基于贝叶斯模型的多标签分类算法被引量：4

Multi-label classification algorithm based on Bayesian model

下载PDF

导出

摘要针对二元关联法(BR)未考虑标签之间相关性,容易造成分类器输出在训练集中不存在或次数较少标签的不足,提出了基于贝叶斯模型的多标签分类算法(MLBM)和马尔可夫型多标签分类算法(MMLBM)。首先,建立仿真模型分析BR算法的不足,考虑到标签的取值应由属性置信度和标签置信度共同决定,提出MLBM。其中,通过传统的分类算法计算获得属性置信度,以及通过训练集得到标签置信度。然后,考虑到MLBM在计算属性置信度时必须考虑所有已分类的标签,分类器的性能容易受无关或弱关系的标签影响,所以使用马尔可夫模型简化置信度的计算提出了MMLBM。理论分析和仿真实验表明,与BR算法相比,MMLBM的平均分类精度在emotions数据集上提高约4.8%,在yeast数据集上提高约9.8%,在flags数据集上提高约7.3%。实验结果表明,当数据集中实例的标签基数较大时,相对于BR算法,MMLBM的准确性有较大的提升。 Since the relation of labels in Binary Relevance（ BR） is ignored, it is easy to cause the multi-label classifier to output not exist or less emergent labels in training data. The Multi-Label classification algorithm based on Bayesian Model（ MLBM） and Markov Multi-Label classification algorithm based on Bayesian Model（ MMLBM） were proposed. Firstly, to analyze the shortcomings of BR algorithm, the simulation model was established; considering the value of label should be decided by the attribute confidence and label confidence, MLBM was proposed. Particularly, the attribute confidence was calculated by traditional classification and the label confidence was obtained directly from the training data. Secondly, when MLBM calculated label confidence, it had to consider all the classified labels, thus some of no-relation or weak-relation labels would affect performance of the classifier. To overcome the weakness of MLBM, MMLBM was proposed, which used Markov model to simplify the calculation of label confidence. The theoretical analyses and simulation experiment results demonstrate that, in comparison with BR algorithm, the average classification accuracy of MMLBM increased by 4. 8% on emotions dataset, 9. 8% on yeast dataset and 7. 3% on flags dataset. The experimental results show that MMLBM can effectively improve the classification accuracy when the label cardinality is larger in the training data.

作者张洛阳毛嘉莉刘斌吴涛

机构地区西华师范大学计算机学院华东师范大学软件学院

出处《计算机应用》 CSCD 北大核心 2016年第1期52-56,71,共6页 journal of Computer Applications

基金四川省自然科学基金资助项目(14ZB0140)~~

关键词多标签贝叶斯模型马尔可夫模型 K近邻置信度 multi-label Bayesian model Markov model K Nearest Neighbor（KNN） confidence

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献19

1ZHANG M, ZHOU Z. A review on multi-label learning algorithms [J]. IEEE transactions on knowledge and data engineering, 2014, 26(8): 1819-1837.
2READ J. A pruned problem transformation method for multi-label classification [C]// Proceedings of the 2008 New Zealand Computer Science Research Student Conference. Hamilton, New Zealand: [s.n.], 2008: 143-150.
3TSOUMAKAS G, KATAKIS I, VLAHAVAS I. Random k-labelsets for multilabel classification [J]. IEEE transactions on knowledge and data engineering, 2011, 23(7): 1079-1089.
4READ J, PFAHRINGER B, HOLMES G, et al. Classifier chains for multi-label classification [J]. Machine learning, 2011, 85(3): 333-359.
5READ J, PFAHRINGER B, HOLMES G, et al. Classifiers chains for multi-label classification [C]// Proceedings of the 2009 European Conference on Machine Learning and Knowledge Discovery in Databases. Berlin: Springer, 2009: 254-269.
6CHENG W, HVLLERMEIER E, DEMBCZYNSKI K J. An analysis of chaining in multi-label classification [C]// Proceedings of the 20th European Conference on Artificial Intelligence. Amsterdam: IOS Press, 2012: 294-299.
7CHENG W, HüLLERMEIER E, DEMBCZYNSKI K J. Bayes optimal multilabel classification via probabilistic classifier chains [C]// Proceedings of the 27th International Conference on Machine Learning. New York: ACM, 2010: 279-286.
8SUCAR L E, BIELZA C, MORALES E F, et al. Multi-label classification with Bayesian network-based chain classifiers [J]. Pattern recognition letters, 2014, 41(9):12-22.
9YU Y, PEDRYCZ W, MIAO D. Multi-label classification by exploiting label correlations [J]. Expert systems with applications, 2014, 41(6): 2989-3004.
10ZHANG M L, ZHOU Z H. ML-KNN: a lazy learning approach to multilabel learning [J]. Pattern recognition, 2007, 40(7): 2038-2048.

同被引文献12

1胡斌,宫宁生,郇洪江.改进的RBF学习算法及其相似性应用[J].计算机工程与设计,2009,30(18):4287-4289. 被引量：5
2吕小勇,石洪波.基于频繁项集的多标签文本分类算法[J].计算机工程,2010,36(15):83-85. 被引量：4
3张敏灵.一种新型多标记懒惰学习算法[J].计算机研究与发展,2012,49(11):2271-2282. 被引量：39
4何朋,周丽娟.基于联合概率的多标签分类算法[J].计算机应用,2015,35(3):659-662. 被引量：4
5周恩波,叶荣华,张微微,周子涵.一种基于成对标签的Rakel算法改进[J].计算机与现代化,2016(3):16-18. 被引量：3
6金永贤,张微微,周恩波.一种改进的RAKEL多标签分类算法[J].浙江师范大学学报（自然科学版）,2016,39(4):386-391. 被引量：2
7茅硕.泛娱乐时代全IP产业的发展趋势[J].出版广角,2016(18):24-26. 被引量：7
8李峰,苗夺谦,张志飞,张维.基于互信息的粒化特征加权多标签学习k近邻算法[J].计算机研究与发展,2017,54(5):1024-1035. 被引量：22
9马建刚,张鹏,马应龙.基于知识块摘要和词转移距离的高效司法文档分类[J].计算机应用,2019,39(5):1293-1298. 被引量：5
10马建刚,马应龙.语义驱动的司法文档学习分类方法[J].计算机应用,2019,39(6):1696-1700. 被引量：2

引证文献4

1梁睿博,王思远,李壮,刘亚松.基于RAKEL算法的商品评论多标签分类研究与实现[J].软件工程,2019,22(1):8-11. 被引量：3
2刘云,肖添,肖雪.基于相似度的多标签分类算法优化[J].计算机与数字工程,2022,50(2):243-246.
3李锦烨,黄瑞章,秦永彬,陈艳平,田小瑜.基于反绎学习的裁判文书量刑情节识别[J].计算机应用,2022,42(6):1802-1807. 被引量：2
4陈若愚,刘秀磊,于汝意.面向泛娱乐文本的层次多标签分类方法[J].计算机应用与软件,2023,40(1):60-65.

二级引证文献5

1谈俊林.大数据技术在通信运营商异网获客系统的应用[J].软件工程,2020,23(1):27-29. 被引量：1
2赵静,韩京宇,钱龙,毛毅.基于改进的RAKEL算法的心电图诊断分类[J].计算机应用,2022,42(6):1892-1897.
3肖新正,黄瑞章,陈艳平,秦永彬,宋玉梅,周裕林.Corrective-Net:面向多标签文本分类的标签关联学习模块[J].计算机工程与科学,2024,46(6):1092-1100.
4张鹏,郝国生,王霞,许文阳,祝义.反绎学习支持下的自动问答及其应用[J].计算机工程与应用,2024,60(17):139-147.
5冯心昊,吕学强,马登豪,滕尚志,田晶晶.融合双通道标签语义的多标签文本分类模型[J].北京信息科技大学学报（自然科学版）,2024,39(4):49-54.

1J.A.RINCON,J.BAJO,A.FERNANDEZ,V.JULIAN,C.CARRASCOSA.Using emotions for the development of human-agent societies[J].Frontiers of Information Technology & Electronic Engineering,2016,17(4):325-337.
2魏娜.基于Excel的简单仿真模型分析[J].科技资讯,2006,4(33):222-223.
3檀何凤,刘政怡.基于标签相关性的K近邻多标签分类方法[J].计算机应用,2015,35(10):2761-2765. 被引量：10
4周磊,覃俊,刘晶.基于微博交互信息的社交网络推荐算法[J].软件导刊,2015,14(4):63-66. 被引量：1
5林芷羽.微博传播的弱关系性质分析[J].戏剧之家,2016(13):263-263. 被引量：2
6老万.恢复浏览器的标签位置[J].电脑爱好者,2015,0(14):62-62.
7陈裕国.类比法、关联法在微机原理教学中的运用[J].科技信息,2008(23):167-167. 被引量：3
8龙士工,李祥.基于着色Petri网仿真模型的安全协议分析[J].计算机仿真,2005,22(6):95-97. 被引量：1
9Kang Xin,Ren Fuji.Predicting Complex Word Emotions and Topics through a Hierarchical Bayesian Network[J].China Communications,2012,9(3):99-109. 被引量：2
10王文俊,张军英.基于核的类别非局保留投影[J].模式识别与人工智能,2009,22(5):769-773. 被引量：4

计算机应用

2016年第1期

浏览历史

内容加载中请稍等...

基于贝叶斯模型的多标签分类算法被引量：4

参考文献19

同被引文献12

引证文献4

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于贝叶斯模型的多标签分类算法 被引量：4

参考文献19

同被引文献12

引证文献4

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

基于贝叶斯模型的多标签分类算法被引量：4