期刊文献+

基于关联规则和拓扑序列的分类器链方法 被引量:1

Classifier Chains Method Based on Association Rules and Topological Sequences
下载PDF
导出
摘要 在分类器链方法中,如何确定标签学习次序至关重要,为此,提出一种基于关联规则和拓扑序列的分类器链方法 (TSECC).首先结合频繁模式设计了一种基于强关联规则的标签依赖度量策略;接下来通过标签间依赖关系构建有向无环图,对图中所有顶点进行拓扑排序;最后将得到的拓扑序列作为分类器链方法中标签的学习次序,对每个标签的分类器依次迭代更新.特别地,为减少无标签依赖或标签依赖度较低的“孤独”标签对其余标签预测性能的影响,将“孤独”标签排在拓扑序列之外,利用二元关联模型训练.在多种公共多标签数据集上的实验结果表明, TSECC能够有效提升分类性能. The order of label learning is crucial to a classifier chains method.Therefore,this study proposes a classifier chains method based on the association rules and topological sequence(TSECC).Specifically,a measurement strategy for label dependencies based on strong association rules is designed by leveraging frequent patterns.Then,a directed acyclic graph is constructed according to the dependency relationships among the labels to topologically sort all the vertices in the graph.Finally,the topological sequence obtained is used as the order of label learning to iteratively update each label’s classifier successively.In particular,to reduce the impact of“lonely”labels with no or low label dependencies on the prediction performance on the other labels,TSECC excludes“lonely”labels out of the topological sequence and uses a binary relevance model to train them separately.Experimental results on a variety of public multi-label datasets show that TSECC can effectively improve classification performance.
作者 丁家满 周蜀杰 李润鑫 付晓东 贾连印 DING Jia-Man;ZHOU Shu-Jie;LI Run-Xin;FU Xiao-Dong;JIA Lian-Yin(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming 650500,China;Artificial Intelligence Key Laboratory of Yunnan Province(Kunming University of Science and Technology),Kunming 650500,China)
出处 《软件学报》 EI CSCD 北大核心 2023年第9期4210-4224,共15页 Journal of Software
基金 国家自然科学基金(61562054)。
关键词 多标签学习 分类器链 标签依赖 强关联规则 拓扑序列 二元关联 multi-label learning classifier chains label dependent strong association rules topological sequence binary relevance
  • 相关文献

参考文献3

二级参考文献39

  • 1Mitchell TM. Machine Learning. New York: McGraw-Hill, 1997.
  • 2Tsoumakas G, Katakis I. Multi-Label classification: An overview. Int’l Journal of Data Warehousing and Mining, 2007,3(3): 1-13. [doi: 10.4018/j dwm.2007070101].
  • 3Zhang ML, Zhou ZH. A review on multi-label learning algorithms. IEEE Trans, on Knowledge and Data Engineering, 2014,26(8): 1819-1837. [doi: 10.1109/TKDE.2013.39].
  • 4Schapire RE, Singer Y. BoosTexter: A boosting-based system for text categorization. Machine Learning, 2000,39(2/3):135-168. [doi: 10.1023/A: 1007649029923].
  • 5McCallum A. Multi-Label text classification with a mixture model trained by EM. In: Proc. of the Working Notes of the AAAI’99 Workshop on Text Learning. 1999.
  • 6Boutell MR, Luo J, Shen X, Brown CM. Learning multi-label scene classification. Patter Recognition, 2004,37(9):1757-1771. [doi: 10.1016/j.patcog.2004.03.009].
  • 7Elisseeff A, Weston J. A kernel method for multi-labelled classification. In: Proc. of the Advances in Neural Information Processing Systems 14. Cambridge: MIT Press, 2002. 681-687.
  • 8Clare A, King RD. Knowledge discovery in multi-label phenotype data. Lecture Notes in Computer Science, 2001,2168:42-53. [doi: 10.1007/3-540-44794-6 4].
  • 9Barutcuoglu Z, Schapire RE, Troyanskaya OG. Hierarchical multi-label prediction of gene function. Bioinformatics, 2006,22(7): 830-836. [doi: 10.1093/bioinformatics/btk048].
  • 10Song Y, Zhang L, Giles LC. A sparse Gaussian processes classification framework for fast tag suggestions. In: Proc. of the 17th ACM Conf. on Information and Knowledge Management. 2008. 93-102. [doi: 10.1145/1458082.1458098].

共引文献11

同被引文献11

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部