期刊文献+

基于标签聚类的多标签分类算法 被引量:10

A Multi-Label Classification Algorithm Based on Label Clustering
下载PDF
导出
摘要 多标签分类的实质就是为给定实例预测一个与其关联的标签集合。典型方法可以分为两类:问题转换型和算法适应型。本文主要研究基于标签幂集的问题转换型算法。由于已有的标签幂集算法很难发现甚至可能忽略隐藏在训练集中的重要标签集合,因此,本文提出了一种基于标签聚类的标签幂集方法,通过改进平衡k-means聚类来发现训练集中潜在的重要标签集合,并用于形成新的训练集进行多标签分类。经实验验证,该算法在多个评价指标上较原有的标签幂集方法具有更好的分类性能。 The essence of a multi-label classifier is to assign a set of labels to a given instance. There are the two classical methods: problem transformation and algorithm adaptation. This paper mainly explores the problem transformation of label powerset. By analyzing existing label powerset methods, we find out that it is easy for them to underutilize multi label information. Therefore, this paper proposed a novel label powerset method based on label clustering. Firstly, it identifies unseen multilabels by improving balanced k-means clustering. Then based on that unseen multilabels, it forms new training data for multi-label classification. The experimental results show that the new method has competitive performance with respect to multiple evaluation metrics.
出处 《软件》 2014年第8期16-21,共6页 Software
基金 北京市自然科学基金资助(4142042)
关键词 多标签 分类器 标签聚类 标签集合 Multi Label Classifier Label Clustering Label Set
  • 相关文献

参考文献14

  • 1郑文超,徐鹏.利用word2vec对中文词进行聚类的研究[J].软件,2013,34(12):160-162. 被引量:29
  • 2L. Enrique Sucar,Concha Bielza,Eduardo F. Morales,Pablo Hernandez-Leal,Julio H. Zaragoza,Pedro Larra?aga.Multi-label classification with Bayesian network-based chain classifiers[J]. Pattern Recognition Letters . 2013
  • 3Gjorgji Madjarov,Dragi Kocev,Dejan Gjorgjevikj,Sa?o D?eroski.An extensive experimental comparison of methods for multi-label learning[J]. Pattern Recognition . 2012 (9)
  • 4Arindam Banerjee,Joydeep Ghosh.Scalable Clustering Algorithms with Balancing Constraints[J]. Data Mining and Knowledge Discovery . 2006 (3)
  • 5Grigorios Tsoumakas,Ioannis Katakis,Ioannis Vlahavas.Random k-Labelsets for Multilabel Classification. IEEE Transactions on Knowledge and Data Engineering . 2011
  • 6Tsoumakas, Grigorios,Spyromitros-Xioufis, Eleftherios,Vilcek, Jozef,Vlahavas, Ioannis.MULAN: A Java library for multi-label learning. Journal of Machine Learning Research . 2011
  • 7Read J,Pfahringer B,Holmes G,et al.Classifier chains for multi-label classification. Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases:Part II . 2009
  • 8Jesse Read.A pruned problem transformation method for multi-label classification. Proceedings of the New Zealand Computer Science Research Student Conference . 2008
  • 9Zhang M,Zhou Z.A k-nearest neighbor based algorithm for multi-label classification. Proceedings of the IEEE International Conference on Granular Computing . 2005
  • 10Goncalves E C,Freitas A A.A Genetic Algorithm for Optimizing the Label Ordering in Multi-Label Classifier Chains. IEEE International Conference on Tools with Artificial Intelligence (ICTAI) . 2013

二级参考文献5

  • 1袁方,周志勇,宋鑫.初始聚类中心优化的k-means算法[J].计算机工程,2007,33(3):65-66. 被引量:152
  • 2曾元颍.词袋模型.
  • 3维基百科.语言模型.
  • 4Yoshua B,Rejean D,Pascal V,Christian J. A Neural Probabilistic Language Model[J].{H}JOURNAL OF MACHINE LEARNING RESEARCH,2003,(0):1137-1155.
  • 5曾俊瑀;王方.Softmax回归.

共引文献29

同被引文献103

引证文献10

二级引证文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部