期刊文献+

基于Tri-training的半监督多标记学习算法 被引量:4

Semi-supervised multi-label learning algorithm based on Tri-training
下载PDF
导出
摘要 传统的多标记学习是监督意义下的学习,它要求获得完整的类别标记.但是当数据规模较大且类别数目较多时,获得完整类别标记的训练样本集是非常困难的.因而,在半监督协同训练思想的框架下,提出了基于Tri-training的半监督多标记学习算法(SMLT).在学习阶段,SMLT引入一个虚拟类标记,然后针对每一对类别标记,利用协同训练机制Tri-training算法训练得到对应的分类器;在预测阶段,给定一个新的样本,将其代入上述所得的分类器中,根据类别标记得票数的多少将多标记学习问题转化为标记排序问题,并将虚拟类标记的得票数作为阈值对标记排序结果进行划分.在UCI中4个常用的多标记数据集上的对比实验表明,SMLT算法在4个评价指标上的性能大多优于其他对比算法,验证了该算法的有效性. Traditional multi-label learning is in the sense of supervision , in which the complete category labels arerequired.However, when the size of data is large and there are several categories of labels , it is quite difficult toobtain the training sample sets with complete labels .Therefore, a semi-supervised multi-label learning algorithmbased on Tri-training (SMLT) is proposed.In the learning stage, SMLT initially introduces a virtual label, then foreach pair of virtual labels, the Tri-training algorithm is utilized to train the corresponding classifiers for each pair oflabels.In the forecast stage, a new sample is given, which will be substituted into the obtained classifier describedabove.According to the votes of each label, the multi-label learning problem is transformed into a label rankingproblem, subsequently; the votes of the virtual label are taken as the threshold for distinguishing the label rankingresults.The contrast experiments on four commonly used UCI multi -label datasets show the SMLT algorithm behavesbetter than other comparative algorithms in four evaluation indices and the effectiveness of the proposed algorithm isverified.
出处 《智能系统学报》 CSCD 北大核心 2013年第5期439-445,共7页 CAAI Transactions on Intelligent Systems
基金 国家"973"计划前期研究专项(2011CB311805) 山西省科技攻关计划资助项目(20110321027-01) 山西省科技基础条件平台建设项目(2012091002-0101)
关键词 多标记学习 半监督学习 TRI-TRAINING multi-label learning semi-supervised learning Tri-training
  • 相关文献

参考文献19

  • 1TSOUMAKAS G, KATAKIS I. Multi-label classification: an overview[J]. International Journal of Data Warehousing and Mining, 2007, 3(3): 1-13..
  • 2ZHU Xiaojin. Semi-supervised learning literature survey [R]. Madison, USA: University of WisconsinMadison, 2008..
  • 3常瑜,梁吉业,高嘉伟,杨静.一种基于Seeds集和成对约束的半监督聚类算法[J].南京大学学报(自然科学版),2012,48(4):405-411. 被引量:7
  • 4ZHOU Zhihua, ZHANG Minling, HUANG Shengjun, et al. Multi-instance multi-label learning[J]. Artificial Intelligence, 2012, 176(1): 2291-2320..
  • 5ZHANG Minling, ZHANG Kun. Multi-label learning by exploiting label dependency[C]//Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Washington, DC, USA,2010: 999-1007..
  • 6BOUTELL M R, LUO Jiebo, SHEN Xipeng, et al. Learning multi-label scene classification[J]. Pattern Recognition, 2004, 37(9): 1757-1771..
  • 7FURNKRANZ J, HULLERMEIER E, MENCIA E L, et al. Multi-label classification via calibrated label ranking[J]. Machine Learning, 2008, 73(2): 133-153..
  • 8TSOUMAKAS G, VLAHAVAS I. Random k-labelsets: an ensemble method for multilabel classification[C]//Proceedings of the 18th European Conference on Machine Learning. Berlin: Springer, 2007: 406-417..
  • 9ZHANG Minling, ZHOU Zhihua. ML-kNN: a lazy learning approach to multi-label learning[J]. Pattern Recognition, 2007, 40(7): 2038-2048.
  • 10ELISSEEFF A, WESTON J. A kernel method for multi-labelled classification[M]//DIETTERICH T G, BECKER S, GHAHRAMANI Z. Advances in Neural Information Processing Systems 14. Cambridge, USA: The MIT Press, 2002: 681-687..

二级参考文献73

  • 1姜远,周志华.基于词频分类器集成的文本分类方法[J].计算机研究与发展,2006,43(10):1681-1687. 被引量:22
  • 2薛晓冰,韩洁凌,姜远,周志华.基于多示例学习技术的Web目录页面链接推荐[J].计算机研究与发展,2007,44(3):406-411. 被引量:6
  • 3Schapire R E, Singer Y. Boostexter: A boosting-based system for text categorization [J]. Machine Learning, 2000, 39(2/3) : 135-168
  • 4McCallum A. Multi-label text classification with a mixture model trained by EM [C]//Working Notes of the AAAI'99 Workshop on Text Learning. Menlo Park, CA.-AAAI Press, 1999
  • 5Ueda N, Saito K. Parametric mixture models for multilabeled text [C]//Beeker S, Thrun S, Obermayer K. Advances in Neural Information Processing Systems 15 (NIPS'02). Cambridge, MA:MIT Press, 2003:721-728
  • 6De Comite F, Gilleron R, Tommasi M. Learning multi label alternating decision trees from texts and data [C] //Proc of the 3rd Int Conf on Machine Learning and Data Mining in Pattern Recognition (MLDM'03). Berlin: Springer, 2003: 35-49
  • 7Zhang M-L, Zhou Z-H. Multi-label neural networks with applications to functional genomics and text categorization[J]. IEEE Trans on Knowledge and Data Engineering, 2006, 18(10): 1338-1351
  • 8Zhang M L, Zhou Z-H. ML-kNN: A lazy learning approach to multi-label learning [J]. Pattern Recognition, 2007, 40 (7) : 2038-2048
  • 9Elisseeff A, Weston J. A kernel method for multi-labelled classification [C]//Dietterich T G, Becker S, Ghahramani Z. Advances in Neural Information Processing Systems 14 (NIPS'01). Cambridge, MA: MIT Press, 2002:681-687
  • 10Boutell M R, Luo J, Shen X, et al. Learning multi-label scene classification [J]. Pattern Recognition, 2004, 37(9): 1757-1771

共引文献36

同被引文献59

  • 1钱志明,杨家宽,段连鑫.基于视频的车辆检测与跟踪研究进展[J].中南大学学报(自然科学版),2013,44(S2):222-227. 被引量:13
  • 2徐蓉,姜峰,姚鸿勋.流形学习概述[J].智能系统学报,2006,1(1):44-51. 被引量:67
  • 3杨剑,王珏,钟宁.流形上的Laplacian半监督回归[J].计算机研究与发展,2007,44(7):1121-1127. 被引量:15
  • 4Tsoumakas G, Katakis I. Multi-label classification:An overview[J]. International Journal of Data Warehou- sing and Mining: 2007,3(3): 1-13.
  • 5Zhang Minling, Zhang K. Multi-label learning by ex- ploiting label dependency[C]//Proeeedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, D. C., 2010, 999-1007.
  • 6Zhu Xiaojin. Semi-supervised Learning Literature Sur- vey[R]. Madison University of Wisconsin,2008.
  • 7Zhang Minling, ZhouZhihua. ML-kNN.. A lazy learn- ing approach to multi label learning[J]. Pattern Rec- ognition, 2007, 40(7): 2038-2048.
  • 8Robert E. Schapire, Yoram Singer. BoosTexter: a boosting-based system for text categorization[J]. Ma- chine Learning, 2000, 39(2-3) :135-168.
  • 9Amanda Clare, Ross D. King. Knowledge discovery in multi-label phenotype data[J]. Lecture Notes in Com- puter Science, 2001, 2168:42-53.
  • 10Liu Yi, Jin Rong, Yang Liu. Semi-supervised multi- label learning by constrained non-negative matrix fac- torization[C]//Proceedings of the 21 st National Con- Ierence on ArtiIieial Intelligence. Menlo Park.. AAAI,2006 : 421-426.

引证文献4

二级引证文献18

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部