期刊文献+

基于信息熵的主动学习半监督分类研究 被引量:8

Active Learning Based on Information Entropy for Semi-supervised Classification
下载PDF
导出
摘要 针对小规模训练样本不足以支持学习器对含有大量潜在不确定因素的未标样本集分类的问题,提出了一种基于信息熵的主动学习方法,引入信息熵的离散事件概率估计理论,通过对未标文档熵值的计算,结合二阶段学习策略,主动学习利用现有知识,结合实验样本环境,主动地选取最有可能的解决问题的样本并标注它们的类别,获得新的参数,重新训练分类器,选择最有利分类器性能的样本,迭代直到未标样本集为空。实验结果表明,该方法取得了较好的分类效果。 Most of supervised machine learning methods led to poor performance when work on limited tagged data. Investigated a novel semi- supervised learning method based on active learning with information entropy. An optimization strategy of selecting part of instances from unlabeled examples for classifying in each iteration, based on active learning from unhbeled examples, was presented. The experiment results show that our method achieve high performance on small tagged data.
作者 陈锦禾 沈洁
出处 《计算机技术与发展》 2010年第2期110-113,共4页 Computer Technology and Development
基金 国家自然科学基金(60673060)
关键词 信息熵 半监督学习 主动学习 分类 information entropy semi-supervised learning active learning elassification
  • 相关文献

参考文献13

  • 1Rocchio J. Relevant feedback in information retrieval[ M]//In Salton G. The smart retrieval system - experiments in automatic document processing. Englewood Cliffs, NJ: [s. n. ], 1971.
  • 2MeCaUum A, Nigam K. A comparison of event models for naive Bayes text classification [ C]//AAAI - 98 Workshop on Learning for Text Categorization. [s. l. ] :AAAI Press, 1998.
  • 3Guyon I, Boser B, Vapnik V. Automatic capacity tuning of very large Vcdimension classifiers[J ]. Advances in Neural Information Processing Systems, 1993(5):147- 155.
  • 4Igam K,McCallum A,Thrun S,et al. Learning to classify text from labeled and unlabeled documents [ C]//In: Mostow J, Madison C R. Proceedings d the 15th National Conference on Artificial Intelligence. Wisconsin: AAAI Press, 1998:792- 799.
  • 5刘晶,郭雷,聂晶鑫.基于SVM的一种新的分类器设计方法[J].计算机应用研究,2006,23(7):181-182. 被引量:5
  • 6Engelbreeht A P, Cloete I. Incremental Learning Using Sensitivity Analysis[C]//Neural Networks, 1999. IJCNN apos; 99. International Joint Conferenoe. [s. l. ] : IEEE Press, 1999: 1350 - 1355.
  • 7陈耀东,王挺,陈火旺.半监督学习和主动学习相结合的浅层语义分析[J].中文信息学报,2008,22(2):70-75. 被引量:13
  • 8Thompson C A,Califf M E,Mooney R J. Active Learning for Natural Language Parsing and Information Extraction[C]// In:Proceedings of the sixteenth International Machine Learning Conference. Slovenia: [ s. n. ], 1999.
  • 9张健沛,徐华.支持向量机(SVM)主动学习方法研究与应用[J].计算机应用,2004,24(1):1-3. 被引量:51
  • 10Cohn D A, Ghahramani Z, Jordan M I. Active learning with statistical models [ J ]. J. of Artidal Intelligence Research, 1996,4:129 - 145.

二级参考文献17

  • 1VAPNIKVN 张学工译.统计学习理论的本质[M].清华大学出版社,2000..
  • 2Andrew R Webb, Statistical Pattern Recognition(2nd edition)[M].Publishing House of Electronics Industry, 2004.5-6,106-111.
  • 3Ira Cohen, Fabio G Cozman, Nieu Sebe, et al. Semisupervised Learning of Classifiers : Theory, Algorithms, and Their Application to Human-Computer Interaction [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2004, 26(12):1553-1567.
  • 4Edgar E Osuna, Robert Freund, Federico Girosi, Support Vector Machines:Training and Applications [R]. Massachusetts Institute of Technology, 1997.28-30.
  • 5Chris J C Burges, Beruhard Scholkopf. Improving the Accuracy and Speed of Support Vector Machines[C]. Advances in Neural Information Processing Systems, MIT Press, 1997, 375-382.
  • 6Thorsten Joachims. Learning to Classify Text Using Support Vector Machines Method, Theory, and Algorithms [M]. Kluwer Academic Publishers, 2002. 140-160.
  • 7Platt J C, Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machine [M]. Cambridge, MA: MIT Press,1999. 185-206.
  • 8Xavier Carreras, Lluis Marquez. Introduction to the CoNLL-2005 Shared Task: Semantic Role Labeling[A]. In: Proceedings of CoNLL-2005[C]. 2005.
  • 9T. Joachims. Transductive inference for text classification using support vector machines [A]. In: Proc. of ICML-99[C]. 1999. 200-209.
  • 10Levin, Beth. English Verb Class and Alternations: A Preliminary Investigation [M]. Chicago: University of Chicago Press. 1993.

共引文献64

同被引文献99

引证文献8

二级引证文献59

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部