期刊文献+

基于特征缺失补偿最大熵模型的文本分类 被引量:2

Text Classification Algorithm Based on Maximum Entropy and Compensation Strategy for Unseen Features
下载PDF
导出
摘要 针对自然语言处理中普遍存在的特征缺失问题,提出了基于特征缺失补偿最大熵模型的文本分类方法。为避免数据稀疏时出现训练过适应,采用高斯先验平滑进行特征补偿,并提出基于条件最大熵计算增益和基于特征频数的混合特征选择方法。通过实验将本方法与中心法、最近邻、贝叶斯、SVM和平滑前的最大熵文本分类器进行了比较,实验结果表明基于特征缺失补偿最大熵模型分类器的综合性能超过以上算法。
出处 《情报杂志》 CSSCI 北大核心 2010年第5期141-143,147,共4页 Journal of Intelligence
基金 国家自然科学基金项目"移动电子商务支付模式及安全性能分析研究"(编号:70672092) 国家自然科学基金重大项目"面向服务的商务智能与知识管理方法研究"(编号:708890080)
  • 相关文献

参考文献7

  • 1刘挺,车万翔,李生.基于最大熵分类器的语义角色标注[J].软件学报,2007,18(3):565-573. 被引量:73
  • 2Zheng Ping Jiang,Hwee Tou Ng.Semantic Role Labeling of Nom-Bank:A Maximum Entropy Approach[C].Conference on Empirical Methods in Natural Language Processing,2006:138-145.
  • 3Adwait Ramaparkhi.Maximum Entropy Models for Natural Language Ambiguity Resolution[D].PA,USA:University of Pennsylvania Philadelphia,1998:AAI9840230.
  • 4JAYNES E T.On the Rationale of Maximum-Entropy Methods[C].IEEE,Proceedings,1982,70:939-952.
  • 5Chen Stanley F,Joshua Goodman.An Empirical Study of Smoothing Techniques for Language Modeling[C].Proceedings of the 34th Annual Meeting on Association for Computational Linguistics,Santa Cruz,California,1996:310-318.
  • 6Adma L Berger,Stephen A Della Pietra,Vincent J Della Pietra.A Maximum Entropy Approach to Natural Language Processing[J].Computational Linguistic,1996,22(1):39-71.
  • 7谭松波.王月粉.中文文本分类语料库-TanCorpV1.0[DB]. [2009-10-30]http://www,searchforum,org.cn/tansongbo/corpus,htm.

二级参考文献28

  • 1Chen SF, Rosenfeld R. A Gaussian prior for smoothing maximum entropy models. Technical Report, CMU-CS-99-108, 1999.
  • 2Gildea D, Jurafsky D. Automatic labeling of semantic roles. Computational Linguistics, 2002,28(3):245-288.
  • 3Baker CF, Fillmore CJ, Lowe JB. The Berkeley FrameNet project. In: Boitet C, Whitelock P, eds. Proc. of the ACL&Coling'98.Montreal: ACL, 1998. 86-90.
  • 4Palmer M, Gildea D, Kingsbury P. The Proposition bank: An annotated corpus of semantic roles. Computational Linguistics, 2005,31(1):71-106.
  • 5Erk K, Kowalski A, Pado S, Pinkal M. Towards a resource for lexical semantics: A large german corpus with extensive semantic annotation. In: Hinrichs EW, Roth D, eds. Proc. of the ACL 2003. Sapporo: ACL, 2003. 537-544.
  • 6Chen J, Rainbow O. Use of deep linguistic features for the recognition and labeling of semantic arguments. In: Hinrichs EW, Roth D, eds. Proc. of the EMNLP 2003. Sapporo: ACL, 2003.41-48.
  • 7Nielsen RD, Pradhan S. Mixing weak learners in semantic parsing. In: Lin D, Wu D, eds. Proc. of the EMNLP 2004. Barcelona:ACL, 2004. 80-87.
  • 8Pradhan S, Hacioglu K, Krugler V, Ward W, Martin JH, Jurafsky D. Support vector learning for semantic argument classification.Machine Learning Journal, 2005,60(3): 11-39.
  • 9Carreras X, Marques L, Chrupala G. Hierarchical recognition of propositional arguments with perceptrons. In: Ng HT, Riloff E, eds.Proc. of the CoNLL 2004. Boston: ACL, 2004.106-109.
  • 10Punyakanok V, Koomen P, Roth D, Yih W. Generalized inference with multiple semantic role labeling systems. In: Knight K, Ng HT, Oflazer K, eds. Proc. of the CoNLL 2005. Ann Arbor: ACL, 2005. 181-184.

共引文献72

同被引文献21

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部