期刊文献+

基于信息几何构建朴素贝叶斯分类器 被引量:1

Constructing Naive Bayesian Classifier Based on Information Geometry
下载PDF
导出
摘要 朴素贝叶斯分类器是机器学习中一种简单而又有效的分期方法。但是由于它的属性条件独立性假设在实际应用中经常不成立,这影响了它的分类性能。本文基于信息几何和Fisher分,提出了一种新的创建属性集的方法。把原有属性经过Fisher分映射成新的属性集,并在新属性集上构建贝叶斯分类器。我们在理论上探讨了新属性间的条件依赖关系,证明了在一定条件下新属性间是条件独立的。试验结果表明,该方法较好地提高了朴素贝叶斯分类器的性能。 The Naive Bayesian Classifier (NBC) is a simple yet effective technique for machine learning. But the unpractical condition independence assumption of the Naive Bayesian Classifier (NBC) greatly degrades the performance of classifying. This paper improved this method based on information geometry theory and Fisher score. We map the original attributes to new attribute set according to the Fisher score, and construct the NBC on the new attribute set. We further prove that these new attributes are condition independent of each other on certain conditions. This method shows excellent performance in experiments.
出处 《通讯和计算机(中英文版)》 2005年第2期1-6,共6页 Journal of Communication and Computer
关键词 朴素贝叶斯分类器 信息几何 Fisher分 条件独立 Naive Bayesian Classifier Information Geometry Fisher Score Condition Independence
  • 相关文献

同被引文献15

  • 1张璠.多种策略改进朴素贝叶斯分类器[J].微机发展,2005,15(4):35-36. 被引量:11
  • 2刘静,尹存燕,陈家骏.一种规则和贝叶斯方法相结合的文本自动分类策略[J].计算机应用研究,2005,22(7):84-86. 被引量:7
  • 3Ricardo Baeza-Yates,Berthier Ribeiro-Neto,Modern Information Retrieval.[M] China Machine Press,2003.
  • 4Fuchun Peng,Dale Schuurmans,Shaojun Wang,Augmenting Naive Bayes Classiers with Statistical Language Models[M].School of Computer Science at University of Waterloo,2004.
  • 5http://www.863data.org.cn/[OL].
  • 6D.Hiemstra,Using Language Models for Information Retrieval[D].Centre for Telematics and Information Technology,University of Twente,2001.
  • 7A.McCallum,K.Nigam,A Comparison of Event Models for Naive Bayes Text Classification[R].In:proceedings of AAAI-98 Workshop on "Learning for Text Categorization",1998.
  • 8D.Holmes,R.Forsyth,The Federalist Revisited:New Directions in Authorship Attribution[J].Literary and linguistic Computing,1995 (10):111-127.
  • 9J.Ponte,W.Croft,A Language Modeling Approach to Information Retrieval[A].In:proceeding of ACM Research and Development in Information Retrieval(SIGIR)[C],1998.
  • 10http://www.lemurproject.org[OL].

引证文献1

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部