期刊文献+

改进的贝叶斯算法在反垃圾邮件中的应用 被引量:3

The Applying of Improved Bayesian Algorithm to the Anti-spam
下载PDF
导出
摘要 论文首先介绍了向量空间模型(VSM)方法以及特征向量抽取方法,推导和研究了引入“特征之间互相独立”假设的朴素贝叶斯分类算法.在此基础上提出了一种改进的贝叶斯算法,改进的贝叶斯算法假设一部分特征之间相互独立,比朴素贝叶斯分类算法更符合实际需要。并把它应用到反垃圾邮件中。最后介绍了贝叶斯过滤算法反垃圾邮件的基本步骤。 In this paper, we first introduced the vector space model (VSM) and the method of the feature vector extraction, Then deduced and analyzed the Naive Bayesian algorithm that on the supposition of "the characteristic to be mutually independent".On the basis of this,the paper introduced a new improved Bayesian algorithm.The improved Bayesian algorithm supposed that only part of the characteristics are mutually independent,It more Conforms to the actual need than the Naive Bayesian.Then applies it in the spam mail.Finally introduced the fundamental step of filtering spam mail with Bayesian algorithm.
作者 白东燕 BAI Dong-yan (Deparmaent of Power Electronics and Electric Drives,Shijiazhuang Railway Institute,Shijiazhuang 050043,China)
出处 《电脑知识与技术》 2007年第4期154-155,共2页 Computer Knowledge and Technology
关键词 朴素贝叶斯 垃圾邮件 向量空间模型特 征向量抽取 先验概率 Naive Bayesian spare vector space model feature vector extraction priori probability
  • 相关文献

参考文献3

二级参考文献19

  • 1[2]Androutsopoulos I, Paliouras G, Michelakis E. Learning to Filter Unsolicited Commercial E-Mail [R]. Technical Report 2004/2, NCSR "Demokritos", 2004.
  • 2[3]McCallum, Andrew Kachites. Bow: A toolkit for statistical language modeling, text retrieval, classification and clustering [EB/OL]. http://www.cs.cmu.edu/~mccallum/bow, 1996.
  • 3[4]Androutsopoulos I, Koutsias J, Chandrinos K V, et al. An evaluation of naive bayesian anti-spam filtering[C]// Potamias G, Moustakis V, Someren Van M, et al. Proceedings of the Workshop on Machine Learning in the New Information Age. Barcelona: 11th European Conference on Machine Learning (ECML 2000), 2000: 9-17.
  • 4[5]Sahami M. Using Machine Learning to Improve Information Access [EB/OL]. http://ai.stanford.edu/~sahami/bio.html, 1998.
  • 5[6]Sahami M, Dumais S, Heckerman D, et al. A bayesian approach to filtering junk e-mail[C]// Sahami Mehran, Craven Mark, Joachims Thorsten, et al. Learning for Text Categorization: Papers from the 1998 Workshop.[s.l.]: AAAI, 1998.
  • 6[7]Friedman N, Geiger D, Goldszmidt M. Bayesian network classifiers [J]. Machine Learning, 1997, 29:131-163.
  • 7潘文锋.基于内容的垃圾邮件过滤研究[EB/OL].http://www.nlp.org.cn/docs/doclist.php?cat_id=17&type=10,2004-11-20.
  • 8JamesO·Berger 贾乃光.统计决策论及贝叶斯分析[M],吴喜之译[M].北京:中国统计出版社,1998.17-19,130-146.
  • 9盛骤.概率论与数量统计·第二版[M].北京:高等教育出版社,1994.18-25.
  • 10Amor N B,Benferhat S,Elouedi Z.Naive Bayes vs Decision Trees in Intrusion Detection Systems[C].Proceedings of the 2004 ACM Symposium on Applied Computing,2004:420-424.

共引文献16

同被引文献7

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部