期刊文献+

基于朴素贝叶斯算法的个性化垃圾邮件过滤

Personalized Spam Filtering Based on Naive Bayes Algorithm
下载PDF
导出
摘要 目前电子邮件得到了广泛的应用,同时垃圾邮件问题也随之而来。本文针对垃圾邮件的处理,从用户的兴趣角度出发,基于朴素贝叶斯算法对垃圾邮件个性化过滤.在朴素贝叶斯算法的条件概率计算中,本文选用了多变量贝努里事件模型的计算方法,最后以VC++6.0为实验平台在Ling-Spam语料库上进行实验. E - mail is widely used all over the world recently. At the same time, junk e - mail problems also arise. In order to deal with the spare,from the users point of interest, the paper was based on the naive Bayesian algorithm for personalized spare filter. In terms of probability calculation of Naive Bayes algorithm, the paper selected calculation of multi - variable model of Bernoulli event,and carried out an experiment on the Ling- Spam Corpus with VC + + 6.0 platform.
作者 翟军昌
出处 《长春师范学院学报(自然科学版)》 2009年第2期17-20,共4页 Journal of Changchun Teachers College
关键词 垃圾邮件 朴素贝叶斯 信息增益 多变量贝努里事件模型 spare email Naive Bayes information gain multi - variable model of Bernoulli events
  • 相关文献

参考文献4

  • 1Wanli Ma,et al.On Extendable Software Architecture for Spam Email Filtering[J].IAENG International Journal of Computer Science,2007.
  • 2Androutsopoulos I,et al.An Evaluation of Naive Bayesian Anti-Spam Filtering[C].Proc of the Workshop on Machine learning(ECML 2000),2000.
  • 3Vangelis M,I Androutsopoulos,et al.Spam Filtering with Naive Bayes -Which Naive Bayes[C].CEAS 2006 Third Conference on Email and AntiSpam(CEAS 2006),Mountain View,California USA,2006,(27).
  • 4王涛,裘国永,何聚厚.基于改进Nave Bayes的垃圾邮件过滤模型研究[J].计算机工程与应用,2007,43(13):186-190. 被引量:10

二级参考文献9

  • 1胡佳妮,徐蔚然,郭军,邓伟洪.中文文本分类中的特征选择算法研究[J].光通信研究,2005(3):44-46. 被引量:47
  • 2王斌,许洪波,王申.基于结构特征的nBayes双层过滤模型[J].计算机应用,2006,26(1):191-194. 被引量:4
  • 3中国互联网协会反垃圾邮件中心.2006年第三次反垃圾邮件调查相关信息发布以及趋势分析[EB/OL].(2006).http://www.anti-spare.cn/ShowArticle.php?id=4843.
  • 4Schwartz A.SpamAssassian[M].USA:O'Reilly Media,lnc,2004-07:25-30.
  • 5Androutsopoulos l,Koutsias J,Chandrinos K V,et al.An evaluation of Naive Bayesian anti-spare filtering[C]//Proe of the Workshop on Machine Learning in the New Information Age,llth European Conferenee on Machine Learning(ECML'00),Bareelona,Spain,June 3,2000:9-17.
  • 6YANG Y,Pedersen J P.A comparative study on feature selection in text categorization[C]//Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97).San Francisco, CA :Morgan Kaufmann Publishers,1997:412-420.
  • 7Wittern I H,Frank E.Data mining practical machine learning tools and teehniques[M].2nd ed.San Francisco, CA : Morgan Kaufmann Publisher, 2005 : 88-97.
  • 8Vangelis M,Androutsopoulos l,Georgios P.Spam filtering with Naive Bayes-which Naive Bayes?[C]//CEAS 2006 Third Conference on Email and AntiSpam(CEAS 2006),Mountain View,California USA,July 27-28,2006.
  • 9李凡,鲁明羽,陆玉昌.关于文本特征抽取新方法的研究[J].清华大学学报(自然科学版),2001,41(7):98-101. 被引量:78

共引文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部