期刊文献+

Improving Accuracy of Spam Detection

Improving Accuracy of Spam Detection
下载PDF
导出
摘要 Spam, the electronic equivalent of junk mail, affects over 600 million users worldwide. The massive increase of spam is posing a very serious threat to email which has become an important means of communication. Not only does it annoy users, but it also consumes much of the bandwidth of the Internet. Even as anti-spam solutions change to limit the amount of spam sent to users, the senders adapt to make sure their messages are seen, while spam reorganization has different properties comparing with normal text reorganization. Presented are three different classifiers are combined with detailed analysis on various training data set of the given spam database. These classifiers are combined into a mixture of experts (MOE) system which yields overall better performance than any of the individual contributors. The instructions for further improvements on classifiers as well as its requirement on spam databases are also given. Spam, the electronic equivalent of junk mail, affects over 600 million users worldwide. The massive increase of spam is posing a very serious threat to email which has become an important means of communication. Not only does it annoy users, but it also consumes much of the bandwidth of the Internet. Even as anti-spam solutions change to limit the amount of spam sent to users, the senders adapt to make sure their messages are seen, while spam reorganization has different properties comparing with normal text reorganization. Presented are three different classifiers are combined with detailed analysis on various training data set of the given spam database. These classifiers are combined into a mixture of experts (MOE) system which yields overall better performance than any of the individual contributors. The instructions for further improvements on classifiers as well as its requirement on spam databases are also given.
出处 《Computer Aided Drafting,Design and Manufacturing》 2008年第2期103-108,共6页 计算机辅助绘图设计与制造(英文版)
关键词 spam mail classification KNN mixture system spam mail classification KNN mixture system
  • 相关文献

参考文献7

  • 1Spambase. Spambase website  ftp://ftp.ics.uci. edu/pub/machine-learning-databases/ spambase . 1999
  • 2Sandeep Mane,Jaideep Srivastava,Hwang San-Yih,Jamshid Vayghan.Estimation of false negatives in classification[].th IEEE International Conference on Data Mining.2004
  • 3Li Stan Z,Chan Kap Luk,Wang Changliang.Performance evaluation of the nearest feature line method in image classification and retrieval[].IEEE Transactions on Pattern Analysis and Machine Intelligence.2000
  • 4University of California.At Irvine website. http://www.uci.edu . 2000
  • 5X.Carreras,and L.Marquez."Boosting Trees for Anti-Spam Email Filtering"[].Proceedings of Euro Conference Recent Advances in NLP (RANLP-).2001
  • 6Schapire R E.The boosting approach to machine learning: An overview[].MSRI workshop on Nonlinear estimation and classification.2002
  • 7Drucker H,Vapnik V.Support Vector Machines for spam categorization[].IEEE Transactions on Neural Networks.1999

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部