期刊文献+

基于在线线性判别学习模型的垃圾邮件过滤方法

A Method of Spam Filtering Based on Online Linear Discriminative Learning Model
下载PDF
导出
摘要 给出了一种使用在线线性判别学习模型进行垃圾邮件过滤的方法,使用贝叶斯理论进行特征提取,特征按出现的位置进行分类,不同类别的特征赋予不同的权重.在TREC测试集上进行了实验,并和TREC评测的结果进行了对比.实验结果表明,该方法取得了较好的结果. Spam filtering is an important task in the application of internet. In this paper a method of spam filtering based on online linear discriminative Learning Model is presented. We statically derive the features using Bayesian rule, clustering them into groups according to their position and then assigning weights respectively. The model is evaluated by TREC Spam corpus and compared with the TREC results. Experimental results show that our linear discriminative model can produce competitive results.
出处 《哈尔滨理工大学学报》 CAS 2008年第3期48-50,共3页 Journal of Harbin University of Science and Technology
关键词 垃圾邮件过滤 判别学习模型 特征提取 贝叶斯理论 主动学习 spam filtering discriminative learning model feature extraction bayesian theory active learning
  • 相关文献

参考文献6

  • 1CORMACK G V, BRATKO A. Batch and on-line Spam Filter Evaluation [ C ]. Third Conference on Email and AntiSpam ( CEAS), California: Mountain View, 2006,27 - 28.
  • 2SEBASTIANI F. Machine Learning in Automated Text Cate-gorization [ J ]. ACM Computing Surveys,2002,34 ( 1 ) : 1 - 47.
  • 3LYNAM T R, ORMACK C G V. On-line Spam Filter Fusion[ C] // SIGIR 2006. Washington, USA. 2006:123 - 130.
  • 4GOODMAN J, YIH W. Online Discriminative Spam Filter Training[C]// Third Conference on Email and AntiSpam (CEAS). California, USA : Mountain View, 2006,27 - 28.
  • 5SCULLEY D, WACHMAN G M. Relaxed Online SVMs for Spam Filtering[ C] //SIGIR'07. 2007:415 -422.
  • 6YERAZUNIS B. CRM114 Revealed-Or How I Learned To Stop Worrying and Trust My Automatic Monitoring Systems [ EB/OL] [2005 -03 -6]. This is the Complete CRM114 Manual Available for Free Download at http ://crm114. sourceforge. net. [ 2007 -10 -12].

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部