期刊文献+

一种改进KNN个性化邮件过滤的方法 被引量:1

A personalized E-mail classification method based on improved KNN
下载PDF
导出
摘要 针对个性化邮件过滤中接收的邮件没有规律、正常邮件和垃圾邮件存在严重类偏移等问题,提出一种改进的k最近邻(k-nearest neighbor algorithm,KNN)个性化邮件过滤方法。该方法主要是通过建立兴趣度模型(对兴趣度计算的改进,剔除用户习惯对建立兴趣度模型带来的影响)不断改变训练集,使得训练集中的文本始终代表用户最近的兴趣。然后通过对接收邮件的无规律和类偏移问题的研究,提出一种改进的KNN算法,该算法主要是对文本在聚类中的价值重新评定,使其对邮件文本进行了较好的分类。经实际验证,改进后的KNN个性化过滤方法能较好地实现对邮件进行个性化分类。 Regarding such traits as irregularity of receiving the emails and imbalance between email category and the quantity, the thesis puts forward a personalized improved KNN algorithm of filtering the emails. Improving the counting method on interests and removing the influence of the users' habits on the establishment of interest model, the method perfects KNN algorithm and thus achieves better classification of the emails based on the research as to the issues of category deviation. Experiments show that improved KNN filtering method could help the users classify the emails according to the users' interests much more accurately.
出处 《重庆邮电大学学报(自然科学版)》 北大核心 2011年第6期757-760,共4页 Journal of Chongqing University of Posts and Telecommunications(Natural Science Edition)
基金 国家自然科学基金(60773113)~~
关键词 个性化 KNN算法 邮件过滤 用户兴趣 垃圾邮件 personalized KNN algorithm E-mail filtering users' interest spare
  • 相关文献

参考文献7

  • 1王斌,潘文锋.基于内容的垃圾邮件过滤技术综述[J].中文信息学报,2005,19(5):1-10. 被引量:129
  • 2JOAEHIMS T. Text Categorization with Support Vector Machines: Learning with Many Relevant Features [ C ]// Michel Parigot, Andrei Voronkov. Lecture Notes in Com- puter Science: European Conference on Machine Learn- ing. 1998:137-142.
  • 3LI Bao-li, CHEN Yu-zhong, YU Shi-wen. A comparative Study on Automatic Categorization Methods for Chinese Serach Engine [ J ]. Computer Engineering and Applica- tions, 2002,32(2) : 117-120.
  • 4ANROUTSOPOULOS I, KOUTSIAS J, CHANDRINOS K,et al. An Experimental Comparison of Naive Bayesian and Keyword Based Anti-spare Filtering with Enter-typed Personal E-mail Message [ C]//Andrei Voronkov. Pro- eeedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York:2000: 160-167.
  • 5江涛,陈小莉,张玉芳,熊忠阳.基于聚类算法的KNN文本分类算法研究[J].计算机工程与应用,2009,45(7):153-155. 被引量:30
  • 6张俊麒 董振兴 王国胤.一种基于用户兴趣度的改进贝叶斯邮件过滤方法.计算机科学,2010,:66-71.
  • 7阎艳,黄智兴,邱玉辉.一种基于派系过滤的社区进化发现研究[J].重庆师范大学学报(自然科学版),2009,26(2):90-93. 被引量:4

二级参考文献47

  • 1李渝勤,孙丽华.基于规则的自动分类在文本分类中的应用[J].中文信息学报,2004,18(4):9-14. 被引量:20
  • 2周涓,熊忠阳,张玉芳,任芳.基于最大最小距离法的多中心聚类算法[J].计算机应用,2006,26(6):1425-1427. 被引量:71
  • 3豆增发,王英强,王保保.一种基于信息增益的K-NN改进算法[J].电子科技,2006,19(12):52-56. 被引量:5
  • 4Dasarathy B V.Nearest Neighbor(NN) norms:NN pattern classification techniques[M].LasAlamitos, California : IEEE Computer Society Press, 1991.
  • 5Palla G, Derenyi I, Farkas I, et al. Uncovering the overlapping community structure of complex networks in nature and society [ J ]. Nature, 2005,435 : 814-818.
  • 6Derenyi I, Palla G. Clique percolation in random networks [ J ]. Physical Review Letters, 2005,94 : 160-202.
  • 7Farkas I, Palla G, Vicsek T. Weighted network modules [ J ]. New Journal of Physic,2007,9(6) :180-198.
  • 8Palla G, Farkas I J. Directed network modules [ J ]. New Journal of Physic, 2007 (9) : 186.
  • 9Palla G, Vicsek T C. Quantifying social group evolution [ J ]. Nature, 2007,446 : 664 -667.
  • 10Zhuge H. Communities and emerging semantics in semantic link network : discovery and learning[ J ]. IEEE Transactions on Knowledge and Data Engineering,2008,99 : 1-1.

共引文献160

同被引文献6

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部