期刊文献+

一种用于互动型不良信息过滤的贝叶斯改进方案 被引量:1

An Improved Bayesian Learning Scheme for Interactive Harmful Information Filtering
下载PDF
导出
摘要 信息过滤是文本挖掘领域的重要研究内容之一。针对互动型网络媒体信息(如BBS),提出一种新的信息过滤算法,该算法主要从特征提取和分类器构造两方面对B ayesian方法进行改进。在对不良信息的特征提取过程中,根据网络论坛的特征,在计算中文不良信息特征项的权重时,根据关键词出现的位置、次数以及词长等建立一个特征评估函数,并用它来替换TF-IDF公式中的TF项;同时,考虑到网络论坛中的良性信息与不良信息之间的不平衡分布,采用一种不对称的学习策略来设计B ayesian分类器。实验结果及对比分析表明,该算法具有较高的过滤准确率。 Information filtering plays an important role in the text mining community. A novel Bayesian classification based information filtering algorithm which improves both feature selection and classification is presented. A new function is builded in term of occurrence,length,place and so on to replace the TF part of TF-IDF. At the same time the number of positive information is much fewer than that of harmful one. Hence,A new classification method was designed and it is called Asymmetric Naive Bays classifier. The results of experiments show that the filter designed gains a high accuracy.
出处 《广西师范大学学报(自然科学版)》 CAS 北大核心 2009年第3期134-137,共4页 Journal of Guangxi Normal University:Natural Science Edition
基金 国家自然科学基金资助项目(60773084 60603023) 教育部博士点基金资助项目(20070151009)
关键词 互动型网络媒体 不良信息 信息过滤 interactive network media harmful information information filtering
  • 相关文献

参考文献9

  • 1夏迎炬,黄萱菁,胡恬,吴立德.自适应信息过滤中使用少量正例进行阈值优化(英文)[J].软件学报,2003,14(10):1697-1705. 被引量:6
  • 2TANG Jian-gang,XIONG Guo-ping.A perspective of applying autonomia to content filtering at network[C]//Proceedings of the 2008 International Conference on Computer Science and Software Engineering-Volume 05.Washington DC,USA:IEEE Computer Society,2008,5:1242-1247.
  • 3HAO Xiu-lan,TAO Xiao-peng,ZHANG Cheng-hong,et al.An effective method to improve KNN text classifier[C]// SNPD' 07:Proceedings of the Eighth ACIS International Conference on Software Engineering,Artificial Intelligence,Networking and Parallel/Distributed Computing-Volume 01.Washington DC,USA:IEEE Computer Society,2007,1:379-384.
  • 4ZHANG Qing-guo,ZHANG Cheng-zhi.Automatic chinese keyword extraction based on KNN for implicit subject extraction[J].Knowledge Acquisition and Modeling,2008,11:689-692.
  • 5肖健华.基于支持对象的野点检测方法[J].计算机工程,2003,29(11):43-45. 被引量:23
  • 6胡振宇,张瑞玲,孙富春.基于贝叶斯方法的网络攻击定位和追踪模型[J].郑州大学学报(理学版),2008,40(3):44-47. 被引量:3
  • 7XIONG Jin-zhi,HU Tian-ming,LI Guang-ming,et al.A comparative study of three smooth SVM classifiers[J].Intelligent Control and Automation,2006,2:5962-5966.
  • 8DONG Yan-shi,HAN Ke-song.A comparison of several ensemble methods for text categorization[J].Services Computing,2004,17:419-422.
  • 9石志伟,刘涛,吴功宜.一种快速高效的文本分类方法[J].计算机工程与应用,2005,41(29):180-183. 被引量:15

二级参考文献37

  • 1范金城,胡峰.动态测量数据的抗扰性分析研究[J].数理统计与应用概率,1996,11(3):244-248. 被引量:25
  • 2Knorr E M, Ng R T. Algorithms for Mining Distance-based Outiiers in Large Datasets. Proc. VLDB, 1998:392-403.
  • 3Burges C J C. A Tutorial on Support Vector Machines for Pattern Recognition. Data Mining and Knowledge Discovery, 1998,2(2):121.
  • 4Moya M R, Koch M W, Hostetler L R. One-class Classifier Networks for Target Recognition Applications. Portland:Proceedings World Congress on Neural Networks, 1993:797-801.
  • 5Tax D, Duin R. Data Domain Description Using Support Vectors.Proc. European Symposium Artificial Neural Networks, 1999:251-256.
  • 6Salton G. Develovments in automatic text retrieval. Science, 1991,253:974-979
  • 7Zhai C, Jansen P,Roma N, Stoica E, Evans DA. Optimization in CLARIT adaptive filtering. In:Voorhees EM, Harman DK, eds.Proceedings of the 8th Text Retrieval Conference. 1999.253-258.
  • 8Zhang Y, Callan J. Yfilter at TREC9. In: Voorhees EM, Harman DK, eds, Proceedings of the 9th Text Retrieval Conference.Gaithersburg. 2000. 154-161.
  • 9Allan J. Incremental relevance feedback for information filtering. In:Frei HP, Harman D, Schiuble P, Wilkinson R, eds.Proceedings of the 19th annual international ACM SIGIR conference on Research and Development in Information Retrieval 1996.Zurich, Switzerland. 1996. 270-278.
  • 10Arampatzis A, Beney J, Koster CHA, van der Weide TP. KUN on the TREC9 filtering track: Incrementality, decay, and theshold optimization for adaptive filtering systems. In:Voorhees EM, Harman DK, eds. Proceedings of the 9th Text Retrieval Conference.Gaithersburg, 2000. 87-109.

共引文献43

同被引文献12

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部