期刊文献+

基于信息反馈的文本主题分类过滤方法

Topic-based text categorization and filtering method based on information-feedback
下载PDF
导出
摘要 为了更加准确地分辨文本主题、过滤无关信息,在分析现有分类算法的基础上选取了SNoW作为基本主题分类算法,并提出了信息反馈和阈值过滤的策略以达到准确过滤无关信息的目的,提出了IG和CHI融合特征提取算法进一步提高系统的准确率。实验结果表明,采用信息反馈和阈值过滤以及融合特征提取的分类算法使系统更高效地实现了将相关信息准确地分类到其所属主题领域、过滤掉无关信息的双重功能。 In order to identify the text topic precisely and to filter out the irrelevant information, the multi-class classifier SNoW out of various classifiers as the basic topic-based classifier was selected.Moreover, the information-feedback combined with threshold-filteration strategy was put forward.The recommended feature abstraction, syncretizing IG and CHI as a whole, improvs the accuracy of the system.Comparison of experiments proves all the measures taken has implemented the expected categorization and filtering function with higher efficiency and effectiveness.
出处 《通信学报》 EI CSCD 北大核心 2009年第S1期139-144,共6页 Journal on Communications
关键词 主题分类 信息反馈 阈值过滤 融合特征提取 topic-based classifier information-feedback threshold-filteration feature abstraction
  • 相关文献

参考文献3

二级参考文献15

  • 1KO C, FINK G, LEVITT K. Execution monitoring of security critical programs in distributed systems: A specification-based approach[A].Proceedings of the 1997 IEEE Symposium on Security and Privacy[C]. Oakland, CA: IEEE Computer Society Press, 1997. 134 - 144.
  • 2JAVITZ HS, VALDES A. The SRI statistical anomaly detector[A].Proceedings of the 1991 IEEE Symposium on Research in Security and Privacy[C]. Oakland, CA: IEEE Computer Society Press,1991.
  • 3JOU Y, GONH F, SARGOR C, et al. Design and implementation of a scalable intrusion detection system for the protection of network infrastructure[A]. Proceedings of the DARPA Information Survivability Conference and Exposition [C]. Los Alamitos, CA: IEEE Computer Society, 2000. 69 - 83.
  • 4FORREST S, HOFMEYR SA, SOMAYAJI A. Computer immunology. Communications of the ACM[J]. 1997, 40(10) : 88 -96.
  • 5WOODALL WH, NCUBE MM. Multivariate CUSUM qualitycontrol procedures. Technometrics[J]. 1985, 27:185-192.
  • 6LOWRY CA, WOODALL WH, CHAMP CW, et al. Multivariate exponentially weighted moving average control chart. Technometrics[J]. 1992, 34:46-53.
  • 7MONTGOMERY DC. Introduction to statistical quality control[M].Fourth Edition. New York. John Wiley & Sons, 2001.
  • 8John G H,Kohavi R,Pfleger K.Irrelevant Features and the Subset Selection Problem.In:Proc.of the Eleventh Intl.Conf.on Machine Learning,1994.121~129
  • 9Kohavi R,John G H.Wrappers for feature subset selection.Artificial Intelligence,1997,97 (1-2):273~324
  • 10Liu Huan,Yu Lei.Toward Integrating Feature Selection Algorithms for Classification and Clustering.IEEE Transactions on Knowledge and Data Engineering,2005,17(5):491~502

共引文献90

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部