期刊文献+

一种新的用于文本分类的特征选择算法

A Novel Feature Selection Algorithm in Text Categorization
下载PDF
导出
摘要 目前在文本分类领域较常用到的特征选择算法中,仅仅考虑了特征与类别之间的关联性,而对特征与特征之间的关联性没有予以足够的重视。在特征相关性分析的基础上,提出了一种新的算法,改进了特征选择算法中所出现的上述问题。实验验证了算法的可行性和有效性。 Current feature selection algorithms in text categorization are all based on the correlation between term and class, and neglect the correlation between terms. On analyzing the feature correlation, a new algorithm was put forward, which can solve the problem above. Simulation results demonstrated that the proposed method can improve the precision of text classification.
出处 《信息技术与信息化》 2009年第6期39-41,45,共4页 Information Technology and Informatization
关键词 特征选择 文本分类 文本集密度 Feature selection Text categorization Text set density
  • 相关文献

参考文献5

  • 1I Guyon, A Elisseeff. An introduction to variable and feature selection [J]. Journal of Machine Learning Research ,2003,3 : 1157-1182.
  • 2L Yu, H Liu. Feature Selection for high -dimensional data: a fast correlation -based filter solution [ R ]. In Proceedings of the twentieth International Conference on Machine Learning, 2003 : 856 - 863.
  • 3R Caruana, D Freitag, Greedy Attribute Selection [ R ]. Proc . 11th Conf. on Machine Learing, 1994:28 - 36.
  • 4陈彬,洪家荣,王亚东.最优特征子集选择问题[J].计算机学报,1997,20(2):133-138. 被引量:96
  • 5SALTON G. Automatic Text Processing: The Transformation Analysis, and Retrieval of Information by Computer[ M ]. Addison Wesley Publishing, 1989.

二级参考文献3

  • 1Wu X,A Heuristic Covering Algorithm for Extension Matrix Approach.Department of Artificial Intelligence,1992年
  • 2洪家荣,Proc Int Computer Science Conference’88, Hong Kong,1988年
  • 3洪家荣,Int Jnal of Computer and Information Science,1985年,14卷,6期,421页

共引文献95

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部