期刊文献+

文本过滤中的特征选择

Feature Selection Method in Text Filtering
下载PDF
导出
摘要 在对目前常用的特征选择算法进行分析的基础上,提出了适用于文本过滤的特征选择模型,很好地解决了文本过滤中的"不平衡数据问题"。经实验验证,这两个模型可以很好地选择出有代表性的特征,提高了文本过滤的精度。 This paper presents a new feature selection model in text filtering on the analysis of present feature selection methods, which can resolve the imbalanced data problem. Simulation results demonstrated that the proposed method can improve the precision of text filtering.
出处 《微计算机信息》 2010年第21期164-165,201,共3页 Control & Automation
关键词 特征选择 文本过滤 不平衡数据 feature selection information f'dtering imbalanced data
  • 相关文献

参考文献8

  • 1W.Ng, H.Goh and K.Low.Feature selection, perceptron learning, and a usability case study for text categarization [C]. ACM SIGIR Conference on Rearch and Develpement in Information Retrieval, 1997:67-73.
  • 2F.Sebastiani. Machine learning in automated text categorization [J].ACM Computing Surveys, 34(1): 1-47, 2002.
  • 3V.Rijsbergen.Information Retrieval [c].Butterworths, London, 1979.
  • 4F.Galavotti,L.Sebastiani and M.Simi.Experiments on the use of feature selection and negative evidence in automated text categorization [c].Proceeding of ECDL-00, 4^th European Conference on Research and Advanced Technology for Digital Libraries,2000:59 - 68.
  • 5F.Sebastiani. Machine learning in automated text categorization [J].ACM Computing Surveys, 2002, 34(1):1-47.
  • 6L Yu, H Liu. Feature Selection for high-dimensional data: a fast correlation-based filter solution[R]. In Proceedings of the twentieth International Conference on Machine Learnlng,2003:856-863.
  • 7Yi Wang, Xiao-Jing Wang. A New Approach to Feature Selectioni Text Classification [R]. Proceeding of the Fourth International Conference on Machine Learning and Cybernetics, Guangzhou, 18- 21August,2005.
  • 8柴玉梅,王宇.基于TFIDF的文本特征选择方法[J].微计算机信息,2006,22(08X):24-26. 被引量:32

二级参考文献6

  • 1王聃,贾云伟,林福严.人脸识别系统中的特征提取[J].微计算机信息,2005,21(07X):53-55. 被引量:18
  • 2Y.Yang.A Comparative Study on Feature Selection in Text Categorization[C].In: Proceeding of the Fourteenth International Conference on Machine Learning (ICML'97),412-420,1997.
  • 3Mlademnic,D.,Grobelnik,M.Feature Selection for unbalanced class distribution and Naive Bayes[A].Proceedings of the Sixteenth International Conference on Machine Learning [C].Bled:Morgan Kaufmann, 1999:258-267.
  • 4Lewis DD. Feature selection and feature extraction for text categorization [A].Proc. of Speech and Natural Language Workshop,February 1992.212-217.
  • 5李凡,鲁明羽,陆玉昌.关于文本特征抽取新方法的研究[J].清华大学学报(自然科学版),2001,41(7):98-101. 被引量:78
  • 6代六玲,黄河燕,陈肇雄.中文文本分类中特征抽取方法的比较研究[J].中文信息学报,2004,18(1):26-32. 被引量:228

共引文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部