期刊文献+

基于模糊软集合理论的文本分类方法 被引量:8

Text Classification Approach Based on Fuzzy Soft Set Theory
下载PDF
导出
摘要 为提高文本分类精度,提出一种基于模糊软集合理论的文本分类方法。该方法把文本训练集表示成模糊软集合表格形式,通过约简、构造软集合对照表方法找出待分类文本所属类别,并针对文本特征提取过程中由于相近特征而导致分类精度下降问题给出一种基于正则化互信息特征选择算法,有效地解决了上述问题。与传统的KNN和SVM分类算法相比,模糊软集合方法在文本分类的精度和准度上都有所提高。 A text classification approach based on soft set theory is proposed to enhance the accuracy of the text classification.The text training set is mapped onto a fuzzy soft set,the category of the new text can be achieved through the reduction of soft set table and construction of the comparison table of the soft set,in order to solve the problem that classification accuracy degrades when the feature is closely related to the selected feature,this paper gives a new feature selection algorithm based on normalization mutual information feature selection algorithm.Comparing with traditional KNN and SVM classification algorithm,the fuzzy soft set approach has the improvement on classification precision and accuracy.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第13期90-92,共3页 Computer Engineering
基金 广东省自然科学基金资助项目(9151001003000005)
关键词 文本分类 软集合 模糊软集合 特征选择 互信息 text classification soft set fuzzy soft set feature selection mutual information
  • 相关文献

参考文献7

  • 1Molodtsov D,Softset Theory-first Results[J].Computers and Mathematics with Applications,1999,37(4):19-31.
  • 2Maji P K,Roy A R.An Application of Soft Sets in a Decision Making Problem[J].Computers Mathematics with Applications,2002,44(8):1077-1083.
  • 3Kong Zhi,Gao Liqun.The Normal Parameter Reduction of Soft Sets and Its Algorithm[J].Computers Mathematics with Applications,2008,56(1):3029-3037.
  • 4Maji P K,Biswas R,Roy A R.Fuzzy Soft Sets[J].Fuzzy Math,2001,9(3):589-602.
  • 5Pawlak Z.Rough Set:Theoretical Aspects of Reasoning About Data[M].Boston,MA,USA:Kluwer Academic,1991.
  • 6Estévez P A,Tesmer T,Perez C A.Normalized Mutual Information Feature Selection[J].IEEE Trans.on Neural Network,2009,20(2):189-201.
  • 7柴玉梅,朱国重,咎红英,胡达明,冼家扬.基于质心的文本分类算法[J].计算机工程,2009,35(20):83-85. 被引量:6

二级参考文献4

  • 1苏金树,张博锋,徐昕.基于机器学习的文本分类技术研究进展[J].软件学报,2006,17(9):1848-1859. 被引量:378
  • 2Yang Yiming, Liu Xin. Are-examination of text Categorization Methods[C]//Proc. of the 22nd Annual International ACM SIGIR Conference on Research and Development in the Information Retrieval. New York, USA: ACM Press, 1999.
  • 3Han Eui-Hong, Karypis G. Centroid-based Document Classification Algorithms: Analysis & Experimental Results[R]. Minneapolis, USA: Department of Computer Science, University of Minnesota, Technical Report: TR-00-017, 2000.
  • 4Lertnattee V, Theeramunkong T. Effect of Term Distributions on Centroid-based Text Categorization[J]. Information Sciences, 2004, 158(1): 89-115.

共引文献5

同被引文献75

引证文献8

二级引证文献55

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部