期刊文献+

基于互信息和关系积理论的特征选择方法 被引量:11

Feature Selection Method Based on Mutual Information and Attribute Union Theory
下载PDF
导出
摘要 研究互信息理论,针对其不足引进粗糙集并给出一个基于关系积理论的属性约简算法,以此为基础提出一个适用于海量文本数据集的特征选择方法。该方法使用互信息进行特征初选,利用所给的属性约简算法消除冗余,从而获得具有代表性的特征子集。实验结果表明,该特征选择方法效果良好。 This paper analyzes Mutual Information(MI) theory.According to deficiency of MI,rough set is introduced and an attribute reduction algorithm based on attribute union theory is proposed.A feature selection method based on MI and attribute union theory is presented which is suitable for massive text data sets.The method uses MI to select features,and employs the proposed attribute reduction algorithm to eliminate redundancy,so the feature subsets which are more representative can be acquired.Experimental results show that the method is effective.
出处 《计算机工程》 CAS CSCD 北大核心 2010年第13期257-259,共3页 Computer Engineering
基金 四川省教育厅科研基金资助项目(2006A108)
关键词 特征选择 互信息 粗糙集 关系积理论 属性约简 feature selection Mutual Information(MI) rough set attribute union theory attribute reduction
  • 相关文献

参考文献6

二级参考文献16

  • 1陈涛,谢阳群.文本分类中的特征降维方法综述[J].情报学报,2005,24(6):690-695. 被引量:79
  • 2杨丽华,戴齐,杨占华.文本分类技术研究[J].微计算机信息,2006(05X):209-211. 被引量:13
  • 3刘开瑛 由丽萍.汉语框架语义知识库构建工程.中文信息学报,.
  • 4Wang Dianhui, DILLON T S, CHANG E J. A Data Mining Approach for Fuzzy Classification Rule Generation [C]. IEEE 2001,2001
  • 5Yang Yiming, PEDERSON J O.A Comparative Study on Feature Selection in Text Categorization.Proceedings of the 14th International Conference on Machine Learning [C],1997
  • 6Yiming Yang.A Comparative Study on Feature Selection in Text Categorization[J].The ICML97,Nashville,1997.
  • 7Monica Rogati,Y Yang.High-Performing Feature Selection for Text categorization[C].Proceedings of the Fourteenth International Conference on Machine Learning (ICML'99),2000.
  • 8Thorsten Joachims.Text Classification with Support Vector Machines:Learning with Many Relevant Feature[J].Artificial Intelligence Journal special issue:Best of IJCAI-99,2000.
  • 9T Joachims.Making large-Scale SVM Learning Practical.Advances in Kernel Methods-Support Vector Learning[M].B Sch?lkopf and C Burges and A Smola (ed.),MIT-Press,1999.
  • 10SKOWRON A, POLKOWSKI L. Synthesis of decision systems from data tables[M]//LIN T Y, CERCONE N. Rough sets and data mining: analysis for imprecise data. Boston: Kluwer Academic Publishers, 1997: 259-300.

共引文献35

同被引文献90

引证文献11

二级引证文献88

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部