期刊文献+

基于邻域粗糙集的多标记分类特征选择算法 被引量:108

Feature Selection for Multi-Label Classification Based on Neighborhood Rough Sets
下载PDF
导出
摘要 多标记学习是一类复杂的决策任务,同一个对象可能同时属于多个类别.此类任务在文本分类、图像识别、基因功能分析等领域广泛存在.多标记分类任务往往由高维特征描述,存在大量无关和冗余的信息.目前已经提出了大量的单标记特征选择算法以应对维数灾难问题,但对于多标记的属性约简和特征选择却鲜有研究.将粗糙集应用于多标记数据的特征选择中,针对多标记分类任务,重新定义了邻域粗糙集的下近似和依赖度计算方法,探讨了这一模型的性质,进而构造了基于邻域粗糙集的多标记分类任务的特征选择算法,并给出了在公开数据上的实验结果.实验分析证明算法的有效性. Multi-label classification is a kind of complex decision making tasks, where one object may be assigned with more than one decision label. This kind of tasks widely exist in text categorization, image recognition, gene function analysis. Multi label classification is usually described with high- dimensional vectors, and some of the features are superfluous and irrelevant. A great number of feature selection algorithms have been developed for single-label classification to conquer the curse of dimensionality. However, as to multi-label classification, fewer researches have been reported for designing feature selection algorithms. In this work, we introduce rough sets to multi label classification for constructing a feature selection algorithm. We redefine the lower approximation and dependency, and discuss the properties of the model. After that, we design a neighborhood rough sets based feature selection algorithm for multi-label classification. Experimental results show the effectiveness of the proposed algorithm.
出处 《计算机研究与发展》 EI CSCD 北大核心 2015年第1期56-65,共10页 Journal of Computer Research and Development
基金 国家自然科学基金优秀青年科学基金项目(61222210) 国家自然科学基金重点项目(61432011) 国家自然科学基金面上项目(61272095)
关键词 多标记分类 特征选择 邻域粗糙集 依赖度 multi-label classification feature selection neighborhood rough sets dependency
  • 相关文献

参考文献18

二级参考文献94

  • 1徐章艳,刘作鹏,杨炳儒,宋威.一个复杂度为max(O(|C||U|),O(|C^2|U/C|))的快速属性约简算法[J].计算机学报,2006,29(3):391-399. 被引量:234
  • 2李丹,李国正,陆文聪.用于药物活性预报的Co-Training方法[J].计算机科学,2006,33(12):159-161. 被引量:3
  • 3Wilson D R, Martinez T R. Improved Heterogeneous Distance Functions. Journal of Artificial Intelligence Research, 1997, 6( 1 ) : 1 - 34
  • 4Hu Qinghua, Yu Daren, Xie Zongxia. Neighborhood Classifiers. Expert Systems with Applications: An International Journal, 2008, 34 (2) : 866 - 876
  • 5Schapire R E, Singer Y. Boostexter: A boosting-based system for text categorization. Machine Learning, 2000, 39 (2--3):135-168.
  • 6McCallum A. Multi-label text classification with a mixture model trained by EM. Working Notes of the AAAI' 99 Workshop on Text Learning. Orlando: AAAI, 1999.
  • 7Boutell M R, Luo J, Shen X, et al. Learning multi-label scene classification. Pattern Recognition, 2004, 37(9): 1757-1771.
  • 8Yin Z, Zhou Z H. Multi-label dimensionality reduction via dependency maximization. Proceedings of the 23^rd AAAI Conference on Artificial Intelligence, Chicago, IL: AAAI, 2008, 1503-1505.
  • 9Yu K, Yu S P, Tresp V. Multi-label informed latent semantic indexing. Proceedings of the 28^th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, New York, NY:ACM, 2005, 258--265.
  • 10Moody J, Utans J. Principled architecture selection for neural networks: Application to corporate bond rating prediction. Moody J E, Hanson S J, Lippmann R P. Neural Information Processing Systems 4. Morgan Kaufmann Publishers, Inc. 1992, 683-690.

共引文献215

同被引文献675

引证文献108

二级引证文献483

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部