期刊文献+

基于标记增强的多标记代价敏感特征选择算法 被引量:4

Multi-label Cost-sensitive Feature Selection Algorithm Based on Label Enhancement
下载PDF
导出
摘要 多标记特征选择是机器学习和人工智能领域的研究热点之一,现有多标记学习的研究是假设每个示例的标记呈均匀分布,即每个示例的各个相关标记的重要程度相同.然而,在许多应用领域中这些相关标记的重要程度往往不同.为此,本文提出了一种标记增强方法,可将多标记数据中传统的逻辑标记转化为监督信息更丰富的标记分布;同时,从代价敏感学习视角,构造了基于特征代价与特征依赖度的特征重要性度量准则,在此基础上,设计了面向标记分布数据的代价敏感特征选择算法;最后,通过在真实的多标记数据集上的实验对比与分析,验证了算法的有效性和可行性. Multi-label feature selection is one of the research issues in the fields of machine learning and artificial intelligence.Existing researches on multi-label learning assume that the related labels of every instance are uniformly distributed,i.e.,the significance of different related labels of every instance is equivalent.However,the significance of related labels tends to be different in the real-word applications.To this end,this paper proposes a label enhancement method,which can transform the traditional logic distribution in multi-label data into more comprehensive label distribution.Then,based on the cost-sensitive learning perspective,a metric of feature significance is constructed using the feature cost and feature dependency,simultaneously.On this basis,a cost-sensitive feature selection algorithm for label-distributed data is designed.Finally,the effectiveness and feasibility of the algorithm are verified by experimental comparison and analysis on real multi-label data sets.
作者 黄锦涛 钱文彬 王映龙 HUANG Jin-tao;QIAN Wen-bin;WANG Ying-long(School of Computer and Information Engineering,Jiangxi Agricultural University,Nanchang 330045,China;School of Software,Jiangxi Agricultural University,Nanchang 330045,China)
出处 《小型微型计算机系统》 CSCD 北大核心 2020年第4期685-691,共7页 Journal of Chinese Computer Systems
基金 国家自然科学基金项目(61966016,61502213)资助 江西省自然科学基金项目(20192BAB207018)资助 江西省教育厅科学技术研究项目(GJJ180200)资助 江西省研究生创新专项基金项目(YC2018-S192)资助。
关键词 特征选择 粗糙集 属性约简 多标记学习 代价敏感 标记增强 feature selection rough sets attribute reduction multi-label learning cost-sensitive label enhancement
  • 相关文献

参考文献4

二级参考文献13

  • 1Tsoumakas G, Katakis 1. Multi-label Classification: An Overview. InternationalJournal of Data Warehousing and Mining, 2007, 3(3) :1-13.
  • 2Boutell M R, LuoJ B, Shen X P, et al. Learning Multi-label Scene Classification. Pattern Recognition, 2004, 37(9): 1757-1771.
  • 3Zhang M L, Zhou Z H. ML-KNN: A Lazy Learning Approach to Multi-Iabel Learning. Pattern Recognition, 2007, 40 (7) : 2038- 2048.
  • 4Miao D Q, Zhang Z F, Wei Z H, et al. DW-ML-kNN: A Dual Weighted Multi-label kNN Algorithm / / Proc of the 8th International Conference on Natural Language Processing and Knowledge Engineering. Hefei, China, 2012: 519-529.
  • 5Subasic P, Huettner A. Affect Analysis of Text Using Fuzzy Semantic Typing. IEEE Trans on Fuzzy Systems, 200 1 , 9 ( 4 ) : 483 -496.
  • 6Yao Y Y, Wong S K M. A Decision Theoretic Framework for Approximating Concepts. InternationalJournal of Man-Machine Studies, 1992, 37 (6) : 793-809.
  • 7Pawlak Z. Rough Sets. InternationalJournal of Computer and Information Sciences, 1982, 11(5): 341-356.
  • 8Wong S K M, Ziarko W. Comparison of the Probabilistic Approximate Classification and the Fuzzy Set Model. Fuzzy Sets and Systerns, 1987, 21 (3) : 357 -362.
  • 9Pawlak Z, Wong S K M, Ziarko W. Rough Sets: Probabilistic versus Deterministic Approach. InternationalJournal of Man-Machine Studies, 1988,29(1): 81-95.
  • 10Yao Y Y. Three-Way Decisions with Probabilistic Rough Sets. Information Sciences, 2010, 180( 3) : 341-353.

共引文献26

同被引文献13

引证文献4

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部