期刊文献+

基于加权信息粒化的多标记数据特征选择算法

Feature selection algorithm of multi-labeled data based on weighted information granulation
下载PDF
导出
摘要 特征选择能去除不相关和冗余的特征,是解决多标记数据维度灾难的有效工具。现有的多标记特征选择算法没有考虑标记空间存在的相关性,认为每个样本的相关标记的重要性相同,并且忽略了特征空间可能是标记重要性差异形成的内在因素,使得选择的特征不能精确全面地刻画样本且计算过程复杂。为此,本文利用标记间的相关性对标记空间进行划分以简化计算,并定义标记重要性度量和特征权重,在此基础上提出了一种基于加权信息粒化的多标记特征选择算法。通过在真实多标记数据集上的实验对比分析,本文提出的算法在各项评价指标上均优于其他对比算法,验证了算法的有效性和可行性。 Feature selection can remove irrelevant and redundant features.It is an efficient tool to solve the disaster of multi-labeled data dimensions.Existing multi-labeled feature selection algorithms did not take the correlation of label space into account,and considered that the relevant labels of each sample have the same importance,and ignored that the feature space may be the internal factor caused by the difference of label importance,so that the selected features can not accurately and comprehensively describe the samples and the calculation process is very complex.In this paper,the correlation between labels is used to divide the label space to simplify the calculation.Then,the label importance measure and feature weight are defined.And further,a feature selection algorithm of multi-label data based on weighted information granulation is proposed.The comparison and analysis on real multi-labeled data set of experiment show that the proposed algorithm is superior to other comparison algorithms in all evaluation indicators,which verifies effectiveness and feasibility of the algorithm.
作者 胡军 王海峰 HU Jun;WANG Haifeng(College of Computer Science and Technology,Chongqing University of Posts and Telecommunications,Chongqing 400065,China;Chongqing Key Laboratory of Computational Intelligence,Chongqing University of Posts and Telecommunications,Chongqing 400065,China)
出处 《智能系统学报》 CSCD 北大核心 2023年第3期619-628,共10页 CAAI Transactions on Intelligent Systems
基金 国家自然科学基金项目(61936001,62276038) 重庆市自然科学基金项目(cstc2019jcyj-cxttX0002,cstc2021ycjh-bgzxm0013) 重庆市教委重点合作项目(HZ2021008).
关键词 邻域粗糙集 信息粒化 多标记学习 标记重要性 标记关系 特征权重 特征选择 谱聚类 neighborhood rough set information granulation multi-label learning label significance label relationship feature weight feature selection spectral clustering
  • 相关文献

参考文献3

二级参考文献27

  • 1Sun Liang,Ji Shuiwang,Ye Jieping.Multi-Label Dimensionality Reduction[M].Florida:CRC Press,2013:20-22.
  • 2Fisher R A.The use of multiple measurements in taxonomicproblems[J].Annals of Eugenics,1936,7(2):179-188.
  • 3Wold H.Estimation of principal components and related models by iterative least squares[J].Multivariate Analysis,1966,1:391-420.
  • 4Zhang Yin,Zhou Zhihua.Multi-label dimensionality reduction via dependence maximization[J].ACM Trans on Knowledge Discovery from Data(TKDD),2010,4(3):14.
  • 5Zhang Minling,Pena J M,Robles V.Feature selection formulti-label naive Bayes classification[J].Information Sciences,2009,179(19):3218-3229.
  • 6Hu Qinghua,Yu Daren,Liu Jinfu,et al.Neighborhoodrough set based heterogeneous feature subset selection[J].Information Sciences,2008,178(18):3577-3594.
  • 7Yu Ying,Pedrycz W,Miao Duoqian.Neighborhood roughsets based multi-label classification for automatic imageannotation[J].International Journal of Approximate Reasoning,2013,54(9):1373-1387.
  • 8Yu Ying,Pedrycz W,Miao Duoqian.Multi-labelclassification by exploiting label correlations[J].Expert Systems with Applications,2014,41(6):2989-3004.
  • 9Trohidis K,Tsoumakas G,Kalliris G,et al.Multi-labelclassification of music into emotions[C]//Proc of the 9th Inl Society for Music Information Retrieval.Philadelphia:ISMIR,2008:325-330.
  • 10Briggs F,Huang Y,Raich R,et al.The 9 annual MLSPcompetition:New methods for acoustic classification of multiple simultaneous bird species in a noisy environment[C]//Proc of 2013 IEEE Int Workshop on Machine Learning for Signal Processing.Los Alamitos.CA:IEEE,2013:22-25.

共引文献111

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部