期刊文献+

基于互信息的Fisher Score多标记特征选择 被引量:2

Multilabel feature selection based on Fisher Score with mutual information
下载PDF
导出
摘要 目前,Fisher Score模型在处理多标记数据时没有考虑样本和整个特征空间之间以及特征和标记之间的关系.提出一种基于互信息的Fisher Score多标记特征选择方法.首先,在多标记决策系统中考虑整个样本空间对特征选择的影响,根据异类样本与同类样本之间的欧式距离定义权重公式,并在特征空间下对标记赋予权重衡量标记的重要程度.然后,基于互信息理论定义特征与每个标记之间的互信息来计算每个特征和每个标记之间的相关度,将特征与标记之间的相关度与该标记所占的权重相结合来定义特征和标记集之间的总相关度.将Fisher得分与总相关度结合,定义每个特征的新的Fisher得分,进而构建多标记Fisher Score模型.最后,设计了一种基于互信息的Fisher Score多标记特征选择算法.在六个多标记数据集上的实验证明,提出的算法与其他算法相比,其四种评价指标都表现良好,分类性能出色. When processing multilabel data,the current Fisher Score models do not consider the relationship between samples and the entire feature space,and also ignore the relationship between features and labels.To address the issues,a multilabel feature selection method based on Fisher Score with mutual information is developed in this paper.Firstly,in multilabel decision systems,to consider the influence of the entire sample space on feature selection,a weight formula is defined based on the Euclidean distance between the heterogeneous samples and the similar samples,and the weights are assigned to labels in feature space to measure the importance of labels.Secondly,the mutual information theory is introduced to define the mutual information between the feature and each label to calculate the correlation between each feature and each label.When combining this correlation between feature and label with the token weight of the label,the total correlation between the feature and the label set is defined.This total correlation is employed to design a new Fisher Score for each feature,and then,a multilabel Fisher Score model is constructed.Finally,a multilabel feature selection algorithm based on Fisher Score with mutual information is designed.Experimental results applied to six multilabel datasets show that the proposed algorithm shows great classification performance in terms of four evaluation metrics when compared with the other related algorithms.
作者 孙林 张起峰 徐久成 Sun Lin;Zhang Qifeng;Xu Jiucheng(College of Computer and Information Engineering,Henan Normal University,Xinxiang,453007,China)
出处 《南京大学学报(自然科学版)》 CAS CSCD 北大核心 2023年第1期55-66,共12页 Journal of Nanjing University(Natural Science)
基金 国家自然科学基金(62076089,61976082) 河南省科技攻关项目(212102210136)。
关键词 多标记学习 特征选择 互信息 Fisher Score multilabel learning feature selection mutual information Fisher Score
  • 相关文献

参考文献13

二级参考文献51

  • 1吴迪,郭嗣琮.改进的Fisher Score特征选择方法及其应用[J].辽宁工程技术大学学报(自然科学版),2019,38(5):472-479. 被引量:9
  • 2毛勇,周晓波,夏铮,尹征,孙优贤.特征选择算法研究综述[J].模式识别与人工智能,2007,20(2):211-218. 被引量:94
  • 3LEE M C.Using support vector machine with a hybrid feature selection method to the stock trend prediction[J].Expert Systems with Applications,2009,36(8):10896-10904.
  • 4MALDONADO S,WEBER R.A wrapper method for feature selection using support machines[J].Information Sciences,2009,179(13):2208-2217.
  • 5LIU Y,ZHENG Y F.FS_SFS:A novel feature selection method for support vector machines[J].Pattern Recognition,2006,39 (7):1333-1345.
  • 6HUA J P,TEMBE W D,DOUGHERTY E R.Performance of feature-selection methods in the classification of high-dimensian data[J].Pattern Recognition,2009,42(3):409-424.
  • 7GUNAL S,GEREK O N,ECE D G,et al.The search for optimal feature set in power quality event classification[J].Expert Systems with Applications,2009,36(7):10266-10273.
  • 8WIDODO A,YANG B S.Application of nonlinear feature extraction and support vector machines for fault diagnosis of induction motors[J].Expert Systems with Applications,2007,33(1):241-250.
  • 9GUYON I,ELISSEEFF A.An introduction to variable and feature selection[J].Machine Learning Research,2003,3:1157-1182.
  • 10TALAVERA L.An evaluation of filter and wrapper methods for feature selection in categorical clustering[C]// Proceedings of 6th International Symposium on Intelligent Data Analysis.Madrid:Springer,2005:440-451.

共引文献100

同被引文献17

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部