期刊文献+

基于相似性特征聚类的加权无监督特征选择算法 被引量:1

Weighted unsupervised feature selection algorithm basedon similarity feature clustering
下载PDF
导出
摘要 目前存在的无监督特征选择算法中往往会忽略特征与特征之间的关系,从而使得特征选择结果不理想。针对这个问题,提出了基于相似性特征聚类的加权无监督特征选择算法(Weighted unsupervised feature selection algorithm based on similarity feature clustering,WFSFC)。其基本思想是:给特征f的k个近邻赋以属性权重,从而定义了新的k近邻密度以及平均冗余度,然后根据新提出的的k近邻密度及平均冗余度选择聚类中心,将特征空间中剩余的特征按照类间距离最大,类内距离最小的原则分配到已经选好的聚类中心所在的类中,最后对所选特征进行k-means聚类,并对7个UCI数据集进行实验。实验结果表明,该算法的特征选择结果较优。 In the existing unsupervised feature selection algorithms,the relationship between features is often ignored,which makes the result of feature selection unsatisfactory.Aiming at this problem,this paper puts forward Weighted unsupervised feature selection algorithm based on similarity feature clustering(WFSFC).Its basic idea is:attribute weights were assigned to k neighbors of the characteric f,thus the new k neighbors'density and average redundancty were defined.Then the clustering center is selected according to the proposed k neighbors'density and average redundancy.In accordance with the principle of the largest distance between classes and the minimum distance within the class,the remaining features in the feature space were allocated to the class where the selected clustering center is.Finally,the selected features were clustered with k-means and seven UCI data sets were experienced.The results show that the algorithm has a better feature selection.
作者 李顺勇 王改变 余曼 LI Shunyong;WANG Gaibian;YU Man(School of Mathematical Sciences,Shanxi University,Taiyuan,Shanxi 030006,China)
出处 《贵州师范大学学报(自然科学版)》 CAS 2021年第1期49-57,共9页 Journal of Guizhou Normal University:Natural Sciences
基金 国家基金(81803962) 山西省基础研究计划项目(201901D111320) 山西省回国留学人员科研资助项目(2017-020) 山西省研究生教育改革项目(2019JG023) 山西省留学人员科技活动择优资助项目(2019) 太原市科技计划研发项目(2018140105000084)资助。
关键词 无监督 特征选择 属性权重 聚类 unsupervised feature selection attribute weight clustering
  • 相关文献

参考文献7

二级参考文献76

  • 1刘杰,金弟,杜惠君,刘大有.一种新的混合特征选择方法RRK[J].吉林大学学报(工学版),2009,39(2):419-423. 被引量:7
  • 2赵飞,刘奇志,张剡,柏文阳.一种大域数据流中缺失值的填充方法[J].南京大学学报(自然科学版),2011,47(1):32-39. 被引量:4
  • 3陈彬,洪家荣,王亚东.最优特征子集选择问题[J].计算机学报,1997,20(2):133-138. 被引量:96
  • 4GUYON I,ELISSEEFF A.An Introduction to Variable and Feature Selection[J].Journal of Machine Learning Research,2003,3(3):1157-1182.
  • 5HUA J,TEMBE W,DOUGHERTY E.Performance of Feature-Selection Methods in the Classification of High-Dimension Data[J].Pattern Recognition,2009,42(3):409-424.
  • 6HE Xiao-fei,CAI Deng,NIYOGI P.Laplacian Score for Feature Selection[C]∥Advances in Neural Information Processing Systems.Vancouver,Canada:[s.n.],2005,18:507-514.
  • 7PRAISAN P,CHIDCHANOK L,KHAMRON S.Univariate Filter Technique for Unsupervised Feature Selection Using a New Laplacian Score Based Local Nearest Neighbors[C]∥Proceedings of Asia-Pacific Conference on Information Processing.Shenzhen,China:[s.n.],2009,2:196-200.
  • 8ZHANG Dao-qiang,CHEN Song-can,ZHOU Zhi-hua.Constraint Score:A New Filter Method for Feature Selection with Pairwise Constraints[J].Pattern Recognition,2008,41(5):1440-1451.
  • 9SUN Dan,ZHANG Dao-qiang.Bagging Constraint Score for Feature Selection with Pairwise Constraints[J].Pattern Recognition,2010,43(6):2106-2118.
  • 10BISHOP C M.Neural Networks for Pattern Recognition[M].[S.l.]:Oxford University Press,1995:105-112.

共引文献118

同被引文献8

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部