基于自适应邻域和自表示正则的无监督特征选择算法

Unsupervised feature selection based on adaptive neighborhood regularized self-representation

下载PDF

导出

摘要为了更好地预处理未标记数据,大多数基于图正则的无监督特征选择算法通过构造样本的相似性矩阵来删除冗余信息并选择具有代表性的特征子集。这些方法中的大多数图都是用固定数量的近邻数来初始化,忽略了数据分布不均匀的问题。为了解决这个问题,提出了一种基于自适应邻域和自表示正则的无监督特征选择算法(Adaptive neighborhood regularized self-representation,ANRSR)来选择具有代表性和判别性的特征子集。为了保留局部内在结构,该算法将基于自适应邻域的流形正则化运用到自表示模型中,并利用了一种迭代方法来解决此优化问题。最后,选取4种经典的无监督特征选择算法,在几个基准数据集上进行了对比实验,验证所提算法能够选出具有更高聚类精度和互信息的判别性特征子集。 To better pre-process unlabeled data,most existing graph-based unsupervised feature selection algorithms remove redundant information and select representative feature subsets by constructing the similarity matrix of samples.However,most of the graphs in these methods are initialized with a fixed number of neighbors,ignoring the problem of uneven data distribution.Aiming to tackle this defect,an unsupervised feature selection based on adaptive neighborhood regularized self-representation(ANRSR)is proposed to select the representative and discriminative feature subsets.To preserve the local intrinsic structure,this paper incorporates manifold regularization based on adaptive neighborhood into the self-representation model and uses an iterative method to solve the optimization problem.Comparative experiments on several benchmark datasets among four classic algorithms and the proposed algorithm are conducted to validate that the proposed algorithm can select discriminative feature subsets which have higher clustering accuracy and mutual information.

作者彭明张继炎王慧玲黄宏昆刘艳芳 Peng Ming;Zhang Jiyan;Wang Huiling;Huang Hongkun;Liu Yanfang(College of Mathematics and Information Engineering,Longyan University,Longyan 364012,China;Department of Electronics and Information Engineering,Yili Normal University,Yining 835000,China)

机构地区龙岩学院数学与信息工程学院伊犁师范大学电子与信息工程分院

出处《南京理工大学学报》 CAS CSCD 北大核心 2021年第4期439-446,共8页 Journal of Nanjing University of Science and Technology

基金福建省中青年教师教育科研项目(科技类)(JAT190743) 龙岩市科技计划项目(2019LYF13002)。

关键词自适应邻域自表示流形学习特征选择无监督学习 adaptive neighborhood self-representation manifold learning feature selection unsupervised learning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献4

1钱晓东,曹阳.基于社区极大类发现的大数据并行聚类算法[J].南京理工大学学报,2016,40(1):117-123. 被引量：6
2孙云云,江朝晖,单桂朋,刘海秋,饶元.最优距离聚类和特征融合表达的关键帧提取[J].南京理工大学学报,2018,42(4):416-423. 被引量：7
3刘艳芳,叶东毅.基于邻域保持学习的无监督特征选择算法[J].模式识别与人工智能,2018,31(12):1096-1102. 被引量：8
4刘艳芳,李文斌,高阳.基于自适应邻域嵌入的无监督特征选择算法[J].计算机研究与发展,2020,57(8):1639-1649. 被引量：9

二级参考文献36

1蓝章礼,帅丹,李益才.基于帧间相关性的道路监控视频关键帧提取[J].微电子学与计算机,2015,32(5):51-54. 被引量：4
2张婵,高新波,姬红兵.视频关键帧提取的可能性C-模式聚类算法[J].计算机辅助设计与图形学学报,2005,17(9):2040-2045. 被引量：21
3王方石,须德,吴伟鑫.基于自适应阈值的自动提取关键帧的聚类算法[J].计算机研究与发展,2005,42(10):1752-1757. 被引量：32
4Gantz J,Reinsel D.2011 Digital universe study:extracting value from chaos[M].USA:IDC Go-to-Market Services,2011.
5Bughin J,Chui M,Manyika J.Clouds,big data and smart assets:ten tech-enabled business trends to watch[J].McKinsey Quarterly,2010,8:1-14.
6Guha S,Rastogi R,Shim K.Cure:an efficient clustering algorithm for large databases[J].Information System Journal,1998,26(1):35～58.
7Kantabutra S,Couch A L.Parallel k-means clustering algorithm on nows[J].Nectec Technical Journal,2000,1(6):243-247.
8Clauset A.Finding local community structure in networks[J].Physics Review E,2005,72:1-6.
9Lancichinetti A,Fortunato S,Kertesz J.Detection of the overlapping and hierarchical community structure in complex networks[J].New Journal of Physics,2009,11:1-18.
10Nicosia V,Mangioni G,Carchiolo V,et al.Extending the definition of modularity to directed graphs with overlapping communities[J].Journal of Statistical Mechanics:Theory and Experiment,2009,3:03024.

共引文献26

1屈洁.虚拟环境下大数据智能并行聚类方法研究[J].计算机测量与控制,2017,25(6):257-260. 被引量：4
2刘先花.基于群体协同智能聚类的大数据存储系统设计[J].现代电子技术,2017,40(23):130-133. 被引量：7
3李京政,杨习贝,王平心,陈向坚.模糊粗糙集的稳定约简方法[J].南京理工大学学报,2018,42(1):68-75. 被引量：11
4李立,江克勤.最小串行策略下脉冲神经膜系统的语言产生能力[J].南京理工大学学报,2018,42(5):597-603. 被引量：1
5张辉,刘万军,吕欢欢.小波核局部Fisher判别分析的高光谱遥感影像特征提取[J].模式识别与人工智能,2019,32(7):624-632. 被引量：6
6葛婷,詹天明,牟善祥.基于多核协同表示分类的脑肿瘤分割算法[J].南京理工大学学报,2019,43(5):578-585. 被引量：6
7蔡晓峰.基于聚类的数字图书馆用户隐私保护方法[J].电子测量技术,2020,43(2):123-127. 被引量：5
8王韫烨,孔珊,李亚伦.基于结构近似度的社交网络聚类[J].南京理工大学学报,2020,44(2):230-235. 被引量：4
9林志达,吴石松.Dockers容器在人工智能研发平台中的关键技术研究[J].自动化与仪器仪表,2020(6):192-196. 被引量：3
10牛慧,赵艳东.基于改进Gabor小波变换的人脸情感识别[J].电子测量技术,2020,43(5):124-129. 被引量：4

1刘艳芳,李文斌,高阳.基于自适应邻域嵌入的无监督特征选择算法[J].计算机研究与发展,2020,57(8):1639-1649. 被引量：9
2王鑫,汪国强.基于子空间划分和自我表示学习的高光谱波段选择[J].黑龙江大学自然科学学报,2021,38(2):228-237. 被引量：1
3赵晨,李开成,林寿英,曾子莹,林炜鑫.基于Jerk流形正则化深度极限学习机的电能质量复合扰动识别[J].华南师范大学学报（自然科学版）,2021,53(4):8-16. 被引量：2
4Jiahai Wang,Yuyan Sun,Zizhen Zhang,Shangce Gao.Solving Multitrip Pickup and Delivery Problem With Time Windows and Manpower Planning Using Multiobjective Algorithms[J].IEEE/CAA Journal of Automatica Sinica,2020,7(4):1134-1153. 被引量：6

南京理工大学学报

2021年第4期

浏览历史

内容加载中请稍等...

基于自适应邻域和自表示正则的无监督特征选择算法

参考文献4

二级参考文献36

共引文献26

相关作者

相关机构

相关主题

浏览历史