基于稀疏图表示的特征选择方法研究被引量：1

A feature selection method based on sparse graph representation

下载PDF

导出

摘要特征选择旨在降低待处理数据的维度,剔除冗余特征,是机器学习领域的关键问题之一。现有的半监督特征选择方法一般借助图模型提取数据集的聚类结构,但其所提取的聚类结构缺乏清晰的边界,影响了特征选择的效果。为此,提出一种基于稀疏图表示的半监督特征选择方法,构建了聚类结构和特征选择的联合学习模型,采用l__1范数约束图模型以得到清晰的聚类结构,并引入l_2,1范数以避免噪声的干扰并提高特征选择的准确度。为了验证本方法的有效性,选择了目前流行的几种特征方法进行对比分析,实验结果表明了本方法的有效性。 Feature selection, which aims to reduce data＇s dimensionality by removing redundant features, is one of the main issues in the field of machine learning. Most of existing graph-based semi-supervised feature selection algorithms are suffering from neglecting clear cluster structure. We propose a semi-supervised algorithm based on l1-norm graph in this paper. A joint learning framework is built upon cluster structure and feature selection; l1 - norm is imposed to guarantee the sparsity of the cluster structure, which is suitable for feature selection. To select the most relevant features and reduce the effect of outliers, the l2,1-norm regularization is added into the objective function. We evaluate the performance of the proposed algorithm over several data sets and compare the results with state-of-the-art semi--supervised feature selection algorithms. The results demonstrate the effectiveness of the proposed algorithm.

作者王晓栋严菲谢勇江慧琴

机构地区厦门理工学院计算机与信息工程学院

出处《计算机工程与科学》 CSCD 北大核心 2015年第12期2372-2378,共7页 Computer Engineering & Science

基金国家自然科学基金资助项目(61502405) 福建省教育厅中青年教师教育科研资助项目(JA15385 JA15368) 厦门理工学院对外科技合作专项资助项目(E201400400)

关键词特征选择半监督学习 l2.1范数 L1范数 feature selection semi-supervised learning l2,1 -norm l1-norm

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1简彩仁,陈晓云.基于局部保持投影和稀疏表示的无监督特征选择方法[J].模式识别与人工智能,2015,28(3):247-252. 被引量：8
2谢娟英,谢维信.基于特征子集区分度与支持向量机的特征选择算法[J].计算机学报,2014,37(8):1704-1718. 被引量：64

二级参考文献49

1张莉,孙钢,郭军.基于K-均值聚类的无监督的特征选择方法[J].计算机应用研究,2005,22(3):23-24. 被引量：29
2毛勇,周晓波,夏铮,尹征,孙优贤.特征选择算法研究综述[J].模式识别与人工智能,2007,20(2):211-218. 被引量：94
3Guyon I, Elisseeff A. An introduction to variable and feature selection. The Journal of Machine Learning Research, 2003, 3:1157-1182.
4Guyon I, Weston J, Barnhill S, et al. Gene selection for cancer classification using support vector machines. Machine Learning, 2002, 46(1-3): 389-422.
5Rakotomamonjy A. Variable selection using svm based criteria. The Journal of Machine Learning Research, 2003, 3: 1357- 1370.
6Duan K B, Rajapakse J C, Wang H, et al. Multiple SVM- RFE for gene selection in cancer classification with expression data. IEEE Transactions on NanoBioscience, 2005, 4(3): 228-234.
7Xia H, Hu B Q. Feature selection using fuzzy support vector machines. Fuzzy Optimization and Decision Making, 2006, 5(2): 187-192.
8Zhou X, Tuck D P. MSVM-RFE: Extensions of SVM-RFE for multiclass gene selection on DNA microarray data. Bioinformatics, 2007, 23(9): 1106-1114.
9Maldonado S, Weber R. A wrapper method for feature selection using support vector machines. Information Sciences, 2009, 179(13): 2208-2217.
10Somol P, Novovicova J. Evaluating stability and comparing output of feature selectors that optimize feature subset cardinality. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(11): 1921-1939.

共引文献70

1李欣,俞卫琴.基于改进GS-XGBoost的个人信用评估[J].计算机系统应用,2020,29(11):145-150. 被引量：7
2盖超会,王成刚.基于改进布谷鸟算法与SVM的矿用变压器故障诊断[J].煤炭工程,2019,51(11):134-137. 被引量：6
3李敏,章国豪,陈梓樑,郭志勇,胡晓敏.基于差分进化的多目标粒子群特征选择算法[J].计算机应用研究,2020,37(1):76-79. 被引量：8
4张文杰,蒋烈辉.一种基于遗传算法优化的大数据特征选择方法[J].计算机应用研究,2020,37(1):50-52. 被引量：20
5刘艳芳,叶东毅.基于邻域保持学习的无监督特征选择算法[J].模式识别与人工智能,2018,31(12):1096-1102. 被引量：8
6谢娟英,高红超.基于统计相关性与K-means的区分基因子集选择算法[J].软件学报,2014,25(9):2050-2075. 被引量：56
7张钰莎,蒋盛益.Clementine软件功能缺陷分析[J].信阳师范学院学报（自然科学版）,2015,28(3):450-453. 被引量：2
8毛文涛,徐文涛,薛天宇,何玲.一种基于特征子集区分度优化的分组特征选择算法[J].小型微型计算机系统,2015,36(8):1827-1831. 被引量：3
9杨昙,冯翔,虞慧群.基于多群体公平模型的特征选择算法[J].计算机研究与发展,2015,52(8):1742-1756. 被引量：5
10马国富,马胜利,王子贤,李双印,程雨丝.数据恢复在电子数据取证与司法鉴定中的应用[J].河北大学学报（自然科学版）,2015,35(5):538-545. 被引量：8

引证文献1

1严菲,王晓栋.基于自适应局部保持投影的无监督特征选择方法[J].中国科学技术大学学报,2018,48(4):290-297. 被引量：1

二级引证文献1

1周婉莹,马盈仓,郑毅,杨小飞.稀疏回归和流形学习的无监督特征选择算法[J].计算机应用研究,2020,37(9):2634-2639. 被引量：2

1董社勤,洪先龙,黄钢,顾均.基于新约束图模型的布图规划和布局算法(英文)[J].软件学报,2001,12(11):1586-1594. 被引量：4
2郭玉华,李军,靳肖闪,景宁,廖巍.复杂约束对地观测卫星成像调度技术研究[J].电子学报,2009,37(10):2326-2332. 被引量：2
3王钧,李军,景宁,郭玉华.基于约束满足的多目标对地观测卫星成像调度[J].国防科技大学学报,2007,29(4):66-71. 被引量：6

计算机工程与科学

2015年第12期

浏览历史

内容加载中请稍等...

基于稀疏图表示的特征选择方法研究被引量：1

参考文献2

二级参考文献49

共引文献70

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于稀疏图表示的特征选择方法研究 被引量：1

参考文献2

二级参考文献49

共引文献70

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于稀疏图表示的特征选择方法研究被引量：1