高维数据的1-范数支持向量机集成特征选择被引量：4

Ensemble Feature Selection Based on 1-Norm Support Vector Machine for High-Dimensional Data

下载PDF

导出

摘要特征选择是机器学习和模式识别领域的关键问题之一。随着模式识别与数据挖掘的深入,研究对象越来越复杂,对象的特征维数也越来越高,此时特征选择的稳定性也显得尤为重要。分析了1-范数支持向量机,用该方法对高维数据进行特征选择,并对特征选择的结果进行集成;提出了一种针对高维数据的稳定性度量方法;在基因表达数据上的实验结果表明,集成特征选择可以有效提高算法的稳定性。 Feature selection is one of the key issues in the field of machine learning and pattern recognition. With pattern recognition and data mining becoming increasingly deeper, the target of research becoming more and more complex and the dimension of feature becoming higher and higher, the stability of feature selection is particularly important. Based on the sparse SVM （support vector machine） model, this paper analyzes L1SVM （1-norm support vector machine）, applies this method to feature selection on high-dimensional data and integrates the results of feature selection according to ensemble learning principle of feature selection. Moreover, the paper designs a new stability measure for high-dimensional data. The experimental results on the gene expression data demonstrate that ensemble feature selection is able to effectively improve the stability of feature selection.

作者鲍捷杨明刘会东

机构地区南京师范大学计算机科学与技术学院

出处《计算机科学与探索》 CSCD 2012年第10期948-953,共6页 Journal of Frontiers of Computer Science and Technology

基金国家自然科学基金 No.61003116 江苏省自然科学基金重点项目 No.BK2011005 江苏省自然科学基金 Nos.BK2011782 BK2010263~~

关键词特征选择高维数据稳定性 1-范数支持向量机集成 feature selection high-dimensional data stability 1-norm support vector machine （LSVM） ensemble

分类号 TP302.7 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献21

1Guyon I, ElisseeffA. An introduction to variable and feature selection[J]. Journal of Machine Learning Research, 2003, 3: 1157-1182.
2Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics[J]. Bioinformatics, 2007, 23 (19): 2507-2517.
3Saeys Y, Abeel T, van de Peer Y. Robust feature selection using ensemble feature selection teehniques[C]//Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML PKDD '08). Berlin, Heidelberg: Springer-Verlag, 2008:313-325.
4Zhao Zheng. Spectral feature selection for mining ultrahigh dimensional data[D]. Arizona State University, 2010: 1-119.
5Abeel T, Helleputte T, van de Peer Y, et al. Robust biomarker identification for cancer diagnosis with ensemble feature selection methods[J]. Bioinformatics, 2010, 26(3): 392-398.
6Yang Feng, Mao K Z. Robust feature selection for microarray data based on multicriterion fusion[J]. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2011, 8(4): 1080-1092.
7Loscalzo S, Yu Lei, Ding C. Consensus group stable feature selection[C]//Proceedings of the 15th ACM S1GKDD International Conference on Knowledge Discovery and Data Mining (KDD '09). New York, NY, USA: ACM, 2009: 567-576.
8Yu Lei, Ding C, Loscalzo S. Stable feature selection via dense feature groups[C]//Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '08). New York, NY, USA: ACM, 2008: 803-811.
9Pal M, Foody G M. Feature selection for classification of hyperspectral data by SVM[J]. IEEE Transactions on Geoscience and Remote Sensing, 2010, 48(5): 2297-2307.
10Kujala J, Aho T, Elomaa T. A walk from 2-norm SVM to 1-norm SVM[C]//Proceedings of the 2009 9th IEEE International Conference on Data Mining (ICDM '09). Washington, DC, USA: IEEE Computer Society, 2009: 836-841.

同被引文献27

1刘杰,姚海林,任建喜.地铁车站基坑围护结构变形监测与数值模拟[J].岩土力学,2010,31(S2):456-461. 被引量：167
2张海,王尧,常象宇,徐宗本.L_(1/2)正则化[J].中国科学：信息科学,2010,40(3):412-422. 被引量：15
3黄亚东,张土乔,俞亭超,吴小刚.公路软基沉降预测的支持向量机模型[J].岩土力学,2005,26(12):1987-1990. 被引量：14
4孙德山,李海清.基于线性规划的支持向量聚类算法[J].计算机工程与设计,2010,31(6):1305-1307. 被引量：2
5吴崇明,王晓丹,白冬婴,张宏达.利用KKT条件与类边界包向量的SVM增量学习算法[J].计算机工程与设计,2010,31(8):1792-1794. 被引量：10
6杨敏,卢俊义.基坑开挖引起的地面沉降估算[J].岩土工程学报,2010,32(12):1821-1828. 被引量：57
7丁世飞,齐丙娟,谭红艳.支持向量机理论与算法研究综述[J].电子科技大学学报,2011,40(1):2-10. 被引量：913
8彭光金,司海涛,俞集辉,杨蕴华,李世勉,谭柯.改进的支持向量机算法及其应用[J].计算机工程与应用,2011,47(18):218-221. 被引量：18
9宋杰.线性规划ν-支持向量机的牛顿法[J].计算机工程与应用,2011,47(26):32-34. 被引量：1
10李淑,张顶立,房倩,卢伟.北京地铁车站深基坑地表变形特性研究[J].岩石力学与工程学报,2012,31(1):189-198. 被引量：144

引证文献4

1崔铁军,马云东.基于差异进化支持向量机的坑外土体沉降预测[J].中国安全科学学报,2013,23(1):83-89. 被引量：26
2冀素琴,石洪波,吕亚丽,郭珉.基于粒化-融合的海量高维数据特征选择算法[J].模式识别与人工智能,2016,29(7):590-597. 被引量：4
3陈晨,陈琴,苏一丹,朱茜.路径跟踪线性规划向量机[J].计算机工程与设计,2017,38(8):2132-2136.
4杨鹤标,刘芳,胡惊涛.基于PSO的小样本特征选择优化算法研究[J].江苏科技大学学报（自然科学版）,2021,35(1):76-81. 被引量：3

二级引证文献33

1赫飞,刘剑,崔铁军.基于结构元直接模糊集和GA算法的爆破方案选择[J].中国安全科学学报,2013,23(12):60-65. 被引量：12
2崔铁军,马云东,王来贵.基于PFC3D的露天矿边坡爆破过程模拟及稳定性研究[J].应用数学和力学,2014,35(7):759-767. 被引量：41
3陈炜,路世昌,崔铁军.基于AHP可拓综合方法的公路隧道安全等级判定研究[J].中国安全生产科学技术,2014,10(7):158-163. 被引量：16
4郭超,宋卫华,魏威.基于网格搜索-支持向量机的采场顶板稳定性预测[J].中国安全科学学报,2014,24(8):31-36. 被引量：17
5张彬,金珠,崔铁军.基于快速图解评价法的地铁隧道施工方式选择[J].中国安全生产科学技术,2014,10(9):140-145.
6陈炜,路世昌,崔铁军.基于安全考虑的地铁隧道施工方式选择研究[J].中国安全生产科学技术,2014,10(10):112-117. 被引量：6
7刘文生,吴作启,崔铁军,冯亚林.基于改进灰色系统的深基坑变形预测方法研究[J].中国安全生产科学技术,2014,10(11):21-26. 被引量：9
8陈善乐,刘剑,崔铁军,耿晓伟.基于DILSSVM的岩石强度预测研究[J].工程地质学报,2014,22(6):1071-1076.
9王会敏,李腾飞,刘洋,崔铁军.基于MDE和分形优化SVM的周期来压预测[J].中国安全生产科学技术,2015,11(1):77-83. 被引量：1
10崔铁军,马云东.考虑点和线的有向无环网络连通可靠性研究[J].计算机应用研究,2015,32(11):3315-3318. 被引量：12

1季薇,李云.基于局部能量的集成特征选择[J].南京大学学报（自然科学版）,2012,48(4):499-503. 被引量：2
2姚旭,王晓丹,张玉玺,薛爱军.基于正则化互信息和差异度的集成特征选择[J].计算机科学,2013,40(6):225-228. 被引量：3
3李霞,王连喜,蒋盛益.面向不平衡问题的集成特征选择[J].山东大学学报（工学版）,2011,41(3):7-11. 被引量：5
4马超,陈西宏,徐宇亮,王光明.广义邻域粗集下的集成特征选择及其选择性集成算法[J].西安交通大学学报,2011,45(6):34-39. 被引量：6
5孙亮,韩崇昭,沈建京,戴宁.集成特征选择的广义粗集方法与多分类器融合[J].自动化学报,2008,34(3):298-304. 被引量：10
6季金胜,郭艺友,霍宏,方涛.考虑稳定性要求的特征选择方法[J].高技术通讯,2014,24(11):1203-1209.
7孙建文,刘三(女牙),杨宗凯,王佩.采用集成特征选择的网络书写纹识别研究[J].小型微型计算机系统,2012,33(5):1108-1112.
8孟军,尉双云.基于近邻传播聚类的集成特征选择方法[J].计算机科学,2015,42(3):241-244. 被引量：6
9周丰,王未央.基于最小最大模块化集成特征选择的改进[J].计算机技术与发展,2016,26(9):149-153. 被引量：2
10周国静,李云.基于最小最大策略的集成特征选择[J].南京大学学报（自然科学版）,2014,50(4):457-465. 被引量：7

计算机科学与探索

2012年第10期

浏览历史

内容加载中请稍等...

高维数据的1-范数支持向量机集成特征选择被引量：4

参考文献21

同被引文献27

引证文献4

二级引证文献33

相关作者

相关机构

相关主题

浏览历史

高维数据的1-范数支持向量机集成特征选择 被引量：4

参考文献21

同被引文献27

引证文献4

二级引证文献33

相关作者

相关机构

相关主题

浏览历史

高维数据的1-范数支持向量机集成特征选择被引量：4