融合特征选取和学习策略的支持向量机研究

Support vector machine study by combining feature selection and learn strategy

下载PDF

导出

摘要支持向量机是重要的机器学习方法之一,已成功解决了许多实际的分类问题。围绕如何提高支持向量机的分类精度与训练效率,以分类过程为主线,主要综述了在训练支持向量机之前不同的特征选取方法与学习策略。在此基础上,比较了不同的特征选取方法SFS,IWSS,IWSSr以及BARS的分类精度,分析了主动学习策略与支持向量机融合后获得的分类器在测试集上的分类精度与正确率/召回率平衡点两个性能指标。实验结果表明,包装方法与过滤方法相结合的特征选取方法能有效提高支持向量机的分类精度和减少训练样本量;在标签数据较少的情况下,主动学习能达到更好的分类精度,而为了达到相同的分类精度,被动学习需要的样本数量必须要达到主动学习的6倍。 Support Vector Machine （SVM） is one of the important machine learning methods and applied successfully to solve many classifying problems in real life. Aiming to improve the classification accuracy and training efficient of SVM, this paper reviews different feature selection algorithms and learning strategies before training SVM according to classification procedure. At the same time, this paper compares the classification accuracy of different feature selection method such as SFS, IWSS, IWSSr and BARS, and analyzes two performance measures on classification accuracy and precision/recall breakeven point when active learning strategy and SVM are combined to obtain a classifier. Experimental results indicate that the accuracy could be significantly improved and the number of training sample could be dramatically reduced by integrating the filtering method into the wrapper method; and when labeled training sample size is too small, active learning obtains better accuracy, however, if passive learning wants to have the same accuracy as active learning, passive learning must have the six times training samples than active learning.

作者吕品钟珞蔡敦波

机构地区武汉理工大学计算机科学与技术学院武汉工程大学计算机科学与工程学院武汉工程大学智能机器人湖北省重点实验室

出处《计算机工程与应用》 CSCD 2012年第32期140-146,共7页 Computer Engineering and Applications

基金国家自然科学基金青年基金项目(No.61103136) 湖北省智能机器人重点实验室开放基金(No.200906)

关键词支持向量机特征选取学习策略优化方法阈值 support vector machine feature selection learning strategy optimization methodologies threshold

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献20

1Ogura H,Amano H,Kondo M.Feature selection with a measure of deviations from Poisson in text categoriza-tion[J].Expert Systems with Applications,2009,36:6826-6832.
2Polat K,Gunes S.A new feature selection method on classification of medical datasets:kernel F-score feature selection[J].Expert Systems with Applications,2009,36:10367-10373.
3Chen Hui-Ling,Yang Bo,Liu Jie,et al.A support vector machine classifier with rough set-based feature selec-tion for breast cancer diagnosis[J].Expert Systems with Applications,2011,38:9014-9022.
4Yang Jieming,Liu Yuanning,Liu Zhen,et al.A new feature selection algorithm based on binomial hypothesis testing for spam filtering[J].Knowledge-Based Systems,2011,24:904-914.
5Liu Yi,Zheng Y F.FS_SFS:a novel feature selection method for support vector machines[J].Pattern Recognition,2006,39:1333-1345.
6Li Shijin,Wu Hao,Wan Dingsheng,et al.An effective feature selection method for hyperspectral image classi-fication based on genetic algorithm and support vector machine[J].Knowledge-Based Systems,2011,24:40-48.
7Nguyen M H,de la Torre F.Optimal feature selection for support vector machines[J].Pattern Recognition,2010,43:584-591.
8Maldonado S,Weber R,Basak J.Simultaneous feature selection and classification using kernel-penalized support vector machines[J].Information Sciences,2011,181:115-128.
9Huang Cheng-Lung,Wang Chieh-Jen.A GA-based feature selection and parameters optimization for support vector machines[J].Expert Systems with Applications,2006,31:231-240.
10Lin Shih-Wei,Ying Kuo-Ching,Chen Shih-Chieh,et al.Particle swarm optimization for parameter determination and feature selection of support vector machines[J].Ex-pert Systems with Applications,2008,35:1817-1824.

1富宇,李春生,高雅田.基于语义缓存的数据库查询优化研究[J].计算机工程与设计,2009,30(19):4432-4435. 被引量：2
2艾默生.一种“新鲜”的包装方法[J].现代制造,2012(27):80-80.
3刘杰,金弟,杜惠君,刘大有.一种新的混合特征选择方法RRK[J].吉林大学学报（工学版）,2009,39(2):419-423. 被引量：7
4岳千钧.基于XML的网络课件共享研究[J].长沙电力学院学报（自然科学版）,2004,19(4):21-25. 被引量：3
5阿斯亚.浅谈塑料包装上的条码制作技术[J].条码与信息系统,2003(2):17-18.
6纸箱结构设计中防震技术的运用[J].全球瓦楞工业,2009(4):65-66.
7Lu Jinbu,Jiang Hongying (Gansu University of Technology, Lanzhou, PR.China 730050).Simulative Measurement of Besetting Bars for Interior Decoration[J].Computer Aided Drafting,Design and Manufacturing,2000,10(1):36-40.
8王伟,韩银和,胡瑜,李晓维,张佑生.SoC测试中低成本、低功耗的芯核包装方法[J].计算机辅助设计与图形学学报,2006,18(9):1397-1402. 被引量：4
9曹海斌,夏春和,潘红莲,朱建鹏.基于目录服务的区域网络性能测量框架NPMI设计与实现[J].计算机工程与应用,2003,39(21):164-167.
10陈鸿皖,周国祥,石雷.一种基于Ext.NET的Web控件二次包装方法研究[J].合肥工业大学学报（自然科学版）,2016,39(8):1066-1071. 被引量：1

计算机工程与应用

2012年第32期

浏览历史

内容加载中请稍等...

融合特征选取和学习策略的支持向量机研究

参考文献20

相关作者

相关机构

相关主题

浏览历史