随机特征上一致中心调节的支持向量机

Support vector machine via consensus centre adjustment on random features

下载PDF

导出

摘要支持向量机(SVM)是最为流行的分类工具,但处理大规模的数据集时,需要大量的内存资源和训练时间,通常在大集群并行环境下才能实现。提出一种新的并行SVM算法,RF-CCASVM,可在有限计算资源上求解大规模SVM。通过随机傅里叶映射,应用低维显示特征映射一致近似高斯核对应的无限维隐式特征映射,从而用线性SVM一致近似高斯核SVM。提出一致中心调节的并行化方法。具体地,将数据集划分成若干子数据集,多个进程并行地在各自的子数据集上独立训练SVM。当各个子数据集上的最优超平面即将求出时,用由各个子集上获得的一致中心解取代当前解,继续在各子集上训练直到一致中心解在各个子集上达到最优。标准数据集的对比实验验证了RF-CCASVM的正确性和有效性。 Support Vector Machines（SVMs）have become popular classification tools, but when dealing with very large datasets, SVMs need large memory requirement and computation time. Therefore, large-scale SVMs are performed on computer clusters or supercomputers. A novel parallel algorithm for large-scale SVM is presented. The algorithm is per-formed on a resource-limited computing environment and guarantees a uniform convergence. The infinite-dimensional implicit feature mapping of the Gaussian kernel function is sufficiently approximated by a low-dimensional feature map-ping. The kernel SVM is approximated with a linear SVM by explicitly mapping data to low-dimensional features using random the Fourier map. The parallelization of the algorithm is implemented with a consensus centre adjustment strategy. Concretely, the dataset is partitioned into several subsets, and separate SVMs are trained on processors parallel with the subsets. When the optimal hyperplanes on subsets are nearly found, solutions achieved by separate SVMs are replaced by the consensus centre and are retrained on the subsets until the consensus centre is optimal on all subsets. Comparative experiments on benchmark databases are performed. The results show that the proposed resource-limited parallel algo-rithm is effective and efficient.

作者廖士中卢玮

机构地区天津大学计算机科学与技术学院

出处《计算机工程与应用》 CSCD 2014年第17期44-48,55,共6页 Computer Engineering and Applications

基金国家自然科学基金(No.61170019) 天津市自然科学基金(No.11JCYBJC00700)

关键词并行支持向量机大规模数据集有限资源随机傅里叶特征一致中心调节 parallel Support Vector Machines （SVM） large-scale datasets limited resource random Fourier features consensus centre adjustment

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献15

1Vapnik V.The nature of statistical learning theory[M].New York:Springer-Verlag,2000.
2Cao L J,Keerthi S S,Ong C J,et al.Developing parallel sequential minimal optimization for fast training support vector machine[J].Neurocomputing,2006,70(1):93-104.
3Zanghirati G,Zanni L.A parallel solver for large quadratic programs in training support vector machines[J].Parallel Computing,2003,29(4):535-551.
4Chang E Y,Zhu Kaihua,Wang Hao,et al.Parallelizing support vector machines on distributed computers[C]//Advances in Neural Information Processing Systems.Cambridge:MIT Press,2008:257-264.
5Zhu Z A,Chen Weizhu,Wang Gang,et al.P-packsvm:parallel primal gradient descent kernel SVM[C]//Proceedings of the 9th IEEE International Conference on Data Mining.Piscataway:IEEE Press,2009:677-686.
6Collobert R,Bengio S,Bengio Y.A parallel mixture of SVMs for very large scale problems[J].Neural Computation,2002,14(5):1105-1114.
7Graf H P,Cosatto E,Bottou L,et al.Parallel support vector machines:the cascade SVM[C]//Advances in Neural Information Processing Systems.Cambridge:MIT Press,2005:521-528.
8Hazan T,Man A,Shashua A.A parallel decomposition solver for SVM:distributed dual ascend using fenchel duality[C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.Piscataway:IEEE Press,2008:1-8.
9Forero P A,Cano A,Giannakis G B.Consensus-based distributed support vector machines[J].Journal of Machine Learning Research,2010,11:1663-1707.
10Rahimi A,Recht B.Random features for large-scale kernel machines[C]//Advances in Neural Information Processing Systems.Cambridge:MIT Press,2008:1177-1184.

1张健沛,程丽丽,马骏.一种基于并行支持向量机的网络入侵检测方法[J].计算机工程与应用,2007,43(4):137-139. 被引量：2
2李丽萍.并行支持向量机[J].计算机光盘软件与应用,2013,16(24):107-107. 被引量：1
3潘希姣.多子群粒子群集成神经网络[J].安徽建筑工业学院学报（自然科学版）,2007,15(2):38-40.
4苏艳,居胜峰,王中卿,李寿山,周国栋.基于随机特征子空间的半监督情感分类方法研究[J].中文信息学报,2012,26(4):85-90. 被引量：16
5柳燕煌,黄立勤.云计算环境的并行支持向量机[J].南阳理工学院学报,2011,3(2):26-29. 被引量：4
6黄丹,李志亮.基于互信息的弱随机特征子空间生成算法[J].南阳理工学院学报,2012,4(2):24-29.
7白宁.基于并行计算的支持向量机加速算法[J].计算机光盘软件与应用,2013,16(16):299-301.
8王宁.谈计算机操作系统中的进程并行和互斥[J].皖西学院学报,2004,20(2):62-63. 被引量：1
9包哲静,皮道映,孙优贤.基于并行支持向量机的多变量非线性模型预测控制[J].控制与决策,2007,22(8):922-926. 被引量：6
10孙广玲,董勇,刘志.伪特权信息和SVM+[J].西安电子科技大学学报,2016,43(6):103-108. 被引量：1

计算机工程与应用

2014年第17期

浏览历史

内容加载中请稍等...

随机特征上一致中心调节的支持向量机

参考文献15

相关作者

相关机构

相关主题

浏览历史