摘要
针对标准支持向量机方法需要存储、计算和处理核矩阵而学习效率很低,不能有效处理较大规模数据挖掘的问题,提出一种基于近邻边缘检测的支持向量机方法 (SVM Method Based on Neighbor Edge Detection,ED_SVM)。该方法将近邻边缘检测技术引入SVM的训练过程,即首先对数据进行划分,选择混合类样本,通过边缘检测技术提取其中位于近似最优分类边界附近的含有较多重要支持向量信息的样本,构成新的小规模训练集,以在压缩训练集的同时保持原始支持向量信息的分布特性;并在新构成的训练集上训练标准SVM,在提高SVM学习效率的同时得到优秀的泛化性能。实验结果表明,本文提出的ED_SVM方法能够同时获得较高的测试精度和学习效率。
This paper presents a Support Vector Machine( SVM) method based on neighbor edge detection,called Support Vector Machine based on Neighbor Edge Detection( ED_SVM),in order to solve the problem that there is low training efficiency and it can not solve the large scale data mining problems of normal SVM,because it needs save,compute and solve the large kernel matrix. By dividing data and obtaining the mixed clusters,this method extracts the important samples near the approximate optimal hyperplane by introducing neighbor edge detection technology into the SVM training process,which have the most important support vector information. The new training samples set is constructed by these new important samples to keep the distribution feature of original support vectors and compress the size of training dataset. Then the normal SVM is trained on these new training samples and the good generalization performance can be obtained with high learning efficiency synchronously. The experiment results demonstrate that the proposed ED_SVM model can obtain the high learning efficiency and testing accuracy simultaneously.
出处
《计算机与现代化》
2015年第3期15-19,25,共6页
Computer and Modernization
关键词
支持向量机
边缘检测
支持向量
泛化性能
学习效率
ED_SVM算法
support vector machine
edge detection
support vector
generalization performance
learning efficiency
ED_SVM algorithm