摘要
为了提高支持向量数据描述(SVDD)的分类精度,引入局部疏密度提出了改进的SVDD算法。该算法提高了分类精度,但增加了计算复杂度。为此,先用K-means聚类将整个数据集划分为k个簇,再用改进的SVDD算法并行训练k个簇,最后再对获得的k个局部支持向量集训练,即得到最终的全局决策边界。由于采用了分而治之并行计算的方法,提高了算法的效率。对合成数据(200个)和实际数据的实验结果表明,所提算法较SVDD算法,训练时间降低为原来的10%,分类错误率较原来的降低了近一半。因此,所提算法提高了分类精度和算法效率。
This paper proposed an improved SVDD algorithm by introducing a local density degree for each data point in order to improve the support vector data description(SVDD) classification accuracy. Proved to improve the classification accuracy, but the increase of computational complexity. To this end, first divided the whole data set into k clusters using K-means cluste- ring algorithm. Then, trained the k clusters in parallel by improved SVDD. Finally, trained the k obtained local support vector sets and got the final overall decision border. As a result of divide and conquer method and parallel computing, improved the efficiency of the algorithm. Synthetic data and real data experimental results show that the proposed method than SVDD algorithm, training time is reduced to 10% and classification error rate lower than the original by almost half. Therefore, the proposed algorithm improves the classification accuracy and algorithm efficiency.
出处
《计算机应用研究》
CSCD
北大核心
2010年第3期883-886,共4页
Application Research of Computers
基金
国家自然科学基金资助项目(60773049)
江苏大学高级人才启动基金资助项目(09JDG041)
关键词
单值分类
支持向量数据描述
K—means聚类
局部疏密度
one-class classification
support vector data description
K-means clustering
local density degree