基于半监督支持向量机的数据分类(英文)

Semi-Supervised Support Vector Machines for Data Classification

下载PDF

导出

摘要为解决支持向量机在分类识别前需要利用已知训练集进行训练的问题，本文提出了一种基于k均值的对无标识数据进行分类的支持向量机分类算法。首先利用k均值算法将未知数据划分成某个数量的子集，然后对新数据进行支持向量机训练得到决策边界与支持矢量，最后对无标识数据进行分类。模拟结果表明：训练时消耗的CPU时间为1.8280秒，支持向量个数为60时，分类错误率小于2％。 To solve the problem that the support vector machines (SVMs) must use a selected training set classified in advance, a SVMs classifier based on k-means clustering algorithm is presented for the classification of unlabeled data. The new algorithm is to firstly divide unlabeled data into many subsets with a new label by k-means clustering , then train the SVMs using the new data set to get decision boundary and support vectors, at last use the SVMs classifier to classify the unlabeled data. The simulations show that the classification error is less than 2% when the CPU training time is 1. 8280seconds and the number of support vector is 60.

作者李茂宽赵洪海

机构地区青岛海军潜艇学院研究生队

出处《青岛大学学报（自然科学版）》 CAS 2004年第4期44-48,共5页 Journal of Qingdao University(Natural Science Edition)

关键词支持向量机数据分类 K-均值 support vector machines data classification k-means clustering

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献6

1Joachims T. Text Categorization with Support Vector Machines[R]. University of Dortmund.
2Osuna E, Freund R, Girosi F. Training Support Vector Machines: An Application to Face Detection [J]. Computer Vision and Pattern Recognition, 1997,130-136.
3Bian Z G. Pattern recognition[M]. 2nd edition, Beijing: Tsinghua University Press, 2000.
4Vapnik V. The Nature of Statistical Learning Theory[M]. New York Springer-Verlag , 1995.
5Bradley P S, Mangasarian O L. Massive data discrimination via linear support vector rachines[J]. Optimization methods and software,2000 13,1-10.
6Mangasarian O L, Musicant D R. Data Discrimination via Nonlinear Generalized Support Vector Machines[R]. Technical Report 99-03,Computer Sciences Department, University of Wisconsin, 1999.

1唐少先,蔡文君.基于无监督聚类混合遗传算法的入侵检测方法[J].计算机应用,2008,28(2):409-411. 被引量：10
2武妍,王守觉.基于多层感知机和RBF转换函数的混合神经网络[J].计算机工程,2006,32(6):25-27. 被引量：2
3程凤伟.一种基于决策树的SVM算法[J].太原学院学报（自然科学版）,2017,35(1):33-36. 被引量：3
4冼广铭,曾碧卿,李星丽.半监督SVM的工作集样本预选取方法[J].计算机工程与应用,2008,44(20):172-175. 被引量：1
5王佩玮.无线射频识别标签防碰撞算法比较分析[J].物联网技术,2017,7(4):21-24. 被引量：4
6尚雷雪,郑惠莉.物联网信息安全面面观[J].通信企业管理,2014(2):81-83. 被引量：1
7胡雷,胡茑庆,秦国军.双阈值单类支持矢量机在线故障检测算法及应用[J].机械工程学报,2009,45(3):169-173. 被引量：6
8刘术杰,丁宏.基于聚类的未标识数据的入侵检测[J].计算机应用研究,2005,22(9):140-141.
9郝红卫,苏荣伟.基于K近邻决策边界的特征提取[J].模式识别与人工智能,2007,20(5):649-653. 被引量：3
10符保龙.基于背景知识和主动学习的文本挖掘技术研究[J].计算机应用与软件,2013,30(5):275-278. 被引量：1

青岛大学学报（自然科学版）

2004年第4期

浏览历史

内容加载中请稍等...

基于半监督支持向量机的数据分类(英文)

参考文献6

相关作者

相关机构

相关主题

浏览历史