一种基于交叉验证思想的半监督分类方法被引量：9

A Semi-supervised Classification Algorithm Based on the Idea of Cross Validation

下载PDF

导出

摘要为了提高半监督分类的有效性,提出一种基于交叉验证思想的半监督分类方法(CV-S3VM)。通过对未标记样本进行伪标记,将伪标记后的样本加入到标记样本集中,参与交叉验证,选取能使SVM分类器误差最小的标记作为最终的标记,实现对未标记样本进行标记。依次挖掘未标记样本的隐含信息,增加标记样本的数目。使用UCI数据集模拟半监督分类实验环境,结果表明CV-S3VM具有较高的分类率,在标记样本较少的情况下效果更为明显。 In order to improve the performance of semi - supervised classifier, a kind of semi - supervisedclassification algorithm CV - S3VM based on the idea of cross validation was proposed. Unlabeled sampleswere labeled and added to the labeled sample set to participate in cross validation. The labels which makeSVM classifier error minimum were selected as the final lables to mark the unlabeled samples. In this waythe information embedded in the unlabeled samples were mined and the number of labeled samples wasexpanded. Finally, the UCI dataset was used to simulate the semi -supervised classification experimentalenvironment. The results show that CV - S3VM has a higher classification rate. In the case of few labeledsamples, the effect is more obvious.

作者赵建华

机构地区西北工业大学计算机学院商洛学院计算机科学系

出处《西南科技大学学报》 CAS 2014年第1期34-38,48,共6页 Journal of Southwest University of Science and Technology

基金陕西省教育厅科研计划项目资助(12JK0748)

关键词机器学习半监督分类交叉验证支持向量机 Machine learning Semi - supervised classification Cross validation Support vector machine

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献18

1吴伟宁,刘扬,郭茂祖,刘晓燕.基于采样策略的主动学习算法研究进展[J].计算机研究与发展,2012,49(6):1162-1173. 被引量：33
2ZHU X J. Semi -supervised Learning Literature Survey [ R]. Madison : University of Wisconsin, 2008.
3李昆仑,曹铮,曹丽苹,张超,刘明.半监督聚类的若干新进展[J].模式识别与人工智能,2009,22(5):735-742. 被引量：50
4CH APELLE O, ZIEN A. Semi -supervised Classifica- tion by Low Density Separation [ C ]. Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics, Barbados, 2005. 57 -64.
5ZHOU Z H , LI M. Tri -training: exploiting unlabeled data using three classifiers [ J ] . IEEE Transactions on Knowl- edge and Data Engineering , 2005, 17(11) :1529-1542.
6Zhang M L, ZHOU Z H. CoTrade: Confident co -train- ing with data editing[J]. IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, 2011, 41 (6) : 1612 - 1626.
7赵建华,李伟华.一种协同半监督分类算法Co-S3OM[J].计算机应用研究,2013,30(11):3237-3239. 被引量：12
8WANG Yun - yun, CHEN Song - cai, ZHOU Zhi - hua. New semi - supervised classification method based on modified cluster assumption [ J ]. IEEE Transactions on Neural Networks and Learning Systems, 2012, 23 (5): 689 - 702.
9LI Y F, KWOK J T, ZHOU Z H. Cost - Sensitive Semi - supervised Support Vector Machine [ A ]. In : Proceed- ings of the 24th AAAI Conference on Artificial Intelli- gences (AAAI10) [ C]. Atlanta, GE, 2010, 500 - 505.
10MENG Jun, WU Li - xia, WANG Xiu - kun. Granulation -based symbolic representation of time series and semi -supervised classification [ J ]. Computers and Mathe- matics with Applications, 2011, 62 (9) : 3581 - 3590.

二级参考文献188

1Olivier C, Bernhard S, Alexander Z. Semi-Supervised Learning. Cambridge, USA : MIT Press, 2006 : 3 - 10.
2Blum A, Mitchell T. Combining Labeled and Unlabeled Data with Co-Training//Proe of the 11th Annual Conference on Computational Learning Theory. Madison, USA, 1998 : 92 - 100.
3Zhong Shi. Semi-Supervised Model-Based Document Clustering: A Comparative Study. Machine Learning, 2006, 65 ( 1 ) : 3 - 29.
4Wagstaff K, Cardie C, Rogers S, et al. Constrained K-means Clustering with Background Knowledge // Proc of 18th International Conference on Machine Learning. San Francisco, USA, 2001:577 -584.
5Wagstaff K, Cardie C. Clustering with Instance-Level Constraints// Proc of the 17th International Conference on Machine Learning. SanFrancisco, USA, 2000:1103 - 1110.
6Huang Desheng, Pan Wei. Incorporating Biological Knowledge into Distance-Based Clustering Analysis of Micro Array Gene Expression Data. Bioinformatics, 2006, 22 (10) : 1259 - 1268.
7Tari L, Baral C, Kim S. Fuzzy C-Means Clustering with Prior Biological Knowledge. Journal of Biomedical Informatics, 2009, 42 (1): 74-81.
8Ceccarelli M, Maratea A. Improving Fuzzy Clustering of Biological Data by Metric Learning with Side Information. International Journal of Approximate Reasoning, 2008, 47 ( 1 ) : 45 - 57.
9Huang Ruizhang, Lam W. An Active Learning Framework for Semi Supervised Document Clustering with Language Modeling. Data & Knowledge Engineering, 2008, 68 ( 1 ) : 49 - 67.
10Erman J, Mahanti A, Arlitt M, et al. Offline/Realtime Traffic Classification Using Semi-Supervised Learning. Performance Evaluation, 2007, 64(9/10/11/12): 1194- 1213.

共引文献183

1张智韬,台翔,杨宁,张珺锐,黄小鱼,陈钦达.不同植被覆盖度下无人机多光谱遥感土壤含盐量反演[J].农业机械学报,2022,53(8):220-230. 被引量：11
2文辉,徐永林,于敬.基于主动学习的领域知识多模式抽取框架[J].新一代信息技术,2022,5(6):137-143.
3潘章明.半监督的自动聚类[J].计算机应用,2010,30(10):2614-2617. 被引量：2
4潘俊,孔繁胜,王瑞琴.加权成对约束投影半监督聚类[J].浙江大学学报（工学版）,2011,45(5):934-940. 被引量：2
5徐晓丹.基于半监督学习的中文多文档子主题划分[J].浙江师范大学学报（自然科学版）,2011,34(3):302-305. 被引量：1
6计华,张化祥,孙晓燕.基于最近邻原则的半监督聚类算法[J].计算机工程与设计,2011,32(7):2455-2458. 被引量：7
7蔡世玉,夏战国,张文涛.时间序列相似性半监督谱聚类[J].计算机工程与应用,2011,47(31):116-118. 被引量：1
8潘章明.半监督的人工免疫网络聚类[J].计算机系统应用,2011,20(12):99-104.
9王亮,王士同.基于成对约束的动态加权半监督模糊核聚类[J].计算机工程,2012,38(1):148-150. 被引量：2
10赵卫中,马慧芳,李志清,史忠植.一种结合主动学习的半监督文档聚类算法[J].软件学报,2012,23(6):1486-1499. 被引量：30

同被引文献92

1赵卓翔,王轶彤,田家堂,周泽学.社会网络中基于标签传播的社区发现新算法[J].计算机研究与发展,2011,48(S3):8-15. 被引量：37
2彭宏,吴铁峰,张东娜.粗糙模糊模型及其在入侵检测中的应用[J].西华大学学报（自然科学版）,2005,24(3):1-3. 被引量：2
3吴庆涛,邵志清.入侵检测研究综述[J].计算机应用研究,2005,22(12):11-14. 被引量：19
4王翼,刘兴伟.基于免疫算法的入侵检测系统[J].西华大学学报（自然科学版）,2006,25(5):48-50. 被引量：2
5赵悦,穆志纯.基于QBC的主动学习研究及其应用[J].计算机工程,2006,32(24):23-25. 被引量：5
6吴青,刘三阳,郑巍.基于乘性规则的支持向量机[J].智能系统学报,2007,2(2):74-77. 被引量：3
7ZHU X J. Semi-supervised Learning Literature Survey[ R/OL]. University of Wisconsin, Madison Department of Computer Sciences ,2012 -03 - 15 [ 2014 - 03 - 15 ] http://diqital, library, wisc. edu/ 1793/60444.
8Zhou Z H, Li M. Semi-supervised Learning by Disagreement [ J ]. Knowledge and Information Systems, 2010, 24 (3) : 415 - 439.
9Blum A, Mitchell T. Combining Labeled and Unlabeled Data with Co-training. [ C ]//Proceedings of the 11 th Annual Conference on Computational Learning Theory ( COLT' 98 ). Wisconsin, USA: ACM, 1998:92 - 100.
10Zhou Z H, Li M. Tri-training : Exploiting Unlabeled Data using Three Classifiers [ J ]. IEEE Transactions on Knowledge and Data Engi- neering,2005,17 ( 11 ) : 1529 - 1541.

引证文献9

1赵建华.一种安全的基于分歧的半监督分类算法[J].西华大学学报（自然科学版）,2014,33(5):1-6. 被引量：2
2刘宁.一种半监督网络入侵检测系统SSIDS-CV[J].计算机与数字工程,2015,43(4):648-651.
3赵建华,刘宁.结合主动学习策略的半监督分类算法[J].计算机应用研究,2015,32(8):2295-2298. 被引量：7
4刘宁,赵建华.一种多分类器协同的半监督分类算法SSC_MCC[J].河南科学,2015,33(9):1554-1558.
5赵建华,刘宁.结合主动学习和半监督学习的网络入侵检测算法[J].西华大学学报（自然科学版）,2015,34(6):53-57. 被引量：5
6尚耐丽,王骁力,沈鹍霄,卢玉领,马晓普,兰义华.半监督分类方法的研究[J].计算机应用与软件,2015,32(11):162-166. 被引量：4
7张晶慧,唐洪.基于体表心音的左心室血压预测方法研究[J].生物医学工程学杂志,2017,34(3):335-341.
8赵建华,刘宁.面向高维数据的安全半监督分类算法[J].计算机系统应用,2019,28(5):178-184. 被引量：2
9赵建华,刘宁.一种基于样本选择的安全半监督分类算法[J].系统仿真技术,2020,16(1):7-11.

二级引证文献20

1赵建华.基于SOM神经网络的半监督分类算法[J].西华大学学报（自然科学版）,2015,34(1):36-40. 被引量：7
2周碧英.基于模式匹配的网络安全协处理器优化研究[J].渭南师范学院学报,2016,31(16):59-63. 被引量：1
3刘宁,赵建华,冯骜骜.基于主动学习的有监督在线多核学习算法[J].河南科学,2016,34(9):1423-1427. 被引量：2
4张鹏,刘寅,栾国强,刘行,丁晓玉,程根.基于图约束和预聚类的主动学习算法在威胁情景感知中的研究[J].计算机应用研究,2017,34(5):1544-1547. 被引量：1
5王军,刘三民,刘涛.面向概念漂移的数据流分类研究分析[J].绵阳师范学院学报,2017,36(5):80-89.
6贾伟,华庆一,张敏军,陈锐,姬翔,王博.改进极限学习机的移动界面模式半监督分类[J].计算机工程与应用,2018,54(2):11-19. 被引量：7
7陈娟,朱福喜.结合半监督与主动学习的时间序列PU问题分类[J].计算机工程与应用,2018,54(11):116-121.
8张敏,陈锻生.结合情感词典的主动贝叶斯文本情感分类方法[J].华侨大学学报（自然科学版）,2018,39(4):623-626.
9崔颖,徐凯,陆忠军,刘述彬,王立国.主动学习策略融合算法在高光谱图像分类中的应用[J].通信学报,2018,39(4):91-99. 被引量：7
10周丽娟.基于小生镜和RBF-ELMAN网络的入侵检测方法[J].山西大同大学学报（自然科学版）,2018,34(6):27-30. 被引量：1

1尚耐丽,王骁力,沈鹍霄,卢玉领,马晓普,兰义华.半监督分类方法的研究[J].计算机应用与软件,2015,32(11):162-166. 被引量：4
2陈少利.CV中使用ADO调用带参数的存储过程[J].电脑技术信息,2000(12):18-18.
3徐晨,曹辉,赵晓.基于SVM的说话人识别参数选择方法[J].计算机工程,2012,38(21):175-177. 被引量：5
4张月琴,胡斌.基于遗传神经网络的图像分割[J].电脑开发与应用,2011,24(2):16-18. 被引量：9
5丁剑,韩萌.基于交叉验证的神经网络实现[J].大连民族学院学报,2008,10(5):422-424. 被引量：7
6丁红军,蔡鸿杰,邢克礼.遗传神经网络在图像分割中应用研究[J].自动化技术与应用,2010,29(3):8-12. 被引量：6
7邹玉梅,范敬雅,张鹏程.基于主成分分析的支持向量机对购房意愿的分类研究[J].技术与创新管理,2016,37(5):544-546. 被引量：1
8张昀.数据挖掘技术研究[J].软件导刊,2009,8(9):171-172. 被引量：9
9吴飞.基于手机传感器的左右手识别[J].现代计算机（中旬刊）,2017(2):26-30.
10马元良,裴生雷.基于改进遗传算法的SVM参数优化研究[J].计算机仿真,2010,27(8):150-152. 被引量：12

西南科技大学学报

2014年第1期

浏览历史

内容加载中请稍等...

一种基于交叉验证思想的半监督分类方法被引量：9

参考文献18

二级参考文献188

共引文献183

同被引文献92

引证文献9

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

一种基于交叉验证思想的半监督分类方法 被引量：9

参考文献18

二级参考文献188

共引文献183

同被引文献92

引证文献9

二级引证文献20

相关作者

相关机构

相关主题

浏览历史

一种基于交叉验证思想的半监督分类方法被引量：9