摘要
在kNN算法分类问题中,k的取值一般是固定的,另外,训练样本中可能存在的噪声能影响分类结果。针对以上存在的两个问题,本文提出一种新的基于稀疏学习的kNN分类方法。本文用训练样本重构测试样本,其中,l_1-范数导致的稀疏性用来对每个测试样本用不同数目的训练样本进行分类,这解决了kNN算法固定k值问题;l_(21)-范数产生的整行稀疏用来去除噪声样本。在UCI数据集上进行实验,本文使用的新算法比原来的kNN分类算法能取得更好的分类效果。
The value of k is usually fixed in the issue of k Nearest Neighbors (kNN) classification. In addition, there may be noise in train samples which affect the results of classification. To solve these two problems, a sparse-based k Nearest Neighbors (kNN) classification method is proposed in this paper. Specifically, the proposed method reconstructs each test sample by the training data. During the reconstruction process,ι1-norm is used to generate the sparsity and different k values are used for different test samples, which solves the issue of fixed value of k. And ι21-norm is used to generate row sparsity which can remove noisy training samples. The experimental results on UCI datasets show that the proposed method outperforms the existing kNN classification method in terms of classification performance.
作者
宗鸣
龚永红
文国秋
程德波
朱永华
ZONG Ming GONG Yonghong WEN Guoqiu CHENG Debo ZHU Yonghua(College of Computer Science and Information Technology,Guangxi Normal University,Guilin Guangxi 541004,China Guangxi Collaborative Innovation Center of Multi-source Information Integration and Intelligent Processing, Guigang Guangxi 537000,China Department of Information Engineering,Guilin University of Aerospace Technology, Guilin Guangxi 541004,China School of Computer, Electronics and Information, Guangxi University, Nanning Guangxi 530004,China)
出处
《广西师范大学学报(自然科学版)》
CAS
北大核心
2016年第3期39-45,共7页
Journal of Guangxi Normal University:Natural Science Edition
基金
国家自然科学基金资助项目(61450001
61263035
61573270)
国家973计划项目(2013CB329404)
中国博士后科学基金资助项目(2015M570837)
广西自然科学基金资助项目(2012GXNSFGA060004
2015GXNSFCB139011
2015GXNSFAA139306)