一种基于均值更新的分类模型被引量：1

Classification Model Based on the Mean Update

下载PDF

导出

摘要最小距离分类法和最近邻分类法是最简单、快速、有效的分类方法,但对噪声较敏感,对于训练样本很少或训练样本偏离类中心较远时,分类效果较差。针对这一问题,提出了基于均值更新(MU)的分类模型,通过不断扩大训练样本并更新均值中心来改善对测试数据的分类效果;并在此基础上提出了基于均值更新的最小距离(MU-MD)分类模型,利用MU的分类结果重新计算各类的均值,然后采用最小距离法对所有测试样本重新进行划分,以确定最终的类别归属,这样可以部分纠正MU分类过程中的错分,进一步提高分类效果。 The minimum distance classification algorithm and the nearest neighbor classification algorithm are the simplest, most rapid and most effective classification methods, and they are more sensitive to the noise. But to the training samples in few or the training samples that are far fi＇om the cluster center, the classification results is poor. To solve this problem, this paper proposes a classification model based on the mean update （MU）, by expanding the training sample and updating the mean center to improve the classification results of the test data; and on this basis, it proposes the MU-based minimum distance （MU-MD） classification model, and uses the MU＇s classification results to recalculate the mean of all test samples, then all test samples are re-divided by using the minimum distance method, so as to determine the final category attribution. This can partially correct misclassification in the MU category process and further improve the classification results.

作者冯进玫卢志茂陈纯锴

机构地区哈尔滨工程大学信息与通信工程学院黑龙江科技学院电气与信息工程学院

出处《计算机系统应用》 2012年第8期123-126,135,共5页 Computer Systems & Applications

关键词最小距离分类法均值更新训练样本测试样本 the minimum distance classification algorithm mean update training samples test samples

分类号 TP311.13 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献3

1任靖,李春平.最小距离分类器的改进算法——加权最小距离分类器[J].计算机应用,2005,25(5):992-994. 被引量：30
2干正如,曾宪珪.基于最小距离原理的自适性分类方法[J].江西理工大学学报,2007,28(4):39-42. 被引量：5
3张孝飞,黄河燕.一种采用聚类技术改进的KNN文本分类方法[J].模式识别与人工智能,2009,22(6):936-940. 被引量：32

二级参考文献20

1王煜,白石,王正欧.用于Web文本分类的快速KNN算法[J].情报学报,2007,26(1):60-64. 被引量：33
2Lewis D D. Naive Bayes at Forty: The Independence Assumption in Information Retrieval // Proc of the lOth European Conference on Machine Learning. Chemnitz, Germany, 1998 : 4 - 15.
3Cohen W W, Singer Y. Context-Sensitive Learning Methods for Text Categorization// Proc of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.Zurich, Switzerland, 1996 : 307 - 315.
4Joaehims T. Text Categorization with Support Vector Machines: Learning with Many Relevant Features//Proc of the 10th European Conference on Machine Learning. Chemnitz, Germany, 1998: 137 - 142.
5Nigam K, Lafferty J, McCallum A. Using Maximum Entropy for Text Classification//Proc of the Workshop on Machine Learning for Information Filtering. Stockholm, Sweden, 1999 : 61 - 67.
6Yang Yiming, Liu Xin. A Re-Examination of Text Categorization Methods// Proc of the 22nd Annual International ACM SIGIR Conference on Research and Development in the Information Retrieval. Berkeley, USA, 1999:42-49.
7Sebastiani F. Machine Learning in Automated Text Categorization. ACM Computing Surveys, 2002, 34 ( 1 ) :1- 47.
8Hull D A. Improving Text Retrieval for the Routing Problem Using Latent Semantic Indexing// Proc of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Dublin, Ireland, 1994 : 282 - 289.
9Joachims T. A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization//Proc of the 14th International Conference on Machine Learning. Nashville, USA, 1997: 143-151.
10Galavotti L, Sebastiani F, Simi M. Experiments on the Use of Feature Selection and Negative Evidence in Automated Text Categorization//Proc of the 4th European Conference on Research and Advanced Technology for Digital Libraries. Lisbon, Portugal, 2000 : 59 - 68.

共引文献64

1兰盛,丁朋,付华,邹宇杰,杨正刚.基于起跳特征的初至波识别方法研究[J].地下空间与工程学报,2021,17(S02):925-931. 被引量：2
2荆青青,张志,王旭.基于ASTER遥感影像的煤矸石分布信息提取方法[J].煤炭科学技术,2008,36(5):93-96. 被引量：7
3饶雄,高振宇.多分类器联合监督分类方法研究[J].四川测绘,2006,29(1):15-16. 被引量：5
4郭亚琴,王正群,乐晓蓉,王向东.基于自适应距离度量的分类器设计方法[J].计算机工程与设计,2007,28(10):2270-2272. 被引量：2
5王凤岭,韦智勇,刘连芳.基于纹理分析笔迹鉴别系统的设计与实现[J].广西民族大学学报（自然科学版）,2007,13(3):97-103. 被引量：2
6张丽娟,李帆,王文龙.两种基于模式识别的枝状燃气管网泄漏定位方法[J].天然气工业,2007,27(8):106-108. 被引量：5
7邹志煌,程武山,孙鑫.人脸识别实时系统在DSP上的实现[J].现代机械,2008(2):64-66. 被引量：2
8李庆波,聂鑫,张广军.基于逆模型偏最小二乘法的高光谱亚像元目标探测方法研究[J].光谱学与光谱分析,2009,29(1):14-19. 被引量：1
9胡珍珍.基于主动外观模型的人脸表情分析[J].淮北煤炭师范学院学报（自然科学版）,2010,31(1):56-59. 被引量：1
10达吾勒.阿布都哈依尔,海拉提.克孜尔别克.哈萨克文脱机手写字符识别系统的研究与实现[J].计算机工程,2011,37(8):186-188. 被引量：1

同被引文献6

1苏高利,邓芳萍.关于支持向量回归机的模型选择[J].科技通报,2006,22(2):154-158. 被引量：59
2杨诸胜,郭雷,罗欣,胡新韬.基于分段主成分分析的高光谱图像波段选择算法研究[J].测绘工程,2006,15(3):15-18. 被引量：11
3陈斌,邹贤勇,朱文静.PCA结合马氏距离法剔除近红外异常样品[J].江苏大学学报（自然科学版）,2008,29(4):277-279. 被引量：54
4王学顺,戚大伟,黄安民.木材近红外光谱小波阈值去噪方法[J].东北林业大学学报,2009,37(2):32-34. 被引量：7
5郭志明,赵杰文,陈全胜,黄星奕.特征谱区筛选在近红外光谱检测茶叶游离氨基酸含量中的应用[J].光学精密工程,2009,17(8):1839-1844. 被引量：16
6刘国海,夏荣盛,江辉,梅从立,黄永红.一种基于SCARS策略的近红外特征波长选择方法及其应用[J].光谱学与光谱分析,2014,34(8):2094-2097. 被引量：17

引证文献1

1戴琼海,张晶,李菲菲,范静涛.光谱数据的特征挖掘降维方法[J].数据采集与处理,2016,31(6):1097-1105. 被引量：4

二级引证文献4

1韩文军,张苏,焦全军,吴骅.基于多时相CHRIS高光谱卫星数据的优势树种分类研究[J].林业调查规划,2019,44(2):1-6. 被引量：3
2邵晓鹏,刘飞,李伟,杨力铭,杨思原,刘佳维.计算成像技术及应用最新进展[J].激光与光电子学进展,2020,57(2):3-47. 被引量：56
3章悦,夏春明,谢佳智,刘爽.特征与分类算法在基于肌音信号的头部运动分类中的对比研究[J].数据采集与处理,2020,35(4):711-719. 被引量：5
4骆立,徐兆军,王晓羽,周康,那斌.基于支持向量机的木材树种识别模型[J].林业工程学报,2022,7(4):122-127. 被引量：4

1邱潇钰,张化祥.基于核的最小距离分类法的参数选择方法[J].计算机工程,2008,34(5):188-190. 被引量：2
2刘炜.应用改进最小距离分类法识别泵功图工况[J].电脑知识与技术,2010(02X):1449-1451. 被引量：2
3李邺,陈北京,张旭,舒华忠.一种结合稀疏表示和切比雪夫矩的人脸识别算法[J].东南大学学报（自然科学版）,2012,42(2):249-253. 被引量：3
4陈江丽,张嵘.一种基于最短距离聚类的K最近邻分类算法[J].新乡学院学报,2014,31(12):29-33. 被引量：1
5张立凡,游福成,张勇斌.手写数字识别系统设计[J].北京印刷学院学报,2009,17(4):44-47. 被引量：2
6秦鹏,陈健飞.K-L变换与NDBI指数法提取ASTER影像城市用地信息的比较[J].测绘与空间地理信息,2008,31(5):33-36. 被引量：4
7张洁玉,武小川.加权局部二值模式的人脸特征提取[J].中国图象图形学报,2014,19(12):1794-1801. 被引量：15
8蒋盛益,李庆华.有指导的入侵检测方法研究[J].通信学报,2006,27(3):86-93. 被引量：5
9刘琼荪,范瑞雅.确定高斯核参数的聚类方法[J].计算机工程与应用,2011,47(3):38-40. 被引量：3
10饶鲜,杨绍全,魏青,董春曦.核的最近邻算法及其仿真[J].系统工程与电子技术,2007,29(3):470-471. 被引量：5

计算机系统应用

2012年第8期

浏览历史

内容加载中请稍等...

一种基于均值更新的分类模型被引量：1

参考文献3

二级参考文献20

共引文献64

同被引文献6

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

一种基于均值更新的分类模型 被引量：1

参考文献3

二级参考文献20

共引文献64

同被引文献6

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

一种基于均值更新的分类模型被引量：1