基于稀疏重构残差和随机森林的集成分类算法被引量：1

Ensemble classification method based on sparse reconstruction residuals and random forest

下载PDF

导出

摘要传统的基于稀疏表示的图像分类算法,通常根据稀疏重构后类残差向量的l2范数得到分类判决.在复杂情况下,各类残差向量l2的范数差别可能并不明显,从而导致分类器作出错误判决.提出了一种基于稀疏表示和随机森林的集成分类方法,通过稀疏表达字典对图像进行重构,提取各类残差图像的l2范数组成特征向量,并引入随机森林进行分类判决,有效地提升了算法基于类残差向量的判决能力.在手写数字数据库MNIST上的实验结果表明,在训练样本数较少的情况下,提出的基于稀疏表示和随机森林的集成学习分类方法与目前主流的SVM分类方法及随机森林方法进行比较,识别率有较为明显的提高,具有良好的鲁棒性. Based on the sparse representation computed by l2-minimization and ensemble learning,we propose a general classification algorithm for image classification.This new framework provides new insights into two crucial issues in image classification：feature extraction and classification accuracy.Since it was proposed,random forest has become a well-known data analysis method,and it has been applied to a wide variety of scientific areas.As the random forest classification has a good performance and high stability on classification,in this paper,we choose random forest as an ensemble learning classifier.The classifier based on sparse representation classified the test sample by calculate its l2 norm of residual vector between its real values and its reconstructed values.While in some cases,due to the difference of the residuals are very small,it is hard to decide the right class that the test sample belongs.We have proposed a reconstruction algorithm of sparse representation to extract image features and classify the images by random forest classifier.First,a learning dictionary is obtained based on the trained image data set.We generate a sparse vector on the over-complete dictionary,and then calculate the residuals between the real values and the reconstructed values of the training samples.The residual vector is used as the training sample of the random＆amp;nbsp;forest classifier.Finally the image is classified by the trained random forest classifier.Random forests are respectively constructed based on residuals,and the classification result is decided by voting strategy.Our Experiments use the standard digital database MNIST as the image recognition database.The recognition rate of the method proposed in this paper is obviously prior to some other popular classification methods,such as SVM.We use MATLAB to finish the research experiment.The experimental results indicate that the method we proposed has better performance than methods based on random forest and sparse representation respectively.Besides,this method has the stability of the result of the classification and good noise robustness.

作者曹冬寅王琼张兴敢

机构地区南京大学电子科学与工程学院

出处《南京大学学报（自然科学版）》 CAS CSCD 北大核心 2016年第6期1127-1132,共6页 Journal of Nanjing University（Natural Science）

基金毫米波国家重点实验室开放课题(K201514)

关键词稀疏表示图像分类算法重构算法随机森林 sparse representation image classification algorithm reconstruction algorithm random forest

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献8

1崔建明,刘建明,廖周宇.基于SVM算法的文本分类技术研究[J].计算机仿真,2013,30(2):299-302. 被引量：83
2张立朝,毕笃彦,查宇飞,黄宏图,孙超.基于二值随机森林的目标跟踪算法[J].计算机应用研究,2014,31(5):1571-1573. 被引量：1
3李响,谭南林,李国正,张驰.基于Zernike矩的人眼定位与状态识别[J].电子测量与仪器学报,2015,29(3):390-398. 被引量：17
4陈波,詹永照,成科扬.基于字典优化的稀疏表示的视频镜头分类[J].计算机应用研究,2012,29(6):2375-2378. 被引量：1
5徐文强,高以成,周煜坤.基于稀疏表示的多尺度目标跟踪算法[J].计算机应用,2013,33(A02):179-182. 被引量：1
6张春霞,张讲社.选择性集成学习算法综述[J].计算机学报,2011,34(8):1399-1410. 被引量：139
7宋相法,焦李成.基于稀疏表示及光谱信息的高光谱遥感图像分类[J].电子与信息学报,2012,34(2):268-272. 被引量：73
8王丽婷,丁晓青,方驰.基于随机森林的人脸关键点精确定位方法[J].清华大学学报（自然科学版）,2009(4):543-546. 被引量：23

二级参考文献166

1王丽丽,苏德富.基于群体智能的选择性决策树分类器集成[J].计算机技术与发展,2006,16(12):55-57. 被引量：3
2牛强,王志晓,陈岱,夏士雄.基于SVM的中文网页分类方法的研究[J].计算机工程与设计,2007,28(8):1893-1895. 被引量：22
3Zhao W, Chellappa R, Rosenfeld A, et al. Face recognition: A literature survey [J]. ACM Computing Surveys, 2003, 35(4): 399- 458.
4Pantic M, Rothkrantz M. Automatic analysis of facial expression: The state of the art [J]. IEEE Trans on PAMI, 2000, 22(12): 1424-1445.
5WANG Jiangang, Sung E. Facial feature extraction in an infrared image by proxy with a visible face image [J]. IEEE Trans on Instrumentation and Measurement, 2007, 56(5): 2057 - 2066.
6Hess M, Martinez G. Facial feature extraction based on the smallest univalue segment assimilating nucleus (SUSAN) algorithm [C]//Proceedings of Picture Coding Symposium. San Franscisco, California, 2004, 261 - 266.
7Smith S M, Brady J M. SUSAN-A new approach to low level image processing [J]. International Journal of Computer Vision, 1997, 23(1): 45- 78.
8Breiman L. Random forests [J]. Machine Learning, 2001, 45: 5-32.
9Ma Yong. Research on face detection and organ localization under complex background [D]. Beijing: Tsinghua University, July 2004.
10Zhou Z H, Geng X. Projection functions for eye detection [J]. Pattern Recognition, 2004, 37(5) : 1049 - 1056.

共引文献330

1王茂光,冀昊悦,王天明.一种基于层次聚类和模拟退火的选择性集成算法的风控模型研究[J].计算机科学,2022,49(S02):201-207. 被引量：1
2韩祥民,刘晓波,徐邦贤,邱知,唐辉.基于CEEMD与GWO-SVM算法的配电网高阻接地故障选线方法[J].智能计算机与应用,2021,11(12):143-148. 被引量：2
3崔宇,侯慧娟,苏磊,钱涛,盛戈皞,江秀臣.考虑不平衡案例样本的电力变压器故障诊断方法[J].高电压技术,2020,46(1):33-41. 被引量：30
4孙伟伟,刘春,施蓓琦,李巍岳.基于随机矩阵的高光谱影像非负稀疏表达分类[J].同济大学学报（自然科学版）,2013,41(8):1274-1280. 被引量：4
5吴正平,崔文超.一种基于FaceSDK的人眼宽度测量系统[J].三峡大学学报（自然科学版）,2010,32(5):90-92. 被引量：1
6郭颖婕,刘晓燕,郭茂祖,邹权.植物抗性基因识别中的随机森林分类方法[J].计算机科学与探索,2012,6(1):67-77. 被引量：15
7郭亚琴,秦燕.改进的模糊聚类在分类器设计中的应用[J].软件导刊,2012,11(3):32-33.
8张建,武东英,刘慧生.基于随机森林的流量分类方法[J].信息工程大学学报,2012,13(5):621-625. 被引量：6
9刘建军,吴泽彬,韦志辉,肖亮,孙乐.基于空间相关性约束稀疏表示的高光谱图像分类[J].电子与信息学报,2012,34(11):2666-2671. 被引量：15
10侯勇,郑雪峰.集成学习算法的研究与应用[J].计算机工程与应用,2012,48(34):17-22. 被引量：8

同被引文献1

1谢娟英,屈亚楠,王明钊.基于密度峰值的无监督特征选择算法[J].南京大学学报（自然科学版）,2016,52(4):735-745. 被引量：4

引证文献1

1李婵,杨文元,赵红.联合依赖最大化与稀疏表示的无监督特征选择方法[J].南京大学学报（自然科学版）,2017,53(4):775-781. 被引量：1

二级引证文献1

1吴昌明,赵兴涛,柳可鑫.基于三元组排序局部性的SOCFS改进算法[J].计算机工程,2020,46(5):47-53.

1王小平,李柳柏.基于AdaBoost算法的图像自动标注[J].西南大学学报（自然科学版）,2015,37(7):174-180. 被引量：6
2蒋芸,陈娜,明利特,周泽寻,谢国城,陈珊.基于Bagging的概率神经网络集成分类算法[J].计算机科学,2013,40(5):242-246. 被引量：43
3赵姝,吕靖,张燕平,张以文.不完整数据集的信息熵集成分类算法[J].模式识别与人工智能,2014,27(3):193-198. 被引量：6
4邱一卉.基于剪枝随机森林的电信行业客户流失预测[J].厦门大学学报（自然科学版）,2014,53(6):817-823. 被引量：7
5张盼盼,尹绍宏.隐含概念漂移的不确定数据流集成分类算法[J].计算机工程与科学,2016,38(7):1510-1516. 被引量：3
6胡小生,温菊屏,钟勇.动态平衡采样的不平衡数据集成分类方法[J].智能系统学报,2016,11(2):257-263. 被引量：13
7郝宇晨.贝叶斯网络分类器近似学习算法[J].计算机系统应用,2014,23(8):189-193. 被引量：3
8韩俊英,刘成忠.一种改进的支持向量机集成分类算法[J].甘肃农业大学学报,2008,43(1):147-150. 被引量：3
9蒋黎星,侯进.基于集成分类算法的自动图像标注[J].自动化学报,2012,38(8):1257-1262. 被引量：11
10王中心,孙刚,王浩.面向噪音和概念漂移数据流的集成分类算法[J].小型微型计算机系统,2016,37(7):1445-1449. 被引量：8

南京大学学报（自然科学版）

2016年第6期

浏览历史

内容加载中请稍等...

基于稀疏重构残差和随机森林的集成分类算法被引量：1

参考文献8

二级参考文献166

共引文献330

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于稀疏重构残差和随机森林的集成分类算法 被引量：1

参考文献8

二级参考文献166

共引文献330

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于稀疏重构残差和随机森林的集成分类算法被引量：1