基于双流混合变换CNN特征的图像分类与识别被引量：2

IMAGE CLASSIFICATION AND RECOGNITION BASED ON DEEP TWO STREAM MIXED CNN FEATURES

下载PDF

导出

摘要具有表达能力及可辨别性更强的特征是图像分类与识别技术的关键。深度CNN特征经过多次中间非线性变换,特征鲁棒性更强,在图像分类与识别领域已取得重大进展。但传统的CNN模型只增加变换层次,下层变换依赖于上层输出结果,因此其中间特征冗余度较低,最终得到的特征向量信息丰富程度不够。本文提出一种基于双流混合变换的CNN模型——DTM-CNN。该模型首先使用不同大小的感受野卷积核提取图像不同的中间特征,然后在多次深度变换时,对中间特征进行混合流动,经过多次混合变换,最终得到1024维的特征向量,并使用Softmax回归函数对其分类。实验结果表明,该模型经过多次卷积、池化及激活变换,提取的特征更加抽象、语义及结构信息更加丰富,对图像具有更强的表达能力及辨别性,因此图像分类及识别性能优越。 It is very important for image classification and recognition that the feature is more discriminative and has power representation ability. The deep CNN feature is more robust than other features because of its more non-linear transformation,and great breakthrough has obtained in the field of image classification and recognition based on the CNN. However,in the traditional CNN model,there just increase the transformation layers,and the posterior layer relies on the prior layer. As a result,the intermediate feature has low redundancy,and there is no enough information in the feature. In this paper,we propose a novel CNN model based on two stream and mixed transform. In this model,the intermediate feature is extracted via using different convolution kernels firstly. And then,the mixed feature is generated and flows forward when the deep transform is executed. Finally,we get a 1024 D feature vector and classify it with the Softmax regression function. The experiment demonstrates that the feature extracted by the model is more abstract and has richer structural and semantic information via convolution,pooling and activation transformation repeatedly. And so,it has better performance for classification and recognition than other same models.

作者汤鹏杰谭云兰李金忠谭彬

机构地区井冈山大学数理学院井冈山大学电子与信息工程学院

出处《井冈山大学学报（自然科学版）》 2015年第5期53-59,共7页 Journal of Jinggangshan University (Natural Science)

基金江西省教育厅科技计划项目(GJJ14561) 井冈山大学科研基金项目(JZ14012)

关键词图像分类识别双流混合 CNN image classification recognition two stream mixed transformation CNN

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献19

1Ojala T, Pietikainen M, Harwood D. A comparative study of texture measures with classification based on feature distributions[C]. Pattern Recognition. 1996:51- 59.
2Dalai N, Triggs B. Histograms of oriented gradients for human detection[C]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR), 2005:886-893.
3Lowe D C~ Distinctive Image Features from Scale-lnvariant Keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110.
4Grauman K, Darrell T. The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features[C]. Proceedings of IEEE Computer Society, 2005:1458-1465.
5Perronnin F, Shnchez J, Mensink T. Improving the Fisher Kernel for Large-Scale Image Classification[J]. Lecture Notes in Computer Science, 2010, 6314:143-156.
6Lazebnik S. et al. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR), 2006:2169-2178.
7Krizhevsky A, Sutskever I, Hinton G E. ImageNet Classification with Deep Convolutional Neural Networks[C]. Advances in Neural Information Processing Systems(NIPS), 2012:2012.
8Zeiler M D, Fergus R. Visualizing and Understanding Convolutional Networks[J]. Lecture Notes in Computer Science, 2014:818-833.
9Schmidhuber J. Deep Learning in Neural Networks: An Overview[J]. Neural Networks the Official Journal of the International Neural Network Society, 2014, 61:85-117.
10LeCun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[C]. Proceedings of the IEEE, 1998, 86(11):2278 - 2324.

同被引文献7

1彭勃宇,王崴,周诚,刘晓卫.面向增强现实的SUSAN-SURF快速匹配算法[J].计算机应用研究,2015,32(8):2538-2542. 被引量：12
2李朋,余谅.一种改进的小波阈值去噪方法[J].现代计算机,2016,22(5):72-75. 被引量：7
3陈剑虹,韩小珍.结合FAST-SURF和改进k-d树最近邻查找的图像配准[J].西安理工大学学报,2016,32(2):213-217. 被引量：17
4张二磊,马骏,王晓田.一种改进的SURF彩色遥感图像配准算法[J].液晶与显示,2017,32(2):144-152. 被引量：6
5刘冰,刘雪梅.基于连续小波阈值去噪算法的目标检测研究[J].现代计算机（中旬刊）,2017(5):64-68. 被引量：4
6吴鹏,徐洪玲,宋文龙.结合小波金字塔的快速NCC图像匹配算法[J].哈尔滨工程大学学报,2017,38(5):791-796. 被引量：28
7张焕龙,张秀娇,贺振东,张建伟.基于布谷鸟搜索的图像匹配方法研究[J].郑州大学学报（理学版）,2017,49(4):51-56. 被引量：13

引证文献2

1黄异嵘,李汶隆,刘川杰.基于双流网络的视频图像去噪算法[J].中国新技术新产品,2020(16):16-17. 被引量：1
2宋大伟,马凤娟,赵华.基于相似度模型耦合角度制约规则的图像匹配算法[J].井冈山大学学报（自然科学版）,2019,40(2):39-44.

二级引证文献1

1李汶隆,刘念林,柳春青.基于时空预测模型的视频码率控制算法[J].电视技术,2021,45(6):115-118. 被引量：1

1郭宏,郝俊杰.一种基于离散余弦和小波混合变换的数字水印算法[J].现代计算机,2008,14(8):62-64.
2王晶,王昊.融合局部特征和全局特征的视频拷贝检测[J].清华大学学报（自然科学版）,2016,56(3):269-272. 被引量：1
3吕承民,马宇峰.一种基于混合变换域的双重扩频盲水印算法[J].计算机工程与科学,2010(1):83-86.
4许文丽,李磊,王育民.抗噪声、几何失真和JPEG压缩攻击的鲁棒数字水印方案[J].电子与信息学报,2008,30(4):933-936. 被引量：15
5孟庆波.浅议计算机病毒的分类及识别方法[J].吉林省教育学院学报（下旬）,2009,25(5):55-56.
6余坤勇,刘健,缪丽娟,张江河.基于地物特征的遥感图像融合效果[J].福建农林大学学报（自然科学版）,2010,39(2):196-201. 被引量：1
7毛启容,赵小蕾,白李娟,王治锋,詹永照.结合过完备字典与PCA的小样本语音情感识别方法[J].江苏大学学报（自然科学版）,2013,34(1):60-65. 被引量：5
8赵专政,李云翔.聚类加权和CS-LSSVM的文本分类[J].计算机工程与应用,2013,49(16):124-128. 被引量：4
9孙翟.光电传感器信息融合技术的实际应用研究[J].科技传播,2013,5(19):209-209. 被引量：1
10姜卓,解成俊.三维混合变换彩色图像无损信息隐藏算法研究[J].计算机工程与设计,2014,35(7):2305-2311. 被引量：1

井冈山大学学报（自然科学版）

2015年第5期

浏览历史

内容加载中请稍等...

基于双流混合变换CNN特征的图像分类与识别被引量：2

参考文献19

同被引文献7

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于双流混合变换CNN特征的图像分类与识别 被引量：2

参考文献19

同被引文献7

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于双流混合变换CNN特征的图像分类与识别被引量：2