期刊文献+

基于双流混合变换CNN特征的图像分类与识别 被引量:2

IMAGE CLASSIFICATION AND RECOGNITION BASED ON DEEP TWO STREAM MIXED CNN FEATURES
下载PDF
导出
摘要 具有表达能力及可辨别性更强的特征是图像分类与识别技术的关键。深度CNN特征经过多次中间非线性变换,特征鲁棒性更强,在图像分类与识别领域已取得重大进展。但传统的CNN模型只增加变换层次,下层变换依赖于上层输出结果,因此其中间特征冗余度较低,最终得到的特征向量信息丰富程度不够。本文提出一种基于双流混合变换的CNN模型——DTM-CNN。该模型首先使用不同大小的感受野卷积核提取图像不同的中间特征,然后在多次深度变换时,对中间特征进行混合流动,经过多次混合变换,最终得到1024维的特征向量,并使用Softmax回归函数对其分类。实验结果表明,该模型经过多次卷积、池化及激活变换,提取的特征更加抽象、语义及结构信息更加丰富,对图像具有更强的表达能力及辨别性,因此图像分类及识别性能优越。 It is very important for image classification and recognition that the feature is more discriminative and has power representation ability. The deep CNN feature is more robust than other features because of its more non-linear transformation,and great breakthrough has obtained in the field of image classification and recognition based on the CNN. However,in the traditional CNN model,there just increase the transformation layers,and the posterior layer relies on the prior layer. As a result,the intermediate feature has low redundancy,and there is no enough information in the feature. In this paper,we propose a novel CNN model based on two stream and mixed transform. In this model,the intermediate feature is extracted via using different convolution kernels firstly. And then,the mixed feature is generated and flows forward when the deep transform is executed. Finally,we get a 1024 D feature vector and classify it with the Softmax regression function. The experiment demonstrates that the feature extracted by the model is more abstract and has richer structural and semantic information via convolution,pooling and activation transformation repeatedly. And so,it has better performance for classification and recognition than other same models.
出处 《井冈山大学学报(自然科学版)》 2015年第5期53-59,共7页 Journal of Jinggangshan University (Natural Science)
基金 江西省教育厅科技计划项目(GJJ14561) 井冈山大学科研基金项目(JZ14012)
关键词 图像分类 识别 双流混合 CNN image classification recognition two stream mixed transformation CNN
  • 相关文献

参考文献19

  • 1Ojala T, Pietikainen M, Harwood D. A comparative study of texture measures with classification based on feature distributions[C]. Pattern Recognition. 1996:51- 59.
  • 2Dalai N, Triggs B. Histograms of oriented gradients for human detection[C]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR), 2005:886-893.
  • 3Lowe D C~ Distinctive Image Features from Scale-lnvariant Keypoints[J]. International Journal of Computer Vision, 2004, 60(2):91-110.
  • 4Grauman K, Darrell T. The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features[C]. Proceedings of IEEE Computer Society, 2005:1458-1465.
  • 5Perronnin F, Shnchez J, Mensink T. Improving the Fisher Kernel for Large-Scale Image Classification[J]. Lecture Notes in Computer Science, 2010, 6314:143-156.
  • 6Lazebnik S. et al. Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories[C]. IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR), 2006:2169-2178.
  • 7Krizhevsky A, Sutskever I, Hinton G E. ImageNet Classification with Deep Convolutional Neural Networks[C]. Advances in Neural Information Processing Systems(NIPS), 2012:2012.
  • 8Zeiler M D, Fergus R. Visualizing and Understanding Convolutional Networks[J]. Lecture Notes in Computer Science, 2014:818-833.
  • 9Schmidhuber J. Deep Learning in Neural Networks: An Overview[J]. Neural Networks the Official Journal of the International Neural Network Society, 2014, 61:85-117.
  • 10LeCun Y, Bottou L, Bengio Y, et al. Gradient-based learning applied to document recognition[C]. Proceedings of the IEEE, 1998, 86(11):2278 - 2324.

同被引文献7

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部