融合显著信息的层次特征学习图像分类被引量：15

Image Classification Using Hierarchical Feature Learning Method Combined with Image Saliency

下载PDF

导出

摘要高效的图像特征表示是计算机视觉的基础.基于图像的视觉显著性机制及深度学习模型的思想,提出一种融合图像显著性的层次稀疏特征表示用于图像分类.这种层次特征学习每一层都由3个部分组成:稀疏编码、显著性最大值汇聚(saliency max pooling)和对比度归一化.通过在图像层次稀疏表示中引入图像显著信息,加强了图像特征的语义信息,得到图像显著特征表示.相比于手工指定特征,该模型采用无监督数据驱动的方式直接从图像中学习到有效的图像特征描述.最后采用支持向量机(support vector machine,SVM)分类器进行监督学习,实现对图像进行分类.在2个常用的标准图像数据集(Caltech 101和Caltech 256)上进行的实验结果表明,结合图像显著性信息的层次特征表示,相比于基于局部特征的单层稀疏表示在分类性能上有了显著提升. Efficient feature representations for images are essential in many computer vision tasks.In this paper,a hierarchical feature representation combined with image saliency is proposed based on the theory of visual saliency and deep learning,which builds a feature hierarchy layer-by-layer.Each feature learning layer is composed of three parts：sparse coding,saliency max pooling and contrast normalization.To speed up the sparse coding process,we propose batch orthogonal matching pursuit which differs from the traditional method.The salient information is introduced into the image sparse representation,which compresses the feature representation and strengthens the semantic information of the feature representation.Simultaneously,contrast normalization effectively reduces the impact of local variations in illumination and foreground-background contrast,and enhances the robustness of the feature representation.Instead of using hand-crafted descriptors,our model learns an effective image representation directly from images in an unsupervised data-driven manner.The final image classification is implemented with a linear SVM classifier using the learned image representation.We compare our method with many state-of-the-art algorithms including convolutional deep belief networks,SIFT based single layer or multi-layer sparse coding methods,and some kernel based feature learning approaches.The experimental results on two commonly used benchmark datasets Caltech 101 and Caltech 256 show that our method consistently and significantly improves the performance.

作者祝军赵杰煜董振宇

机构地区宁波大学信息科学与工程学院

出处《计算机研究与发展》 EI CSCD 北大核心 2014年第9期1919-1928,共10页 Journal of Computer Research and Development

基金国家自然科学基金项目(61175026) 科技部国际科技合作专项(2013DFG12810) 国家"十二五"科技支撑计划基金项目(2012BAF12B11) 浙江省国际科技合作专项(2013C24027)

关键词特征学习层次稀疏表示图像显著性图像分类显著性最大值汇聚 feature learning hierarchical sparse coding image saliency image classification saliency max pooling

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献31

1Jarrett K, Kavukcuoglu K, Ranzato M, et al. What is the best multi stage architecture for object recognition? [C] // Proc of the 12th IEEE Int Conf on Computer Vision. Piscataway, NJ: IEEE, 2009:2146-2153.
2Achanta R, Hemami S, Estrada F, et al. Frequency-tuned salient region detection [C]//Proc of the 22nd IEEE Int Conf on Computer Vision and Pattern Recognition. Piseataway, NJ: IEEE, 2009: 1597-1604.
3韩冰,杨辰,高新波.融合显著信息的LDA极光图像分类[J].软件学报,2013,24(11):2758-2766. 被引量：20
4Gemert J C, Geusebroek J M, Veenman C J, et al. Kernel codebooks for scene categorization [C] //Proc of the 10th European Conf on Computer Vision. New York: ACM, 2008:696-709.
5Csurka G, Dance C R, Fan Lixin, et al. Visual categorization with bags of keypoints [C] //Proc of the 8th European Conf on Computer Vision. Berlin: Springer, 2004: 1-22.
6Lazebnik S, Schmid C, Ponce J. Beyond bags of featuresz Spatial pyramid matching for recognizing natural scene categories [C] //Proc of the 19th Computer Society Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2006:2169-2178.
7Yang Jianchao, Yu Kai, Gong Yihong, etal. Linear spatial pyramid matching using sparse coding for image classification [C] //Proc of the 22nd Computer Society Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2009:1794-1801.
8Wang Jinjun, Yang Jianchao, Yu Kai, et al. Locality constrained linear coding for image classification [C] //Proc of the 23rd IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2010:3360-3367.
9Yu Kai, Lin Yuanqing, Lafferty J. Learning image representations from the pixel level via hierarchical sparse coding [C]//Proc of the 24th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2011:1713-1720.
10余凯,贾磊,陈雨强,徐伟.深度学习的昨天、今天和明天[J].计算机研究与发展,2013,50(9):1799-1804. 被引量：610

二级参考文献16

1MarkoffJ. How many computers to identify a cat?[NJ The New York Times, 2012-06-25.
2MarkoffJ. Scientists see promise in deep-learning programs[NJ. The New York Times, 2012-11-23.
3李彦宏.2012百度年会主题报告:相信技术的力量[R].北京:百度,2013.
410 Breakthrough Technologies 2013[N]. MIT Technology Review, 2013-04-23.
5Rumelhart D, Hinton G, Williams R. Learning representations by back-propagating errors[J]. Nature. 1986, 323(6088): 533-536.
6Hinton G, Salakhutdinov R. Reducing the dimensionality of data with neural networks[J]. Science. 2006, 313(504). Doi: 10. 1l26/science. 1127647.
7Dahl G. Yu Dong, Deng u, et a1. Context-dependent pre?trained deep neural networks for large vocabulary speech recognition[J]. IEEE Trans on Audio, Speech, and Language Processing. 2012, 20 (1): 30-42.
8Jaitly N. Nguyen P, Nguyen A, et a1. Application of pretrained deep neural networks to large vocabulary speech recognition[CJ //Proc of Interspeech , Grenoble, France: International Speech Communication Association, 2012.
9LeCun y, Boser B, DenkerJ S. et a1. Backpropagation applied to handwritten zip code recognition[J]. Neural Computation, 1989, I: 541-551.
10Large Scale Visual Recognition Challenge 2012 (ILSVRC2012)[OLJ.[2013-08-01J. http://www. image?net.org/challenges/LSVRC/2012/.

共引文献628

1贾彦哲.论人工智能研发者过失犯的注意义务[J].华中师范大学研究生学报,2020(2):40-46.
2毕思文,Henri Jaffrès,Chandra Sekhar Roychoudhuri.量子遥感发展新态势——世界首次量子遥感国际会议评述[J].全球变化数据学报（中英文）,2019,3(4):317-325. 被引量：1
3范敏,胥小波,聂小明.基于字符级扩张卷积网络的Web攻击检测方法[J].计算机应用研究,2020,37(S02):234-237. 被引量：4
4孟威,尉永清,刘文锋.基于CRT机制混合神经网络的特定目标情感分析[J].计算机应用研究,2020,37(2):360-364. 被引量：1
5华夏,王新晴,马昭烨,王东,邵发明.基于递归神经网络的视频多目标检测技术[J].计算机应用研究,2020,37(2):615-620. 被引量：8
6刘树霄,衣立,张苏平,时晓曚,薛允传.基于全卷积神经网络方法的日间黄海海雾卫星反演研究[J].海洋湖沼通报,2019(6):13-22. 被引量：11
7王海涛.自主无人系统——概念、体系架构和设计要素[J].电信快报,2021(5):6-9.
8郭龙银,扎西多吉,尚慧杰,旦增.基于LSTM的藏语语音识别[J].电脑知识与技术,2020,0(4):154-155. 被引量：2
9李佳意,董万鹏,任梦,张吉超,弓成美琪.新时代计算机智能制造模式的研究进展[J].智能计算机与应用,2021,11(3):98-105. 被引量：1
10唐公田.杏砧杏快速育苗新技术[J].科技致富向导,2000(4):26-26.

同被引文献103

1纪传俊,刘作涛,产文,周向东.一个基于语义上下文建模的图像自动标注系统[J].计算机研究与发展,2011,48(S3):441-445. 被引量：2
2邢晓芬,徐向民,黄晓泓,黄建敬.基于内容的医学图像分类研究[J].科学技术与工程,2007,7(1):85-90. 被引量：9
3董立岩,苑森淼,刘光远,贾书洪.基于贝叶斯分类器的图像分类[J].吉林大学学报（理学版）,2007,45(2):249-253. 被引量：30
4陈丹,李京华,黄根全,许家栋.基于证据理论的战场被动声多目标识别研究[J].系统仿真学报,2007,19(6):1323-1325. 被引量：3
5Changren Zhu, Hui Zhou, Runsheng Wang, et al. A novel hierarchical method of ship detection from space borne optical image based on shape and texture features [J]. IEEE Transactions on Geosciences and Remote Sensing, 2010,48 (9) : 3446-3456.
6张铮,王艳萍,等.数字图像处理与机器视觉-VisualC++与Matlab实现[M].北京:人民邮电出版社,2012:178-180.
7J. X. Sun. Modern pattern recognition[M]. Second e- dition. Beijing: Higher Education Publishing Company, 2008 : 252-259.
8Anagnostopoulos G C. SVM-based target recognition from synthetic aperture radar images using target re- gion outline descriptors[J]. Nonlinear Analysis,2009, 71(12) : e2934-e2939.
9Hwang W S, Weng J Y. Hierarchical discriminant re- gression[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000,22(11) : 1277-1293.
10Li Feifei, Perona P. A Bayesian hierarchical model for learning natural scene categories[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recog- nition(CVPR), Washington, USA, 2005 : 524-531.

引证文献15

1宿勇.基于图像处理的舰船目标识别研究[J].计算机与数字工程,2015,43(7):1207-1211. 被引量：4
2张旭东,吕言言,缪永伟,郝鹏翼,陈佳舟.结合区域协方差分析的图像显著性检测[J].中国图象图形学报,2016,21(5):605-615. 被引量：12
3赵永威,周苑,李弼程.基于词典优化与空间一致性度量的目标检索[J].计算机研究与发展,2016,53(5):1043-1052. 被引量：1
4邓江洪,赵领.多特征筛选与支持向量机相融合的图像分类模型[J].吉林大学学报（理学版）,2016,54(4):862-866. 被引量：6
5黄金国.基于云计算的图像分类算法[J].现代电子技术,2017,40(5):63-65.
6王慧,宋淑蕴.基于KCPA提取特征和RVM的图像分类[J].吉林大学学报（理学版）,2017,55(2):357-362. 被引量：4
7刘尚旺,胡剑兰,崔艳萌.改进HFT模型及其在图像分类中的应用[J].小型微型计算机系统,2017,38(5):1111-1115. 被引量：1
8张露,王华彬,陶亮,周健.基于分类距离分数的自适应多模态生物特征融合[J].计算机研究与发展,2018,55(1):151-162. 被引量：7
9王林,张晓锋.卷积深度置信网络的场景文本检测[J].计算机系统应用,2018,27(6):231-235. 被引量：2
10吴雪.粒子群优化算法选择特征的运动图像分类[J].现代电子技术,2017,40(17):47-50.

二级引证文献57

1张宁,姜春字,林嘉昊.海上舰船目标识别研究[J].中国水运（下半月）,2021,21(1):1-4.
2马燕,余海军,钟发生,刘丰林.基于残差编解码网络的CT图像金属伪影校正[J].仪器仪表学报,2020,41(8):160-169. 被引量：17
3鲍光海,林善银,徐林森.基于改进型卷积网络的汽车高度调节器缺陷检测方法[J].仪器仪表学报,2020,41(2):157-165. 被引量：13
4单凯强,桑海峰.基于全景视频下标记点特征的停车位检测技术研究[J].电子测量与仪器学报,2022,36(2):203-210. 被引量：4
5赖欣,王储,陈航.低照度下人脸检测MSRCR光频分段滤波增强算法[J].电子测量与仪器学报,2022,36(2):96-106. 被引量：5
6王夏霖,阚秀,孙维周,曹乐,范艺璇.焦炭显微光学组织自动检测与提取方法研究[J].电子测量与仪器学报,2022,36(2):32-39. 被引量：1
7臧苏莹.MVI分块标识颜色特征快速检索仿真[J].计算机仿真,2019,36(1):458-461. 被引量：1
8郭春梅,陈恳,李萌,李斐.融合显著度时空上下文的超像素跟踪算法[J].模式识别与人工智能,2017,30(8):728-739. 被引量：1
9包姣.多维视觉图像敏感区域智能标记方法仿真[J].计算机仿真,2017,34(11):324-327.
10崔玲玲,许金兰,徐岗,吴卿.融合双特征图信息的图像显著性检测方法[J].中国图象图形学报,2018,23(4):583-594. 被引量：14

1杨晓敏,严斌宇,李康丽,苏冰山.基于金字塔模型的图像分类[J].计算机与数字工程,2015,43(4):704-706.
2华骅,杨晓敏,严斌宇.基于视觉显著度及金字塔模型的图像分类[J].数字技术与应用,2015,33(3):51-53.
3胡湘萍.基于多核学习的多特征融合图像分类研究[J].计算机工程与应用,2016,52(5):194-198. 被引量：9
4张国怀.我们测试数码相机的规则考验数码相机的实力[J].电子测试,2002,15(12):60-61.
5优派VX2268wm——120Hz刷新率、SRS音箱[J].家庭电子,2009(10):45-45.
6徐新文,李国辉,甘亚莉.基于MPEG-7框架的交互式图像层次化描述工具(IIHDT)的设计与实现[J].计算机工程与科学,2004,26(12):30-33.
7Zhou Wujie,Jiang Gangyi,Yu Mei.NEW VISUAL PERCEPTUAL POOLING STRATEGY FOR IMAGE QUALITY ASSESSMENT[J].Journal of Electronics(China),2012,29(3):254-261. 被引量：2
8程东阳,蒋兴浩,孙锬锋.基于稀疏编码和多核学习的图像分类算法[J].上海交通大学学报,2012,46(11):1789-1793. 被引量：6
9李博,张凌.基于视觉显著性的监控视频动态目标跟踪[J].信息技术,2014,38(4):60-65. 被引量：2
10杨长春,王俊,袁敏,雷晨阳.基于weight-pooling词向量的上下文广告推荐算法[J].计算机应用与软件,2016,33(12):224-229. 被引量：1

计算机研究与发展

2014年第9期

浏览历史

内容加载中请稍等...

融合显著信息的层次特征学习图像分类被引量：15

参考文献31

二级参考文献16

共引文献628

同被引文献103

引证文献15

二级引证文献57

相关作者

相关机构

相关主题

浏览历史

融合显著信息的层次特征学习图像分类 被引量：15

参考文献31

二级参考文献16

共引文献628

同被引文献103

引证文献15

二级引证文献57

相关作者

相关机构

相关主题

浏览历史

融合显著信息的层次特征学习图像分类被引量：15