期刊文献+

图像物体分类与检测算法综述 被引量:188

A Review on Image Object Classification and Detection
下载PDF
导出
摘要 图像物体分类与检测是计算机视觉研究中的两个重要的基本问题,也是图像分割、物体跟踪、行为分析等其他高层视觉任务的基础.该文从物体分类与检测问题的基本定义出发,首先从实例、类别、语义三个层次对物体分类与检测研究中存在的困难与挑战进行了阐述.接下来,该文以物体检测和分类方面的典型数据库和国际视觉算法竞赛PASCAL VOC竞赛为主线对近年来物体分类与检测的发展脉络进行了梳理与总结,指出表达学习和结构学习在于物体分类与检测中占有重要的地位.最后文中对物体分类与检测的发展方向进行了思考和讨论,探讨了图像物体识别中下一步研究可能的方向. Image object classification and detection are two of the most essential problems in computer vision.They are the basis of many other complex vision problems,such as segmentation,tracking,and action analysis.In this paper,we try to give an analysis of object classification and detection algorithms based on PASCAL VOC challenge,which is generally acknowledged as a public evaluation for object recognition.We first discuss the importance of object classification and detection; next we summarize the difficulties and challenges in the development of basic object recognition.Then we review the yearly achievements in the study of object classification and detection.Finally we discuss the development directions of object classification and detection,from the view of representations learning and structure learning.
出处 《计算机学报》 EI CSCD 北大核心 2014年第6期1225-1240,共16页 Chinese Journal of Computers
基金 国家"九七三"重点基础研究发展规划项目基金(2012CB316302) 国家自然科学基金(61322209) 国家科技支撑计划(2012BAH07B01)资助~~
关键词 物体分类 物体检测 计算机视觉 特征表达 结构学习 object classification object detection computer vision feature representations structural learning
  • 相关文献

参考文献67

  • 1Marr D.Vision:A Computational Investigation Into the Human Representation and Processing of Visual Information.Cambridge:The MIT Press,2010.
  • 2LeCun Y,Bottou L,Bengio Y,Haffner P.Gradient-based learning applied to document recognition.Proceedings of the IEEE,1998,86(11):2278-2324.
  • 3Ferrari V,Jurie F,Schmid C.From images to shape models for object detection.International Journal of Computer Vision,2009,87(3):284-303.
  • 4Latecki L J,Lakamper R,Eckhardt U.Shape descriptors for non rigid shapes with a single closed contour//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Hilton Head,USA,2000,1:424-429.
  • 5Krizhevsky A.Learning Multiple Layers of Features from Tiny Images[M.S.dissertation].University of Toronto,2009.
  • 6Torralba A,Fergus R,Freeman W T.80 million tiny images:A large dataset for non-parametric object and scene recognition.IEEE Transactions on Pattern Analysis and Machine Intelligence,2008,30(11):1958-1970.
  • 7Li FebFei,Fergus R,Perona P.Learning generative visual models from few training examples:An incremental Bayesian approach tested on 101 object categories//Proceedings of the Computer Vision and Pattern Recognition (CVPR),Workshop on Generative-Model Based Vision.Washington,USA,2004:178.
  • 8Griffin G,Holub A D,Perona P.The Caltech 256.Caltech Technical Report CNS-TR-2007-001.
  • 9Lazebnik S,Schmid C,Ponce J.Beyond bags of features:Spatial pyramid matching for recognizing natural scene categories//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).New York,USA,2006:2169-2178.
  • 10Li Fei-Fei,Perona P.A Bayesian hierarchical model for learning natural scene categories//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Washington,USA,2005:524-531.

二级参考文献55

  • 1黄凯奇,吴镇扬,王桥.色彩恒常性在彩色图像增强中的应用[J].应用科学学报,2004,22(3):322-326. 被引量:10
  • 2汪云九,齐翔林.初级视觉的Gabor函数模型的研究进展[J].生物物理学报,1993,9(3):508-513. 被引量:7
  • 3Borji A, Itti L. State-of-the-Art in Visual Attention Modeling. IEEE Trans on Pattern Analysis and Machine Intelligence, 2013, 35 ( 1 ) : 185-207.
  • 4TenenbaumJ B, Griffiths T L, Kemp C. Theory-Based Bayesian Models of Inductive Learning and Reasoning. Trends in Cognitive Sciences, 2006,10(7): 309-318.
  • 5Roberts L. Machine Perception of Three-Dimensional Solids. Ph. 0 Dissertation. Cambridge, USA: Massachusetts Institute of Technolo?gy, 1963.
  • 6Marr D. Vision: A Computational Investigation into the Human Re?presentation and Processing of Visual Information. San Francisco, USA: Freeman, 1982.
  • 7Aloimonos Y. What I Have Learned. CVGIP: Image Understand?ing, 1994, 60(1) : 74-85.
  • 8Neisser U. Cognition and Reality. San Francisco, USA: Freeman, 1976.
  • 9Wandell B A. Foundations of Vision. Sunderland, USA: Sinauer Associates, 1995.
  • 10Gonzalez R C, Woods R E. Digital Image Processing. 2nd Edition. Upper Saddle River, USA: Prentice Hall, 2002.

共引文献11

同被引文献1612

引证文献188

二级引证文献2158

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部