期刊文献+

面向目标检测与姿态估计的联合文法模型 被引量:7

A Combined Grammar for Object Detection and Pose Estimation
下载PDF
导出
摘要 针对部件模型在描述目标上的局限性,提出了一种判别化的视觉文法模型.该模型利用文法的可描述性和可扩展性能够对通用目标类别进行描述并且处理一般化的识别任务.根据目标检测和姿态估计的特点将文法模型实例化为两个单任务文法,同时对比了文法的异同.通过分析检测与姿态估计在应用背景和研究方法上的互补性,进一步提出了一种联合识别文法.联合文法由一组判别符号合并两个单任务文法,其特点是实现了并行化的目标检测与姿态估计,而且能同时提升检测和估计性能.鉴于参数训练所面临的弱监督环境,引入带隐变量的结构化学习框架优化文法参数.实验分别在单任务和多任务场景下对比了部件模型与提出的联合文法.实验结果说明联合文法在性能上优于当前主流的检测模型和姿态估计模型. Consider that the limitation of part-based models on the description of object categories,we propose a discriminative grammar model.The model,which has powerful description ability and extensibility,can represent general objects and deal with common recognition tasks.We define two instantiations of the grammar model for object detection and pose estimation and then discuss the differences and similarities between them.Viewed from application background and current research methods,there is great complementarity in object detection and pose estimation.This paper further introduces a novel grammar that is constructed by combining two single-task grammars using a set of discriminative symbols.There are two characteristics for the combined grammar.First,it supports joint detection and pose estimation.Second,it can improve the detection performance of both tasks.For learning grammar parameters with weak supervision we utilize a structural SVM with latent variables.We compare the combined grammar with part-based models in single-task scenario and multiple-task scenario.The evaluated results demonstrate that the proposed grammar outperforms the state-of-the-art detection models and pose estimation models.
出处 《计算机学报》 EI CSCD 北大核心 2014年第10期2206-2217,共12页 Chinese Journal of Computers
基金 国家自然科学基金(60873047 61173036)资助
关键词 视觉文法 部件模型 目标检测 姿态估计 基于隐变量的结构化SVM 计算机视觉 visual grammar part-based models object detection pose estimation structural SVM with latent variables computer vision
  • 相关文献

参考文献23

  • 1Felzenszwalb P F,Huttenlocher D P.Pictorial structures for object recognition.International Journal of Computer Vision,2005,61(1):55-79.
  • 2Felzenszwalb P F,Girshick R B,McAllester D,et al.Object detection with discriminatively trained part-based models.IEEE Transactions on Pattern Analysis and Machine Intelli gence,2010,32(9):1627-1645.
  • 3Zh S C,Mumford D.A stochastic grammar of images.Foundations and Trends in Computer Graphics and Vision,Boston:Now Publishers Inc.,2006.
  • 4Purdy E.Grammatical methods in computer vision[Ph.D.dissertation].The University of Chicago,Chicago,2013.
  • 5Girshick R B,Felzenszwalb P F,Mcallester D A.Object detection with grammar models//Proceedings of the 25th Annual Conference on Neural Information Processing Systems.Granada,Spain,2011:442-450.
  • 6Xi Song,Wu Tian Fu,Jia Yun De,et al.Discriminatively trained andor tree models for object detection//Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition.Portland,USA,2013:3278-3285.
  • 7Joo S W,Chellappa R.Attribute grammar-based event recognition and anomaly detection//Proceedings of the 19th IEEE Conference on Computer Vision and Pattern Recognition Workshop.New York,USA,2006:107-107.
  • 8Lin Liang,Wu Tian Fu,Porway J,Xu Zi-Jian.A Stochastic Graph Grammar for Compositional Object Representation and Recognition.Pattern Recognition,2009,42 (7):1297-1307.
  • 9Lin Liang,Wang Xiao-Long,Yang Wei,Lai Jian-Huang.Learning contour-fragment-based shape model with AndOr tree representation//Proceedings of the 25th IEEE Conference on Computer Vision and Pattern Recognition.Providence,USA,2012:135-142.
  • 10Wang Xiao-Long,Lin Liang.Dynamical and-or graph learning for object shape modeling and detection//Proceedings of the Advances in Neural Information Processing Systems.Lake Tahoe,USA,2012:242-250.

二级参考文献26

  • 1Felzenszwalb P F, Girshick R B, McAllester D, et al. Object detection with discriminatively trained part-based models[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2010, 32(9): 1627-1645.
  • 2Pirsiavash H, Ramanan D. Steer able part models[C] IIproc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2012: 3226-3233.
  • 3Ott P. Everingham M. Shared parts for deformable part?based models[C] //Proc of the 24th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2011: 1513-1520.
  • 4Schnitzspan P. Roth S. Schiele B. Automatic discovery of meaningful object parts with latent CRFs[CJ IIProc of the 23rd IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2010: 121-128.
  • 5Mottaghi R. Augmenting deformable part models with irregular-shaped object patches[CJ IIProc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2012: 3116-3123.
  • 6Felzenszwalb P F. Huttenlocher D P. Pictorial structures for object recognition[J].Journal of Computer Vision. 2005. 61 (1): 55-79.
  • 7Azizpour H. Laptev 1. Object detection using strongly?supervised deformable part models[CJ IIProc of the 12th European Conf on Computer Vision. Berlin: Springer. 2012: 836-849.
  • 8Branson S. Perona P. Belongie S. Strong supervision from weak annotation: Interactive training of deformable part models[CJ IIProc of the 24th IEEE Int Conf On Computer Vision. Piscataway. NJ: IEEE. 2011: 1832-1839.
  • 9Lin Z. Hua G. Davis L S. Multiple instance feature for robust part-based object detection[CJ IIProc of the 22nd IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2009: 405-412.
  • 10Parizi S N. OberlinJ G. Felzenszwalb P F. Reconfigurable models for scene recognition[CJ IIProc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2012: 2775-2782.

共引文献3

同被引文献52

  • 1崔智高,李艾华,冯国彦.采用多组单应约束和马尔可夫随机场的运动目标检测算法[J].计算机辅助设计与图形学学报,2015,27(4):621-632. 被引量:6
  • 2Mukherjee D,Wu Q M J,Nguyen T M.Multiresolution Based Gaussian Mixture Model for Background Suppression[J].IEEE Transactions on Image Processing,2013,22(12):5022-5035.
  • 3Maddalena L,Petrosino A.A Self-organizing Approach to Background Subtraction for Visual Surveillance Applications[J].IEEE Transactions on Image Processing,2008,17(7):1168-1177.
  • 4Marco P,Andrea V,Jordi G,et al.A Coarse-to-fine Approach for Fast Deformable Object Detection[J].Pattern Recognition,2015,48(5):1844-1853.
  • 5Xiao Jinwen,Wei Hui.Scale-invariant Contour Segment Context in Object Detection[J].Image and Vision Computing,2014,32(12):1055-1066.
  • 6Ashish G,Ajoy M,Susmita G.Moving Object Detection Using Markov Random Field and Distributed Differential Evolution[J].Applied Soft Computing,2014,15(2):121-136.
  • 7Prasad R,Murthy C R,Rao B D.Joint Approximately Sparse Channel Estimation and Data Detection in OFDM Systems Using Sparse Bayesian Learning[J].IEEE Transactions on Signal Processing,2014,62(14):3591-3603.
  • 8丁莹,李文辉,范静涛,杨华民.基于Choquet模糊积分的运动目标检测算法[J].电子学报,2010,38(2):263-268. 被引量:13
  • 9甘明刚,陈杰,刘劲,王亚楠.一种基于三帧差分和边缘信息的运动目标检测方法[J].电子与信息学报,2010,32(4):894-897. 被引量:74
  • 10陈明生,梁光明,孙即祥,刘东华,赵键.复杂背景下H.264压缩域运动目标检测算法[J].通信学报,2011,32(3):91-97. 被引量:3

引证文献7

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部