面向目标检测与姿态估计的联合文法模型被引量：7

A Combined Grammar for Object Detection and Pose Estimation

下载PDF

导出

摘要针对部件模型在描述目标上的局限性,提出了一种判别化的视觉文法模型.该模型利用文法的可描述性和可扩展性能够对通用目标类别进行描述并且处理一般化的识别任务.根据目标检测和姿态估计的特点将文法模型实例化为两个单任务文法,同时对比了文法的异同.通过分析检测与姿态估计在应用背景和研究方法上的互补性,进一步提出了一种联合识别文法.联合文法由一组判别符号合并两个单任务文法,其特点是实现了并行化的目标检测与姿态估计,而且能同时提升检测和估计性能.鉴于参数训练所面临的弱监督环境,引入带隐变量的结构化学习框架优化文法参数.实验分别在单任务和多任务场景下对比了部件模型与提出的联合文法.实验结果说明联合文法在性能上优于当前主流的检测模型和姿态估计模型. Consider that the limitation of part-based models on the description of object categories,we propose a discriminative grammar model.The model,which has powerful description ability and extensibility,can represent general objects and deal with common recognition tasks.We define two instantiations of the grammar model for object detection and pose estimation and then discuss the differences and similarities between them.Viewed from application background and current research methods,there is great complementarity in object detection and pose estimation.This paper further introduces a novel grammar that is constructed by combining two single-task grammars using a set of discriminative symbols.There are two characteristics for the combined grammar.First,it supports joint detection and pose estimation.Second,it can improve the detection performance of both tasks.For learning grammar parameters with weak supervision we utilize a structural SVM with latent variables.We compare the combined grammar with part-based models in single-task scenario and multiple-task scenario.The evaluated results demonstrate that the proposed grammar outperforms the state-of-the-art detection models and pose estimation models.

作者陈耀东李仁发李实英黄鑫谢国琪

机构地区湖南大学信息科学与工程学院

出处《计算机学报》 EI CSCD 北大核心 2014年第10期2206-2217,共12页 Chinese Journal of Computers

基金国家自然科学基金(60873047 61173036)资助

关键词视觉文法部件模型目标检测姿态估计基于隐变量的结构化SVM 计算机视觉 visual grammar part-based models object detection pose estimation structural SVM with latent variables computer vision

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献23

1Felzenszwalb P F,Huttenlocher D P.Pictorial structures for object recognition.International Journal of Computer Vision,2005,61(1):55-79.
2Felzenszwalb P F,Girshick R B,McAllester D,et al.Object detection with discriminatively trained part-based models.IEEE Transactions on Pattern Analysis and Machine Intelli gence,2010,32(9):1627-1645.
3Zh S C,Mumford D.A stochastic grammar of images.Foundations and Trends in Computer Graphics and Vision,Boston:Now Publishers Inc.,2006.
4Purdy E.Grammatical methods in computer vision[Ph.D.dissertation].The University of Chicago,Chicago,2013.
5Girshick R B,Felzenszwalb P F,Mcallester D A.Object detection with grammar models//Proceedings of the 25th Annual Conference on Neural Information Processing Systems.Granada,Spain,2011:442-450.
6Xi Song,Wu Tian Fu,Jia Yun De,et al.Discriminatively trained andor tree models for object detection//Proceedings of the 26th IEEE Conference on Computer Vision and Pattern Recognition.Portland,USA,2013:3278-3285.
7Joo S W,Chellappa R.Attribute grammar-based event recognition and anomaly detection//Proceedings of the 19th IEEE Conference on Computer Vision and Pattern Recognition Workshop.New York,USA,2006:107-107.
8Lin Liang,Wu Tian Fu,Porway J,Xu Zi-Jian.A Stochastic Graph Grammar for Compositional Object Representation and Recognition.Pattern Recognition,2009,42 (7):1297-1307.
9Lin Liang,Wang Xiao-Long,Yang Wei,Lai Jian-Huang.Learning contour-fragment-based shape model with AndOr tree representation//Proceedings of the 25th IEEE Conference on Computer Vision and Pattern Recognition.Providence,USA,2012:135-142.
10Wang Xiao-Long,Lin Liang.Dynamical and-or graph learning for object shape modeling and detection//Proceedings of the Advances in Neural Information Processing Systems.Lake Tahoe,USA,2012:242-250.

二级参考文献26

1Felzenszwalb P F, Girshick R B, McAllester D, et al. Object detection with discriminatively trained part-based models[J]. IEEE Trans on Pattern Analysis and Machine Intelligence, 2010, 32(9): 1627-1645.
2Pirsiavash H, Ramanan D. Steer able part models[C] IIproc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2012: 3226-3233.
3Ott P. Everingham M. Shared parts for deformable part?based models[C] //Proc of the 24th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2011: 1513-1520.
4Schnitzspan P. Roth S. Schiele B. Automatic discovery of meaningful object parts with latent CRFs[CJ IIProc of the 23rd IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2010: 121-128.
5Mottaghi R. Augmenting deformable part models with irregular-shaped object patches[CJ IIProc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2012: 3116-3123.
6Felzenszwalb P F. Huttenlocher D P. Pictorial structures for object recognition[J].Journal of Computer Vision. 2005. 61 (1): 55-79.
7Azizpour H. Laptev 1. Object detection using strongly?supervised deformable part models[CJ IIProc of the 12th European Conf on Computer Vision. Berlin: Springer. 2012: 836-849.
8Branson S. Perona P. Belongie S. Strong supervision from weak annotation: Interactive training of deformable part models[CJ IIProc of the 24th IEEE Int Conf On Computer Vision. Piscataway. NJ: IEEE. 2011: 1832-1839.
9Lin Z. Hua G. Davis L S. Multiple instance feature for robust part-based object detection[CJ IIProc of the 22nd IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2009: 405-412.
10Parizi S N. OberlinJ G. Felzenszwalb P F. Reconfigurable models for scene recognition[CJ IIProc of the 25th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway. NJ: IEEE. 2012: 2775-2782.

共引文献3

1陈耀东,李仁发.一种层次化的联合识别模型[J].计算机研究与发展,2015,52(11):2431-2440. 被引量：1
2贾丽娟.融合SORM背景模型和DTCNN阈值模型的运动目标检测[J].计算机工程,2016,42(1):220-224. 被引量：3
3林灏昶,秦云川,蔡宇辉,李肯立,唐卓.基于目标检测的图形用户界面控件识别方法[J].南京大学学报（自然科学版）,2022,58(6):1012-1019. 被引量：3

同被引文献52

1崔智高,李艾华,冯国彦.采用多组单应约束和马尔可夫随机场的运动目标检测算法[J].计算机辅助设计与图形学学报,2015,27(4):621-632. 被引量：6
2Mukherjee D,Wu Q M J,Nguyen T M.Multiresolution Based Gaussian Mixture Model for Background Suppression[J].IEEE Transactions on Image Processing,2013,22(12):5022-5035.
3Maddalena L,Petrosino A.A Self-organizing Approach to Background Subtraction for Visual Surveillance Applications[J].IEEE Transactions on Image Processing,2008,17(7):1168-1177.
4Marco P,Andrea V,Jordi G,et al.A Coarse-to-fine Approach for Fast Deformable Object Detection[J].Pattern Recognition,2015,48(5):1844-1853.
5Xiao Jinwen,Wei Hui.Scale-invariant Contour Segment Context in Object Detection[J].Image and Vision Computing,2014,32(12):1055-1066.
6Ashish G,Ajoy M,Susmita G.Moving Object Detection Using Markov Random Field and Distributed Differential Evolution[J].Applied Soft Computing,2014,15(2):121-136.
7Prasad R,Murthy C R,Rao B D.Joint Approximately Sparse Channel Estimation and Data Detection in OFDM Systems Using Sparse Bayesian Learning[J].IEEE Transactions on Signal Processing,2014,62(14):3591-3603.
8丁莹,李文辉,范静涛,杨华民.基于Choquet模糊积分的运动目标检测算法[J].电子学报,2010,38(2):263-268. 被引量：13
9甘明刚,陈杰,刘劲,王亚楠.一种基于三帧差分和边缘信息的运动目标检测方法[J].电子与信息学报,2010,32(4):894-897. 被引量：74
10陈明生,梁光明,孙即祥,刘东华,赵键.复杂背景下H.264压缩域运动目标检测算法[J].通信学报,2011,32(3):91-97. 被引量：3

引证文献7

1贾丽娟.融合SORM背景模型和DTCNN阈值模型的运动目标检测[J].计算机工程,2016,42(1):220-224. 被引量：3
2易唐唐.基于时空与或图模型的视频人体动作识别方法[J].控制工程,2017,24(9):1792-1797. 被引量：6
3罗莎,夏国恩,朱新琰.改进Adaboost算法的人体步态识别方法[J].控制工程,2018,25(7):1312-1317. 被引量：11
4任文.基于姿态估计的运动辅助训练系统研究[J].电子设计工程,2019,27(18):149-152. 被引量：4
5牟丽莎,杨建,刘述木,彭莉娟.信息集理论结合SVM的步态识别[J].控制工程,2020,27(11):2038-2043. 被引量：1
6徐晓华,钱平,王一达,周昕悦,徐汉麟,徐李冰.面向电力系统的多粒度隐患检测方法[J].北京航空航天大学学报,2021,47(3):520-530.
7褚真,米庆,马伟,徐士彪,张晓鹏.部位级遮挡感知的人体姿态估计[J].计算机研究与发展,2022,59(12):2760-2769. 被引量：4

二级引证文献29

1文政颖,王旭辉,于海鹏.一种融合视觉不变矩参数表征的动态手势识别方法[J].智能计算机与应用,2021,11(12):7-11.
2牛瑞,王昱.用VC++6.0实现图像浏览器功能[J].电脑编程技巧与维护,2000(5):85-87.
3唐洪良,黄颖,黄淮,杨成顺,黄宵宁.改进的自适应高斯混合模型运动目标检测算法[J].现代电子技术,2017,40(11):65-67. 被引量：5
4张明媛,曹天卓,赵雪峰.基于ANN识别施工人员跌落险兆事故的研究[J].安全与环境学报,2018,18(5):1703-1710. 被引量：11
5栾庆磊,朱广,赵为松,汪方斌,毕晓华,薛海波.动态场景下运动目标检测方法研究[J].安徽建筑大学学报,2018,26(6):61-65. 被引量：1
6蔡冠蓝.柔性姿态估计和时空特征结合的乒乓球动作视频片段关键帧提取[J].科学技术与工程,2019,19(25):268-272. 被引量：5
7陆兴华,蔡韬.基于CNN的安防监控步态特征提取研究[J].计算机技术与发展,2019,29(11):123-127. 被引量：3
8耿君.基于轮廓图像空频域特征的舞蹈翻腾姿态识别模型[J].现代电子技术,2019,42(24):146-149. 被引量：1
9孙桂煌.基于机器学习的人体动作深度信息识别方法研究[J].佳木斯大学学报（自然科学版）,2020,38(1):37-40. 被引量：4
10孙桂煌.基于机器学习的人体动作深度信息识别方法研究[J].长春大学学报,2020,30(4):16-20. 被引量：2

1袁兴梅,杨明.一种面向不平衡数据的结构化SVM集成算法[J].南京师大学报（自然科学版）,2010,33(4):123-127. 被引量：4
2袁兴梅,杨明,杨杨.一种面向不平衡数据的结构化SVM集成分类器[J].模式识别与人工智能,2013,26(3):315-320. 被引量：21
3章东平,钱乐义.基于结构化学习的小群体主导者检测方法[J].中国计量大学学报,2017,28(1):57-62.
4孙洲伟,赵长林.提高应用程序交付的性能[J].网管员世界,2009(20):27-27.
5张淮峰,何祥健,吴强.通用目标检测算法研究进展与评述[J].云南民族大学学报（自然科学版）,2006,15(4):261-267. 被引量：1
6董新华,李瑞轩,周湾湾,王聪,薛正元,廖东杰.Hadoop系统性能优化与功能增强综述[J].计算机研究与发展,2013,50(S2):1-15. 被引量：69
7王文剑,王亚贝.基于结构化支持向量机的中文句法分析[J].山西大学学报（自然科学版）,2011,34(1):66-70. 被引量：2
8付兴,王冰,李健,刘庆龙.现代电力系统自动化技术[J].山东工业技术,2016(1):168-168. 被引量：1
9罗毅辉,熊曙初,王四春,范强.无监督环境下基于聚类集成的特征选择[J].微计算机信息,2008,24(9):265-267. 被引量：2
10程藜,吴谨,朱磊.基于结构标签学习的显著性目标检测[J].液晶与显示,2016,31(7):726-732. 被引量：2

计算机学报

2014年第10期

浏览历史

内容加载中请稍等...

面向目标检测与姿态估计的联合文法模型被引量：7

参考文献23

二级参考文献26

共引文献3

同被引文献52

引证文献7

二级引证文献29

相关作者

相关机构

相关主题

浏览历史

面向目标检测与姿态估计的联合文法模型 被引量：7

参考文献23

二级参考文献26

共引文献3

同被引文献52

引证文献7

二级引证文献29

相关作者

相关机构

相关主题

浏览历史

面向目标检测与姿态估计的联合文法模型被引量：7