三维模板跟踪的基准合成数据集构建及算法评估被引量：4

A Synthetic Dataset and Performance Evaluation for 3D Template Tracking

下载PDF

导出

摘要三维模板跟踪旨在将预先构建的三维CAD模型与输入图像中的相应目标进行精确配准,在增强现实、机器人等领域具有重要的应用,也是计算机视觉领域的关键问题之一.近年来,三维模板跟踪的准确率和稳定性都得到了持续提升,但仅有少量的工作关注三维模板跟踪数据集的构建.随着深度学习的普及,各领域中大规模数据集的构建越来越被重视,为算法的训练、测试和评估奠定了基础,极大地推动了相关领域的发展.以往的三维模板跟踪数据集大多存在规模有限,画面不够自然、真实,多样性不足等问题.基于此,本文创建了一个大规模的基于真实感渲染的三维模板跟踪数据集(Render Dataset for Object Tracking,简称RDOT),其包含多种不同结构和材质的物体、复杂的运动模式,并且在场景、光照、噪声、运动模糊和遮挡等方面有丰富细致的设置,是目前三维模板跟踪领域最大的数据集,满足三维模板跟踪算法评估的各种需求.针对现有三维模板跟踪算法测评时使用的数据集不统一,测评结果难以客观全面地反映算法性能的问题,本文基于所构建的数据集,利用平均边缘距离、平均表面距离和重初始化率三种度量标准全面评估了目前主流的三维模板跟踪算法,并对评测结果进行了深入的分析讨论,给出了全面的分析报告和技术展望.此外,基于所构建的数据集,本文提出了对跟踪结果建立误差分析模型,并对结果进行校正的方法,有效改善了三维模版跟踪算法的准确率. 3D template tracking aims to accurately align pre-constructed 3D CAD models with the corresponding targets in the input images,and has important applications in augmented reality and robotics.It is also one of the key problems in the field of computer vision.In recent years,various approaches have been proposed to improve the accuracy and robustness of 3D template tracking,but only a small amount of work has contributed to the construction of 3D template tracking datasets.With the development and wide applications of deep learning,the construction of large-scale datasets in various fields has been paid more and more attention,laying the foundation for the training,testing and evaluation of algorithms,which has greatly promoted the development of related fields.Previous datasets for 3D template tracking are acquired by either video capture or computer rendering.Video-captured datasets are realistic,but since the pose is computed based on hand-crafted markers,the accuracy of the ground-truth pose is not guaranteed and the size of these datasets are also limited due to the time-consuming labelling process.Computer-rendered datasets could be synthesized massively,but the quality of rendered image sequences is limited by the adopted render techniques.Altogether,previous datasets suffer from problems such as limited scale,inaccurate ground-truth poses,unrealistic images and insufficient diversity of model settings,therefore it is meaningful and challenging to construct a high-quality and large-scale dataset for 3D template tracking.In this paper,we propose to construct a large-scale 3D template tracking dataset RDOT(Render Dataset for Object Tracking)based on photorealistic rendering.RDOT is rendered with photorealistic rendering method.The model set contains tens of objects with different physical structures and realistic materials,it also allows the camera and objects to move in pre-defined complex motion modes.Moreover,compared with previous datasets,RDOT takes more accurate control of settings of rendering scenes,it offers various detailed settings of lighting,noise,motion blur and occlusion in different degrees of difficulty.To the best of our knowledge,RDOT is currently the largest 3D template tracking dataset which meets the demands of performance evaluation.Based on RDOT,we evaluated previous 3D template tracking methods in an objective and fair way.Previous approaches have been evaluated on different datasets that suffer the aforementioned problems.In our evaluation,the tracking methods are evaluated with three precision metrics,including ADE(Average Edge Distance),ASD(Average Surface Distance)and RR(Reinitialization Rate).We analyze the evaluation results from multiple aspects considering structures of objects,materials of objects and different settings of rendering scenes.In addition,since RGB-based 3D tracking method usually produce significant errors in the depth direction due to the missing of depth constraint,we propose a statistical model of tracking errors that can be computed based on the accurate ground-truth pose of RDOT.By applying the error model to compensate the resulting object pose parameters,the tracking accuracy can be improved significantly.Finally,we discuss the disadvantages of different tracking approaches,and give an overall conclusion and perspective for future 3D template tracking approaches.

作者何弦李佳宸金立刘力钟凡秦学英 HE Xian;LI Jia-Chen;JIN Li;LIU Li;ZHONG Fan;QIN Xue-Ying(Department of Software,Shandong University,Jinan 250101;Engineering Research Center of Digital Media Technology,Ministry of Education,Shandong University,Jinan 250101;Shichen Information Technology(Shanghai)Co.,Ltd,Shanghai 201203;Department of Computer Science and Technology,Shandong University,Qingdao,Shandong 266237)

机构地区山东大学软件学院数字媒体技术教育部工程研究中心视辰信息科技(上海)有限公司山东大学计算机科学与计算学院

出处《计算机学报》 EI CAS CSCD 北大核心 2022年第3期585-600,共16页 Chinese Journal of Computers

基金国家自然科学基金项目(62172260,61907026) 工信部2019年工业互联网创新发展工程项目之江实验室项目(2020NB0AB02) 山东省高等学校科学技术计划项目(J18KA392)资助

关键词三维模板跟踪数据集构建算法测评增强现实真实感渲染 3D template tracking dataset construction algorithm evaluation augmented reality photorealistic rendering

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1黄鸿,钟凡,秦学英.基于自适应特征融合的无纹理3D目标跟踪[J].计算机辅助设计与图形学学报,2018,30(5):833-841. 被引量：3

二级参考文献1

1李成龙,钟凡,秦学英.基于3维模型的单视图不规则物体定位[J].计算机辅助设计与图形学学报,2015,27(1):68-75. 被引量：6

共引文献2

1黄鸿,钟凡,秦学英.基于时间一致性局部颜色特征的无纹理3D物体实时跟踪[J].计算机辅助设计与图形学学报,2020,32(1):99-111.
2许靖添.基于图像轮廓检测的航天器目标跟踪控制系统设计[J].计算机测量与控制,2021,29(2):67-70. 被引量：1

同被引文献29

1李纪三,蔡文彬,耿利祥,刘溶,任渊.旋转相控阵雷达变数据率目标跟踪算法[J].系统工程与电子技术,2021,43(3):676-683. 被引量：5
2谭芳,穆平安,马忠雪.基于YOLOv3检测和特征点匹配的多目标跟踪算法[J].计量学报,2021,42(2):157-162. 被引量：19
3赵林锁,马瑞强,姜天,宋宝燕,潘一山.两级回归的流式大数据事件自适应预警方法[J].计算机工程与应用,2021,57(7):88-94. 被引量：1
4尤勇,汪浩,任天,顾胜晖,孙佳林.一种监控系统的链路跟踪型日志数据的存储设计[J].软件学报,2021,32(5):1302-1321. 被引量：11
5赵怡,高淑萍,何迪.基于深度学习的眼动跟踪数据融合算法[J].计算机工程与应用,2021,57(10):211-217. 被引量：2
6赵楚楚,王子微,丁冠华,孙进平.基于模糊逻辑的改进自适应IMM跟踪算法[J].信号处理,2021,37(5):724-734. 被引量：9
7余列冰,向隆刚,孙尚宇,关雪峰,吴华意.面向分布式列式存储的轨迹大数据k近邻查询[J].武汉大学学报（信息科学版）,2021,46(5):736-745. 被引量：9
8刘金文,任卫红,田建东.融合人群密度的自适应深度多目标跟踪算法[J].模式识别与人工智能,2021,34(5):385-397. 被引量：5
9Jia-Chen Li,Fan Zhong,Song-Hua Xu,Xue-Ying Qin.3D Object Tracking with Adaptively Weighted Local Bundles[J].Journal of Computer Science & Technology,2021,36(3):555-571. 被引量：2
10侯晓双,张俊.图数据流上时间尊重图模式匹配算法研究[J].计算机应用研究,2021,38(7):1988-1992. 被引量：1

引证文献4

1陈东升.基于AR技术的财会专业在线教学系统设计[J].中国新技术新产品,2022(12):31-35. 被引量：1
2宋修强,金立,宋婧,李佳宸,孟祥旭,秦学英.基于单目RGB数据的三维模板物体跟踪算法综述[J].计算机辅助设计与图形学学报,2024,36(1):1-13.
3陈鹏,白勇,孙翰翔.面向抓取检测的位姿估计数据集自动采集标注系统[J].工程科学学报,2024,46(8):1458-1468.
4刘梓健,陈超鸿.可编程逻辑器件间的大数据自适应跟踪系统设计[J].电子设计工程,2024,32(19):119-123.

二级引证文献1

1刘强.基于候鸟算法的机电一体化在线课程系统设计[J].自动化与仪器仪表,2023(2):181-184.

1杨金铎,王林波,王元峰,兰雯婷,吴显峰.YOLOv3及模板跟踪在电力AR远程作业指导中的应用[J].电工技术,2020(17):47-49.
2赵炫,刘雨田,张旭,刘庆伟,崔涵.基于FAT-AI测试的实时人脸比对技术分析[J].警察技术,2021(6):4-7. 被引量：1
3刘育含,翟玉莹.园艺植物组培育苗技术探析[J].广东蚕业,2022,56(1):82-84. 被引量：1
4佘文学,刘晓鹏,刘凯.桑格尔空天飞行器技术途径分析与思考[J].火箭推进,2021,47(6):11-20. 被引量：4
5李江涛,史慧.浅谈无人机技术智能化应用及展望[J].中国设备工程,2022(4):31-32. 被引量：2
6黄勤超.烟叶烘烤技术研究进展与智能烘烤技术展望[J].江西农业,2022(2):12-13.
7林然.应用于飞机火灾的无人化消防技术展望[J].中国民用航空,2022(1):56-58.
8单月华.高中化学作业设计的优化[J].数理化解题研究,2022(3):113-115. 被引量：1
9蔡圣杰,郑成勇,陈伟杰.基于残差网络的树叶分类[J].五邑大学学报（自然科学版）,2022,36(1):21-27.
10崔宗勇,王晓雅,施君南,曹宗杰,杨建宇.基于中心点回归的大场景SAR图像舰船检测方法[J].电波科学学报,2022,37(1):153-161. 被引量：6

计算机学报

2022年第3期

浏览历史

内容加载中请稍等...

三维模板跟踪的基准合成数据集构建及算法评估被引量：4

参考文献1

二级参考文献1

共引文献2

同被引文献29

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

三维模板跟踪的基准合成数据集构建及算法评估 被引量：4

参考文献1

二级参考文献1

共引文献2

同被引文献29

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

三维模板跟踪的基准合成数据集构建及算法评估被引量：4