Multi-task learning andjoint refinement between camera localizationand objecttdetection

导出

摘要 Visual localization and object detection both play important roles in various tasks.In many indoor application scenarios where some detected objects have fixed positions,the two techniques work closely together.However,few researchers consider these two tasks simultaneously,because of a lack of datasets and the little attention paid to such environments.In this paper,we explore multi-task network design and joint refinement of detection and localization.To address the dataset problem,we construct a medium indoor scene of an aviation exhibition hall through a semi-automatic process.The dataset provides localization and detection information,and is publicly available at https://drive.google.com/drive/folders/1U28zk0N4_I0db zkqyIAK1A15k9oUKOjI?usp=sharing for benchmarking localization and object detection tasks.Targeting this dataset,we have designed a multi-task network,JLDNet,based on YOLO v3,that outputs a target point cloud and object bounding boxes.For dynamic environments,the detection branch also promotes the perception of dynamics.JLDNet includes image feature learning,point feature learning,feature fusion,detection construction,and point cloud regression.Moreover,object-level bundle adjustment is used to further improve localization and detection accuracy.To test JLDNet and compare it to other methods,we have conducted experiments on 7 static scenes,our constructed dataset,and the dynamic TUM RGB-D and Bonn datasets.Our results show state-of-the-art accuracy for both tasks,and the benefit of jointly working on both tasks is demonstrated.

作者 Junyi Wang Yue Qi

机构地区 State Key Laboratory of Virtual Reality Technology and Systems Peng Cheng Laboratory Qingdao Research Institute of Beihang University School of Computer Science and Technology

出处《Computational Visual Media》 SCIE EI CSCD 2024年第5期993-1011,共19页 计算可视媒体（英文版）

基金 supported by the National Natural Science Foundation of China(No.62072020) Key-Area Research and the Leading Talents in Innovation and Entrepreneurship of Qingdao(No.19-3-2-21-zhc).

关键词 visual localization object detection joint optimization multi-task learning

分类号 TP3 [自动化与计算机技术—计算机科学与技术]

引文网络
相关文献

1Zou Yating,Wu Suhua.China Agricultural Museum:Showcasing the Rich and Diverse Agricultural Civilization[J].China & The World Cultural Exchange,2024(5):44-48.
2Carmen Morales,Alfonso Sanchez-Paus Diaz,Daniel Dionisio,Laura Guarnieri,Giulio Marchi,Danae Maniatis,Danilo Mollicone.Earth Map:A Novel Tool for Fast Performance of Advanced Land Monitoring and Citation:Morales C,DiazAS,Climate Assessment Climate Assessment[J].Journal of Remote Sensing,2023(1):1-18.
3赵晓娜,贺晓娇,包雯心.数字媒体技术下“红色文化”融入学生党员教育的路径研究——以辽宁理工学院为例[J].创新教育研究,2024,12(10):252-260.

Computational Visual Media

2024年第5期

浏览历史

内容加载中请稍等...

Multi-task learning andjoint refinement between camera localizationand objecttdetection

相关作者

相关机构

相关主题

浏览历史