复杂端到端场景的跨视觉域目标检测算法

Cross-Domain Object Detection Algorithm for Complex End-to-End Scene Understanding

导出

摘要深度学习应用往往假设部署场景与训练数据具有相似的视觉域特征分布,但是在复杂端到端场景中该假设并不总是成立,难以满足开放环境中智能检测业务的需求。为此,提出了基于人工智能闭环组合理论与跨视觉域的目标检测算法,在检测框架中引入多尺度卷积层构建检测算法的主干网络与瓶颈层网络,提出带有长距离依赖注意力的视觉域判别器作为二次检测头细化检测结果,设计基于空间重构注意力单元的背景聚焦模块进行伪背景图的聚焦学习,从而提升跨视觉域目标检测的准确率。实验结果表明,所提算法在跨视觉域场景中目标检测平均准确率相比双阶段算法提高6.9%,相比单阶段算法提高9.0%。 Conventional deep learning training approaches often assume a similarity between the deployment scenario and the visual domain features present in the training data.However,this assumption might not hold true in complex end-to-end scenarios,making it difficult to meet the demands of intelligent detection services in open environments.In response,an object detection algorithm based on artificial intelligence closed-loop ensemble theory with cross-domain capabilities has been introduced.Within the detection framework,construct a backbone network and bottleneck layer network with multi-scale convolutional layers.A visual domain discriminator featuring long-range dependency attention works as a secondary detection head to refine the results.Moreover,a background focusing module,based on spatial reconstruction attention units,is able to enhance learning focused on pseudo-background representations,thereby improving the accuracy of cross-domain object detection.Experimental results show that,compared to two-stage algorithms,the proposed algorithm yields an average precision increase 6.9%,and surpasses single-stage algorithms by 9.0%in complex end-to-end scenarios.

作者陈傲然黄海朱玥琰薛俊笙 CHEN Aoran;HUANG Hai;ZHU Yueyan;XUE Junsheng(School of Information and Communication Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China)

机构地区北京邮电大学信息与通信工程学院

出处《北京邮电大学学报》 EI CAS CSCD 北大核心 2024年第4期57-62,共6页 Journal of Beijing University of Posts and Telecommunications

基金国家重点研发计划项目(2021YFF0900700)。

关键词体系化人工智能计算机视觉神经网络目标检测 holistic artificial intelligence computer vision neural network object detection

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1张晓燕.初中数学教学中合作学习应用策略研究[J].中国科技经济新闻数据库教育,2024(8):0101-0104.
2汪杰良.中国代数学和组合理论领域的领导人——万哲先[J].中学数学,2024(18).
3蒋菡.数据开放助力智慧配送体系构建研究[J].中国物流与采购,2024(17):89-90.
4陈立莉.探究纪录片散文化叙事中的时间与空间重构[J].记者摇篮,2024(10):27-29.
5刘冬梅,蔡笑天.加快建设世界科技强国的根本遵循[J].前线,2024(9):58-61.
6于婷婷,刘王寅,黄昕悦,郑可萱,冷红.乡村文化传承与空间重构的逻辑辨析和重构路径[J].规划师,2024,40(8):107-113.
7耿顺国,李秀梅.赛译《水浒传》中地理空间相关文化负载词翻译策略[J].英语广场（学术研究）,2024(25):11-14.
8杨校宝.项目式学习在小学数学“生活与百分数”教学中的应用[J].数学学习与研究,2024(20):95-97.
9张续鹏.强化学习应用下的移动抓取机器人轨迹分析[J].中国科技纵横,2024(12):23-25.
10李纯青,郝日艳,贺文华.当机器人也有体验时——人机共生体验的研究脉络、理论视角和未来研究[J].西北大学学报（哲学社会科学版）,2024,54(5):72-86.

北京邮电大学学报

2024年第4期

浏览历史

内容加载中请稍等...

复杂端到端场景的跨视觉域目标检测算法

相关作者

相关机构

相关主题

浏览历史