基于多任务学习的超分辨率辅助小目标检测

Super-Resolution-Aided Small-Target Detection Based on Multi-Task Learning

下载PDF

导出

摘要小目标通常具有低分辨率和模糊不清的特点,并容易受到遮挡和背景的影响,导致难以实现准确且实时的小目标检测。为提升检测效果,提出一种基于多任务学习的超分辨率辅助小目标检测算法Multi-YOLO。首先,引入一个超分辨率辅助分支引导主干网络提取有效特征,减少小目标信息丢失;其次,采用Anchor based协同监督Anchor free的双检测头训练方法来辅助提升检测准确性,另外,在骨干网络尾部使用CTR3模块加强目标信息与位置感知的关联性;最后,在推理阶段仅使用检测分支进行推理以保证推理速度。实验结果表明,Multi-YOLO相对于基准网络在VEDAI、COCO MiniTrain和SPCD数据集上均取得了一定的性能提升,其中在VEDAI数据集上,Multi-YOLO实现了10.9%的平均精度均值(mAP)提升,且与基准模型大小相近。同时,与主流的单阶段目标检测网络相比,Multi-YOLO在小目标检测方面表现出色,并在精度和速度之间取得了平衡。 Small targets often exhibit low resolution and blurriness and are easily affected by occlusions and background interference,making accurate and real-time detection of small targets challenging.In this study,to enhance the detection performance,a super-resolution-aided small-target detection algorithm based on multi-task learning called Multi-YOLO is proposed.First,a super-resolution auxiliary branch is introduced to guide the main network in extracting effective features,thereby reducing the loss of information for small targets.Second,a collaborative supervision method is employed by combining Anchor based and Anchor free detection heads to improve the detection accuracy.Additionally,a CTR3 module is used at the end of the backbone network to strengthen the correlation between the target information and position awareness.Finally,during the inference stage,only the detection branch is used to maintain the speed of inference.Experimental results show that,compared with the baseline network,Multi-YOLO achieves performance improvement on the VEDAI,COCO MiniTrain,and SPCD datasets.Specifically,on the VEDAI dataset,this method achieves a 10.9%improvement in mean Average Precision(mAP)improvement while maintaining a model size similar to that of the baseline model.Moreover,compared with mainstream single-stage object detection networks,Multi-YOLO excels in small-target detection,maintaining a remarkable balance between accuracy and speed.

作者张天鹏韩晶吕学强 ZHANG Tianpeng;HAN Jing;LÜXueqiang(Beijing Key Laboratory of Internet Culture and Digital Dissemination Research,Beijing Information Science and Technology University,Beijing 100101,China)

机构地区北京信息科技大学网络文化与数字传播北京市重点实验室

出处《计算机工程》 CAS CSCD 北大核心 2024年第9期304-312,共9页 Computer Engineering

基金国家自然科学基金(62171043) 北京市自然科学基金(4232025) 北京市教委科研计划科技一般项目(KM202311232003)。

关键词深度学习小目标检测多任务学习超分辨率注意力机制 deep learning small-target detection multi-task learning super resolution attention mechanism

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李德舜,郑台,张翼,杨云丽,金雅昭.基于单分类自编码器的烘丝机设备健康度评估方法[J].设备管理与维修,2024(12):30-32.
2龚轩,郭中华,丁荣荣,顾旭璐,闫梓旭.结合双重注意力机制的遥感图像道路分割[J].传感器与微系统,2024,43(9):140-143.
3杜特,宋扬.基于特征金字塔网络与树莓派的护理床智能控制方法研究[J].计算机测量与控制,2024,32(9):206-212.
4王欣,江涛,魏玉梅,马珍,白金燕.基于改进YOLOv5的遥感图像小目标检测算法[J].计算机与数字工程,2024,52(7):2050-2054.
5徐辛超,孟祥柯,于佳琪.基于改进YOLOv5s的遥感影像小目标检测[J].测绘科学,2024,49(6):143-153.
6张艺婷.开拓创新、能闯敢拼,护航高质量就业之路[J].教育家,2024(35):14-15.
7郭月飞,阳旭,葛晨阳.改进YOLOv5的轻量化RGB-IR融合小目标检测[J].办公自动化,2024,29(17):65-68.
8张浩.多自由度工业机器人末端执行器碰撞位置预测研究[J].办公自动化,2024,29(17):69-71.
9杨石含,曾涛,陈晓军.公立医院纪检、审计和行风协同监督的探索与实践[J].卫生法学,2024,32(5):107-113.
10张延军,陈博.基于HigherHRNet的煤矿井下人体姿态估计快速网络研究[J].矿业安全与环保,2024,51(4):35-40.

计算机工程

2024年第9期

浏览历史

内容加载中请稍等...

基于多任务学习的超分辨率辅助小目标检测

相关作者

相关机构

相关主题

浏览历史