多尺度特征融合轻量化夜间红外行人实时检测被引量：13

Multi-Scale Feature Fusion Lightweight Real-Time Infrared Pedestrian Detection at Night

导出

摘要针对辅助驾驶中夜间小目标红外行人检测精度低、网络模型占用内存空间大、检测速度难以满足实时检测要求等问题,提出了一种轻量化的夜间红外图像行人检测神经网络YOLO-Person。首先提出一种以MobileNetV3轻量化网络为骨干网络,以多尺度融合目标检测层为预测模块的网络模型,以解决网络模型大、推理速度慢的问题,大幅减少了模型计算量,初步实现轻量化;然后通过在网络中添加空间金字塔池化模块与更小感受野的检测层,增强网络输出特征图的表征能力,解决数据集中行人目标尺度大小不均衡的问题,提高模型的红外行人检测精度;最后应用通道剪枝对模型进行剪枝,减少特征图的通道数,获得最终网络模型YOLO-Person。通过Jetson Nano移动开发平台,在夜间红外图像行人数据集上验证YOLO-Person轻量化模型,结果表明:与YOLOv3网络模型相比,提出的YOLO-Person网络模型更适于移动端的夜间红外行人检测,平均检测精度达到了92.2%,检测速度由26frame/s提高到了69frame/s,模型大小也由246MB减少到了11.7MB。 Objective Poor lighting conditions lead to a high accident rate during night driving.In order to reduce the incidence of night traffic accidents,various auxiliary driving technologies such as ultrasonic ranging,millimeter wave radar and visual auxiliary driving are widely used.Infrared thermal imaging technology based on the thermal radiation of object and reflection imaging with certain penetrability is less affected by the weather and light conditions at night.Human targets within the vision field can be accurately captured by infrared thermal imaging technology,which is convenient for pedestrian detection.In addition,the cost of infrared imaging equipment has been decreased in recent years,making it possible to be mounted on vehicles.Therefore,the fusion of infrared thermal imaging technology and pedestrian target detection algorithm based on deep learning is of great research significance and with a broad market application prospective in vehicle auxiliary driving.In this paper,a pedestrian detection model based on night infrared image is proposed for night driving,which can detect pedestrians on the night road in real time.This study can be applied to the field of auxiliary driving for early warning and active braking provided to drivers,reducing the probability of night driving accidents and providing higher security for vehicles and pedestrians.Methods Aiming at the problems of low accuracy in infrared pedestrian detection for small targets at night,large committed memory of network model,and the difficulty of real-time detection in auxiliary driving due to the low model detection speed,a lightweight pedestrian detection neural network called YOLO-Person is proposed for night infrared images.Firstly,the MobileNetV3 lightweight network is used as the backbone network,while the multi-scale fusion target detection layer is used as the prediction module to solve the problem of large model size and slow inference speed,which greatly reduces the amount of model calculation and obtains a preliminary lightweight network model.Furthermore,by adding the spatial pyramid pooling module and the detection layer with smaller receptive field in the network,the representation ability is enhanced to solve the problem of unbalanced pedestrian target scale in the dataset and improve the infrared pedestrian detection accuracy.Finally,channel pruning is used to reduce the number of channels in the feature map,and the final network model YOLO-Person is obtained.The lightweight model YOLO-Person is verified on the pedestrian dataset of night infrared images based on Jetson Nano mobile development platform.Results and Discussions A lightweight model YOLO-Person is proposed for night infrared pedestrian detection(Fig.1).Firstly,MobileNetV3 lightweight network is used as the backbone network,and the multi-scale fusion detection layer is used as the prediction module.Although the accuracy is reduced by 1.2%,the speed is increased by 34 frame/s,and the model size is reduced by 151 MB(Table 1),which indicates that the lightweight of the night infrared pedestrian detection model is preliminarily realized.Secondly,aiming at the problem of unbalanced pedestrian target scale in dataset,spatial pyramid pooling module(Fig.2)and small receptive field detection layer are added in the network,through which the accuracy is improved by 3.3%,the speed is reduced by 23 frame/s,and the model size is increased by 5.1 MB(Table 2).Moreover,the model is pruned(Fig.3)to reduce a large number of redundant channels(Fig.6).When the pruning rate is 95%,the number of model channels,accuracy and model size achieve balance and optimization(Table 3).In addition,the model is fine-tuned to obtain the final lightweight model YOLO-Person,which reaches the accuracy of 92.2%,the speed of 69 frame/s,and the model size of 11.7 MB(Table 4).Finally,the model is deployed on the Jetson Nano mobile development platform to verify the detection effect(Fig.7),and the test results of three networks are compared.The lightweight model YOLO-Person gets the best results:the accuracy of 92.2%,the speed of 12 frame/s,and the model size of 11.7 MB(Table 5).Conclusions A lightweight model YOLO-Person for night infrared pedestrian detection is proposed in this paper.Firstly,MobileNetV3 lightweight network is used as the backbone network,and the multi-scale fusion detection layer is used as the prediction module to achieve the preliminary model lightweight.Secondly,spatial pyramid pooling module and small receptive field detection layer are added to improve the detection accuracy of small targets.Finally,the model parameters are greatly reduced through channel pruning,and the final lightweight model YOLO-Person is obtained.The experimental results show that the detection accuracy and speed of YOLO-Person model reach 92.2%and 69 frame/s,respectively,meeting the requirements of real-time pedestrian detection.The YOLO-Person network model is deployed on the Jetson Nano mobile development platform,where the detection speed of 12 frame/s exceeds that of YOLOv3 and approaches that of YOLOv3-tiny,which further verifies the superiority of the proposed method.By optimizing the network structure and increasing the effective functional network layer,the detection accuracy of the model will be further improved in the future research.

作者何自芬陈光晨陈俊松张印辉 He Zifen;Chen Guangchen;Chen Junsong;Zhang Yinhui(Faculty of Mechanical and Electrical Engineering,Kunming University of Science and Technology,Kunming 650500,Yunnan,China)

机构地区昆明理工大学机电工程学院

出处《中国激光》 EI CAS CSCD 北大核心 2022年第17期115-124,共10页 Chinese Journal of Lasers

基金国家自然科学基金(62171206,61761024,62061022)。

关键词成像系统夜间红外行人检测多尺度融合 MobileNetV3网络模型剪枝 imaging systems infrared pedestrian detection at night multi-scale fusion MobileNetV3 network model pruning

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献8

1王一同,周宏强,闫景逍,合聪,黄玲玲.基于深度学习算法的计算光学研究进展[J].中国激光,2021,48(19):255-277. 被引量：14
2邹梓吟,盖绍彦,达飞鹏,李昱.基于注意力机制的遮挡行人检测算法[J].光学学报,2021,41(15):149-157. 被引量：24
3刘学,李范鸣,刘士建.改进的SSD红外图像行人检测算法[J].电光与控制,2020,27(1):42-46. 被引量：15
4赵斌,王春平,付强,陈一超.基于深度注意力机制的多尺度红外行人检测[J].光学学报,2020,40(5):41-52. 被引量：21
5赵亮,胡杰,刘汉,安永鹏,熊宗权,王宇.基于语义分割的深度学习激光点云三维目标检测[J].中国激光,2021,48(17):171-183. 被引量：34
6苗壮,张湧,陈瑞敏,李伟华.基于关键点的快速红外目标检测方法[J].光学学报,2020,40(23):130-138. 被引量：9
7于博,马书浩,李红艳,李春庚,安居白.远红外车载图像实时行人检测与自适应实例分割[J].激光与光电子学进展,2020,57(2):286-296. 被引量：9
8李玉华,刘全程,李天华,吴彦强,牛子孺,侯加林.基于Jetson Nano处理器的大蒜鳞芽朝向调整装置设计与试验[J].农业工程学报,2021,37(7):35-42. 被引量：10

二级参考文献66

1金诚谦,袁文胜,吴崇友,张敏.大蒜播种时鳞芽朝向对大蒜生长发育影响的试验研究[J].农业工程学报,2008,24(4):155-158. 被引量：64
2荐世春,赵峰,李青,李福欣.旋转式蒜瓣单粒定向取种器的研究设计[J].农业装备与车辆工程,2009,47(2):18-20. 被引量：12
3荐世春,刘云东.大蒜播种机蒜瓣自动定向控制装置的试验研究[J].农业装备与车辆工程,2009,47(10):28-29. 被引量：33
4杨清明,李娟玲,何瑞银.基于图像处理的大蒜蒜瓣朝向识别[J].浙江农业学报,2010,22(1):119-123. 被引量：16
5苏晓倩,孙韶媛,戈曼,谯帅,谷小婧.车载红外图像的行人检测与跟踪技术[J].激光与红外,2012,42(8):949-953. 被引量：15
6赵丽清,马志勇.大蒜播种机装盘系统蒜瓣定向识别算法的研究[J].农机化研究,2013,35(6):163-166. 被引量：10
7彭志勇,王向军,卢进.窗口热辐射下基于视觉显著性的红外目标检测方法[J].红外与激光工程,2014,43(6):1772-1776. 被引量：3
8许茗,于晓升,陈东岳,吴成东,贾同,茹敬雨.复杂热红外监控场景下行人检测[J].中国图象图形学报,2018,23(12):1829-1837. 被引量：14
9孙宁,陈梁,韩光,李晓飞.深度分类网络研究及其在智能视频监控系统中的应用[J].电光与控制,2015,22(9):77-82. 被引量：6
10刘让,王德江,贾平,周达标,丁鹏.红外图像弱小目标探测技术综述[J].激光与光电子学进展,2016,53(5):38-46. 被引量：20

共引文献122

1侯加林,李超,娄伟,李天华,李玉华,周凯.大蒜联合收获机按压式切根装置设计与试验[J].农业机械学报,2022,53(10):167-174.
2田珊.基于D-S证据理论的红外图像行人检测[J].科技通报,2021,37(7):52-56. 被引量：1
3张业,徐婧.基于语义点云的巡航系统移动目标轨迹识别[J].北京测绘,2023,37(8):1115-1120.
4孙永选.古人名字中多音字的读音[J].语文建设,2000(2):23-23. 被引量：1
5张程发,韩竺秦,陈益锋,高兆鑫,谢清河,陈武钿.基于深度学习的机器视觉儿童智能安防系统[J].电子质量,2020,0(4):40-43. 被引量：1
6史健婷,张贵强.改进的YOLOv3红外图像行人检测算法[J].黑龙江科技大学学报,2020,30(4):442-447. 被引量：5
7周舟,韩芳,王直杰.改进SSD算法在中国手语识别上的应用[J].计算机工程与应用,2021,57(3):156-161. 被引量：5
8卜德飞,孙韶媛,黄荣,王宇岚,刘致驿.基于改进SSD的无人驾驶夜间目标检测[J].东华大学学报（自然科学版）,2021,47(1):63-69. 被引量：1
9高刘雅,孙冬,卢一相.基于轻量级注意机制的人脸检测算法[J].激光与光电子学进展,2021,58(2):122-130. 被引量：7
10尧佼,于凤芹.基于候选区域定位与HOG-CLBP特征组合的行人检测[J].激光与光电子学进展,2021,58(2):157-164. 被引量：6

同被引文献106

1Jinpu Lin,Florian Haberstroh,Stefan Karsch,Andreas Döpp.Applications of object detection networks in high-power laser systems and experiments[J].High Power Laser Science and Engineering,2023,11(1):52-60. 被引量：9
2胡良梅,高隽,何柯峰.图像融合质量评价方法的研究[J].电子学报,2004,32(F12):218-221. 被引量：100
3张立保,章珏.基于显著性分析的自适应遥感图像融合[J].中国激光,2015,42(1):307-314. 被引量：23
4徐波,朱青松,熊艳海.视频图像去雨技术研究前沿[J].中国科技论文,2015,10(8):916-927. 被引量：7
5傅志中,王雪,李晓峰,徐进.基于视觉显著性和NSCT的红外与可见光图像融合[J].电子科技大学学报,2017,46(2):357-362. 被引量：37
6李克强,戴一凡,李升波,边明远.智能网联汽车(ICV)技术的发展现状及趋势[J].汽车安全与节能学报,2017,8(1):1-14. 被引量：401
7王战古,高松,邵金菊,谭德荣,孙亮,于杰.基于深度置信网络的多源信息前方车辆检测[J].汽车工程,2018,40(5):554-560. 被引量：6
8周浦城,周远,韩裕生.视频图像去雨技术研究进展[J].图学学报,2017,38(5):629-646. 被引量：9
9江泽涛,何玉婷.基于卷积自编码器和残差块的红外与可见光图像融合方法[J].光学学报,2019,39(10):210-218. 被引量：16
10段仲静,李少波,胡建军,杨静,王铮.深度学习目标检测方法及其主流框架综述[J].激光与光电子学进展,2020,57(12):51-66. 被引量：61

引证文献13

1吕昌,尹和,邵叶秦.基于结构重参数化的目标检测模型[J].电子测量技术,2023,46(18):114-121.
2王琳毅,白静,李文静,蒋金哲.YOLO系列目标检测算法研究进展[J].计算机工程与应用,2023,59(14):15-29. 被引量：25
3杨叶君,刘刚,肖刚,顾新杰.基于自适应特征增强和生成器路径交互的红外与可见光图像融合[J].激光与光电子学进展,2023,60(14):170-180. 被引量：2
4相敏月,涂振宇,孙逸飞,方强,马飞.基于ECA和BIFPN的低照度环境下的行人目标检测算法[J].智能计算机与应用,2023,13(9):189-193. 被引量：2
5刘珂琪,董绵绵,郜辉,吕志刚,郭宝亿,庞敏.基于光照感知权重融合的多模态行人检测算法[J].激光与光电子学进展,2023,60(16):137-147.
6杨阳,任振南,李北辰.联合卷积神经网络和转换器的红外与可见光图像融合[J].激光与光电子学进展,2023,60(16):185-195.
7刘润坤,党世杰,张洪远,牛银银,米贯勋,李三华,陈振鑫,赵凌霄,李鹏.基于改进RetinaNet的宫颈异常细胞检测算法[J].中国激光,2023,50(15):101-110.
8胡待方,仝秋红,柴国庆,王凯,穆雨薇,苏胜君.雨天车辆检测的两阶段渐进式图像去雨算法[J].激光与光电子学进展,2023,60(22):103-112.
9高小强,常侃,凌铭阳,银梦雨.多模态自适应特征融合的目标检测[J].激光与光电子学进展,2023,60(24):100-109.
10孙明正,李浩.一种基于ResPNet的光伏组件红外成像故障检测方法[J].激光与光电子学进展,2023,60(24):193-201.

二级引证文献29

1孟青云,戴佳蔚,查佳佳,熊亦可,司博宇.基于YOLOv8算法的常用手势识别[J].现代仪器与医疗,2023,29(4):12-20. 被引量：8
2江祥奎,杜遥遥,胡浩昌.一种改进YOLOv5s小目标无人机实时检测算法[J].西安邮电大学学报,2023,28(3):88-96. 被引量：2
3陈星宇,凡玉琪,刘虎涛,蒋培宗.基于改进YOLOv5n的红枣缺陷识别方法[J].信息与电脑,2023,35(14):181-186.
4孙圣明,吴秋灵,吴建明.基于改进YOLOv4-tiny的废气砣状态检测算法研究[J].燃料与化工,2023,54(6):42-44.
5张跃,陈宁,孔明,郭钢祥,郭斌,吴晓康.基于改进YOLOv4网络的手机曲面玻璃缺陷检测[J].现代电子技术,2023,46(23):103-108.
6张辉,苏国用,赵东洋.基于FBEC-YOLOv5s的采掘工作面多目标检测研究[J].工矿自动化,2023,49(11):39-45.
7罗奕凯.铁路货车装载状态图像质量客观评价方法研究[J].铁道货运,2023,41(11):47-53.
8孙歆,王晓燕,刘静,黄贺瑄.经典YOLO系列目标检测算法及其在乳腺癌检测中的应用[J].计算机系统应用,2023,32(12):52-62. 被引量：1
9杨明瑞,古玉锋.基于改进YOLOv5的无人驾驶农业车辆视觉检测[J].南方农机,2024,55(1):21-23.
10朱俊,封磊.基于声呐图像的鱼群识别与计数方法[J].南京理工大学学报,2023,47(6):782-789.

1王孝天,卢紫微,张燕.基于多尺度融合的图像超分辨率重建[J].控制工程,2022,29(9):1573-1579. 被引量：1
2马洁,吴英宾.基于Android移动开发平台的答题APP[J].电子技术与软件工程,2021(2):68-69. 被引量：3
3陈继清,韦德鹏,龙腾,罗天,王桦彬.基于卷积神经网络的害虫分类[J].中国农机化学报,2022,43(11):188-194. 被引量：2
4渠涵冰,贾振堂.轻量级高分辨率人体姿态估计研究[J].激光与光电子学进展,2022,59(18):119-126. 被引量：2
5史雨馨,朱继杰,凌志刚.基于特征增强YOLOv4的无人机检测算法研究[J].电子测量与仪器学报,2022,36(7):16-23. 被引量：8
6任胜兰,郭慧娟,黄文豪,亓慧.基于物联网和改进Yolo-v4-tiny的智能果蝇诱捕方案[J].南京理工大学学报,2022,46(5):586-593. 被引量：2
7杨栋杰,高贤君,冉树浩,张广斌,王萍,杨元维.基于多重多尺度融合注意力网络的建筑物提取[J].浙江大学学报（工学版）,2022,56(10):1924-1934. 被引量：3
8王权顺,吕蕾,黄德丰,付思琴,余华云.基于改进YOLOv4算法的苹果叶部病害缺陷检测研究[J].中国农机化学报,2022,43(11):182-187. 被引量：8
9闫雪,祝启斌,陈菊霞,夏巧桥.基于深度残差网络的轻量级生成图像压缩方法[J].激光杂志,2022,43(9):76-82. 被引量：1
10李雨诗,张才裕,赵杨珂,陈绪君.基于模型压缩的轻量化障碍物检测模型研究[J].激光杂志,2022,43(9):38-43. 被引量：2

中国激光

2022年第17期

浏览历史

内容加载中请稍等...

多尺度特征融合轻量化夜间红外行人实时检测被引量：13

参考文献8

二级参考文献66

共引文献122

同被引文献106

引证文献13

二级引证文献29

相关作者

相关机构

相关主题

浏览历史

多尺度特征融合轻量化夜间红外行人实时检测 被引量：13

参考文献8

二级参考文献66

共引文献122

同被引文献106

引证文献13

二级引证文献29

相关作者

相关机构

相关主题

浏览历史

多尺度特征融合轻量化夜间红外行人实时检测被引量：13