期刊文献+

基于改进的YOLOv4-tiny模型剪枝与量化

Pruning and Quantization Based on Improved YOLOv4-tiny
下载PDF
导出
摘要 针对YOLOv4-tiny存在计算量较大,检测精度低,无法满足嵌入式设备实时性需求的问题,论文基于MobileNetv3改进的轻量级网络YOLOv4-E,使用BN层的γ尺度因子对冗余的特征通道进行剪枝,在25%剪枝率下模型大小降低到了6.7MB,mAP仅降低了0.59%,FPS提升了8.8%。同时使用NCNN前向推理框架对剪枝后的模型进行Int8量化,在树莓派4B上检测单张图片仅需173 ms,满足了实时性需求。 Aiming at the problem that YOLOv4-tiny has a large amount of calculation and low detection accuracy,which cannot meet the real-time requirements of embedded devices,based on the improved lightweight network YOLOv4-E of MobileNetv3,this paper uses the scale factor of BN layer to prune the redundant characteristic channels.At 25% pruning rate,the size of the model is reduced to 6.7 MB,mAP is reduced by only 0.59%,and FPS is increased by 8.8%.At the same time,the NCNN forward reasoning framework is used to quantify the Int8 of the pruning model.It is only 173 ms to detect a single image on RaspberryPi 4B,which meets the real-time requirements.
作者 李秉涛 何勇 LI Bingtao;HE Yong(College of Computer Science and Technology,Guizhou University,Guiyang 550025)
出处 《计算机与数字工程》 2024年第9期2721-2725,2770,共6页 Computer & Digital Engineering
关键词 目标检测 YOLOv4-tiny 剪枝 嵌入式设备 object detection YOLOv4-tiny pruning embedded devices
  • 相关文献

参考文献4

二级参考文献18

共引文献81

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部