摘要
基于卷积神经网络的目标检测算法在追求较高精度的同时,忽略了检测速度,使得算法难以在有限算力的情况下实现实时检测。在YOLO目标检测算法的基础上,采用一系列轻量化的方法,运用Mobilenetv1网络替换Darknet53基础网络,将YOLO head部分3×3标准卷积替换为深度可分离卷积,根据灵敏度对卷积层滤波器进行排序和修剪,并在嵌入式GPU TX2平台上进行C++推理部署。在VOC数据集上的测试结果表明,改进算法在精度仅下降0.75个百分点的前提下实现了2.4倍加速,模型占用内存仅为原来的21.5%。
The existing target detection algorithms based on convolutional neural networks have achieved a high accuracy,but the accuracy gain comes at the cost of detection speed,making it difficult for the algorithms to implement real-time detection with limited computing power.To solve this problem,a series of lightweight methods are adopted based on the YOLO target detection algorithm.The methods employ Mobilenetv1 to replace the basic network of Darknet53,and depthwise separable convolutions to replace the 3×3 standard convolutions in the YOLO head part.On this basis,the convolution layer filter is sorted and pruned according to sensitivity.Finally,C++inference algorithms are deployed on the embedded GPU TX2 platform.The test results on the VOC data set show that the improved algorithm provides an acceleration of 2.4 times while the accuracy is reduced by only 0.75 percentage points.Additionally,the memory occupied by the improved model is only 21.5%of that occupied by the original model.
作者
黄靖淞
左颢睿
张建林
HUANG Jingsong;ZUO Haorui;ZHANG Jianlin(Institute of Optics and Electronics,Chinese Academy of Sciences,Chengdu 610209,China;College of Computer Science,University of Chinese Academy of Sciences,Beijing 100049,China)
出处
《计算机工程》
CAS
CSCD
北大核心
2021年第10期236-241,共6页
Computer Engineering
基金
科技委创新项目(G158207)。