摘要
针对一般手势识别算法的参数量、计算量和精度难以平衡的问题,提出一种轻量化篮球裁判手势识别算法。该算法在YOLOV5s(You Only Look Once Version 5s)算法的基础上进行重构:首先,用Involution算子替代CSP1_1的卷积算子,以扩大上下文信息捕获范围并减少核冗余;其次,在C3模块后加入协同注意力(CA)机制,以得到更强的手势特征提取能力;然后,用轻量化内容感知上采样算子改进原始上采样模块,并将采样点集中在目标区域而忽略背景部分;最后,利用以SiLU作为激活函数的Ghost-Net进行轻量化剪枝。在自制的篮球裁判手势数据集上的实验结果表明,该轻量化篮球裁判手势识别算法的计算量、参数量和模型大小分别为3.3 GFLOPs、4.0×10^(6)和8.5 MB,与YOLOV5s算法相比,分别减少了79%、44%和40%,mAP@0.5为91.7%,在分辨率为1920×1280的比赛视频上的检测帧率达到89.3 frame/s,证明该算法能满足低误差、高帧率和轻量化的要求。
Aiming at the problem that the number of parameters,calculation amount and accuracy of general gesture recognition algorithms are difficult to balance,a lightweight gesture recognition algorithm for basketball referee was proposed.The proposed algorithm was reconstructed on the basis of YOLOV5s(You Only Look Once Version 5s)algorithm:Firstly,the Involution operator was used to replace CSP1_1(Cross Stage Partial 1_1)convolution operator to expand the context information capturing range and reduce the kernel redundancy.Secondly,the Coordinate Attention(CA)mechanism was added after the C3 module to obtain stronger gesture feature extraction ability.Thirdly,a lightweight content aware upsampling operator was used to improve the original upsampling module,and the sampling points were concentrated in the object area and the background part was ignored.Finally,the Ghost-Net with SiLU(Sigmoid Weighted Liner Unit)as the activation function was used for lightweight pruning.Experimental results on the self-made basketball referee gesture dataset show that the calculation amount,number of parameters and model size of this lightweight gesture recognition algorithm for basketball referee are 3.3 GFLOPs,4.0×10^(6) and 8.5 MB respectively,which are only 79%,44% and 40% of those of YOLOV5s algorithm,mAP@0.5 of the proposed algorithm is 91.7%,and the detection frame rate of the proposed algorithm on the game video with a resolution of 1920×1280 reaches 89.3 frame/s,verifying that the proposed algorithm can meet the requirements of low error,high detection rate and lightweight.
作者
李忠雨
孙浩东
李娇
LI Zhongyu;SUN Haodong;LI Jiao(Microelectronic Research and Development Center,Shanghai University,Shanghai 200444,China;School of Mechatronic Engineering and Automation,Shanghai University,Shanghai 200444,China;Key Laboratory of Advanced Display and System Applications,Ministry of Education(Shanghai University),Shanghai 200444,China)
出处
《计算机应用》
CSCD
北大核心
2023年第7期2173-2181,共9页
journal of Computer Applications
基金
国家自然科学基金资助项目(52107239)。