期刊文献+

采用轻量级网络MobileNetV2的酿酒葡萄检测模型 被引量:5

Detection model for wine grapes using MobileNetV2 lightweight network
下载PDF
导出
摘要 为提高田间葡萄图像中小目标葡萄检测的速度和精度,该研究提出了一种基于轻量级网络的酿酒葡萄检测模型(Wine Grape Detection Model,WGDM)。首先,采用轻量级网络MobileNetV2取代YOLOv3算法的骨干网络DarkNet53完成特征提取,加快目标检测的速度;其次,在多尺度检测模块中引入M-Res2Net模块,提高检测精度;最后,采用平衡损失函数和交并比损失函数作为改进的定位损失函数,增大目标定位的准确性。试验结果表明,提出的WGDM模型在公开的酿酒葡萄图像数据集的测试集上平均精度为81.20%,网络结构大小为44 MB,平均每幅图像的检测时间为6.29 ms;与单发检测器(Single Shot Detector,SSD)、YOLOv3、YOLOv4和快速区域卷积神经网络(Faster Regions with Convolutional Neural Network,Faster R-CNN)4种主流检测模型相比,平均检测时间分别减少了4.91、7.75、14.84和158.20 ms。因此,该研究提出的WGDM模型对田间葡萄果实具有更快速、更准确的识别与定位,为实现葡萄采摘机器人的高效视觉检测提供了可行方法。 Efficient detection of grape image has widely been one of the most important technologies in automatic grape harvesting robots. In this study, a wine grape detection model(WGDM) was proposed to improve the speed and accuracy of field grape detection using a lightweight network. Firstly, the MobileNetV2 lightweight network was adopted to significantly increase the detection speed for real-time objects in the WGDM model, due to the smaller size, faster speed, and higher accuracy in the image recognition, compared with DarkNet53 in the original YOLOv3. Secondly, the M-Res2 Net module was added to the multi-scale detection of YOLOv3, as some standard convolutional layers with 1′1 and 3′3 convolution kernels were removed, particularly for the better capability of multi-scale feature extraction and higher accuracy of detection in the improved model. Finally, a new location loss function was established using the balanced loss and the intersection over union loss. The classification and object loss stayed the same as the YOLO. As such, a more balance was achieved in the object,classification and location during the model training, thereby to enlarge the precision of object location. Different detection models were trained, including the proposed WGDM, Single Shot Detector(SSD), the original YOLOv3, YOLOv4, and Faster Regions with Convolutional Neural Network(Faster R-CNN). The available wine grape instance segmentation dataset(WGISD) was also selected, including 300 images of wine grape and 300 annotation files with 4 432 objects under the same experimental conditions. Additionally, the resolution of input image was adjusted from the original resolution of 2 048′1 365 pixels or 2 048′1 536 pixels to 608′608 pixels. The experimental results showed that the proposed WGDM model in the test set of wine grape image dataset achieved an average accuracy of 81.20%. The F1-score(a metric function that balances the precision and recall of the model) of the proposed model reached 0.856 3, which was 0.056 3 higher than that of SSD, 0.005 4 higher than that of the original YOLOv3, 0.041 7 higher than that of YOLOv4, and 0.012 5 higher than that of Faster R-CNN.The network structure size of the proposed model was 44 MB, which was 50 MB smaller than that of SSD, 191 MB smaller than that of the original YOLOv3 or YOLOv4, and 83 MB less than that of Faster R-CNN. The average detection time for each grape image in the proposed model was 6.29 ms, which was 4.91 ms shorter than that of SSD, 7.75 ms shorter than that of the original YOLOv3, 14.84 ms shorter than that of YOLOv4, and 158.2 ms shorter than that of Faster R-CNN. Moreover, the number of floating-point operations(the sum of the number of multiplication operations and the number of addition operations)of the proposed model was only 10.14×10^(9), which was 11.58% of SSD 14.54% of the original YOLOv3, 16.05% of YOLOv4,and 5.48%-15.33% of Faster R-CNN. Therefore, the proposed WGDM model presented the faster and more accurate recognition and location of grape fruits in the field, providing a feasible path for the efficient visual detection of grape picking robots.
作者 李国进 黄晓洁 李修华 艾矫燕 Li Guojin;Huang Xiaojie;Li Xiuhua;Ai Jiaoyan(School of Electrical Engineering,Guangxi University,Nanning 530004,China)
出处 《农业工程学报》 EI CAS CSCD 北大核心 2021年第17期168-176,F0003,共10页 Transactions of the Chinese Society of Agricultural Engineering
基金 国家自然科学基金项目(31760342) 广西创新驱动发展专项(桂科AA17202032-2)。
关键词 机器视觉 图像处理 模型 葡萄 检测 YOLO Res2Net machine vision image processing models grape detection YOLO Res2Net
  • 相关文献

参考文献7

二级参考文献65

共引文献286

同被引文献85

引证文献5

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部