期刊文献+

采用改进YOLOv3算法检测青皮核桃 被引量:4

Detection of green walnut by improved YOLOv3
下载PDF
导出
摘要 使用机器视觉对果实检测并进行估产是实现果园智能化管理的重要途径,针对自然环境下青皮核桃与叶片颜色差异小、核桃体积较小导致青皮核桃不易检出的问题,该研究提出一种基于改进YOLOv3的青皮核桃视觉检测方法。依据数据集特征进行数据增强,引入Mixup数据增强方法,使模型从更深的维度学习核桃特征;针对核桃单种类目标检测比较不同预训练模型,选择精度提升更明显的Microsoft Common Objects in Context(COCO)数据集预训练模型;依据标注框尺寸统计对锚框进行调整,避免锚框集中,提升模型多尺度优势。在消融试验中,前期改进将平均精度均值提升至93.30%,在此基础上,引入Mobil Net-v3骨干网络替换YOLOv3算法中原始骨干网络,提升模型检测能力及轻量化。试验表明,基于改进YOLOv3的青皮核桃检测平均精度均值为94.52%,超越YOLOv3其他2个骨干网络和Faster RCNN-ResNet-50网络。改进模型大小为88.6 M,检测速度为31帧/s,检测速度是Faster RCNN-ResNet-50网络的3倍,可以满足青皮核桃实时准确检测需求。该方法可为核桃果园智能化管理中的估产、采收规划等提供技术支撑,也可为近背景颜色的小果实实时准确检测提供思路。 The aim of this research is to detect green walnuts in the natural environment using machine vision. The yield was also estimated to realize the intelligent management of orchards under fruiting detection. A number of challenges remained to identify the green-skinned walnuts at present, including the more complex environment in which the green walnuts grow, the small difference in color between green walnuts and leaves, as well as the small sizes of the green walnut fruit. An optimization strategy was then proposed to deal with these challenges during the green walnut detection. The images were also accurately captured to determine the walnut yields for the harvest planning. As such, deep learning was effectively used for the object detection of green walnut. A YOLOv3 network was selected to fully meet the actual requirements of real-time detection on the green walnut. An ablation experiment was performed on an improved YOLOv3 object detection algorithm. The quantitative and qualitative analyses were also utilized to verify the model under the optimization strategies. A pre-trained model was substantially improved to detect the green walnuts, compared with the ImageNet DET and COCO datasets. The results showed that the mean Average Precision(m AP) of green walnut detection was 83.55% after the YOLOv3 network training using the pre-trained model with the COCO dataset, and 86.11% for green walnut detection applying the COCO dataset pre-trained model network. The superior detection was achieved in the pre-training model using the COCO dataset for the single category detection. Ablation experiments demonstrated that there was an excellent performance in the green walnut detection using the data enhancement on dataset feature, with a 3.81 percentage points increase in the mAP of the model. The Mixup data enhancement was then added to promote the image data complexity for better detection performance, with a 1.24 percentage point increase in the mAP. The K-means clustering was applied to cluster the annotation boxes for the Anchor scale size in the YOLOv3 model. The Anchor was adjusted to obtain from the clustering in the ablation experiments, indicating the improved mAP of the model. The lightweight MobileNet-v3 network was also selected as the backbone network of object detection, in order to extend to the mobile terminal for the detection of green walnut. As such, the improved model quantification was optimized for less complexity, but higher detection speed. A comparison was made on the performance of YOLOv3-MobileNet-v3 with the YOLOv3-DarkNet-53, YOLOv3-ResNet-50, and Faster RCNN-ResNet-50 target detection networks. The MobileNet-v3 network was achieved in the smallest size of 88.6M, the highest mAP of 94.52%, and the fastest detection speed of 31 frames/s, indicating the best performance on green-skinned walnut detection. Finally, the walnut detection maintained the higher accuracy and detection speed in the case of small models, whereas, the strong robustness in the case of small walnut targets. The finding can provide technical support and yield estimation for intelligent management in walnut orchards. The following ideas were also gained for the small fruit detection near background color. The COCO dataset was recommended to pre-train the model in the single species tasks of target detection. The detection accuracy depended mainly on the data augmentation on the dataset feature, and the Mixup data augmentation. It was necessary to adjust the concentration of data features for the less anchor at the multiple scales. Consequently, the MobileNet-v3 backbone can be presented an excellent performance for an active network during detection. Therefore, the prediction box was simply labelled on the data for the quantitative analysis of the prediction error during detection.
作者 郝建军 邴振凯 杨淑华 杨杰 孙磊 Hao Jianjun;Bing Zhenkai;Yang Shuhua;Yang Jie;Sun Lei(College of Mechanical and Electrical Engineering,Hebei Agricultural University,Baoding 071000,China;Technology Innovation Center of Intelligent Agricultural Equipment of Hebei Province,Baoding 071001,China;Huanghua Branch of Beijing Computing Center Co.,Ltd.,Cangzhou 061000,China)
出处 《农业工程学报》 EI CAS CSCD 北大核心 2022年第14期183-190,共8页 Transactions of the Chinese Society of Agricultural Engineering
基金 河北省重点研发计划项目(21327211D) 2022年河北省博士在读研究生创新能力培养资助项目(CXZZBS2022050)。
关键词 图像处理 目标检测 算法 青皮核桃 YOLOv3 image processing object detection algorithms green walnut YOLOv3
  • 相关文献

参考文献9

二级参考文献164

共引文献162

同被引文献57

引证文献4

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部