期刊文献+

基于卷积神经网络的田间多簇猕猴桃图像识别方法 被引量:100

Image recognition method of multi-cluster kiwifruit in field based on convolutional neural networks
下载PDF
导出
摘要 为实现田间条件下快速、准确地识别多簇猕猴桃果实,该文根据猕猴桃的棚架式栽培模式,采用竖直向上获取果实图像的拍摄方式,提出一种基于Le Net卷积神经网络的深度学习模型进行多簇猕猴桃果实图像的识别方法。该文构建的卷积神经网络通过批量归一化方法,以Re LU为激活函数,Max-pooling为下采样方法,并采用Softmax回归分类器,对卷积神经网络结构进行优化。通过对100幅田间多簇猕猴桃图像的识别,试验结果表明:该识别方法对遮挡果实、重叠果实、相邻果实和独立果实的识别率分别为78.97%、83.11%、91.01%和94.78%。通过与5种现有算法进行对比试验,该文算法相对相同环境下的识别方法提高了5.73个百分点,且识别速度达到了0.27 s/个,识别速度较其他算法速度最快。证明了该文算法对田间猕猴桃图像具有较高的识别率和实时性,表明卷积神经网络在田间果实识别方面具有良好的应用前景。 China is the largest country for cultivating kiwifruit,and Shaanxi Province provides the largest production,which accounts for approximately 70% of the production in China and 33% of the global production.Harvesting kiwifruit in this region relies mainly on manual picking which is labor-intensive.Therefore,the introduction of robotic harvesting is highly desirable and suitable.The fast and effective recognition of kiwifruit in the field under natural scenes is one of the key technologies for robotic harvesting.Recently,the study on kiwifruit recognition has been limited to a single cluster and multi clusters in the field have seldom been considered.In this paper,according to growth characteristics of kiwifruit grown on sturdy support structures,an RGB(red,green,blue) camera was placed around 100 cm underneath the canopy so that kiwifruit clusters could be included in the images.We proposed a kiwifruit image recognition system based on the convolutional neural network(CNN),which has a good robustness avoiding the subjectivity and limitation of the features selection by artificial means.The CNN could be trained end to end,from raw pixels to ultimate categories,and we optimized the critical structure parameters and the training strategy.Ultimately,the network was made up of 1 input layer,3 convolutional layers,2 sub-sampling layers,1 full convolutional layer,and 1 output layer.The CNN architecture was optimized by using batch normalization(BN) method,which normalized the data distribution of the middle layer and the output data,accelerating the training convergence and reducing the training time.Therefore,the BN layers were added after the 1,3 and 5 th convolutional layer(Conv1,Conv3,and Conv5 layer) of the original Le Net network.The size of all convolutional kernels was 5×5,and that of all the sub-sampling layers was 2×2.The feature map numbers of Conv1,Conv3,and Conv5 were 6,16 and 120,respectively.After manual selection and normalizing,the RGB image of kiwifruit was transferred into a matrix with the size of 32×32 as the input of the network,stochastic gradient descent was used to train our models with mini-batch size of 100 examples,and momentum was set as 0.9.In addition,the CNN took advantages of the part connections,the weight sharing and Max pooling techniques to lower complexity and improve the training performance of the model simultaneously.The network used rectified linear units(Re LU) as activation function,which could greatly accelerate network convergence.The proposed model for training kiwifruit was represented as 32×32-6 C-2 S-16 C-2 S-120 C-2.Finally,100 images of kiwifruit in the field(including 5918 fruits) were used to test the model,and the results showed that the recognition ratios of occluded fruit,overlapped fruit,adjacent fruit and separated fruit were 78.97%,83.11%,91.01% and 94.78%,respectively.The overall recognition rate of the model reached 89.29%,and it only took 0.27 s in average to recognize a fruit.There was no overlap between the testing samples and the training samples,which indicated that the network had a high generalization performance,and the testing images were captured from 9 a.m.to 5 p.m.,which indicated the network had a good robustness to lightness variations.However,some fruits were wrongly detected and undetected,which included the fruits occluded by branches or leaves,overlapped to each other and the ones under extremely strong sunlight.Particularly,2 or more fruits overlapped were recognized as one fruit,which was the main reason to the success rate not very high.This phenomenon demands a further research.By comparing with the conventional methods,it suggested that the method proposed obtained a higher recognition rate and better speed,and especially it could simultaneously identify multi-cluster kiwifruit in the field,which provided significant support for multi-arm operation of harvesting robotic.It proves that the CNN has a great potential for recognition of fruits in the field.
出处 《农业工程学报》 EI CAS CSCD 北大核心 2018年第2期205-211,共7页 Transactions of the Chinese Society of Agricultural Engineering
基金 陕西省重点研发计划-般项目(2017NY-164) 陕西省科技统筹创新工程计划项目(2015KTCQ02-12) 国家自然科学基金资助项目(61175099) 西北农林科技大学国际合作种子基金(A213021505)
关键词 图像处理 图像识别 算法 深度学习 卷积神经网络 猕猴桃 image processing image recognition algorithms deep learning convolutional neural network kiwifruit
  • 相关文献

参考文献16

二级参考文献151

共引文献531

同被引文献994

引证文献100

二级引证文献1124

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部