基于稠密自编码器的无监督番茄植株图像深度估计模型被引量：7

Unsupervised deep estimation modeling for tomato plant image based on dense convolutional auto-encoder

下载PDF

导出

摘要深度信息获取是温室移动机器人实现自主作业的关键。该研究提出一种基于稠密卷积自编码器的无监督植株图像深度估计模型。针对因视角差异和遮挡而产生的像素消失问题,引入视差置信度预测,抑制图像重构损失产生的问题梯度,设计了基于可分卷积的稠密自编码器作为模型的深度神经网络。以深度估计误差、阈值精度等为判据,在番茄植株双目图像上开展训练和测试试验,结果表明,抑制问题梯度回传可显著提高深度估计精度,与问题梯度抑制前相比,估计深度的平均绝对误差和均方根误差分别降低了55.2%和33.0%,将网络预测的多尺度视差图接入编码器并将其上采样到输入图像尺寸后参与图像重构和损失计算的处理方式对提高预测精度是有效的,2种误差进一步降低了23.7%和27.5%;深度估计误差随空间点深度的减小而显著降低,当深度在9 m以内时,估计深度的平均绝对误差<14.1 cm,在3 m以内时,则<7 cm。与已有研究相比,该研究估计深度的平均相对误差和平均绝对误差分别降低了46.0%和26.0%。该研究可为温室移动机器人视觉系统设计提供参考。 Depth information acquisition is the key to mobile robots which realize autonomous operation in the greenhouse.This study proposed an unsupervised model that used binocular images for training and testing based on dense convolutional auto-encoder.This model enabled the neural network to perform plant image depth estimation and defined a loss function for the depth estimation with convolution feature comparison and regularization constraints.Aiming at the problem of pixel vanished due to the different perspective and occlusion,a disparity confidence prediction was introduced to suppress the problem gradient caused by the image reconstruction loss.In the meantime,a dense block was designed based on separable convolution and built a convolutional auto-encoder as the backbone network for the model.In the greenhouse of tomato planting,a large number of binocular images were collected when tomato planting was growing on an overcast,cloudy and sunny days.An unsupervised plant image depth estimation network was also designed with a Python application interface,which was implemented by adopting Microsoft Cognitive Tools(CNTK)v2.7,a deep learning computing framework.The experiments of training and testing which were used image feature similarity,depth estimation error,and threshold precision as the criteria were carried out,and the binocular images of tomato planting were also taken as examples,on Tesla K40c graphic device.The results showed that the auto-encoder based on the separable convolution dense block which was compared with the regular convolution could effectively reduce the number of network weight parameters.Compared with the other activations which included ReLU(Rectified Linear Unit),Param-ReLU,ELU(Exponential Linear Unit),and SELU(Scaled-ELU),the network model with Leaky-ReLU as the nonlinear transformation had the minimum depth error and the maximum threshold precision.Also,the results showed that the network structures had significant impacts on the accuracy of prediction disparity.The introduction of separable convolution dense block in the skip connection between the encoder and decoder of auto-encoder had a certain effect on improving the accuracy of depth estimation.Meanwhile,by making the depth estimation model predict the disparity confidence which was used to restrain the problem gradient backpropagation,the error of depth estimation was remarkably decreased,Mean Absolute Error(MAE),and Root Mean Square Error(RMSE)were reduced by 55.2%and 33.0%respectively.The accuracy of depth estimation was significantly improved by using these processing methods,such as image reconstruction,loss function calculation after up-sampling the disparity map to the input image scale and splicing the multi-scale disparity map predicted by the network to the feature map of its encoder,as well as sending the combination feature map to the next prediction module.The performance of depth estimation was improved by increasing the depth and width of the convolutional auto-encoder.The error of depth estimation decreased significantly with the reduction of the spatial point depth.When the spatial point depth was within 9 m,the MAE of the estimated depth was less than 14.1 cm.And when the depth was within 3 m,the MAE was less than 7 cm.The influence of illumination conditions on the accuracy of this study depth estimation model was not significant.The method in this study was robust to the change of the luminous environment.The highest test speed of this study model was 14.2 FPS(Frames Per Second),which was near real-time.Compared with the existing researches,the mean relative error,MAE,and Mean Range Error(MRE)of depth estimation in this study were reduced by 46.0%,26.0%,and 25.5%respectively.This research could provide a reference for the design of the vision system of greenhouse mobile robots.

作者周云成邓寒冰许童羽苗腾吴琼 Zhou Yuncheng;Deng Hanbing;Xu Tongyu;Miao Teng;Wu Qiong(College of Information and Electrical Engineering,Shenyang Agricultural University,Shenyang 110866,China)

机构地区沈阳农业大学信息与电气工程学院

出处《农业工程学报》 EI CAS CSCD 北大核心 2020年第11期182-192,共11页 Transactions of the Chinese Society of Agricultural Engineering

基金辽宁省自然科学基金(20180551102) 国家自然科学基金(31901399,31601218)。

关键词图像处理卷积神经网络算法深度估计无监督学习深度学习自编码器视差番茄 image processing convolution neural network algorithms depth estimation unsupervised learning deep learning auto-encoder disparity tomato

分类号 TP183 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献6

1翟志强,朱忠祥,杜岳峰,张硕,毛恩荣.基于Census变换的双目视觉作物行识别方法[J].农业工程学报,2016,32(11):205-213. 被引量：21
2梁喜凤,彭明,路杰,秦超.基于自适应无迹卡尔曼滤波的采摘机械手视觉伺服控制方法[J].农业工程学报,2019,35(19):230-237. 被引量：21
3姬长英,沈子尧,顾宝兴,田光兆,张杰.基于点云图的农业导航中障碍物检测方法[J].农业工程学报,2015,31(7):173-179. 被引量：16
4马驰,李光林,李晓东,黄小玉,宋杰,杨士航.丘陵山地柑橘果园多方位自动喷药装置研制[J].农业工程学报,2019,35(3):31-41. 被引量：20
5孙国祥,汪小旵,刘景娜,孙晔,丁永前,卢伟.基于相位相关的温室番茄植株多模态三维重建方法[J].农业工程学报,2019,35(18):134-142. 被引量：11
6肖珂,高冠东,马跃进.基于Kinect视频技术的葡萄园农药喷施路径规划算法[J].农业工程学报,2017,33(24):192-199. 被引量：11

二级参考文献93

1陈兵旗,何醇,马彦平,白由路.大田玉米长势的三维图像监测与建模[J].农业工程学报,2011,27(S1):366-372. 被引量：13
2何勇,赵春江,吴迪,聂鹏程,冯雷.作物-环境信息的快速获取技术与传感仪器[J].中国科学：信息科学,2010,40(S1):1-20. 被引量：19
3付为森,滕光辉.基于双目视觉技术的猪生长监测系统标定模式[J].农业机械学报,2009,40(S1):223-227. 被引量：5
4刘正东,高鹏,杨静宇.一种用于道路避障的双目视觉图像分割方法[J].计算机应用研究,2005,22(4):249-251. 被引量：7
5袁佐云,毛志怀,魏青.基于计算机视觉的作物行定位技术[J].中国农业大学学报,2005,10(3):69-72. 被引量：38
6周勇,陈超,叶庆泰.基于机器视觉的自动导引车辆障碍物检测[J].机械设计与研究,2005,21(5):74-76. 被引量：12
7赵杰,李牧,李戈,闫继宏.一种无标定视觉伺服控制技术的研究[J].控制与决策,2006,21(9):1015-1019. 被引量：8
8田有文,李天来,李成华,朴在林,孙国凯,王滨.基于支持向量机的葡萄病害图像识别方法[J].农业工程学报,2007,23(6):175-180. 被引量：84
9辛菁,刘丁,班建安.自适应卡尔曼滤波器在机器人控制中的应用[J].西安理工大学学报,2007,23(2):136-139. 被引量：8
10张磊,王书茂,陈兵旗,刘志刚.基于双目视觉的农田障碍物检测[J].中国农业大学学报,2007,12(4):70-74. 被引量：24

共引文献91

1张硕,刘禹,熊坤,翟志强,朱忠祥,杜岳峰.基于特征工程的大田作物行中心线识别方法[J].农业机械学报,2023,54(S01):18-26.
2周海燕,杨悦,刘阳春,马若飞,张峰硕,张启帆.基于激光雷达的作物收获导航线实时提取方法研究[J].农业机械学报,2023,54(S01):9-17.
3肖珂,郝毅,高冠东.果园自动变距精准施药系统设计与试验[J].农业机械学报,2022,53(10):137-145. 被引量：2
4王丹丹,石峰,翟亚芳,杜雪.基于UKF的苹果果实定位估计算法[J].昆明理工大学学报（自然科学版）,2020,45(4):50-56. 被引量：3
5张志斌,赵帅领,罗锡文,魏凤岐.基于SURF算法的绿色作物特征提取与图像匹配方法[J].农业工程学报,2015,31(14):172-178. 被引量：36
6翟志强,杜岳峰,朱忠祥,郎健,毛恩荣.基于Rank变换的农田场景三维重建方法[J].农业工程学报,2015,31(20):157-164. 被引量：10
7翟志强,朱忠祥,杜岳峰,张硕,毛恩荣.基于Census变换的双目视觉作物行识别方法[J].农业工程学报,2016,32(11):205-213. 被引量：21
8蒋郁,崔宏伟,区颖刚,马旭,齐龙,郑文汉.基于茎基部分区边缘拟合的稻株定位方法[J].农业机械学报,2017,48(6):23-31. 被引量：11
9何勇,蒋浩,方慧,王宇,刘羽飞.车辆智能障碍物检测方法及其农业应用研究进展[J].农业工程学报,2018,34(9):21-32. 被引量：51
10翟志强,朱忠祥,杜岳峰,李臻,毛恩荣.基于虚拟现实的拖拉机双目视觉导航试验[J].农业工程学报,2017,33(23):56-65. 被引量：17

同被引文献60

1袁挺,任永新,李伟,纪超,谭豫之.基于光照色彩稳定性分析的温室机器人导航信息获取[J].农业机械学报,2012,43(10):161-166. 被引量：13
2高国琴,李明.基于K-means算法的温室移动机器人导航路径识别[J].农业工程学报,2014,30(7):25-33. 被引量：105
3刘天亮,莫一鸣,徐高帮,戴修斌,朱秀昌,罗杰波.多线索非参数化融合的单目视频深度估计[J].东南大学学报（自然科学版）,2015,45(5):834-839. 被引量：1
4王丹丹,宋怀波,何东健.苹果采摘机器人视觉系统研究进展[J].农业工程学报,2017,33(10):59-69. 被引量：94
5居锦,刘继展,李男,李萍萍.基于侧向光电圆弧阵列的温室路沿检测与导航方法[J].农业工程学报,2017,33(18):180-187. 被引量：10
6何勇,蒋浩,方慧,王宇,刘羽飞.车辆智能障碍物检测方法及其农业应用研究进展[J].农业工程学报,2018,34(9):21-32. 被引量：51
7王礼,方陆明,陈珣,吴超.基于Lab颜色空间的花朵图像分割算法[J].浙江万里学院学报,2018,31(3):67-73. 被引量：9
8李良,张文爱,冯青春,王秀.温室轨道施药机器人系统设计[J].农机化研究,2016,38(1):109-112 118. 被引量：20
9杨卫国,祝铁军,王庆韧,杜胜磊.燃气轮机组压气机失速引发的不稳定振动分析[J].广东电力,2019,32(7):37-43. 被引量：9
10李阳,陈秀万,王媛,刘茂林.基于深度学习的单目图像深度估计的研究进展[J].激光与光电子学进展,2019,56(19):1-17. 被引量：23

引证文献7

1白明亮,张冬雪,刘金福,刘娇,于达仁.基于深度自编码器和支持向量数据描述的燃气轮机高温部件异常检测[J].发电技术,2021,42(4):422-430. 被引量：6
2周云成,许童羽,邓寒冰,苗腾,吴琼.基于自监督学习的温室移动机器人位姿跟踪[J].农业工程学报,2021,37(9):263-274. 被引量：11
3龙燕,高研,张广犇.基于改进HRNet的单幅图像苹果果树深度估计方法[J].农业工程学报,2022,38(23):122-129. 被引量：5
4白琳,刘林军,李轩昂,吴沙,刘汝庆.基于自监督学习的单目图像深度估计算法[J].吉林大学学报（工学版）,2023,53(4):1139-1145.
5赵露露,邓寒冰,周云成,苗腾,赵凯,杨景,张羽丰.基于自生成标签的玉米苗期图像实例分割[J].农业工程学报,2023,39(11):201-211. 被引量：1
6周云成,刘忠颖,邓寒冰,苗腾,王昌远.基于混合分组扩张卷积的玉米植株图像深度估计[J].华南农业大学学报,2024,45(2):280-292.
7蔡嘉诚,董方敏,孙水发,汤永恒.无监督单目深度估计研究综述[J].计算机科学,2024,51(2):117-134.

二级引证文献23

1李鹏.基于改进PSO-BP算法的机器人目标位姿识别方法[J].国外电子测量技术,2023,42(1):7-12. 被引量：5
2张慧波,王守相,赵倩宇,任杰,王海.考虑数据不均衡的居民用户负荷曲线分类方法[J].电力工程技术,2022,41(3):186-193. 被引量：6
3陈远浩,吴明晖.基于回波强度的AGV重定位方法研究[J].智能计算机与应用,2022,12(8):179-182.
4王政博,王红军,张翔,崔英杰,苏静雷.燃气轮机深度卷积生成对抗故障样本生成研究[J].电子测量与仪器学报,2022,36(6):82-90. 被引量：2
5纪永.考虑外部扰动的四轮移动机器人运动轨迹控制优化方法[J].机械与电子,2023,41(2):23-26.
6胡炼,彭靖怡,赖桑愉,冯达文,陈高隆,王晨阳,罗锡文.基于BDS和IMU的挖掘机铲斗位姿测量方法与试验[J].农业工程学报,2022,38(23):12-19. 被引量：1
7王一如,樊秋波,种法力.基于双目视觉技术的激光再制造机器人跟踪系统[J].激光杂志,2023,44(3):216-220.
8许未晴,冀守虎,安永伟,贾冠伟,曹鑫源,王佳,蔡茂林,吴素君.机器学习算法在重型燃气轮机健康监测的应用现状[J].液压与气动,2023,47(4):71-86. 被引量：3
9许伟明,李学敏,张祎,Maulidi Barasa,张培泽,易佑中.基于O-DAE和SVDD的汽轮机异常检测方法[J].浙江电力,2023,42(7):102-109.
10吴雄伟,周云成,刘峻渟,刘忠颖,王昌远.面向温室移动机器人的无监督视觉里程估计方法[J].农业工程学报,2023,39(10):163-174. 被引量：1

1王泽隆,徐向辉,张雷.基于仿真SAR图像深度迁移学习的自动目标识别[J].中国科学院大学学报（中英文）,2020,37(4):516-524. 被引量：8
2谢蓉.高中数学自主作业的优化研究[J].读天下（综合）,2020,0(20):0170-0170.
3赵昕.防控疫情之际智能农机迎机遇[J].农家致富,2020(7):4-5.
4李梅,郭飞,张立中,王波,张俊岭,李兆桐.基于TATLNet的输电场景威胁检测[J].工程科学学报,2020,42(4):509-515. 被引量：2
5徐欣,刘强,王少军.一种高度并行的卷积神经网络加速器设计方法[J].哈尔滨工业大学学报,2020,52(4):31-37. 被引量：6
6李绍堂.经肛全直肠系膜切除术的相关解剖及操作技巧[J].结直肠肛门外科,2020,26(3):354-357. 被引量：4
7曲海成,田小容,刘腊梅,石翠萍.多尺度显著区域检测图像压缩[J].中国图象图形学报,2020,0(1):31-42. 被引量：9
8陶志勇,胡亚磊,林森.基于改进AlexNet的手指静脉识别[J].激光与光电子学进展,2020,57(8):50-58. 被引量：11
9席珺珺.听觉与触觉感知在城市视觉系统设计中的探索[J].西部皮革,2020,42(13):39-40. 被引量：1
10陈婷婷,王晨,程金兰,朱文远,陈务平.相同集聚因子时纤维悬浮液的流动特征及机理[J].林业工程学报,2020,5(4):121-126. 被引量：2

农业工程学报

2020年第11期

浏览历史

内容加载中请稍等...

基于稠密自编码器的无监督番茄植株图像深度估计模型被引量：7

参考文献6

二级参考文献93

共引文献91

同被引文献60

引证文献7

二级引证文献23

相关作者

相关机构

相关主题

浏览历史

基于稠密自编码器的无监督番茄植株图像深度估计模型 被引量：7

参考文献6

二级参考文献93

共引文献91

同被引文献60

引证文献7

二级引证文献23

相关作者

相关机构

相关主题

浏览历史

基于稠密自编码器的无监督番茄植株图像深度估计模型被引量：7