摘要
针对现有双目场景流计算方法在大位移、运动遮挡及光照变化等复杂场景下场景流估计的准确性与鲁棒性问题,提出一种基于金字塔块匹配的双目场景流计算方法.首先对双目图像序列进行超像素分割和视差估计,得到图像初始分割结果和视差信息,然后建立基于金字塔块匹配的运动模型并采用Ransac随机一致性算法拟合刚性运动模型和最小化重投影算法估计对象运动参数.最后,本文将金字塔块匹配结果作为约束项,联合对象运动参数和超像素平面参数构建基于金字塔块匹配的双目场景流估计能量函数模型,通过最小化能量函数得到最终场景流.实验分别采用KITTI2015(Karlsruhe Institute of Technology and Toyota technological Institute 2015)和MPI-Sintel(Max-Planck Institute and Sintel)数据集测试图像对本文方法和具有代表性场景流算法进行综合对比分析,结果表明本文方法相对于其他对比方法有效提高大位移、运动遮挡以及光照变化情况下场景流估计精度和鲁棒性.
Aiming at the accuracy and robustness of existing binocular scene flow calculation methods in complex scenes such as large displacement,motion occlusion and illumination changes,this paper proposes a binocular scene flow estimation method based on pyramid block matching.Firstly,we apply the superpixel segmentation and disparity estimation to the binocular image sequence to obtain the initial image segmentation results and disparity information.Secondly,we es⁃tablish a motion model based on pyramid block matching.Then we fit the rigid motion model by using Ransac stochastic consensus algorithm and estimate the object motion parameters by minimizing the reprojection algorithm.Finally,this pa⁃per takes the matching result of the pyramid block as a constraint item,then we construct a binocular scene flow estimation energy function model based on the pyramid block matching by combines the object motion parameters and the superpixel plane parameters,and obtains the final scene flow by minimizing the energy function.The image sequences provided by the KITTI2015(Karlsruhe Institute of Technology and Toyota Technological Institute 2015)and MPI-Sintel(Max-Planck Insti⁃tute and Sintel)databases were adopted to compare and analyze the proposed method in this paper and the existing represen⁃tative scene flow method.The experimental results show that compared with other comparison methods,the method in this paper has high accuracy and robustness of scene flow estimatin,especially in large displacement,motion occlusion and lighting changes.
作者
陈震
倪晶晶
张聪炫
葛利跃
王梓歌
CHEN Zhen;NI Jing-jing;ZHANG Cong-xuan;GE Li-yue;WANG Zi-ge(Key Laboratory of Nondestructive Testing,Ministry of Education,Nanchang Hangkong University,Nanchang,Jiangxi 330063,China;School of Information Engineering,Nanchang Hangkong University,Nanchang,Jiangxi 330063,China)
出处
《电子学报》
EI
CAS
CSCD
北大核心
2022年第9期2164-2171,共8页
Acta Electronica Sinica
基金
国家重点研发计划(No.2020YFC2003800)
国家自然科学基金(No.61866026,No.61866025)
江西省优势科技创新团队计划(No.20165BCB19007)
江西省技术创新引导类计划项目(No.20212AEI91005)
江西省自然科学基金重点项目(No.20202ACB214007)
航空科学基金(No.2018ZC56008)
中国博士后科学基金(No.2019M650894)
江西省教育厅科学技术研究项目(No.GJJ210910)
江西省图像处理与模式识别重点实验室开放基金资助(No.ET202104413)。
关键词
双目场景流
金字塔块匹配
运动模型
大位移运动
超像素分割
binocular scene flow
pyramid block matching
motion model
large displacement motion
superpixel segmentation