双目立体视觉室内场景描述模型

Indoor Scene Description Model Based on Binocular Stereo Vision

下载PDF

导出

摘要为了解决室内场景中物体的准确检测与描述问题,本文设计了双目立体视觉的室内场景描述模型。系统首先使用双目摄像头模组捕捉具有视差的左右图像信息,并借助SGBM算法获得包含深度信息匹配点的视差图,然后运用基于图的密集视觉描述模型输出场景的描述内容,最后使用开源语音模块进行播报。在ScanRefer数据集的测试结果表明,模型在描述3D物体方面表现出色,在CIDEr@0.5IoU评价指标上达到了40.69%。 In order to solve the problem of accurate detection and description of objects in indoor scenes,this paper designs a binocular stereo vision indoor scene description model.The system first uses a binocular camera module to capture left and right image information with disparity,and uses the SGBM algorithm to obtain a disparity map containing depth information matching points.Then,a graph based dense visual description model is used to output the scene description content.Finally,an open-source voice module is used for broadcasting.The test results on the ScanReferr dataset show that the model performs well in describing 3D objects,with a CIDEr@0.5IoU The evaluation index reached 40.69%.

作者黄启航程昊阳王然 HUANG Qihang;CHENG Haoyang;WANG Ran(School of Computer Science,Hangzhou Dianzi University,Hangzhou,China,310018)

机构地区杭州电子科技大学计算机学院

出处《福建电脑》 2024年第7期23-28,共6页 Journal of Fujian Computer

基金浙江省大学生科技创新活动计划(新苗计划)(No.GK230701205028)资助。

关键词室内场景双目立体视觉描述模型 Indoor Scenes Binocular Stereo Vision Described Model

分类号 TP181 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

1曾雅容,戴智慧(指导).荠[J].湖北教育,2024(13):32-32.
2刘明明,刘浩,王栋,张海燕.基于全局与序列变分自编码的图像描述生成[J].计算机应用研究,2024,41(7):2215-2220.
3中国医师协会胰腺病学专业委员会,中华医学会放射学分会,国家消化系统疾病临床医学研究中心(上海),上海市医学会放射诊断质控中心,边云,王铁功,李兆申,陆建平,邵成伟,刘士远,陈敏,李汛,陈克敏.中国急性胰腺炎影像学诊断报告规范循证学指南[J].中华胰腺病杂志,2024,24(3):173-185.
4崔衡,张海涛,杨剑,杜宝昌.基于改进Transformer的多尺度图像描述生成[J].软件导刊,2024,23(7):160-166.
5中国医师协会胰腺病学专业委员会,中华医学会放射学分会,国家消化系统疾病临床医学研究中心(上海),上海市医学会放射诊断质控中心,边云,刘芳,李兆申,陆建平,邵成伟,刘士远,陈敏,李汛.中国慢性胰腺炎影像学诊断报告规范循证学指南[J].中华胰腺病杂志,2024,24(3):161-172.
6李相旭,胡超,吴国平.3D打印技术在颅颌面外科修复重建的应用进展[J].中华医学美学美容杂志,2024,30(3):244-246.
7Emran Al-Buraihy,Dan Wang.Enhancing Cross-Lingual Image Description: A Multimodal Approach for Semantic Relevance and Stylistic Alignment[J].Computers, Materials & Continua,2024,79(6):3913-3938.

福建电脑

2024年第7期

浏览历史

内容加载中请稍等...

双目立体视觉室内场景描述模型

相关作者

相关机构

相关主题

浏览历史