期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
A deep dense captioning framework with joint localization and contextual reasoning
1
作者 KONG Rui XIE Wei 《Journal of Central South University》 SCIE EI CAS CSCD 2021年第9期2801-2813,共13页
Dense captioning aims to simultaneously localize and describe regions-of-interest(RoIs)in images in natural language.Specifically,we identify three key problems:1)dense and highly overlapping RoIs,making accurate loca... Dense captioning aims to simultaneously localize and describe regions-of-interest(RoIs)in images in natural language.Specifically,we identify three key problems:1)dense and highly overlapping RoIs,making accurate localization of each target region challenging;2)some visually ambiguous target regions which are hard to recognize each of them just by appearance;3)an extremely deep image representation which is of central importance for visual recognition.To tackle these three challenges,we propose a novel end-to-end dense captioning framework consisting of a joint localization module,a contextual reasoning module and a deep convolutional neural network(CNN).We also evaluate five deep CNN structures to explore the benefits of each.Extensive experiments on visual genome(VG)dataset demonstrate the effectiveness of our approach,which compares favorably with the state-of-the-art methods. 展开更多
关键词 dense captioning joint localization contextual reasoning deep convolutional neural network
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部