期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
Collaborative Learning Method for Natural Image Captioning
1
作者 rongzhao wang Libo Liu 《国际计算机前沿大会会议论文集》 2022年第1期249-261,共13页
We propose a collaborative learning method to solve the natural image captioning problem.Numerous existing methods use pretrained image classification CNNs to obtain feature representations for image caption generatio... We propose a collaborative learning method to solve the natural image captioning problem.Numerous existing methods use pretrained image classification CNNs to obtain feature representations for image caption generation,which ignores the gap in image feature representations between different computer vision tasks.To address this problem,our method aims to utilize the similarity between image caption and pix-to-pix inverting tasks to ease the feature representation gap.Specifically,our framework consists of two modules:1)The pix2pix module(P2PM),which has a share learning feature extractor to extract feature representations and a U-net architecture to encode the image to latent code and then decodes them to the original image.2)The natural language generation module(NLGM)generates descriptions from feature representations extracted by P2PM.Consequently,the feature representations and generated image captions are improved during the collaborative learning process.The experimental results on the MSCOCO 2017 dataset prove the effectiveness of our approach compared to other comparison methods. 展开更多
关键词 Image captioning Pix2pix inverting Collaborative learning
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部