期刊文献+

一种改进的全局注意机制图像描述方法 被引量:6

Improved method for image caption with global attention mechanism
下载PDF
导出
摘要 针对现有基于注意机制的图像描述方法全局信息缺失问题,提出了一种改进的全局注意机制图像描述方法。该方法在注意机制的基础上,通过设计全局特征网络来模拟人类感知机制的全过程,对图像全局特征进行增强。将所提方法在相同数据集和网络超参数的情况下与目前最优网络进行实验对比,分析了全局信息对生成文本的影响。实验结果显示,文中提出的方法在更具挑战性的中文文本描述任务上客观评价指标优于目前最优的模型。同时,在主观评价中能够生成更准确的文本内容,也更具丰富性与多样性,接近自然语言描述。 Aiming at the lack of global information in existing attention based image caption methods,we propose an improved image caption method with global attention mechanism.The proposed method mimics the entire human perception process via designing aglobal feature extraction network to enhance the global context based on visual attention mechanism.This paper compares the proposed method with the existing attention based image caption technique under the same dataset and hyper parameters,and analyzes the influence of introducing the global feature.The results show that our method outperforms the existing technique in objective evaluations with the challenging Chinese caption dataset.In the subjective evaluation,in the meanwhile,the captions generated by the proposed method describes the image more accurately,vividly and diversely,and they are more close to the natural language.
作者 马书磊 张国宾 焦阳 石光明 MA Shulei;ZHANG Guobin;JIAO Yang;SHI Guangming(School of Artificial Intelligence, Xidian Univ., Xi'an 710071, China;The 27 th Research Institute of China Electronic Technology Group Corporation, Zhengzhou 450047, China)
出处 《西安电子科技大学学报》 EI CAS CSCD 北大核心 2019年第2期17-22,共6页 Journal of Xidian University
基金 国家自然科学基金(61875157 61301288)
关键词 图像描述 注意力机制 全局特征 卷积神经网络 循环神经网络 image caption attention mechanism global feature convolutional neural network recurrent neural network
  • 相关文献

参考文献2

二级参考文献20

  • 1Hajizadeh A, Farhadpour Z. An Algorithm for 3D Pore Space Reconstruction from a 2D Image Using Sequential Simulation and Gradual Deformation with the Probability Perturbation Sampler[J], Transport in Porous Media, 2012, 94 (3): 859-881.
  • 2Kumar T S, Vijai A. 3D Reconstruction of Face from 2D CT Scan Images [C]//International Conference on Communication Technology and System Design. Oxford: Elceiver Ltd, 2012: 970-977.
  • 3Ciechomski P D, Constantinescu M, Garcia J, et al. Development and Implementation of a Web-Enabled 3D Consultation Tool for Breast Augmentation Surgery Based on 3D-Image Reconstruction of 2D Pictures[J]. Journal of Medical Internet Research, 2012, 14(1): 21.
  • 4Pilu M. A Direct Method for Stereo Correspondence Based on Singular Value Decomposition C] //Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Los Alamitos: IEEE, 1997: 261-266.
  • 5Salah A A, Alpaydin E, Akarun L. A Selective Attention-based Method for Visual Pattern Recognition with Application to Handwritten Digit Recognition and Face Reeognition [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(3): 420-425.
  • 6魏立梅,张永瑞,谢维信,程相君.图像处理中边缘漏检的自动补偿[J].西安电子科技大学学报,1997,24(4):563-566. 被引量:2
  • 7洪明坚,吕建斌,杨丹,毛有力.一种新的基于互信息的图像配准方法[J].重庆大学学报(自然科学版),2009,32(6):697-700. 被引量:6
  • 8康皓,王明倩,王莹莹.图像三维重建技术[J].企业技术开发,2009,28(6):42-43. 被引量:2
  • 9高宏伟,于洋,刘晓阳.双目立体视觉三维重建实验平台研究[J].计算机工程与应用,2009,45(33):149-152. 被引量:5
  • 10吴月娥,边后琴.尺度与特征引导视觉选择性注意机制模型[J].现代电子技术,2009,32(22):84-87. 被引量:2

共引文献19

同被引文献27

引证文献6

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部