期刊文献+

深度学习图像标注与用户标注比较研究 被引量:2

Image Annotation Tags by Deep Learning and Real Users: A Comparative Study
原文传递
导出
摘要 【目的】利用用户对图像标注的标签提出用户标签框架,并通过用户标签框架总结深度学习自动标注图像的不足。【方法】统计分析从Flickr上下载的大约100万张图像数据集中的用户标签,抽取高频词进行用户标签框架匹配。将用户标签与Image Net数据库标签进行对比总结。对含有高频词的图像使用MXNet深度学习算法进行标注,分析标注结果。【结果】当前深度学习自动标注,在图像背景知识、总体描述以及人类感官描述等方面还存在缺陷。【局限】数据集的范围需要扩大,深度学习算法的种类需要增加。【结论】自动标注图像的发展,需要建立图像信息与背景知识、描述等的联系;并且深度学习未来发展还需要赋予计算机逻辑推理以及情境感知的能力。 [Objective] This paper proposes a user tagging framework and examines the limitations of tagging image with deep learning techniques, aiming to improve the performance of automatic annotation services. [Methods] We analyzed the user-added tags from one million images on flickr.com to extract the high frequency ones. Then, we mapped these tags with the proposed framework, and compared them with tags from the Image Net database. Finally, we analyzed images with high frequency tags with the deep learning algorithm-MXNet. [Results] The automatic image annotation techniques based on deep learning could not effectively understand the image's background knowledge, as well as the image's descriptions from the human perceptive. [Limitations] Our dataset needs to be expanded and analyzed with other deep learning algorithms. [Conclusions] The development of automatic image annotation, requires us to establish the association between image information, background knowledge, and description, as well as cultivate deductive reasoning and context-aware abilities.
作者 陆伟 罗梦奇 丁恒 李信 Lu Wei;Luo Mengqi;Ding Heng;Li Xin(School of Information Management, Wuhan University, Wuhan 430072, Chin;Information Retrieval and Knowledge Mining Laboratory, Wuhan University, Wuhan 430072, China)
出处 《数据分析与知识发现》 CSSCI CSCD 北大核心 2018年第5期1-10,共10页 Data Analysis and Knowledge Discovery
基金 国家自然科学基金面上项目"面向词汇功能的学术文本语义识别与知识图谱构建"(项目编号:71473183)的研究成果之一
关键词 图像标注 用户标签 自动标注 机器学习 深度学习 人工智能 Image Annotation User Tags Automatic Image Annotation Machine Learning Deep Learning Artificial Intelligence
  • 相关文献

参考文献1

二级参考文献24

  • 1周卫东,冯其波,匡萃方.图像描述方法的研究[J].应用光学,2005,26(3):27-31. 被引量:5
  • 2王晓光.“数字人文”的产生、发展与前沿[A]重庆,2009.
  • 3Vassilis Tzouvaras. Multimedia annotation interoperability framework[EB/OL].http://www.w3.org/2005/Incubator/mmsem/XGR-interoperability/,2013.
  • 4Jorgensen C. Attributes of images in describing tasks[J].{H}Information Processing and Management,1998,(2-3):161-174.
  • 5Cunningham S J,Bainbridge D,Masoodian M. How people describe their image information needs:A grounded theory analysis of visual arts queries[A].New York,USA:ACM Press,2004.47-66.
  • 6Shatford S L. Analyzing the subject of a picture:A theoretical approach[J].Cataloging & Classification Quarterly,1985,(3):39-62.
  • 7Jaims A,Jaims R,Chang S F. A conceptual framework for indexing visual information at multiple levels[A].San Jose,CA,2000.
  • 8Batley S. Visual information retrieval browsing strategies in pictorial databases[A].London,1988.373-381.
  • 9Hollink L,Schreiber A T,Wielinga B J. Classification of user image descriptions[J].{H}International Journal of Human-Computer Studies,2004,(5):601-626.
  • 10John P E. Automatic image content retrieval-Are we getting anywhere[A].De Mont fort University,Milton Keyncs:Aslib,1996.123-135.

共引文献50

同被引文献68

引证文献2

二级引证文献32

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部