期刊文献+

卷积神经网络在跨媒体检索中的应用研究 被引量:3

Application of convolutional neural network in cross media retrieval
下载PDF
导出
摘要 为了适应人们多样化的检索需求,国内外研究人员提出各种"浅层"学习方法模型来探索跨媒体数据间潜在的关联关系,但这些方法主要从手工构建的底层特征出发,并不能充分有效的学习到不同媒体间的关联。与手工构建底层特征不同,深度学习通过无监督逐层预训练与有监督的微调,从而实现区分性更强的特征描述。利用深度学习在特征学习方面的优越性,提出了一种基于深层卷积神经网络VGGNet与LDA模型相结合的跨媒体数据检索方法,该方法利用预训练的VGGNet模型提取图像视觉特征,同时使用LDA模型获取文本的主题概率分布,有效的缩减了不同模态数据间的异构鸿沟与语义鸿沟,从而更有效的实现文本与图像之间的跨媒体检索,实验证明了该方法的先进性和有效性。 In order to meet the diversified retrieval needs of people,researchers at home and abroad put forward various"shallow"learning method models to explore the potential correlation between cross-media data.However,these methods mainly start from the bottom features of manual construction,which can not fully and effectively learn the association between different media.Different from the underlying features of manual construction,deep learning is accomplished through unsupervised layer-by-layer pre-training and supervised fine-tuning,to achieve a stronger discriminative feature description.Takes advantage of the superiority of deep learning in feature learning,we proposes a cross-media data retrieval method based on deep convolutional neural network VGGNet and LDA model.The method uses the pretrained VGGNet model to extract visual features of the images and uses the LDA model to obtain the thematic probability distribution of the texts,which effectively reduces the heterogeneous and semantic gap between different modal data.Thus,the cross-media retrieval between text and image can be realized more effectively.In this paper,experiments are conducted to demonstrate the progressiveness and effectiveness of the method which I proposed.
作者 金汉均 段贝贝 Jin Hanjun;Duan Beibei(School of Computer, Central China Normal University, Wuhan 430079, Chin)
出处 《电子测量技术》 2018年第7期54-57,共4页 Electronic Measurement Technology
基金 教育部人文社科规划基金(17YJA870010)项目资助
关键词 跨媒体检索 深度学习 卷积神经网络 VGGNet cross media retrieval deep learning convolutional neural network VGGNet
  • 相关文献

参考文献6

二级参考文献107

  • 1王燕.一种改进的K-means聚类算法[J].计算机应用与软件,2004,21(10):122-123. 被引量:9
  • 2袁薇,高淼.综合语义与颜色特征的图像检索技术研究[J].微电子学与计算机,2006,23(1):36-39. 被引量:4
  • 3宋丹,王卫东,陈英.基于改进向量空间模型的话题识别与跟踪[J].计算机技术与发展,2006,16(9):62-64. 被引量:23
  • 4廖玲,文敦伟.基于改进向量空间模型的邮件分类[J].计算机与数字工程,2007,35(4):190-193. 被引量:3
  • 5Zhuang Yueting, Yang Yi, Wu Fei. Mining semantic correlation of heterogeneous multimedia data for cross - media retrieval[J]. IEEE Transactions on Multimedia, 2008,10 (2) : 221 - 229.
  • 6Ahmet Ekin, Murat Tekalp A, Rajiv Mehrotra. Integrated semantic - syntactic video modeling for search and browsing[J]. IEEE Transactions on Multimedia, 2004,6 (6) : 839 - 851.
  • 7Umapathy K, Krishnan S, Jimaa S. Multigroup classification of audio signals using time- frequency parameters[J ]. IEEE Transactions on Multimedia, 2005, 7 ( 2 ) : 308 - 315.
  • 8Salton G,Wong A, Yang C S. A Vector Space Model for Auto- matic lndexing[J]. Communications of the ACM, 1975,18: 613- 620.
  • 9Blei D, Ng A, Jordan M. Latent dirichlet allocation[J]. Journal of Machine Learning Research, 2003,3 : 993.
  • 10石晶,范猛,李万龙.基于LDA模型的主题分析[J].自动化报,2009,36:1586-1593.

共引文献143

同被引文献26

引证文献3

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部