期刊文献+

基于多级语义的判别式跨模态哈希检索算法 被引量:4

Cross-modal retrieval algorithm based on multi-level semantic discriminative guided hashing
下载PDF
导出
摘要 针对大多数跨模态哈希方法采用二进制矩阵表示相关程度,因此无法捕获多标签数据之间更深层的语义信息,以及它们忽略了保持语义结构和数据特征的判别性等问题,提出了一种基于多级语义的判别式跨模态哈希检索算法——ML-SDH。所提算法使用多级语义相似度矩阵发现跨模态数据中的深层关联信息,同时利用平等指导跨模态哈希表示在语义结构和判别分类中的关联关系,不仅实现了对蕴含高级语义信息的多标签数据进行编码的目的,而且构建的保留多级语义的结构能够确保最终学习的哈希码在保持语义相似度的同时又具有判别性。在NUSWIDE数据集上,哈希码长度为32 bit时,所提算法在两个检索任务中的平均准确率(mAP)比深度跨模态哈希(DCMH)、成对关联哈希(PRDH)、平等指导判别式哈希(EGDH)算法分别高出了19.48,14.50,1.95个百分点和16.32,11.82,2.08个百分点。 Most cross-modal hashing methods use binary matrix to represent the degree of correlation,which results in high-level semantic information cannot be captured in multi-label data,and those methods ignore maintaining the semantic structure and the discrimination of the data features.Therefore,a cross-modal retrieval algorithm named ML-SDH(Multi-Level Semantics Discriminative guided Hashing)was proposed.In the algorithm,multi-level semantic similarity matrix was used to discover the deeply correlated information in the cross-modal data,and equally guided cross-modal hashing was used to express the correlations in the semantic structure and discriminative classification.As the result,not only the purpose of encoding multi-label data of high-level semantic information was achieved,but also the distinguishability and semantic similarity of the final learned hash codes were ensured by the constructed multi-level semantic structure.On NUS-WIDE dataset,with the hash code length of 32 bit,the mean Average Precision(mAP)of the proposed algorithm in two retrieval tasks is 19.48,14.50,1.95 percentage points and 16.32,11.82,2.08 percentage points higher than those of DCMH(Deep Cross-Modal Hashing),PRDH(Pairwise Relationship guided Deep Hashing)and EGDH(Equally-Guided Discriminative Hashing)algorithms respectively.
作者 刘芳名 张鸿 LIU Fangming;ZHANG Hong(School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan Hubei 430065,China;Hubei Province Key Laboratory of Intelligent Information Processing and Real-time Industrial System(Wuhan University of Science and Technology),Wuhan Hubei 430065,China)
出处 《计算机应用》 CSCD 北大核心 2021年第8期2187-2192,共6页 journal of Computer Applications
基金 国家自然科学基金资助项目(61373109)。
关键词 多级语义 语义结构 判别性哈希 语义指导 跨模态检索 multi-level semantic semantic structure discriminative hashing semantic guidance cross-modal retrieval
  • 相关文献

参考文献2

二级参考文献17

  • 1Lew M, Sebe N, Djeraba C, Jain R. Content-based multime- dia information retrieval: State-of-the art and challenges. ACM Transactions on Multimedia Computing, Communica tion and Applications, 2006, 2(1); 1 -19.
  • 2Bekkerman R, Jeon J. Multi modal clustering for multimedia collection//Proceedings of the CVPR. Minneapolis, USA, 2007:1 -8.
  • 3McLachlan G J, Basford K E. Mixture models: Inference and applications to clustering. Statistics: Textbooks and Mono graphs, New York, 1988.
  • 4Frey Brendan J, Dueck Delbert. Clustering by passing mes- sages between data point. Science, 2007, 315:972 -976.
  • 5Guo G D, Li S Z. Content based audio classification and re- trieval by support vector machines. IEEE Transactions on Neural Network, 2003, 14(1): 209-115.
  • 6Yang Yi, Xu Dong, Nie Feiping et al. Ranking with local re- gression and global alignment for cross-media retrieval//Pro- ceedings of the ACM Multimedia Conference. Beijing, China, 2009:175-184.
  • 7Wu Fei, Zhang Hong, Zhuang Yueting. Learning semantic correlations for cross-media retrieval//Proceedings of the In- ternational Conference on Image Processing. Atlanta, USA, 2006:1465-1468.
  • 8Yang Yi, Zhuang Yueting, Wu Fei, Pan Yunhe. Harmoni- zing hierarchical manifolds for multimedia document seman- tics understanding and cross-media retrieval. IEEE Transac- tions on Maltimedia, 2008, 10(3): 437-446.
  • 9McGurk Harry, MacDonald John. Hearing lips and seeing voices. Nature, 1976, 264:746-748.
  • 10Wu Yi, Chang Edward Y, Chang Kevin Chen Chuan, Smith John R. Optimal multimodal fusion for multimedia data anal- ysis//Proceedings of the ACM Multimedia Conference. New York, USA, 2004: 572-579.

共引文献13

同被引文献22

引证文献4

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部