基于语义一致与多级相似性的跨模态哈希检索

Semantic-Consistent and Multilayer Similarity Based Cross-Modal Hashing Retrieval

导出

摘要【目的】通过关联标签学习丰富的语义表示,在哈希码中保留更多辨别信息,同时考虑到跨模态语义相似性,保持不同模态间的相关性,更好地弥合模态差距。【方法】在多标签的关联约束下,挖掘不同模态的公共语义信息以及隐藏的类语义结构,采用高级语义与低级语义联合相似性度量的非对称学习框架,进而量化获得更具强鉴别性的哈希码。【结果】在MIRFlickr-25K、IAPR TC-12和NUS-WIDE三个多模态基准数据集上,本文方法与7种方法进行实验对比,在5种不同码长情况下,本文方法的平均MAP值比基准模型的最高值分别提升2.1%、5.8%和2.1%。【局限】所提出方法对多标签数据集更具适用性,对单标签数据的语义相关性挖掘尚有欠缺。【结论】所提方法保持样本和类语义结构的一致性,并且充分挖掘内在模态特征,有效提高了检索性能。 [Objective]This paper proposes a retrieval method for learning the rich semantic representations through associated labels and retaining more discriminative information in hash codes.It considers cross-modal semantic similarity,maintains relevance between different modalities,and better bridges the modal gaps.[Methods]Under the constraint of multi-label association,we explored the common semantic information and the hidden class semantic structure of different modalities.Then,we adopted the asymmetric learning framework for joint similarity measurement of high-level and low-level semantics,thereby quantifying to obtain more discriminative hash codes.[Results]We conducted experiments on three multi-modal benchmark datasets:MIRFlickr-25K,IAPR TC-12,and NUS-WIDE,comparing the proposed method with seven other methods.Under five different code lengths,the average MAP values of the proposed method were 2.1%,5.8%,and 2.1%higher than the baseline's maximum value,respectively.[Limitations]The proposed method is more applicable to multi-label datasets and has some deficiencies in mining the semantic relevance of single-label data.[Conclusions]The proposed method maintains the consistency of sample and class semantic structures,fully explores the inherent modal features,and effectively improves retrieval performance.

作者刘媛媛王晓燕张雨欣朱路 Liu Yuanyuan;Wang Xiaoyan;Zhang Yuxin;Zhu Lu(School of Information Engineering,East China Jiaotong University,Nanchang 330013,China)

机构地区华东交通大学信息工程学院

出处《数据分析与知识发现》 EI CSCD 北大核心 2024年第7期89-102,共14页 Data Analysis and Knowledge Discovery

基金江西省教育科学“十四五”规划课题基金项目(项目编号:22YB067)的研究成果之一。

关键词跨模态检索监督学习多级相似哈希 Cross-Modal Retrieval Supervised Learning Multi-Level Similarity Hashing

分类号 TP393 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1赵依林,郭逸,刘雨烟,张晴.融合边缘信息和双分支注意力的息肉分割算法[J].应用技术学报,2024,24(3):367-375.
2王宇翔,沈月千.基于三维激光扫描技术的岩体结构面智能识别方法[J].激光与光电子学进展,2024,61(14):128-137.
3高小雲(编译),李慧(编译).基于场地相似性度量的区域洪水频率分析模型研究[J].水利水电快报,2024,45(9):5-5.
4胡明浩,张博文,沈肖波,孙权森.基于深度旋转不变特征图哈希的遥感图像检索[J].南京理工大学学报,2024,48(4):434-441.
5白海平,白文.一种基于多维度语义增强的句子编码[J].计算机与数字工程,2024,52(7):2061-2065.
6黄华泽,胡紫璇,游进国,黄星瑞,陶静梅,易健宏.一种引入元路径相似性度量的材料实体检索方法[J].计算机应用研究,2024,41(9):2781-2786.
7李帮娜,贺兴时,朱军伟.基于双HSIC和稀疏正则化的多标签特征选择[J].西安工程大学学报,2024,38(4):141-151.
8王柳然,石明翔,刘丰硕,关紫莲,陈昊东,赵云.基于ElasticSearch的科技情报采集检索系统设计与实现[J].电脑知识与技术,2024,20(22):53-56.
9谢颖.基于哈希学习算法的专业课程资源库安全检索方法[J].计算机应用文摘,2024,40(17):191-194.
10蓝章礼,徐元通,赵胜薇,张洪,黄大荣.基于Sobel算子桥接的双编码器路面裂缝检测网络[J].重庆交通大学学报（自然科学版）,2024,43(9):18-24.

数据分析与知识发现

2024年第7期

浏览历史

内容加载中请稍等...

基于语义一致与多级相似性的跨模态哈希检索

相关作者

相关机构

相关主题

浏览历史