基于松弛Hadamard矩阵的多模态融合哈希方法被引量：1

Multimodal Fusion Hash Learning Method Based on Relaxed Hadamard Matrix

下载PDF

导出

摘要哈希作为一种有效的数据表征技术,已经在应对爆炸式增长的多媒体数据中扮演了重要的角色.它由于低存储和高效率的优势,在多媒体检索领域受到了越来越多的关注.目前多模态哈希学习方法在多媒体检索任务中得到了较好的研究和发展.然而,多数的方法通过编码特征的内积重构成对相似度来保持原始数据的结构信息,但是带来较复杂的优化问题.此外一些模型缺乏判别性使得检索性能的提升受到限制.为了克服上述问题,本文提出一种新型的多模态融合哈希方法,在类别信息的监督下利用Hadamard矩阵为数据生成目标编码,通过松弛严格的二值约束增大类间的间隔,同时采用图嵌入的方式促进类内的紧凑性.本文提出的方法既保证了模型具有很好的判别能力也简化了优化过程.在3个公开数据集上的实验结果表明,本文提出的方法在多媒体数据检索中是非常有效的,平均性能上相比最优的对比方法提高了8.47%. Hashing,as an effective data representation technology,has played an important role in dealing with the explosive growth of multimedia data.Due to the advantages of its low storage and high efficiency,it has received more and more attention in the field of multimedia retrieval.At present,multi-modal hashing methods have been well researched and developed in multimedia retrieval tasks.However,most of these methods usually use the inner product of hashing features to reconstruct larger pairwise similarity,aiming to preserve the structural information of the original data,which will bring more complex optimization problems.Besides,some models lack discriminant ability,which leads to limitations in the improvement of retrieval performance.In order to overcome the above-mentioned problems,this paper proposes a new multimodal fusion hashing method.Under the supervision of category information,Hadamard matrix is used to generate target codes for data,and the margin between categories is increased by relaxing strict binary constraints.At the same time,the graph embedding approach is used to promote compactness within the class.The proposed method in this paper not only ensures the strong discriminative ability of the model,but also simplifies the optimization process.The experimental results on three public datasets show that the method proposed in this paper is very effective in multimedia data retrieval,and the average performance is 8.47%higher than that of the optimal comparison method.

作者庾骏黄伟张晓波尹贺峰 YU Jun;HUANG Wei;ZHANG Xiao-bo;YIN He-feng(The College of Computer and Communication Engineering,Zhengzhou University of Light Industry,zhengzhou,Henan 450000,China;The School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214000,China)

机构地区郑州轻工业大学计算机与通信工程学院江南大学计算机与人工智能学院

出处《电子学报》 EI CAS CSCD 北大核心 2022年第4期909-920,共12页 Acta Electronica Sinica

基金河南省科技攻关计划项目(No.222102210064) 郑州轻工业大学博士科研启动基金(No.2021BSJJ025) 国家自然科学基金(No.61902361)。

关键词哈希学习多模态融合 HADAMARD矩阵多媒体检索哈希中心 hash learning multimodal fusion Hadamard matrix multimedia retrieval hash centers

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献7

1李志欣,凌锋,张灿龙,马慧芳.融合两级相似度的跨媒体图像文本检索[J].电子学报,2021,49(2):268-274. 被引量：11
2李武军,周志华.大数据哈希学习:现状与趋势[J].科学通报,2015,60(5):485-490. 被引量：45
3高文.高文：“存得下查得快”拥抱多媒体大数据时代[J].创新科技,2013(6):7-7. 被引量：2
4刘昊淼,王瑞平,山世光,陈熙霖.基于离散优化的哈希编码学习方法[J].计算机学报,2019,42(5):1149-1160. 被引量：6
5姚涛,孔祥维,付海燕,TIAN Qi.基于映射字典学习的跨模态哈希检索[J].自动化学报,2018,44(8):1475-1485. 被引量：4
6刘昊鑫,吴小俊,庾骏.联合哈希特征和分类器学习的跨模态检索算法[J].模式识别与人工智能,2020,33(2):160-165. 被引量：2
7王锦荟,金露,李泽超,唐金辉.基于知识蒸馏的跨模态哈希[J].中国科学：技术科学,2022,52(5):713-726. 被引量：2

二级参考文献66

1Mayer-Sch?nberger V, Cukier K. Big Data: A Revolution That Will Transform How We Live, Work, and Think. Boston: Eamon Dolan/Houghton Mifflin Harcourt, 2013.
2Hey T, Tansley S, Tolle K. The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond: Microsoft Research, 2009.
3Bryant R E. Data-intensive scalable computing for scientific applications. Comput Sci Engin, 2011, 13: 25-33.
4周志华. 机器学习与数据挖掘. 中国计算机学会通讯, 2007, 3: 35-44.
5Zhou Z H, Chawla N V, Jin Y, et al. Big data opportunities and challenges: Discussions from data analytics perspectives. IEEE Comput Intell Mag, 2014, 9: 62-74.
6Jordan M. Message from the president: The era of big data. ISBA Bull, 2011, 18: 1-3.
7Kleiner A, Talwalkar A, Sarkar P, et al. The big data bootstrap. In: Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, 2012, 1759-1766.
8Shalev-Shwartz S, Zhang T. Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization. In: Proceedings of the 31st International Conference on Machine Learning (ICML), Beijing, 2014, 64-72.
9Gonzalez J E, Low Y, Gu H, et al. PowerGraph: Distributed graph-parallel computation on natural graphs. In: Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Hollywood, 2012, 17-30.
10Gao W, Jin R, Zhu S, et al. One-pass AUC optimization. In: Proceedings of the 30th International Conference on Machine Learning (ICML), Atlanta, 2013, 906-914.

共引文献63

1韩会珍,刘立波.基于注意力和视觉语义推理的枸杞虫害检索[J].计算机科学,2022,49(S02):431-436.
2谭喆.多模态数据哈希检索方法综述[J].信息通信,2016,29(3):179-180.
3聂秀山,王舒婷,尹义龙.基于特征融合和曼哈顿量化的视频哈希学习方法[J].南京大学学报（自然科学版）,2016,52(4):705-713.
4刘宁,赵建华,冯骜骜.基于主动学习的有监督在线多核学习算法[J].河南科学,2016,34(9):1423-1427. 被引量：2
5王欢,屠长河.基于哈希学习的动作捕捉数据的编码与检索[J].计算机辅助设计与图形学学报,2016,28(12):2151-2158. 被引量：3
6翟俊海,王婷婷,张明阳,王耀达,刘明明.2种加速K-近邻方法的实验比较[J].河北大学学报（自然科学版）,2016,36(6):650-656. 被引量：3
7王丹,赵文兵,丁治明.大数据安全保障关键技术分析综述[J].北京工业大学学报,2017,43(3):335-349. 被引量：44
8翟俊海,张明阳,王婷婷,郝璞.基于哈希技术和MapReduce的大数据集K-近邻算法[J].计算机科学,2017,44(7):210-214. 被引量：7
9曾宪华,袁知洪,王国胤,杨洁.基于多特征多核哈希学习的大规模图像检索[J].中国科学：信息科学,2017,47(8):1109-1126. 被引量：7
10曹路,杨文强.基于离散监督哈希的相似性检索算法[J].科学技术与工程,2017,17(26):245-250. 被引量：3

同被引文献1

1顾广华,霍文华,苏明月,付灏.基于非对称监督深度离散哈希的图像检索[J].电子与信息学报,2021,43(12):3530-3537. 被引量：5

引证文献1

1庾骏,马江涛,咸阳,侯瑞霞,孙伟.半配对的多模态询问哈希方法[J].电子与信息学报,2024,46(2):481-491.

1杨妮,王志超,孙思豪.基于压缩感知的计算鬼成像技术[J].中国科技成果,2022,23(6):42-43.
2陈汗青,李菲菲,陈虬.基于三维卷积和哈希方法的视频检索算法[J].电子科技,2022,35(4):35-39. 被引量：1
3胡章芳,蹇芳,唐珊珊,明子平,姜博文.DFSMN-T:结合强语言模型Transformer的中文语音识别[J].计算机工程与应用,2022,58(9):187-194. 被引量：9
4王振,杨珺,邓佳莉,谢鸿慧,黄聪.多尺度特征自适应融合的图像语义分割算法[J].小型微型计算机系统,2022,43(4):834-840. 被引量：3
5蒋新竹,袁胜.基于单像素成像和公钥密码的图像加密技术[J].电力信息与通信技术,2022,20(5):87-94. 被引量：6

电子学报

2022年第4期

浏览历史

内容加载中请稍等...

基于松弛Hadamard矩阵的多模态融合哈希方法被引量：1

参考文献7

二级参考文献66

共引文献63

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于松弛Hadamard矩阵的多模态融合哈希方法 被引量：1

参考文献7

二级参考文献66

共引文献63

同被引文献1

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于松弛Hadamard矩阵的多模态融合哈希方法被引量：1