摘要
哈希作为一种有效的数据表征技术,已经在应对爆炸式增长的多媒体数据中扮演了重要的角色.它由于低存储和高效率的优势,在多媒体检索领域受到了越来越多的关注.目前多模态哈希学习方法在多媒体检索任务中得到了较好的研究和发展.然而,多数的方法通过编码特征的内积重构成对相似度来保持原始数据的结构信息,但是带来较复杂的优化问题.此外一些模型缺乏判别性使得检索性能的提升受到限制.为了克服上述问题,本文提出一种新型的多模态融合哈希方法,在类别信息的监督下利用Hadamard矩阵为数据生成目标编码,通过松弛严格的二值约束增大类间的间隔,同时采用图嵌入的方式促进类内的紧凑性.本文提出的方法既保证了模型具有很好的判别能力也简化了优化过程.在3个公开数据集上的实验结果表明,本文提出的方法在多媒体数据检索中是非常有效的,平均性能上相比最优的对比方法提高了8.47%.
Hashing,as an effective data representation technology,has played an important role in dealing with the explosive growth of multimedia data.Due to the advantages of its low storage and high efficiency,it has received more and more attention in the field of multimedia retrieval.At present,multi-modal hashing methods have been well researched and developed in multimedia retrieval tasks.However,most of these methods usually use the inner product of hashing features to reconstruct larger pairwise similarity,aiming to preserve the structural information of the original data,which will bring more complex optimization problems.Besides,some models lack discriminant ability,which leads to limitations in the improvement of retrieval performance.In order to overcome the above-mentioned problems,this paper proposes a new multimodal fusion hashing method.Under the supervision of category information,Hadamard matrix is used to generate target codes for data,and the margin between categories is increased by relaxing strict binary constraints.At the same time,the graph embedding approach is used to promote compactness within the class.The proposed method in this paper not only ensures the strong discriminative ability of the model,but also simplifies the optimization process.The experimental results on three public datasets show that the method proposed in this paper is very effective in multimedia data retrieval,and the average performance is 8.47%higher than that of the optimal comparison method.
作者
庾骏
黄伟
张晓波
尹贺峰
YU Jun;HUANG Wei;ZHANG Xiao-bo;YIN He-feng(The College of Computer and Communication Engineering,Zhengzhou University of Light Industry,zhengzhou,Henan 450000,China;The School of Artificial Intelligence and Computer Science,Jiangnan University,Wuxi,Jiangsu 214000,China)
出处
《电子学报》
EI
CAS
CSCD
北大核心
2022年第4期909-920,共12页
Acta Electronica Sinica
基金
河南省科技攻关计划项目(No.222102210064)
郑州轻工业大学博士科研启动基金(No.2021BSJJ025)
国家自然科学基金(No.61902361)。