基于离散优化的哈希编码学习方法被引量：6

Learning to Hash with Discrete Optimization

下载PDF

导出

摘要哈希作为近似近邻搜索的一种主流方法,通过将样本索引为紧致的二值编码,在计算效率和存储上都非常高效.由于二值码的离散特性,以往的哈希方法往往需要将二值码松弛为实数值才能高效地进行优化,因此在优化完成后重新将实数值的结果量化为二值时难免会由于二值的汉明空间与实数值的欧氏空间之间的差异而遇到性能上的损失问题.为了更好地解决量化损失的问题,本文提出了一种深度离散优化哈希(Deep Discrete Optimization Hashing,DDOH)方法.首先,设计了一种新的离散优化算法,通过直接在二值的汉明空间中对二值码进行优化,得到具有强判别性的二值编码.然后,训练卷积神经网络模型拟合上述二值码,得到用于编码的哈希函数.在CIFAR-10和ImageNet-100两个常用的评测数据集上的实验显示,本文提出的方法在CIFAR-10数据库上与目前最好的方法达到了同样的性能,在ImageNet-100数据库上的平均准确率指标与已有方法相比提升了约2.2%,证明了该方法的有效性. In recent years, billions of images are uploaded to the Internet every day, making it extremely difficult to find an interested image according to a user’s demand. This paper addresses the content-based image retrieval task, which aims at looking for database images that are similar to the given query image. However, due to the huge size of modern datasets, exact nearest neighbor search method cannot produce retrieval results in acceptable time. Therefore, approximate nearest neighbor search methods are proposed to sacrifice accuracy for acceptable retrieval time. As a mainstream approximate nearest neighbor (ANN) search method, hashing projects the original feature vectors of samples into very compact binary codes, and thus is very efficient in both computation and storage. As a result, hashing methods have received more and more research attention over the past twenty years. However, due to the discrete nature of binary codes, directly optimizing the binary codes is an NP-hard problem and the computation time required for obtaining the global optimum would be unacceptable. To deal with this problem, existing hashing methods can only perform optimization efficiently by relaxing the binary codes to real values, and optimize the real-valued counterpart of the objective function instead. After that, the optimum obtained in the relaxed real-valued space are again quantized to generate the real binary codes. However, there is no guarantee that the real-valued optimum would remain optimum after quantization, and thus existing methods inevitably suffer from performance drop when quantizing the real-valued optimization results into binary codes, due to the discrepancy between the binary Hamming space and the real-valued Euclidean space. To better deal with the problems of quantization, this paper proposes a novel hash learning method, named Deep Discrete Optimization Hashing (DDOH). First of all, the initial binary codes of all training image samples are obtained by one of the three binary code initialization methods proposed in this paper. After that, a discrete binary codes optimization algorithm is designed, which takes the initial binary codes of training images as well as their corresponding label information as inputs. The proposed optimization algorithm iteratively decides whether or not to flip certain binary bits in the binary codes with the Fisher’s law, and it is theoretically proved in this paper that by doing so, the proposed method would improve or at least would not decrease the discriminability of the binary codes in terms of the Fisher’s law. Next, to obtain the hash functions which would be used to encode new-coming images, a deep convolutional neural network (CNN) is trained to fit the aforementioned binary codes. Specifically, with the optimized binary codes, each bit can be seen as a binary classification problem, and all binary classifiers that share the same feature map of the CNN as training inputs are trained to perform as the hashing functions. Experiments on two widely studied datasets CIFAR-10 and ImageNet show that the proposed method achieves state of the art retrieval performance on CIFAR-10, and improves the performance of existing hashing methods by about 2.2% mean Average Precision (mAP) on ImageNet-100, validating the effectiveness of the proposed method.

作者刘昊淼王瑞平山世光陈熙霖 LIU Hao-Miao;WANG Rui-Ping;SHAN Shi-Guang;Xilin Chen(Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190;School of Computer Science and Technology, University of Chinese Academy of Sciences, Beijing 100049)

机构地区中国科学院计算技术研究所智能信息处理重点实验室中国科学院大学计算机科学与技术学院

出处《计算机学报》 EI CSCD 北大核心 2019年第5期1149-1160,共12页 Chinese Journal of Computers

基金国家"九七三"重点基础研究发展规划项目基金(2015CB351802) 国家自然科学基金(61772500) 中国科学院前沿科学重点研究项目(QYZDJ-SSW-JSC009) 中国科学院青年创新促进会(2015085)资助~~

关键词近似近邻搜索高维特征索引哈希学习离散优化卷积神经网络 approximate neighbor search high dimensional feature indexing hash learning discrete optimization convolutional neural network

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献1

1文庆福,王建民,朱晗,曹越,龙明盛.面向近似近邻查询的分布式哈希学习方法[J].计算机学报,2017,40(1):192-206. 被引量：10

共引文献9

1朱命冬,徐立新,申德荣,寇月,聂铁铮.面向不确定文本数据的余弦相似性查询方法[J].计算机科学与探索,2018,12(1):49-64. 被引量：12
2张素芳,翟俊海,王婷婷,郝璞,王聪,赵春玲.基于Spark的压缩近邻算法[J].计算机科学,2018,45(B06):406-410. 被引量：2
3陈凤,蒙祖强.基于哈希算法的异构多模态数据检索研究[J].计算机科学,2019,46(10):49-54. 被引量：11
4王振,孙福振,张龙波,王雷.模糊序列感知哈希[J].计算机工程与应用,2020,56(21):123-130. 被引量：1
5王谟瀚,翟俊海,齐家兴.基于MapReduce和Spark的大规模压缩模糊K-近邻算法[J].计算机工程,2020,46(11):139-147. 被引量：3
6荣梦君,刘惊雷.基于小波投影和离散哈希的图像检索[J].模式识别与人工智能,2020,33(11):1023-1032.
7刘健博,邓凌风,李文海,田野.基于前缀剪枝的大规模向量空间相似检索框架[J].软件导刊,2024,23(6):92-97.
8翟俊海,沈矗,张素芳,王婷婷.基于Spark和SimHash的大数据K-近邻分类算法[J].河北大学学报（自然科学版）,2019,39(2):201-210. 被引量：3
9罗冬梅.面向分布式图计算的平衡图划分算法[J].信息与电脑,2019,0(11):44-46.

同被引文献30

1高文.高文：“存得下查得快”拥抱多媒体大数据时代[J].创新科技,2013(6):7-7. 被引量：2
2李武军,周志华.大数据哈希学习:现状与趋势[J].科学通报,2015,60(5):485-490. 被引量：46
3曹玉东,刘艳洋,贾旭,王冬霞.基于改进的局部敏感哈希算法实现图像型垃圾邮件过滤[J].计算机应用研究,2016,33(6):1693-1696. 被引量：13
4夏立超,蒋建国,齐美彬.基于改进谱哈希的大规模图像检索[J].合肥工业大学学报（自然科学版）,2016,39(8):1049-1054. 被引量：3
5柯圣财,赵永威,李弼程,彭天强.基于卷积神经网络和监督核哈希的图像检索方法[J].电子学报,2017,45(1):157-163. 被引量：36
6Ge SONG,Xiaoyang TAN.Hierarchical deep hashing for image retrieval[J].Frontiers of Computer Science,2017,11(2):253-265. 被引量：3
7刘海龙,李宝安,吕学强,黄跃.基于深度卷积神经网络的图像检索算法研究[J].计算机应用研究,2017,34(12):3816-3819. 被引量：51
8李小薪,梁荣华.有遮挡人脸识别综述:从子空间回归到深度学习[J].计算机学报,2018,41(1):177-207. 被引量：61
9姚涛,孔祥维,付海燕,TIAN Qi.基于映射字典学习的跨模态哈希检索[J].自动化学报,2018,44(8):1475-1485. 被引量：4
10何果财,刘峡壁.基于图像三元组挖掘的无监督视觉表示学习[J].计算机学报,2018,41(12):2787-2803. 被引量：4

引证文献6

1顾岩,赵崇宇,黄平.基于高阶统计信息的深度哈希学习模型[J].计算机工程,2020,46(7):260-267. 被引量：1
2张冰冰,李培华,孙秋乐.基于局部约束仿射子空间编码的时空特征聚合卷积网络模型[J].计算机学报,2020,43(9):1589-1603. 被引量：3
3吴泽斌,于俊清,何云峰,管涛.一种用于图像检索的多层语义二值描述符[J].计算机学报,2020,43(9):1641-1655. 被引量：5
4李雪,于炯,李梓杨,陈嘉颖,蒲勇霖.基于成对标签的深度哈希图像检索方法[J].计算机工程与设计,2021,42(7):1981-1988. 被引量：3
5庾骏,黄伟,张晓波,尹贺峰.基于松弛Hadamard矩阵的多模态融合哈希方法[J].电子学报,2022,50(4):909-920. 被引量：2
6王飞.基于改进CNN卷积神经网络的音乐识别模型构建[J].自动化技术与应用,2024,43(2):127-131. 被引量：1

二级引证文献15

1蒋伟进,孙永霞,朱昊冉,陈萍萍,张婉清,陈君鹏.边云协同计算下基于ST-GCN的监控视频行为识别机制[J].南京大学学报（自然科学版）,2022,58(1):163-174.
2战涛,姚璐.基于有向图理论模型的网络新闻图像检索算法[J].科技通报,2022,38(8):35-40.
3黄耀,雷景生.基于帧级骨架拓展类特征的人体动作实时检测技术[J].计算机应用与软件,2022,39(10):175-183. 被引量：1
4董家玮,孙福振,吴相帅,吴田慧,王绍卿.基于差异性汉明距离的变分推荐算法[J].计算机科学,2022,49(12):178-184. 被引量：2
5杨奎河,刘怡.基于三元组与变分自编码器的图像检索算法[J].长江信息通信,2023,36(4):82-85.
6杨在春,魏巍,岳琴,王锋.稀疏表示一致性引导的多视图降维算法[J].小型微型计算机系统,2023,44(8):1637-1643.
7李为杰,杨志景.基于自监督蒸馏辅助学习的哈希图像检索[J].计算机工程与设计,2023,44(11):3420-3426.
8庾骏,马江涛,咸阳,侯瑞霞,孙伟.半配对的多模态询问哈希方法[J].电子与信息学报,2024,46(2):481-491.
9刘晓利,李耀翔,彭润东,张哲宇,陈雅.基于卷积神经网络的樟子松木材密度近红外预测模型优化[J].森林工程,2024,40(3):142-151.
10艾列富,陶勇,蒋常玉.基于全局注意力的正交融合图像描述符[J].图学学报,2024,45(3):472-481.

1苗建辉,栗志扬,周泽艳,杨传福,刘朝斌,刘卫江.比特串划分多索引的近邻搜索算法[J].计算机辅助设计与图形学学报,2019,31(5):771-779. 被引量：3
2衣姝颖,白璐,李天平.基于卷积神经网络的图像搜索技术研究[J].山东师范大学学报（自然科学版）,2019,34(1):88-95. 被引量：2
3亓海凤,王永.深度学习在哈希算法的应用[J].科技资讯,2018,16(32):139-142. 被引量：2
4张艺超,黄樟灿,陈亚雄.一种多尺度平衡深度哈希图像检索方法[J].计算机应用研究,2019,36(2):621-625. 被引量：5
5台州永耀塑业有限公司.简座[J].设计,2019,32(8):10-10.
6张芳菲,梁玉斌,王佳.基于近邻搜索的激光点云数据孤立噪点滤波研究[J].测绘工程,2018,27(11):29-33. 被引量：18
7刘玲艳."吃"掉水肿,还身体紧致清爽[J].医食参考,2019,0(3):48-49.
8王梦琦,李晓明,李文臣,郑惠萍,刘新元,黄彦浩.含风电的电力系统不对称故障后机电恢复特性修正方法研究[J].电测与仪表,2019,56(2):16-23. 被引量：5
9王丽芳,王雁丽,蔺素珍,秦品乐,高媛.基于改进的Zernike矩的局部描述符与图割离散优化的非刚性多模态脑部图像配准[J].计算机应用,2019,39(2):582-588. 被引量：7
10刘梦雅,毛剑琳.一种改进池化模型对卷积神经网络性能影响的研究[J].电子测量技术,2019,42(5):34-38. 被引量：11

计算机学报

2019年第5期

浏览历史

内容加载中请稍等...

基于离散优化的哈希编码学习方法被引量：6

参考文献1

共引文献9

同被引文献30

引证文献6

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

基于离散优化的哈希编码学习方法 被引量：6

参考文献1

共引文献9

同被引文献30

引证文献6

二级引证文献15

相关作者

相关机构

相关主题

浏览历史

基于离散优化的哈希编码学习方法被引量：6