哈夫曼编码乘积量化的图像哈希检索方法被引量：4

Hashing method for image retrieval based on product quantization with Huffman coding

导出

摘要目的基于哈希编码的检索方法是图像检索领域中的经典方法。其原理是将原始空间中相似的图片经哈希函数投影、量化后,在汉明空间中得到相近的哈希码。此类方法一般包括两个过程:投影和量化。投影过程大多采用主成分分析法对原始数据进行降维,但不同方法的量化过程差异较大。对于信息量不均衡的数据,传统的图像哈希检索方法采用等长固定编码位数量化的方式,导致出现低编码效率和低量化精度等问题。为此,本文提出基于哈夫曼编码的乘积量化方法。方法首先,利用乘积量化法对降维后的数据进行量化,以便较好地保持数据在原始空间中的分布情况。然后,采用子空间方差作为衡量信息量的标准,并以此作为编码位数分配的依据。最后,借助于哈夫曼树,给方差大的子空间分配更多的编码位数。结果在常用公开数据集MNIST、NUS-WIDE和22K LabelMe上进行实验验证,与原始的乘积量化方法相比,所提出方法能平均降低49%的量化误差,并提高19%的平均准确率。在数据集MNIST上,与同类方法的变换编码方法(TC)进行对比,比较了从32bit到256bit编码时的训练时间,本文方法的训练时间能够平均缩短22. 5s。结论本文提出了一种基于多位编码乘积量化的哈希方法,该方法提高了哈希编码的效率和量化精度,在平均准确率、召回率等性能上优于其他同类算法,可以有效地应用到图像检索相关领域。 Objective Hashing method is one of the most popular approaches for content-based image retrieval. The main idea of this approach is to learn the same size of binary codes for each image and then use the Hamming distance to measure the similarity of images. Effective Hashing methods should include at least three properties. First,the learned codes should be short so that large amounts of images can be stored in a small memory. Second,the learned codes should transform images that are perceptually or semantically similar into binary strings with a small Hamming distance. Third,the method should be efficient to learn the parameters of the binary code and encode a new test image. Most Hashing approaches include two important steps to achieve binary coding,namely,projection and quantization. For projection,most Hashing approaches perform principal component analysis( PCA) to reduce the dimensionality of raw data. For quantization,different Hashing approaches may design different strategies. In the quantization stage,most traditional Hashing methods usually allocate the same number of bit to each data subspace for image retrieval. However,information quantities are different in each data subspace. Accordingly,a uniform quantization may result in inefficient codes and high quantization distortion problems,especially when the data have unbalanced information quantities. To address this problem,this study proposes an effective coding method based on product quantization,called Huffman coding. Method Similar to most Hashing approaches,the proposed method utilizes PCA to reduce the dimensionality of raw data in the projection stage. A vector quantization scheme is then carefully designed at the quantization stage. The proposed approach first utilizes product quantization to quantize data after dimensionality reduction to preserve data distribution in the original space. For each subspace,the variance can be directly calculated as the measure of its information quantity. For effectiveness,the subspace with high information quantity should be allocated with a large number of bit for binary coding and vice versa. To achieve this goal,the reciprocal value of the variance proportion can be used to build a Huffman tree,which can then be applied to generate Huffman codes. Accordingly,different bit and values of binary code can be assigned to each subspace. In other words,numerous bit will be allocated to encode subspaces with large variance and few for subspaces with small variance. The variance is easy to calculated,and therefore,the proposed approach is simple and efficient for binary coding. Experimental results illustrate that the Huffman coding method is effective for image retrieval. Result During the experiment,the proposed approach is tested on three public datasets,namely,MNIST,NUS-WIDE,and 22K Label Me. For each image,a 512D GIST descriptor can be extracted as the input of the Hashing approach. To verify its good performance,the proposed approach is compared with four related approaches: original product quantization method,PCA-based product quantization method,iterative quantization method,and transform coding( TC) method. The experimental results are reported in the form of quantization distortion,mean average precision,recall,and training time. Results show that the average quantization distortion of the proposed approach can be decreased by approximately 49%,and the mean average precision of the retrieval results is increased by approximately 19% compared with the existing method based on product quantization. The training time of the proposed approach is also compared with that of TC from 32 bit to 256 bit on MNIST. The proposed approach can reduce 22. 5 s of the training time on average. Conclusion This study proposes Huffman coding for image retrieval in the product quantization stage. According to information quantities,the Huffman-based product quantization scheme can allocate different numbers of bit to each data subspace,which can effectively increase coding efficiency and quantization accuracy. The proposed approach is tested on three public datasets and compared with four related approaches.Experimental results demonstrate that the proposed approach is superior to some state-of-the-art algorithms for image retrieval on mean average precision and recall. The proposed approach does not belong to precise coding methods;thus,our future work will focus on precise Hashing method for effective image retrieval.

作者栾婷婷祝继华徐思雨王佳星时璇李垚辰 Luan Tingting;Zhu Jihua;Xu Siyu;Wang Jiaxing;Shi Xuan;Li Yaochen(The First Affiliated Hospital,College of Medicine,Zhejiang University,Hangzhou 310003,China;School of Software Engineering,Xi'an Jiaotong University,Xi'an 710049,China)

机构地区浙江大学医学院附属第一医院西安交通大学软件学院

出处《中国图象图形学报》 CSCD 北大核心 2019年第3期389-399,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(61573273 61603289)~~

关键词哈希图像检索近似最近邻搜索乘积量化比特分配编码效率 Hashing image retrieval approximate nearest neighbor search product quantization bit allocation coding efficiency

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1李武军,周志华.大数据哈希学习:现状与趋势[J].科学通报,2015,60(5):485-490. 被引量：46

二级参考文献52

1Mayer-Sch?nberger V, Cukier K. Big Data: A Revolution That Will Transform How We Live, Work, and Think. Boston: Eamon Dolan/Houghton Mifflin Harcourt, 2013.
2Hey T, Tansley S, Tolle K. The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond: Microsoft Research, 2009.
3Bryant R E. Data-intensive scalable computing for scientific applications. Comput Sci Engin, 2011, 13: 25-33.
4周志华. 机器学习与数据挖掘. 中国计算机学会通讯, 2007, 3: 35-44.
5Zhou Z H, Chawla N V, Jin Y, et al. Big data opportunities and challenges: Discussions from data analytics perspectives. IEEE Comput Intell Mag, 2014, 9: 62-74.
6Jordan M. Message from the president: The era of big data. ISBA Bull, 2011, 18: 1-3.
7Kleiner A, Talwalkar A, Sarkar P, et al. The big data bootstrap. In: Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, 2012, 1759-1766.
8Shalev-Shwartz S, Zhang T. Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization. In: Proceedings of the 31st International Conference on Machine Learning (ICML), Beijing, 2014, 64-72.
9Gonzalez J E, Low Y, Gu H, et al. PowerGraph: Distributed graph-parallel computation on natural graphs. In: Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Hollywood, 2012, 17-30.
10Gao W, Jin R, Zhu S, et al. One-pass AUC optimization. In: Proceedings of the 30th International Conference on Machine Learning (ICML), Atlanta, 2013, 906-914.

共引文献45

1谭喆.多模态数据哈希检索方法综述[J].信息通信,2016,29(3):179-180.
2聂秀山,王舒婷,尹义龙.基于特征融合和曼哈顿量化的视频哈希学习方法[J].南京大学学报（自然科学版）,2016,52(4):705-713.
3刘宁,赵建华,冯骜骜.基于主动学习的有监督在线多核学习算法[J].河南科学,2016,34(9):1423-1427. 被引量：2
4王欢,屠长河.基于哈希学习的动作捕捉数据的编码与检索[J].计算机辅助设计与图形学学报,2016,28(12):2151-2158. 被引量：3
5翟俊海,王婷婷,张明阳,王耀达,刘明明.2种加速K-近邻方法的实验比较[J].河北大学学报（自然科学版）,2016,36(6):650-656. 被引量：3
6王丹,赵文兵,丁治明.大数据安全保障关键技术分析综述[J].北京工业大学学报,2017,43(3):335-349. 被引量：44
7翟俊海,张明阳,王婷婷,郝璞.基于哈希技术和MapReduce的大数据集K-近邻算法[J].计算机科学,2017,44(7):210-214. 被引量：7
8曾宪华,袁知洪,王国胤,杨洁.基于多特征多核哈希学习的大规模图像检索[J].中国科学：信息科学,2017,47(8):1109-1126. 被引量：8
9曹路,杨文强.基于离散监督哈希的相似性检索算法[J].科学技术与工程,2017,17(26):245-250. 被引量：4
10翟俊海,郝璞,王婷婷,张明阳.MapReduce并行化压缩近邻算法[J].小型微型计算机系统,2017,38(12):2678-2682. 被引量：1

同被引文献20

1原尉峰,郭佳明,苏卓,罗笑南,周凡.结合深度多标签解析的哈希服装检索[J].中国图象图形学报,2019,24(2):159-169. 被引量：4
2李武军,周志华.大数据哈希学习:现状与趋势[J].科学通报,2015,60(5):485-490. 被引量：46
3李军,吕绍和,陈飞,阳国贵,窦勇.结合视觉注意机制与递归神经网络的图像检索[J].中国图象图形学报,2017,22(2):241-248. 被引量：7
4李默.基于深度学习的智慧图书馆移动视觉搜索服务模式研究[J].现代情报,2019,39(5):89-96. 被引量：20
5张佳盛,钟佳,许斌,凌育洪,庄泽浩,谢喜凤.装配式建筑工程中BIM与RFID技术集成与改进的研究应用[J].施工技术,2019,48(6):11-15. 被引量：17
6乔虎,吴庆云,杜江,何俊.多装配接口的三维装配模型检索方法[J].计算机辅助设计与图形学学报,2019,31(5):851-858. 被引量：1
7李月琳,何鹏飞.游戏化信息检索系统用户研究:游戏元素偏好、态度及使用意愿[J].中国图书馆学报,2019,45(3):62-78. 被引量：26
8黄宇.基于局部描述符聚类的关联书目智能检索仿真[J].计算机仿真,2019,36(7):347-350. 被引量：1
9李敏,聂勇敢,李生鹏,李维龙,董金梅,陈玉聪,赵铁城,毛瑞士,徐治国,康新才,冯永春,赵祖龙,王延谋,马维年,尹炎.基于EPICS的CSRe束流诊断控制系统升级[J].强激光与粒子束,2019,31(12):95-102. 被引量：3
10向佳丽,张静,周红.国外图书馆数字资源许可政策新近发展研究[J].大学图书馆学报,2020,38(1):26-34. 被引量：10

引证文献4

1王亚鸽,康晓东,郭军,李博,张华丽,刘汉卿.密集网络图像哈希检索[J].中国图象图形学报,2020,25(5):900-912.
2苏卓,柯司博,王若梅,周凡.深度多模态融合服装风格检索[J].中国图象图形学报,2021,26(4):857-871. 被引量：4
3艾列富,程宏俊,陶勇,于俊清,郑馨,刘德阳.面向近似最近邻搜索的码字扩展增强型残差量化[J].计算机辅助设计与图形学学报,2022,34(3):459-469.
4翟小静.基于RFID技术的电子图书资源检索方法[J].现代电子技术,2022,45(11):99-103. 被引量：2

二级引证文献6

1李思宇.服装设计的视觉特征及构成方法研究[J].化纤与纺织技术,2021,50(8):105-106. 被引量：1
2廖列法,李志明,张赛赛.基于深度残差网络的迭代量化哈希图像检索方法[J].计算机应用,2022,42(9):2845-2852. 被引量：2
3陈彦海.基于语义特征挖掘的图书馆文献资源智能检索方法[J].信息与电脑,2024,36(2):125-127. 被引量：1
4俞凯杰,陈郁.基于EF-UNet的服装图像分割研究[J].北京服装学院学报（自然科学版）,2024,44(4):83-89.
5陈佳芸.基于深度学习的多模态服装风格检索[J].计算机科学与应用,2023,13(3):492-501.
6王康,郑泳杰,陈卓,唐燕平,陈基海.基于物联网技术的水利工程档案管理信息化研究[J].水资源研究,2022,11(5):550-560.

1衣姝颖,白璐,李天平.基于卷积神经网络的图像搜索技术研究[J].山东师范大学学报（自然科学版）,2019,34(1):88-95. 被引量：2
2杨瑞,唐向宏,张越.结构约束的全局目标函数视频修复方法[J].计算机辅助设计与图形学学报,2019,31(3):455-466. 被引量：2
3单剑锋,杨雨.粒子群优化的流形SVM模拟电路故障诊断[J].机械科学与技术,2019,38(2):260-264. 被引量：9
4余震,何留杰,吴婷.基于融合鲁棒特征与多维尺度变换的紧凑图像哈希算法[J].包装工程,2019,40(1):186-195. 被引量：4
5陶津,王晓东,姚宇.基于乘积量化的近似最近邻算法[J].计算机应用,2018,38(A02):128-131. 被引量：3
6孟蕾,周晓陆.基于分型图谱量化法的唐代陶瓷执壶功能性造型研究[J].装饰,2019(1):84-87. 被引量：3
7梁嘉倩.通信系统中编码方法的研究[J].数码世界,2018(12):31-31.
8张艺超,黄樟灿,陈亚雄.一种多尺度平衡深度哈希图像检索方法[J].计算机应用研究,2019,36(2):621-625. 被引量：5
9薛红霞,李宇,李林霞.提升直观想象核心素养,破解压轴题[J].中国数学教育（高中版）,2019(1):7-10. 被引量：1
10郭红伟,骆洪军,刘帅,牛林,杨波.一种改进的R-λ模型码率控制算法[J].计算机科学,2019,46(3):142-147. 被引量：6

中国图象图形学报

2019年第3期

浏览历史

内容加载中请稍等...

哈夫曼编码乘积量化的图像哈希检索方法被引量：4

参考文献1

二级参考文献52

共引文献45

同被引文献20

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

哈夫曼编码乘积量化的图像哈希检索方法 被引量：4

参考文献1

二级参考文献52

共引文献45

同被引文献20

引证文献4

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

哈夫曼编码乘积量化的图像哈希检索方法被引量：4