期刊文献+

密集网络图像哈希检索

Image Hash retrieval with DenseNet
原文传递
导出
摘要 目的 为提取可充分表达图像语义信息的图像特征,减少哈希检索中的投影误差,并生成更紧致的二值哈希码,提出一种基于密集网络和改进的监督核哈希方法.方法 用训练优化好的密集网络提取图像的高层语义特征;先对提取到的图像特征进行核主成分分析投影,充分挖掘图像特征中隐含的非线性信息,以减少投影误差,再利用监督核哈希方法对图像特征进行监督学习,将特征映射到汉明空间,生成更紧致的二值哈希码.结果 为验证提出方法的有效性、可拓展性以及高效性,在Paris6K和LUNA16(lung nodule analysis 16)数据集上与其他6种常用哈希方法相比,所提方法在不同哈希码长下的平均检索精度均较高,且在哈希码长为64 bit时,平均检索精度达到最高,分别为89.2%和92.9%;与基于卷积神经网络的哈希算法(convolution neural network Hashing,CNNH)方法相比,所提方法的时间复杂度有所降低.结论 提出一种基于密集网络和改进的监督核哈希方法,提高了图像特征的表达能力和投影精度,具有较好的检索性能和较低的时间复杂度;且所提方法的可拓展性也较好,不仅能够有效应用到彩色图像检索领域,也可以应用在医学灰度图像检索领域. Objective To extract image features that can fully express image semantic information,reduce projection errors in Hash retrieval,and generate more compact binary Hash codes,a method based on dense network and improved supervised Hashing with kernels is proposed. Method The pre-processed image data set is used to train the dense network. To reduce the over-fitting phenomenon,L2 regularization term is added into the cross entropy as a new loss function. When the dense network model is training,batch normalization (BN) algorithm and root mean square prop (RMSProp) optimization algorithm are used to improve the accuracy and robustness of the model. High-level semantic features of images with trained and optimized dense network model are removed to enhance the ability of image features to express image information and build an image feature library of the image dataset. The kernel principal component analysis projection is then performed on the extracted image features. The nonlinear information implicit in the image features is fully exploited to reduce the projection error. The supervised kernel Hash method is also used to supervise the image features,enhance the resolution of the linear inseparable image feature data,and map the features to the Hamming space. According to the correspondence between the inner product of Hash code and Hamming distance and the semantic similarity monitoring matrix composed of image label information,the Hamming distance is optimized to generate a more compact binary Hash code. Next,the image feature Hash code library of the image dataset is constructed. Finally,the same operation is performed on the input query image to obtain the Hash code of the query image. The Hamming distance between the Hash code of the query image and the Hash code of the image feature in the image dataset is compared to measure the similarity. The retrieved similar images are returned in ascending order. Result To verify the effectiveness,expansion,and efficiency of the proposed method,our method is used respectively in Paris6 K and lung nodule analysis 16 (LUNA16) datasets. It is also compared with other six commonly used Hashing methods. The average retrieval accuracy is compared in 12,24,32,48,64,and 128 bits of code length. Experimental results show that the average retrieval accuracy increases with the increase of Hash code length. When the Hash code length increases to a certain value,the average retrieval accuracy decreases. The average retrieval accuracy of the proposed method is always higher than that of the other six Hash methods. Except for the semantic Hashing method,the average retrieval accuracy value reaches the maximum when the Hash code length is 48 bits. Other Hash methods,including the proposed method,have the maximum average retrieval accuracy value when the Hash code length is 64 bits,and the retrieval accuracy is better. When the Hash code length is 64 bits,the average retrieval accuracy value of the proposed method is as high as 89. 2% and 92. 9% in the Paris6 K and LUNA16 datasets,respectively. The time complexity of the proposed method and the convolutional neural network (CNN) Hashing method is compared in the Paris6 K and LUNA16 data sets when the Hash code length is 12,24,32,48,64,and 128 bits. Results show that the time complexity of the proposed method is reduced under different Hash code lengths and is efficient to a certain degree. Conclusion A method based on dense network and improved supervised Hashing with kernels is proposed. This method improves the expression ability of image features and projection accuracy and is superior to other similar methods in average retrieval accuracy,recall rate,and precision rate. It improves the retrieval performance to some extent. It has a lower time complexity of algorithm than the method of CNN Hashing method. In addition,the proposed method has better extensibility,which can be used not only in the field of color image retrieval but also in the field of medical gray scale image retrieval.
作者 王亚鸽 康晓东 郭军 李博 张华丽 刘汉卿 Wang Yage;Kang Xiaodong;Guo Jun;Li bo;Zhang Huali;Liu Hanqing(School of Medical Imaging,Tianjin Medical University,Tianjin 300203,China)
出处 《中国图象图形学报》 CSCD 北大核心 2020年第5期900-912,共13页 Journal of Image and Graphics
基金 京津冀协同创新项目(17YEXTZC00020)。
关键词 密集卷积网络(DenseNet) 监督核哈希 图像特征 投影误差 核主成分分析 dense convolutional network(DenseNet) supervised Hashing with Kernels image features projection error kernel principal component analysis(KPCA)
  • 相关文献

参考文献2

二级参考文献53

  • 1Mayer-Sch?nberger V, Cukier K. Big Data: A Revolution That Will Transform How We Live, Work, and Think. Boston: Eamon Dolan/Houghton Mifflin Harcourt, 2013.
  • 2Hey T, Tansley S, Tolle K. The Fourth Paradigm: Data-Intensive Scientific Discovery. Redmond: Microsoft Research, 2009.
  • 3Bryant R E. Data-intensive scalable computing for scientific applications. Comput Sci Engin, 2011, 13: 25-33.
  • 4周志华. 机器学习与数据挖掘. 中国计算机学会通讯, 2007, 3: 35-44.
  • 5Zhou Z H, Chawla N V, Jin Y, et al. Big data opportunities and challenges: Discussions from data analytics perspectives. IEEE Comput Intell Mag, 2014, 9: 62-74.
  • 6Jordan M. Message from the president: The era of big data. ISBA Bull, 2011, 18: 1-3.
  • 7Kleiner A, Talwalkar A, Sarkar P, et al. The big data bootstrap. In: Proceedings of the 29th International Conference on Machine Learning (ICML), Edinburgh, 2012, 1759-1766.
  • 8Shalev-Shwartz S, Zhang T. Accelerated proximal stochastic dual coordinate ascent for regularized loss minimization. In: Proceedings of the 31st International Conference on Machine Learning (ICML), Beijing, 2014, 64-72.
  • 9Gonzalez J E, Low Y, Gu H, et al. PowerGraph: Distributed graph-parallel computation on natural graphs. In: Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI), Hollywood, 2012, 17-30.
  • 10Gao W, Jin R, Zhu S, et al. One-pass AUC optimization. In: Proceedings of the 30th International Conference on Machine Learning (ICML), Atlanta, 2013, 906-914.

共引文献47

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部