期刊文献+

面向视觉搜索的空间局部敏感哈希方法 被引量:4

Locality-sensitive hashing approach based on semantic space for visual retrieval
原文传递
导出
摘要 目的视觉检索需要准确、高效地从大型图像或者视频数据集中检索出最相关的视觉内容,但是由于数据集中图像数据量大、特征维度高的特点,现有方法很难同时保证快速的检索速度和较好的检索效果。方法对于面向图像视频数据的高维数据视觉检索任务,提出加权语义局部敏感哈希算法(weighted semantic locality-sensitive hashing,WSLSH)。该算法利用两层视觉词典对参考特征空间进行二次空间划分,在每个子空间里使用加权语义局部敏感哈希对特征进行精确索引。其次,设计动态变长哈希码,在保证检索性能的基础上减少哈希表数量。此外,针对局部敏感哈希(locality sensitive hashing,LSH)的随机不稳定性,在LSH函数中加入反映参考特征空间语义的统计性数据,设计了一个简单投影语义哈希函数以确保算法检索性能的稳定性。结果在Holidays、Oxford5k和DataSetB数据集上的实验表明,WSLSH在DataSetB上取得最短平均检索时间0.03425 s;在编码长度为64位的情况下,WSLSH算法在3个数据集上的平均精确度均值(mean average precision,mAP)分别提高了1.2%~32.6%、1.7%~19.1%和2.6%~28.6%,与几种较新的无监督哈希方法相比有一定的优势。结论通过进行二次空间划分、对参考特征的哈希索引次数进行加权、动态使用变长哈希码以及提出简单投影语义哈希函数来对LSH算法进行改进。由此提出的加权语义局部敏感哈希(WSLSH)算法相比现有工作有更快的检索速度,同时,在长编码的情况下,取得了更为优异的性能。 Objective Visual retrieval methods need to accurately and efficiently retrieve the most relevant visual content from large-scale images or video datasets. However, due to a large amount of image data and high feature dimensionality in the dataset, existing methods face difficulty in ensuring fast retrieval speed and good retrieval results. Hashing is a widely studied solution for approximate nearest neighbor search, which aims to convert high-dimensional data items into a low-dimensional representation or a hash code consisting of a set of bit sequences. Locality-sensitive hashing(LSH) is a data-independent, unsupervised hashing algorithm that provides asymptotic theoretical properties, thereby ensuring performance. LSH is considered as one of the most common methods for fast nearest-neighbor search in high-dimensional space. Nevertheless, if the number of hash functions k is set too small, it leads to too many data items falling into each hash bucket, thus increasing the query response time. By contrast, if k is set too large, the number of data items in each hash bucket is reduced. Moreover, to achieve the desired search accuracy, LSH usually needs to use long hash codes, thereby reducing the recall rate. Although the use of multiple hash tables can alleviate this problem, it significantly increases memory cost and query time. Besides, due to the semantic gap between the visual semantic space and metric space, LSH may not obtain good search performance. Method For visual retrieval of high-dimensional data, we first propose a hash algorithm called weighted semantic locality-sensitive hashing(WSLSH), which is based on feature space partitioning, to address the aforementioned drawbacks of LSH. While building the indices, WSLSH considers the distance relationship between reference and query features, divides the reference feature space into two subspaces by a two-layer visual dictionary, and employs weighted-semantic locality sensitive hashing in each subspace to index, thereby forming a hierarchical index structure. The proposed algorithm can rapidly converge the target to a small range in the process of large-scale retrieval and make accurate queries, which greatly improves the retrieval speed. Then, dynamic variable-length hashing codes are applied in a hashing table to retrieve multiple hashing buckets, which can reduce the number of hashing tables and improve the retrieval speed based on guaranteeing the retrieval performance. Through these two improvements, the retrieval speed can be greatly improved. In addition, to solve the random instability of LSH, statistical information reflecting the semantics of reference feature space is introduced into the LSH function, and a simple projection semantic-hashing function is designed to ensure the stability of the retrieval performance. Result Experiment results on Holidays, Oxford5 k, and DataSetB datasets show that the retrieval accuracy and retrieval speed are effectively improved in comparison with the representative unsupervised hash methods. WSLSH achieves the shortest average retrieval time(0.034 25 s) on DataSetB. When the encoding length is 64 bit, the mean average precision(mAP) of the WSLSH algorithm is improved by 1.2%~32.6%,1.7%~19.1%, and 2.6%~28.6%. WSLSH is not highly sensitive to the size change of the reference feature subset involved, so the retrieval time does not change significantly, which reflects the retrieval advantage of WSLSH for large-scale datasets. With the increase of encoding length, the performance of the WSLSH algorithm is improved gradually. When the encoding length is 64 bit, the WSLSH algorithm obtains the highest precision and recall on the three datasets, which is superior to other compared methods. Conclusion The LSH algorithm is improved by performing feature space division twice, weighting the number of hash indexes of reference features, dynamically using variable-length hash codes, and introducing a simple-projection semantic-hash function. Thus, the proposed WSLSH algorithm has faster retrieval speed. In the case of long encoding length, WSLSH achieves better performance than the compared works and shows high application value for large-scale image datasets.
作者 黄小燕 孙彬 杨展源 朱映映 田奇 Huang Xiaoyan;Sun Bin;Yang Zhanyuan;Zhu Yingying;Tian Qi(College of Computer Science and Software Engineering,Shenzhen University,Shenzhen 518000,China;Huawei Technologies Co.,Ltd.,Shenzhen 518000,China)
出处 《中国图象图形学报》 CSCD 北大核心 2021年第7期1568-1582,共15页 Journal of Image and Graphics
基金 国家自然科学基金项目(62072318) 广东省自然科学基金项目(2021A1515012014) 深圳市科技计划基础研究面上项目(JCYJ20190808172007500)。
关键词 特征空间划分 局部敏感哈希(LSH) 动态变长哈希码 视觉搜索 最近邻搜索 feature space partitioning locality-sensitive hashing(LSH) dynamic variable-length hashing code visual retrieval nearest neighbor search
  • 相关文献

参考文献3

二级参考文献34

  • 1蒋凯,武港山.基于Web的信息检索技术综述[J].计算机工程,2005,31(24):7-9. 被引量:20
  • 2http://venturebeat.com/2008/07/25/google-finds-that-the-web-has-over-1-trillion-unique-urls.
  • 3/http://www.kullin.net/2010/09/flickr-5-billion-photos/.
  • 4Arya S, Mount DM. Approximate nearest neighbor queries in fixed dimensions. In: Proc. of the 4th Annual ACM/SIGACT-SIAM Symp. on Discrete Algorithms. New York: ACM/SIAM, 1993. 271-280.
  • 5Gionis A,Indyk P, Motwani R. Similarity search in high dimensions via hashing, In: Proc. of the 25th Int'l Conf. on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 1999.518-529.
  • 6Weiss Y, Torralba A, Fergus R. Spectral hashing. In: Proc. of the 22th Annual Conf. on Neural Information Processing System, New York: Curran Associates Inc., 2008. 1753-1760.
  • 7Torralba A, Fergus R, Freeman WT. 80 million tiny images: A large dataset for non-parametric object and scene recognition, IEEE Trans. on Pattern Analysis and Machine Intelligence, 2008,30(11):1958-1970. [doi: 10,1109/TPAMI,2008.128].
  • 8Torralba A, Fergus R, Weiss Y. Small codes and large databases for recognition, In: Proc, of the IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, Washington: IEEE Computer Society, 2008. 1-8. [doi: 10,1109/CVPR.2008.4587633].
  • 9Kulis B, Jain P, Grauman K. Fast similarity search for learned metric. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009,31(12):2143-2157. [doi: 10.1109/TPAMI.2009,151].
  • 10Xu H, Wang JD, Li Z, Zeng G, Li SP, Yu NH, Complementary hashing for approximate nearest neighbor search. In: Proc. of the IEEE Int'l Conf. on Computer Vision, New York: IEEE, 2011. 1631-1638. [doi: 10.1109/ICCV.2011.6126424].

共引文献22

同被引文献30

引证文献4

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部