期刊文献+

一种基于子空间学习的图像语义哈希索引方法 被引量:8

Semantic Hashing with Image Subspace Learning
下载PDF
导出
摘要 随着数据量的不断增加,快速而准确的索引算法对信息检索而言变得十分重要.针对上述问题,提出了一种基于子空间学习的索引算法.首先,利用部分有标签的数据进行子空间学习,在学习过程中,为了保证语义相同的样本在索引后保持局部性,以样本近邻间的距离衡量类内聚合度;同时,为了保证不同语义的样本在索引后增强判别性,以不同语义样本中心之间的距离衡量类间离散度.通过放松限制,用类似线性判别分析的方法进行子空间学习,将子空间作为哈希函数的投影向量.利用学习到的投影向量进一步计算偏移量,得到哈希函数.分别在数据集MNIST和CIFAR-10上进行编码判别性实验和局部性保留实验,并与相关方法进行比较,得到了较好的效果.实验结果表明该方法是有效的. With the increasing amount of data being collected, developing fast indexing methods with high accuracy becomes important for information retrieval tasks. To address this issue, this paper proposes an indexing method based on hashing mechanism with subspace learning. Firstly, the subspace is learned on a set of labeled data. To guarantee the locality preserving characteristics in the original space for the samples with similar semantic labels, the distances between the nearest neighbors are computed to measure the intra-class scatter. Besides, the distances between the centers of samples with dissimilar semantic labels are also computed to measure the inter-class scatter in order to enhance the discriminative power of the codes. The projections of the hash functions are then learned by relaxing the constraint of the formula. The biases are further learned based on the projections. Finally, the proposed method is evaluated on the datasets MNIST and CIFAR-10 to compare with the state-of-the-art methods. Experimental results show that the proposed method achieves significant performance and high effectiveness in searching semantically similar neighbors.
出处 《软件学报》 EI CSCD 北大核心 2014年第8期1781-1793,共13页 Journal of Software
基金 国家自然科学基金(61273257 61321491 61035003) 国家重点基础研究发展计划(973)(2010CB327903) 教育部新世纪优秀人才计划(NCET-11-0213) 江苏省六大人才高峰计划(2013-XXRJ-018) 江苏省自然科学基金(BK2011005)
关键词 哈希函数 子空间 偏移量 局部性保留 判别性 hash function subspace bias locality preserving discriminant
  • 相关文献

参考文献24

  • 1http://venturebeat.com/2008/07/25/google-finds-that-the-web-has-over-1-trillion-unique-urls.
  • 2/http://www.kullin.net/2010/09/flickr-5-billion-photos/.
  • 3Arya S, Mount DM. Approximate nearest neighbor queries in fixed dimensions. In: Proc. of the 4th Annual ACM/SIGACT-SIAM Symp. on Discrete Algorithms. New York: ACM/SIAM, 1993. 271-280.
  • 4Gionis A,Indyk P, Motwani R. Similarity search in high dimensions via hashing, In: Proc. of the 25th Int'l Conf. on Very Large Data Bases. San Francisco: Morgan Kaufmann Publishers, 1999.518-529.
  • 5Weiss Y, Torralba A, Fergus R. Spectral hashing. In: Proc. of the 22th Annual Conf. on Neural Information Processing System, New York: Curran Associates Inc., 2008. 1753-1760.
  • 6Torralba A, Fergus R, Freeman WT. 80 million tiny images: A large dataset for non-parametric object and scene recognition, IEEE Trans. on Pattern Analysis and Machine Intelligence, 2008,30(11):1958-1970. [doi: 10,1109/TPAMI,2008.128].
  • 7Torralba A, Fergus R, Weiss Y. Small codes and large databases for recognition, In: Proc, of the IEEE Computer Society Conf. on Computer Vision and Pattern Recognition, Washington: IEEE Computer Society, 2008. 1-8. [doi: 10,1109/CVPR.2008.4587633].
  • 8Kulis B, Jain P, Grauman K. Fast similarity search for learned metric. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009,31(12):2143-2157. [doi: 10.1109/TPAMI.2009,151].
  • 9Xu H, Wang JD, Li Z, Zeng G, Li SP, Yu NH, Complementary hashing for approximate nearest neighbor search. In: Proc. of the IEEE Int'l Conf. on Computer Vision, New York: IEEE, 2011. 1631-1638. [doi: 10.1109/ICCV.2011.6126424].
  • 10Strecha C, Bronstein AM, Bronstein MM, Fua P. LDAHash: Improving matching with smaller descriptors. IEEE Trans. on Pattern Analysis and Machine Intelligence, 2012,34(1):66-78. [doi: 10.1109/TPAMI.2011.103].

同被引文献119

引证文献8

二级引证文献104

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部