期刊文献+

Fast filtering false active subspaces for efficient high dimensional similarity processing

Fast filtering false active subspaces for efficient high dimensional similarity processing
原文传递
导出
摘要 The query space of a similarity query is usually narrowed down by pruning inactive query subspaces which contain no query results and keeping active query subspaces which may contain objects corre- sponding to the request. However, some active query subspaces may contain no query results at all, those are called false active query subspaces. It is obvious that the performance of query processing degrades in the presence of false active query subspaces. Our experiments show that this problem becomes seriously when the data are high dimensional and the number of accesses to false active subspaces increases as the dimensionality increases. In order to solve this problem, this paper proposes a space mapping approach to reducing such unnecessary accesses. A given query space can be refined by filtering within its mapped space. To do so, a mapping strategy called maxgap is proposed to improve the efficiency of the refinement processing. Based on the mapping strategy, an index structure called MS-tree and algorithms of query processing are presented in this paper. Finally, the performance of MS-tree is compared with that of other competitors in terms of range queries on a real data set. The query space of a similarity query is usually narrowed down by pruning inactive query subspaces which contain no query results and keeping active query subspaces which may contain objects corre- sponding to the request. However, some active query subspaces may contain no query results at all, those are called false active query subspaces. It is obvious that the performance of query processing degrades in the presence of false active query subspaces. Our experiments show that this problem becomes seriously when the data are high dimensional and the number of accesses to false active subspaces increases as the dimensionality increases. In order to solve this problem, this paper proposes a space mapping approach to reducing such unnecessary accesses. A given query space can be refined by filtering within its mapped space. To do so, a mapping strategy called maxgap is proposed to improve the efficiency of the refinement processing. Based on the mapping strategy, an index structure called MS-tree and algorithms of query processing are presented in this paper. Finally, the performance of MS-tree is compared with that of other competitors in terms of range queries on a real data set.
出处 《Science in China(Series F)》 2009年第2期286-294,共9页 中国科学(F辑英文版)
基金 Supported by National Basic Research Program of China (Grant No.2006CB303103) the National Natural Science Foundation of China (Grant Nos.60873011,60802026,60773219,60773021) the High Technology Program (Grant No.2007AA01Z192)
关键词 high dimensional index refining processing false active subspace high dimensional index refining processing false active subspace
  • 相关文献

参考文献11

  • 1King-Ip Lin,H. V. Jagadish Ph.D.,Christos Faloutsos Ph.D.The TV-tree: An index structure for high-dimensional data[J].The VLDB Journal.1994(4)
  • 2White D A,,Jain R.Similarity indexing with the SS-tree[].Proceedings of the th ICDE Conference.1996
  • 3Skopal T,Pokorny J,Kratky M, et al.Revisiting M-tree build-ing principles[].Proceedings of the th ADBIS Conference.2003
  • 4Ishikawa M,Chen H,Furuse K, et al.MB+tree:a dynamically updatable metric index for similarity searches[].Proceed-ings of the ?rst WAIM Conference.2000
  • 5Zhou X,Wang G,Yu J X, et al.M+-tree:a new dynami-cal multidimensional index for metric spaces[].Proceedings of the th Australasian Database Conference.2003
  • 6Yu G,Kaneko K,Bai G, et al.Transaction management for a distributed object storage system WAKSHI-design, imple-mentation and performance[].Proceedings of the th ICDE Conference.1996
  • 7BOHM C,BERCHTOLD S,KEIM D.Searching inhigh-dimensional spaces:index structures for improvingthe performance of multimedia databases[].ACM Computing Surveys.2001
  • 8Berkmann N,Krigel HP,Schneider R,Seeger B.The R*-tree:An efficient and robust access method for points and rectangles[].SIGMOD Record.1990
  • 9Katayama N,Satoh S.The SR-tree:An index structure for high-dimensional nearest neighbor queries[].SIGMOD Record.1997
  • 10Lin,K.,Jagadish,H.,Faloutsos,C.The TV-tree: an index structure for high-dimensional data[].The VLDB Journal.1994

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部