期刊文献+

基于LSH的时间子序列查询算法 被引量:6

Similarity Query of Time Series Sub-sequences Based on LSH
下载PDF
导出
摘要 子序列的相似性查询是时间序列数据集中的一种重要操作,包括范围查询和k近邻查询.现有的大多算法是基于欧几里德距离或者DTW距离的,缺点在于查询效率低下.文中提出了一种新的基于LSH的距离度量方法,可以在保证查询结果质量的前提下,极大提高相似性查询的效率;在此基础上,给出一种DS-Index索引结构,利用距离下界进行剪枝,进而还提出了两种优化的OLSH-Range和OLSH-kNN算法.实验是在真实的股票序列集上进行的,数据结果表明算法能快速精确地找出相似性查询结果. Subsequence Similarity Query is an important operation in time series, including range query and k nearest neighbor query. Most of these algorithms are based on Euclidean distance or DTW distance, weak point of which is the time inefficiencies. We propose a new distance meas- ure, based on Locality Sensitive Hash (LSH), which improve the efficiency greatly while ensu- ring the quality of the query results. We also propose an index structure named DS-Index. Using DS-Index, we prune the candidates of query and thus propose two optimal algorithms: OLSH- Range and OLSH-kNN. Our experiments conducted on real stock exchange transaction sequence datasets show that algorithms can quickly and accurately find similarity query results.
出处 《计算机学报》 EI CSCD 北大核心 2012年第11期2228-2236,共9页 Chinese Journal of Computers
基金 上海市重点学科建设基金(B114)资助~~
关键词 相似性查询 时间序列数据库 子序列 LSH 索引 similarity query time-series databases subsequence Locality Sensitive Hash (LSH) index
  • 相关文献

参考文献20

  • 1Keogh E. Exact indexing of dynamic time warping//Proceed- ings of the VLDB. Hong Kong, China, 2002: 406-417.
  • 2Rafiei D, Mendelzon A O. Querying time series data based on similarity. IEEE Transactions on Knowledge and Data Engineering, 2000, 12(5): 675-693.
  • 3Berndt D, Clifford J. Finding patterns in time series: A dynamic programming approach//Advances in Knowledge Discovery and Data Mining. American Association for Artificial Intelligence. Menlo Park, CA, USA, 1996:229-248.
  • 4Sakoe H, Chiba S. Dynamic programming algorithm optimi- zation for spoken word recognition. IEEE Transactions on ASSP, 1978, 26(1): 43-49.
  • 5Vlachos M, Gunopulos D, Kollios G. Discovering similar multi-dimensional trajectories//Proceedings of the ICDE. San Jose, CA, USA, 2002:673-684.
  • 6Chen L, Ozsu M T, Oria V. Robust and fast similarity search for moving object trajectories//Proceedings of the 2005 ACM SIGMOD International Conference on Manage- ment of Data. New York, USA, 2005: 491-502.
  • 7Chen L, Ng R T. On the marriage of Lp-norms and edit dis- tance//Proceedings of the 30th International Conference on Very Large Data Bases. 2004:792-803.
  • 8Agrawal R, Faloutsos C, Swami A. Efficient similarity search in sequence databases//Proceedings of the FODO. Chicago, Illinois, USA, 1993:69-84.
  • 9Beckmann Net al. The R.tree: An efficient and robust access method for points and rectangles//Proceedings of the SIGMOD. Atlantic City, NJ, USA, 1990:322-331.
  • 10Keogh E et al. Locally adaptive dimensionality reduction for indexing large time series databases//Proceedings of the SIGMOD. Santa Barbara, CA, USA, 2001:151-162.

同被引文献83

引证文献6

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部