期刊文献+

RQIC:一种高效时序相似搜索算法 被引量:1

RQIC:An Efficient Similarity Searching Algorithm on Time Series
下载PDF
导出
摘要 索引大规模时序数据库是高效时序搜索中的关键问题.提出了一种新颖的索引方案RQI,它包括3种过滤策略:即first-k过滤、索引低边界和上边界以及三角不等式修剪.基本的思想为首先运用Haar小波变换计算每个时序的小波系数,利用前面的k个小波系数形成一个最小边界矩阵,以利用点过滤方法;然后将预先计算每个时序的低边界特征和上边界特征存放到索引当中;最后采用三角不等式来修剪不相似的序列并确保没有漏报.同时提出了一种新的低边界距离函数SLBS和聚类算法CSA.通过CSA可保持索引良好的聚类特征以提高点过滤方法的效率,从而引入了一种更好的算法RQIC.在合成数据集和实时数据集的大量对比实验表明,RQIC是有效的且具备较高的查询效率. Indexing large time series databases is crucial for efficient search of time series queries. An index scheme RQI (range query based on index) is introduced, which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning.The basic idea is calculating wavelet coefficients for each time series based on Haar wavelet transform.The first k coefficients are used to form a MBR (minimal bounding rectangle). Thus, the point filtering method can be used. In addition, lower bounding and upper bounding features of each time series are calculated in advance, which are stored into index structure. Finally, triangle inequality pruning method is adopted by calculating the distance between time series beforehand. Moreover, a new lower bounding distance function SLBS (symmetrical Iower bounding based on segment) and a novel clustering algorithm CSA (clustering based on segment approximation) are proposed. The aim is to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure, thereby obtaining a better algorithm RQIC (range query based on Index and clustering). A lot of compared experiments are conducted over both synthetic and real data sets for RQIC. The results show that RQIC provides perfect pruning ability and high search efficiency.
出处 《计算机研究与发展》 EI CSCD 北大核心 2009年第5期770-778,共9页 Journal of Computer Research and Development
基金 国家"八六三"高技术研究发展计划基金项目(2006AA01Z430 2007AA01Z309)~~
关键词 数据挖掘 算法 索引 聚类 时间序列 相似搜索 data mining algorithm indexing clustering time series similarity search
  • 相关文献

参考文献12

  • 1Sakurai Y, Faloutsos C, Yamamuro M. Stream monitoring under the time warping distanee [C] //Proc of IEEE ICDE. Los Alamitos, CA: IEEE Computer Soeiety, 2007: 1086- 1095
  • 2Cai Y H, Ng R. Indexing spatio-temporal trajectories with chebyshev polynomials [C]//Proc of ACM SIGMOD 2004. New York: ACM, 2004:599-610
  • 3Keogh E, Palpanas T, Zordan V B, et al. Indexing large human-motion databases [C] //Proc of VLDB Int Conf. San Francisco: Morgan Kaufmann, 2004:780-791
  • 4Chakrabarti K, Keogh E, Mehrotra S, et at. Locally adaptive dimensionality reduction for indexing large time series databases [J]. ACM Trans on Database Systems, 2002, 27(2): 188-228
  • 5Keogh E, Ratanamahatana C A. Exact indexing of dynamic time warping [J]. Knowledge and Information Systems, 2005, 7(3): 358-386
  • 6廖巍,景宁,钟志农,陈宏盛.面向移动对象的高效预测范围聚集查询方法[J].计算机研究与发展,2007,44(6):1015-1021. 被引量:5
  • 7陈继东,胡志智,孟小峰,王凌.一种基于城市交通网络的移动对象全时态索引[J].计算机研究与发展,2007,44(6):1008-1014. 被引量:8
  • 8Chen Q X, Chen L, Lian X, et al. Indexable PLA for efficient similarity search [C] //Proc of VLDB. New York: ACM, 2007:435-446
  • 9Liu B, Wang Z, Li J. Tight Bounds on the Estimation Distance Using Wavelet [C] //Proc of the 7th Conf on Web- Age Information Management (WAIM' 06). Berlin.. Springer, 2006:460-471
  • 10Yi B K, Faloutsos C. Fast time sequence indexing for arbitrary Lp norms [C]//Proc of VLDB. San Francisco: Morgan Kaufmaon, 2000:385-304

二级参考文献22

  • 1O Wolfson,A P Sistla,S Chamberlain,et al.Updating and querying databases that track mobile units[J],Distributed and Parallel Databases,1999,7(3):257-387
  • 2C S Jensen,D Lin,B C Ooi.Query and update efficient B^+-tree based indexing of moving objects[C].The 30th Int'l Conf on Very Large Data Base(VLDB),Toronto,Canada,2004
  • 3O Wolfson,A Prasad Sistla,B Xu,et al.Tracking moving objects using database technology in DOMINO[C].The 4th Int'l Workshop on Next Generation Information Technologies and Systems,Zikhron-Yaakov,Israel,1999
  • 4O Wolfson,S Chamberlain,S Dao,et al.Cost and imprecision in modeling the position of moving objects[C].The 14th Int'l Conf on Data Engineering(ICDE),Orlando,FL,1998
  • 5O Wolfson,B Xu,S Chamberlain,el:al.Moving objects databases:Issues and solutions[C].The 10th Int'l Conf on Scientific and Statistical Database Management(SSDBM),Capri,Italy,1998
  • 6N Beckmann,H P Kriegel,R Schneider,et al.The R^*-Tree:An efficient and robust access method for points and rectangles[C].The ACM SIGMOD Int'l Conf on Management of Data,Atlantis,NJ,1990
  • 7J Tayeb,O Ulusoy,O Wolfson.A quadtree-based dynamic attribute indexing method[J].The Computer Journal,1998,41(3):185-200
  • 8S Saltenis,C S Jensen,S T Leutenegger,et al.Indexing the positions of continuously moving objects[C].The ACMSIGMOD Int'l Conf on Management of Data,Dallas,Texas,2000
  • 9M A Nascimento,J R O Silva.Towards historical R-trees[C].The ACM Symposium on Applied Computing,Atlanta,Georgia,1998
  • 10Y Tao,D Papadias.The MV3R-Tree:A spatio-temporal access method for timestamp and interval queries[C].The 26th Int'l Conf on Very Large Data Bases(VLDB),Roma,Italy,2001

共引文献10

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部