摘要
时间序列数据主要依据采集时间进行排序,时间序列上相邻的数据具有一定的关联性,当用户读取时间序列数据时不只是读取一条数据,而是连续读取一段时间序列数据。针对时间序列的局部性特点,提出一种基于动态分段的时间序列索引DSI,通过设置差值及差值等级对时间序列数据进行动态分段,使用区间树快速查找不同长度的数据分段块,并利用层次聚类算法优化查询结果集合。实验结果表明,DSI索引的查询效率优于现有时间序列查询索引。
Adjacent time series data is correlated to some extent,as it is ordered by collection time.When extracting data from a time series,users tend to read multiple successive data points rather than a single data point.Based on the data locality of time series,this paper proposes a time series index based on dynamic segmentation,called DSI.DSI sets difference and difference levels to dynamically segment time series data,and uses interval tree to quickly query segmented data blocks of unequal length.The query result set is optimized by using the hierarchical clustering algorithm.Experimental results show that DSI has higher query efficiency than existing time series query indexes.
作者
周骑骏
王鹏
汪卫
ZHOU Qijun;WANG Peng;WANG Wei(School of Computer Science,Fudan University,Shanghai 201203,China)
出处
《计算机工程》
CAS
CSCD
北大核心
2020年第2期88-95,共8页
Computer Engineering
基金
国家自然科学基金(61672163,U1509213)
关键词
索引
范围查询
时间序列
区间树
层次聚类
index
range query
time series
interval tree
hierarchical clustering