期刊文献+

一种关于数据流区间Disjoint查询的快速处理算法 被引量:1

A Fast Processing Algorithm on Section Disjoint Query of Data Stream
下载PDF
导出
摘要 在关于数据流子序列相似性匹配的研究中,Disjoint查询是很重要的一类,在传感网络和数据挖掘等方面都发挥着非常重要的作用.但现有的研究并没有关注到定长区间上的Disjoint查询问题.直接对每个区间内成员使用Spring算法是解决该问题的NAIVE算法,但是因为NAIVE算法不具有增量计算的特点,所以存在冗余运算.针对NAIVE算法冗余运算的处理问题,提出了边界路径技术.边界路径技术很好地使用了Spring算法在相邻前一区间上的执行结果,使得Spring算法无需对当前区间上每个成员执行,就可以得到Disjoint查询在该区间的查询结果.使用该技术对NAIVE算法进行改造,设计并实现了快速区间Disjoint查询处理算法(fast section Disjoint query processing algorithm,FSDQ),该算法具有增量计算的特点.实验证明FSDQ算法可以有效减少NAIVE算法所具有的冗余运算,是处理数据流上区间Disjoint查询的有效方法. Disjoint query has played a very important role in the research about similarity match over subsequence of data stream, such as sensor network, data mining and so on. But the existing research did not focus on the disjoint query which is executed for elements of data stream in the fixed-length section. Running Spring algorithm for all the elements in every section directly is the NAIVE algorithm. But this algorithm does not have the characteristic of incremental computation, so it has redundant operation. The boundary path technology is proposed in this paper to deal with this redundant operation. The NAIVE algorithm can get the disjoint query results of current section based on the ones of previous section and need not performing for every element in current section by using this technology, so it has the characteristic of incremental computation. The FSDQ algorithm, which has the characteristic of incremental computation, is proposed by improving NAIVE algorithm with the boundary path technology. Extensive experiments on real and synthetic data sets show that the FSDQ algorithm is an effective way to solve the problem of section disjoint query over data stream, and it can reduce redundant operations the NAIVE algorithm has and meet the requirements of practical application.
出处 《计算机研究与发展》 EI CSCD 北大核心 2014年第5期1136-1148,共13页 Journal of Computer Research and Development
基金 国家自然科学基金项目(60903159,61173153) 中央高校基本科研业务费专项资金项目(N110818001,N100218001) 沈阳市科技计划项目(1091176-1-00)
关键词 Disjoint查询 欧氏距离 动态时间弯曲 数据流 Spring算法 Disjoint query Euclidean distance dynamic time warping (DTW) data stream Springalgorithm
  • 相关文献

参考文献3

二级参考文献23

  • 1钱江波,徐宏炳,王永利,刘学军,董逸生.多数据流滑动窗口并发连接方法[J].计算机研究与发展,2005,42(10):1771-1778. 被引量:10
  • 2潘定,沈钧毅.时态数据挖掘的相似性发现技术[J].软件学报,2007,18(2):246-258. 被引量:41
  • 3Franky Kin-Pong C, Ada Wai-chee F, Clement Y. Haar Wavelets for Efficient Similarity Search of Time-series:With and Without Time Warping [J]. IEEE Trans. on Knowl. and Data Eng., 2003,15 (3): 686-705.
  • 4Popivanov I, Miller R J. Similarity Search Over Time-Series Data Using Wavelets[C]//Proceedings of the 18th International Conference on Data Engineering. IEEE Computer Society, 2002: 212-216.
  • 5Liabotis I, Theodoulidis B, Saraaee M. Improving Similarity Search in Time Series Using Wavelets[J]. International Journal of Data Warehousing and Mining, 2006,2 (2).
  • 6Yingyi B, Lei C, Ada Wai-Chee F, et al. Efficient anomaly monitoring over moving object trajectory streams[C]//Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. Paris, France, ACM, 2009: 159-168.
  • 7Hao-Ping H, Ming-Syan C. Efficient range-constrained similarity search on wavelet synopses over multiple streams[C]//Proceedings of the 15th ACM International Conference on Information and Knowledge Management. Arlington, Virginia, USA, ACM, 2006 : 327-336.
  • 8Mayur D, Aristides G, Piotr I, et al. Maintaining stream statistics over sliding windows [C]// Proceedings of the Thirteenth Annual ACM-SIAM Symposium on Discrete Algorithms. San Francisco, California, Society for Industrial and Applied Mathematics, 2002 : 635-644.
  • 9Lukasz G, David D, Erik D D, et al. Identifying frequent items in sliding windows over on-line packet streams[C]//Proceedings of the 3rd ACM SIGCOMM Conference on Internet Measurement. Miami Beach, FL, USA, ACM, 2003: 173-178.
  • 10Hua-Fu L, Suh-Yin L. Mining frequent itemsets over data streams using efficient window sliding techniques[J]. Expert Syst. Appl. ,2009,36(2) : 1466-1477.

共引文献8

同被引文献5

引证文献1

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部