一种基于LSH的时间子序列匹配查询算法被引量：1

An LSH Based Time Subsequence Matching Algorithm

下载PDF

导出

摘要提出了一种基于LSH(locality sensitive hashing,局部敏感散列)算法处理时间子序列匹配问题的方法LSHSM。不同于FRM和Dual Match方法 ,该方法不需要对时间序列做DFT、DWT等特征变换,而是直接把序列看成高维数据点,利用LSH能处理高维数据的特性来查找相似时间子序列。实验采用3种不同的时间序列数据集,通过与线性扫描算法比较,验证了算法的有效性,性能有很大的提高。 An algorithm called LSHSM, which uses locality sensitive hashing （LSH） to process time subsequence matching, was proposed. Different to the FRM and DualMatch algorithms, the LSHSM does not require feature transformation such as DFT and DWT. It just directly regards the sequence as a high-dimensional object to find similar subsequences. Comparing to a linear algorithm on three real datasets, the LSHSM algorithm demonstrates the effectiveness and efficiency.

作者刘根平陈叶芳杜呈透钱江波

机构地区宁波大学信息科学与工程学院

出处《电信科学》北大核心 2015年第8期63-71,共9页 Telecommunications Science

基金国家自然科学基金资助项目(No.61472194) 浙江省自然科学基金资助项目(No.LY13F020040) 宁波市自然科学基金资助项目(No.2014A610023 No.2015A610119) "信息与通信工程"浙江省重中之重学科开放基金资助项目~~

关键词时间子序列 LSH 匹配查询 time subsequence, locality sensitive hashing, match searching

分类号 TP301.6 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献22

1Fu T. A review on time series data mining. Engineering Applications of Artificial Intelligence, 2011, 24(1): 164-181.
2Christos F, Ranganathan M, Manolopoulos Y. Fast subsequence matching in time-series databases. Proceedings of ACM SIGMOD Conference, Minneapolis, USA, 1994.
3Moon Y S, Whang K Y, Loh W K. Duality-based subsequence matching in time-series databases. Proceedings of the 17th International Conference on Data Engineering, Heidelberg, Germany, 2001:263~272.
4Ruben B, Homaifar A, Gebril M, et al. Satellite image retrieval using low memory locality sensitive hashing in Euclidean space. Earth Science Informatics, 2011, g(1): 17-28.
5Yi Y, Michel C, Vincent O, et d. Local summarization and multi-level LSH for retrieving multi-variant audio tracks. Proceedings of the 17th ACM International Conference on Multimedia, Beijing, China, 2009:341~350.
6Paisitkriangkrai S, Mei T, Zhang J, et ol. Scalable clip-based near-duplicate video detection with ordinal measure. Proceedings of the ACM International Conference on Image and VideoRetrieval, Xi'an, China, 2010:121-128.
7Zhu L, Liu T, Gibbon D, et ol. Effective and scalable video copy detection. Proceedings of the 11th ACM International Conference on Multimedia Information Retrieval, Philadelphia, USA, 2010:119-128.
8Agrawal R, Faloutsos C, Swami A. Efficient similarity search in sequence databases. Proceedings of the 4th International Conference FODO, Chicago, USA, 1993:69-84.
9Indyk P, Motwani R. Approximate nearest neighbors: towards removing the curse of dimensionality. Proceedings of the 30th Annual ACM Symposium on Theory of Computing, Dallas, USA, 1998:604-613.
10Gionis A, Indyk P, Motwani R. Similarity search in high dimensions via hashing. Proceedings of the 25th International Conference on Very Large Data Bases (VLDB), Edinburgh, UK, 1999:518-529.

二级参考文献20

1Keogh E. Exact indexing of dynamic time warping//Proceed- ings of the VLDB. Hong Kong, China, 2002: 406-417.
2Rafiei D, Mendelzon A O. Querying time series data based on similarity. IEEE Transactions on Knowledge and Data Engineering, 2000, 12(5): 675-693.
3Berndt D, Clifford J. Finding patterns in time series: A dynamic programming approach//Advances in Knowledge Discovery and Data Mining. American Association for Artificial Intelligence. Menlo Park, CA, USA, 1996:229-248.
4Sakoe H, Chiba S. Dynamic programming algorithm optimi- zation for spoken word recognition. IEEE Transactions on ASSP, 1978, 26(1): 43-49.
5Vlachos M, Gunopulos D, Kollios G. Discovering similar multi-dimensional trajectories//Proceedings of the ICDE. San Jose, CA, USA, 2002:673-684.
6Chen L, Ozsu M T, Oria V. Robust and fast similarity search for moving object trajectories//Proceedings of the 2005 ACM SIGMOD International Conference on Manage- ment of Data. New York, USA, 2005: 491-502.
7Chen L, Ng R T. On the marriage of Lp-norms and edit dis- tance//Proceedings of the 30th International Conference on Very Large Data Bases. 2004:792-803.
8Agrawal R, Faloutsos C, Swami A. Efficient similarity search in sequence databases//Proceedings of the FODO. Chicago, Illinois, USA, 1993:69-84.
9Beckmann Net al. The R.tree: An efficient and robust access method for points and rectangles//Proceedings of the SIGMOD. Atlantic City, NJ, USA, 1990:322-331.
10Keogh E et al. Locally adaptive dimensionality reduction for indexing large time series databases//Proceedings of the SIGMOD. Santa Barbara, CA, USA, 2001:151-162.

共引文献5

1廖丽,伍绍佳.优化多重过滤的序列查询算法研究[J].网络安全技术与应用,2014(6):104-104. 被引量：2
2刘根平.集中式环境下的局部敏感哈希算法综述[J].移动通信,2015,39(10):46-51. 被引量：1
3于喆.水生生物DNA序列相似度的算法[J].水产学杂志,2016,29(5):22-26. 被引量：1
4沈一超,倪世宏,张鹏.一种飞行数据相似子序列查询方法[J].空军工程大学学报（自然科学版）,2019,20(2):7-12. 被引量：1
5李敏,于长永,张峰,马海涛,赵宇海.基于LSH的时间序列DTW相似性查询[J].小型微型计算机系统,2019,40(10):2155-2159. 被引量：5

同被引文献3

1汤春蕾,董家麒.基于LSH的时间子序列查询算法[J].计算机学报,2012,35(11):2228-2236. 被引量：6
2李海林,郭崇慧.时间序列数据挖掘中特征表示与相似性度量研究综述[J].计算机应用研究,2013,30(5):1285-1291. 被引量：66
3李海林,梁叶,王少春.时间序列数据挖掘中的动态时间弯曲研究综述[J].控制与决策,2018,33(8):1345-1353. 被引量：50

引证文献1

1李敏,于长永,张峰,马海涛,赵宇海.基于LSH的时间序列DTW相似性查询[J].小型微型计算机系统,2019,40(10):2155-2159. 被引量：5

二级引证文献5

1张晓黎.财经院校《Java数据科学》课程的思政研究[J].电脑知识与技术,2020,16(34):180-182. 被引量：5
2费超,陆天海,于海涛,徐大诚.微悬臂梁气敏材料表征系统中基线校正方法[J].现代电子技术,2021,44(17):100-104.
3魏联滨,王彬,王莹,张海峰.基于气象相似日选取与提升回归树的光伏发电短期功率预测[J].电子器件,2022,45(1):183-188. 被引量：5
4张晓黎.信息技术企业股票关联网络风险[J].系统工程学报,2023,38(6):812-823.
5熊浩然,何震瀛.支持均匀缩放的不等长时间子序列查询方法[J].计算机工程,2024,50(1):60-67.

1关云鸿,杨静.高维时序数据的相似搜索[J].贵州大学学报（自然科学版）,2006,23(1):44-50.
2陈为满,马佩勋.时间序列相似性度量的研究[J].长沙民政职业技术学院学报,2011,18(2):109-111.
3苏.Intermec SR61手持式无线扫描器上市[J].信息与电脑,2007(2):8-8.
4苏亮,邹鹏,贾焰,杨树强.海量数据流上快速Top-K子序列匹配算法研究[J].计算机工程与科学,2009,31(6):58-61.
5陈为满,苏亮,高春鸣.数据流上快速子序列匹配[J].计算机工程与应用,2008,44(36):174-178. 被引量：1
6徐晨,傅瑜.力反馈比例控制FRM的建模研究[J].西安电子科技大学学报,1998,25(5):621-624.
7徐晨,傅瑜.FRM建模方法与精度的研究[J].电子科技杂志,1994(3):1-12. 被引量：3
8王忠伟,陈叶芳,肖四友,钱江波.一种高维大数据全k近邻查询算法[J].电信科学,2015,31(7):52-62. 被引量：3
9梅寒蕾,蔡青林,陈岭,孙建伶.一种基于动态时间弯曲的数据流子序列匹配系统[J].计算机研究与发展,2015,52(S1):112-117. 被引量：2
10岳晓峰,梁翠萍.基于分层模糊推理的坠机搜索方案研究[J].科技致富向导,2015,0(5):94-94.

电信科学

2015年第8期

浏览历史

内容加载中请稍等...

一种基于LSH的时间子序列匹配查询算法被引量：1

参考文献22

二级参考文献20

共引文献5

同被引文献3

引证文献1

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种基于LSH的时间子序列匹配查询算法 被引量：1

参考文献22

二级参考文献20

共引文献5

同被引文献3

引证文献1

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种基于LSH的时间子序列匹配查询算法被引量：1