期刊文献+

基于形态的时间序列相似性度量研究 被引量:33

Research on Shape-Based Time Series Similarity Measure
下载PDF
导出
摘要 时间序列重新描述和相似性度量是时间序列数据挖掘的研究基础,对提高挖掘任务的效率和准确性至关重要。该文提出了一种新的基于形态的时间序列符号描述,并给出相应的距离公式,以度量时间序列的相似性。该方法直观简洁,对数据的平移、伸缩不敏感,能够反映序列趋势变化的程度、去除噪声的影响,满足时间多分辨率要求。仿真结果表明,该方法具有较好的聚类性能,可以在不同分辨率下有效度量时间序列的形态相似性。 The representation and similarity measure of time series are the basis of time series research, which is quite important to improving the efficiency and accuracy of the time series data mining. This paper proposes a shape-based discrete symbolic representation and its corresponding distance measure to measure the similarity between time series. The present method is intuitive and compact, and not sensitive to the shifting, amplitude scaling, compression and stretch of data. The method can reflect the degree of the dynamic change of the tendency and erase the influence of the noises, and it has multi-scale characterization. The experimental results show that the approach has good effect in clustering, which can measure the shape-similarity of time series effectively under various analyzing frequency.
出处 《电子与信息学报》 EI CSCD 北大核心 2007年第5期1228-1231,共4页 Journal of Electronics & Information Technology
基金 国家自然科学基金(60275020)资助课题
关键词 时间序列 数据挖掘 相似性度量 重新描述 Time series Data mining Similarity measure Representation
  • 相关文献

参考文献8

  • 1Chung Fu-Lai,Fu Tak-Chung,Ng V,and Luk Rt W P.An evolutionary approach to pattern-based time series segmentation.IEEE Trans.on Evolutionary Computation,2004,8(5):471-489.
  • 2Shatkay H and Zkonik S B.Approximate queries and representations for large data sequences.in Proc.Int.Conf.Data Engeering.Los Alamitos,CA:IEEE Computer Society Press,1996:536-545.
  • 3Das G,Lin K I,and Mannila H.Rule discovery from time series,in Proc.ACM SIGKDD Int.Conf.Knowledge Discovery Data Mining,New York City,1998:16-22.
  • 4Kai O Y,Jia W,Zhou P,and Meng X.A new approach to transforming time series into symbolic sequences,in Proc.1st Joint BMES/EMBS Conf.,Atlanta,GA,USA Oct.1999,vol.2:974.
  • 5王达,荣冈.时间序列的模式距离[J].浙江大学学报(工学版),2004,38(7):795-798. 被引量:40
  • 6Keogh E and Pazzani M.An enhanced representation of time series which allows fast and accurate classification,clustering and relevance feedback.In proceedings of the 4th Int'l Conference on Knowledge Discovery and Data Mining.New York,NY,Aug 27-31.1998:239 241.
  • 7姜宁,史忠植.文本聚类中的贝叶斯后验模型选择方法[J].计算机研究与发展,2002,39(5):580-587. 被引量:21
  • 8Hyndman R J.http://www-personal.buseco.monash.edu.au/~hyndman/tsdl/data/FVD1.dat.2002-12.

二级参考文献17

  • 1[1]H H Bock.Probabilistic models in cluster analysis.Computational Statistics & Data Analysis,1996,23:5~28
  • 2[2]Chris Fraley,Adrian E Raftery.Model-based clustering,discriminate analysis,and density estimation.Department of Statistics,University of Washington,Tech Rep:380,2000
  • 3[3]Petri T Kontkanen,Petri J Myllymaki,Henry R Tirri.Comparing Bayesian model class selection criteria by discrete finite mixtures.In:D L Dowl,K B Korb,J J Oliver eds.Information,Statistics and Induction in Science (Proc of the ISIS'96 Conf in Melbourne.Australia,1996).Singapore:World Scientific,1996.364~374
  • 4[4]An Introduction to Cluster Analysis for Data Mining.http://www.cs.umn.edu/classes/Spring-2000/csci5980-dm/cluster-survey.pdf
  • 5[5]高等数理统计.超星数字图书馆.http://www.ssreader.com.cn.442~444(Advanced Mathematical Statistics (in Chinese),Superstar Digital Library.http://www.ssreader.com.cn.442~444)
  • 6[6]Jeff A Bilmes.A gentle tutorial of the EM algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models.Computer Science Division Department of Electrical Engineering and Computer Science,U C Berkeley,Tech Rep:TR-97-021,1998
  • 7[7]R E Kass,A E Raftery.Bayesian factors and model uncertainly.Department of Statistics,Carnegie-Mellon University,Tech Rep:571,1993
  • 8[8]I J Good.Weight of evidence:A brief survey.In:J M Bernade ed.Bayesian Statistics 2.New York:Elsevier,1985.249~269
  • 9[9]贝叶斯统计推断.超星数字图书馆.http://www.ssreader.com.cn(Bayesian Inferential Statistics (in Chinese).Superstar Digital Library.http://www.ssreader.com.cn)
  • 10[10]P Cheeseman,J Stutz.Bayesian Classification (AutoClass):Theory and results.In:U M Tayyad ed.Knowledge Discovery in Data Bases II.AAAI Press /The MIT Press,1995.153~180

共引文献59

同被引文献316

引证文献33

二级引证文献158

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部