SAX结合Adaboost算法的时间序列分类问题

Research on Time Series Data Classification Combine SAX and AdaBoost Algorithm

下载PDF

导出

摘要 SAX是一种典型的符号化特征表示方法.该方法在时间序列特征表示中不仅可以有效地降维、降噪,而且具有简单、直观等特点.时间序列长度不一、特征表示过程中信息损失等问题的存在,使得常规的分类算法难以很好地完成分类任务.在对时间序列数据进行基于SAX符号化的BOP表示方法的基础上,提出了结合集成学习中AdaBoost算法进行分类的新方法,实验结果表明,该方法不仅能很好地处理SAX符号化表示中的信息损失问题,而且与已有方法相比,在分类准确度方面也有了显著的提高. Symbolic Aggregate approXimation （SAX） is a typical symbolic representation method, which is straight-for- ward and very simple, and it efficiently converts time series data to a symbolic representation with dimension reduction. The is- sues of time series data such as variable in length, and information lose during the representation, making many traditional clas- sification methods unable to apply directly. This paper focus on the SAX discretization method coupled with the Bag of Patterns （BOP） representation in classification task, and proposed the new approach by use AdaBoost Algorithm to remedy the informa- tion loss by SAX representation. The experimental results show that, the approach improved the classification accuracy obvi- ously.

作者宋玉高明磊宋伟

机构地区郑州大学信息工程学院

出处《河南师范大学学报（自然科学版）》 CAS 北大核心 2015年第3期155-160,共6页 Journal of Henan Normal University(Natural Science Edition)

基金国家自然科学基金(61202207) 河南省教育厅科学技术研究重点项目(13A520453)

关键词时间序列分类 SAX BOP ADABOOST time series classification SAX BOP AdaBoost

分类号 TP311 [自动化与计算机技术—计算机软件与理论]

引文网络
相关文献

参考文献16

1Fu T. A review on time series data mining[J]. Engineering Applications of Artificial Intelligence, 2011,24 (1): 164-181.
2李海林,郭崇慧.时间序列数据挖掘中特征表示与相似性度量研究综述[J].计算机应用研究,2013,30(5):1285-1291. 被引量：65
3Lin J, Keogh E, Lonardi S, et al. A symbolic representation of time series, with implications for streaming algorithms[C]. Proc of the 8th ACM SIGMOD Workshop on Research issues in data mining and knowledge discovery(DMKD '03) ,San Diego,2003.
4Lin J, Keogh E,Wei L, et al. Experiencing SAX: a novel symbolic representation of time series[J]. Data Mining and Knowledge Discover- y, 2007,15(2) : 107-144.
5Junejo I, Aghbari Z. Using SAX representation for human action recognition[J]. Journal of Visual Communication and Image Represen ration,2012,23(6) : 853-861.
6Afroni M, Sutanto D, Stirling D. Analysis of Nonstationary Power-Quality Waveforms Using Iterative Hilbert Huang Transform and SAX Algorithm[J]. IEEE Transactions on Power Delivery, 2013,28(4) : 2134-2144.
7Oates T,Mackenzie C,Stein D,et al. Exploiting Representational Diversity for Tirne Series Classification[C]. Proc of llth Int Conf on Machine Learning and Applications(ICMLA '12), Boca Raton, 2012.
8Freund Y,Schapire R. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting[J]. Journal of Computer and System Sciences, 1997, 55(1) : 119-139.
9Keogh E, Pazzani M. Derivative dynamic time warping[-C]. Proc of 1st Int Conf on Data Mining, Chicago,2001.
10Chen L,Ng R. On the marriage of lp-norms and edit distance[C]. Proe of 30th Int Conf on Very Large Data Bases(VLDB 04), Morgan Kaufmann,2004.

二级参考文献76

1李爱国,覃征.在线分割时间序列数据[J].软件学报,2004,15(11):1671-1679. 被引量：27
2肖辉,胡运发.基于分段时间弯曲距离的时间序列挖掘[J].计算机研究与发展,2005,42(1):72-78. 被引量：59
3李爱国,覃征.大规模时间序列数据库降维及相似搜索[J].计算机学报,2005,28(9):1467-1475. 被引量：20
4HAN J W,KAMBER M,PEI J. Data mining:concepts and techniques [ M]. 3rd ed. San Francisco:Morgan Kanfmann Publishers, 2011.
5P.ENG C K, HAVLIN S, STANLEY H E, et al. Quantification of scaling exponents and crossover phenomena in nonstationary heartbeat time series [ J ]. Chaos, 1995,5 ( 1 ) :83- 88.
6YANG Qiang,WU Xin-dong. 10 challenging problems in data mining research[ J]. Intemational Journal of Information Technology & Decision Making,2006,5(4) :597-604.
7FU T C. A review on time series data mining[J]. Engineering Appli- cations of Artificial Intelligence,2011,24( 1 ) : 164-181.
8RATANAMAHATANA C, KEOGH E, BAGNALL T, et al. A novel bit level time series representation with implications for similarity search and clustering [ C]//Proc of the 9th Pacific-Asia Conference on Knowledge Discovery and Data Mining. 2005:771-777.
9KEOGH E, LIN J, FU A. Hot SAX:efficiently finding the most unusu- al time series subsequence[ C]//Proc of the 5th IEEE International Conference on Data Mining. 2005:226-233.
10AGRAWAL R, FALOUTSOS C, SWAMI A. Efficient similarity search in sequence databases[ C ]//Proc of the 4th International Conference on Foundations of Data Organization and Algorithms. Washington DC : IEEE Computer Society, 1993:69- 84.

共引文献64

1刘春柳,张征.城市用水量曲线聚类算法的研究与实现[J].中国科技论文在线精品论文,2020(2):212-220.
2林莽.林莽散文选[J].岁月,2000(7):27-29.
3杨悦,杨永安,胡绍林.逐段回归近似的卫星遥测数据挖掘算法与仿真[J].计算机仿真,2013,30(8):109-112. 被引量：6
4贾永锋,闫宏图,阎红灿.EMD-BP神经网络预测模型及应用[J].计算机时代,2014(2):1-4. 被引量：1
5尚军,陈莉,汤宏胜,张苍松,李华.基于IRST的谱图相似性查找方法研究[J].计算机与应用化学,2014,31(3):333-336.
6郑宝芬,苏宏业,罗林.无监督特征选择在时间序列数据挖掘中的应用[J].仪器仪表学报,2014,35(4):834-840. 被引量：15
7吴学雁,莫赞.基于分层动态时间弯曲的序列相似性度量方法研究[J].计算机应用研究,2014,31(5):1370-1373. 被引量：2
8李海林.基于动态弯曲的时间序列异步相关性分析[J].计算机应用研究,2014,31(7):1976-1979. 被引量：6
9郑旭,盛立辉,崔宵语.基于小波熵的时间序列分段聚合近似表示[J].计算机仿真,2015,32(1):411-415. 被引量：7
10吴明辉,许爱强,孙伟超,裘璐光.多属性单一趋势结构时序数据的聚类模型[J].计算机工程与设计,2015,36(4):1058-1062. 被引量：1

1张庆吉,左启民.应用计算机技术实现火电厂辅助系统的监控[J].东北电力技术,2006,27(10):36-38. 被引量：1
2李浩,王晓东,张佩响.提高发电厂BOP鲁棒性的策略研究[J].电气传动自动化,2014,36(5):25-28.
3宋伟,张帆,叶阳东,韩鹏,范明.基于SAX方法的时间序列分类问题的多阶段改进研究[J].计算机工程与科学,2016,38(5):988-996. 被引量：5
4耿晓峰,刘卫国.辅助车间集中控制网在发电厂中的应用[J].浙江电力,2010,29(11):52-54. 被引量：3
5十大恶劣天气.《火线狂飙——天堂》最新消息[J].大众软件,2008(21):90-90.
6付奎.大型火力发电厂辅助车间系统控制方式及网络结构的研究[J].科技创新与应用,2016,6(34):16-17. 被引量：1
7刘博,郭建胜.改进的多元时间序列符号化表示方法研究[J].计算机仿真,2015,32(1):314-317. 被引量：3
8江松.Netlinx网络构架在大型火力发电厂BOP中的应用[J].电子世界,2014(14):56-57. 被引量：1
9吴永存.高度集中的辅控网在国华宁海电厂的实现[J].自动化博览,2007,24(3):70-72. 被引量：9
10范小平,徐锦林.印刷企业应适时构建自己的BCP[J].印刷杂志,2006(3):40-42.

河南师范大学学报（自然科学版）

2015年第3期

浏览历史

内容加载中请稍等...

SAX结合Adaboost算法的时间序列分类问题

参考文献16

二级参考文献76

共引文献64

相关作者

相关机构

相关主题

浏览历史