期刊文献+

系统日志故障预测中的ELK与LSTM应用与实践 被引量:1

Application and Practice of ELK and LSTM in System Log Fault Prediction
下载PDF
导出
摘要 随着业务系统规模不断扩大,系统结构也变得十分复杂,常规基于规则的方法已经很难判断多个系统相互作用下的复合型故障,也难以对潜在故障进行预测.本文在多业务系统的复杂场景下,使用ELK平台对日志进行集中化管理,梳理出复杂系统环境下日志与各业务系统、主机、进程之间的关系,筛选出系统中直接与故障相关的日志文件,进而在深度学习框架TensorFlow中使用这些海量数据对LSTM算法模型进行训练,从而实现对系统的实时故障预测. As the scale of systems continues to expand,the system structure also becomes very complex.The rule-based methods have been difficult to judge the composite faults under the interaction of multiple systems,and it is also hard to predict potential faults.Firstly,the study uses the ELK platform for centralized management of logs in complex scenarios of multi-business systems.Then,it sorts out the relationship between logs and various business systems,hosts,and processes in a complex system environment.Finally,we filter out the log files related to the failure in the system,and use these data in the deep learning framework TensorFlow to train the LSTM algorithm model,so as to realize the real-time fault prediction of the system.
作者 徐志斌 叶晗 王晗 郜义浩 XU Zhi-Bin;YE Han;WANG Han;GAO Yi-Hao((Beijing Capital Highway Development Group Co.Ltd.,Beijing 100161,China;Beijing Yunxingyu Traffic Technology Co.Ltd.,Beijing 100078,China)
出处 《计算机系统应用》 2020年第7期264-267,共4页 Computer Systems & Applications
关键词 ELK LSTM 故障预测 深度学习 TensorFlow ELK LSTM failure prediction deep learning TensorFlow
  • 相关文献

参考文献3

二级参考文献20

  • 1Chang F,Dean J,Ghemawat S,et al.Bigtable:A distributed storage system for structured data[C]//Seventh Symposium on Operating System Design and Implementation.Seattle,WA:UsenixAssociation,2006.
  • 2Scribe logfile aggregation system described by facebook's jeff hammerbacher[EB/OL].http://github.com/facebook/scibe,2008.
  • 3Jerome Boulon,Andy Konwinski,Runping Qi,et al.Chukwa:A large-scale monitoring system[C]//Cloud Computing and Its Applications,2008.
  • 4Dean J,Ghemawat S.MapReduce:Simplifie'r date processing on large clusters[J].Communications of the ACM,2008,51 (1):107-113.
  • 5Dean J,Ghemawat S.MapReduce:A flexible data processing tool[J].Communications of the ACM,2010,53 (1):72-77.
  • 6Liu Yan,Cao Ning,Pan Wei,et al.System anomaly detection in distributed systems through MapReduce-based log analysis[C]//Advaxed Computer Theory and Engineering,2010.
  • 7Ronald C Taylor.An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics[C]//11th Annual Bioinformatics Open Source Conference,2010.
  • 8王珊,王会举,覃雄派,周烜.架构大数据:挑战、现状与展望[J].计算机学报,2011,34(10):1741-1752. 被引量:616
  • 9覃雄派,王会举,杜小勇,王珊.大数据分析——RDBMS与MapReduce的竞争与共生[J].软件学报,2012,23(1):32-45. 被引量:386
  • 10孟小峰,慈祥.大数据管理:概念、技术与挑战[J].计算机研究与发展,2013,50(1):146-169. 被引量:2392

共引文献37

同被引文献11

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部