摘要
本论文针对传统的人工运维方式的不足,提出依据现有的专家经验和以往积累的已有经结合,实现自动化运维的方式。本文首先介绍自动化运维的整体架构,通过zookeeper消息发送实现指标采集入库,发现指标异常进行告警分析,通过任务调度平台调用executor执行器进行对应的python脚本自动化运维诊断,提升运维效率,降低成本。
In view of the shortcomings of the traditional manual operation and maintenance methods, this paper proposes a way to realize automatic operation and maintenance based on the existing expert experience and the existing accumulated experience. This paper first introduces the overall architecture of the automated operation and maintenance, through the zookeeper sending message to achieve the metrics collection and storage, find the indicators abnormal alarm analysis, then by calling the executor through the task scheduling platform to carry out the corresponding python script automatic operation and maintenance diagnosis, improve operation and maintenance efficiency, cut costs.
作者
程聪
周品秀
CHENG Cong;ZHOU Pin-xiu(Nanrui Group Co.,Ltd.,NanjingJiangsu 211000;Nanjing Cornerstone Data Technology Co.,Ltd.,Nanjing Jiangsu 210093)
出处
《数字技术与应用》
2018年第7期190-191,共2页
Digital Technology & Application