期刊文献+

一种面向微服务的多维度根因定位算法 被引量:1

A Multi-Dimensional Root Cause Localization Algorithm for Microservices
下载PDF
导出
摘要 伴随着Docker等虚拟化容器技术的逐渐成熟,因其可扩展性、灵活性等特点与微服务架构完美契合,工业界逐渐将微服务架构应用部署在基于容器的云环境下,并用Kubernetes等容器编排工具来管理应用的全生命周期。在这样复杂的微服务架构下,如何使用人工智能技术高效发现异常并且定位根因成为重中之重。首先,文章总结了在微服务系统环境下进行异常检测和根因定位所面临的主要挑战和现有的关键技术;然后,针对现有技术异常检测覆盖范围不全面的问题,文章提出了一种基于无监督学习的多维度的异常检测方法,在调用链Trace数据的基础上结合服务和机器资源利用数据进行综合分析,确保能够检测出服务响应时间异常的同时,也能够识别服务资源利用异常和环境异常;最后,在异常已知的情况下,为了减少根因定位时间,拓展定位范围和缩小粒度,文章提出了一种轻量的基于异常传播子图的方法,将服务接口和机器节点两种维度的数据统一到异常传播子图中进行根因定位。实验表明,文章所提方法与已有方法相比,定位时间更短,不仅拓宽了根因定位场景,而且准确率也有明显提升。 With the gradual maturity of virtualized container technologies such as Docker,because of its scalability,f lexibility and other characteristics that perfectly fit the microservice architecture,the industry gradually deploys microservice architecture applications in container-based cloud environments,and use container orchestration tools such as Kubernetes to manage the full life cycle of the application.Under such a complex microservice architecture,how to use artificial intelligence technology to efficiently find abnormalities and locate the root cause becomes the top priority.First,the article summarized the main challenges and existing key technologies for anomaly detection and root cause localization in the context of microservice systems.Then,aiming at the problem that the coverage of existing anomaly detection was not comprehensive,we proposed a multi-dimensional anomaly detection method based on unsupervised learning,it combined service and machine resource utilization data for comprehensive analysis on the basis of call chain Trace data to ensure that service response time anomalies can be detected,and service resource utilization anomalies and environmental anomalies can also be identified.Finally,in the case of known anomalies,in order to reduce the root cause localization time,expand the localization range and reduce the granularity,we proposed a lightweight anomaly propagation subgraph-based method.It unified the data of the two dimensions of service interface and machine node into the anomaly propagation subgraph for root cause localization.The experiments results show that proposed method has shorter localization time compared with the existing methods,and not only broadens the root cause localization scenario,but also has a significant improvement in accuracy.
作者 施园 李杨 詹孟奇 SHI Yuan;LI Yang;ZHAN Mengqi(Institute of Information Engineering,Chinese Academy of Sciences,Beijing 100093,China;School of Cyber Security,University of Chinese Academy of Sciences,Beijing 100049,China)
出处 《信息网络安全》 CSCD 北大核心 2023年第3期73-83,共11页 Netinfo Security
基金 国家重点研发计划[2019YFB1005200,2019YFA1005201]。
关键词 容器 微服务 Kubernetes 异常检测 根因定位 container microservices Kubernetes abnormal detection root cause localization
  • 相关文献

参考文献6

二级参考文献19

共引文献67

同被引文献9

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部