摘要
利用深度学习模型训练和运行维护过程产生的海量日志信息,进行模型的优化与故障排查,是当前人工智能运维的研究热点.针对现有工作缺少模型工作流分析的问题,提出面向模型服务的日志异常可视分析方法ModelLogVis.该方法采用日志异常检测方法定位模型工作流中的潜在故障,帮助用户聚焦主要的故障类型;支持用户从数据流、状态、实例性能和原始日志等多个角度对工作流中的事件进行交互式可视化与分析,快速、准确地排查问题.通过真实的模型服务数据的案例研究和专家访谈,证明ModelLogVis方法可高效地辅助用户快速挖掘日志中的异常信息.
Recently it is a hot topic to utilize massive log information of deep learning models for model optimization and troubleshooting in artificial intelligence operation.To address the challenge of model workflow analysis,we propose ModelLogVis,a visual analysis approach for diagnosing log abnormality in model services.Our approach employs a log anomaly detection method to locate the potential faults in the model workflow,guiding users to focus on the significant fault types.We integrated visual interface illustrates events of the workflow from multiple perspectives,including dataflow,status,instance performance,and original logs,and supports users to progressively analyze the faults in the workflow.Case studies of real datasets and expert interviews demonstrate that our approach is highly efficient in helping users quickly uncover anomalous information in logs.
作者
卢裕弘
朱琳
封颖超杰
王斯加
林正轩
潘嘉铖
陈为
Lu Yuhong;Zhu Lin;Feng Yingchaojie;Wang Sijia;Lin Zhengxuan;Pan Jiacheng;Chen Wei(State Key Laboratory of CAD&CG,Zhejiang University,Hangzhou 310058;Zhejiang Lab,Hangzhou 311121)
出处
《计算机辅助设计与图形学学报》
EI
CSCD
北大核心
2024年第7期1106-1114,共9页
Journal of Computer-Aided Design & Computer Graphics
基金
国家自然科学基金(62132017,61972122).
关键词
可视分析
日志可视化
异常检测
visual analysis
log visualization
anomaly detection