摘要
软件监测是评价大规模分布式系统行为是否符合预期的重要手段,也是进一步实施在线调整和演化活动的前提和基础。给出了一个名为Mega的分布式软件系统运行时监测框架,可以支持具备监测能力的分布式软件系统的开发和运行。一方面,该框架通过引入一种专门的监测需求描述语言,在分布式系统中实现了监测逻辑和功能逻辑的分离,使得监测逻辑可以高效开发和独立维护。另一方面,该框架通过引入为监测数据处理而设计的事件模型,使得大规模系统的监测具备可伸缩和可扩展性。在阐述Mega框架特点的基础上,进一步给出了基于这一框架的若干监测设计模式,可以有效支持具备监测能力的大规模系统开发。
Software monitoring is an important means to check whether a large-scale distributed sys- tem complies with its intended behavior, which is the foundation of further online adjustment and evolu- tion activities. This paper proposes a framework named Mega that supports the development and run ning of monitoring-enabled distributed software systems. On the one hand, a dedicated language is in- troduced into the framework to describe where and what should be monitored at runtime, which sepa- rates the monitoring logic and the functional logic and enables the former to be developed efficiently and maintained independently. On the other hand, an event-based model for monitoring data processing is introduced, which is scalable and extensible in large-scale distributed systems. Based on the description of the feature of the Mega framework, this paper presents a set of monitoring design patterns as well, which can support the development of large-scale distributed systems with monitoring capability.
出处
《计算机工程与科学》
CSCD
北大核心
2013年第6期24-29,共6页
Computer Engineering & Science
基金
国家自然科学基金资助项目(90818028
61202117)
关键词
运行时监测
框架
分布式软件
runtime monitoring
software framework ~ distributed software