摘要
机群监测系统是用来管理机群,方便用户使用机群的软件系统.针对当前已有的机群管理系统在时效性、健壮性等方面的不足,提出了一种基于异步时钟的服务器监测技术,通过把指定的服务器集合组成一个具有反馈机制的环形队列的方法,使得用户能够将机群作为一个整体来进行管理.该系统能够透明地加入和删除服务器节点,自动重新配置以达到高可用性.本文采用的线程池技术和I/O多路转换技术能够有效提高机群监测系统的反应时间.实际的测试结果证明,该系统能够根据传输延迟、丢包率、系统繁忙程度、服务器或网络故障等情况采取适当的对策,在短时间内发现和排除故障,可以较好地应用于邮件服务器、WEB服务器等事务性机群处理系统中.
Cluster monitor system is a management software system, which makes the control of the cluster system very easy. At current time, most of the cluster monitor systems have some faults in real-time and robust. In this paper, a new mechanism to improve the cluster monitor system based on asynchronous clock is proposed. It connects the specified servers into a feed- back queue, so the users can treat the cluster system as one. The monitor system can add or delete certain nodes in the cluster system transparently. Furthermore, the new monitor system can reconfigure automatically to achieve high availability. In order to reduce the latency of the cluster monitor system, two technologies are used. One is the thread poll, the other is the I/O multiplexing. The results show that the new cluster monitor system can find an appropriate way to deal with the situations such as latency, ratio of the data losing, CPU utilization, fail of the server or network. The new cluster monitor system is very suited for the cluster system such as mail server, WEB server or other businesslike systems.
出处
《小型微型计算机系统》
CSCD
北大核心
2005年第9期1617-1620,共4页
Journal of Chinese Computer Systems
基金
国家"八六三"重点项目(2001AA11110
2004AA111120)资助.
关键词
机群
异步时钟
成员协议
高可用
远程过程调用
cluster t asynchronous clock
membership protocol
credibility t remote procedure call