期刊文献+

自适应大规模服务器集群监控系统的构建 被引量:5

Design and Implementation of an Adaptive Monitoring System for Large-Scale Server Clusters
下载PDF
导出
摘要 针对大规模服务器集群监控系统域大小固定而不能动态适应集群规模变化的缺点,提出了自适应动态域算法.通过计算监控引入的负载,根据负载容忍度要求,动态改变域的个数和大小,以适应集群规模变化.同时,提出了浮动域监控节点的方法,以应对域监控节点负载过重的情况,即在域监控节点负载较重时,自适应地选择新的域监控节点,并完成节点角色的自动更新.采用自适应的线程池和数据库连接池可降低监控任务的开销,加快执行任务的响应时间.监控CPU利用率的测试结果表明,系统执行单任务的响应时间为128 ms,20个任务的响应时间为242 ms;数据库连接池的性能测试结果表明,在300 s内,采用了自适应池、不采用自适应池方案所耗处理器资源分别为1.80%和7.53%,处理任务数分别为12 364和2 769. An adaptive dynamic domain algorithm has been proposed to solve the issue that the fixed domain can not adapt to the variation of the cluster scale. By computing the intrusive load obtained by the monitoring system, the algorithm can control the numbers and the sizes of monitoring domains in an acceptable level according to the intrusion tolerance degree. The fixed domain monitoring node is not efficient as its load is high. A floating domain monitoring node method has been proposed to re-select the domain monitoring node and automatically update the roles of the nodes when the load of the node is high. To reduce the overhead incurred by monitoring system and improve the response time of executing monitoring tasks, an adaptive thread pool and a database connection pool have been designed. The measurements on monitoring CPU usage show that the response time of monitoring one node is 128ms, and that of monitoring 20 nodes is 242ms. A comparative performance measurement has been performed' between the solutions with an adaptive database connection pool and that without a pool. The results show that the average intrusion on CPU usage in 300 seconds is 1.80% with a pool and 7.53% without a pool, and the numbers of completed tasks are 12 364 and 2 769, respectively.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2008年第4期399-403,共5页 Journal of Xi'an Jiaotong University
基金 国家自然科学基金资助项目(60773118) 国家高技术研究发规划资助项目(2004AA111110,2006AA01A109)
关键词 服务器集群 监控系统 自适应池 server cluster monitoring system adaptive pool
  • 相关文献

参考文献5

  • 1ALMASI G, BACHEGA L, BELLOFATTO R, et al. System management in the ,bluegene/L supereomputer [C]//Parallel and Distributed Processing Symposium. Los Alamitos, USA:IEES Computer SocietY, 2003:1- 8.
  • 2BUYYA R. PARMON: a portable and scalable monitoring system for Cluster[J]. Software PraCtice and Experience, 2000, 30(7) :723-739.
  • 3SOTTILE MJ, MINNICH R G. ‘Supermon: a high- Speed cluster' monitoring system [C]//2002 IEEE International Conference on Cluster Computing. Piscataway, USA: IEEE, 2002:39-46.
  • 4MASSIE M L, CHUN B N, CULLER D E. The ganglia distrlbuted monitoring system: design, implementation, arid experience[J]: Parallel Computing,2004, 30(7):817-840.
  • 5TURCK F D, VANHASTEL S, VOLCKAERT B, et al. A generic middleware-based platform for Scalable cluster computing [J]. Future Generation Computer Systems, 2002, 18 (4) : 549-560.

同被引文献46

  • 1易昭华,金正操,杜晓黎.大规模机群监控系统数据采集通信模型和通信协议的研究[J].计算机工程与应用,2004,40(35):116-118. 被引量:2
  • 2何丽萍,刘立程.改进的基于Ganglia的网格监控系统[J].广东工业大学学报,2006,23(1):85-89. 被引量:9
  • 3王霓虹,张涛.基于Java的数据库连接池设计方案的研究[J].信息技术,2006,30(3):102-106. 被引量:5
  • 4陈燕晖,罗宇.Linux内核数据采集的一种有效方法[J].计算机应用与软件,2006,23(6):51-52. 被引量:3
  • 5任建基,胡延平,陈俊峰,穆林涛.基于WMI技术的局域网计算机设备的监测[J].计算机工程与应用,2006,42(25):134-136. 被引量:24
  • 6中国软件行业协会数学软件分会,国家863高性能计算机评测中心,中国计算机学会高性能计算专业委员会.2008年中国高性能计算机性能TOP100排行榜[EB/OL].(2009-03-13)[2009-03-28].http://www.samss.org.cn/sites/shuxue/ndhyC.jsp?contentId=2473512102846.
  • 7CHANA K H, LI Ligang, LIAO Xinhao. Modelling the core convection using finite element and finite difference methods [J]. Phys Earth Planet Interiors, 2006, 157(2): 124-138.
  • 8LI Ligang, LIAO Xinhao, ZHANG Keke. Linear and nonlinear instabilities in rotating cylindrical Rayleigh-Benard convection[J/OL]. Physical Review: E, 2008, 78(5):12[2009-02-20]. http://link, aps. org/doi/10. 1103/PbysRevE. 78. 056303.
  • 9I.I Ligang, LIAO Xinhao, ZHANG Keke. Countertraveling waves in rotating Rayleigh 13enard convection [J/OL]. Physical Review: E, 2008, 77(2):4[2009-02 -18]. http://link, aps. org/doi/10. 1103/PhysRevE. 77. 027301.
  • 10李力刚.球壳内行星流体动力学方程组的有限差分法[R]//国家863高效能计算机及网格服务环境项目技术报告.北京:中国科学院,2008:10-20.

引证文献5

二级引证文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部