期刊文献+

一种全面高效的HPCS监控体系

Comprehensive Efficient HPCS Monitoring System
下载PDF
导出
摘要 分析了以超级计算中心联想深腾6800为主的HPCS监控需求,比较了大量的监控实现技术,给出监控系统评估特征和指标,提出了一种集系统、性能、应用程序、进程监控于一体的改进策略,讨论了利用信息流水、过滤、双重传输模式减少监控数据传输量,减轻监控资源消耗,提高综合监控的性能和效率。 The monitoring requirement on Deepcomp 6800 and other HPCS was analyzed, compares with a large number of existed monitoring techniques, presents the evaluation features and criteria of monitoring system, then advances an improved strategy that integrates system, performance, application and process monitoring together, finally discusses using pipelining, filtering and double-mode transfer to reduce monitoring data, alleviate monitoring overhead, enhance monitoring performance and efficiency.
出处 《计算机应用研究》 CSCD 北大核心 2007年第7期24-27,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(60533020) 国家"973"计划资助项目(2005-CB321702)
关键词 HPCS 监控 Clumon+ 过滤 双重传输模式 high performance computing systems(HPCS) monitoring Clumon + filtering double-mode transfer
  • 相关文献

参考文献10

  • 1ANDERSON E,DAVE PATTERSON.Extensible,scalable monitoring for clusters of computers:proc.of the 11th Systems Administration Conference[C].[S.l.]:[s.n.],1997:1-9,26-31.
  • 2BUYYA R.Parmon:a portable and scalable monitoring system for clusters[J].International Journal on Software,2000,30:1-17.
  • 3SOTTILE M,MINNICH R.Supermon:a high-speed cluster monitoring system:proc.of Cluster Computing[C].[S.l.]:[s.n.],2002:1-8.
  • 4SACERDOTI F D,KATZ M J,MASSIE M L,et al.Wide area cluster monitoring with Ganglia:proc.of 2003 IEEE International Con-ference[C].Hong Kong,China:IEEE,2003:1-10.
  • 5MASSIE M L,CHUN B N,CULLER D E.The ganglia distributed monitoring system:design,implementation,and experience[J].Parallel Computing,2004,30:817-840.
  • 6WEI Wenguo,DONG Shoubin,ZHANG Ling.An improved ganglia like clusters monitoring system:proc.of the 2nd International Workshop on Grid and Cooperative Computing[C].Shanghai:[s.n.],2003:1-8.
  • 7魏文国,张凌,董守斌,梁正友.一个可靠的集群簇/网格监控系统[J].计算机应用,2004,24(5):143-144. 被引量:4
  • 8RONEY T,BAILEY A,FULLOP J.The cluster monitoring system[EB/OL].[2004].http://clumon.ncsa.uiuc.edu/.
  • 9MOONEY R,SCHMIDT K,STUPHAM S,et al.NWPerf:a system wide performance monitoring tool[C].Pittsburg,USA:[s.n.],2004:1-11.
  • 10MUCCI P J,DANIEL A,JOHAN D,et al.PerfMiner:cluster wide collection,storage and presentation of application level hardware performance data[M].Heidelberg:Springer Berlin,2005:124-133.

二级参考文献4

  • 1[1]Massie ML, et al. The Ganglia Distributed Monitoring System:Design,Implementation,and Experience[EB/OL]. http://ganglia.sourceforge.net/talks/parallel_computing/ganglia-twocol.pdf,2003.
  • 2[2]Wenguo WEI,Shoubin DONG,Ling ZHANG,et al. An Improved Ganglia-like Clusters Monitoring System[A]. The 2th International Workshop on Grid and Cooperative Computing[C]. Shanghai,2003.
  • 3[3]Hyarary F. Graph Theory[M]. Addison-Wesley,Reading,Mass,1969.
  • 4[4]Peterson L,Culler D,Anderson T,et al. A blueprint for introducing disruptive technology into the internet[A]. Proceedings of the 1st Workshop on Hot Topics in Networks (HotNets-I)[C]. New Jersey,October 2002.

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部