期刊文献+

事务处理型容错计算机可用性评测系统设计与实现

Design and implementation of an availability assessment system for transaction processing-oriented fault tolerant computers
下载PDF
导出
摘要 针对事务处理型容错计算机可用性测试中存在的目标系统数量少、测试时长有限的问题,设计了一种可用性评测方法,并实现了相应的可用性评测系统,用于评测事务处理型容错计算机的可用性指标。评测系统由多层次故障注入平台、模拟应用负载、可用性评测套件组成。多层次故障注入平台实现自动化故障注入,针对目标系统施加多批量、多种类故障负载;模拟应用负载能够针对目标系统施加事务处理型工作负载;可用性评测套件用于测试目标系统中各功能子系统和现场可更换部件(FRU)的可靠性连接关系,测试各类FRU的冗余程度,测试各类FRU的平均修复时间,以及测试验证目标系统是否满足指定的可用性设计要求。针对HP Superdome容错服务器进行的评测结果与官方文档一致,证明了该评测系统的有效性。本研究对于事务处理型容错计算机研制商预测系统可用性以及终端用户应用选型具有重要作用。 To overcome the limitation in sample system number and test period during the availability test for a transaction processing-oriented fault tolerant computer, an availability assessment method was proposed and a corresponding as- sessment system was realized. The availability assessment system consists of a multi-level fault injection platform, an application workloads simulator and an availability assessment toolkit. The fault injection platform is designed for automatically injecting various fault-loads into target systems in batches. The application workloads simulator can generate transactions launched by end-users and send them to target systems as workloads. The availability assess- ment toolkit is designed for several tests, including reliability relationship test among functional subsystems, relia- bility relationship test among field replaceable units ( FRUs), redundancy test of different kind of FRUs, mean time to recovery (MTFR) test, and availability validation test. The evaluation results of the tests on HP Superdome fault-tolerant server accord with official documents, which proves the effectiveness of the assessment system. This research is important for computer manufacturers to predict availability metric and it is also important for end-users to verify system availability.
出处 《高技术通讯》 CAS CSCD 北大核心 2012年第9期912-917,共6页 Chinese High Technology Letters
基金 863计划(2008AA01A204)资助项目.
关键词 容错 事务处理 可用性评测 故障注入 fault-tolerant, transaction processing, availability assessment, fault injection
  • 相关文献

参考文献11

  • 1Shooman M L. Reliability of Computer Systems and Net-works :Fault Tolerance, Analysis, and Design. NewYork, USA: John Wiley & Sons, Inc, 2002. 183-186.
  • 2Trivedi K. SHARPE 2002; symbolic hierarchical automa-ted reliability and performance evaluator. In : Proceedingsof International Conference on Dependable Systems andNetworks. Washington DC, USA,2002. 544-544.
  • 3Hirel C,Truffin B, Trivedi K. SPNP: stochastic petrinets version 6. 0. In: Proceedings of 11th InternationalConference on TOOLS. Schaumburg, USA, 2000. 354-357.
  • 4W aller M, Siegle M,Bode A. OpenSESAME: the simplebut extensive, structured availability modeling environ-ment. Reliability Engineering and System Safety,2008,93(6) : 857-873.
  • 5Deavours D D,Clark G,Courtney T, et al. The Mobiusframework and its implementation. IEEE Transactions onSoftware Engineering, 2002,28( 10) : 956-969.
  • 6Zhu J, Mauro J, Pramanick I. R-Cubed ( R3) : rate, ro-bustness ,and recovery-an availability benchmark frame-work :[Technical Report ] , Series # TR-2002-109.Mountain View, CA, USA: Sun Microsystems, Inc,2002. 1-22.
  • 7Kanoun K,Spainhower L. Dependability Benchmarkingfor Computer Systems. New York, USA: John Wiley &Sons, Inc, 2008. 63-79.
  • 8Mukherjee A, Siewiorek D P. Measuring software de-pendability by robustness benchmarking. IEEE Transac-tions of Software Engineering, 1997,23(6) : 366-378.
  • 9左德承,张展,董剑,刘宏伟,杨孝宗.面向事务处理的容错计算机系统结构设计与实现[J].高技术通讯,2008,18(2):111-115. 被引量:3
  • 10Schroeder B,Gibson G A. A large-scale study of failuresin high-performance computing systems. In: Proceedingsof International Conference on Dependable Systems andNetworks, Philadelphia, USA, 2006. 249-258.

二级参考文献6

  • 1董剑,左德承,刘宏伟,杨孝宗.一种基于QoS的自适应网格失效检测器[J].软件学报,2006,17(11):2362-2372. 被引量:12
  • 2Tsai T K, Iyer R K, Jewitt D. An approach towards benchmarking of fault-tolerant commercial systems. In: Proceedings of 26th IEEE International Symposium on Fault Tolerant Computing(FTCS-26), Sendai, Japan, 1996. 314-323.
  • 3Costa D, Carreira J, Silva J G. WinFT: Using off-the-shelf computer on industrial environments. In: Proceedings of the 6th IEEE International Conference on Emerging Technologies and Factory Automation (ETFA' 97 ), Los Angeles, USA, 1997. 39-44.
  • 4Russinovich M, Segall Z. Fault-tolerance for off-the-shelf applications and hardware. In: Proceedings of the 25th International Symposium on Fault-Tolerant Computing, Pasadena, Canada, 1995. 67-71.
  • 5Muller G, Banatre M, Peyrouze N, et al. Lessons from FTM: an experiment in the design & implementation of a low-cost fault-tolerant computer. IEEE Transactions on Reliability, 1996, 45(2): 332-340.
  • 6Hsueh M C, Iyer R K. Dependability and performance measurement. In: IEEE Intenational Workshop on Computer-Aided Design, Test, and Evaluation for Dependability, Beijing, China, 1996. 11-22.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部