期刊文献+
共找到1篇文章
< 1 >
每页显示 20 50 100
ArchSim:A System-Level Parallel Simulation Platform for the Architecture Design of High Performance Computer 被引量:4
1
作者 黄永勤 李宏亮 +4 位作者 谢向辉 钱磊 郝子宇 过锋 张昆 《Journal of Computer Science & Technology》 SCIE EI CSCD 2009年第5期901-912,共12页
High performance computer (HPC) is a complex huge system, of which the architecture design meets increasing difficulties and risks. Traditional methods, such as theoretical analysis, component-level simulation and s... High performance computer (HPC) is a complex huge system, of which the architecture design meets increasing difficulties and risks. Traditional methods, such as theoretical analysis, component-level simulation and sequential simulation, are not applicable to system-level simulations of HPC systems. Even the parallel simulation using large-scale parallel machines also have many difficulties in scalability, reliability, generality, as well as efficiency. According to the current needs of HPC architecture design, this paper proposes a system-level parallel simulation platform: ArchSim. We first introduce the architecture of ArchSim simulation platform which is composed of a global server (GS), local server agents (LSA) and entities. Secondly, we emphasize some key techniques of ArchSim, including the synchronization protocol, the communication mechanism and the distributed checkpointing/restart mechanism. We then make a synthesized test of some main performance indices of ArchSim with the phold benchmark and analyze the extra overhead generated by ArchSim. Finally, based on ArchSim, we construct a parallel event-driven interconnection network simulator and a system-level simulator for a small scale HPC system with 256 processors. The results of the performance test and HPC system simulations demonstrate that ArchSim can achieve high speedup ratio and high scalability on parallel host machine and support system-level simulations for the architecture design of HPC systems. 展开更多
关键词 high performance computer architecture system-level parallel simulation synchronization protocol message communication distributed checkpointing/restart
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部