期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
Challenges and reflections on exascale computing
1
作者 Yang Xuejun 《Engineering Sciences》 EI 2014年第3期17-22,共6页
This paper introduces the development of the exascale (10^18) computing. Though exascalc computing is a hot research direction worldwide, we are facing many challenges in the areas of memory wall, communica- tion wa... This paper introduces the development of the exascale (10^18) computing. Though exascalc computing is a hot research direction worldwide, we are facing many challenges in the areas of memory wall, communica- tion wall, reliability wall, power wall and scalability of parallel computing. According to these challenges, some thoughts and strategies are proposed. 展开更多
关键词 exascale COMPUTING CENTRAL processing unit (CPU) storage WALL HETEROGENEOUS PROCESSOR
下载PDF
Exascale Computer百亿亿次计算机
2
《兵器材料科学与工程》 CAS CSCD 北大核心 2020年第2期19-19,共1页
2020年,中国有望赢得建造世界上首台百亿亿次计算机的竞赛,该计算机每秒运算能力为100亿亿次。至于具体哪台超级计算机将成为第一代超级计算机,目前仍不确定,因为在中国已经有3家机构展开了竞争,分别是位于天津和济南的国家超级计算中心... 2020年,中国有望赢得建造世界上首台百亿亿次计算机的竞赛,该计算机每秒运算能力为100亿亿次。至于具体哪台超级计算机将成为第一代超级计算机,目前仍不确定,因为在中国已经有3家机构展开了竞争,分别是位于天津和济南的国家超级计算中心,以及中科曙光(Sugon)。 展开更多
关键词 exascale COMPUTER
下载PDF
Brief Introduction of TianHe Exascale Prototype System 被引量:5
3
作者 Ruibo Wang Kai Lu +7 位作者 Juan Chen Wenzhe Zhang Jinwen Li Yuan Yuan Pingjing Lu Libo Huang Shengguo Li Xiaokang Fan 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2021年第3期361-369,共9页
Facing the challenges of the next generation exascale computing,National University of Defense Technology has developed a prototype system to explore opportunities,solutions,and limits toward the next generation Tianh... Facing the challenges of the next generation exascale computing,National University of Defense Technology has developed a prototype system to explore opportunities,solutions,and limits toward the next generation Tianhe system.This paper briefly introduces the prototype system,which is deployed at the National Supercomputer Center in Tianjin and has a theoretical peak performance of 3.15 Pflops.A total of 512 compute nodes are found where each node has three proprietary CPUs called Matrix-2000+.The system memory is 98.3 TB,and the storage is 1.4 PB in total. 展开更多
关键词 TianHe exascale system PROTOTYPE proprietary CPU Matrix-2000+
原文传递
Storage wall for exascale supercomputing 被引量:3
4
作者 Wei HU Guang-ming LIU +2 位作者 Qiong LI Yan-huang JIANG Gui-lin CAI 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2016年第11期1154-1175,共22页
The mismatch between compute performance and I/O performance has long been a stumbling block as supercomputers evolve from petaflops to exaflops. Currently, many parallel applications are I/O intensive,and their overa... The mismatch between compute performance and I/O performance has long been a stumbling block as supercomputers evolve from petaflops to exaflops. Currently, many parallel applications are I/O intensive,and their overall running times are typically limited by I/O performance. To quantify the I/O performance bottleneck and highlight the significance of achieving scalable performance in peta/exascale supercomputing, in this paper, we introduce for the first time a formal definition of the ‘storage wall' from the perspective of parallel application scalability. We quantify the effects of the storage bottleneck by providing a storage-bounded speedup,defining the storage wall quantitatively, presenting existence theorems for the storage wall, and classifying the system architectures depending on I/O performance variation. We analyze and extrapolate the existence of the storage wall by experiments on Tianhe-1A and case studies on Jaguar. These results provide insights on how to alleviate the storage wall bottleneck in system design and achieve hardware/software optimizations in peta/exascale supercomputing. 展开更多
关键词 围住存储的加速 存储墙 高效计算 exascale 计算 TP338.6
原文传递
Force10发布新型交换机ExaScale E系列
5
《机械工业信息与网络》 2009年第2期36-36,共1页
Force10网络公司本周发布了一个全新的交换机/路由器产品线,旨在改善虚拟化数据中心和云计算环境的性能、管理和成本效益。
关键词 exascale E FORCE10 数据中心 NEXUS 网密度 可用数据 交换架构
原文传递
A Parallel Hybrid Testing Technique for Tri-Programming Model-Based Software Systems
6
作者 Huda Basloom Mohamed Dahab +3 位作者 Abdullah Saad AL-Ghamdi Fathy Eassa Ahmed Mohammed Alghamdi Seif Haridi 《Computers, Materials & Continua》 SCIE EI 2023年第2期4501-4530,共30页
Recently,researchers have shown increasing interest in combining more than one programming model into systems running on high performance computing systems(HPCs)to achieve exascale by applying parallelism at multiple ... Recently,researchers have shown increasing interest in combining more than one programming model into systems running on high performance computing systems(HPCs)to achieve exascale by applying parallelism at multiple levels.Combining different programming paradigms,such as Message Passing Interface(MPI),Open Multiple Processing(OpenMP),and Open Accelerators(OpenACC),can increase computation speed and improve performance.During the integration of multiple models,the probability of runtime errors increases,making their detection difficult,especially in the absence of testing techniques that can detect these errors.Numerous studies have been conducted to identify these errors,but no technique exists for detecting errors in three-level programming models.Despite the increasing research that integrates the three programming models,MPI,OpenMP,and OpenACC,a testing technology to detect runtime errors,such as deadlocks and race conditions,which can arise from this integration has not been developed.Therefore,this paper begins with a definition and explanation of runtime errors that result fromintegrating the three programming models that compilers cannot detect.For the first time,this paper presents a classification of operational errors that can result from the integration of the three models.This paper also proposes a parallel hybrid testing technique for detecting runtime errors in systems built in the C++programming language that uses the triple programming models MPI,OpenMP,and OpenACC.This hybrid technology combines static technology and dynamic technology,given that some errors can be detected using static techniques,whereas others can be detected using dynamic technology.The hybrid technique can detect more errors because it combines two distinct technologies.The proposed static technology detects a wide range of error types in less time,whereas a portion of the potential errors that may or may not occur depending on the 4502 CMC,2023,vol.74,no.2 operating environment are left to the dynamic technology,which completes the validation. 展开更多
关键词 Software testing hybrid testing technique OpenACC OPENMP MPI tri-programming model exascale computing
下载PDF
西班牙将开发首个ARM架构CPU/GPU混合型超级计算机
7
《自动化信息》 2011年第12期87-87,共1页
西班牙巴塞隆纳超级运算中心(Barcelona Supercomputing Center;BSC)宣布将开发一款混合型超级计算机,其中将首次使用Nvidia公司的Tegra ARM CPU及其支持CUDA的Tesla GPU,期望能够实现达百万兆级(Exascale)的运算性能。
关键词 TEGRA ARM TESLA exascale 超级计算机
下载PDF
A Pipelining Loop Optimization Method for Dataflow Architecture 被引量:2
8
作者 Xu Tan Xiao-Chun Ye +6 位作者 Xiao-Wei Shen Yuan-Chao Xu Da Wang Lunkai Zhang Wen-Ming Li Dong-Rui Fan Zhi-Min Tang 《Journal of Computer Science & Technology》 SCIE EI CSCD 2018年第1期116-130,共15页
With the coming of exascale supercomputing era, power efficiency has become the most important obstacle to build an exascale system. Dataflow architecture has native advantage in achieving high power efficiency for sc... With the coming of exascale supercomputing era, power efficiency has become the most important obstacle to build an exascale system. Dataflow architecture has native advantage in achieving high power efficiency for scientific applications. However, the state-of-the-art dataflow architectures fail to exploit high parallelism for loop processing. To address this issue, we propose a pipelining loop optimization method (PLO), which makes iterations in loops flow in the processing element (PE) array of dataflow accelerator. This method consists of two techniques, architecture-assisted hardware iteration and instruction-assisted software iteration. In hardware iteration execution model, an on-chip loop controller is designed to generate loop indexes, reducing the complexity of computing kernel and laying a good f(mndation for pipelining execution. In software iteration execution model, additional loop instructions are presented to solve the iteration dependency problem. Via these two techniques, the average number of instructions ready to execute per cycle is increased to keep floating-point unit busy. Simulation results show that our proposed method outperforms static and dynamic loop execution model in floating-point efficiency by 2.45x and 1.1x on average, respectively, while the hardware cost of these two techniques is acceptable. 展开更多
关键词 dataflow model control-flow model loop optimization exascale computing scientific application
原文传递
E级超算下应用软件发展的挑战与趋势(英文) 被引量:1
9
作者 Guang-wen YANG Hao-huan FU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2018年第10期1267-1272,共6页
在未来三到五年,多台百亿亿次超级计算机系统将在全球多个国家部署。面对前所未有的超强计算能力与超大并行规模,如何设计和实现相匹配的应用软件,成为当前高性能计算软件发展的重要挑战。以中国神威太湖之光超级计算机系统(峰值计算能... 在未来三到五年,多台百亿亿次超级计算机系统将在全球多个国家部署。面对前所未有的超强计算能力与超大并行规模,如何设计和实现相匹配的应用软件,成为当前高性能计算软件发展的重要挑战。以中国神威太湖之光超级计算机系统(峰值计算能力12.5亿亿次)及其相关应用挑战和解决方案为出发点,根据我们目前的经验,讨论未来十年全球超级计算机的软硬件发展方向,并提出未来高性能计算应用软件开发中将会遇到的潜在挑战和可能的发展趋势。 展开更多
关键词 SUPERCOMPUTING exascale APPLICATION
原文传递
后E级时代高性能处理器架构的探索(英文)
10
作者 Xiang-hui XIE Xun JIA 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2018年第10期1224-1229,共6页
科学计算与工程应用对高性能日益增长的需求将推动高性能计算进入后E级时代。高性能处理器作为超级计算系统核心部件,其架构设计对提高系统性能至关重要。首先介绍后E级时代高性能处理器架构设计的3个目标,即性能有效扩展、资源高效利... 科学计算与工程应用对高性能日益增长的需求将推动高性能计算进入后E级时代。高性能处理器作为超级计算系统核心部件,其架构设计对提高系统性能至关重要。首先介绍后E级时代高性能处理器架构设计的3个目标,即性能有效扩展、资源高效利用和适应多种应用。其次,提出标量运算众核主芯片连接应用加速从芯片的Massa处理器架构,通过计算资源分布和应用定制硬件的结合,满足后E级时代高性能处理器架构设计的目标。最后,讨论了Massa架构未来需要重点研究的若干问题。 展开更多
关键词 HIGH-PERFORMANCE computing Beyond the exascale PROCESSOR architecture Application-customized HARDWARE Distributed computational RESOURCES
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部