期刊文献+
共找到10篇文章
< 1 >
每页显示 20 50 100
联想、曙光厮杀“超级计算机”
1
作者 曹增光 《IT时代周刊》 2004年第01S期21-21,共1页
曙光似乎总是慢半拍。2002年联想抢先一步,夺得据说属于曙光的中石油超级计算机订单:2003年12月9日,在曙光的10万亿次超级计算机还待字闺中时,联想又领先曙光一步,推出运算速度4万亿次的高性能计算机。“曙光在超级计算系统上又吃... 曙光似乎总是慢半拍。2002年联想抢先一步,夺得据说属于曙光的中石油超级计算机订单:2003年12月9日,在曙光的10万亿次超级计算机还待字闺中时,联想又领先曙光一步,推出运算速度4万亿次的高性能计算机。“曙光在超级计算系统上又吃了哑巴亏”,有业内专家如此评述。 展开更多
关键词 联想集团 “超级计算机” 高性能计算机 高性价比 曙光公司 行业竞争
下载PDF
Programming for scientific computing on peta-scale heterogeneous parallel systems 被引量:1
2
作者 杨灿群 吴强 +2 位作者 唐滔 王锋 薛京灵 《Journal of Central South University》 SCIE EI CAS 2013年第5期1189-1203,共15页
Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to co... Peta-scale high-perfomlance computing systems are increasingly built with heterogeneous CPU and GPU nodes to achieve higher power efficiency and computation throughput. While providing unprecedented capabilities to conduct computational experiments of historic significance, these systems are presently difficult to program. The users, who are domain experts rather than computer experts, prefer to use programming models closer to their domains (e.g., physics and biology) rather than MPI and OpenME This has led the development of domain-specific programming that provides domain-specific programming interfaces but abstracts away some performance-critical architecture details. Based on experience in designing large-scale computing systems, a hybrid programming framework for scientific computing on heterogeneous architectures is proposed in this work. Its design philosophy is to provide a collaborative mechanism for domain experts and computer experts so that both domain-specific knowledge and performance-critical architecture details can be adequately exploited. Two real-world scientific applications have been evaluated on TH-IA, a peta-scale CPU-GPU heterogeneous system that is currently the 5th fastest supercomputer in the world. The experimental results show that the proposed framework is well suited for developing large-scale scientific computing applications on peta-scale heterogeneous CPU/GPU systems. 展开更多
关键词 heterogeneous parallel system programming framework scientific computing GPU computing molecular dynamic
下载PDF
Review of the Low-Latency Optical Interconnect Technologies for Peta-Scale Computing
3
作者 S. J. B. Yoo Yin Yawei 《China Communications》 SCIE CSCD 2012年第8期16-28,共13页
This paper reviews the recently developed optical interconnect technologies designed for scalable, low latency and high-throughput comunications within datacenters or high perforrmnce computers. The three typical arch... This paper reviews the recently developed optical interconnect technologies designed for scalable, low latency and high-throughput comunications within datacenters or high perforrmnce computers. The three typical architectures including the broadcast-and-select based Optical Shared Memory Supercomputer Interconnect System (OSMOSIS) switch, the defection routing based Data Vortex switch and the arrayed waveguide grating based Low-latency Interconnect Optical Network Switch (LIONS) switch are discussed in detail. In particular, we investigate the various Ioopback buffering technologies in LIONS and present a proof of principle testbed demonstration showing feasibility of LIONS architecture. Moreover, the performance of LIONS, Data Vortex and OSMOSIS with traditional state-of-the-art electrical switching network based on the Flattened-ButterFly (FBF) architecture in terms of throughput and latency are compared. The sinmlation based perfortmnce study shows that the latency of LIONS is almost independent of the number of input ports and does not saturate even at very high input load. 展开更多
关键词 optical interconnects switches WDM data centers high perforrmnce computing
下载PDF
Scalable group management in large-scale virtualized clusters
4
作者 Zhou Wei Wang Lei +2 位作者 Meng Dan Yuan Lin Zhan Jianfeng 《High Technology Letters》 EI CAS 2011年第3期263-271,共9页
To save cost, more and more users choose provision resources at the granularity of virtual machines in cluster systems, especially data centres. Maintaining a consistent member view is the foundation of reliable clust... To save cost, more and more users choose provision resources at the granularity of virtual machines in cluster systems, especially data centres. Maintaining a consistent member view is the foundation of reliable cluster managements, and it also raises several challenge issues for large scale cluster systems deployed with virtual machines (which we call virtualized clusters). In this paper, we introduce our experience in design and implementation of scalable member view management on large-scale virtual clusters. Our research contributions include three-aspects : 1 ) we propose a scalable and reliable management infrastructure that combines a peer-to-peer structure and a hierarchy structure to maintain a consistent member view in virtual clusters; 2 ) we present a light-weighted group membership algorithm that can reach the consistent member view within a single round of message exchange; 3 ) we design and implement a scalable membership service that can provide virtual machines and maintain a consistent member view in virtual clusters. Our work is verified on Dawning 5000A, which ranked No. 10 of Top 500 super computers in November, 2008. 展开更多
关键词 virtualized clusters consistent member view scalable group membership SGMS
下载PDF
Performance Analysis of Parallel Eigensolvers of Two Libraries on BlueGene/P
5
作者 Inge Gutheil Tommy Berg Johannes Grotendorst 《Journal of Mathematics and System Science》 2012年第4期231-236,共6页
Many applications in computational science and engineering require the computation of eigenvalues and vectors of dense symmetric or Hermitian matrices. For example, in DFT (density functional theory) calculations on... Many applications in computational science and engineering require the computation of eigenvalues and vectors of dense symmetric or Hermitian matrices. For example, in DFT (density functional theory) calculations on modern supercomputers 10% to 30% of the eigenvalues and eigenvectors of huge dense matrices have to be calculated. Therefore, performance and parallel scaling of the used eigensolvers is of upmost interest. In this article different routines of the linear algebra packages ScaLAPACK and Elemental for parallel solution of the symmetric eigenvalue problem are compared concerning their performance on the BlueGene/P supercomputer. Parameters for performance optimization are adjusted for the different data distribution methods used in the two libraries. It is found that for all test cases the new library Elemental which uses a two-dimensional element by element distribution of the matrices to the processors shows better performance than the old ScaLAPACK library which uses a block-cyclic distribution. 展开更多
关键词 Numerical linear algebra eigensolvers SCALAPACK ELEMENTAL performance analysis.
下载PDF
Yang Guangwen:Builder of the Most Powerful Computer
6
作者 LU RUCAI 《China Today》 2016年第10期22-24,共3页
AT the International Supercomputing Conference held in Frankfurt,Germany on June20,2016,the TOP500.org published the latest supercomputer rank ings.China’s Sunway Taihu Light took pole position.This is the seventh ti... AT the International Supercomputing Conference held in Frankfurt,Germany on June20,2016,the TOP500.org published the latest supercomputer rank ings.China’s Sunway Taihu Light took pole position.This is the seventh time in a row that China’s supercomputers have topped the Top500 rankings,published biannually since 1993. 展开更多
关键词 imported Jiangsu sector visitors currently director aerospace hopes expand institutions
下载PDF
Sci-Tech
7
《China Today》 2014年第10期13-13,共1页
China’s Supercomputer Helps Construct"Smart Cities"Developers of China’s Tianhe-1A,one of the world’s fastest supercomputers,are tapping into the digital brain’s higher functions,moving it beyond animati... China’s Supercomputer Helps Construct"Smart Cities"Developers of China’s Tianhe-1A,one of the world’s fastest supercomputers,are tapping into the digital brain’s higher functions,moving it beyond animation and Internet financing to help in the construction of new"smart cities."The Tianhe-1A can digitize the planning,design,construction, 展开更多
关键词 animation Construct fastest smart financing Tianjin Sci-Tech authority hasbeen jointly
下载PDF
64-qubit quantum circuit simulation 被引量:7
8
作者 Zhao-Yun Chen Qi Zhou +3 位作者 Cheng Xue Xia Yang Guang-Can Guo Guo-Ping Guo 《Science Bulletin》 SCIE EI CSCD 2018年第15期964-971,共8页
Classical simulations of quantum circuits are limited in both space and time when the qubit count is above 50, the realm where quantum supremacy reigns. However, recently, for the low depth circuit with more than 50 q... Classical simulations of quantum circuits are limited in both space and time when the qubit count is above 50, the realm where quantum supremacy reigns. However, recently, for the low depth circuit with more than 50 qubits, there are several methods of simulation proposed by teams at Google and IBM. Here,we present a scheme of simulation which can extract a large amount of measurement outcomes within a short time, achieving a 64-qubit simulation of a universal random circuit of depth 22 using a 128-node cluster, and 56-and 42-qubit circuits on a single PC. We also estimate that a 72-qubit circuit of depth 23 can be simulated in about 16 h on a supercomputer identical to that used by the IBM team. Moreover, the simulation processes are exceedingly separable, hence parallelizable, involving just a few inter-process communications. Our work enables simulating more qubits with less hardware burden and provides a new perspective for classical simulations. 展开更多
关键词 Simulation of quantum circuits Universal random circuit Quantum supremacy Partitioning Parallel computing
原文传递
High performance computing of DGDFT for tens of thousands of atoms using millions of cores on Sunway TaihuLight 被引量:4
9
作者 Wei Hu Xinming Qin +9 位作者 Qingcai Jiang Junshi Chen Hong An Weile Jia Fang Li Xin Liu Dexun Chen Fangfang Liu Yuwen Zhao Jinlong Yang 《Science Bulletin》 SCIE EI CSCD 2021年第2期111-119,M0003,共10页
High performance computing(HPC)is a powerful tool to accelerate the Kohn–Sham density functional theory(KS-DFT)calculations on modern heterogeneous supercomputers.Here,we describe a massively parallel implementation ... High performance computing(HPC)is a powerful tool to accelerate the Kohn–Sham density functional theory(KS-DFT)calculations on modern heterogeneous supercomputers.Here,we describe a massively parallel implementation of discontinuous Galerkin density functional theory(DGDFT)method on the Sunway Taihu Light supercomputer.The DGDFT method uses the adaptive local basis(ALB)functions generated on-the-fly during the self-consistent field(SCF)iteration to solve the KS equations with high precision comparable to plane-wave basis set.In particular,the DGDFT method adopts a two-level parallelization strategy that deals with various types of data distribution,task scheduling,and data communication schemes,and combines with the master–slave multi-thread heterogeneous parallelism of SW26010 processor,resulting in large-scale HPC KS-DFT calculations on the Sunway Taihu Light supercomputer.We show that the DGDFT method can scale up to 8,519,680 processing cores(131,072 core groups)on the Sunway Taihu Light supercomputer for studying the electronic structures of twodimensional(2 D)metallic graphene systems that contain tens of thousands of carbon atoms. 展开更多
关键词 Density functional theory Tens of thousands of atoms High performance computing Sunway TaihuLight
原文传递
Speedup in classical simulation of Gaussian boson sampling 被引量:1
10
作者 Bujiao Wu Bin Cheng +3 位作者 Fei Jia Jialin Zhang Man-Hong Yung Xiaoming Sun 《Science Bulletin》 SCIE EI CAS CSCD 2020年第10期832-841,M0004,共11页
Gaussian boson sampling is an alternative model for demonstrating quantum computational supremacy,where squeezed states are injected into every input mode, instead of applying single photons as in the case of standard... Gaussian boson sampling is an alternative model for demonstrating quantum computational supremacy,where squeezed states are injected into every input mode, instead of applying single photons as in the case of standard boson sampling. Here by analyzing numerically the computational costs, we establish a lower bound for achieving quantum computational supremacy for a class of Gaussian bosonsampling problems. Specifically, we propose a more efficient method for calculating the transition probabilities, leading to a significant reduction of the simulation costs. Particularly, our numerical results indicate that one can simulate up to 18 photons for Gaussian boson sampling at the output subspace on a normal laptop, 20 photons on a commercial workstation with 256 cores, and about 30 photons for supercomputers. These numbers are significantly smaller than those in standard boson sampling, suggesting that Gaussian boson sampling could be experimentally-friendly for demonstrating quantum computational supremacy. 展开更多
关键词 Gaussian boson sampling Classical simulation Hafnian Probability distribution Marginal distribution Quantum optics
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部