期刊文献+
共找到9篇文章
< 1 >
每页显示 20 50 100
船体曲面三角网格划分的并行性计算方法
1
作者 陈宾康 余金洲 赵成璧 《船海工程》 2001年第5期1-5,共5页
采用前沿生成法对船体曲面进行三角网格划分 ,运用信号灯Petri网对划分过程进行控制和分析 ,使前沿结点的生成及其适应性计算并行处理 ,充分利用系统的分布、并发、同步异步等特性 ,以提高三角网格划分的效率 ;前沿结点的适应性计算采... 采用前沿生成法对船体曲面进行三角网格划分 ,运用信号灯Petri网对划分过程进行控制和分析 ,使前沿结点的生成及其适应性计算并行处理 ,充分利用系统的分布、并发、同步异步等特性 ,以提高三角网格划分的效率 ;前沿结点的适应性计算采用改进的演化算法。 展开更多
关键词 船体 曲面 并行性计算方法 三角网格划分 前沿生成法 PETRI网 演化计算
下载PDF
GRCC:一种通用可重构协处理器 被引量:1
2
作者 朱文 付宇卓 谢憬 《微电子学与计算机》 CSCD 北大核心 2009年第6期154-158,共5页
描述了一种改进型可重构处理器——GRCC(General Reconfigurable Coprocessor).该处理器能够使用一般通用RISC处理器的协处理器接口,通过与通用处理器的协处理器指令通信,达到辅助主处理器进行大规模密集计算的目的.着重介绍了DCT算法在... 描述了一种改进型可重构处理器——GRCC(General Reconfigurable Coprocessor).该处理器能够使用一般通用RISC处理器的协处理器接口,通过与通用处理器的协处理器指令通信,达到辅助主处理器进行大规模密集计算的目的.着重介绍了DCT算法在GRCC中的映射与实现,仿真结果显示,GRCC能达到6倍以上于通用处理器的性能,并在实现复杂度、运行效率与通用性中达到了一个权衡. 展开更多
关键词 可重构协处理器 并行性计算 DCT
下载PDF
一种循环流水阵列架构
3
作者 杨超 谢憬 毛志刚 《信息技术》 2010年第2期23-27,共5页
描述了一种基于循环流水计算的阵列架构(PLAA),该阵列架构能够工作在基于AHB协议的总线接口上,通过与ARM处理器指令通信,达到辅助主处理器进行大规模密集计算的目的。描述了这一处理器的结构,并着重介绍了二维DCT算法在PLAA中的映射与... 描述了一种基于循环流水计算的阵列架构(PLAA),该阵列架构能够工作在基于AHB协议的总线接口上,通过与ARM处理器指令通信,达到辅助主处理器进行大规模密集计算的目的。描述了这一处理器的结构,并着重介绍了二维DCT算法在PLAA中的映射与实现。仿真结果显示,PLAA能达到7倍以上于通用处理器的性能,并在实现复杂度、运行效率与通用性中达到一个权衡。 展开更多
关键词 可重构阵列 循环流水 并行性计算
下载PDF
A preliminary evaluation of high-performance advanced regional eta-coordinate model(H-AREM)
4
作者 CHENG Yu-Feng XU You-Ping +1 位作者 LI Li-Juan WANG Bin 《Atmospheric and Oceanic Science Letters》 CSCD 2017年第1期1-8,共8页
This paper preliminarily evaluates the speedup,scalability,and prediction skill of the highperformance advanced regional eta coordinate model(H-AREM),which is based on several parallel processing methods and decompo... This paper preliminarily evaluates the speedup,scalability,and prediction skill of the highperformance advanced regional eta coordinate model(H-AREM),which is based on several parallel processing methods and decomposition strategies.Results show that the parallel version of the model that is based on a modular parallel framework and a multidimensional domain decomposition strategy performs better overall,e.g.it is faster and more scalable than the version based on a message passing interface and a one-dimensional decomposition strategy.In particular,the scalability of the H-AREM with a resolution of 8 km approaches 8099 cores.Moreover,in the H-AREM,higher resolutions result in more realistic precipitation predictions without remarkable increases in simulation time. 展开更多
关键词 AREM parallel computation SPEED-UP SCALABILITY
下载PDF
Parallel Optical Interconnect Technology: Combination of Higher Performance and Lower Energy Consumption
5
作者 Qiao Yaojun Gu Rentao Ji Yuefeng 《China Communications》 SCIE CSCD 2010年第3期99-106,共8页
This paper analyzes the physical potential, computing performance benefi t and power consumption of optical interconnects. Compared with electrical interconnections, optical ones show undoubted advantages based on phy... This paper analyzes the physical potential, computing performance benefi t and power consumption of optical interconnects. Compared with electrical interconnections, optical ones show undoubted advantages based on physical factor analysis. At the same time, since the recent developments drive us to think about whether these optical interconnect technologies with higher bandwidth but higher cost are worthy to be deployed, the computing performance comparison is performed. To meet the increasing demand of large-scale parallel or multi-processor computing tasks, an analytic method to evaluate parallel computing performance ofinterconnect systems is proposed in this paper. Both bandwidth-limit model and full-bandwidth model are under our investigation. Speedup and effi ciency are selected to represent the parallel performance of an interconnect system. Deploying the proposed models, we depict the performance gap between the optical and electrically interconnected systems. Another investigation on power consumption of commercial products showed that if the parallel interconnections are deployed, the unit power consumption will be reduced. Therefore, from the analysis of computing influence and power dissipation, we found that parallel optical interconnect is valuable combination of high performance and low energy consumption. Considering the possible data center under construction, huge power could be saved if parallel optical interconnects technologies are used. 展开更多
关键词 optical interconnects high performance computing power dissipation
下载PDF
Modeling and Generating Realistic Background Traffic by Hybrid Approach 被引量:2
6
作者 QIAN Yaguan GUAN Xiaohui +1 位作者 JIANG Ming CEN Gang 《China Communications》 SCIE CSCD 2015年第10期147-157,共11页
One of the key challenges in largescale network simulation is the huge computation demand in fine-grained traffic simulation.Apart from using high-performance computing facilities and parallelism techniques,an alterna... One of the key challenges in largescale network simulation is the huge computation demand in fine-grained traffic simulation.Apart from using high-performance computing facilities and parallelism techniques,an alternative is to replace the background traffic by simplified abstract models such as fluid flows.This paper suggests a hybrid modeling approach for background traffic,which combines ON/OFF model with TCP activities.The ON/OFF model is to characterize the application activities,and the ordinary differential equations(ODEs) based on fluid flows is to describe the TCP congestion avoidance functionality.The apparent merits of this approach are(1) to accurately capture the traffic self-similarity at source level,(2) properly reflect the network dynamics,and(3) efficiently decrease the computational complexity.The experimental results show that the approach perfectly makes a proper trade-off between accuracy and complexity in background traffic simulation. 展开更多
关键词 network simulation background traffic ON/OFF models fluid flows self-similarity
下载PDF
Implementation Study of Dynamic Load Balancing Algorithm of Parallel Tree Computation on Clusters of Heterogeneous Workstation
7
作者 Mohammed A.M. Ibrahim M.SaifMokbel 《Journal of Donghua University(English Edition)》 EI CAS 2005年第2期81-86,共6页
The rapid growth of interconnected high performance workstations has produced a new computing paradigm called clustered of workstations computing. In these systems load balance problem is a serious impediment to achie... The rapid growth of interconnected high performance workstations has produced a new computing paradigm called clustered of workstations computing. In these systems load balance problem is a serious impediment to achieve good performance. The main concern of this paper is the implementation of dynamic load balancing algorithm, asynchronous Round Robin (ARR), for balancing workload of parallel tree computation depth-first-search algorithm on Cluster of Heterogeneous Workstations (COW) Many algorithms in artificial intelligence and other areas of computer science are based on depth first search in implicitty defined trees. For these algorithms a load-balancing scheme is required, which is able to evenly distribute parts of an irregularly shaped tree over the workstations with minimal interprocessor communication and without prior knowledge of the tree’s shape. For the (ARR) algorithm only minimal interprocessor communication is needed when necessary and it runs under the MPI (Message passing interface) that allows parallel execution on heterogeneous SUN cluster of workstation platform. The program code is written in C language and executed under UNIX operating system (Solaris version). 展开更多
关键词 cluster of workstations parallel tree computation dynamic load balancing performance metrics
下载PDF
Research and Analysis of the Parallel Computer Network Reliability
8
作者 Xiaodan ZHANG 《International Journal of Technology Management》 2015年第3期113-114,共2页
With the continuous development of network communication technology and computer technology, parallel computer network applications becoming more widely, its reliability has attracted more attention on researcher. Thi... With the continuous development of network communication technology and computer technology, parallel computer network applications becoming more widely, its reliability has attracted more attention on researcher. This paper gives a introduction to a simple computer network, given the reliability of the design criteria for computer network analysis, and finally through the examples to illustrate the computer network hardware and software reliability. 展开更多
关键词 Computer network RELIABILITY Design criteria Example illustrations.
下载PDF
High performance computing of DGDFT for tens of thousands of atoms using millions of cores on Sunway TaihuLight 被引量:4
9
作者 Wei Hu Xinming Qin +9 位作者 Qingcai Jiang Junshi Chen Hong An Weile Jia Fang Li Xin Liu Dexun Chen Fangfang Liu Yuwen Zhao Jinlong Yang 《Science Bulletin》 SCIE EI CSCD 2021年第2期111-119,M0003,共10页
High performance computing(HPC)is a powerful tool to accelerate the Kohn–Sham density functional theory(KS-DFT)calculations on modern heterogeneous supercomputers.Here,we describe a massively parallel implementation ... High performance computing(HPC)is a powerful tool to accelerate the Kohn–Sham density functional theory(KS-DFT)calculations on modern heterogeneous supercomputers.Here,we describe a massively parallel implementation of discontinuous Galerkin density functional theory(DGDFT)method on the Sunway Taihu Light supercomputer.The DGDFT method uses the adaptive local basis(ALB)functions generated on-the-fly during the self-consistent field(SCF)iteration to solve the KS equations with high precision comparable to plane-wave basis set.In particular,the DGDFT method adopts a two-level parallelization strategy that deals with various types of data distribution,task scheduling,and data communication schemes,and combines with the master–slave multi-thread heterogeneous parallelism of SW26010 processor,resulting in large-scale HPC KS-DFT calculations on the Sunway Taihu Light supercomputer.We show that the DGDFT method can scale up to 8,519,680 processing cores(131,072 core groups)on the Sunway Taihu Light supercomputer for studying the electronic structures of twodimensional(2 D)metallic graphene systems that contain tens of thousands of carbon atoms. 展开更多
关键词 Density functional theory Tens of thousands of atoms High performance computing Sunway TaihuLight
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部