期刊文献+
共找到163篇文章
< 1 2 9 >
每页显示 20 50 100
Optimization Techniques for GPU-Based Parallel Programming Models in High-Performance Computing
1
作者 Shuntao Tang Wei Chen 《信息工程期刊(中英文版)》 2024年第1期7-11,共5页
This study embarks on a comprehensive examination of optimization techniques within GPU-based parallel programming models,pivotal for advancing high-performance computing(HPC).Emphasizing the transition of GPUs from g... This study embarks on a comprehensive examination of optimization techniques within GPU-based parallel programming models,pivotal for advancing high-performance computing(HPC).Emphasizing the transition of GPUs from graphic-centric processors to versatile computing units,it delves into the nuanced optimization of memory access,thread management,algorithmic design,and data structures.These optimizations are critical for exploiting the parallel processing capabilities of GPUs,addressingboth the theoretical frameworks and practical implementations.By integrating advanced strategies such as memory coalescing,dynamic scheduling,and parallel algorithmic transformations,this research aims to significantly elevate computational efficiency and throughput.The findings underscore the potential of optimized GPU programming to revolutionize computational tasks across various domains,highlighting a pathway towards achieving unparalleled processing power and efficiency in HPC environments.The paper not only contributes to the academic discourse on GPU optimization but also provides actionable insights for developers,fostering advancements in computational sciences and technology. 展开更多
关键词 Optimization Techniques GPU-Based Parallel Programming Models high-performance computing
下载PDF
Towards Auction-Based HPC Computing in the Cloud
2
作者 Moussa Taifi Justin Y. Shi Abdallah Khreishah 《Computer Technology and Application》 2012年第7期499-509,共11页
Cloud computing is expanding widely in the world of IT infrastructure. This is due partly to the cost-saving effect of economies of scale. Fair market conditions can in theory provide a healthy environment to reflect ... Cloud computing is expanding widely in the world of IT infrastructure. This is due partly to the cost-saving effect of economies of scale. Fair market conditions can in theory provide a healthy environment to reflect the most reasonable costs of computations. While fixed cloud pricing provides an attractive low entry barrier for compute-intensive applications, both the consumer and supplier of computing resources can see high efficiency for their investments by participating in auction-based exchanges. There are huge incentives for the cloud provider to offer auctioned resources. However, from the consumer perspective, using these resources is a sparsely discussed challenge. This paper reports a methodology and framework designed to address the challenges of using HPC (High Performance Computing) applications on auction-based cloud clusters. The authors focus on HPC applications and describe a method for determining bid-aware checkpointing intervals. They extend a theoretical model for determining checkpoint intervals using statistical analysis of pricing histories. Also the latest developments in the SpotHPC framework are introduced which aim at facilitating the managed execution of real MPI applications on auction-based cloud environments. The authors use their model to simulate a set of algorithms with different computing and communication densities. The results show the complex interactions between optimal bidding strategies and parallel applications performance. 展开更多
关键词 Auction-based cloud computing fault tolerance cloud hpc (high performance computing
下载PDF
Web-Based Computing and Property Database Portlet by Using HPC Portal Development Platform
3
作者 Chien-Heng Wu 《通讯和计算机(中英文版)》 2011年第12期1023-1032,共10页
关键词 开发平台 性能计算 PORTLET hpc Web 属性数据库 门户 企业应用程序
下载PDF
MatDEM-fast matrix computing of the discrete element method 被引量:7
4
作者 Chun Liu Hui Liu Hongyong Zhang 《Earthquake Research Advances》 CSCD 2021年第3期1-7,共7页
Discrete element method can effectively simulate the discontinuity,inhomogeneity and large deformation and failure of rock and soil.Based on the innovative matrix computing of the discrete element method,the highperfo... Discrete element method can effectively simulate the discontinuity,inhomogeneity and large deformation and failure of rock and soil.Based on the innovative matrix computing of the discrete element method,the highperformance discrete element software MatDEM may handle millions of elements in one computer,and enables the discrete element simulation at the engineering scale.It supports heat calculation,multi-field and fluidsolid coupling numerical simulations.Furthermore,the software integrates pre-processing,solver,postprocessing,and powerful secondary development,allowing recompiling new discrete element software.The basic principles of the DEM,the implement and development of the MatDEM software,and its applications are introduced in this paper.The software and sample source code are available online(http://matdem.com). 展开更多
关键词 Discrete element method high-performance MatDEM Matrix computing
下载PDF
Creep experimental test and analysis of high-performance concrete in bridge 被引量:1
5
作者 陈志华 袁健 《Journal of Central South University》 SCIE EI CAS 2008年第S1期577-581,共5页
Factors that have effect on concrete creep include mixture composition,curing conditions,ambient exposure conditions,and element geometry.Considering concrete mixtures influence and in order to improve the prediction ... Factors that have effect on concrete creep include mixture composition,curing conditions,ambient exposure conditions,and element geometry.Considering concrete mixtures influence and in order to improve the prediction of prestress loss in important structures,an experimental test under laboratory conditions was carried out to investigate compression creep of two high performance concrete mixtures used for prestressed members in one bridge.Based on the experimental results,a power exponent function of creep degree for structural numerical analysis was used to model the creep degree of two HPCs,and two series of parameters of this function for two HPCs were calculated with evolution program optimum method.The experimental data was compared with CEB-FIP 90 and ACI 209(92) models,and the two code models both overestimated creep degrees of the two HPCs.So it is recommended that the power exponent function should be used in this bridge structure analysis. 展开更多
关键词 CEMENT CONCRETE CREEP high-performance concrete(hpc)
下载PDF
Experimental study on creep of high-performance concretes
6
作者 陈志华 马忠武 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 2009年第4期576-580,共5页
In order to investigate the compression creep of two kinds of high-performance concrete mixtures used for prestressed members in a bridge,an experimental test under laboratory conditions was carried out.Based on the e... In order to investigate the compression creep of two kinds of high-performance concrete mixtures used for prestressed members in a bridge,an experimental test under laboratory conditions was carried out.Based on the experimental results,a power exponent function was used to model the creep degree of these high-performance concretes(HPCs) for structural numerical analysis,and two series parameters of this function for the HPCs were given with the optimum method of evolution program.The experimental data were compared with CEB-FIP 90 and ACI 92 models.Results show that the two code models both overestimate the creep degree of two HPCs,so it is recommended that the power exponent function should be used for the creep analysis of bridge structure. 展开更多
关键词 CEMENT CONCRETE CREEP high-performance concrete (hpc
下载PDF
Evaluation of the Application Benefit of Meteorological High Performance Computing Resources
7
作者 Min Wei Bin Wang 《Journal of Geoscience and Environment Protection》 2017年第7期153-160,共8页
The meteorological high-performance computing resource is the support platform for the weather forecast and climate prediction numerical model operation. The scientific and objective method to evaluate the application... The meteorological high-performance computing resource is the support platform for the weather forecast and climate prediction numerical model operation. The scientific and objective method to evaluate the application of meteorological high-performance computing resources can not only provide reference for the optimization of active resources, but also provide a quantitative basis for future resource construction and planning. In this paper, the concept of the utility value B and index compliance rate E of the meteorological high performance computing system are presented. The evaluation process, evaluation index and calculation method of the high performance computing resource application benefits are introduced. 展开更多
关键词 high-performance computing RESOURCES RESOURCE Application BENEFIT EVALUATION BENEFIT Value
下载PDF
API Development Increases Access to Shared Computing Resources at Boston University
8
作者 George Jones Amanda E. Wakefield +4 位作者 Jeff Triplett Kojo Idrissa James Goebel Dima Kozakov Sandor Vajda 《Journal of Software Engineering and Applications》 2022年第6期197-207,共11页
Within the last few decades, increases in computational resources have contributed enormously to the progress of science and engineering (S & E). To continue making rapid advancements, the S & E community must... Within the last few decades, increases in computational resources have contributed enormously to the progress of science and engineering (S & E). To continue making rapid advancements, the S & E community must be able to access computing resources. One way to provide such resources is through High-Performance Computing (HPC) centers. Many academic research institutions offer their own HPC Centers but struggle to make the computing resources easily accessible and user-friendly. Here we present SHABU, a RESTful Web API framework that enables S & E communities to access resources from Boston University’s Shared Computing Center (SCC). The SHABU requirements are derived from the use cases described in this work. 展开更多
关键词 API Framework Open Source high-performance computing Software Architecture Science and Engineering
下载PDF
基于MPI的鲲鹏CPU核间通信研究
9
作者 周岩 王鹏 王琨予 《西南民族大学学报(自然科学版)》 CAS 2024年第3期328-335,共8页
核间通信延时是影响高性能计算系统整体运行效率的重要因素.国产鲲鹏CPU在高性能计算领域应用日益广泛,针对鲲鹏CPU的缓存架构及多核间接口互联进行分析,研究影响鲲鹏CPU核间通信延时的因素.在消息传递接口(MPI)环境下进行节点内核间通... 核间通信延时是影响高性能计算系统整体运行效率的重要因素.国产鲲鹏CPU在高性能计算领域应用日益广泛,针对鲲鹏CPU的缓存架构及多核间接口互联进行分析,研究影响鲲鹏CPU核间通信延时的因素.在消息传递接口(MPI)环境下进行节点内核间通信实验,对包括跨三级缓存、跨物理CPU通信等不同模式下通信延时进行对比,发现通信数据包大于500 KB后,跨L3 Cache TAG的通信延时反优于共享L3 Cache TAG的通信延时.针对通信数据包在64 KB大小时的通信延迟异常,分析得出是MPI的Eager模式和Rendezvous模式的默认切换阈值所造成.对这两种模式进行实验对比,验证不同大小的通信数据包在不同模式下和跨核通信时的延时特征,Eager模式更适合低延时的小消息发送.在实际应用中可根据通信数据包大小调整两种模式的默认切换阈值,以达到更好的传输效果.实验结果表明由于鲲鹏CPU存在复杂的多核结构,在并行计算程序设计时可以进行针对性优化,以提升程序的运行效率. 展开更多
关键词 鲲鹏CPU 核间通信 消息传递接口 高性能计算 共享缓存
下载PDF
广域网协议在PC集群系统HPC中的应用分析 被引量:1
10
作者 王文义 赵少林 王若雨 《郑州大学学报(工学版)》 CAS 2006年第1期67-71,共5页
鉴于绝大多数PC集群系统都使用了TCP/IP协议,着重分析了作为分布式进程间通信手段的socket通信机制在Linux中的实现以及传统通信开销中影响性能的主要因素,并针对通信瓶颈,对传统的通信协议栈进行了改进,对广域网协议在PC集群系统中应... 鉴于绝大多数PC集群系统都使用了TCP/IP协议,着重分析了作为分布式进程间通信手段的socket通信机制在Linux中的实现以及传统通信开销中影响性能的主要因素,并针对通信瓶颈,对传统的通信协议栈进行了改进,对广域网协议在PC集群系统中应用的不足之处,提出了改进集群网络性能的方法. 展开更多
关键词 集群系统 高性能计算 TCP/IP
下载PDF
HPCC:面向存储访问模型的基准测试—一种可能替代TOP500 HPL的测试方法 被引量:1
11
作者 王晓英 李三立 《小型微型计算机系统》 CSCD 北大核心 2006年第5期950-955,共6页
高性能计算机系统的性能评价历来是本领域所关注的重要问题.TOP500排名所采用的标准测试HPL(HigPerformanceLinpack)并不能真实的反映系统各方面的性能,尤其是存储访问方面.HPCChallenge基准测试则着重于各种存储访问模型,在HPL的基础... 高性能计算机系统的性能评价历来是本领域所关注的重要问题.TOP500排名所采用的标准测试HPL(HigPerformanceLinpack)并不能真实的反映系统各方面的性能,尤其是存储访问方面.HPCChallenge基准测试则着重于各种存储访问模型,在HPL的基础之上又整合了多个有代表性的核心测试程序,很有可能在未来取代现在TOP500采用的的HPL测试.本文首先简单介绍HPCChallenge诞生的背景,解释基准测试的基本概念和原理,从存储访问模型的角度对各项测试进行了描述,并根据实际的测试结果进行比较和分析.最后给出结论以及将来的工作. 展开更多
关键词 hpc CHALLENGE 基准测试 高性能计算 局部性
下载PDF
Static Analysis Techniques for Fixing Software Defects in MPI-Based Parallel Programs
12
作者 Norah Abdullah Al-Johany Sanaa Abdullah Sharaf +1 位作者 Fathy Elbouraey Eassa Reem Abdulaziz Alnanih 《Computers, Materials & Continua》 SCIE EI 2024年第5期3139-3173,共35页
The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memorysystems.However, MPI implementations can contain defects that impact the reliability and performance of par... The Message Passing Interface (MPI) is a widely accepted standard for parallel computing on distributed memorysystems.However, MPI implementations can contain defects that impact the reliability and performance of parallelapplications. Detecting and correcting these defects is crucial, yet there is a lack of published models specificallydesigned for correctingMPI defects. To address this, we propose a model for detecting and correcting MPI defects(DC_MPI), which aims to detect and correct defects in various types of MPI communication, including blockingpoint-to-point (BPTP), nonblocking point-to-point (NBPTP), and collective communication (CC). The defectsaddressed by the DC_MPI model include illegal MPI calls, deadlocks (DL), race conditions (RC), and messagemismatches (MM). To assess the effectiveness of the DC_MPI model, we performed experiments on a datasetconsisting of 40 MPI codes. The results indicate that the model achieved a detection rate of 37 out of 40 codes,resulting in an overall detection accuracy of 92.5%. Additionally, the execution duration of the DC_MPI modelranged from 0.81 to 1.36 s. These findings show that the DC_MPI model is useful in detecting and correctingdefects in MPI implementations, thereby enhancing the reliability and performance of parallel applications. TheDC_MPImodel fills an important research gap and provides a valuable tool for improving the quality ofMPI-basedparallel computing systems. 展开更多
关键词 high-performance computing parallel computing software engineering software defect message passing interface DEADLOCK
下载PDF
HPC海量存储系统Pass-Through访问策略研究 被引量:2
13
作者 朱平 《计算机研究与发展》 EI CSCD 北大核心 2013年第8期1667-1673,共7页
为了解决海量信息处理中实时访问中的"I/O墙"的问题,提高海量信息分布式存储系统的性能,提出了一种基于高性能计算(high performance computing,HPC)的存储部件新型访问策略.首先,分析传统访问模型存在的问题;其次,研究存储... 为了解决海量信息处理中实时访问中的"I/O墙"的问题,提高海量信息分布式存储系统的性能,提出了一种基于高性能计算(high performance computing,HPC)的存储部件新型访问策略.首先,分析传统访问模型存在的问题;其次,研究存储部件直通路模式的工作机理,建立存储系统分解为多层次、分布式的模型,根据不同的层次和映射策略实现存储空间物理地址、缓存地址、存储系统逻辑空间地址的连续映射;第三,分析直通路访问模式下的存储路径时间开销;第四,在模拟环境下存储部件访问的性能测试,在实际采用该策略的应用系统中进行验证.通过验证测试结果表明,该方法能有效提高存储系统的性能,能够不断满足海量信息处理实时需要. 展开更多
关键词 高性能计算 海量存储系统 存储部件直通路 存储层次映射 存储策略
下载PDF
HPC的最新发展情况及发展趋势 被引量:1
14
作者 李卫平 《安阳工学院学报》 2006年第2期59-62,共4页
本文介绍了高性能计算机在国际和国内的最新发展情况,对目前比较热门的计算机机群和网格计算机进行了理性分析和客观评价,并对以后高性能计算机的发展进行了前瞻性探讨,以期对我国高性能计算的发展有所借鉴。
关键词 TOP500 高性能计算机 网格计算 机群
下载PDF
基于GPU的LBM迁移模块算法优化
15
作者 黄斌 柳安军 +3 位作者 潘景山 田敏 张煜 朱光慧 《计算机工程》 CAS CSCD 北大核心 2024年第2期232-238,共7页
格子玻尔兹曼方法(LBM)是一种基于介观模拟尺度的计算流体力学方法,其在计算时设置大量的离散格点,具有适合并行的特性。图形处理器(GPU)中有大量的算术逻辑单元,适合大规模的并行计算。基于GPU设计LBM的并行算法,能够提高计算效率。但... 格子玻尔兹曼方法(LBM)是一种基于介观模拟尺度的计算流体力学方法,其在计算时设置大量的离散格点,具有适合并行的特性。图形处理器(GPU)中有大量的算术逻辑单元,适合大规模的并行计算。基于GPU设计LBM的并行算法,能够提高计算效率。但是LBM算法迁移模块中每个格点的计算都需要与其他格点进行通信,存在较强的数据依赖。提出一种基于GPU的LBM迁移模块算法优化策略。首先分析迁移部分的实现逻辑,通过模型降维,将三维模型按照速度分量离散为多个二维模型,降低模型的复杂度;然后分析迁移模块计算前后格点中的数据差异,通过数据定位找到迁移模块的通信规律,并对格点之间的数据交换方式进行分类;最后使用分类的交换方式对离散的二维模型进行区域划分,设计新的数据通信方式,由此消除数据依赖的影响,将迁移模块完全并行化。对并行算法进行测试,结果显示:该算法在1.3×10^(8)规模网格下能达到1.92的加速比,表明算法具有良好的并行效果;同时对比未将迁移模块并行化的算法,所提优化策略能提升算法30%的并行计算效率。 展开更多
关键词 高性能计算 格子玻尔兹曼方法 图形处理器 并行优化 数据重排
下载PDF
Parallel Inference for Real-Time Machine Learning Applications
16
作者 Sultan Al Bayyat Ammar Alomran +3 位作者 Mohsen Alshatti Ahmed Almousa Rayyan Almousa Yasir Alguwaifli 《Journal of Computer and Communications》 2024年第1期139-146,共8页
Hyperparameter tuning is a key step in developing high-performing machine learning models, but searching large hyperparameter spaces requires extensive computation using standard sequential methods. This work analyzes... Hyperparameter tuning is a key step in developing high-performing machine learning models, but searching large hyperparameter spaces requires extensive computation using standard sequential methods. This work analyzes the performance gains from parallel versus sequential hyperparameter optimization. Using scikit-learn’s Randomized SearchCV, this project tuned a Random Forest classifier for fake news detection via randomized grid search. Setting n_jobs to -1 enabled full parallelization across CPU cores. Results show the parallel implementation achieved over 5× faster CPU times and 3× faster total run times compared to sequential tuning. However, test accuracy slightly dropped from 99.26% sequentially to 99.15% with parallelism, indicating a trade-off between evaluation efficiency and model performance. Still, the significant computational gains allow more extensive hyperparameter exploration within reasonable timeframes, outweighing the small accuracy decrease. Further analysis could better quantify this trade-off across different models, tuning techniques, tasks, and hardware. 展开更多
关键词 Machine Learning Models computational Efficiency Parallel computing Systems Random Forest Inference Hyperparameter Tuning Python Frameworks (TensorFlow PyTorch Scikit-Learn) high-performance computing
下载PDF
基于64位的HPC应用研究
17
作者 王占杰 陈科 《微计算机信息》 2010年第29期10-11,17,共3页
针对64位高性能计算,本文简述了如何配置64位编程环境,并对64位SIMD指令作了介绍。通过一个实例验证了利用64位SIMD指令可以较好地提高任务处理速度。
关键词 64位处理器 单指令多数据 高性能计算
下载PDF
New prospects for computational hydraulics by leveraging high-performance heterogeneous computing techniques 被引量:3
18
作者 Qiuhua LIANG Luke SMITH Xilin XIA 《Journal of Hydrodynamics》 SCIE EI CSCD 2016年第6期977-985,共9页
In the last two decades, computational hydraulics has undergone a rapid development following the advancement of data acquisition and computing technologies. Using a finite-volume Godunov-type hydrodynamic model, this... In the last two decades, computational hydraulics has undergone a rapid development following the advancement of data acquisition and computing technologies. Using a finite-volume Godunov-type hydrodynamic model, this work demonstrates the promise of modern high-performance computing technology to achieve real-time flood modeling at a regional scale. The software is implemented for high-performance heterogeneous computing using the OpenCL programming framework, and developed to support simulations across multiple GPUs using a domain decomposition technique and across multiple systems through an efficient implementation of the Message Passing Interface (MPI) standard. The software is applied for a convective storm induced flood event in Newcastle upon Tyne, demonstrating high computational performance across a GPU cluster, and good agreement against crowd- sourced observations. Issues relating to data availability, complex urban topography and differences in drainage capacity affect results for a small number of areas. 展开更多
关键词 computational hydraulics high-performance computing flood modeling shallow water equations shock-capttLring hydrodynamic model
原文传递
The use of high-performance and high-throughput computing for the fertilization of digital earth and global change studies 被引量:2
19
作者 Yong Xue Dominic Palmer-Brown Huadong Guo 《International Journal of Digital Earth》 SCIE 2011年第3期185-210,共26页
The study of global climate change seeks to understand:(1)the components of the Earth’s varying environmental system,with a particular focus on climate;(2)how these components interact to determine present conditions... The study of global climate change seeks to understand:(1)the components of the Earth’s varying environmental system,with a particular focus on climate;(2)how these components interact to determine present conditions;(3)the factors driving these components;(4)the history of global change and the projection of future change;and(5)how knowledge about global environmental variability and change can be applied to present-day and future decision-making.This paper addresses the use of high-performance computing and high-throughput computing for a global change study on the Digital Earth(DE)platform.Two aspects of the use of high-performance computing(HPC)/high-throughput computing(HTC)on the DE platform are the processing of data from all sources,especially Earth observation data,and the simulation of global change models.The HPC/HTC is an essential and efficient tool for the processing of vast amounts of global data,especially Earth observation data.The current trend involves running complex global climate models using potentially millions of personal computers to achieve better climate change predictions than would ever be possible using the supercomputers currently available to scientists. 展开更多
关键词 high-performance computing(hpc) high-throughput computing(HTC) digital earth global change climate change Earth observation grid computing
原文传递
基于Microsoft HPC的Magic迭代计算软件的并行化 被引量:1
20
作者 侯佳正 张绍阳 陈博远 《应用科技》 CAS 2020年第3期100-105,共6页
Magic软件可以通过馈入参数仿真计算输出功率。为了确定最优的参数,实际工作中通过采用Magic迭代计算软件中的遗传算法迭代计算确定参数,但每代计算需要同时启动多个Magic程序,耗时长、单机计算效率低。本文提出了基于Windows环境,使用... Magic软件可以通过馈入参数仿真计算输出功率。为了确定最优的参数,实际工作中通过采用Magic迭代计算软件中的遗传算法迭代计算确定参数,但每代计算需要同时启动多个Magic程序,耗时长、单机计算效率低。本文提出了基于Windows环境,使用现有的工作站,利用Microsoft HPC工具包搭建一个并行计算集群,实现将每代启动的Magic程序放到多台计算机上进行并行计算的解决方案。首先利用HPC Pack进行集群的搭建,然后设置共享文件夹进行数据的存储与访问,最后使用Microsoft HPC Pack SDK中的API改写Magic迭代计算软件,实现将每代启动的Magic程序放到多台计算机上进行并行计算。通过测试表明加速比大约为2,能够有效提高计算速度。 展开更多
关键词 hpc集群 MAGIC软件 Magic迭代计算软件 Window环境 遗传算法 多机并行 共享存储 计算时间
下载PDF
上一页 1 2 9 下一页 到第
使用帮助 返回顶部