期刊文献+
共找到6,822篇文章
< 1 2 250 >
每页显示 20 50 100
Joint computation offloading and parallel scheduling to maximize delay-guarantee in cooperative MEC systems
1
作者 Mian Guo Mithun Mukherjee +3 位作者 Jaime Lloret Lei Li Quansheng Guan Fei Ji 《Digital Communications and Networks》 SCIE CSCD 2024年第3期693-705,共13页
The growing development of the Internet of Things(IoT)is accelerating the emergence and growth of new IoT services and applications,which will result in massive amounts of data being generated,transmitted and pro-cess... The growing development of the Internet of Things(IoT)is accelerating the emergence and growth of new IoT services and applications,which will result in massive amounts of data being generated,transmitted and pro-cessed in wireless communication networks.Mobile Edge Computing(MEC)is a desired paradigm to timely process the data from IoT for value maximization.In MEC,a number of computing-capable devices are deployed at the network edge near data sources to support edge computing,such that the long network transmission delay in cloud computing paradigm could be avoided.Since an edge device might not always have sufficient resources to process the massive amount of data,computation offloading is significantly important considering the coop-eration among edge devices.However,the dynamic traffic characteristics and heterogeneous computing capa-bilities of edge devices challenge the offloading.In addition,different scheduling schemes might provide different computation delays to the offloaded tasks.Thus,offloading in mobile nodes and scheduling in the MEC server are coupled to determine service delay.This paper seeks to guarantee low delay for computation intensive applica-tions by jointly optimizing the offloading and scheduling in such an MEC system.We propose a Delay-Greedy Computation Offloading(DGCO)algorithm to make offloading decisions for new tasks in distributed computing-enabled mobile devices.A Reinforcement Learning-based Parallel Scheduling(RLPS)algorithm is further designed to schedule offloaded tasks in the multi-core MEC server.With an offloading delay broadcast mechanism,the DGCO and RLPS cooperate to achieve the goal of delay-guarantee-ratio maximization.Finally,the simulation results show that our proposal can bound the end-to-end delay of various tasks.Even under slightly heavy task load,the delay-guarantee-ratio given by DGCO-RLPS can still approximate 95%,while that given by benchmarked algorithms is reduced to intolerable value.The simulation results are demonstrated the effective-ness of DGCO-RLPS for delay guarantee in MEC. 展开更多
关键词 Edge computing computation offloading parallel scheduling Mobile-edge cooperation Delay guarantee
下载PDF
Analysis on intersections between fractures by parallel computation 被引量:10
2
作者 Zhiyu Li Mingyu Wang +1 位作者 Jianhui Zhao Xiaohui Qiao 《International Journal of Coal Science & Technology》 EI CAS 2014年第3期356-363,共8页
The discrete fracture network model is a powerful tool for fractured rock mass fluid flow simulations and supports safety assessments of coal mine hazards such as water inrush.Intersection analysis,which identifies al... The discrete fracture network model is a powerful tool for fractured rock mass fluid flow simulations and supports safety assessments of coal mine hazards such as water inrush.Intersection analysis,which identifies all pairs of intersected fractures(the basic components composing the connectivity of a network),is one of its crucial procedures.This paper attempts to improve intersection analysis through parallel computing.Considering a seamless interfacing with other procedures in modeling,two algorithms are designed and presented,of which one is a completely independent parallel procedure with some redundant computations and the other is an optimized version with reduced redundancy.A numerical study indicates that both of the algorithms are practical and can significantly improve the computational performance of intersection analysis for large-scale simulations.Moreover,the preferred application conditions for the two algorithms are also discussed. 展开更多
关键词 Fracture intersections Discrete fracture network-Intersection analysis parallel computing
下载PDF
A UNIVERSAL ALGORITHM FOR PARALLEL CRC COMPUTATION AND ITS IMPLEMENTATION 被引量:5
3
作者 Xu Zhanqi Yi Kechu Liu Zengji 《Journal of Electronics(China)》 2006年第4期528-531,共4页
Derived from a proposed universal mathematical expression, this paper investigates a novel algo-rithm for parallel Cyclic Redundancy Check (CRC) computation, which is an iterative algorithm to update the check-bit seq... Derived from a proposed universal mathematical expression, this paper investigates a novel algo-rithm for parallel Cyclic Redundancy Check (CRC) computation, which is an iterative algorithm to update the check-bit sequence step by step and suits to various argument selections of CRC computation. The algorithm proposed is quite suitable for hardware implementation. The simulation implementation and performance analysis suggest that it could efficiently speed up the computation compared with the conventional ones. The algorithm is implemented in hardware at as high as 21Gbps, and its usefulness in high-speed CRC computa-tions is implied, such as Asynchronous Transfer Mode (ATM) networks and 10G Ethernet. 展开更多
关键词 Cyclic Redundancy Check (CRC) parallel computation Multi-bit divider
下载PDF
A Hybrid Parallel Strategy for Isogeometric Topology Optimization via CPU/GPU Heterogeneous Computing
4
作者 Zhaohui Xia Baichuan Gao +3 位作者 Chen Yu Haotian Han Haobo Zhang Shuting Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第2期1103-1137,共35页
This paper aims to solve large-scale and complex isogeometric topology optimization problems that consumesignificant computational resources. A novel isogeometric topology optimization method with a hybrid parallelstr... This paper aims to solve large-scale and complex isogeometric topology optimization problems that consumesignificant computational resources. A novel isogeometric topology optimization method with a hybrid parallelstrategy of CPU/GPU is proposed, while the hybrid parallel strategies for stiffness matrix assembly, equationsolving, sensitivity analysis, and design variable update are discussed in detail. To ensure the high efficiency ofCPU/GPU computing, a workload balancing strategy is presented for optimally distributing the workload betweenCPU and GPU. To illustrate the advantages of the proposedmethod, three benchmark examples are tested to verifythe hybrid parallel strategy in this paper. The results show that the efficiency of the hybrid method is faster thanserial CPU and parallel GPU, while the speedups can be up to two orders of magnitude. 展开更多
关键词 Topology optimization high-efficiency isogeometric analysis CPU/GPU parallel computing hybrid OpenMPCUDA
下载PDF
Parallel computation of unified finite-difference time-domain for underwater sound scattering 被引量:2
5
作者 冯玉田 王朔中 《Journal of Shanghai University(English Edition)》 CAS 2008年第2期120-125,共6页
In this work, we treat scattering objects, water, surface and bottom in a truly unified manner in a parallel finitedifference time-domain (FDTD) scheme, which is suitable for distributed parallel computing in a mess... In this work, we treat scattering objects, water, surface and bottom in a truly unified manner in a parallel finitedifference time-domain (FDTD) scheme, which is suitable for distributed parallel computing in a message passing interface (MPI) programming environment. The algorithm is implemented on a cluster-based high performance computer system. Parallel computation is performed with different division methods in 2D and 3D situations. Based on analysis of main factors affecting the speedup rate and parallel efficiency, data communication is reduced by selecting a suitable scheme of task division. A desirable scheme is recommended, giving a higher speedup rate and better efficiency. The results indicate that the unified parallel FDTD algorithm provides a solution to the numerical computation of acoustic scattering. 展开更多
关键词 parallel computation finite-difference time-domain (FDTD) message passing interface (MPI) object scattering.
下载PDF
Parallel Computation of Shallow-water Model on Workstations Cluster 被引量:2
6
作者 Song Junqiang Sun An-clang, Li Xiaomei(epartment Of CO,mp’uter Science, Changsha Institute of Technology Hunan 410073, P.R. of China) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期522-525,共4页
ParallelComputationofShallow-waterModelonWorkstationsClusterSongJunqiang;SunAn-clang,;LiXiaomei(epartmentOfC... ParallelComputationofShallow-waterModelonWorkstationsClusterSongJunqiang;SunAn-clang,;LiXiaomei(epartmentOfCO,mp'uterScience,... 展开更多
关键词 parallel computation of Shallow-water Model on Workstations Cluster
下载PDF
Optimization Techniques for GPU-Based Parallel Programming Models in High-Performance Computing
7
作者 Shuntao Tang Wei Chen 《信息工程期刊(中英文版)》 2024年第1期7-11,共5页
This study embarks on a comprehensive examination of optimization techniques within GPU-based parallel programming models,pivotal for advancing high-performance computing(HPC).Emphasizing the transition of GPUs from g... This study embarks on a comprehensive examination of optimization techniques within GPU-based parallel programming models,pivotal for advancing high-performance computing(HPC).Emphasizing the transition of GPUs from graphic-centric processors to versatile computing units,it delves into the nuanced optimization of memory access,thread management,algorithmic design,and data structures.These optimizations are critical for exploiting the parallel processing capabilities of GPUs,addressingboth the theoretical frameworks and practical implementations.By integrating advanced strategies such as memory coalescing,dynamic scheduling,and parallel algorithmic transformations,this research aims to significantly elevate computational efficiency and throughput.The findings underscore the potential of optimized GPU programming to revolutionize computational tasks across various domains,highlighting a pathway towards achieving unparalleled processing power and efficiency in HPC environments.The paper not only contributes to the academic discourse on GPU optimization but also provides actionable insights for developers,fostering advancements in computational sciences and technology. 展开更多
关键词 Optimization Techniques GPU-Based parallel Programming Models High-Performance computing
下载PDF
Parallel Computation of the Noncooperative Equilibria for Dynamic Games
8
作者 卢子芳 达庆利 徐南荣 《Journal of Southeast University(English Edition)》 EI CAS 1996年第2期88-93,共6页
A method of the parallel computation of the linear quadratic non cooperative dynamic games problem is proposed. The Lyapunov function is introduced, through which the form adapted to parallel computation of the open ... A method of the parallel computation of the linear quadratic non cooperative dynamic games problem is proposed. The Lyapunov function is introduced, through which the form adapted to parallel computation of the open loop Nash equilibrium strategies is gi 展开更多
关键词 parallel computation dynamic GAMES linear QUADRATIC STYLE
下载PDF
A PARALLEL COMPUTATION SCHEME FOR IMPLICIT RUNGE-KUTTA METHODS AND THE ITERATIVELY B-CONVERGENCE OF ITS NEWTON ITERATIVE PROCESS
9
作者 赵双锁 王昌银 《Numerical Mathematics A Journal of Chinese Universities(English Series)》 SCIE 1994年第1期54-66,共13页
In this paper, based on the implicit Runge-Kutta(IRK) methods, we derive a class of parallel scheme that can be implemented on the parallel computers with Ns(N is a positive even number) processors efficiently, and di... In this paper, based on the implicit Runge-Kutta(IRK) methods, we derive a class of parallel scheme that can be implemented on the parallel computers with Ns(N is a positive even number) processors efficiently, and discuss the iteratively B-convergence of the Newton iterative process for solving the algebraic equations of the scheme, secondly we present a strategy providing initial values parallelly for the iterative process. Finally, some numerical results show that our parallel scheme is higher efficient as N is not so large. 展开更多
关键词 IMPLICIT Range-Kutta methods NEWTON ITERATIVE process parallel computation iteratively B-CONVERGENCE
下载PDF
Preconditioned method in parallel computation
10
作者 Wu Ruichan Wei Jianing 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第1期220-222,共3页
The grid equations in decomposed domain by parallel computation are soled, and a method of local orthogonalization to solve the large-scaled numerical computation is presented. It constructs preconditioned iteration m... The grid equations in decomposed domain by parallel computation are soled, and a method of local orthogonalization to solve the large-scaled numerical computation is presented. It constructs preconditioned iteration matrix by the combination of predigesting LU decomposition and local orthogonalization, and the convergence of solution is proved. Indicated from the example, this algorithm can increase the rate of computation efficiently and it is quite stable. 展开更多
关键词 grid equations parallel computation PRECONDITION LU decomposition local orthogonalization.
下载PDF
CLUSTER OF WORKSTATIONS BASED ON DYNAMIC LOAD BALANCING FOR PARALLEL TREE COMPUTATION DEPTH-FIRST-SEARCH
11
作者 加力 陆鑫达 张健 《Journal of Shanghai Jiaotong university(Science)》 EI 2002年第1期26-31,共6页
The real problem in cluster of workstations is the changes in workstation power or number of workstations or dynmaic changes in the run time behavior of the application hamper the efficient use of resources. Dynamic l... The real problem in cluster of workstations is the changes in workstation power or number of workstations or dynmaic changes in the run time behavior of the application hamper the efficient use of resources. Dynamic load balancing is a technique for the parallel implementation of problems, which generate unpredictable workloads by migration work units from heavily loaded processor to lightly loaded processors at run time. This paper proposed an efficient load balancing method in which parallel tree computations depth first search (DFS) generates unpredictable, highly imbalance workloads and moves through different phases detectable at run time, where dynamic load balancing strategy is applicable in each phase running under the MPI(message passing interface) and Unix operating system on cluster of workstations parallel platform computing. 展开更多
关键词 cluster of WORKSTATIONS parallel TREE computation DFS task migration dynamic load balancing strategy and TERMINATION detection algorithm
下载PDF
PARALLEL COMPUTATION OF SUPER-SONIC BLUNT BODY VISCOUS FLOWFIELDS IN PVM
12
作者 Yang Xiaohui, Wang Zhenghua, Wang Chengyao (Dept. 1, National University of Defense Technology, Changsha, 410073, China) 《Chinese Journal of Aeronautics》 SCIE EI CAS CSCD 1997年第4期26-30,共5页
Supersonic viscous flows past blunt bodies is calculated with TVD difference scheme and implicit Lower Upper Symmetric Gauss Seidel (LU SGS) method, and parallel programming designing software platform PVM is used b... Supersonic viscous flows past blunt bodies is calculated with TVD difference scheme and implicit Lower Upper Symmetric Gauss Seidel (LU SGS) method, and parallel programming designing software platform PVM is used based on message passing to distribute a large task according to some patching strategies to a large number of processors in the network. These processors accomplish this large task together. The marked improvement of computational efficiency in networks, especially in MPP system, demonstrates the potential vitality of CFD in engineering design. 展开更多
关键词 computational fluid dynamics supersonic speed blunt bodies viscous flow PVM patching strategy parallel mode SPEEDUP networks
下载PDF
Realistic Efficiency Evaluations for Parallel Computations under Workstation Cluster
13
作者 Mo Zeyao Li Xiaomei(Dept. of Computer, Changsha institute of Technology Changsha, China, 410073) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期329-336,共8页
In recent years, high performance scientific computing under workstation cluster connected by local area network is becoming a hot point. Owing to both the longer latency and the higher overhead for protocol processin... In recent years, high performance scientific computing under workstation cluster connected by local area network is becoming a hot point. Owing to both the longer latency and the higher overhead for protocol processing compared with the powerful single workstation capacity, it is becoming severe important to keep balance not only for numerical load but also for communication load, and to overlap communications with computations while parallel computing. Hence,our efficiency evaluation rules must discover these capacities of a given parallel algorithm in order to optimize the existed algorithm to attain its highest parallel efficiency. The traditional efficiency evaluation rules can not succeed in this work any more. Fortunately, thanks to Culler's detail discuss in LogP model about interconnection networks for MPP systems, we present a system of efficiency evaluation rules for parallel computations under workstation cluster with PVM3.0 parallel software framework in this paper. These rules can satisfy above acquirements successfully. At last, two typical synchronous,and asynchronous applications are designed to verify the validity of these rules under 4 SGIs workstations cluster connected by Ethernet. 展开更多
关键词 parallel efficiency evaluation workstation cluster PVM. network parallel computations.
下载PDF
Combination Method for Parallel Computation in ODEs
14
作者 Song Xiaoqiu(Beijing Institute of Computer Application and Simulation Technology ) 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1996年第4期19-26,共8页
In this paper, a 3rd order combination method with three processes and a 4th order combination method with five processes for solving ODEs are discussed. These methods are the Runge-Kutta method combined with a linear... In this paper, a 3rd order combination method with three processes and a 4th order combination method with five processes for solving ODEs are discussed. These methods are the Runge-Kutta method combined with a linear multistep method, which overcomes the defect of the 3rd order parallel Runge-Kutta method discussed in [1]. 展开更多
关键词 SOFTWARE RELIABILITY Numerical analysis Combination method parallel computation ODEs.
下载PDF
A Class of Stable Difference Schemes For Linear Elliptic PDEs And Their Asynchronous Parallel Computation
15
作者 Wen Shangmeng Li Xiaomei(Dept. of Compactor, Changsha I.nstitutc of TechnologyChangsha, Hunan 410073, P.R. of China) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期553-556,共4页
This paper improves and generalizes the two difference schemes presented in paper [1] and gives a new difference scheme for second order linear elliptic partial differential equations, its difference matrix is a matri... This paper improves and generalizes the two difference schemes presented in paper [1] and gives a new difference scheme for second order linear elliptic partial differential equations, its difference matrix is a matrix and because of the stability of the M-matrix, it is convergent by the asynchronous iterative method on multiprocessors. Then this paper gives a class of differeifce schemes for linear elliptic PDEs so that their difference matrixes are all M-matrixes and their asynchronous parallel computation are convergent. 展开更多
关键词 difference scheme partial differential equation parallel computation.
下载PDF
A Parallel Computational Scheme on Hybrid Methods
16
作者 Zhao ShuangsuoDepartment of Mathematics, Lanzhou University, Lanzhou 730000Wang ChangyinInst. of Mech. & Elec. Eng., Gansu University of Technology, Lanzhou 730050 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1994年第4期8-18,共11页
Based on the efficient hybrid methods for solving initial value problems of stiff ODEs, this paper derives a parallel scheme that can be used to solve the problems on parallel computers with N processors, and discusse... Based on the efficient hybrid methods for solving initial value problems of stiff ODEs, this paper derives a parallel scheme that can be used to solve the problems on parallel computers with N processors, and discusses the iteratively B-convergence of the Newton iterative process, finally, the paper provides some numberical results which show that the parallel scheme is highly efficient as N is not too large. 展开更多
关键词 Hybrid methods parallel computation Iteratively B-convergence.
下载PDF
Development of high performance casting analysis software by coupled parallel computation
17
作者 Sang Hyun CHO Jeong Kil CHOI 《China Foundry》 SCIE CAS 2007年第3期215-219,共5页
Up to now,so much casting analysis software has been continuing to develop the new access way to real casting processes. Those include the melt flow analysis,heat transfer analysis for solidification calculation,mecha... Up to now,so much casting analysis software has been continuing to develop the new access way to real casting processes. Those include the melt flow analysis,heat transfer analysis for solidification calculation,mechanical property predictions and microstructure predictions. These trials were successful to obtain the ideal results comparing with real situations,so that CAE technologies became inevitable to design or develop new casting processes. But for manufacturing fields,CAE technologies are not so frequently being used because of their difficulties in using the software or insufficient computing performances. To introduce CAE technologies to manufacturing field,the high performance analysis is essential to shorten the gap between product designing time and prototyping time. The software code optimization can be helpful,but it is not enough,because the codes developed by software experts are already optimized enough. As an alternative proposal for high performance computations,the parallel computation technologies are eagerly being applied to CAE technologies to make the analysis time shorter. In this research,SMP (Shared Memory Processing) and MPI (Message Passing Interface) (1) methods for parallelization were applied to commercial software "Z-Cast" to calculate the casting processes. In the code parallelizing processes,the network stabilization,core optimization were also carried out under Microsoft Windows platform and their performances and results were compared with those of normal linear analysis codes. 展开更多
关键词 parallel computation message passing interface casting analysis SMP performance improvement
下载PDF
Parallel Computation of Fourier Transform on Distributed Memory Computer System
18
作者 Yihui Yan Qingfeng Hu Xinfang He 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期557-560,共4页
Multicomputer systems(distributed memory computer systems) are becoming more and more popular and will be wildly used in scientific researches. In this paper, we present a parallel algorithm of Fourier Transform of a ... Multicomputer systems(distributed memory computer systems) are becoming more and more popular and will be wildly used in scientific researches. In this paper, we present a parallel algorithm of Fourier Transform of a vector of complex numbers on multicomputer system and give its computing times and its speedup in parallel environment supported by EXPRESS system on the multicomputer system which consists of four SGI workstations. Our analysis shows that the results is ideal and this scheme is suitable to multicomputer systems. 展开更多
关键词 Fourier Transform Distributed Memory computer System parallel computing
下载PDF
Optimizing photoacoustic image reconstruction using cross-platform parallel computation
19
作者 Tri Vu Yuehang Wang Jun Xia 《Visual Computing for Industry,Biomedicine,and Art》 2018年第1期12-17,共6页
Three-dimensional(3D)image reconstruction involves the computations of an extensive amount of data that leads to tremendous processing time.Therefore,optimization is crucially needed to improve the performance and eff... Three-dimensional(3D)image reconstruction involves the computations of an extensive amount of data that leads to tremendous processing time.Therefore,optimization is crucially needed to improve the performance and efficiency.With the widespread use of graphics processing units(GPU),parallel computing is transforming this arduous reconstruction process for numerous imaging modalities,and photoacoustic computed tomography(PACT)is not an exception.Existing works have investigated GPU-based optimization on photoacoustic microscopy(PAM)and PACT reconstruction using compute unified device architecture(CUDA)on either C++or MATLAB only.However,our study is the first that uses cross-platform GPU computation.It maintains the simplicity of MATLAB,while improves the speed through CUDA/C++−based MATLAB converted functions called MEXCUDA.Compared to a purely MATLAB with GPU approach,our cross-platform method improves the speed five times.Because MATLAB is widely used in PAM and PACT,this study will open up new avenues for photoacoustic image reconstruction and relevant real-time imaging applications. 展开更多
关键词 Photoacoustic computed tomography Graphics processing units parallel computation Focal-line backprojection algorithm MATLAB Optical imaging
下载PDF
Energy-efficient task allocation for reliable parallel computation of cluster-based wireless sensor network in edge computing
20
作者 Jiabao Wen Jiachen Yang +2 位作者 Tianying Wang Yang Li Zhihan Lv 《Digital Communications and Networks》 SCIE CSCD 2023年第2期473-482,共10页
To efficiently complete a complex computation task,the complex task should be decomposed into subcomputation tasks that run parallel in edge computing.Wireless Sensor Network(WSN)is a typical application of parallel c... To efficiently complete a complex computation task,the complex task should be decomposed into subcomputation tasks that run parallel in edge computing.Wireless Sensor Network(WSN)is a typical application of parallel computation.To achieve highly reliable parallel computation for wireless sensor network,the network's lifetime needs to be extended.Therefore,a proper task allocation strategy is needed to reduce the energy consumption and balance the load of the network.This paper proposes a task model and a cluster-based WSN model in edge computing.In our model,different tasks require different types of resources and different sensors provide different types of resources,so our model is heterogeneous,which makes the model more practical.Then we propose a task allocation algorithm that combines the Genetic Algorithm(GA)and the Ant Colony Optimization(ACO)algorithm.The algorithm concentrates on energy conservation and load balancing so that the lifetime of the network can be extended.The experimental result shows the algorithm's effectiveness and advantages in energy conservation and load balancing. 展开更多
关键词 Wireless sensor network parallel computation Task allocation Genetic algorithm Ant colony optimization algorithm ENERGY-EFFICIENT Load balancing
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部