期刊文献+
共找到56篇文章
< 1 2 3 >
每页显示 20 50 100
On the Communication-Efficiency of Parallel Systems 被引量:1
1
作者 Lin Hong Chen Guoliang Wang Min(Department of Computer Science, USTC, Hefei, Anhui, PRC Department of Electronic Engineering and information Scisnce, USTC, Hefei, Anhui, PRC) 《Wuhan University Journal of Natural Sciences》 CAS 1996年第Z1期325-328,共4页
This peper defines the communication-efficiency, which is directly related to the cost-efficiency, and Studies the relationship between the communication-efficiency and the processor-efficiency when they are applied t... This peper defines the communication-efficiency, which is directly related to the cost-efficiency, and Studies the relationship between the communication-efficiency and the processor-efficiency when they are applied to scalability analysis. An example of algorithms is given to analyze some typical architectures. 展开更多
关键词 parallel processing ALGORITHM ARCHITECTURE EFFICIENCY SCALABILITY
下载PDF
Efficient Partially Asynchronous Parallel Simulation on Multicomputer Systems: Research and Practice
2
作者 Chen, Delai Hong, Bo +1 位作者 Xie, Zhiwu Weng, Shilie 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1998年第2期40-47,共8页
This paper presents partially asynchronous parallel simulation of continuous-system (PAPSoCS) and some approaches to the issues of its implementation on a multicomputer system. To guarantee the simulation results cor... This paper presents partially asynchronous parallel simulation of continuous-system (PAPSoCS) and some approaches to the issues of its implementation on a multicomputer system. To guarantee the simulation results correct and speedup the simulation, the scheme for efficient PAPSoCS is proposed and the virtual topology star is constructed to match the path of message passing for solving algorithm-architecture adequation problem. Under the circumstances that messages frequently passed inter-processor are much shorter, typically within several 4 bytes, asynchronous communication mode is employed to reduce the communication ratio. Experiment results show that asynchronous parallel simulation has much higher efficiency than its synchronous counterpart. 展开更多
关键词 parallel processing Asynchronous computation Virtual topology Multicomputer system SIMULATION
下载PDF
Fourier hologram method using Gerchberg-Saxton algorithm for parallel femtosecond laser processing
3
作者 汪金礼 苏亚辉 +2 位作者 崔洪涛 杨亮 方志伟 《Journal of Measurement Science and Instrumentation》 CAS 2012年第3期304-306,共3页
In order to improve femtosecond laser throughput,a parallel processing system consisting of liquid crystal on silicon(LCOS)device as spatial light modulator is put forward.A method is described for displaying Fourier ... In order to improve femtosecond laser throughput,a parallel processing system consisting of liquid crystal on silicon(LCOS)device as spatial light modulator is put forward.A method is described for displaying Fourier hologram on LCOS,and a high uniformity of several diffraction peaks in the computer reconstruction is achieved.Application of this method to the parallel femtosecond laser processing is also demonstrated,and two intersecting rings and three tangent rings are fabricated respectively by one time in the photoresist. 展开更多
关键词 femtosecond laser parallel processing liquid crystal on silicon(LCOS) Fourier hologram Gerchberg-Saxton(GS)algorithm
下载PDF
Parallel Processing Design for LTE PUSCH Demodulation and Decoding Based on Multi-Core Processor
4
作者 Zhang Ziran,Li Jun,Li Changxiao(ZTE Corporation,Shenzhen 518057,P.R.China) 《ZTE Communications》 2009年第1期54-58,共5页
The Long Term Evolution (LTE) system imposes high requirements for dispatching delay.Moreover,very large air interface rate of LTE requires good processing capability for the devices processing the baseband signals.Co... The Long Term Evolution (LTE) system imposes high requirements for dispatching delay.Moreover,very large air interface rate of LTE requires good processing capability for the devices processing the baseband signals.Consequently,the single-core processor cannot meet the requirements of LTE system.This paper analyzes how to use multi-core processors to achieve parallel processing of uplink demodulation and decoding in LTE systems and designs an approach to parallel processing.The test results prove that this approach works quite well. 展开更多
关键词 CORE LTE parallel processing Design for LTE PUSCH Demodulation and Decoding Based on Multi-Core Processor Design
下载PDF
Efficient Parallel Processing of k-Nearest Neighbor Queries by Using a Centroid-based and Hierarchical Clustering Algorithm
5
作者 Elaheh Gavagsaz 《Artificial Intelligence Advances》 2022年第1期26-41,共16页
The k-Nearest Neighbor method is one of the most popular techniques for both classification and regression purposes.Because of its operation,the application of this classification may be limited to problems with a cer... The k-Nearest Neighbor method is one of the most popular techniques for both classification and regression purposes.Because of its operation,the application of this classification may be limited to problems with a certain number of instances,particularly,when run time is a consideration.However,the classification of large amounts of data has become a fundamental task in many real-world applications.It is logical to scale the k-Nearest Neighbor method to large scale datasets.This paper proposes a new k-Nearest Neighbor classification method(KNN-CCL)which uses a parallel centroid-based and hierarchical clustering algorithm to separate the sample of training dataset into multiple parts.The introduced clustering algorithm uses four stages of successive refinements and generates high quality clusters.The k-Nearest Neighbor approach subsequently makes use of them to predict the test datasets.Finally,sets of experiments are conducted on the UCI datasets.The experimental results confirm that the proposed k-Nearest Neighbor classification method performs well with regard to classification accuracy and performance. 展开更多
关键词 CLASSIFICATION k-Nearest Neighbor Big data CLUSTERING parallel processing
下载PDF
Parallel Iterative FEM Solver with Initial Guess for Frequency Domain Electromagnetic Analysis
6
作者 Woochan Lee Woobin Park +2 位作者 Jaeyoung Park Young-Joon Kim Moonseong Kim 《Intelligent Automation & Soft Computing》 SCIE 2023年第5期1585-1602,共18页
The finite element method is a key player in computational electromag-netics for designing RF(Radio Frequency)components such as waveguides.The frequency-domain analysis is fundamental to identify the characteristics ... The finite element method is a key player in computational electromag-netics for designing RF(Radio Frequency)components such as waveguides.The frequency-domain analysis is fundamental to identify the characteristics of the components.For the conventional frequency-domain electromagnetic analysis using FEM(Finite Element Method),the system matrix is complex-numbered as well as indefinite.The iterative solvers can be faster than the direct solver when the solver convergence is guaranteed and done in a few steps.However,such complex-numbered and indefinite systems are hard to exploit the merit of the iterative solver.It is also hard to benefit from matrix factorization techniques due to varying system matrix parts according to frequency.Overall,it is hard to adopt conventional iterative solvers even though the system matrix is sparse.A new parallel iterative FEM solver for frequency domain analysis is implemented for inhomogeneous waveguide structures in this paper.In this implementation,the previous solution of the iterative solver of Matlab(Matrix Laboratory)employ-ing the preconditioner is used for the initial guess for the next step’s solution process.The overlapped parallel stage using Matlab’s Parallel Computing Toolbox is also proposed to alleviate the cold starting,which ruins the convergence of early steps in each parallel stage.Numerical experiments based on waveguide structures have demonstrated the accuracy and efficiency of the proposed scheme. 展开更多
关键词 Computational electromagnetics numerical simulation finite element method parallel processing iterative solvers
下载PDF
Design of ANN Based Non-Linear Network Using Interconnection of Parallel Processor
7
作者 Anjani Kumar Singha Swaleha Zubair +3 位作者 Areej Malibari Nitish Pathak Shabana Urooj Neelam Sharma 《Computer Systems Science & Engineering》 SCIE EI 2023年第9期3491-3508,共18页
Suspicious mass traffic constantly evolves,making network behaviour tracing and structure more complex.Neural networks yield promising results by considering a sufficient number of processing elements with strong inte... Suspicious mass traffic constantly evolves,making network behaviour tracing and structure more complex.Neural networks yield promising results by considering a sufficient number of processing elements with strong interconnections between them.They offer efficient computational Hopfield neural networks models and optimization constraints used by undergoing a good amount of parallelism to yield optimal results.Artificial neural network(ANN)offers optimal solutions in classifying and clustering the various reels of data,and the results obtained purely depend on identifying a problem.In this research work,the design of optimized applications is presented in an organized manner.In addition,this research work examines theoretical approaches to achieving optimized results using ANN.It mainly focuses on designing rules.The optimizing design approach of neural networks analyzes the internal process of the neural networks.Practices in developing the network are based on the interconnections among the hidden nodes and their learning parameters.The methodology is proven best for nonlinear resource allocation problems with a suitable design and complex issues.The ANN proposed here considers more or less 46k nodes hidden inside 49 million connections employed on full-fledged parallel processors.The proposed ANN offered optimal results in real-world application problems,and the results were obtained using MATLAB. 展开更多
关键词 Artificial neural network(ANN) MULTIPROCESSOR hidden node nonlinear optimization parallel processing
下载PDF
Development and Application of High-efficiency,Interactive Processing Functions of SP2 Parallel Computer
8
《China Oil & Gas》 CAS 2000年第2期44-44,共1页
关键词 Development and Application of High-efficiency Interactive processing Functions of SP2 parallel Computer SP
下载PDF
APPLICATION OF MVP IN REAL TIME IMAGE PROCESSING
9
作者 戴擎宇 杨占昕 何佩琨 《Chinese Journal of Aeronautics》 SCIE EI CSCD 2000年第1期30-33,共4页
MVP is a digital signal processor, which is of MIMD structure and fit for multimedia application. MVP has several processors in it, and its operation is characteristic of parallelism and pipeline; therefore, real-time... MVP is a digital signal processor, which is of MIMD structure and fit for multimedia application. MVP has several processors in it, and its operation is characteristic of parallelism and pipeline; therefore, real-time signal processing can be done on it. This paper presents the image processing system based on MVP, explains the principles of parallel task assignment and hardware pipeline design, and gives out the example of target tracking and edge detection. 展开更多
关键词 Computer hardware Edge detection Image processing MIM devices Multimedia systems parallel processing systems Random access storage
下载PDF
Parallel Algorithm Case Study Using Virtual Machine
10
作者 Zhang, Gongqing 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1997年第3期51-59,共9页
This paper takes the Sobel operator as example to study parallel sequential algorithm onto a memory-sharing multiprocessor by using a virtual machine. Several different parallel algorithms using function decomposition... This paper takes the Sobel operator as example to study parallel sequential algorithm onto a memory-sharing multiprocessor by using a virtual machine. Several different parallel algorithms using function decomposition and/or data decomposition methods are compared and their performances are analyzed in terms of processor utilization, data traffic, shared memory access, and synchronization overhead. The analysis is validated through a simulation experiment on the virtual machine of 64 parallel processors. Conclusions are presented at the end of this paper. 展开更多
关键词 Computer simulation Data processing Data storage equipment Image processing parallel processing systems SYNCHRONIZATION Telecommunication traffic
下载PDF
Operation of three parallel AN/AO processes to enrich denitrifying phosphorus removing bacteria for low strength wastewater treatment 被引量:6
11
作者 XIA Si-qing LIU Hong-bo 《Journal of Environmental Sciences》 SCIE EI CAS CSCD 2006年第3期433-438,共6页
Three parallel anaerobic-anoxic/anaerobic-aerobic (AN/AO) processes were developed to enrich denitrifying phosphorus removal bacteria (DPB) for low strength wastewater treatment. The main body of the parallel AN/A... Three parallel anaerobic-anoxic/anaerobic-aerobic (AN/AO) processes were developed to enrich denitrifying phosphorus removal bacteria (DPB) for low strength wastewater treatment. The main body of the parallel AN/AO process consists of an AN (anaerobic-anoxic) process and an AO (anaerobic-aerobic) process. In the AO process, the common phosphorus accumulating organisms (PAOs) was dominate, while in the AN process, DPB was dominate, The volume of anaerobic zone(Vana):anoxie zone(Vano) : aerobic zone (Vaer) for the parallel AN/AO process is 1:1:1 in contrast with a Vana:Vaer and Vano:Vaer of 1:2 and 1:4 for a traditional biological nutrient removal process (BNR). Process 3 excels in the 3 processes on the basis of COD, TN and TP removal. For 4 month operation, the effluent COD concentration of process 3 did not exceed 60 mg/L; the effluent TN concentration of process 3 was lower than 15 mg/L; and the effluent TP concentration of process 3 was lower than 1 mg/L. 展开更多
关键词 parallel AN/AO process denitrifying phosphorus removing bacteria (DPB) low strength wastewater biological nutrientremoval (BNR)
下载PDF
A Neuron-Oriented Programming System 被引量:3
12
作者 李涛 《High Technology Letters》 EI CAS 2001年第1期70-73,共4页
A neruon-oriented programming system based on parallel neural information processing has been presented. With the neural programming system built upon 4~8 process elements(TMS C30), the system has thus provided users... A neruon-oriented programming system based on parallel neural information processing has been presented. With the neural programming system built upon 4~8 process elements(TMS C30), the system has thus provided users high speed, general purpose and large scale neural network application development platforms etc. 展开更多
关键词 Neural networks parallel processing Programming system
下载PDF
Assigning Task by Parallel Genetic Algorithm Based on PVM 被引量:1
13
作者 Zheng Zhi jun, Dong Xiao she, Zheng Shou qi Department of Computer Science and Technology,Xi’an Jiaotong University,Xi’an 710049,China 《Wuhan University Journal of Natural Sciences》 CAS 2001年第Z1期579-584,共6页
Genetic algorithm has been proposed to solve the problem of task assignment. However, it has some drawbacks, e.g., it often takes a long time to find an optimal solution, and the success rate is low. To overcome these... Genetic algorithm has been proposed to solve the problem of task assignment. However, it has some drawbacks, e.g., it often takes a long time to find an optimal solution, and the success rate is low. To overcome these problems, a new coarse grained parallel genetic algorithm with the scheme of central migration is presented, which exploits isolated sub populations. The new approach has been implemented in the PVM environment and has been evaluated on a workstation network for solving the task assignment problem. The results show that it not only significantly improves the result quality but also increases the speed for getting best solution. 展开更多
关键词 task assignment genetic algorithm parallel process PVM
下载PDF
The inversion of density structure by graphic processing unit(GPU) and identification of igneous rocks in Xisha area 被引量:1
14
作者 Lei Yu Jian Zhang +2 位作者 Wei Lin Rongqiang Wei Shiguo Wu 《Earthquake Science》 2014年第1期117-125,共9页
Organic reefs, the targets of deep-water petro- leum exploration, developed widely in Xisha area. However, there are concealed igneous rocks undersea, to which organic rocks have nearly equal wave impedance. So the ig... Organic reefs, the targets of deep-water petro- leum exploration, developed widely in Xisha area. However, there are concealed igneous rocks undersea, to which organic rocks have nearly equal wave impedance. So the igneous rocks have become interference for future explo- ration by having similar seismic reflection characteristics. Yet, the density and magnetism of organic reefs are very different from igneous rocks. It has obvious advantages to identify organic reefs and igneous rocks by gravity and magnetic data. At first, frequency decomposition was applied to the free-air gravity anomaly in Xisha area to obtain the 2D subdivision of the gravity anomaly and magnetic anomaly in the vertical direction. Thus, the dis- tribution of igneous rocks in the horizontal direction can be acquired according to high-frequency field, low-frequency field, and its physical properties. Then, 3D forward model- ing of gravitational field was carried out to establish the density model of this area by reference to physical properties of rocks based on former researches. Furthermore, 3D inversion of gravity anomaly by genetic algorithm method of the graphic processing unit (GPU) parallel processing in Xisha target area was applied, and 3D density structure of this area was obtained. By this way, we can confine the igneous rocks to the certain depth according to the density of the igneous rocks. The frequency decomposition and 3D inversion of gravity anomaly by genetic algorithm method of the GPU parallel processing proved to be a useful method for recognizing igneous rocks to its 3D geological position. So organic reefs and igneous rocks can be identified, which provide a prescient information for further exploration. 展开更多
关键词 Xisha area Organic reefs and igneous rocks -Frequency decomposition of potential field 3D inversionof the graphic processing unit (GPU) parallel processing
下载PDF
A Granularity-Aware Parallel Aggregation Method for Data Streams
15
作者 WANG Yong-li XU Hong-bing XU Li-zhen QIAN Jiang-bo LIU Xue-jun 《Wuhan University Journal of Natural Sciences》 EI CAS 2006年第1期133-137,共5页
This paper focuses on the parallel aggregation processing of data streams based on the shared-nothing architecture. A novel granularity-aware parallel aggregating model is proposed. It employs parallel sampling and li... This paper focuses on the parallel aggregation processing of data streams based on the shared-nothing architecture. A novel granularity-aware parallel aggregating model is proposed. It employs parallel sampling and linear regression to describe the characteristics of the data quantity in the query window in order to determine the partition granularity of tuples, and utilizes equal depth histogram to implement partitio ning. This method can avoid data skew and reduce communi cation cost. The experiment results on both synthetic data and actual data prove that the proposed method is efficient, practical and suitable for time-varying data streams processing. 展开更多
关键词 data streams parallel processing linear regression AGGREGATION data skew
下载PDF
Key-Value Store Coupled with an Operating System for Storing Large-Scale Values
16
作者 Jeonghwan Im Hyuk-Yoon Kwon 《Computers, Materials & Continua》 SCIE EI 2022年第11期3333-3350,共18页
The key-value store can provide flexibility of data types because it does not need to specify the data types to be stored in advance and can store any types of data as the value of the key-value pair.Various types of ... The key-value store can provide flexibility of data types because it does not need to specify the data types to be stored in advance and can store any types of data as the value of the key-value pair.Various types of studies have been conducted to improve the performance of the key-value store while maintaining its flexibility.However,the research efforts storing the large-scale values such as multimedia data files(e.g.,images or videos)in the key-value store were limited.In this study,we propose a new key-value store,WR-Store++aiming to store the large-scale values stably.Specifically,it provides a new design of separating data and index by working with the built-in data structure of the Windows operating system and the file system.The utilization of the built-in data structure of the Windows operating system achieves the efficiency of the key-value store and that of the file system extends the limited space of the storage significantly.We also present chunk-based memory management and parallel processing of WR-Store++to further improve its performance in the GET operation.Through the experiments,we show that WR-Store++can store at least 32.74 times larger datasets than the existing baseline key-value store,WR-Store,which has the limitation in storing large-scale data sets.Furthermore,in terms of processing efficiency,we show that WR-Store++outperforms not only WR-Store but also the other state-ofthe-art key-value stores,LevelDB,RocksDB,and BerkeleyDB,for individual key-value operations and mixed workloads. 展开更多
关键词 Key-value stores large-scale values chunk-based memory management parallel processing
下载PDF
PARALLEL COMPUTATIONAL ALGORITHM OF SUBSTRUCTURE METHOD OF LARGE-SCALE STRUCTURE ANALYSIS
17
作者 张汝清 《Applied Mathematics and Mechanics(English Edition)》 SCIE EI 1991年第1期93-100,共8页
In this paper, according to the parallel environment of ELXSI computer, a parallel solving process of substructure method in static and dynamic analyses of large-scale and complex structure has been put forward, and t... In this paper, according to the parallel environment of ELXSI computer, a parallel solving process of substructure method in static and dynamic analyses of large-scale and complex structure has been put forward, and the corresponding parallel computational program has been developed. 展开更多
关键词 Computer Programming ALGORITHMS Computer systems Digital parallel processing
下载PDF
Fast simulation of flexible multibody dynamics with electric-hydraulic drive system
18
作者 Etsujiro Imanishi Takao Nanjo 《Theoretical & Applied Mechanics Letters》 CAS 2013年第1期19-24,共6页
Hardware in the loop simulation (HILS) has been investigated in the field of the multibody dynamics (MBD), which combined the MBD simulation with the actual mechanical system. The fast simulation is necessary for ... Hardware in the loop simulation (HILS) has been investigated in the field of the multibody dynamics (MBD), which combined the MBD simulation with the actual mechanical system. The fast simulation is necessary for the HILS system in order to require the real time simulation. This paper presents a fast simulation technique using the domain decomposition method with the iteration in the flexible multibody system in which flexible linkage system and electro-hydraulic drive system are coupled with each other. C 2013 The Chinese Society of Theoretical and Applied Mechanics.[doi:10.1063/2.1301301] 展开更多
关键词 multibody dynamics SIMULATION construction machineary parallel processing
下载PDF
A High Speed Signal Processing Machine -Its Architecture, Language and Compiler
19
作者 Wang Yufei and Yu ShiqiBeijing Institute of Data Processing Technology, P.O.Box 3927, Beijing 100039, China 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1991年第1期119-128,共10页
A systolic array architecture computer (FXCQ) has been designed for signal processing. R can handle floating point data at very high speed. It is composed of 16 processing cells and a cache that are connected linearly... A systolic array architecture computer (FXCQ) has been designed for signal processing. R can handle floating point data at very high speed. It is composed of 16 processing cells and a cache that are connected linearly and form a ring structure. All processing cells are identical and programmable. Each processing cell has the peak performance of 20 million floating-point operations per second (20MFLOPS). The machine therefore has a peak performance of 320 M FLOPS. It is integrated as an attached processor into a host system through VME bus interface. Programs for FXCQ are written in a high-level language -B language, which is supported by a parallel optimizing compiler. This paper describes the architecture of FXCQ, B language and its compiler. 展开更多
关键词 parallel processing Systolic array processor parallel language Compiler.
下载PDF
Seismic 3D Prestack Time Migration on Parallel Computers
20
作者 Liu Ping & Chen Yunhe(Wuhan Digital Engineering institute, Wuchang P. O. Box 74223, P. R. China 430074) 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 1995年第3期49-55,共7页
Parallel versions of prestack KirchhofT 3D integral migration algorithm, which is suitable forseismic data processing, are described in this paper. Firstly, the inherent parallel characteristics of seismicdata process... Parallel versions of prestack KirchhofT 3D integral migration algorithm, which is suitable forseismic data processing, are described in this paper. Firstly, the inherent parallel characteristics of seismicdata processing are analyzed. Then some principles in algorithm partition are discussed. Based on these analyses and the system architecture, communication mechanism, this algorithm is divided into four subtasksallocated to four nodes of 990 STAR-l. Then we describe in detail a module-partitioning method-theI / O processing and communication are separated from the computation process, the processes includingI / O processing and communication are allocated to transputer T805 and the other is allocated to processori860. These two processes are synchronized by shared memory and memory-lock mechanism, but the communication betWeen different nodes is implemented through links of transputer. Load balance among fourprocessor modules is performed dynamically. Finally, we discussed the speed--up of the parallel versions ofprestack KirchhofT 3D integral migration algorithm running on four nodes. Some further researches are also melltioned in this paper. 展开更多
关键词 parallel algorithm 3D prestack migration parallel processing
下载PDF
上一页 1 2 3 下一页 到第
使用帮助 返回顶部