A finite-element model of the thermosetting epoxy asphalt mixture(EAM) microstructure is developed to simulate the indirect tension test(IDT).Image techniques are used to capture the EAM microstructure which is di...A finite-element model of the thermosetting epoxy asphalt mixture(EAM) microstructure is developed to simulate the indirect tension test(IDT).Image techniques are used to capture the EAM microstructure which is divided into two phases:aggregates and mastic.A viscoelastic constitutive relationship,which is obtained from the results of a creep test,is used to represent the mastic phase at intermittent temperatures.Model simulation results of the stiffness modulus in IDT compare favorably with experimental data.Different loading directions and velocities are employed in order to account for their influence on the modulus and the localized stress of the microstructure model.It is pointed out that the modulus is not consistent when the loading direction changes since the heterogeneous distribution of the mixture internal structure,and the loading velocity affects the localized stress as a result of the viscoelasticity of the mastic.The study results can provide a theoretical basis for the finite-element method,which can be extended to the numerical simulations of asphalt mixture micromechanical behavior.展开更多
The most popular hardware used for parallel depth migration is the PC-Cluster but its application is limited due to large space occupation and high power consumption. In this paper, we introduce a new hardware archite...The most popular hardware used for parallel depth migration is the PC-Cluster but its application is limited due to large space occupation and high power consumption. In this paper, we introduce a new hardware architecture, based on which the finite difference (FD) wavefield-continuation depth migration can be conducted using the Graphics Processing Unit (GPU) as a CPU coprocessor. We demonstrate the program module and three key optimization steps for implementing FD depth migration: memory, thread structure, and instruction optimizations and consider evaluation methods for the amount of optimization. 2D and 3D models are used to test depth migration on the GPU. The tested results show that the depth migration computational efficiency greatly increased using the general-purpose GPU, increasing by at least 25 times compared to the AMD 2.5 GHz CPU.展开更多
Particle-in-cell (PIC) method has got much benefits from GPU-accelerated heterogeneous systems.However,the performance of PIC is constrained by the interpolation operations in the weighting process on GPU (graphic pro...Particle-in-cell (PIC) method has got much benefits from GPU-accelerated heterogeneous systems.However,the performance of PIC is constrained by the interpolation operations in the weighting process on GPU (graphic processing unit).Aiming at this problem,a fast weighting method for PIC simulation on GPU-accelerated systems was proposed to avoid the atomic memory operations during the weighting process.The method was implemented by taking advantage of GPU's thread synchronization mechanism and dividing the problem space properly.Moreover,software managed shared memory on the GPU was employed to buffer the intermediate data.The experimental results show that the method achieves speedups up to 3.5 times compared to previous works,and runs 20.08 times faster on one NVIDIA Tesla M2090 GPU compared to a single core of Intel Xeon X5670 CPU.展开更多
The design, analysis and parallel implementation of particle filter(PF) were investigated. Firstly, to tackle the particle degeneracy problem in the PF, an iterated importance density function(IIDF) was proposed, wher...The design, analysis and parallel implementation of particle filter(PF) were investigated. Firstly, to tackle the particle degeneracy problem in the PF, an iterated importance density function(IIDF) was proposed, where a new term associating with the current measurement information(CMI) was introduced into the expression of the sampled particles. Through the repeated use of the least squares estimate, the CMI can be integrated into the sampling stage in an iterative manner, conducing to the greatly improved sampling quality. By running the IIDF, an iterated PF(IPF) can be obtained. Subsequently, a parallel resampling(PR) was proposed for the purpose of parallel implementation of IPF, whose main idea was the same as systematic resampling(SR) but performed differently. The PR directly used the integral part of the product of the particle weight and particle number as the number of times that a particle was replicated, and it simultaneously eliminated the particles with the smallest weights, which are the two key differences from the SR. The detailed implementation procedures on the graphics processing unit of IPF based on the PR were presented at last. The performance of the IPF, PR and their parallel implementations are illustrated via one-dimensional numerical simulation and practical application of passive radar target tracking.展开更多
This paper presents an optimization of shadow volume algorithm, which allow a rendering in real-time. This technique is based on previous works which makes it possible to obtain shadows in real-time, although the calc...This paper presents an optimization of shadow volume algorithm, which allow a rendering in real-time. This technique is based on previous works which makes it possible to obtain shadows in real-time, although the calculation of the silhouette requires a pretreatment of the geometry implemented on the CPU (Central Processing Unit). By using last version of the GPU (Graphic Processing Unit), the authors propose to implement the calculation of the silhouette on the GPU by using Geometry Shader. The authors present the step which made it possible to lead to a concrete implementation of this algorithm, the modifications which were made, as well as a comparative study of results, followed by a discussion of these results and choices of implementation.展开更多
A B-spline active contour model based on finite element method is presented, into which the advantages of a B-spline active contour attributing to its fewer parameters and its smoothness is built accompanied with redu...A B-spline active contour model based on finite element method is presented, into which the advantages of a B-spline active contour attributing to its fewer parameters and its smoothness is built accompanied with reduced computational complexity and better numerical stability resulted from the finite element method. In this model, a cubic B-spline segment is taken as an element, and the finite element method is adopted to solve the energy minimization problem of the B-spline active contour, thus to implement image segmentation. Experiment results verify that this method is efficient for B-spline active contour, which attains stable, accurate and faster convergence.展开更多
Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual ...Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual information into spatial transformation and histogram-based calculation, and performing 3D spatial transformation and trilinear interpolation on graphic processing unit (GPU). The 3D floating image is downloaded to GPU as flat 3D texture, and then fetched and interpolated for each new voxel location in fragment shader. The transformed resuits are rendered to textures by using frame buffer object (FBO) extension, and then read to the main memory used for the remaining computation on CPU. Experimental results show that GPU-accelerated method can achieve speedup about an order of magnitude with better registration result compared with the software implementation on a single-core CPU.展开更多
The paper presents the implementation of a parallel version of FDK (Felkamp, David e Kress) algorithm using graphics processing units. Discussion was briefly some elements the computed tomographic scan and FDK algor...The paper presents the implementation of a parallel version of FDK (Felkamp, David e Kress) algorithm using graphics processing units. Discussion was briefly some elements the computed tomographic scan and FDK algorithm; and some ideas about GPUs (Graphics Processing Units) and its use in general purpose computing were presented. The paper shows a computational implementation of FDK algorithm and the process of parallelization of this implementation. Compare the parallel version of the algorithm with the sequential version, used speedup as a performance metric. To evaluate the performance of parallel version, two GPUs, GeForce 9400GT (16 cores) a low capacity GPU and Quadro 2000 (192 cores) a medium capacity GPU was reached speedup of 3.37.展开更多
A modular flat-screen liquid crystal television display is described.The picture elements of the modules may be emissive,reflective or transmissive.The flat-screen liquid crystal television also comprised of an electr...A modular flat-screen liquid crystal television display is described.The picture elements of the modules may be emissive,reflective or transmissive.The flat-screen liquid crystal television also comprised of an electrical control circuit capable of categorizing incoming television picture signals corresponding to the modules in the array and directing the electrical signals to the drive circuits of each module according to the portion of the television picture to be reproduced by the picture elements of that module.The picture elements are preferably formed in a light modulating film composed of a liquid crystal dispersion in a polymeric binder.A color display was also produced by placing a patterned red-green-blue filter adjacent the active matrix so that each picture element could also be coordinated with the color components of a color video signal.展开更多
The characteristics of asphalt mixtures are associated with the key features of the mixed material when it is not damaged.Two-dimensional(2D) microstructure images of asphalt mixture bending beam specimen were capture...The characteristics of asphalt mixtures are associated with the key features of the mixed material when it is not damaged.Two-dimensional(2D) microstructure images of asphalt mixture bending beam specimen were captured by a CCD camera.After image processing,such as noise elimination,boundary identification,image binarization and vectorization,the images were imported into finite element(FE) software in order to set up the micromechanical finite element(FE) model.The simulation results show that the displacement contours spectrum is not a smooth curve since the mixed material is heterogeneous.Also,the largest strain value exists at the bottom of the specimen between two coarse aggregates,and it is the point where the fracture starts.The stress values of aggregates are larger than those of the asphalt matrix.Different from the strain of asphalt matrix,the strain of aggregates is close to zero because the aggregates have higher capability to resist self-deformation.The difference in deformation between aggregate and asphalt matrix can lead to an interface crack as a final result.All these results can be improved by three-point bending test of asphalt mixture beam.展开更多
The phase field simulation has been actively studied as a powerful method to investigate the microstructural evolution during the solidification.However,it is a great challenge to perform the phase field simulation in...The phase field simulation has been actively studied as a powerful method to investigate the microstructural evolution during the solidification.However,it is a great challenge to perform the phase field simulation in large length and time scale.The developed graphics processing unit(GPU)calculation is used in the phase filed simulation,greatly accelerating the calculation efficiency.The results show that the computation with GPU is about 36 times faster than that with a single Central Processing Unit(CPU)core.It provides the feasibility of the GPU-accelerated phase field simulation on a desktop computer.The GPU-accelerated strategy will bring a new opportunity to the application of phase field simulation.展开更多
基金Program for New Century Excellent Talents in University(No. NCET-08-0118)Specialized Research Fund for the Doctoral Program of Higher Education (No. 20090092110049)
文摘A finite-element model of the thermosetting epoxy asphalt mixture(EAM) microstructure is developed to simulate the indirect tension test(IDT).Image techniques are used to capture the EAM microstructure which is divided into two phases:aggregates and mastic.A viscoelastic constitutive relationship,which is obtained from the results of a creep test,is used to represent the mastic phase at intermittent temperatures.Model simulation results of the stiffness modulus in IDT compare favorably with experimental data.Different loading directions and velocities are employed in order to account for their influence on the modulus and the localized stress of the microstructure model.It is pointed out that the modulus is not consistent when the loading direction changes since the heterogeneous distribution of the mixture internal structure,and the loading velocity affects the localized stress as a result of the viscoelasticity of the mastic.The study results can provide a theoretical basis for the finite-element method,which can be extended to the numerical simulations of asphalt mixture micromechanical behavior.
基金supported by the National Natural Science Foundation of China (Nos. 41104083 and 40804024) Fundamental Research Funds for the Central Universities (No, 2011YYL022)
文摘The most popular hardware used for parallel depth migration is the PC-Cluster but its application is limited due to large space occupation and high power consumption. In this paper, we introduce a new hardware architecture, based on which the finite difference (FD) wavefield-continuation depth migration can be conducted using the Graphics Processing Unit (GPU) as a CPU coprocessor. We demonstrate the program module and three key optimization steps for implementing FD depth migration: memory, thread structure, and instruction optimizations and consider evaluation methods for the amount of optimization. 2D and 3D models are used to test depth migration on the GPU. The tested results show that the depth migration computational efficiency greatly increased using the general-purpose GPU, increasing by at least 25 times compared to the AMD 2.5 GHz CPU.
基金Projects(61170049,60903044)supported by National Natural Science Foundation of ChinaProject(2012AA010903)supported by National High Technology Research and Development Program of China
文摘Particle-in-cell (PIC) method has got much benefits from GPU-accelerated heterogeneous systems.However,the performance of PIC is constrained by the interpolation operations in the weighting process on GPU (graphic processing unit).Aiming at this problem,a fast weighting method for PIC simulation on GPU-accelerated systems was proposed to avoid the atomic memory operations during the weighting process.The method was implemented by taking advantage of GPU's thread synchronization mechanism and dividing the problem space properly.Moreover,software managed shared memory on the GPU was employed to buffer the intermediate data.The experimental results show that the method achieves speedups up to 3.5 times compared to previous works,and runs 20.08 times faster on one NVIDIA Tesla M2090 GPU compared to a single core of Intel Xeon X5670 CPU.
基金Project(61372136) supported by the National Natural Science Foundation of China
文摘The design, analysis and parallel implementation of particle filter(PF) were investigated. Firstly, to tackle the particle degeneracy problem in the PF, an iterated importance density function(IIDF) was proposed, where a new term associating with the current measurement information(CMI) was introduced into the expression of the sampled particles. Through the repeated use of the least squares estimate, the CMI can be integrated into the sampling stage in an iterative manner, conducing to the greatly improved sampling quality. By running the IIDF, an iterated PF(IPF) can be obtained. Subsequently, a parallel resampling(PR) was proposed for the purpose of parallel implementation of IPF, whose main idea was the same as systematic resampling(SR) but performed differently. The PR directly used the integral part of the product of the particle weight and particle number as the number of times that a particle was replicated, and it simultaneously eliminated the particles with the smallest weights, which are the two key differences from the SR. The detailed implementation procedures on the graphics processing unit of IPF based on the PR were presented at last. The performance of the IPF, PR and their parallel implementations are illustrated via one-dimensional numerical simulation and practical application of passive radar target tracking.
文摘This paper presents an optimization of shadow volume algorithm, which allow a rendering in real-time. This technique is based on previous works which makes it possible to obtain shadows in real-time, although the calculation of the silhouette requires a pretreatment of the geometry implemented on the CPU (Central Processing Unit). By using last version of the GPU (Graphic Processing Unit), the authors propose to implement the calculation of the silhouette on the GPU by using Geometry Shader. The authors present the step which made it possible to lead to a concrete implementation of this algorithm, the modifications which were made, as well as a comparative study of results, followed by a discussion of these results and choices of implementation.
基金the National Natural Science Foundation of China (No.59975057).
文摘A B-spline active contour model based on finite element method is presented, into which the advantages of a B-spline active contour attributing to its fewer parameters and its smoothness is built accompanied with reduced computational complexity and better numerical stability resulted from the finite element method. In this model, a cubic B-spline segment is taken as an element, and the finite element method is adopted to solve the energy minimization problem of the B-spline active contour, thus to implement image segmentation. Experiment results verify that this method is efficient for B-spline active contour, which attains stable, accurate and faster convergence.
基金Supported by National High Technology Research and Development Program("863"Program)of China(No.863-306-ZD13-03-06)
文摘Mutual information (MI)-based image registration is effective in registering medical images, but it is computationally expensive. This paper accelerates MI-based image registration by dividing computation of mutual information into spatial transformation and histogram-based calculation, and performing 3D spatial transformation and trilinear interpolation on graphic processing unit (GPU). The 3D floating image is downloaded to GPU as flat 3D texture, and then fetched and interpolated for each new voxel location in fragment shader. The transformed resuits are rendered to textures by using frame buffer object (FBO) extension, and then read to the main memory used for the remaining computation on CPU. Experimental results show that GPU-accelerated method can achieve speedup about an order of magnitude with better registration result compared with the software implementation on a single-core CPU.
文摘The paper presents the implementation of a parallel version of FDK (Felkamp, David e Kress) algorithm using graphics processing units. Discussion was briefly some elements the computed tomographic scan and FDK algorithm; and some ideas about GPUs (Graphics Processing Units) and its use in general purpose computing were presented. The paper shows a computational implementation of FDK algorithm and the process of parallelization of this implementation. Compare the parallel version of the algorithm with the sequential version, used speedup as a performance metric. To evaluate the performance of parallel version, two GPUs, GeForce 9400GT (16 cores) a low capacity GPU and Quadro 2000 (192 cores) a medium capacity GPU was reached speedup of 3.37.
文摘A modular flat-screen liquid crystal television display is described.The picture elements of the modules may be emissive,reflective or transmissive.The flat-screen liquid crystal television also comprised of an electrical control circuit capable of categorizing incoming television picture signals corresponding to the modules in the array and directing the electrical signals to the drive circuits of each module according to the portion of the television picture to be reproduced by the picture elements of that module.The picture elements are preferably formed in a light modulating film composed of a liquid crystal dispersion in a polymeric binder.A color display was also produced by placing a patterned red-green-blue filter adjacent the active matrix so that each picture element could also be coordinated with the color components of a color video signal.
基金supported by the National Natural Science Foundation of China (Grant No. 11162010)the Inner Mongolia Natural Science Foundation (Grant No. 2009MS0701)
文摘The characteristics of asphalt mixtures are associated with the key features of the mixed material when it is not damaged.Two-dimensional(2D) microstructure images of asphalt mixture bending beam specimen were captured by a CCD camera.After image processing,such as noise elimination,boundary identification,image binarization and vectorization,the images were imported into finite element(FE) software in order to set up the micromechanical finite element(FE) model.The simulation results show that the displacement contours spectrum is not a smooth curve since the mixed material is heterogeneous.Also,the largest strain value exists at the bottom of the specimen between two coarse aggregates,and it is the point where the fracture starts.The stress values of aggregates are larger than those of the asphalt matrix.Different from the strain of asphalt matrix,the strain of aggregates is close to zero because the aggregates have higher capability to resist self-deformation.The difference in deformation between aggregate and asphalt matrix can lead to an interface crack as a final result.All these results can be improved by three-point bending test of asphalt mixture beam.
基金supported by the China Postdoctoral Science Foundation(Grant No.2013M540772)the Young Scientists Fund of the National Natural Science Foundation of China(Grant Nos.61203233,51101124,51101125)
文摘The phase field simulation has been actively studied as a powerful method to investigate the microstructural evolution during the solidification.However,it is a great challenge to perform the phase field simulation in large length and time scale.The developed graphics processing unit(GPU)calculation is used in the phase filed simulation,greatly accelerating the calculation efficiency.The results show that the computation with GPU is about 36 times faster than that with a single Central Processing Unit(CPU)core.It provides the feasibility of the GPU-accelerated phase field simulation on a desktop computer.The GPU-accelerated strategy will bring a new opportunity to the application of phase field simulation.