期刊文献+

Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method

Fast division-free parallel structure for convolution perfectly matched layer in finite difference time domain method
原文传递
导出
摘要 Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss. Parallel acceleration of convolution perfectly matched layer (CPML) algorithm suffers from massive division operation which is widely accepted as one of the most expensive operations for the equipment such as graphic processing unit (GPU), field programmable gate array (FPGA) etc. In pursuit of higher efficiency and lower power consumption, this article revisited the CPML theory and proposed a new fast division-free parallel CPML structure. By optimally rearranging the CPML inner iteration process, all the division operators can be eliminated and replaced by recalculating the related field updating coefficients offline. Experiments show that the proposed division-free structure can save more than 50% arithmetic instructions and 25% execution time of the traditional parallel CPML structure without any accuracy loss.
出处 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2015年第1期72-76,82,共6页 中国邮电高校学报(英文版)
基金 sponsored by the National Natural Science Foundation of China (30870577)
关键词 division elimination convolution perfectly matched layer finit difference time domain parallel computing graphic processing unit division elimination, convolution perfectly matched layer, finit difference time domain, parallel computing, graphic processing unit
  • 相关文献

参考文献1

二级参考文献10

  • 1李康,孔凡敏,郭毅峰,王俊泉,梅良模.MRTD和高阶FDTD算法的数值色散特性的分析[J].系统仿真学报,2005,17(9):2089-2091. 被引量:12
  • 2葛德彪,杨利霞.各向异性介质FDTD分析及其并行计算[J].系统工程与电子技术,2006,28(4):483-485. 被引量:4
  • 3Yee K S.Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media[J].IEEE Trans on Antennas and Propagation, 1996, 14: 302-307.
  • 4Adams S, Payne J, Boppana R.Finite Difference Time Domain (FDTD) simulations using graphics processors[C]//2007 DoD High Performance Computing Modernization Program Users Group Conference, Pittsburgh, 2007: 334-338.
  • 5Valcarce A,De La Roche G,Jie Z.A GPU approach to FDTD for radio coverage prediction[C]//11th IEEE Singapore Interna- tional Conference on Communication Systems,Guangzhou,2008: 1585-1590.
  • 6Roden J, Gedney S D.Convolution PML (CPML) :an efficient FDTD implementation of the CFS-PML for arbitrary medium[J]. Microwave and Optical Technology Letters, 2000,27: 334-339.
  • 7Gandey S D.An anisotropic perfectly matched layer-absorbing medium for the truncation of FDTD lattices[J].IEEE Transac- tions on Antennas and Propagation, 1996,44(12) : 1630-1639.
  • 8Nvidia Corporation Technical Staff.NVIDIA CUDA program- ming guide 2.0[M].[S.l.] : NVIDIA Corporation, 2008 : 13-71.
  • 9Inman M J, Elsherbeni A Z.Programming video cards for com- putational electromagnetic applications[J].Antermas and Propaga- tion Magazine,IEEE,2005,47(6) : 71-78.
  • 10Du Liuge,Li Kang,Kong Fanmin.Parallel 3D finite difference time domain simulations on graphics processors with cuda[C]// Proceedings of the Computational Intelligence and Software Engineering, Wuhan, 2009:1-4.

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部