期刊文献+

基于GPU的低密度奇偶校验码译码加速技术 被引量:1

Low density parity check code decoding acceleration technology based on GPU
下载PDF
导出
摘要 随着通信技术的发展,通信终端逐渐采用软件的方式来兼容多种通信制式和协议。针对以计算机中央处理器(CPU)作为运算单元的传统软件无线电架构,无法满足高速无线通信系统如多进多出(MIMO)等宽带数据的吞吐率要求问题,提出了一种基于图形处理器(GPU)的低密度奇偶校验(LDPC)码译码器的加速方法。首先,根据GPU并行加速异构计算在GNU Radio 4G/5G物理层信号处理模块中的加速表现的理论分析,采用了并行效率更高的分层归一化最小和(LNMS)算法;其次,通过使用全局同步策略、合理分配GPU内存空间以及流并行机制等方法减少了译码器的译码时延,同时配合GPU多线程并行技术对LDPC码的译码流程进行了并行优化;最后,在软件无线电平台上对提出的GPU加速译码器进行了实现与验证,并分析了该并行译码器的误码率性能和加速性能的瓶颈。实验结果表明,与传统的CPU串行码处理方式相比,CPU+GPU异构平台对LDPC码的译码速率可提升至原来的200倍左右,译码器的吞吐量可以达到1 Gb/s以上,特别是在大规模数据的情况下对传统译码器的译码性有着较大的提升。 With the development of communication technology,communication terminals gradually adopt software to be compatible with multiple communication modes and protocols.As in the traditional software radio architecture with a Central Processing Unit(CPU)of computer as an arithmetic unit,the wideband data throughput of high-speed wireless communication systems such as Multiple-Input Multiple-Output(MIMO)is not be satisfied,an acceleration method of Low Density Parity Check(LDPC)code decoder based on Graphics Processing Unit(GPU)was proposed.Firstly,according to the theoretical analysis of the acceleration performance of GPU parallelly accelerated heterogeneous computing in GNU Radio 4G/5G physical layer signal processing module,a more parallelly efficient Layered Normalized Min-Sum(LNMS)algorithm was adopted.Then,the decoding delay of the decoder was reduced by using the methods such as global synchronization strategy,reasonably allocation of GPU memory space and stream parallelism mechanism.At the same time,the LDPC code decoding process was optimized in parallel with the multi-threaded parallel technology in GPU.Finally,the GPU accelerated decoder was implemented and verified on the software radio platform,and the bit error rate performance and acceleration performance bottlenecks of the parallel decoder were analyzed.Experimental results show that compared with the traditional CPU serial code processing method,CPU+GPU heterogeneous platform has the decoding rate for LDPC codes increased to about 200 times,and the throughput of decoder can reach more than 1 Gb/s,especially in the case of large-scale data,the decoding performance is greatly improved compared with traditional decoder.
作者 徐启迪 刘争红 郑霖 XU Qidi;LIU Zhenghong;ZHENG Lin(Guangxi Key Laboratory of Wireless Wideband Communication and Signal Processing(Guilin University of Electronic Technology),Guilin Guangxi 541004,China)
出处 《计算机应用》 CSCD 北大核心 2022年第12期3841-3846,共6页 journal of Computer Applications
基金 广西自然科学基金资助项目(2020GXNSFAA159067) 无线宽带通信与信号处理重点实验室基金资助项目(GXKL06160112) 认知无线电重点实验室项目(CRKL200102)。
关键词 图形处理器 计算统一设备架构 低密度奇偶校验码 并行计算 信道译码 Graphic Processing Unit(GPU) Compute Unified Device Architecture(CUDA) Low Density Parity Check(LDPC)code parallel computing channel decoding
  • 相关文献

参考文献3

二级参考文献7

共引文献5

同被引文献11

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部