一种基于SRT-8算法的SIMD浮点除法器的设计与实现

Design and implementation of a SIMD floating-point divider based on SRT-8

下载PDF

导出

摘要在科学计算、数字信号处理、通信和图像处理等应用中,除法运算是常用的基本操作之一。基于SRT-8除法算法,设计一个SIMD结构的IEEE-754标准浮点除法器,在同一硬件平台上能够实现双精度浮点除法和两个并行的单精度浮点除法。通过优化SRT-8迭代除法结构,提出商选择和余数加法的并行处理,并采用商数字存储技术降低迭代除法的计算延时,提高频率。同时,采用复用策略减少硬件资源开销,节省面积。实验表明,在40nm工艺下,本设计综合cell面积为18 601.968 1μm2,运行频率可达2.5GHz,相对传统的SRT-8实现关键延迟减少了23.81%。 In the area of scientific computing, digital signal processing, communication and image processing, division is one of the widely used basic operations. Based on SRT-8 algorithm, a SIMD floating-point divider is designed,which is compatible to IEEE-754 standard. The divider supports one double precision floating point division and two parallel single precision floating point division on the same hardware platform. It reduces the iterative division calculation time delay and improves the frequen- cy by optimizing the SRT-8 iterative division structure,choosing parallel processing of quotient and resi- due addition, and adopting rapid storage technique. Besides, it reduces hardware resources and saves area by adopting reuse strategy. Experiments show that the synthesized cell area is 18 601. 968 1μm2 and the frequency reaches up to 2.5 GHz with 40 nm technology library,and the latency of operation is reduced by 23.81% in comparison to the traditional implementation based on SRT-8.

作者邓子椰陈书明彭元喜雷元武

机构地区国防科学技术大学计算机学院

出处《计算机工程与科学》 CSCD 北大核心 2014年第5期797-803,共7页 Computer Engineering & Science

关键词 SRT-8 浮点除法器双精度浮点 SIMD单精度浮点 SRT-8 SIMD floating-point division double precision floating-point SIMD single precisionfloating-point

分类号 TP303 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献10

1Gerwig G, Wetter H, Schwarz E M, et al. High perform- ance troating point unit with 116 bit wide divider[C]//Proe of the 16th Symposim on Computer Arithmetic, 2003:87- 94.
2Harris D I., Oberman S F, Horowirtz M A. SRT division architectures and implementations[C]//Proc of the 13th IEEE Symposium, 1997:18-25.
3Fandrianto J. Algorithm for high speed shared radix-4 divi- sion and radix-4 square-root [C]///Proc of the 8th IEEE Symposium on Computer Arithmetic, 1987:73-79.
4Oberman S F. Floating-point division and square root algo- rithms and implementation in the AMD-K7 microprocessor [C]//Proe of the 14th Symposium on Computer Arithmetic, 1999:106-115.
5Burgess N, Hinds C N. Design of the ARM VFPll divide and square root synthesisable macrocell[C]//Proc of the 18th IEEE Symposim on Computer Arithmetic, 2007 : 87-96.
6NVIDIA. Fermi: NVIDIA' s next generation CUDA compute architecture[EB/OL]. [2009-10-10]. http: //www. nvidia.com /content/PDF/fermi _ white_ papers/NVIDIA _ Fermi_ Compute_ Archit ecture Whit epaper, pdf.
7Oberman S F, Flynn M. Design issues in division and other floating-point operations[J]. IEEE Transactions on Comput- ers, 1997, 46(2):154-161.
8王县,倪晓强,邢座程.浮点除法算法的分析与研究[C]//计算机工程与工艺,2008:282-283.
9Liu W, Nannarelli A. Power efficient division square root u- nit[J]. IEEE Transactions on Computers, 2012, 61 (8): 1059-1070.
10Baliga H, Cooray N, Gamsaragan E, et al. Improvements in the Intel Core2 Penryn processor family architecture and mi- eroarchitecture[J]. Intel Technology Journal, 2008, 12(3) : 179-192.

1何婷婷,彭元喜,雷元武.基于Goldschmidt算法的高性能双精度浮点除法器设计[J].计算机应用,2015,35(7):1854-1857. 被引量：3
2GPU/MIC双剑合璧浪潮推出新旗舰异构超算服务器[J].电脑与电信,2015,0(8):12-12.
3张民选,李晓梅.软硬结合的迭代除法方案及其精度分析[J].国防科技大学学报,1989,11(1):28-34.
4刘志刚,汪旭东,郑关东.基于SRT算法的单精度浮点除法器[J].电子技术应用,2007,33(10):56-58. 被引量：4
5钟强,刘鹏飞,刘宝军,胡宗进,秦绪栋.基于FPGA的浮点除法器的研究与实现[J].中国集成电路,2016,25(9):43-46.
6王景悟.单双精度浮点运算加法器的实现[J].电脑知识与技术,2016,0(11):231-232. 被引量：2
7白永强,沈绪榜,罗旻,靳战鹏.一种高阶除法器的设计与实现[J].微电子学与计算机,2006,23(1):64-66. 被引量：4
8徐群,金翊,沈云付,李卫民,韩越兴.三值光学计算机的MSD迭代除法算法和实现技术[J].中国科学：信息科学,2016,46(4):539-550. 被引量：5
9汪小志,高珍冉,刘志刚.单精度浮点除法器的优化设计与仿真[J].电工文摘,2012(2):50-52. 被引量：3
10程晋,严承华,樊攀星.基于CBR的信息化装备故障案例推理复用技术研究[J].计算机与数字工程,2013,41(8):1308-1311. 被引量：3

计算机工程与科学

2014年第5期

浏览历史

内容加载中请稍等...

一种基于SRT-8算法的SIMD浮点除法器的设计与实现

参考文献10

相关作者

相关机构

相关主题

浏览历史