摘要
浮点单元是高性能处理器的速度瓶颈之一,基于广泛应用的开源RISC-V浮点单元原型,设计了一种面向RISC-V处理器的高速浮点单元。对该原型中时序最差的浮点融合乘加、整数转浮点、除法开方子模块分别进行静态时序分析,并定位其中需要优化的关键模块。针对该浮点单元原型中存在的问题,提出基于算法优化和流水线优化的设计思路,设计基4 Booth-Wallace乘法模块替代原有多位宽乘法模块,设计基于二叉树的并行前导零检测模块替代原有串行前导零检测模块,增加了部分子模块的流水线级数。基于SMIC 55 nm工艺对优化设计前后的RISC-V浮点单元原型进行了性能评估,优化后的工作频率达到820 MHz,提升了39.46%,而面积开销增加了15.14%。
Floating-point unit is one of the speed bottlenecks of high-performance processor.Based on the widely used open-source RISC-V floating-point unit prototype,a high-speed floating-point unit for RISC-V processor is designed.Static timing analyses are performed for the submodules which have worst timing in prototype,including the floating-point fused multiply-add,integer-to-floating-point,floating-point divider and sqrt submodules.Next,the critical modules most in need of optimization are located.Then,the design methods are proposed based on algorithm optimization and pipeline optimization.A radix-4 Booth-Wallace multiplication module is designed to replace the original multi-bit wide multiplication algorithm.A parallel leading zero detection algorithm based on binary tree is designed to replace the original serial leading zero detection algorithm,and the pipeline stages of some submodules are increased.Based on the SMIC 55 nm technology,the performance of RISC-V floating-point unit prototype before and after the optimization are evaluated.The operating frequency reaches 820 MHz after optimization,increased by 39.46%,and the area is increased by 15.14%.
作者
常龙鑫
虞致国
钟啸宇
顾晓峰
CHANG Longxin;YU Zhiguo;ZHONG Xiaoyu;GU Xiaofeng(Engineering Research Centre of IoT Technology Applications(Ministry of Education),Jiangnan University,Wuxi Jiangsu 214122,China)
出处
《电子器件》
CAS
北大核心
2022年第6期1289-1295,共7页
Chinese Journal of Electron Devices
基金
江苏省重点研发计划(BE2019003-2)
中央高校基本科研业务费专项资金资助(JUSRP51510)。