期刊文献+

一种高效双精度浮点乘法器 被引量:2

An Efficient Double-precision Floating-point Multiplier
下载PDF
导出
摘要 浮点乘法器(FPM)是中央处理器的关键部件之一,因此其性能是处理器的关键影响因素之一,高性能浮点乘法器是研究人员的追求;基于此需求,提出了一种高速双精度浮点乘法器,该设计采用了有别于传统基2Booth算法,即基4Booth算法产生部分积,在此基础上用优化的Wallace树阵列结构进行部分积的累加得到和序列和进位序列,进而对和序列和进位序列采用部分和并行相加得到最后尾数结果;采用了优化的10级流水线结构的设计在Cyclone II EP2C15AF484C6器件上实现后运行频率可达138.77MHz;在同等优化努力下,相比于Altera IP核运行速度提高大约67.77%;类似的,在Xilinx Virtex2 xc2v6000上的实现比现存的设计频率提高约102.2%;实验结果显示了所设计FPM结构的有效性。 Floating point multiplier (FPM) is one key part for CPU, its performance plays a great role in CPU performance, thus a high performance FPM attracts much attention all over the world. To considerably satisfy the requirement, a high speed double precision FPM is presented. The proposed design adopted radix-4 Booth coder to obtain partial products, then an optimal Wallace tree compression architec-ture is exploited on these partial products to obtain a pseudo-sum and pseudo-carry, which are partially accumulated in a parallel approach to generate mantissa of the product. Its radix-4 Booth coder is different from the conventionally adopted radix-2 Booth coder. The design with 10-stage pipeline architecture can achieve up to 138.77MHz targeted at a Cyclone II EP2C15AF484C6 device. By taking the same opti-mization efforts and the same stage pipeline, the design reaches about 67.77% enhancement on operating frequency over an Altera IP core. Similarly, running frequency targeted at Xilinx Virtex2 xe2v6000 increased about 102.2% when compared to an existing design. The experi-mental results demonstrate the efficiency of the proposed FPM architecture.
作者 夏炜 肖鹏
出处 《计算机测量与控制》 北大核心 2013年第4期1017-1020,共4页 Computer Measurement &Control
基金 国家自然科学基金(61072135 60788402) 武汉市科技攻关计划项目(201110921295)
关键词 基4Booth编码 双精度浮点数 浮点乘法器 并行结构 流水线结构 WALLACE树 double precision floating point number floating point multiplier Parallel architecture pipeline architecture Radix--4Booth coder Wallace tree
  • 相关文献

参考文献5

  • 1周洪,樊晓桠,赵丽丽.基于CUDA的稀疏矩阵与矢量乘法的优化[J].计算机测量与控制,2010,18(8):1906-1908. 被引量:6
  • 2Manolopoulos K,Reisis D,Chouliaras V A. An efficient multipleprecision floating-point multiplier [AJ. Electronics,Circuits andSystems(ICECS) [C]. 2011.
  • 3Gong Renxi,Zhang Shangjun,Zhang Hainan. Hardware implemen-tation of a High Speed Floating Point Multiplier Based on FPGA[A]. Proceedings of 2009 4th International Conference on ComputerScience Education [C]. 2009.
  • 4旷捷,毛雪莹,彭俊淇,黄启俊,常胜.基于FPGA的单精度浮点数乘法器设计[J].电子技术应用,2010,36(5):17-19. 被引量:3
  • 5Venishetti S K,Akoglu A. A Highly Parallel FPGA based IEEE-754 Compliant Double-Precision Binary Floating-Point Multipli-cation Algorithm [ A ]. Field-Programmable Technology[C]. 2007.

二级参考文献15

  • 1赵忠民,林正浩.一种改进的Wallace树型乘法器的设计[J].电子设计应用,2006(8):113-116. 被引量:12
  • 2GONG Ren Xi. Hardware implementation of a high speed floating point multiplier based on FPGA.Proceedings of 2009 4th International Conference on Computer Science and Education, ICCSE 2009,2009.
  • 3BEWICK G W.Fast multiplication algorithms and implementation.Stanford University, 1994(2).
  • 4ANSI/IEEE Std.754-1985.IEEE Standard for binary Float- ing-Point Arithmetic.IEEE, Inc, New York. 1985.
  • 5BOOTH A D.A signed binary multiplication technique. Quarterly Journal of Mechanics and Applied Mathematics. 1951,4(2).
  • 6WALLACE C S.A suggestion for a fast muhiplier.IEEE Transactions on Electronic Computers, 1964,13(2).
  • 7SHIVALING S M.High performance low power array multiplier using temporal tiling.IEEE Transactions on Very Large Scale Integration(VLSI) Systems, 1999,7(1).
  • 8Garland M. Sparse Matrix Computations on Manycore GPU's [R]. DAC, June. 2008.
  • 9Bell N, Garland M, Efficient Sparse Matrix--Vector Multiplication on CUDA [R]. NVIDIA Technical Report NVR- 2008 -004, Dec. 2008.
  • 10NVIDIA CUDA Programming Guide Version 2. 1[EB], http: // www. nvidia. com, 2008.

共引文献7

同被引文献8

引证文献2

二级引证文献5

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部