期刊文献+

分段三次多项式逼近初等函数的硬件实现 被引量:2

Hardware Implementation of Elementary Function Approximation by Using a Piecewise Cubic Polynomial Interpolator
下载PDF
导出
摘要 针对分段二次多项式逼近初等函数需要较大的查找表面积和电路面积的问题,提出基于极大极小分段三次多项式逼近单精度浮点初等函数的算法,实现了单精度浮点倒数、平方根、平方根倒数、指数、对数和三角函数的逼近运算.首先缩小参数范围到一个特定的区间,并对该区间进行均匀分段,在每一分段区间上采用极大极小分段三次多项式逼近;然后在对应分段上综合考虑各种误差,在满足精度要求的情况下,通过多次Remes算法迭代优化出多项式系数的最优截取位宽,使查找表的面积最小;再对乘法器、平方器和立方器的输出位宽进行最优截取,使电路的面积最小;最后设计出硬件电路的整体架构.实验结果表明,与分段二次多项式逼近相比较,在同等精度要求下,该算法能够使电路时延减少17.25%,同时使查找表的面积减少53.60%、电路的总面积减少19.73%. The elementary function approximation using piecewise quadratic polynomial interpolation requires larger area of the look-up table(LUT) and circuit. To solve the problem, this paper presents an algorithm for elementary function approximation in single-precision floating-point format, which is based on Minimax piecewise cubic polynomial approximation. The algorithm can efficiently achieve the approximation of reciprocal, square root, square root reciprocal, exponentials, logarithm, and trigonometric function in single-precision floating-point format. At the beginning of the algorithm, the range of parameters is narrowed to a specific interval, and this interval is evenly segmented. The Minimax piecewise cubic polynomial approximation is then used in each segment. The errors in corresponding segments are synthetically considered, and the optimal truncated bit width of coefficients is obtained by successive optimization using Remes algorithm, so as to reduce the area of LUT and circuit. Also, the output bit width of multiplier, squarer unit and cubic unit is truncated, in order to reduce the area of circuit. At last, the overall framework of hardware circuit is designed. The analysis and experimental results show that compared with piecewise quadratic polynomial approximation in the same precision, the circuit delay is reduced by 17.25%, the area of LUT is decreased by 53.60% and total area of circuits is reduced by 19.73%.
出处 《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2016年第1期180-187,共8页 Journal of Computer-Aided Design & Computer Graphics
基金 国家自然科学基金(61136002) 陕西省科技发展计划(2011k06-47) 西安市科技发展计划(CXY1440(10))
关键词 三次多项式逼近 初等函数 单精度 低成本 MAPLE cubic polynomial approximation elementary functions single precision low cost Maple
  • 相关文献

参考文献3

二级参考文献29

  • 1吴恩华,柳有权.基于图形处理器(GPU)的通用计算[J].计算机辅助设计与图形学学报,2004,16(5):601-612. 被引量:226
  • 2Muller J M.Elementary Functions:Algorithms and Imple-mentations[M].2nd ed.Boston,USA:Birkhauser,2006.
  • 3Pineiro J A,Oberman S F,Muller J M,et al.High-speedFunction Approximation Using a Minimax Quadratic Inter-polator[J].IEEE Trans.on Computers,2005,54(3):304-318.
  • 4Caro D,Petra N,Strollo A G M.High-performance SpecialFunction Unit for Programmable 3-D Graphics Processors[J].IEEE Trans.on Circuit and System,2009,56(9):1968-1978.
  • 5Pi eiro J A.Algorithms and Architectures for ElementaryFunction Computation[D].Santiago,USA:University ofSantiago de Compostela,2003.
  • 6Strollo A G M,Petra N,Caro D.Dual-tree Error Compen-sation for High Performance Fixed-width Multipliers[J].IEEETrans.on Circuit and System,2005,52(8):501-506.
  • 7Rajput R P,Swamy M N S.High Speed Modified BoothEncoder Multiplier for Signed and Unsigned Numbers[C]//Proc.of the 14th International Conference on Modeling andSimulation.Cambridge,UK:Emmanuel College Press,2012.
  • 8Erez S,Even G.An Improved Micro-architecture for FunctionApproximation Using Piecewise Quadratic Interpolation[C]//Proc.of IEEE International Conference on Computer Design.[S.l.]:IEEE Press,2008:422-426.
  • 9Walters E G,Schulte M J.Efficient Function ApproximationUsing Truncated Multipliers and Squarers[C]//Proc.of the17th IEEE Symposium on Computer Arithmetic.[S.l.]:IEEEPress,2005:232-239.
  • 10ShreinerD,WooM,NeiderJ,etal.OpenGL编程指南[M].徐波,译.6版.北京:机械工业出版社,2008.

共引文献18

同被引文献4

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部