期刊文献+

对数函数ln(x)在SDTA指令集结构上的实现

Implementation of the Natural Logarithm Function ln(x) on the SDTA Instruction Set Architecture
下载PDF
导出
摘要 SDTA指令集体系结构是一种基于传输触发的VLIW体系结构。本文结合SDTA指令集结构的特点,经过循环展开和循环化简、强度消弱、过程集成、机器方言和指令归并等指令调度优化技术,高效实现了自然对数函数ln(x)。实验结果表明,在Neuron处理器上,ln(x)不但数据精度高,而且运行周期数只有gcc3.2.2数学库中自然对数函数运行周期数的33%左右。 Based on the transport triggering architecture, the SDTA (Synchronized Data Triggering Architecture) instruction set architecture is a VLIW(Very Long Instruction Word) architecture. According to the SDTA instruction set architecture, taking advantage of the instruction scheduling and optimization techniques,such as loop unrolling and loop sim- plification, strength reduction, procedure integration, machine idiom and instruction merge, ln(x) is implemented successfully in this paper. The experimental results show that the running circles of ln(x) is decreased to about 33%, compared with the natural logarithm function in the mathematical library of gcc3.2. 2 on the Neuron microprocessor, while the precision of ln(x) is adequately high.
出处 《计算机工程与科学》 CSCD 北大核心 2009年第2期88-92,共5页 Computer Engineering & Science
基金 国家自然科学基金资助项目(60773024)
关键词 对数函数ln(x) SDTA指令集 Neuron处理器 指令调度优化 natural logarithm function ln(x) SDTA instruction set architecture Neuron microprocessor instructions scheduling and optimization
  • 相关文献

参考文献12

  • 1Volder J E. The CORDIC Trigonometric Computing Technique[J]. IRE Trans on Electronic Computing, 1959, 8(3): 330-334.
  • 2Walther J S. A Unified Algorithm for Elementary Functions [C]//Proc of AFIPS Spring Joint Computer Conf, 1997: 379-385.
  • 3Bajard J C, Kla S, Muller J. BKM: A New Hardware Algorithm for Complex Elementary Functions[J]. IEEE Trans on Computers, 1994, 43(8) : 955-963.
  • 4Tang P T P. Table-Lookup Algorithms for Elementary Functions and Their Error Analysis[C] // Proc of the 10th Symp on Computer Arithmetic, 1991 : 232-236.
  • 5Tang P T P. Table-Driven Implementation of the Logarithm Function in IEEE Floating-Point Arithmetic[J]. ACM Trans on Mathematical Software, 1990, 4(16) : 378-400.
  • 6Tang P T P. Table-Driven Implementation of the Exponential Function in IEEE Float-Point Arithmetic[J]. ACM Trans on Mathematical Software, 1989, 15(2) : 144-157.
  • 7OpenGL ES 2. 0 Specification[EB/OL]. [2007-07-10]. http://www. khronos.org/opengles/2_X.
  • 8OpenGL ES 2. 0 Shading language[EB/OL].[2007-07-10]. http://www. khronos. org/opengles/2_X.
  • 9Hoogerbrugge J, Corporaal H. Transport-Triggering vs. Operation-Triggering[C]//Proc of the Int'l Conf on Compiler Construction, 1994.
  • 10Muchnick S.高级编译器设计与实现[M].赵克佳,沈志宇译.北京:机械工业出版社,2005.

共引文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部