期刊文献+

80位浮点运算的编译实现与优化 被引量:1

Implementation and Optimization of the 80-Bit Floating-Point Arithmetic
下载PDF
导出
摘要 本文介绍了80位浮点运算的编译实现技术,在IA-64平台上针对一套科学计算测试程序进行了性能瓶颈分析。利用IA-64体系结构特点,改进和实现了用户定义函数的自动内联、高级循环变换、数据预取、80位浮点数学库函数内联扩展四种编译优化。测试结果表明,这些优化手段显著提高了80位浮点运算的串行性能和并行性能。 In this paper we present the implementation of the 80-bit floating-point arithmetic, and perform a bottleneck analysis of the IA-64 system with a suite of scientific computing benchmarks. Then we improve and implement four optimizations by utilizing the architecture features of IA-64, namely the automatic inlining of user-defined functions, high-level loop transformations, data prefetching, and expanding of math libraries, We show that on IA-64 these improvements have significantly improved the perform- ance of both serial and parallel 80-bit floating-point arithmetic of the scientific computing benchmarks.
出处 《计算机工程与科学》 CSCD 北大核心 2009年第1期154-158,共5页 Computer Engineering & Science
基金 国家自然科学基金重点资助项目(60633050)
关键词 80位浮点运算 IA-64体系结构 GCC编译器 优化 80-bit floating-point arithmetic IA-64 architecture GCC corapiler optimization
  • 相关文献

参考文献15

  • 1周毓麟,袁国兴.关于科学计算用数字电子计算机字长问题[J].计算机工程与科学,2005,27(10):1-2. 被引量:7
  • 2Bailey D H. High-Precision Floating-Point Arithmetic in Scientitle Computation[J]. Computing Science and Engineering, 2005,7(3) :54-61.
  • 3Hauser J R. SoftFloat[EB/OL].[2007-07-12]. http://www. jhauser.us/arithmetic/softfloat. html.
  • 4Sehulte M J, Swartzlander E E, A Family of Variable-Precision Interval Arithmetic Processors[J]. IEEE Trans on Computers,2000,49 (5) :387-397.
  • 5Intel Coporation. Intel 64 and Intel IA-32 Architectures Software Developer's Manual[M]. Intel Corporation, 2007.
  • 6Intel Coporation. Intel Itanium^TM Architecture Software Developer's Manual[M]. Intel Corporation, 2001.
  • 7Khan W. Lecture Notes on the Status of IEEE Standard 754 for Binary Floating-Point Arithmetic[R]. University of California Berkeley, 1996.
  • 8Stallman R M. Using and Porting GNU CK2C[M]. Free Software Foundation, Inc, 1994.
  • 9Jarp S. A Methodology for Using the Itanium 2 Performance Counters for Bottleneck Analysis [ EB/OL].[2007-03-25]. http://www.gelato.org/pdUPerformance_ counters_ final. pdf.
  • 10Arnold M, Fink S, Sarkar V, et al. A Comparative Study of Static and Profile-Based Heuristics for Inlining[C]//Proc of ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and Optimization, 2000 : 52-64.

二级参考文献2

共引文献6

同被引文献14

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部