期刊文献+

科学计算程序在FT64流处理器上的实现、优化和评测

Implementation,Optimization and Evaluation of Scientific Applications on the FT64 Stream Processor
下载PDF
导出
摘要 流体系结构是一种适应VLSI工艺发展的新型体系结构,它是否对科学计算程序有效是一个广泛关注的问题。本文选取NASA并行测试程序集中的一个数据密集型程序MG,研究了它在一个64位的面向科学计算设计的流处理器FT64上的实现和优化问题。在FT64上的实测表明,经过面向片上存储层次的优化,FT64能够达到与Itanium 2处理器相当的性能。 The stream architecture is one of the emerging architectures that adapt well to the development of VLSI technologies, its efficiency to scientific applications is widely concerned. In this paper, we select a data-intensive program MG from the NASA Parallel Benchmarks and explore its implementation and optimization on the FT64 stream processor, which is the first implementation of a 64-bit stream processor for scientific computing. The evaluation shows that FT64 achieves a comparable performance to the Itanium 2 processor after optimizations towards on-chip memory hierarchies.
出处 《计算机工程与科学》 CSCD 2008年第9期107-110,共4页 Computer Engineering & Science
基金 国家自然科学基金资助项目(60621003 60633050)
关键词 FT64 流处理器 存储层次 性能评测 FT64 stream processor memory hierarchy performance evaluation
  • 相关文献

参考文献10

  • 1Rixner S. Stream Processor Architecture[M]. Kluwer Academic Publishers Group, 2002.
  • 2Dally W J, Hanrahan P, Erez M, et al. Merrimac: Supercomputing with Streams[C]//Proc of Supercomputing Conf, 2003.
  • 3Yang X, Yan X, Xing Z, et al. A 64-bit Stream Processor Architecture for Scientific Applications[C]//Proc of the 34th Annual Int'l Symp on Computer Architecture, 2007.
  • 4Owens J, Kapasi U, Mattson P, et al. Media Processing Applications on the Imagine Stream Proeessor[C]//Proc of the 20th IEEE Int'l Conf on Computer Design, 2002:295-302.
  • 5Mattson P. A Programming System for the Imagine Media Processor:[Ph D Thesis][D]. Stanford University, 2002.
  • 6Ahn J H, Dally W J, Khailany B, et al. Evaluating the Imagine Stream Architeeture [C]//Proc of the 31st Annual Int'l Symp on Computer Architecture, 2004:14-25.
  • 7Griem G,Oliker L. Transitive Closure on the Imagine Stream Processor[C]//Proc of the 5th Workshop on Media and Streaming Processors,2003.
  • 8Erez M, Ahn J H, Garg A, et al. Analysis and Performance Results of a Molecular Modeling Application on Merrimac[C]//Proc of Supercomputing Conf, 2004 : 263-272.
  • 9Allen R, Kennedy K. Optimizing Compilers for Modern Architecture, A Dependence-Based Approach[M]. Elsevier Science,2001.
  • 10Allen R, Kennedy K. Vector Register Allocation[J]. IEEE Trans on Computers, 1992,41(10):1290-1317.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部