期刊文献+

基于四阶段人工优化的软件流水技术 被引量:1

Software Pipelining Technique Based on Four-Phase Manual Optimization
下载PDF
导出
摘要 代码体积是优化存储资源有限的嵌入式系统的重要因素之一。针对该特点,使用oprofile性能分析工具,以EEMBC基准程序集作为工作负载,提出四阶段人工优化软件流水方法(FPMO)。电信类的自相关程序实验结果表明,FPMO以2.04%的代码增量为代价换来40.678%的性能提升,而单纯的编译器自动优化则以33.35%的体积膨胀换来38.33%的性能提升。 For embedded systems with very limited memory resources, code size becomes one of the most important optimization concerns. Using the oprofile profiling tool, this paper focuses on the Four-Phase Manual Optimization(FPMO) for the software pipelining technique when running the EEMBC benchmark. Experimental result of telecom-autocorrelation shows the FPMO method gets 40.678% performance promotion by increasing 2.04% code size but the pure compiler automatic optimization trades 38.33% performance improvements by 33.35% code size expansion.
出处 《计算机工程》 CAS CSCD 北大核心 2009年第5期40-43,共4页 Computer Engineering
基金 国家“863”计划基金资助项目(2007AA01Z114) 国家“863”计划基金资助重点项目“低成本先进计算机单机”(2006AA010201) 国家自然科学基金资助项目(60703017)
关键词 软件流水 循环展开 性能分析 四阶段人工优化 software pipelining loop unrolling performance analysis Four-Phase Manual Optimization(FPMO)
  • 相关文献

参考文献4

  • 1Zhuge Qingfeng, Xiao Bin. Code Size Reduction Technique and Implementation for Software-pipelined DSP Applications[J]. ACM Trans. on Embedded Computing Systems, 2003, 2(4): 590-613.
  • 2Sanchez F, Cortadella J, Badia R M. Optimal Exploration of the Unrolling Degree for Software Pipelining[J]. Journal of Systems Architecture: the EUROMICRO Journal, 1999, 45(6): 1-16.
  • 3Sarkar V. Optimized Unrolling of Nested Loops[C]//Proc. of the 14th International Conference on Supercomputing. New Mexico, USA: [s. n.], 2000.
  • 4李文龙,刘利,汤志忠.软件流水中的循环展开优化[J].北京航空航天大学学报,2004,30(11):1111-1115. 被引量:16

二级参考文献9

  • 1Allen V H, Jones R B, Lee R M, et al . Software pipelining [J]. ACM Computing Surveys, 1995,27(3):367-432
  • 2Weiss S, Smith J E. A study of scalar compilation techniques for pipelined supercomputers[J]. ACM Transactions on Mathematical Software, 1990, 16(3):223-245
  • 3Rau B R. Iterative modulo scheduling[R]. HPL-94-115, 1994
  • 4Mowry T C, Lam M S, Gupta A. Design and evaluation of a compiler algorithm for prefetching[A]. In: Proceeding of the Fifth International Conference on Architectural Support for Programming Languages and Operating Systems [C]. Massachusetts: ACM Press, 199
  • 5Roy J, Sun C, Wu C Y. Open research compiler for itanium processor family(IPF) [A]. In:MICRO-34 Tutorial [C]. Texas: ACM Press, 2001
  • 6Intel Corporation. Intel IA-64 architecture software developer's manual. Volume 3: Instruction set reference [M]. Intel Corp, 2000
  • 7Intel Corporation. Intel IA-64 architecture software developer's manual. Volume 1: IA-64 application architecture [M]. Intel Corp, 2000
  • 8Sanchez F, Cortadella J,Badia R M. Optimal exploration of the unrolling degree for software pipelining [R]. UPC-DAC-1996-41, 1996
  • 9Vivek Sarkar. Optimized unrolling of nested loops [A]. In: Proceedings of the 14th International Conference on Supercomputing[C]. New Mexico: ACM Press, 2000. 153-166

共引文献15

同被引文献2

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部