期刊文献+

一种面向媒体处理器有效生成并行指令的方法 被引量:2

An Approach to Generate Parallel Instructions Effectively for Media Processors
下载PDF
导出
摘要 随着数字信号处理器(DSP)的迅速发展,面向DSP设计可重定目标的C编译器成为必然。由GCC编译器移植面向媒体处理器的C编译器,利用指令模板匹配不能有效的生成并行指令。本文结合32位具有RISC-DSP结构的媒体处理器MD32特有的体系结构特点,提出C编译器支持的,在汇编级通过指令调度和调整指令操作数及其类型的代码优化方法,实现输出高效的并行指令,同时提高了代码的运行效率和代码的密度。统计数据表明:代码执行效率平均可以提高14%,而代码密度平均提高11.75%。这种方法比树模式匹配方法更容易实现,并适用于支持并行指令的其他DSP处理器。 With the development of digital signal processing (DSP) processors, the design of retargetable C compiler is necessary. C compiler for media processors is build by porting GCC, all instructions including parallel instructions are generated through instruction pattern matching, which is very difficult to generate effective parallel instructions. Meeting with the features in architecture of RISC-DSP processor designed by us, this paper proposes a compiler supported code generation at assemble level, implemented by instruction scheduling and operand and its type modifying. Therefore, this can generate effective parallel instructions and improve the performance and density of object code. The statistic data show that the performance of object code may be improved by 14% average while the code density may be improved by 11.75% average. This method is easier than tree pattern matching, and can be used in other DSP processors supporting parallel instructions.
出处 《科技通报》 2005年第4期453-459,470,共8页 Bulletin of Science and Technology
基金 国家高技术研究发展计划(863)基金项目(No.2002AAlZll40) 浙江省自然科学基金项目(No.602031)
关键词 计算机体系结构 代码生成 代码优化 编译器 媒体处理器 computer architecture code generation code optimization compiler media processor
  • 相关文献

参考文献11

  • 1Lei Wang, Boying Lu, Li Zhang. The study and implementation of architecture-dependent optimization in GCC[A]. High Performance Computing in the Asia-Pacific Region, 2000. Proceedings[C]. The Fourth International Conference/Exhibition on, Volume: 1 , 14-17 May 2000,253 -255 vol.1.
  • 2Alves J, Held M, Glesner M. A code generator for an application specific pipelined processor [A]. Electrotechnical Conference, 1994. Proceedings [C].7th Mediterranean, vol,1, 12-14 April 1994, 306-308.
  • 3Kuroda I, Hirano A, Nishitani T. A knowledge-based compiler enhancing DSP internal parallelism [A]. Circuits and Systems, 1991 [C]. IEEE International Sympoisum on, vol.1, 11-14 June 1991, 236 -239.
  • 4Yin-Tsung Hwang, Jer-Sho Hwang. Efficient code generation for digital signal processors with parallel and pipelined instructions [A]. Signal Processing Systems,1997. SIPS 97 - Design and Implementation[C]. 1997 IEEE Workshop on, 3-5 Nov. 1997. 243-252.
  • 5Hatcher P.J TuBer J,W. Efficient retargetable compiler code generation[A]. Computer Languages, 1988. Proceedings [C]. International Conference on, 9-13 Oct.1988, 25-30.
  • 6Marwedel P. Code generation for core processors[A]. Design Automation Conference[C], 1997. Proceedings of the 34th.June 9-13. 1997. 232 -237.
  • 7Baudendistel K, McClellan JH. Code generation for the AT&T DSP32[A]. Acoustics, Speech, and Signal Processing, 1990. ICASSP-90[C]. 1990 International Conference on, vol.2, 3-6 April 1990, 1073 -1076.
  • 8Tzer-Shyong Chen, Feipei Lai, Rung-Ji Shang. A simple tree pattern matching algorithm for code generator [A].Computer Software and Applications Conference [C].1995. COMPSAC 95. Proceedings., Nineteenth Annual International, 9-11 Aug. 1995, 162-167.
  • 9Leupers R. Code generation for embedded processors[A].System Synthesis, 2000. Proceedings[C]. The 13th International Symposium on, 20-22 Sept. 2000, 173 -178.
  • 10Richard M Stallman. Using and porting the GNU compiler collection for GCC 3.1. [M]. Free Software Foundation, Last updated 22 June 2001.

同被引文献8

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部