期刊文献+

基于65nm工艺的高性能低功耗处理器设计

Design of High Performance and Lower Power Processor Based on 65 nm Technology
下载PDF
导出
摘要 研究并设计一款RISC处理器,从架构设计、电路设计、芯片后端设计多个层次保证其高性能、低功耗的特点。在架构设计层面,通过扩展寄存器堆来提升数据交互的局部性并降低对存储器的访问次数。在电路设计层面,利用动态门控时钟技术对乘除法模块和寄存器堆进行高效的时钟控制。在芯片后端设计层面,分析并比较TSMC 65 nm中GP和LP 2种工艺库,采用多阈值设计流程进一步提高处理器的速度并降低功耗。测试结果表明,与其他平台下的性能结果相比,该处理器可以将RS前向纠错解码算法的吞吐率提高4倍~70倍。 This paper presents the design of a high performance lower power RISC processor from architecture level,circuit level and backend design level.In architecture level,the processor is cache-free and register file extended to improve data locality and reduce memory accesses.In circuit level,gating clock technology is used to control MDU and register file to reduce power,and in backend level,this paper analyzes and compares GP and LP processes in TSMC 65 nm,and uses multi-threshold flow to increase speed and reduce power.Test results show that the processor can make throughput rate of RS forward error correction decoding increase 4 times^70 times,compared with other platforms.
出处 《计算机工程》 CAS CSCD 2012年第19期250-253,共4页 Computer Engineering
基金 国家自然科学基金资助项目(61103008) 国家科技重大专项基金资助项目(2011ZX03003-003-03) 上海市科委集成电路专项基金资助项目(10706200300) 上海市青年科技启明星基金资助项目(11QA1400500)
关键词 高性能低功耗处理器 扩展寄存器 门控时钟 65nm工艺 多阈值 high performance and lower power processor extended register gating clock 65 nm technology multi-threshold
  • 相关文献

参考文献12

  • 1周俊,林正浩.微处理器的低功耗芯片设计技术[J].单片机与嵌入式系统应用,2007(1):8-10. 被引量:5
  • 2Dias W P, Colonese E. Performance Analysis of Cache and Scratchpad Memory in an Embedded High Performance Processor[C]//Proc. of the 5th International Conference on Information Technology. Las Vegas, USA: IEEE Computer Society, 2008: 657-661.
  • 3李晓潮,焦李成,洪新华,卢潇.基于嵌入式SIMD处理器的音频解码优化[J].计算机工程,2009,35(1):13-16. 被引量:1
  • 4MIPS Technologies Company. MIPS32 4KE? Processor Core Family Software User’s Manual[Z]. 2004.
  • 5Quan Heng, Xiao Ruijin, You Kaidi, et al. A Novel Vector/SIMD Multiply-accumulate Unit Based on Reconfigurable Booth Array[C]//Proc. of the 10th IEEE International Conference on Solid-state and Integrated Circuit Technology. [S. 1.]: IEEE Press, 2010: 524-526.
  • 6Xiao Ruijin, Quan Heng, You Kaidi, et al. A Novel Multi-core Processor for Communication Applications[C]//Proc. of the 10th IEEE International Conference on Solid-state and Integrated Circuit Technology. [S. 1.]: IEEE Press, 2010: 236-238.
  • 7Tavel B, Duriez B, Gwoziecki R, et al. 65 nm LP/GP Mix Low Cost Platform for Multi-media Wireless and Consumer Applications[C]//Proc. of ESSDERC’05. Grenoble, France: [s. n.], 2005.
  • 8Taiwan Semiconductor Manufacturing Company. 90/65/45 nm Standard Cell Library Application Note[Z]. 2011.
  • 9Yu Zhiyi, You Kaidi, Xiao Ruijin, et al. An 800 MHz 320 mW 16-core Processor with Message-passing and Shared-memory Inter-core Communication Mechanisms[C]//Proc. of ISSCC’12. Shanghai, China: [s. n.], 2012: 14-15.
  • 10Quan Heng, Xiao Ruijin, You Kaidi. A Simple High-efficient Inter- core Communication Mechanism for Multi-core Systems[C]//Proc. of Asia-pacific Signal & Information Processing Association Annual Summit & Conference. [S. 1.]: IEEE Press, 2011.

二级参考文献19

  • 1Bhargava R, John L K, Evans B L, et al. Evaluating MMX Technology Using DSP and Multimedia Applications[C]// Proceedings of the 31st Annual ACM/IEEE International Symposium on Microarchitecture. [S. l.]: IEEE Press, 1998: 37-46.
  • 2Simunic T, Benini L, Micheli G D, et al. Source Code Optimization and Profiling of Energy Consumption in Embedded Systems[C]// Proceedings of the 13th International Symposium on System Synthesis. NY, USA: [s. n.], 2000: 193-198.
  • 3Wragg B, Carpenter E An Optimised Software Solution for an ARM Powered MP3 Decoder[M]. Cambridge, UK: ARM Corporation, 2000.
  • 4Gurkhe V. Optimization of a MP3 Decoder on the ARM Processor[C]//Proc. of TENCON'03. NY, USA: [s. n.], 2003: 1475-1478.
  • 5Anguita M, Martfnez-Lechado J M. MP3 Optimization Exploiting Processor Architecture and Using Better Algorithms[J]. IEEE Micro, 2005, 25(3): 81-92.
  • 6Bhaskaran V, Konstantinides K, Lee R B, et al. Algorithmic and Architectural Enhancements for Real-time MPEG-1 Decoding on a General Purpose RISC Workstation[J]. IEEE Trans. on Circuits and Systems for Video Technology, 1995, 5(5): 380-386.
  • 7Hans M C, Bhaskaran V. A Compliant MPEG-1 Layer Ⅱ Audio Decoder with 16-B Arithmetic Operations[J]. IEEE Signal Processing Letters, 1997, 4(5): 121-122.
  • 8Parthasarathy V, Bharathi S A, Uthariaraj V R. Performance Analysis of Embedded Media Applications in Newer ARM Architectures [C]//Proceedings of the 2005 International Conference on Parallel Processing Workshops. NY, USA: [s. n.], 2005: 210-214.
  • 9Lee B G. A New Algorithm to Compute the Discrete Cosine Transform[J]. IEEE Trans. on Acoustic Speech Signal Processing, 1984, 32(6): 1243-1245.
  • 10ISO/IEC 13818-3 Information Technology-generic Coding of Moving Pictures and Associated Audio[S]. 1994.

共引文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部