期刊文献+

循环在可重构处理器上的软硬件划分技术 被引量:2

Hardware-Software Partition Technology of Loops for Reconfigurable Processor
下载PDF
导出
摘要 针对较大循环在可重构处理器上的映射问题提出了一种启发式的算法,将循环划分为在处理器上执行的软件部分和在可重构阵列上执行的硬件部分,并且使两者之间的数据传输量最小。通过测试,相比于原有处理较大循环的方法,该技术降低了13%~29%的循环执行时间。在FPGA验证系统上通过H.264中的运动估计和MPEG-2中的IDCT等多种多媒体核心算法验证了该划分技术。使用该划分技术后,验证系统相比于类似结构在不增加硬件规模的情况下,有平均3.5倍的性能提升。 Based on the idea of mapping the loops onto the reconfigurable array can improve the performance, the paper proposes a heuristic hardware-software partition algorithm which partitions the big loop into two parts: one is in the processor and another in the array. The objective of the algorithm is to minimize the communication of the two parts. The partition algorithm reduces 13%-29% of execution time compared with original teehniques.The technology has been verified on the platform of FPGA with some kernel algorithms of multimedia applications, such Motion Estimation of H.264, and IDCT of MPEG-2. With the same scale of reeonfigurable array, the performance is 3.5 times higher than the similar researches.
出处 《电视技术》 北大核心 2009年第10期21-23,40,共4页 Video Engineering
基金 国家自然科学基金项目(60676012) 国家"863"计划信息技术领域重点项目(2009AA011700)
关键词 可重构处理器 可重构阵列 循环映射 软硬件划分 reconfigurable processor reconfigurable array loop mapping hardware-software partition
  • 相关文献

参考文献9

  • 1HARTENSTEIN R. A decade of reconfigurable computing: A visionary retrospective[C]//2001 Design,Automation and Test in Europe Conference and Exposition (DATE 2001 ). Munich, Germany: IEEE Press,2001:642-649.
  • 2SINGH H,LEE M-H,LU Guangnfing. MorphoSys: An integrated reconfigurable system for data-parallel and computation-intensive applications[J]. IEEE Trans. Computers,2000,49(5):465-481.
  • 3BEREKOVIC M. Mapping of Video ADRES Coarse-grain reconfigurable Multimedia and Stream Processors. 47-52. compression algorithms on the array [C]//MSP7 Workshop on Barcelona, Spain : [s.n.], 2005 :.
  • 4XPP-III processor overview[EB/OL]. [2009-04-01]. http://www. pactxpp.com/main/download/XPP-III_overview_WP.pdf.
  • 5BONDALAPATI K. Parallelizing DSP nested loops on reconfigurable architectures using data context switching[C]//Proceedings of the 38th Annual Design Automation Conference. New York:ACM Press, 2001 : 273-276.
  • 6VIKRAM K N,VASUDEVAN V. Mapping data-parallel tasks onto partially reconfigurable hybrid processor architectures [J]. IEEE Trans. VLSI Systems,2006,14(9) : 1010-1023.
  • 7崔岩松,段大高,邓中亮.多宏块模式多参考帧快速搜索算法[J].北京邮电大学学报,2005,28(4):37-40. 被引量:1
  • 8胡嘉凯,梁立伟,蒋建国,齐美彬.基于TMS320C64x DSP的H.264整数变换快速实现方法[J].电视技术,2005,29(4):17-19. 被引量:5
  • 9Texas Instruments Inc. TMS320C6000 assembly benchmarks at Texas Instruments: C64X DSP benchmarks[EB/OL].[2009-04-05]. http ://www.ti.com/sc/docs/products/dsp/c6000/benchmarks162x.htm.

二级参考文献12

  • 1宋俊德.2005年无线移动互联网领域的几个热点问题[J].北京邮电大学学报,2005,28(2):1-3. 被引量:10
  • 2Wiegand T, Schwarz H. The emerging H.264/AVC standard, www.packetizer.com/codecs/h264/trev_293-Schaefer.pdf, January 2003.
  • 3Joint Video Team(JVT) of ISO/IEC MPEG&ITU-T VCEG.Study of Final Committee Draft of Joint Video Specification.Final Committee Draft, Document JVT-F100, December2002.
  • 4Halbach Till, Mathias Wien. Concepts and performance of next-generation video compression standardization. www.ncesd.org/vc/docs/H264_explained.pdf, October 2002.
  • 5Malvar H, Hallapuro A, Karczewicz M, et al. Lowcomplexity transform and quantization in H.264/avc. IEEE Circuits Syst. Video Technol., 2003, 13(7): 598-603.
  • 6ITU-T Rec H.264 (ISO/IEC 14496-10 AVC) - 2003,Draft ITU-T recommendation and final draft international standard of joint video specification [S].
  • 7Thomas Wiegand, Heiko Sehwarz, Anthony Joeh. Rateconstrained coder control and comparison of video coding standards[J]. IEEE Trans CSVT, 2003, 13: 668-703.
  • 8H. 264/AVC software coordination [ J/OL ]. http:///bs.hhi. de/- suehring/tml/download, 2004-06.
  • 9Sullivan G. Recommended simulation common conditions for H. 26L coding efficiency experiments on low-resolution progressive-scan source material[Z]. 2001.
  • 10Chen Zhibo, Zhou Peng, He Yun. Fast integer pel and fractional pel motion estimation for JVT[Z]. 2002.

共引文献4

同被引文献12

  • 1Estrin C, Organization of Computer Systems The Fixed Plus Variable Structure Computer[C]//Proc. of Western Joint Computer Conference. New York, USA: [s. n.], 1960: 33-40.
  • 2于苏东.可重构处理器的软硬件协同设计研究[D].北京:清华大学,2009.
  • 3陶文卿,毛志刚,何卫锋.面向媒体处理的可重构阵列的结构设计与研究[D].上海:上海交通大学,2010.
  • 4Li Yanbing, Callahan T, Darnell E, et al. Hardware SoRware Co-design of Embedded Reconfigurable Architectures[C]//Proc. of the 37th Design Automation Conference. Los Angeles, USA: ACM Press, 2000:507-512.
  • 5Zhu Min, Liu Leibo, Yin Shouyi, et al. A Cycle-accurate Simulator for a Reconfigurable Multi-Media System[J]. IEICE Trans. on Information and Systems, 2010, 93(12): 3202-3210.
  • 6Zhu Mitt, Liu Leibo, Yin Shouyi, et al. A Reconfigurable Multi-processor SoC for Media Applications[C]//Proc. of IEEE International Symposium on Circuits and Systems. [S. 1.]: IEEE Press, 2010:2011-2014.
  • 7Wiegand T, et al. Overview of the H.264/AVC video coding standard, Circuits and Systems for Video Technology [J]. IEEE Transactions on, 2003, 13: 560-576.
  • 8M. Bingfeng, et al. Mapping an H.264/AVC decoder onto the ADRES reconfigurable architecture [A]. in Field Programmable Logic and Applications, 2005.1nternational Conference on [C]. 2005. 622-625.
  • 9M K A Ganesan, et al. H. 264 Decoder at HD Resolution on a Coarse Grain Dynamically Reconfigurable Architecture [A]. in Field Programmable Logic and Applications, 2007. FPL 2007. International Conference on [C]. 2007.467-471.
  • 10D Peng, et al. Implementation of H.264 algorithm on reconfigurable processor ReMAP [A]. in Microelectronics & Electronics, 2009. PrimeAsia 2009. Asia Pacific Conference on Postgraduate Research in [C]. 2009.237-240.

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部