期刊文献+

基于FPGA的嵌入式多核处理器及SUSAN算法并行化 被引量:5

Multi-Core Embedded Processor Based on FPGA and Parallelization of SUSAN Algorithm
下载PDF
导出
摘要 给出了四核心嵌入式并行处理器FPEP的结构设计并建立了FPGA验证平台.为了对多核处理器平台性能进行评测,提出了基于OpenMP的3种可行的图像处理领域的经典算法SUSAN算法的并行化方法:直接并行化SUSAN、图像分块处理和多图像并行处理,并对这3种并行算法在Intel四核心平台和FPEP的FPGA验证平台上进行性能测试.实验表明,3种并行算法在两种四核心平台下均可获得接近3.0的加速比,多图像并行处理在FPEP的FPGA验证平台可以获得接近4.0的加速比. Structure of 4-core embedded parallel processor is presented and FPGA validation platform that is called FPEP is built.To evaluate performance of multi-core processor platform,three feasible parallel algorithms with OpenMP on SUSAN that is a classic image processing algorithm: direct parallel SUSAN,image block processing and parallel processing of multiple images are proposed,and performance testing of these three parallel algorithms execute on Intel 4-core platform and FPGA validation platform of FPEP.Experimental results demonstrate that speedup ratio of three parallel algorithms on both 4-Core platforms is approximate 3.0,and speedup ratio of parallel processing of multiple images on FPGA validation platform of FPEP is approximate 4.0.
出处 《计算机学报》 EI CSCD 北大核心 2008年第11期1995-2004,共10页 Chinese Journal of Computers
基金 山东省科技发展计划项目基金(2007GG10001020)资助
关键词 SUSAN FPGA OPENMP 多核处理器 图像处理 SUSAN FPGA OpenMP multi-core processor image processing
  • 相关文献

参考文献17

  • 1McNairy C, Bhatia R. Montecito: A dual-core, dual-thread itanium processor. IEEE Micro, 2005, 25(2): 10-20
  • 2Kalla R. IBM power5 chip: A dual-core multithreaded processor. IEEE Micro, 2004, 24(2): 40-47
  • 3Mladen B, Hans J S, Peter P. Multicore system-onchip architecture for MPEG-4 streaming video. IEEE Transactions on Circuits and Systems for Video Technology, 2002, 12(8) : 688-699
  • 4Lee Trong-Yen, Fan Yang-Hsin, Cheng Yu-Min et al. Hardware-oriented partition for embedded multiprocessor FPGA systems//Proceedings of the 2th International Conference on Innovative Computing, Information and Control (ICICIC 2007). Kumamoto, Japan, 2007
  • 5Toledo F, Martinez J, Ferrandez J. FPGA-based platform for image and video processing embedded systems//Proceedings of the 2007 3rd Southern Conference on Programmable Logic (SPL'07). Mar del Plata, Argentina, 2007:171-176
  • 6Hofmann Andreas, Waldschmidt Klaus. SDVMR: A scalable firmware for FPGA-based multi-core Systems-on-Chip// Proceedings- IEEE Computer Society Annual Symposium on VLSI: Trends in VLSI Technology and Design, ISVLSI. Montpellier, France, 2008:387-392
  • 7Noseworthy Joshua, Leeser Miriam. Efficient communication between the embedded processor and the reconfigurable logic on an FPGA. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2008, 16(8) : 1083-1090
  • 8Kleinosowski A, Lilja D J. MinneSPEC: A new SPEC benchmark workload for simulation-based computer architecture research. University of Minnesota ARCTIC Lab, 2002-10
  • 9Levy Markus. Keynote 1: Using EEMBC benchmarks to understand processor behavior in embedded applications//Proceedings of the 1st International Conference on High Performance Embedded Architectures and Compilers (HiPEAC 2005). Lecture Notes in Computer Science 3793. Barcelona, Spain, 2005:3-4
  • 10Guthaus M R, Ringenberg J S, Ernst D et al. MiBench: A free, commercially representative embedded benchmark suite//Proceedings of the IEEE International Workshop on Workload Characterization. Los Alamitos, CA: IEEE Computer Society Press, 2000: 266-277

同被引文献44

  • 1张丽,陈志强,高文焕,康克军.均值加速的快速中值滤波算法[J].清华大学学报(自然科学版),2004,44(9):1157-1159. 被引量:54
  • 2颜露新,张天序,邹胜,钟胜.用FPGA实现互联的多DSP并行系统结构[J].系统工程与电子技术,2005,27(10):1757-1759. 被引量:4
  • 3朱捷,朱小娟,贺明.基于FPGA的实时图像中值滤波设计[J].计算机测量与控制,2007,15(6):798-800. 被引量:13
  • 4Barriga A, Sanchez-Solano S, Brox P, et al. Modeling and implementation of fuzzy systems based on VHDL[ J]. International Journal of Approximate Reasoning,2004,41 (2) : 164 - 178.
  • 5张卫国,杨向忠.模糊控制理论与应用[M].西安:西北工业大学出版社,2004.
  • 6BhaskerJ.VHDL教程[M].刘雷波,孟一聪,译.北京:机械工业出版社,2006.
  • 7董付国,原达,王金鹏.中值滤波快速算法的进一步思考[J].计算机工程与应用,2007,43(26):48-49. 被引量:17
  • 8Park S H,Park J W,Jeong J M,et al.A mixed flash translation layer structure for SLC-MLC combined flash memory system. Proc of SPEED’’08 . 2008
  • 9Alexander D D,Mavis D G.Design issues for radiation tolerant microcircuits in space. Proc of IEEE Nuelear and Space Radiation Eeffets Conf . 1996
  • 10Lu Q,Alias C,Bondhugula U.Data layout transformation for enhancing data locality on NUCA chip multiprocessors. Proc of the18th Int Conf on PACT . 2009

引证文献5

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部