期刊文献+

嵌入式异构智能计算系统并行多流水线设计

Parallel Multi Pipeline Design of Embedded Heterogeneous AI Computing Systems
下载PDF
导出
摘要 嵌入式智能计算系统因其功耗受限和多传感器实时智能处理需要,对硬件平台的智能算力能效比和智能计算业务并行度提出了严峻挑战.传统嵌入式计算系统常采用的DSP+FPGA数字信号处理架构,无法适用于多个神经网络模型加速场景.本文基于ARM+DLP+SRIO嵌入式异构智能计算架构,利用智能处理器多片多核多内存通道特性,提出了并行多流水线设计方法.该方法充分考虑智能计算业务中数据传输、拷贝、推理、结果反馈等环节时间开销,为不同的神经网络模型合理分配智能算力资源,以达到最大的端到端智能计算业务吞吐率.实验结果表明,采用并行多流水线设计方法的深度学习处理器利用率较单流水线平均提高约25.2%,较无流水线平均提高约30.7%,满足可见光、红外、SAR等多模图像实时智能处理需求,具有实际应用价值. Due to the limited power consumption and the need for real-time intelligent processing of multiple sensors,embedded AI computing systems desire for higher energy efficiency and more parallel intelligent computing services simultaneously.The digital signal processing architecture DSP+FPGA commonly used in traditional embedded computing systems is not suitable for multiple ANN models inference acceleration.Based on embedded heterogeneous intelligent computing architecture ARM+DLP+SRIO,this paper proposes a parallel multi pipeline design method by taking advantage of the characteristics of multi chip,multi-core and multi memory channels of deep learning processors.Considering the time cost of data transmission,copy,reference and feedback,this method allocates intelligent computing resources for different neural network models to achieve the maximum end-to-end throughput.The experimental results show that the utilization of the deep learning processor using the parallel multi pipeline design method is about 25.2%higher than that of a single pipeline,and about 30.7%higher than that without pipeline.It meets the real-time intelligent processing requirements of visible light,infrared and SAR images,and is valuable for practical applications.
作者 赵二虎 吴济文 肖思莹 晋振杰 徐勇军 ZHAO Er-hu;WU Ji-wen;XIAO Si-ying;JIN Zhen-jie;XU Yong-jun(Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China;University of Chinese Academy of Sciences,Beijing 100049,China)
出处 《电子学报》 EI CAS CSCD 北大核心 2023年第11期3354-3364,共11页 Acta Electronica Sinica
基金 中国科学院技术支撑人才项目 北航杭州创新研究院钱江实验室开放基金(No.2020-Y8-A-023)。
关键词 嵌入式智能计算系统 异构计算架构 神经网络模型 并行多流水线 深度学习处理器 embedded AI computing systems heterogeneous computing architecture neural network model parallel multi pipeline deep learning processor(DLP)
  • 相关文献

参考文献10

二级参考文献66

共引文献96

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部