期刊文献+

面向微控制器的卷积神经网络加速器设计

Design of Convolutional Neural Network Accelerator for Microcontroller
下载PDF
导出
摘要 针对目前嵌入式微控制器的性能难以满足实时图像识别任务的问题,提出一种适用于微控制器的卷积神经网络加速器。该加速器在卷积层设计了无阻塞的行并行乘法-加法树结构,获得了更高的硬件利用率;为了满足行并行的数据吞吐量,设计了卷积专用SRAM存储器。加速器将池化和激活单元融入数据通路,有效减少数据重复存取带来的时间开销。FPGA原型验证表明加速器的性能达到92.2 GOPS@100 MHz;基于TSMC 130 nm工艺节点进行逻辑综合,加速器的动态功耗为33 mW,面积为90 764.2μm^(2),能效比高达2 793 GOPS/W,比FPGA加速器方案提高了约100倍。该加速器低功耗、低成本的特性,有利于实现嵌入式系统在目标检测、人脸识别等机器视觉领域的广泛应用。 Aiming at the problem that the performance of embedded microcontroller is difficult to meet the task of real-time image recog-nition,a convolutional neural network accelerator suitable for microcontroller is proposed.The accelerator has a non blocking row paral-lel multiplier adder unit structure in the convolutional layer.It has higher hardware utilization.In order to meet the throughput of row parallel data,a special convolution SRAM memory is designed.The accelerator integrates pooling and activation units into the data path,effectively reducing the time overhead caused by repeated data access.Through FPGA prototype verification,the performance of the accelerator can reach 92.2 GOPS@100 MHz.The accelerator is synthesized based on TSMC 130 nm process.The dynamic power consumption of the accelerator is 33 mW,the area is 90764.2μm^(2),and the energy efficiency ratio is 2793 GOPS/W,which is about a hundred times higher than that of FPGA accelerator.The accelerator has the characteristics of low power and cost,which is conducive to the wide application of embedded systems in the field of machine vision,such as object detection,face recognition and so on.
作者 乔建华 吴言 栗亚宁 雷光政 QIAO Jianhua;WU Yan;LI Yaning;LEI Guangzheng(School of Electronic Information Engineering,Taiyuan University of Science and Technology,Taiyuan Shanxi 030024,China)
出处 《电子器件》 CAS 2024年第1期48-54,共7页 Chinese Journal of Electron Devices
基金 山西省研究生教育改革研究课题项目(2021YJJG247)。
关键词 卷积神经网络 并行计算 流水线 硬件加速器 专用集成电路 convolutional neural network parallel computing pipeline hardware accelerator application specific integrated circuit
  • 相关文献

参考文献6

二级参考文献27

共引文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部