摘要
针对卷积神经网络对于运算资源需求的不断增长,和传统的硬件卷积加速方案在功耗、面积敏感的边缘计算领域难以应用的问题,设计并实现了一个低功耗嵌入式卷积神经网络加速处理器.目标处理器基于RISC-V指令集架构,内核扩展4条自定义神经网络指令,并在硬件层面实现加速处理.该卷积神经网络处理器最大程度的复用了原RISC-V的数据通路和功能模块,减小了额外的功耗和芯片面积等资源开销.目标处理器通过RISC-V官方标准测试集验证,并对MNIST手写数据集进行识别测试,正确率达到97.23%.在TSMC 40nm标准数字工艺下,目标处理器面积仅为0.34 mm^(2,),动态功耗仅为11.1μw/MHz,与同期处理器相比,面积和功耗方面均具有一定优势.
A low power embedded convolutional neural network acceleration processor is designed and implemented in response to the growing demand for computational resources in convolutional neural networks and the difficulty of applying traditional hardware convolution acceleration schemes in power-and area-sensitive edge computation applications.The target processor based on the RISC-V instruction set architecture,extend four custom neural network instructions,and accelerates the processing on the hardware architecture.The convolutional neural network processor maximizes the reuse of the original RISC-V data path and functional modules,reducing additional resourceoverhead.The target processor is verified by the RISC-V official standard test set,and the MNIST handwritten data set is identified and tested,with a correct rate of 97.23%.The target processor occupies 0.34 mm^2 area and consume 11.1μw/MHz dynamic power using TSMC 40 nm technology.Compared with the correlation processor,it has certain advantages in area and power consumption.
作者
傅思扬
陈华
郁发新
FU Si-yang;CHEN Hua;YU Fa-xin(college of Aeronautics and Astronautics,Zhejiang University,Hangzhou,310058,China)
出处
《微电子学与计算机》
北大核心
2020年第4期49-54,共6页
Microelectronics & Computer
基金
国家自然科学基金(61604128)
中央高校基本科研业务费专项项目(2017QN81002)。