摘要
随着近年来硬件的飞速发展,深度学习又一次成为了研究的热门领域,其中卷积神经网络在多个方面显示了突出的表现.卷积层是卷积神经网络中最重要的组成部分,具有大量乘加计算.针对该特点,提出了流水线式的FPGA卷积层并行加速模块.该电路可以在一个周期内获得一个计算结果.在相同结构和数据集的情况下,FPGA的计算效率分别是CPU,GPU的近7倍和5倍,而功耗只有GPU的28.87%.
With the development of hardware, deep learning has been a hot area again, in which Convolutional Neural Network (CNN) shows excellent performance in several aspects. Convolution layer is the most important part of CNN, and has lots of multiplications and additions. For this [eature, a FPGA-based accelerator with pipelineis designed for convolution layer. The designed circuit can compute one result in a single clock cycle. Under the same framework and dataset, FPGA has nearly 7x and 5x computational efficiency of CPU and GPU, and has only 28. 87% power consumption of GPU.
作者
陈煌
祝永新
田犁
汪辉
封松林
CHEN Huang;ZHU Yong-xin;TIAN Li;WANG Hui;FENG Song-lin(Shanghai Advanced Research Institute,Chinese Academy of Sciences,CIS Lab,Shanghai 201210,China;University of Chinese Academy of Sciences,Beijing 100049,China;ShanghaiTech University,School of Information Science & Technology,Shanghai 201210,China)
出处
《微电子学与计算机》
CSCD
北大核心
2018年第10期85-88,共4页
Microelectronics & Computer
基金
国家重点研发计划(2017YFA206104)
上海市科学技术委员会科研计划项目(16511108701)
张江管委会公共服务平台项目(2016-14)
关键词
卷积神经网络
FPGA
卷积层
convolutional neural network
FPGA
convolution layer