摘要
针对当前神经网络加速器难以高效实现目标跟踪边框后处理的问题,提出一种高效的目标跟踪专用加速器.引入神经网络架构,用于提取输入视图特征并生成边框置信度与偏移量集合.随后针对目标跟踪的边框处理设计了专用于边框的回归、惩罚以及提取操作的加速模块,通过同步神经网络加速器与专用加速模块间的数据,以流水结构并行执行特征提取与边框操作,实现基于深度学习目标跟踪的端到端处理.该加速器在40 nm工艺下消耗面积3.64mm^(2),获得了5.71 Tops/W能效比.实验结果表明:与现有加速方案相比,该目标跟踪加速器获得了1.53倍加速,可实现实时的视频处理(31 fps).其中仅针对跟踪过程的后处理任务,专用加速模块相对RISC处理器可实现3.2倍的加速比.
Since the current nerual network accelerator couldn t efficiently accelerate the post-processing of object tracking»a dedicated object trackeris proposed.A neural network architecture is introduced to extract the features of the input feature map.At the meanwhile,it generates thebounding box confidence and position offset sets.Adedicated acceleration module is designed for the anchor regression,penalty calculation and extraction.By synchronizing the data between the neural network accelerator and the dedicated module,a new pipelined structure is proposed to execute the feature extraction and anchor regression in parallel.Therefore,the end-to-end processing of the object tracking is efficiently achieved.The accelerator consumes an area of 3.64 mm^(2)under the SMIC 40nm process,and achieves 5.71 Tops/W energy efficiency.Experimental results show that,compared with the current accleration solutions,the object tracking accelerator achieves 1.53 times acceleration,and it could realize real-time video processing(31 fps).For the post-processing task of the tracking,the processing speeds of the proposed dedicated module is improved by 3.2 times than the RISC processor.
作者
李倍
闵丰
杨军
梁科
李国峰
LI Bei;MIN Feng;YANG Jun;LIANG Ke;LI Guofeng(Tianjin Key Laboratory of Optoelectronic Sensor and Sensing Network Technology,Integrated Circuit and System Integration Laboratory of Nankai University,Tianjin 300350,China;Institute of Computing Technology,Chinese Academy of Sciences,Beijing 100190,China)
出处
《微电子学与计算机》
2021年第8期53-58,共6页
Microelectronics & Computer
基金
国家自然科学基金项目(62004198)
北京市自然科学基金资助项目(4194092)
国家重点研发计划(2018AAA0102505)。
关键词
深度学习目标跟踪
后处理
专用模块
硬件加速器
DNN object tracking
post-processing
dedicated module
hardware accelerator