摘要
实时场景下的小脸检测存在检出率低而且回归精度差的问题。通过融合更底层特征进行多尺度级联预测。根据实时场景下的人脸特点生成不同大小和比例的预测框以更好地适应人脸形状。在预测阶段提出了基于IOU判别的soft and hard nms算法,对冗余预测框进行抑制,设置两个阈值将网络生成的预测框划分为低中高三段,对不同段的预测框采取不同的处理以达到精准筛选的目的。最优架构可在两张NVIDIA GTX 1080显卡下的实时视频检测和摄像头检测中获得45f/s的速度,并且在Wider Face总体验证集上取得82.6%的平均精度。
Tiny face detection in real-time scenes has a low detection rate and poor regression accuracy. This paper further integrates the lower-level feature maps for multi-scale prediction. According to the characteristics of face in real-time scene detection, predicted boxes of different scales are generated to better adapt to human face shape. In the prediction stage, a soft and hard nms algorithm based on Intersection of Union(IOU)discrimination is proposed to suppress the redundant prediction boxes. Two thresholds are set to divide the prediction frame generated by the network into three segments of low, medium and high, and different segments of the prediction boxes are treated differently to achieve accurate suppress. The optimal architecture of the paper can obtain 45 frame per second in real-time video detection and camera detection under two NVIDIA GTX 1080 graphics cards, and achieves an average accuracy of 82.6% on the Wider Face overall validation set.
作者
叶锋
赵兴文
宫恩来
杭丽君
YE Feng;ZHAO Xingwen;GONG Enlai;HANG Lijun(College of Automation,Hangzhou Dianzi University,Hangzhou 310018,China)
出处
《计算机工程与应用》
CSCD
北大核心
2019年第12期162-168,共7页
Computer Engineering and Applications
关键词
深度学习
小脸检测
实时检测
计算机视觉
deep learning
tiny face detection
real-time detection
computer vision