摘要
针对图像背景噪声、透视畸变等影响人群计数网络计数精度的问题,提出一种基于背景抑制与上下文感知的新网络。利用VGG-16网络提取图像特征,并分别将特征输入密度图生成模块和背景噪声抑制(BNS)模块中进行处理,生成密度特征图和空间注意力图。使用BNS模块优化密度特征图并生成初级密度图,以抑制图像中背景噪声干扰,提高人群区域的特征权重。为减少透视畸变对人群密度估计的影响,使用上下文感知增强网络优化初级密度图,并生成预测密度图。在ShanghaiTech、UCF-CC-50及UCF-QNRF 3个公开数据集上的实验结果表明,该网络相较于MCNN、SwitchCNN、CSRNet等网络的计算准确度较高,尤其在UCF-QNRF数据集上其平均绝对误差和均方误差分别为85.8、146.0,相较于其他网络最高分别下降69.0%和67.2%,能充分抑制图像背景噪声并有效减小透视畸变引起的误差,具有良好的泛化能力和较强的鲁棒性。
To reduce the influence of background noise and perspective distortion in crowd counting tasks,a new network based on background suppression and context awareness is proposed.VGG-16 network is used to extract image features,which are input into Density Map Generation(DMG)and Background Noise Suppression(BNS)modules for processing to generate density feature and spatial attention maps.The BNS module is used to optimize a density feature map and generate a primary density map,to suppress noise information interference in the image and improve the characteristic weight of the crowd area.To reduce the influence of perspective distortion on counting density estimation,a Weight Enhancement-Context Aware Network(WE-CAN)is used to optimize the primary density map and generate the predicted density map.Experiment results on three public datasets,namely ShanghaiTech,UCF-CC-50 and UCFQNRF show that the network has higher computational accuracy than Multi-Column Convolutional Neural Network(MCNN),Switching Convolutional Neural Network(SwitchCNN),Congested Scene Recognition Network(CSRNet)and other networks.Especially on UCF-QNRF,the Mean Absolute Error(MAE)of the proposed algorithm reach 85.8,and the Mean Square Error(MSE)reach 146.0.Compared with other algorithms,the highest decrease is 69.0%and 67.2%,respectively.The network proposed can also suppress background noise,reduce the error caused by perspective distortion,and has good accuracy and robustness.
作者
黄奕秋
胡晓
杨佳信
欧嘉敏
HUANG Yiqiu;HU Xiao;YANG Jiaxin;OU Jiamin(School of Electronics and Communication Engineering,Guangzhou University,Guangzhou 510006,China;School of Mechanical and Electrical Engineering,Guangzhou University,Guangzhou 510006,China)
出处
《计算机工程》
CAS
CSCD
北大核心
2022年第9期314-320,共7页
Computer Engineering
基金
国家自然科学基金(62076075)。
关键词
人群计数
深度学习
密度图
背景噪声
上下文感知
crowd counting
deep learning
density map
background noise
context awareness