基于双重注意力机制的人群计数方法

Crowd counting method based on dual attention mechanism

下载PDF

导出

摘要针对复杂场景下人群计数问题中的尺度变化、背景干扰和部分遮挡等问题,在空洞卷积操作的基础上,提出一种基于双重注意力机制的空洞上下文卷积神经网络(DA-DCCNN)。首先,将VGG16中的卷积层作为特征提取器,获取人群图像抽象、深层的特征图;其次,利用空洞卷积构造空洞上下文模块(DCM)对不同层获取的特征进行连接,并引入空间注意力模块(SAM)和通道注意力模块(CAM)获取上下文信息;最后,组合欧氏距离和交叉熵构造损失函数,对网络预测注意力图和真实注意力图之间的差异进行度量。在ShanghaiTech、UCF_CC_50和UCF-QNRF 3个公开数据集上的实验结果表明,DA-DCCNN在有效获取图像的多尺度特征的同时,增强了对图像中重要区域和通道的感知能力,平均绝对误差(MAE)取得了相对最优的结果。基于双重注意力机制的特征融合网络能有效感知图像中的空间结构和局部特征,从而使得生成的密度图能更准确地对人群区域进行预测和计数。 In response to challenges such as scale variation,background interference,and partial occlusion in crowd counting within complex scenes,a DA-DCCNN(Dual Attention based Dilated Contextual Convolutional Neural Network)was proposed.Firstly,the convolutional layers from VGG16 were utilized as feature extractors to obtain abstract and deeplevel feature maps of the crowd image.Subsequently,by employing dilated convolutions,a Dilated Context Module(DCM)was constructed to connect features obtained from different layers.The Spatial Attention Module(SAM)and Channel Attention Module(CAM)were introduced to acquire contextual information.Finally,a loss function was formulated by combining the Euclidean distance and cross entropy to measure the disparity between the predicted attention map and the ground truth attention map.Experimental results on three publicly available datasets—ShanghaiTech,UCF_CC_50 and UCF-QNRF demonstrate that DA-DCCNN can effectively capture multi-scale features in the image and enhance the perception of important regions and channels within the image,achieving the optimal Mean Absolute Error(MAE).The feature fusion network based on dual attention mechanism can efficiently recognize spatial structures and local features in images so that by using the generated density maps,the crowd regions can be predicted and counted more accurately.

作者赵志强马培红黑新宏 ZHAO Zhiqiang;MA Peihong;HEI Xinhong(School of Computer Science and Engineering,Xi’an University of Technology,Xi’an Shaanxi 710048,China;Shaanxi Key Laboratory of Network Computing and Security Technology(Xi’an University of Technology),Xi’an Shaanxi 710048,China)

机构地区西安理工大学计算机科学与工程学院陕西省网络计算与安全技术重点实验室(西安理工大学)

出处《计算机应用》 CSCD 北大核心 2024年第9期2886-2892,共7页 journal of Computer Applications

基金国家自然科学基金资助项目(61976177) 陕西省重点研发计划项目(2023-YBGY-222)。

关键词空洞卷积上下文特征双重注意力机制密度图人群计数 dilated convolution contextual feature dual attention mechanism density map crowd counting

分类号 TP183 [自动化与计算机技术—控制理论与控制工程] TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1余鹰,朱慧琳,钱进,潘诚,苗夺谦.基于深度学习的人群计数研究综述[J].计算机研究与发展,2021,58(12):2724-2747. 被引量：14
2覃勋辉,王修飞,周曦,刘艳飞,李远钱.多种人群密度场景下的人群计数[J].中国图象图形学报,2013,18(4):392-398. 被引量：31

二级参考文献18

1Li M, Zhang Z X, Huang K Q. Estimating the number of people in crowded scenes by MID based foreground segmentation and head-shoulder detection[ C ]// Proceedings of the 19th Interna- tional Conference on Pattern Recognition. Florida ,USA: IEEE, 2008 1-4.
2Wu B, Nevatia R. Detection of multiple, partially occluded hu- mans in a single image by bayesian combination of edgelet part detectors[ C ]// Proceedings of the 10th IEEE International Con- ference on Computer Vision. Beijing, China: IEEE, 2005:90- 97.
3Zhao T, Nevatia R, Wu B. Segmentation and tracking of multi- ple humans in crowded environments [ J ]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008, 30(7) :1198- 1211.
4Choudri S, Ferryman J M, Badii A. Robust background model for pixel based people counting using a single unealibrated camera [ CI//Proceedings of the 12th IEEE International Workshop on Performance Evaluation of Tracking and Surveillance. Snowbird, Utah : IEEE, 2009 : 1-8.
5Hou Y L, Pang G K. PeopLe counting and human detection in a challenging situation[J]. IEEE Transactions on Systems Man and Cybernetics, 2011, 41 ( 1 ) :24-33.
6Celik I-I, Hanjalic A, Flendriks E A. Towards a robust solution to people counting[ C] // Proceedings of IEEE International Con- ference on hnage Processing. Atlanta, USA : IEEE, 2006 : 2401- 2404.
7Conte D, Foggia P, Percannella G. A method for counting people in crowded scenes[ C]//Proceedings of the Seventh IEEE Inter- national Conference on Advanced Video and Signal based Surveil- lance. Klagenfurt, Austria :IEEE, 2011:111-118.
8Conte D, Foggia P, Percannella G. Counting moving people in videos by salient points detection [ C]// Proceedings of the 20th International Conference on Pattern Recognition. Istanbu, Turkey : IEEE, 2010 : 1743-1746.
9Wu X Y, Liang G Y, Lee K K. Crowd density estimation using texture analysis and learning [ C]// Proceedings of the IEEE International Contrence on Robotics and Biomimetics. Kunming, China : 1EEE,2006:214-219.
10Chan A B, Liang Z S, Vasconcelos N. Privacy preserving crowd monitoring counting people without people models or tracking[ C]//Proceedings of IEEE Conference on Computer Vision and Pattern Recognition. Florida, USA : IEEE, 2008 : 1-7.

共引文献43

1常庆龙,夏洪山,黎宁.一种基于归一化前景和角点信息的复杂场景人数统计方法[J].电子与信息学报,2014,36(2):312-317. 被引量：6
2胡正平,武丽丽,李朝辉.交通场景中采用有监督序学习拥挤度排序算法[J].信号处理,2014,30(12):1464-1472.
3侯鹏鹏.基于GLCM纹理特征分析的人群密度估计方法实现[J].中国安防,2014,0(12):88-90. 被引量：3
4黄铁军,郑锦,李波,傅慧源,马华东,薛向阳,姜育刚,于俊清.多媒体技术研究:2013——面向智能视频监控的视觉感知与处理[J].中国图象图形学报,2014,19(11):1539-1562. 被引量：26
5徐麦平,张二虎,陈亚军.融合像素与纹理特征的人群人数统计方法研究[J].西安理工大学学报,2015,31(3):340-346. 被引量：2
6丁艺,陈树越,刘金星,戴永惠,朱双双.基于归一化目标像素的人群密度估计方法[J].计算机应用与软件,2016,33(4):212-214. 被引量：5
7安曦宁.基于改进混合高斯模型的人群密度估计研究[J].电子科技,2017,30(5):180-183. 被引量：6
8成金庚,计科峰.结合群组动量特征与卷积神经网络的人群行为分析[J].科学技术与工程,2017,17(14):79-85. 被引量：2
9陈思秦.基于全卷积神经网络的人群计数[J].电子设计工程,2018,26(2):75-79. 被引量：2
10张君军,石志广,李吉成.人数统计与人群密度估计技术研究现状与趋势[J].计算机工程与科学,2018,40(2):282-291. 被引量：27

1黄路,李泽平,杨文帮,赵勇.融合自适应感受野的目标检测算法[J].计算机工程与设计,2024,45(9):2669-2675.
2王曼铃,侯典峰.2024年九省联考第5题的四个求解策略[J].中学生数学,2024(17):41-43.
3陈泽纯,林富生,张庆,宋志峰,刘泠杉,余联庆.基于改进YOLOv7的织物疵点小目标检测算法[J].棉纺织技术,2024,52(10):26-32.
4孙文赟,车嘉航,金忠.基于全局上下文注意力特征融合金字塔网络的遥感目标检测[J].计算机系统应用,2024,33(9):114-122.
5程诗蕾,程国建.基于YOLOv8的井场设施安全实时监测新算法[J].石油工业技术监督,2024,40(9):45-50.
6黄淑英,夏钰锟,杨勇,万伟国,邱根莹.基于暗通道先验引导的图像去雾网络[J].北京航空航天大学学报,2024,50(9):2717-2726.
7樊翔宇,代琦.基于改进YOLOv5的菌落计数算法研究[J].软件工程,2024,27(10):34-38.
8王磊,张斌,吴奇鸿.RCSA-YOLO:改进YOLOv8的SAR舰船实例分割[J].计算机工程与应用,2024,60(18):103-113.
9祁居攀.几类新数列求和问题解法剖析[J].高中数学教与学,2024(10):18-20.
10李紫娟,常光耀,贾永娜.基于循环特征推理的大间距缺失地震数据重建方法[J].煤田地质与勘探,2024,52(9):176-183.

计算机应用

2024年第9期

浏览历史

内容加载中请稍等...

基于双重注意力机制的人群计数方法

参考文献2

二级参考文献18

共引文献43

相关作者

相关机构

相关主题

浏览历史