基于动态多任务平衡方法的行人属性识别深度学习网络

Deep Learning Network for Pedestrian Attribute Recognition Based on Dynamic Multi-Task Balancing

下载PDF

导出

摘要深度学习网络是计算机视觉和人工智能系统的研究热点之一,行人属性识别提供了结构化的行人特征,为安防计算机视觉识别中行人检索提供了重要的信息.基于深度学习网络,提出了一种端到端的多属性识别方法,在R*CNN的基础上设计了一个端到端的行人属性识别网络,使用候选区域提取网络代替Selective Search提取第二重要的区域,建立属性识别与辅助区域提取一体化的网络,提升局部及细节属性识别的准确率;其次,为增加辅助区域的作用,将人体感兴趣区域按比例划分为整体、头、肩膀到腰及腰到脚4个部分,每个部分对应了不同属性,在任务分支层分出4个分支,使用主要区域预测对应属性的同时,分别从RPN中学习到对应的第二重要区域辅助预测;最后,提出了基于损失梯度的损失权值自动更新方法,即权重与损失的梯度逆相关,防止某个任务训练的过快或过慢.通过在行人属性数据库进行实验,整体提升了属性预测的准确率,大大缩短了识别时间. Person attribute recognition extracts structured feature of person,which plays a vital role in intelligent video surveillance,such as person re-identification.Firstly,based on R*CNN,we design an end-to-end multi-attribute recognition method based on deep learning network.The region proposal network(RPN)rather than selective search is employed to extract auxiliary regions.An unified network for auxiliary region extraction and attribute recognition is constructed to improve locally attributes.Secondly,in order to enhance the effects of auxiliary region,we split the body ROI into four regions proportionately,such as whole body,head,torso and leg.Each region is in charge of different attributes.And the network splits into four branches at the prediction stage.The primary regions and the second important auxiliary regions are exploited to predict attributes simultaneously.At last,the dynamic adapting loss weighting has the ability to balance the contribution of every task and achieve an optimum performance.That is,the loss weights are inversely correlated with the gradient of loss function,which is to avoiding a certain task is training too fast or too slow.The comparison experiments are elaborated on the Berkeley Attributes of People dataset,an optimum mean average precision(mAP)more than 92%is obtained when compared with state-of-the-art methods.

作者孙志勇叶俊勇汪同庆雷莉连捷李阳 Sun Zhiyong;Ye Junyong;Wang Tongqing;Lei Li;Lian Jie;Li Yang(Key Laboratory of Optoelectronic Technology and Systems of the Ministry of Education,Chongqing University,Chongqing 400044;School of Electronic Information Engineering,Yangtze Normal University,Chongqing 408100;Beijing Dilusense Technology Co,Ltd,Beijing 100083;Nanjing Pioneer Awareness Information Technology Co.,Ltd,Nanjing 210019)

机构地区重庆大学光电技术教育部重点实验室长江师范学院电子信息工程学院北京的卢深视科技有限公司南京派光智慧感知信息技术有限公司

出处《计算机辅助设计与图形学学报》 EI CSCD 北大核心 2019年第12期2144-2151,共8页 Journal of Computer-Aided Design & Computer Graphics

基金中央高校基本科研业务费专项资金资助(2018CDXYGD0017) 重庆市基础研究与前沿探索专项(cstc2018jcyj AX0633)

关键词深度学习属性识别动态多任务损失函数 deep learning attribute recognition multi-task learning loss function

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1余永维,殷国富,殷鹰,杜柳青.基于深度学习网络的射线图像缺陷识别方法[J].仪器仪表学报,2014,35(9):2012-2019. 被引量：75

二级参考文献18

1LIM T Y,RATNAM M M,KHALID M A.Automatic classification of weld defects using simulated data and an MLP neural network[J].Insight,2007,49 (3):154-159.
2VILAR R,ZAPATA J,RUIZ R.An automatic system of classification of weld defects in radiographic images[J].NDT and E International,2009,42(5):467-476.
3ZAPATA J,VILAR R,RUIZ R.An adaptive-networkbased fuzzy inference system for classification of welding defects[J].NDT & E International,2010,43 (3):191-199.
4ZAPATA J,VILAR R,RUIZ R.Performance evaluation of an automatic inspection system of weld defects in radiographic images based on neuroclassifiers[J].Expert Systems with Applications,2011,38 (7):8812-8824.
5MIRAPEIX J,GARCíA-ALLENDE P B,COBO A,et al.Real-time arc-welding defect detection and classification with principal component analysis and artificial neural networks[J].NDT & E International,2007,40 (4):315-323.
6ALAKNANDA,ANAND R S,KUMAR P,et al.Flaw detection in radiographic weldment images using morpho logical watershed segmentation technique[J].NDT&E International,2009,42(1):2-8.
7VINCENT P,LAROCHELLE H,LAJOIE I,et al.Stacked denoising autoencoders:learning useful representations in a deep network with a local denoising criterion[J].Journal of Machine Learning Research,2010,11 (12):3371-3408.
8BENGIO Y.Learning deep architectures for AI[J].Foundations and Trends in Machine Learning,2009,2 (1):1-127.
9申清明,高建民,李成.焊缝缺陷类型识别方法的研究[J].西安交通大学学报,2010,44(7):100-103. 被引量：18
10吴一全,尹丹艳,吴诗婳.基于NSCT、KFCM和多模型LS-SVM的红外小目标检测[J].仪器仪表学报,2011,32(8):1704-1709. 被引量：7

共引文献74

1吕枫,王义,阮胡林,秦毅,王平.深度嵌入关系空间下齿轮箱标记样本扩充及其半监督故障诊断方法[J].仪器仪表学报,2021,42(2):55-65. 被引量：12
2徐啸顺,任建,林立,高雪婷.基于深度学习机器视觉对于动力总成制造防错应用的研究[J].传动技术,2020,0(1):3-10. 被引量：3
3敦宏丽,袁晔.基于卷积神经网络的3D服用人体特征识别[J].北京服装学院学报（自然科学版）,2018,38(3):54-61. 被引量：3
4高强,阳武,李倩.DBN层次趋势研究及其在航拍图像故障识别中的应用[J].仪器仪表学报,2015,36(6):1267-1274. 被引量：16
5赵凯旋,何东健.基于卷积神经网络的奶牛个体身份识别方法[J].农业工程学报,2015,31(5):181-187. 被引量：97
6谢旻旻.无人机降落地点的智能障碍图像识别方法仿真[J].计算机仿真,2015,32(7):84-87. 被引量：4
7高强,靳其兵,程勇.基于卷积神经网络探讨深度学习算法与应用[J].电脑知识与技术,2015,0(5):169-170. 被引量：11
8冯通.基于深度学习的航空飞行器故障自助检测研究[J].计算机仿真,2015,32(11):119-122. 被引量：7
9白丰,张明路,张小俊,孙凌宇.快速优化筛选多尺度矩形域的二进制描述[J].中国图象图形学报,2016,21(3):303-313. 被引量：1
10傅天驹,郑嫦娥,田野,丘启敏,林斯俊.复杂背景下基于深度卷积神经网络的森林火灾识别[J].计算机与现代化,2016(3):52-57. 被引量：33

1张程.网络生态中混乱与黑暗的一面[J].检察风云,2019,0(22):10-12.
2刘学刚,孟晖.无泄漏泵轴向力产生的原因及几种通用解决办法[J].通用机械,2019,0(11):24-25. 被引量：2
3王欣.综合护理干预方法对视网膜脱离患者术前焦虑抑郁状态的干预效果[J].医学信息,2019,32(S2):351-351.
4褚明.人工智能时代下对会计行业的思考[J].中国集体经济,2020,0(1):137-138. 被引量：10
5严欣.创设乐学情境在眼科临床护理教学中的应用效果评价[J].实用临床护理学电子杂志,2019,4(36):192-192. 被引量：3
6张立霞.统编小学语文六年级上册教科书编排思路与教学建议[J].小学语文,2019,0(9):24-30. 被引量：2
7赵立.浅析井下掘进巷道施工技术[J].中国石油和化工标准与质量,2019,39(19):140-140.
8无.美德、规则与实践智慧[J].中国哲学年鉴,2016(1):336-337.
9岳昊,武栓虎,徐金东,郑强,殷茹.基于机器视觉的医用瓶盖质检系统设计[J].仪表技术与传感器,2019(10):83-87. 被引量：6
10王素.必维集团：聚焦安全，支持中国高质量发展[J].进出口经理人,2019,0(10):61-61.

计算机辅助设计与图形学学报

2019年第12期

浏览历史

内容加载中请稍等...

基于动态多任务平衡方法的行人属性识别深度学习网络

参考文献1

二级参考文献18

共引文献74

相关作者

相关机构

相关主题

浏览历史