Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models 被引量：2

Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models

下载PDF

导出

摘要 Pedestrian attributes recognition is a very important problem in video surveillance and video forensics. Traditional methods assume the pedestrian attributes are independent and design handcraft features for each one. In this paper, we propose a joint hierarchical multi-task learning algorithm to learn the relationships among attributes for better recognizing the pedestrian attributes in still images using convolutional neural networks(CNN). We divide the attributes into local and global ones according to spatial and semantic relations, and then consider learning semantic attributes through a hierarchical multi-task CNN model where each CNN in the first layer will predict each group of such local attributes and CNN in the second layer will predict the global attributes. Our multi-task learning framework allows each CNN model to simultaneously share visual knowledge among different groups of attribute categories. Extensive experiments are conducted on two popular and challenging benchmarks in surveillance scenarios, namely, the PETA and RAP pedestrian attributes datasets. On both benchmarks, our framework achieves superior results over the state-of-theart methods by 88.2% on PETA and 83.25% on RAP, respectively. Pedestrian attributes recognition is a very important problem in video surveillance and video forensics. Traditional methods assume the pedestrian attributes are independent and design handcraft features for each one. In this paper, we propose a joint hierarchical multi-task learning algorithm to learn the relationships among attributes for better recognizing the pedestrian attributes in still images using convolutional neural networks(CNN). We divide the attributes into local and global ones according to spatial and semantic relations, and then consider learning semantic attributes through a hierarchical multi-task CNN model where each CNN in the first layer will predict each group of such local attributes and CNN in the second layer will predict the global attributes. Our multi-task learning framework allows each CNN model to simultaneously share visual knowledge among different groups of attribute categories. Extensive experiments are conducted on two popular and challenging benchmarks in surveillance scenarios, namely, the PETA and RAP pedestrian attributes datasets. On both benchmarks, our framework achieves superior results over the state-of-theart methods by 88.2% on PETA and 83.25% on RAP, respectively.

作者 Wenhua Fang Jun Chen Ruimin Hu

机构地区 National Engineering Research Center for Multimedia Software Hubei Key Laboratory of Multimedia and Network Communication Engineering Collaborative Innovation Center of Geospatial Technology

出处《China Communications》 SCIE CSCD 2018年第12期208-219,共12页 中国通信（英文版）

基金 supported by National Key R&D Program of China(-NO.2017YFC0803700) National Nature Science Foundation of China(No.U1736206) National Nature Science Foundation of China(61671336) National Nature Science Foundation of China(61671332) Technology Research Program of Ministry of Public Security(No.2016JSYJA12) Hubei Province Technological Innovation Major Project(-No.2016AAA015) Hubei Province Technological Innovation Major Projec(2017AAA123) National Key Research and Development Program of China(No.2016YFB0100901) Nature Science Foundation of Jiangsu Province(No.BK20160386)

关键词 attributes RECOGNITION CNN MULTI-TASK learning attributes recognition CNN multi-task learning

分类号 TN [电子电信]

引文网络
相关文献

同被引文献3

1杨思春,高超,戴新宇,尹存燕,陈家骏.基于差异性和重要性的问句特征组合[J].电子学报,2014,42(5):918-924. 被引量：7
2LE Juan,ZHANG Chunxia,NIU Zhendong.Answer Extraction Based on Merging Score Strategy of Hot Terms[J].Chinese Journal of Electronics,2016,25(4):614-620. 被引量：1
3Guolin Shao,Xingshu Chen,Xuemei Zeng,Lina Wang.Labeling Malicious Communication Samples Based on Semi-Supervised Deep Neural Network[J].China Communications,2019,16(11):183-200. 被引量：2

引证文献2

1Shuifei Zeng,Yan Ma,Xiaoyan Zhang,Xiaofeng Du.Term-Based Pooling in Convolutional Neural Networks for Text Classification[J].China Communications,2020,17(4):109-124. 被引量：2
2陈双叶,徐凯,胡鑫.监控场景下基于机器注意的多任务行人属性识别[J].北京工业大学学报,2021,47(5):472-479. 被引量：2

二级引证文献4

1Canghong Jin,Guangjie Zhang,Minghui Wu,Shengli Zhou,Taotao Fu.Textual Content Prediction via Fuzzy Attention Neural Network Model without Predefined Knowledge[J].China Communications,2020,17(6):211-222.
2袁雨轩,李放,陈科淇,韩正.基于依存关系的自然语言可视化仿真系统[J].计算机技术与发展,2021,31(9):214-220.
3曾路,汪浩,孙骏.基于YOLO V4的机房异常巡检研究[J].电力大数据,2022,25(6):56-61. 被引量：1
4黄云,董天宇.电力人工智能指标算法模型多场景鲁棒性评价方法[J].吉林大学学报（信息科学版）,2024,42(1):162-167.

1yu zhang,qiang yang.An overview of multi-task learning[J].National Science Review,2018,5(1):30-43. 被引量：55
2吴晓佳,叶桦,苏雅,仰燕兰.基于Android手机的实时视频传输和解码[J].软件工程与应用,2013,2(5):104-108. 被引量：1
3Pimthong Thongnopkun.Nano-Microplate Gold Clay for Handcraft Jewelry and Decoration[J].宝石和宝石学杂志,2018,20(S1):139-139.
4SONG Wei.Exploration on the Crowdfunding Mode of Rural Memory Project[J].Journal of Landscape Research,2016,8(6):111-114.
5夏蓉,冯春阳,周莅斌.隐匿型间接视神经损伤特点与视觉诱发电位变化[J].现代仪器与医疗,2018,24(6):104-106.
6Giulia Torregiani,Luisa Bonfiglio,Francesco Maria Melchiori,Francesco Peluso Cassese.Experimental Analysis on the Development of Cognitive Processes in Childhood Through Body Experience[J].Psychology Research,2018,8(11):527-543.

China Communications

2018年第12期

浏览历史

内容加载中请稍等...

Pedestrian Attributes Recognition in Surveillance Scenarios with Hierarchical Multi-Task CNN Models 被引量：2

同被引文献3

引证文献2

二级引证文献4

相关作者

相关机构

相关主题

浏览历史