期刊文献+

基于深度卷积神经网络的红外图像行人检测 被引量:10

Human detection in infrared image based on deep convolution neural network
下载PDF
导出
摘要 红外图像中的行人检测一直是计算机视觉领域的研究热点与难点。针对传统的红外行人检测方法需要人工设计目标表达特征的弊端,本文从深度学习的角度出发,提出一种可以自动构建目标表达特征的红外行人检测卷积神经网络。在对卷积神经网络的实现原理进行分析的基础上,设计了红外行人检测卷积神经网络的初始结构,然后通过实验对初始结构进行调整,得到最终的检测神经网络。对实拍红外人体数据库进行行人检测的实验结果表明,该方法在保持低虚警率的同时可以对红外图像中的行人进行稳健检测,优于传统方法。 Pedestrian detection in infrared images has been a hot and difficult research topic in computer version.Traditional methods of pedestrian detection mainly depend on the manual feature for the expression of human body and the results largely relies on the feature representation.Designing artificial features is time-consuming and labor intensive,requires heuristic expertise and experience.Deep learning model based on convolution neural network can automatically learn feature representation from the original images,while avoiding the drawbacks of artificial features.Its difficulty is the choice of network parameters.In this paper,we propose to use deep learning method based on convolution neural network in the process of pedestrian detection.In addition,we analyze the impact of network layers,convolution kernel sizes and feature maps to pedestrian detection in infrared images are.The results demonstrate the superiority of our method over traditional methods in detection rate and alarm rate.
作者 单巍 王江涛 陈得宝 李素文 SHAN Wei;WANG Jiang-tao;CHEN De-bao;LI Su-wen(School of Physics and Electronic Information,Huaibei Normal University,Huaibei 235000,China)
出处 《激光与红外》 CAS CSCD 北大核心 2020年第5期634-640,共7页 Laser & Infrared
基金 国家自然科学基金项目(No.61572224) 安徽省自然科学研究项目(No.KJ2018B10)资助。
关键词 行人检测 深度学习 卷积神经网络 红外图像 pedestrian detection deep learning convolution neural network infrared image
  • 相关文献

参考文献8

二级参考文献137

  • 1郏东耀,丁天怀.皮棉杂质透射检测及成像目标增强方法[J].物理学报,2005,54(9):4058-4064. 被引量:7
  • 2胡坤,周富强,张广军.一种快速结构光条纹中心亚像素精度提取方法[J].仪器仪表学报,2006,27(10):1326-1329. 被引量:83
  • 3高建坡,王煜坚,杨浩,吴镇扬.一种基于KL变换的椭圆模型肤色检测方法[J].电子与信息学报,2007,29(7):1739-1743. 被引量:15
  • 4余伶俐,蔡自兴,陈明义.语音信号的情感特征分析与识别研究综述[J].电路与系统学报,2007,12(4):76-84. 被引量:27
  • 5董士海,王横.人机交互.北京:北京大学出版社,2003.
  • 6Dahland G E, Yu Dong, Deng u, Acero A. Context?dependent pre- trained deep neural networks for large?vocabulary speech recognition. IEEE Transactions on Audio, Speech & Language Processing, 2012, 200): 30-42.
  • 7Federico M, Bertoldi N, Cettolo M. Irstlm , An open source toolkit for handling large scale language models/ /Proceedings of the Annual Conference of the International Speech Communication Association (Interopeech), Brisbane, Australia, 2008: 1618-1621.
  • 8Mohri M, Pereira F, Riley M. Weighted finite-state trans?ducers in speech recognition. Computer Speech &. Language, 2002, 16(1): 69-88.
  • 9Senior A, Lei Xin. Fine context, low-rank, softplus deep neural networks for mobile speech recognition/ /Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal ProcessingCICASSP). Florence, Italy, 2014.
  • 10Zen Hei-Ga, Tokuda K, Black A W. Statistical parametric speech synthesis. Speech Communication, 2009, 51(11): 1039-1064.

共引文献256

同被引文献84

引证文献10

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部