级联型P-RBM神经网络的人脸检测被引量：11

Cascaded probability state-restricted Boltzmann machine for face detection

导出

摘要目的针对非理想条件下快速准确的人脸检测问题,提出一种基于概率态多层受限玻尔兹曼机(RBM)级联神经网络的检测方法。方法它采用RBM中神经元的概率态表征来模拟人脑神经元连续分布的激活状态,并且利用多层P-RBM(概率态RBM)级联来仿真人脑对视觉的层次学习模式,又以逐层递减隐藏层神经元数来控制网络规模,最后采用分层训练和整体优化的机制来缓解鲁棒性和准确性的矛盾。结果在LFW、FERET、PKUSVD-B以及CAS-PEAL数据集上的测试都实现了优于现有典型算法的检测性能。对于单人脸检测,相比于Adaboost算法,将漏检率降低了2.92%;对于多人脸检测,相比于结合肤色的Adaboost算法,将误检率降低了14.9%,同时漏检率降低了5.0%,检测时间降低了50%。结论无论是静态单张人脸,还是复杂条件下视频多人脸检测,该方法不仅在误检率和漏检率上表现更好,而且具有较快的检测速度,同时对于旋转人脸检测具有较强的鲁棒性。针对基于肤色的多人脸检测研究,该方法能显著降低误检率。 Objective Face detection is constantly an active research subject in computer vision and pattern recognition. Face detection is also a constituent part of pattern recognition, artificial intelligence, information security, and many other disciplines. With video network coverage widely increasing in recent years, face detection has been increasingly used in the field of video surveillance. However, many factors require consideration in face detection, such as the complex environ- ments, multiple faces, and face rotation angles. In view of these interference problems in nonideal condition, a cascaded neuron network based on a multi-layer probability state-restricted Bohzmann machine （P-RBM） is proposed in this study to overcome the challenge of accurately and rapidly detecting faces. Method The neurons of RBM only have two states, name- ly, activated and nonactivated; this state mode can inhibit the interference in the learning result induced by the inadequate active information, while it simultaneously increases the likelihood that the learning network falls into a local optimum caused by the shielding of relatively weak information. To solve this contradiction, the proposed method uses the probability state of neurons in RBM as their activation degree, which better models the activity state＇ s continuous distribution of the neurons in the human brain. Using the probability state not only retains the weak active information but further decreases the effect caused by the former layer＇ s miscalculation. Simultaneously, this method simulates the hierarchical learning mode in the human brain by cascading multiple P-RBMs. This cascaded network can achieve multi-layer nonlinear mapping and obtain the semantic feature of the input date by extracting the input data＇ s separate level features. Furthermore, this cascaded network can learn the relationship hiding within the data to make the learned features be more promotional and ex- pressive. Simultaneously, the number of the hidden layer＇ s neurons decreases layer-by-layer to control the network＇ s scale and enhance the robustness. Finally, the proposed method uses the layered training and the entire optimization to balance robustness and accuracy. The greedy layer-wise learning is used in the layered training to avoid the training error transfer- ring in layers, thereby solving the problem of the multi-layer network easily falling into the local optimum. Furthermore, a preprocessing layer is used to detect the skin color area to reduce the number of neurons in the detection network and speed up the detection speed. Result Testing the single face detection performance in the LFW and FERET, the proposed method nearly achieves entirely accurate detection. Testing the video face detection in the PKU-SVD-B database, the missing de- tection rate and the false detection rate of the proposed method are all lower than that of the state-of-the-art methods, such as Adaboost and Adaboost combined with skin color detection, and its detection speed is faster. Moreover, the proposed method has a good detection performance for the face with a large rotation, which is tested in the CAS-PEAL database. Conclusion Experimental results show that regardless of whether a static single face or video multi-face detection occurs under complicated conditions, apart from the faster detection speed and robustness against face rotation, the proposed method possesses lower false detection rate and lower missing detection rate. Aiming at the multi-face detection based on skin color, this method can significantly reduce the false detection rate.

作者叶学义陈雪婷陈华华顾亚风吕秋云

机构地区杭州电子科技大学模式识别与信息安全实验室

出处《中国图象图形学报》 CSCD 北大核心 2016年第7期875-885,共11页 Journal of Image and Graphics

基金国家自然科学基金项目(60802047 60702018)~~

关键词人脸检测受限玻尔兹曼机(RBM) 概率态受限玻尔兹曼机(P-RBM) 神经网络 face detection restricted Boltzmann machine （RBM） probability state-restricted Bohzmann machine （ P- RBM） neural network

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献23

1Yang G Z, Huang T S. Human face detection in a complex back-ground[J]. Pattern Recognition, 1994, 27(1 ): 53-63.
2梁路宏,艾海舟,何克忠,张钹.基于多关联模板匹配的人脸检测[J].软件学报,2001,12(1):94-102. 被引量：47
3Dai Y, Nakano Y. Recognition of facial images with low resolu- tion using a Hopfield memory model [ J ]. Pattern Recognition, 1998, 31 (2) : 159-167.
4Lin S H, Kung S Y, Lin L J. Face recognition/detection by probabilistic decision-based neural network [ J ]. IEEE Transac- tions on Neural Networks, 1997, 8 ( 1 ) : 114-132.
5余凯,贾磊,陈雨强,徐伟.深度学习的昨天、今天和明天[J].计算机研究与发展,2013,50(9):1799-1804. 被引量：590
6郑胤,陈权崎,章毓晋.深度学习及其在目标和行为识别中的新进展[J].中国图象图形学报,2014,19(2):175-184. 被引量：144
7Jaitly N, Nguyen P, Senior A, et al. Application of pretrained deep neural networks to large vocabulary speech recognition[ R ]. Toronto : University of Toronto, 2012.
8Hayat M, Bennamoun M, An S. Deep reconstruction models for image set classification[ J ]. IEEE Transactions on Pattern Analy- sis and Machine Intelligence, 2015, 37 (4) : 713-727.
9Sarikaya R, Hinton G E, Deoras A. Application of deep belief networks for natural language understanding [ J ]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22 (4) : 778-784.
10Smolcnsky P. Neural and conceptual interpretations of parallel distributed proeessing models[ M]//McClelland J L, Rumelhart D E, the PDP Research Group. Parallel Distributed Processing: Explorations in the Mierostrueture of Cognition. Volume 2 : Psy- chological and Biological Models. Cambridge, MA, USA : MIT Press, 1986.

二级参考文献110

1何光宏,潘英俊,吴芳.基于肤色特征和动态聚类的彩色人脸检测[J].光电工程,2004,31(11):47-50. 被引量：4
2刘玉颖,窦硕星,王鹏业,谢平,王渭池.应用分子梳技术对DNA与组蛋白相互作用的研究[J].物理学报,2005,54(2):622-627. 被引量：12
3乔晓艳,李刚,贺秉军,林凌.弱激光对神经细胞膜延迟整流钾通道电流特性的影响[J].中国激光,2006,33(9):1288-1293. 被引量：5
4孙宁,邹采荣,赵力.人脸检测综述[J].电路与系统学报,2006,11(6):101-108. 被引量：39
5Hyeon Bae, Sungshin Kim. Real-time face detection and recognition using hybrid-information extracted from face space and facial features [J]. Image and Vision Computing(S0262-8856), 2005, 23(13): 1181-1191.
6Phimoltares S, Lursinsap C, Chamnongthai K. Face detection and facial feature localization without considering the appearance of image context [J]. Image and Vision Computing(S0262-8856), 2007, 25(5): 741-753.
7Hsiuao-Ying Chen, Chung-Lin Huang, Chih-Ming Fu. Hybrid-boost learning for multi-pose face detection and facial expression recognition [J]. Pattern Recognition(S0031-3203), 2008, 41(3): 1173-1185.
8Tat-SengChua, Yunlong Zhao, Mohan S Kankanhalli. Detection of human Faces in a compressed domain for video stratification [J]. The VisualComputer(S0178-2789), 2002, 18(2): 121-133.
9Paul Voila, Michael Jones. Robust real-time face detection [J]. International Journal of Computer Vision(S 1573-1405), 2004, 57(2): 137-154.
10Vezhnevets V,Sazonov V,and Andreeva A.A survey on pixel-based skin color detection techniques.In Graphicon,Moscow,Russia,2003:85-92.

共引文献799

1贾彦哲.论人工智能研发者过失犯的注意义务[J].华中师范大学研究生学报,2020(2):40-46.
2谈咏东,王永雄,陈姝意,缪银龙.(2+1)D多时空信息融合模型及在行为识别的应用[J].信息与控制,2019,48(6):715-722. 被引量：3
3毕思文,Henri Jaffrès,Chandra Sekhar Roychoudhuri.量子遥感发展新态势——世界首次量子遥感国际会议评述[J].全球变化数据学报（中英文）,2019,3(4):317-325. 被引量：1
4张常泉.基于深度学习的智能视频图像分析研究[J].计算机产品与流通,2019,0(12):177-177.
5范敏,胥小波,聂小明.基于字符级扩张卷积网络的Web攻击检测方法[J].计算机应用研究,2020,37(S02):234-237. 被引量：4
6孟威,尉永清,刘文锋.基于CRT机制混合神经网络的特定目标情感分析[J].计算机应用研究,2020,37(2):360-364. 被引量：2
7华夏,王新晴,马昭烨,王东,邵发明.基于递归神经网络的视频多目标检测技术[J].计算机应用研究,2020,37(2):615-620. 被引量：7
8刘树霄,衣立,张苏平,时晓曚,薛允传.基于全卷积神经网络方法的日间黄海海雾卫星反演研究[J].海洋湖沼通报,2019(6):13-22. 被引量：10
9王海涛.自主无人系统——概念、体系架构和设计要素[J].电信快报,2021(5):6-9.
10郭龙银,扎西多吉,尚慧杰,旦增.基于LSTM的藏语语音识别[J].电脑知识与技术,2020,0(4):154-155. 被引量：2

同被引文献73

1周敬利,吴桂林,余胜生.基于BP神经网络的人脸检测算法[J].计算机工程,2004,30(11):34-36. 被引量：20
2潘志庚,邹鹏程,梁荣华.基于特征人脸和肤色统计的人脸检测[J].系统仿真学报,2004,16(6):1346-1349. 被引量：14
3刘向东,陈兆乾.基于支持向量机方法的人脸识别研究[J].小型微型计算机系统,2004,25(12):2261-2263. 被引量：6
4洪子泉,杨静宇.基于奇异值特征和统计模型的人像识别算法[J].计算机研究与发展,1994,31(3):60-65. 被引量：49
5李杰,郝晓莉.一种基于椭圆肤色模型的人脸检测方法[J].计算机测量与控制,2006,14(2):170-171. 被引量：12
6文学志,方巍,郑钰辉.一种基于类Haar特征和改进AdaBoost分类器的车辆识别算法[J].电子学报,2011,39(5):1121-1126. 被引量：86
7黄艳国,赵书玲,许伦辉.基于纹理特征和颜色匹配的车牌定位方法[J].微电子学与计算机,2011,28(9):123-126. 被引量：19
8余龙华,王宏,钟洪声.基于隐马尔科夫模型的人脸识别[J].计算机技术与发展,2012,22(2):25-28. 被引量：15
9王智文,蔡启先,陈劲飙,王乃嵩.利用肤色分割和自适应模版匹配的人脸检测[J].广西工学院学报,2013,24(1):1-8. 被引量：10
10贾伟,王正勇,张杰,李伟.一种基于改进的CS-LBP算子纹理图像自适应检索方法[J].微电子学与计算机,2013,30(9):75-78. 被引量：3

引证文献11

1张海涛,李美霖,董帅含.两层级联卷积神经网络的人脸检测[J].中国图象图形学报,2019,24(2):203-214. 被引量：15
2孙雅琪,邹祎,赵辉煌.基于Java的人脸检测系统设计与开发[J].信息系统工程,2018,0(3):34-35. 被引量：1
3刘小芳,魏伟波,谭璐.智能录播系统中站立人脸检测定位的实现[J].青岛大学学报（自然科学版）,2018,31(1):85-91.
4崔凯,才华,陈广秋,谷欣超,孙俊喜.基于多纹理CS-LBP特征的多视角人脸检测算法[J].吉林大学学报（理学版）,2018,56(3):610-616. 被引量：1
5朱善玮,李玉惠.基于Haar-like和AdaBoost的车脸检测[J].电子科技,2018,31(8):66-68. 被引量：4
6蒋阿娟,张文娟.人脸识别综述[J].电脑知识与技术,2019,15(1Z):173-174. 被引量：8
7余飞,甘俊英,张雨晨,曾军英.多级联卷积神经网络人脸检测[J].五邑大学学报（自然科学版）,2018,32(3):49-56.
8衣柳成,魏伟波,刘小芳.基于GoogLeNet的智能录播系统中站立人脸的检测与定位[J].青岛大学学报（自然科学版）,2019,32(4):91-95. 被引量：3
9周涛,陆惠玲,霍兵强.深度信念网络研究进展[J].计算机工程与应用,2020,56(9):24-32. 被引量：9
10孙灏.基于深度学习的人脸识别系统在智慧农业领域的应用研究[J].智慧农业导刊,2021,1(2):36-39. 被引量：1

二级引证文献51

1欧琪,王剑雄,孙歌,李宗阳,李晨昊.基于OpenCV的距离估计的数据可视化研究[J].河北建筑工程学院学报,2022,40(4):180-184. 被引量：1
2马垠飞,王力.融合D-S证据理论的DBN电路故障诊断算法[J].辽宁工程技术大学学报（自然科学版）,2021,40(5):448-453. 被引量：2
3郑平平.浅谈人与超级计算机的区别[J].山东青年,2019,0(11):189-190.
4范少地,许建中,唐康来,李起鸿.缓慢牵伸肢体延长周围神经亚临床损害修复过程的观察[J].第三军医大学学报,2000,22(5):470-473. 被引量：6
5胡石,王彬,吴志光.基于多通信机制与机器视觉的智慧小区视频监控系统[J].井冈山大学学报（自然科学版）,2019,40(2):52-57.
6干书祥.基于深度学习的靶蛋白药物重定位研究[J].信息与电脑,2019,31(13):37-41.
7蔡凤翔,龚仁彬,李群,柴永财.基于物联网技术的油气生产管理系统的设计与实现[J].信息系统工程,2019,0(11):34-35. 被引量：6
8汪浩,吴云树.融合神经网络与瞬时自相关分区特征的自动调制分类方法研究[J].国外电子测量技术,2019,38(11):52-56. 被引量：4
9何松华,章阳.基于快速检测和AdaBoost的车辆检测[J].计算机工程与设计,2020,41(1):203-207. 被引量：6
10何伟鑫,邓建球,刘爱东,丛林虎.MTCNN和RESNET的人脸识别弹库门禁系统研究[J].单片机与嵌入式系统应用,2020,20(4):51-54. 被引量：2

1吕运君.基于VHDL的级联型IIR数字滤波器的设计[J].硅谷,2011,4(14):195-195.
2张登奇,周婷,李斌.基于MATLAB的数字滤波器结构实现与仿真[J].湖南理工学院学报（自然科学版）,2008,21(3):19-22. 被引量：3
3胡慧,刘国荣.模糊神经网络控制及其学习方法的研究[J].湖南工程学院学报（自然科学版）,2006,16(3):1-4. 被引量：5
4盛积德,张延炘,常胜江,陈戍.前馈型神经网络中隐藏层神经元的研究[J].光电子．激光,2001,12(6):620-622. 被引量：3
5田雪.基于级联型组合分类器的人脸识别研究[J].嘉兴学院学报,2005,17(6):64-67.
6孙丹,秦贵和,董劲男,陈虹.基于最小二乘支持向量机的网络控制系统建模[J].吉林大学学报（理学版）,2014,52(6):1277-1283. 被引量：1
7姜洋,罗贵明.Petri网模型的扩展与检测[J].计算机应用,2007,27(1):183-185.
8无忧.查询Windows XP的激活状态[J].计算机应用文摘,2004(15).
9兔子.借助手机来快乐上网[J].电脑爱好者（普及版）,2007,0(12):75-75.
1017期E博士悬赏问题答案揭晓[J].网友世界,2009(18):79-79.

中国图象图形学报

2016年第7期

浏览历史

内容加载中请稍等...

级联型P-RBM神经网络的人脸检测被引量：11

参考文献23

二级参考文献110

共引文献799

同被引文献73

引证文献11

二级引证文献51

相关作者

相关机构

相关主题

浏览历史

级联型P-RBM神经网络的人脸检测 被引量：11

参考文献23

二级参考文献110

共引文献799

同被引文献73

引证文献11

二级引证文献51

相关作者

相关机构

相关主题

浏览历史

级联型P-RBM神经网络的人脸检测被引量：11