期刊文献+

级联型P-RBM神经网络的人脸检测 被引量:11

Cascaded probability state-restricted Boltzmann machine for face detection
原文传递
导出
摘要 目的针对非理想条件下快速准确的人脸检测问题,提出一种基于概率态多层受限玻尔兹曼机(RBM)级联神经网络的检测方法。方法它采用RBM中神经元的概率态表征来模拟人脑神经元连续分布的激活状态,并且利用多层P-RBM(概率态RBM)级联来仿真人脑对视觉的层次学习模式,又以逐层递减隐藏层神经元数来控制网络规模,最后采用分层训练和整体优化的机制来缓解鲁棒性和准确性的矛盾。结果在LFW、FERET、PKUSVD-B以及CAS-PEAL数据集上的测试都实现了优于现有典型算法的检测性能。对于单人脸检测,相比于Adaboost算法,将漏检率降低了2.92%;对于多人脸检测,相比于结合肤色的Adaboost算法,将误检率降低了14.9%,同时漏检率降低了5.0%,检测时间降低了50%。结论无论是静态单张人脸,还是复杂条件下视频多人脸检测,该方法不仅在误检率和漏检率上表现更好,而且具有较快的检测速度,同时对于旋转人脸检测具有较强的鲁棒性。针对基于肤色的多人脸检测研究,该方法能显著降低误检率。 Objective Face detection is constantly an active research subject in computer vision and pattern recognition. Face detection is also a constituent part of pattern recognition, artificial intelligence, information security, and many other disciplines. With video network coverage widely increasing in recent years, face detection has been increasingly used in the field of video surveillance. However, many factors require consideration in face detection, such as the complex environ- ments, multiple faces, and face rotation angles. In view of these interference problems in nonideal condition, a cascaded neuron network based on a multi-layer probability state-restricted Bohzmann machine (P-RBM) is proposed in this study to overcome the challenge of accurately and rapidly detecting faces. Method The neurons of RBM only have two states, name- ly, activated and nonactivated; this state mode can inhibit the interference in the learning result induced by the inadequate active information, while it simultaneously increases the likelihood that the learning network falls into a local optimum caused by the shielding of relatively weak information. To solve this contradiction, the proposed method uses the probability state of neurons in RBM as their activation degree, which better models the activity state' s continuous distribution of the neurons in the human brain. Using the probability state not only retains the weak active information but further decreases the effect caused by the former layer' s miscalculation. Simultaneously, this method simulates the hierarchical learning mode in the human brain by cascading multiple P-RBMs. This cascaded network can achieve multi-layer nonlinear mapping and obtain the semantic feature of the input date by extracting the input data' s separate level features. Furthermore, this cascaded network can learn the relationship hiding within the data to make the learned features be more promotional and ex- pressive. Simultaneously, the number of the hidden layer' s neurons decreases layer-by-layer to control the network' s scale and enhance the robustness. Finally, the proposed method uses the layered training and the entire optimization to balance robustness and accuracy. The greedy layer-wise learning is used in the layered training to avoid the training error transfer- ring in layers, thereby solving the problem of the multi-layer network easily falling into the local optimum. Furthermore, a preprocessing layer is used to detect the skin color area to reduce the number of neurons in the detection network and speed up the detection speed. Result Testing the single face detection performance in the LFW and FERET, the proposed method nearly achieves entirely accurate detection. Testing the video face detection in the PKU-SVD-B database, the missing de- tection rate and the false detection rate of the proposed method are all lower than that of the state-of-the-art methods, such as Adaboost and Adaboost combined with skin color detection, and its detection speed is faster. Moreover, the proposed method has a good detection performance for the face with a large rotation, which is tested in the CAS-PEAL database. Conclusion Experimental results show that regardless of whether a static single face or video multi-face detection occurs under complicated conditions, apart from the faster detection speed and robustness against face rotation, the proposed method possesses lower false detection rate and lower missing detection rate. Aiming at the multi-face detection based on skin color, this method can significantly reduce the false detection rate.
出处 《中国图象图形学报》 CSCD 北大核心 2016年第7期875-885,共11页 Journal of Image and Graphics
基金 国家自然科学基金项目(60802047 60702018)~~
关键词 人脸检测 受限玻尔兹曼机(RBM) 概率态受限玻尔兹曼机(P-RBM) 神经网络 face detection restricted Boltzmann machine (RBM) probability state-restricted Bohzmann machine ( P- RBM) neural network
  • 相关文献

参考文献23

  • 1Yang G Z, Huang T S. Human face detection in a complex back-ground[J]. Pattern Recognition, 1994, 27(1 ): 53-63.
  • 2梁路宏,艾海舟,何克忠,张钹.基于多关联模板匹配的人脸检测[J].软件学报,2001,12(1):94-102. 被引量:47
  • 3Dai Y, Nakano Y. Recognition of facial images with low resolu- tion using a Hopfield memory model [ J ]. Pattern Recognition, 1998, 31 (2) : 159-167.
  • 4Lin S H, Kung S Y, Lin L J. Face recognition/detection by probabilistic decision-based neural network [ J ]. IEEE Transac- tions on Neural Networks, 1997, 8 ( 1 ) : 114-132.
  • 5余凯,贾磊,陈雨强,徐伟.深度学习的昨天、今天和明天[J].计算机研究与发展,2013,50(9):1799-1804. 被引量:590
  • 6郑胤,陈权崎,章毓晋.深度学习及其在目标和行为识别中的新进展[J].中国图象图形学报,2014,19(2):175-184. 被引量:144
  • 7Jaitly N, Nguyen P, Senior A, et al. Application of pretrained deep neural networks to large vocabulary speech recognition[ R ]. Toronto : University of Toronto, 2012.
  • 8Hayat M, Bennamoun M, An S. Deep reconstruction models for image set classification[ J ]. IEEE Transactions on Pattern Analy- sis and Machine Intelligence, 2015, 37 (4) : 713-727.
  • 9Sarikaya R, Hinton G E, Deoras A. Application of deep belief networks for natural language understanding [ J ]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22 (4) : 778-784.
  • 10Smolcnsky P. Neural and conceptual interpretations of parallel distributed proeessing models[ M]//McClelland J L, Rumelhart D E, the PDP Research Group. Parallel Distributed Processing: Explorations in the Mierostrueture of Cognition. Volume 2 : Psy- chological and Biological Models. Cambridge, MA, USA : MIT Press, 1986.

二级参考文献110

  • 1何光宏,潘英俊,吴芳.基于肤色特征和动态聚类的彩色人脸检测[J].光电工程,2004,31(11):47-50. 被引量:4
  • 2刘玉颖,窦硕星,王鹏业,谢平,王渭池.应用分子梳技术对DNA与组蛋白相互作用的研究[J].物理学报,2005,54(2):622-627. 被引量:12
  • 3乔晓艳,李刚,贺秉军,林凌.弱激光对神经细胞膜延迟整流钾通道电流特性的影响[J].中国激光,2006,33(9):1288-1293. 被引量:5
  • 4孙宁,邹采荣,赵力.人脸检测综述[J].电路与系统学报,2006,11(6):101-108. 被引量:39
  • 5Hyeon Bae, Sungshin Kim. Real-time face detection and recognition using hybrid-information extracted from face space and facial features [J]. Image and Vision Computing(S0262-8856), 2005, 23(13): 1181-1191.
  • 6Phimoltares S, Lursinsap C, Chamnongthai K. Face detection and facial feature localization without considering the appearance of image context [J]. Image and Vision Computing(S0262-8856), 2007, 25(5): 741-753.
  • 7Hsiuao-Ying Chen, Chung-Lin Huang, Chih-Ming Fu. Hybrid-boost learning for multi-pose face detection and facial expression recognition [J]. Pattern Recognition(S0031-3203), 2008, 41(3): 1173-1185.
  • 8Tat-SengChua, Yunlong Zhao, Mohan S Kankanhalli. Detection of human Faces in a compressed domain for video stratification [J]. The VisualComputer(S0178-2789), 2002, 18(2): 121-133.
  • 9Paul Voila, Michael Jones. Robust real-time face detection [J]. International Journal of Computer Vision(S 1573-1405), 2004, 57(2): 137-154.
  • 10Vezhnevets V,Sazonov V,and Andreeva A.A survey on pixel-based skin color detection techniques.In Graphicon,Moscow,Russia,2003:85-92.

共引文献799

同被引文献73

引证文献11

二级引证文献51

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部