Pose manifold and tensor decomposition are used to represent the nonlinear changes of multi-view faces for pose estimation,which cannot be well handled by principal component analysis or multilinear analysis methods.A...Pose manifold and tensor decomposition are used to represent the nonlinear changes of multi-view faces for pose estimation,which cannot be well handled by principal component analysis or multilinear analysis methods.A pose manifold generation method is introduced to describe the nonlinearity in pose subspace.And a nonlinear kernel based method is used to build a smooth mapping from the low dimensional pose subspace to the high dimensional face image space.Then the tensor decomposition is applied to the nonlinear mapping coefficients to build an accurate multi-pose face model for pose estimation.More importantly,this paper gives a proper distance measurement on the pose manifold space for the nonlinear mapping and pose estimation.Experiments on the identity unseen face images show that the proposed method increases pose estimation rates by 13.8% and 10.9% against principal component analysis and multilinear analysis based methods respectively.Thus,the proposed method can be used to estimate a wide range of head poses.展开更多
This paper presents a joint head pose and facial landmark regression method with input from depth images for realtime application. Our main contributions are: firstly, a joint optimization method to estimate head pose...This paper presents a joint head pose and facial landmark regression method with input from depth images for realtime application. Our main contributions are: firstly, a joint optimization method to estimate head pose and facial landmarks, i.e., the pose regression result provides supervised initialization for cascaded facial landmark regression, while the regression result for the facial landmarks can also help to further refine the head pose at each stage. Secondly,we classify the head pose space into 9 sub-spaces, and then use a cascaded random forest with a global shape constraint for training facial landmarks in each specific space. This classification-guided method can effectively handle the problem of large pose changes and occlusion.Lastly, we have built a 3D face database containing 73 subjects, each with 14 expressions in various head poses. Experiments on challenging databases show our method achieves state-of-the-art performance on both head pose estimation and facial landmark regression.展开更多
Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previous...Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previously seen head poses). To predict head poses that are not seen in the training data, some regression-based methods have been proposed. However, they focus on estimating continuous head pose angles, and thus do not systematically evaluate the performance on predicting unseen head poses. In this paper, we use a dense multivariate label distribution(MLD) to represent the pose angle of a face image. By incorporating both seen and unseen pose angles into MLD, the head pose predictor can estimate unseen head poses with an accuracy comparable to that of estimating seen head poses. On the Pointing'04 database, the mean absolute errors of results for yaw and pitch are 4.01?and 2.13?, respectively. In addition, experiments on the CAS-PEAL and CMU Multi-PIE databases show that the proposed dense MLD-based head pose estimation method can obtain the state-of-the-art performance when compared to some existing methods.展开更多
Lots of progress has been made recently on 2 D human pose tracking with tracking-by-detection approaches. However,several challenges still remain in this area which is due to self-occlusions and the confusion between ...Lots of progress has been made recently on 2 D human pose tracking with tracking-by-detection approaches. However,several challenges still remain in this area which is due to self-occlusions and the confusion between the left and right limbs during tracking. In this work,a head orientation detection step is introduced into the tracking framework to serve as a complementary tool to assist human pose estimation. With the face orientation determined,the system can decide whether the left or right side of the human body is exactly visible and infer the state of the symmetric counterpart. By granting a higher priority for the completely visible side,the system can avoid double counting to a great extent when inferring body poses. The proposed framework is evaluated on the HumanEva dataset. The results show that it largely reduces the occurrence of double counting and distinguishes the left and right sides consistently.展开更多
Facial expression recognition(FER)has numerous applications in computer security,neuroscience,psychology,and engineering.Owing to its non-intrusiveness,it is considered a useful technology for combating crime.However,...Facial expression recognition(FER)has numerous applications in computer security,neuroscience,psychology,and engineering.Owing to its non-intrusiveness,it is considered a useful technology for combating crime.However,FER is plagued with several challenges,the most serious of which is its poor prediction accuracy in severe head poses.The aim of this study,therefore,is to improve the recognition accuracy in severe head poses by proposing a robust 3D head-tracking algorithm based on an ellipsoidal model,advanced ensemble of AdaBoost,and saturated vector machine(SVM).The FER features are tracked from one frame to the next using the ellipsoidal tracking model,and the visible expressive facial key points are extracted using Gabor filters.The ensemble algorithm(Ada-AdaSVM)is then used for feature selection and classification.The proposed technique is evaluated using the Bosphorus,BU-3DFE,MMI,CK^(+),and BP4D-Spontaneous facial expression databases.The overall performance is outstanding.展开更多
针对传统监考存在人工成本高、主观性强等问题,构建基于人脸识别、头部姿态估计和目标检测的智能监考模型。模型通过人脸识别算法进行考生身份验证,设计结合注意力机制的头部姿态估计(channel and spatial-aware wide head pose estimat...针对传统监考存在人工成本高、主观性强等问题,构建基于人脸识别、头部姿态估计和目标检测的智能监考模型。模型通过人脸识别算法进行考生身份验证,设计结合注意力机制的头部姿态估计(channel and spatial-aware wide head pose estimation network,CS-WHENet)方法对考生偷看的异常行为进行检测,并使用深度学习方法及传统方法对考生传递纸条的异常行为进行联合判定。实验结果表明,智能监考模型在模拟真实考场的环境中,对考生身份验证与异常行为检测均有较高的准确率,并能在GPU支持下实现实时检测。通过验证表明,该模型能有效降低监考人员工作成本,实现考场监考公平性。展开更多
基金supported by National Natural Science Foundation of China (6090312660872145)+1 种基金Doctoral Fund of Ministry of Education of China (20090203120011)Basic Science Research Fund in XidianUniversity (72105470)
文摘Pose manifold and tensor decomposition are used to represent the nonlinear changes of multi-view faces for pose estimation,which cannot be well handled by principal component analysis or multilinear analysis methods.A pose manifold generation method is introduced to describe the nonlinearity in pose subspace.And a nonlinear kernel based method is used to build a smooth mapping from the low dimensional pose subspace to the high dimensional face image space.Then the tensor decomposition is applied to the nonlinear mapping coefficients to build an accurate multi-pose face model for pose estimation.More importantly,this paper gives a proper distance measurement on the pose manifold space for the nonlinear mapping and pose estimation.Experiments on the identity unseen face images show that the proposed method increases pose estimation rates by 13.8% and 10.9% against principal component analysis and multilinear analysis based methods respectively.Thus,the proposed method can be used to estimate a wide range of head poses.
基金supported by the National Key Technologies R&D Program of China (No. 2016YFC0800501)the National Natural Science Foundation of China (No. 61672481)
文摘This paper presents a joint head pose and facial landmark regression method with input from depth images for realtime application. Our main contributions are: firstly, a joint optimization method to estimate head pose and facial landmarks, i.e., the pose regression result provides supervised initialization for cascaded facial landmark regression, while the regression result for the facial landmarks can also help to further refine the head pose at each stage. Secondly,we classify the head pose space into 9 sub-spaces, and then use a cascaded random forest with a global shape constraint for training facial landmarks in each specific space. This classification-guided method can effectively handle the problem of large pose changes and occlusion.Lastly, we have built a 3D face database containing 73 subjects, each with 14 expressions in various head poses. Experiments on challenging databases show our method achieves state-of-the-art performance on both head pose estimation and facial landmark regression.
基金supported by the National Key Scientific Instrument and Equipment Development Project of China(No.2013YQ49087903)the National Natural Science Foundation of China(No.61202160)
文摘Accurate head poses are useful for many face-related tasks such as face recognition, gaze estimation,and emotion analysis. Most existing methods estimate head poses that are included in the training data(i.e.,previously seen head poses). To predict head poses that are not seen in the training data, some regression-based methods have been proposed. However, they focus on estimating continuous head pose angles, and thus do not systematically evaluate the performance on predicting unseen head poses. In this paper, we use a dense multivariate label distribution(MLD) to represent the pose angle of a face image. By incorporating both seen and unseen pose angles into MLD, the head pose predictor can estimate unseen head poses with an accuracy comparable to that of estimating seen head poses. On the Pointing'04 database, the mean absolute errors of results for yaw and pitch are 4.01?and 2.13?, respectively. In addition, experiments on the CAS-PEAL and CMU Multi-PIE databases show that the proposed dense MLD-based head pose estimation method can obtain the state-of-the-art performance when compared to some existing methods.
文摘Lots of progress has been made recently on 2 D human pose tracking with tracking-by-detection approaches. However,several challenges still remain in this area which is due to self-occlusions and the confusion between the left and right limbs during tracking. In this work,a head orientation detection step is introduced into the tracking framework to serve as a complementary tool to assist human pose estimation. With the face orientation determined,the system can decide whether the left or right side of the human body is exactly visible and infer the state of the symmetric counterpart. By granting a higher priority for the completely visible side,the system can avoid double counting to a great extent when inferring body poses. The proposed framework is evaluated on the HumanEva dataset. The results show that it largely reduces the occurrence of double counting and distinguishes the left and right sides consistently.
文摘Facial expression recognition(FER)has numerous applications in computer security,neuroscience,psychology,and engineering.Owing to its non-intrusiveness,it is considered a useful technology for combating crime.However,FER is plagued with several challenges,the most serious of which is its poor prediction accuracy in severe head poses.The aim of this study,therefore,is to improve the recognition accuracy in severe head poses by proposing a robust 3D head-tracking algorithm based on an ellipsoidal model,advanced ensemble of AdaBoost,and saturated vector machine(SVM).The FER features are tracked from one frame to the next using the ellipsoidal tracking model,and the visible expressive facial key points are extracted using Gabor filters.The ensemble algorithm(Ada-AdaSVM)is then used for feature selection and classification.The proposed technique is evaluated using the Bosphorus,BU-3DFE,MMI,CK^(+),and BP4D-Spontaneous facial expression databases.The overall performance is outstanding.
文摘针对传统监考存在人工成本高、主观性强等问题,构建基于人脸识别、头部姿态估计和目标检测的智能监考模型。模型通过人脸识别算法进行考生身份验证,设计结合注意力机制的头部姿态估计(channel and spatial-aware wide head pose estimation network,CS-WHENet)方法对考生偷看的异常行为进行检测,并使用深度学习方法及传统方法对考生传递纸条的异常行为进行联合判定。实验结果表明,智能监考模型在模拟真实考场的环境中,对考生身份验证与异常行为检测均有较高的准确率,并能在GPU支持下实现实时检测。通过验证表明,该模型能有效降低监考人员工作成本,实现考场监考公平性。