期刊文献+

多核支持向量域描述在基于图像集合匹配的人脸识别中的应用 被引量:3

Multi-kernel support vector domain description and its application in facial recognition based on image set matching
原文传递
导出
摘要 目的图像集匹配是当前模式识别领域研究的一个热点,其核心问题是如何对图像集合建模并度量两个模型的相似性,为此提出一种基于支持向量域描述的人脸识别的方法。方法支持向量域描述是一种基于支持向量机学习的数据描述方法,可以用于图像集合建模,但是单一的核函数不能准确地描述具有多中心分布的数据。本文通过多核学习扩展了支持向量域描述,提高其对多中心分布数据的表达能力。进一步借助与位置相关的方法对样本动态加权,解决全局权重参数所带来的问题。结果在公开的基于集合的人脸识别数据库上进行测试,在Honda/UCSD、CMU MoBo和YouTube数据库上,本文方法的识别率分别达到100%、98.72%和62.34%。结论实验结果表明,在光照条件受控制的监控环境中,本文方法是有效的,并取得了优于其他基于集合匹配的人脸识别算法。 Objective Image set matching has attracted increasing attention in the field of pattern recognition. For set-based image matching, the key issues can be categorized on the basis of the processes of representing the image set and measuring the similarity between two sets. Method Support vector domain description (SVDD) is a recently developed method based on support vector machine learning. SVDD is a boundary one-class learning method that maximizes the availability of sam- pies that do not belong to the target class in refining its decision boundary, and can be used to describe a set of objects. Accordingly, each image set is described with a hypersphere, and the problem of image set matching is converted into the measure of the clistance between two hyperspheres. Using support vector machine learning, each image set from the original input space is mapped into a high-dimensional feature space and modeled with support vector domain to handle the underly- ing non-linearity in the data space. In the feature space, a hypersphere encloses most of the mapped data. Thereafter, a novel metric is proposed based on domain - domain distance in a high-dimensional feature space; the distance between twoimage sets is then converted into the distance between pair-wise domains. However, the SVDD model has a disadvanta- geously simple form with only a single kernel information. Selecting the best kernel parameters is difficult and the construc- ted hypersphere is considerably sensitive to the trade-off parameter. Multiple kernel learning methods apply multiple kernels instead of merely one specific kernel function and its corresponding parameters. Recent developments in composition kernel learning for classification motivated us to apply a position-based weighting instead of the same global trade-off parameter to discriminate the importance of samples. Furthermore, considering the SVDD model' s disadvantageousiy simple form with only one kernel and the difficulty of selecting the best kernel parameters, we propose a muhi-kemel SVDD model, which can flexibly describe the data distribution boundary in the feature space after analyzing the space of multi-kernel mapping. This study utilizes the nearest neighbor classifier to obtain the class label. Result This study' s experimental settings reach 100% , 98.72% , and 62. 34% recognition rate in the public HondafUCSD, CMU MoBo, and YouTube video database, respectively. Conclusion Given that multi-kernel learning can improve the efficiency of kernel selection and automatically evaluate the relative importance of the candidate kernels, the multi-kernel SVDD model flexibly describes the data distribu- tion boundary in the feature space and provides a considerably accurate data description for the multifaceted context of the multi-model data set. Experiments conducted on public data sets demonstrate that the multi-kernel SVDD improves predic- tion accuracy and assists in characterizing the properties of complex data.
作者 曾青松
出处 《中国图象图形学报》 CSCD 北大核心 2016年第8期1021-1027,共7页 Journal of Image and Graphics
基金 广东省自然科学基金项目(2015A030313807)~~
关键词 集合匹配 模式识别 人脸识别 支持向量域描述 距离度量 多核学习 set matching pattern recognition face recognition support vector domain description distance measure multiple kernel learning
  • 相关文献

参考文献5

二级参考文献112

  • 1Zhou Yatong Zhang Taiyi Li Xiaohe.MULTI-SCALE GAUSSIAN PROCESSES MODEL[J].Journal of Electronics(China),2006,23(4):618-622. 被引量:4
  • 2Chowdhury A, Chellappa R. Face reconstruction from monocular video using uncertainty analysis and a generic model.Computer Vision and Image Understanding, 2003, 91 (1) : 188-213
  • 3Choudhury A, Clarkson B, Jebara T, Penland A. Multimodal person recognition using unconstrained audio and video// Proceedings of the Conference on Audio- and Video-based Biometric Person Authentication. Washington D. C, 1999: 176-180
  • 4Zhang Z Y, Liu Z C, Adler D, Cohen M F, Hanson E, Shan Y. Robust and rapid generation of animated faces from video images: A model-based modeling approaeh. International Journal of Computer Vision, 2004, 58(2) : 93-119
  • 5Zhou X, Bhanu B. Integrating face and gait for human recognition at a distance in video. IEEE Transactions on Systems, Man and Cybernetics, Part B, 2007, 37(5):1119-1137
  • 6JingXY, Yao Y F, Zhang D, YangJ Y, Li M. Face and palmprint pixel level fusion and kernel DCV-RBF classifier for small sample biometric recognition. Pattern Recognition, 2007, 40(11): 3209-3324
  • 7Yan Y, Zhang Y J. Multimodal biometrics fusion using correlation filter bank//Proceedings of the 19th IAPR International Conference on Pattern Recognition. Tampa, 2008, MoBTT. 3(1-4)
  • 8McKenna S, Gong S, Raja Y. Face recognition in dynamic scenes//Proceedings of the British Machine Vision Conference. Colchester, 1997: 140-151
  • 9Park U, Jain A K, Ross A. Face recognition in video: Adaptive fusion of multiple matchers//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, 2007:1-8
  • 10Wolf L, Shashua A. Kernel principal angles for classification machines with applications to image sequence interpretation//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Madison, 2003:635-642

共引文献270

同被引文献26

引证文献3

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部