期刊文献+

基于PS-Level Set的嘴唇几何形状定位模型 被引量:6

Detection Model of Geometric Shape of Lip Based on PS-Level Set
下载PDF
导出
摘要 针对面向唇读的水平集模型在嘴唇分割中存在边界过收敛和过早收敛的问题,文中提出了一种改进的基于先验知识的水平集模型(简称为PS-Level Set)来进行嘴唇几何形状的定位.PS-Level Set模型利用改进的差值能量函数引入嘴唇形状的先验信息.在曲线演化过程中,反复比较演化曲线和先验曲线的差距,使曲线的演化形状逐渐逼近先验模型形状,从而更精确地收敛于目标物体实际轮廓.实验表明,用PS-Level Set模型定位嘴唇几何形状的准确率比用水平集模型提高了8.38%. In order to overcome the overconvergence and the premature convergence of lip boundary caused by the level set model for geometric shape detection, an improved level set model based on the prior shape ( PS-Level Set) is proposed. In this model, the prior shape information of lip is incorporated into an improved differential energy function, and the differences between the evolution shape curve and the prior shape curve are repeatedly compared during the curve-evolving process, which enables the evolution shape to gradually approach the prior one and to converge to the target object more accurately. Experimental results show that, as compared with the conventional level set model, the proposed model improves the detection accuracy by 8.38%.
出处 《华南理工大学学报(自然科学版)》 EI CAS CSCD 北大核心 2010年第2期121-125,共5页 Journal of South China University of Technology(Natural Science Edition)
基金 国家自然科学基金资助项目(60572141 60602014)
关键词 唇读 形状定位 水平集模型 曲线演化 lip reading shape detection level set model curve evolution
  • 相关文献

参考文献12

  • 1McGurk J M H. Hearing lips and seeing voices [ J ]. Nature, 1976,264 : 746-748.
  • 2Potamianos G, Neti C, Gravier G, et al. Recent advances in the automatic recognition of audiovisual speech [ J ]. IEEE Signal Processing Magazine, 2003,91 (9) : 1 306- 1 323.
  • 3Zhang X, Mersereau R M, Clements M. Visual speech feature extraction for improved speech recognition [ C ] //Proc of IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando : IEEE, 2002 : 1993-1 996.
  • 4Nefian A, Liang L, Pi X, et al. A couple HMM for audiovisual speech recognition [ C ]//Proc of IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando: IEEE,2002:2013-2016.
  • 5Werda S, Mahdi W, Tmak M, et al. A life:automatic lip feature extraction:a new approach for speech recognition application [ C ]///Proc of IEEE International Conference on Information and Communication Technologies. Damasus:IEEE,2006:2953-2968.
  • 6Dumitras A, Venetsanopoulos N A. Angular map driven snakes with application to object shape description in color images [ J ]. IEEE Transactions on Image Processing, 2001,10(12) : 1 851-1 859.
  • 7Osher S, Sethian J A. Level sets and the fast marching method: evolving interfaces in computational geometry [ M ] // Fluid Mechanics, Computer Vision and Materials Science. Cambridge : Cambridge University Press, 1999.
  • 8Li C, Xu C, Gui C, et al. Level set evolution without reinitialization : a new variational formulation [ C ] // Proc of IEEE Computer Society Conference on Computer Vision and Pattern Recognition. San Diego : IEEE, 2005 : 430- 436.
  • 9Xu C, Prince J L. Snakes, shapes and gradient vector flow [ J ]. IEEE Transactions on Image Processing, 1998,7 ( 3 ) : 359-369.
  • 10Cremers D, Rousson M, Deriche R. A review of statistical approaches to level set segmentation: integrating color, texture, motion and shape [J]. International Journal of Computer Vision, 2007,72 ( 2 ) : 195- 215.

同被引文献60

  • 1宋怀波,齐关锋,钱程.基于YUV颜色空间的脸部区域特征点定位方法[J].吉林大学学报(工学版),2013,43(S1):39-42. 被引量:3
  • 2丁爱玲,周秦武,郑春红.基于直方图均衡与小波变换的超声图像增强[J].长安大学学报(自然科学版),2004,24(6):84-87. 被引量:3
  • 3郝颖明,朱枫.2维Otsu自适应阈值的快速算法[J].中国图象图形学报(A辑),2005,10(4):484-488. 被引量:121
  • 4胡振涛,刘先省.一种实用的数据融合算法[J].自动化仪表,2005,26(8):7-9. 被引量:25
  • 5王晓平,郝玉峰,付德刚,袁春伟.一种自动的唇部定位及唇轮廓提取、跟踪方法[J].模式识别与人工智能,2007,20(4):485-491. 被引量:7
  • 6MI Faraj, J Bigun. S ynergy of lip-motion and acoustic features in biometric speech and speaker recognition[ J]. IEEE Transac- tions on Computer,2007,56(9): 1169- 1175.
  • 7S Kumagal, K Doman, et al. Detection of inconsistency between subject and speaker based on the co-occurrence of lip motion and voice towards speech scene extraction from news videos [ A]. IEEE International Symposium on Multimedia[ C]. Cali- fornia: IEEE,2011.311 - 318.
  • 8M Slaney,M Covell. Facesync:A linear operator for measuring synchronization of video facial images and audio track [ A ].Neural Information Processing Systems [ C ]. Denver: NIPSF, 2000. 814 - 820.
  • 9N Eveno, L Besacier. A speaker independent "liveness" test for audio-visual biomelrics [ A ]. Nineth European Conference on Speech Communication and Technology [ C ]. Lisbon: ISCA, 2005. 3081 - 3084.
  • 10G ChoUet, R Landais, et al. Some experiments in audio-visual speech processing [A ]. Non-Linear Speech Processing 2007 [ C]. Paris-ISCA, 2007.28 - 56.

引证文献6

二级引证文献25

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部