期刊文献+

用于口型识别的实时唇定位方法 被引量:10

Real-Time Lip Locating Method for Lip-Movement Recognition
下载PDF
导出
摘要 在许多应用于有噪声环境下的语音识别系统中 ,唇读技术能有效地降低噪声的影响 ,通过视觉通道来补充仅取决于听觉通道的信息量 ,从而提高语音识别系统的识别率 .该文提出了一种有效和稳健的唇定位跟踪方法 ,以满足不用特殊标识物和规范性照明就能对信息进行有效提取的应用需求 .该方法首先用肤色模型查找脸 ;然后用迭代算法搜索脸部区域内的眼睛 ;再根据眼睛的位置来确定脸的大小和位置 ,并对脸的下半部分采用彩色坐标变换法将唇从肤色中明显地区分出来 ;最后 ,用可变模板将上下唇的内外轮廓描述出来 . For speech recognition systems under noisy environment, lip reading technique can effectively reduce the influence of noise and improve the accurate rate of speech recognition system by adding visual information to acoustic channel. In this paper, an effective and robust approach for lip and mouth locating and tracking is presented to enable the information extraction under abnormal illumination and without special marks. This approach first locates face region with skin color model, then finds the eyes from the face region with iterative algorithm, modifies the position and size of face according to the position of eyes, transforms the lower part of face by specific color coordinators to clearly distinguish lip color from skin color, and finally describes the outline of upper lip and lower lip with deformable template.
出处 《软件学报》 EI CSCD 北大核心 2000年第8期1126-1132,共7页 Journal of Software
基金 国家自然科学基金! (No.6 978930 1) 国家 86 3高科技项目基金! (No.86 3- 30 6 - ZT0 3- 0 1- 2 )资助
关键词 口型识别 唇定位 语音识别系统 模式识别 Lip reading, lip movement, skin color model, optical flow, deformable template.
  • 相关文献

参考文献1

  • 1Kin Manlam,Pattern Recognition,1996年,29卷,5期,771页

同被引文献58

  • 1梁毅雄,龚卫国,潘英俊,李伟红,刘嘉敏,张红梅.基于奇异值分解的人脸识别方法[J].光学精密工程,2004,12(5):543-549. 被引量:40
  • 2李小红.基于积分投影的人脸图像的特征提取[J].计算机仿真,2004,21(12):189-191. 被引量:12
  • 3洪晓鹏,姚鸿勋,徐铭辉.基于句子级的唇读语料库及其切分算法[J].计算机工程与应用,2005,41(3):174-177. 被引量:7
  • 4张欣,杜利民,陈柯,赵向阳.汉语语音视觉合成研究数据库CVSS1.0[J].微计算机应用,2007,28(3):260-265. 被引量:3
  • 5李刚,王蒙军,林凌.面向残疾人的汉语可视语音数据库[J].中国生物医学工程学报,2007,26(3):355-360. 被引量:3
  • 6Stork D G, Wolff G J,Levine E P. Neural Network Lipreading System for Improved Speech Recognition. In: Proc. Intl. Joint Conf. on Neural Networks, 1992,2: 289~295
  • 7Hennecke M E,Stork D G,Prasad K V. Visionary Speech: Looking ahead to Practical Speechreading Systems. In: David G. Stork and Marcus E. Hennecke,eds. Speechreading by Humans and Machines, Springer and Systems Sciences. 1996. 331 ~ 350
  • 8Gao W,Liu M B. A Hierarchical Approach to Human Face Detection in Complex Background. the First International Conference on Multimodal Interface, Beijing, 1996
  • 9N A Fox,R B Reilly.Audio-Visual Speaker Identification Based on the Use of Dynamic Audio and Visual Features[C].In:Proceedings of the 4th Int.Conf.on Audio-and Video-Based Biometric Person Authentication,AVBPA,Guildford,UK,2003:743~751
  • 10S Lucey,T Chen.Improved audio-visual speaker recognition via the use of a hybrid combination strategy[C].In:Conf of Audio-and VideoBased Person Authentication(AVBPA),Guildford U K,2003

引证文献10

二级引证文献23

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部