摘要
在许多应用于有噪声环境下的语音识别系统中 ,唇读技术能有效地降低噪声的影响 ,通过视觉通道来补充仅取决于听觉通道的信息量 ,从而提高语音识别系统的识别率 .该文提出了一种有效和稳健的唇定位跟踪方法 ,以满足不用特殊标识物和规范性照明就能对信息进行有效提取的应用需求 .该方法首先用肤色模型查找脸 ;然后用迭代算法搜索脸部区域内的眼睛 ;再根据眼睛的位置来确定脸的大小和位置 ,并对脸的下半部分采用彩色坐标变换法将唇从肤色中明显地区分出来 ;最后 ,用可变模板将上下唇的内外轮廓描述出来 .
For speech recognition systems under noisy environment, lip reading technique can effectively reduce the influence of noise and improve the accurate rate of speech recognition system by adding visual information to acoustic channel. In this paper, an effective and robust approach for lip and mouth locating and tracking is presented to enable the information extraction under abnormal illumination and without special marks. This approach first locates face region with skin color model, then finds the eyes from the face region with iterative algorithm, modifies the position and size of face according to the position of eyes, transforms the lower part of face by specific color coordinators to clearly distinguish lip color from skin color, and finally describes the outline of upper lip and lower lip with deformable template.
出处
《软件学报》
EI
CSCD
北大核心
2000年第8期1126-1132,共7页
Journal of Software
基金
国家自然科学基金! (No.6 978930 1)
国家 86 3高科技项目基金! (No.86 3- 30 6 - ZT0 3- 0 1- 2 )资助
关键词
口型识别
唇定位
语音识别系统
模式识别
Lip reading, lip movement, skin color model, optical flow, deformable template.