摘要
1引言自然人机交互方式使得人同计算机的交流不再局限于键盘、鼠标等外设.而是通过语言及手势、表情、唇动等形体语言来进行,从而使得人机交互变得像人与人之间的交流一样轻松自如.唇读通常被视为说话过程中伴随的辅助信息,它有助于对说话者提供信息的更准确理解.减弱噪音干扰.
This paper has put forward a concept of mouth-shape basic unit,and described an approach of obtaining basic units by mouth-shape images classifying and clustering. The basic units are the gist of breakaway different states during continuous speech recognition or sequence images speechreading,which can definitely measure off statuses for mouth shape changing in sequence images. The method based on mouth-shape basic unit,compared with the approach based on feature vector directly,can rather reduce the number of state space branch,shrink searching space and expedite convergence rate. This paper introduces the preprocessing of mouth shape classification,the approach of real-time lip movement detection and classification,and gives experiment results of how to select the number of original clustering center in order to fit mouth shape classification and how to select features to be propitious to mouth shape classification, then gets a conclusion.
出处
《计算机科学》
CSCD
北大核心
2002年第2期130-133,共4页
Computer Science
基金
国家863计划项目(863-306-QN99-4)
国家863计划项目(863-306-ZT03-01-2)
国家自然科学基金(69789301)
中科院百人计划的资助
关键词
唇读识别
口型分类
语音识别
计算机
Speechreading, Clustering analysis ,Automatic speech recognition