期刊文献+

唇读识别中的基本口型分类 被引量:3

Basic Mouth Shape Classification for Speechreading
下载PDF
导出
摘要 1引言自然人机交互方式使得人同计算机的交流不再局限于键盘、鼠标等外设.而是通过语言及手势、表情、唇动等形体语言来进行,从而使得人机交互变得像人与人之间的交流一样轻松自如.唇读通常被视为说话过程中伴随的辅助信息,它有助于对说话者提供信息的更准确理解.减弱噪音干扰. This paper has put forward a concept of mouth-shape basic unit,and described an approach of obtaining basic units by mouth-shape images classifying and clustering. The basic units are the gist of breakaway different states during continuous speech recognition or sequence images speechreading,which can definitely measure off statuses for mouth shape changing in sequence images. The method based on mouth-shape basic unit,compared with the approach based on feature vector directly,can rather reduce the number of state space branch,shrink searching space and expedite convergence rate. This paper introduces the preprocessing of mouth shape classification,the approach of real-time lip movement detection and classification,and gives experiment results of how to select the number of original clustering center in order to fit mouth shape classification and how to select features to be propitious to mouth shape classification, then gets a conclusion.
出处 《计算机科学》 CSCD 北大核心 2002年第2期130-133,共4页 Computer Science
基金 国家863计划项目(863-306-QN99-4) 国家863计划项目(863-306-ZT03-01-2) 国家自然科学基金(69789301) 中科院百人计划的资助
关键词 唇读识别 口型分类 语音识别 计算机 Speechreading, Clustering analysis ,Automatic speech recognition
  • 相关文献

参考文献5

  • 1Stork D G, Wolff G J,Levine E P. Neural Network Lipreading System for Improved Speech Recognition. In: Proc. Intl. Joint Conf. on Neural Networks, 1992,2: 289~295
  • 2Hennecke M E,Stork D G,Prasad K V. Visionary Speech: Looking ahead to Practical Speechreading Systems. In: David G. Stork and Marcus E. Hennecke,eds. Speechreading by Humans and Machines, Springer and Systems Sciences. 1996. 331 ~ 350
  • 3Gao W,Liu M B. A Hierarchical Approach to Human Face Detection in Complex Background. the First International Conference on Multimodal Interface, Beijing, 1996
  • 4姚鸿勋,高文,李静梅,吕雅娟,王瑞.用于口型识别的实时唇定位方法[J].软件学报,2000,11(8):1126-1132. 被引量:10
  • 5姚鸿勋,刘明宝,高文,范旭彤,张洪明,吕雅娟.基于彩色图像的色系坐标变换的面部定位与跟踪法[J].计算机学报,2000,23(2):158-165. 被引量:54

二级参考文献6

  • 1Gao W,Proceedings of the First International Conference on Multimodal Interface,1996年,289页
  • 2Dai Y,Pattern Recognition,1996年,29卷,6期,1007页
  • 3Kin Manlam,Pattern Recognition,1996年,29卷,5期,771页
  • 4Yang J,Carnegia Mellon University:Technical Report CMU-CS-95 -2 10,1995年
  • 5Yang G,Pattern Recognition,1994年,27卷,1期,53页
  • 6Kin Manlam,Pattern Recognition,1996年,29卷,5期,771页

共引文献59

同被引文献23

  • 1洪晓鹏,姚鸿勋,徐铭辉.基于句子级的唇读语料库及其切分算法[J].计算机工程与应用,2005,41(3):174-177. 被引量:7
  • 2张欣,杜利民,陈柯,赵向阳.汉语语音视觉合成研究数据库CVSS1.0[J].微计算机应用,2007,28(3):260-265. 被引量:3
  • 3李刚,王蒙军,林凌.面向残疾人的汉语可视语音数据库[J].中国生物医学工程学报,2007,26(3):355-360. 被引量:3
  • 4余志龙,等.GooSeAndroidSDK开发范例大全[M].2版.北京:人民邮电出版社,2010.
  • 5M Ying-jie,H Ying-jie,Z Hai-yah,et al.Feature mouth shapes extraction based on contour of internal lips[C]//2010 International Conference on Wireless Communications Networking and Mobile Computing(WiCOM).IEEE press,2010:1-5.
  • 6Z Yi,L Quan-jie,L Yan-hua,et al.Intelligent wheelchair multimodal human-machine interfaces in lip contour extraction based on PMM[C]//2009 IEEE International Conference on Robotics and Biomimetics(ROBIO).IEEE,2009:2108-2113.
  • 7Kass M,Witkin A,Terzopoulos D.Snakes:Active contour modds[J].International Journal of Computer Vision,1988,1(4):321-331.
  • 8Feng X,He Q,Wang W.An improved GAC model for lip contour detection[C]//9th International Conference on Signal Processing,2008(ICSP 2008).IEEE,2008:1215-1218.
  • 9Saha S,Pandey P C.Estimation of the area of mouth opening during speech production[C]//Proceedings of the Eighth Indian Conference on Computer Vision,Graphics and Image Processing.ACM,2012:27.
  • 10Viola P,Jones M J.Robust real-time face detection[J].International Journal of Computer Vision,2004,57(2):137-154.

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部