期刊文献+

一种基于共振峰分析的语音驱动人脸动画方法 被引量:1

An Approach of Speech-driven Facial Animation Based on Formants Analysis
下载PDF
导出
摘要 快速、高效地实现语音驱动下的唇形自动合成,以及优化语音与唇动的同步是语音驱动人脸动画的重点。提出了一种基于共振峰分析的语音驱动人脸动画的方法。对语音信号进行加窗分帧,DFT变换,再对短时音频信号的频谱进行第一、第二共振峰分析,将分析结果映射为一组控制序列,并对控制序列进行去奇异点等后处理。设定三维人脸模型的动态基本口形,以定时方式将控制序列导入模型,完成人脸动画驱动。实验结果表明,该方法简单快速,有效实现了语音和唇形的同步,动画效果连贯自然,可广泛用于各类虚拟角色的配音,缩短虚拟人物的制作周期。 Automatic synthesis of lip animation driven by speech and lip synchronization is the key issues in speech driven facial animation system. A new approach of speech-driven facial animation based on formants analysis is presented. The input audio signal is divided into partly overlapped frames and multiplied by a Hamming window, and then a DFT (Discrete Fourier Transformation) is used. In the frequency domain of the short time signal, the 1st and 2nd formant are analyzed in order to form a control sequence. Several basic dynamic mouth shapes of the 3D facial model are defined, and the control sequence is used to drive the facial movements. The results show that the input speech and facial lip animation is synchronized precisely in this way, and the effect of the animation is fluent and looks real.
作者 潘晋 杨卫英
出处 《电声技术》 2009年第5期62-65,共4页 Audio Engineering
关键词 语音驱动 共振峰分析 人脸动画 语音唇形同步 speech driving formants analysis lip animation speech-lip synchronization
  • 相关文献

参考文献7

  • 1MASSARO D W, BESKOW J, COHEN M M. Picture my voice: Audio to visual speech synthesis using artificial neural networks[C]// Proceedings of the 4th Annual Auditory-Visual Speech Processing Conference (AVSP'99). Santa Cruz : [s.n.], 1999 : 105-111.
  • 2BRAND M. Voice puppetry[C]// Proceedings of the SIGGRAPH'99. Los Angeles:[s.n.], 1999 : 21-28.
  • 3陈益强,高文,王兆其,姜大龙.基于机器学习的语音驱动人脸动画方法[J].软件学报,2003,14(2):215-221. 被引量:20
  • 4林鑫,陈桦,王开志,王继成.语音驱动唇形自动合成算法[J].计算机工程,2007,33(17):237-238. 被引量:6
  • 5KAKUMANU P,GUTIERREZ-OSUNA R, ESPOSITO A, et al. Speech driven facial animation [C]//Proceedings of the 2001 Workshop on Perceptive User Interfaces. Florida : [s.n.], 2001,15 : 1-5.
  • 6涂欢,周经野,刘军发,崔国勤,谢晨.一种语音和文本联合驱动的卡通人脸动画方法[J].小型微型计算机系统,2007,28(12):2238-2241. 被引量:1
  • 7王炳锡,屈丹,彭煊.实用语音识别基础[M].北京:国防工业出版社,2004:264-286.

二级参考文献23

  • 1[1]Beskow J. Rule-Based visual speech synthesis. In: Proceedings of the 4th European Conference on Speech Communication and Technology. 1995. 299~302. http://www.speech.kth.se/~beskow/papers/es95rul.pdf.
  • 2[2]Waters K, Levergood, TM. DECface : an automatic lip-synchronization algorithm for synthetic face. Technical Report, CRL 93-4, Digital Equipment Corporation, Cambridge Research Laboratory, 1993. ftp://crl.dec.com/pub/DEC/CRL/tech-reports/93.4.ps.Z.
  • 3[3]Hong PY, Wen Z, Huang TS. IFACE: a 3D synthetic talking face. International Journal of Image and Graphics, 2001,1(1):1~8.
  • 4[4]Ezzat T, Poggio, T. Visual speech synthesis by morphing visemes. International Journal of Computer Vision, 2000,38(1):45~57.
  • 5[5]Yehia H, Kuratate T, Vatikiotis-Bateson E. Using speech acoustics to drive facial motion. In: Proceedings of the 14th international congress of phonetic sciences (ICPhS'99). 1999. 631~634. http://trill.berkeley.edu/ICPhS/frameless/acceptance.html.
  • 6[6]Massaro DW, Beskow J, Cohen MM. Picture my voice: audio to visual speech synthesis using artificial neural networks. In: Proceedings of the 4th Annual Auditory-Visual Speech Processing Conference (AVSP'99). 1999. 105~111. http://mambo.ucsc.edu/ pdf/avsp9922.pdf.
  • 7[7]Brand M. Voice puppetry. In: Proceedings of the SIGGRAPH'99. 1999. 21~28. http://www.cs.cmu.edu/~ph/869/papers/Brand- sigg99.pdf.
  • 8[8]Ostermann J. Animation of synthetic faces in MPEG-4. Computer Animation, 1998. 49~51. http://www.research.att.com/projects/ AnimatedHead/pimages/companim3.pdf.
  • 9[9]Zhen B, Wu XH, Liu ZM, Chi HS. An enhanced RASTA processing for speaker identification, In: Huang TY, ed. Proceedings of the International Symposium of Chinese Spoken Language Processing. Beijing: China Military Friendship Publish,2000. 251~255.
  • 10[10]Wang AH, Bao HQ, Chen JY. Primary research on the viseme system in standard Chinese, In: Huang TY, ed. Proceedings of the International Symposium of Chinese Spoken Language Processing. Beijing: China Military Friendship Publish, 2000. 215~218.

共引文献33

同被引文献3

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部