期刊文献+

可视化协同发音合成研究综述

A Survey of the Synthesis of Visual Co-Articulation
下载PDF
导出
摘要 可视语音应用于教学、通信、电子商务等领域,可进一步提高人机交互的友好性和方便性,近年来得到广泛关注。可视化协同发音合成是研究可视语音的重要环节之一,阐述可视化协同发音的基本概念,介绍有代表性的描述方法,并对基于图像和基于模型方法这两种重要的合成方法的研究现状进行评述。总结两种方法的优缺点,并展望其发展方向。 Visual speech, used in the fields of teaching, communication, e-commerce etc. to further improve the friendliness and ease of human-computer interaction, has aroused wide concern in recent years. As one of the important parts of the research on visual speech, introduces the concept of visual co-articulation as well as some representative descriptive methods and a review of the two important methods based on image and model respectively. Summarizes the advantages and disadvantages of two methods and prospects the way of its development.
作者 吴翠娟 赵晖
出处 《现代计算机》 2014年第9期9-14,共6页 Modern Computer
基金 国家自然科学基金(No.61261037)
关键词 可视化协同发音 唇同步 语音动画 Visual Co-Articulation Lip Synchronization Speech Animation
  • 相关文献

参考文献20

  • 1Ostermann J, Weissenfeld A. Talking Faces-Technologies and Applications[C]. ICPR 2004, Cambridge, United kingdom,2004. Institute of Electrical and Electronics Engineers Inc,2004:826-833.
  • 2Mattheyses W, Latacz L, Verhelst W. Comprehensive Many-to-Many Phoneme-to-Viseme Mapping and Its Application for Concatenative Visual Speech Synthesis[J]. Speech Communication, 2013, 55(7): 857-876.
  • 3Chen H, Wang L, Liu W, et al. Combined X-ray and Facial Videos for Phoneme-Level Articulator Dynamics[J]. The Visual Computer, 2010, 26(6-8): 477-486.
  • 4王志明,陶建华.文本-视觉语音合成综述[J].计算机研究与发展,2006,43(1):145-152. 被引量:5
  • 5贾熹滨,尹宝才,李敬华.语音同步的可视语音合成技术研究[J].北京工业大学学报,2005,31(6):656-661. 被引量:5
  • 6Zhou Z, Zhao G, Guo Y, et al. An Image-Based Visual Speech Animation System[J]. Circuits and Systems for Video Technology, IEEE Transactions on, 2012, 22(10): 1420-1432.
  • 7Shih P Y, Paul A, Wang J F, et al. Speech-Driven Talking Face Using Embedded Confusable System for Real Time Mobile Multimedia [J]. Multimedia Tools and Applications, 2013:1-21.
  • 8李皓,陈艳艳,唐朝京.唇部子运动与权重函数表征的汉语动态视位[J].信号处理,2012,28(3):322-328. 被引量:12
  • 9Jackson P L. The Theoretical Minimal Unit for Visual Speech Perception: Visemes and Coarticulation[J]. The Volta Review, 1988,90 (5):99-115.
  • 10Auer Jr E T, Bernstein L E. Speech Reading and the Structure of the Lexicon: Computationally Modeling the Effects of Reduced Phonetic Distinctiveness on Lexical Uniqueness[J]. The Journal of the Acoustical Society of America, 1997, 102(6): 3704-3710.

二级参考文献30

共引文献34

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部