
语音处理上如何逐渐减少对具体语料的依赖? 被引量:3

How to gradually decrease the dependence on specific speech materials in speech processing?
摘要 为解决语音处理上对大量具体语料的依赖及其繁重处理的问题,该文首先通过对语音处理的根本目标与语音技术的当前现状的分析,指出了这种依赖性的根源。接着通过对语音多变的不可避免性与声学不变量的相对性的阐述,说明语音的变化并非完全不可知,进而指出解决问题的关键在于充分认识语音变化的规律性和在处理系统中综合利用这些规律。最后,提出一个解决策略,基本原则就是通过完善语料库建设来促进知识与语料的有机结合,逐步以相对关系上的声学不变量来取代具体语料的作用。并对相关语料库的建设提出了初步设想。 This study provides solutions to speech processing system problems when systems rely heavily on large amounts of speech materials and need large amount of computations to deal with these materials.The paper describes the origin of this dependence by analyzing the essential goal and current technologies of speech processing,then explains that analysis of speech variations is not an intractable problem through analysis of speech terms and their relational acoustic invariance.The study further indicates that the key solution is to accurately describe for speech variations and integrate them into the speech processing.The solution promotes the integration of phonetic rules and speech materials by improving the construction of the speech database to introduce more relational acoustic invariance to gradually replace the role of specific speech materials.The construction of the speech database is also provided.
作者 曹剑芬
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2009年第S1期1380-1387,共8页 Journal of Tsinghua University(Science and Technology)
基金 中国社会科学院老年科研基金项目 中国社会科学院语言所与东芝(中国)研究中心合作项目
关键词 语音多变性 声学不变量 语料库建设 variability of speech sound acoustic invariance speech database construction
  • 相关文献


  • 1曹剑芬,李爱军,胡方,张利刚.语音学知识在语音识别中的应用:案例分析[J].清华大学学报(自然科学版),2008,48(S1):748-753. 被引量:3
  • 2曹剑芬.音段延长的不同类型及其韵律价值[J].南京师范大学文学院学报,2005(4):160-167. 被引量:25
  • 3曹剑芬.连读变调与轻重对立[J].中国语文,1995(4):312-320. 被引量:42
  • 4Fant G.Phonetics and speech technology. QuarterlyProgress and Status Report . 1983
  • 5Fant G.Speech research in perspective. STL-QPSR . 1989
  • 6Stewens K.Acoustic invariance in speech production:evidence from measurements of the spectral characteristics ofstop consonants. The Journal of The Acoustical Society of America . 1979
  • 7MIN Chu,Yong ZHAO,Eric Chang.Modeling stylizedinvariance and local variability of prosody in text-to-speechsynthesis. Speech Communication . 2006
  • 8Barry W J,Dommelen W A,Koreman J.Phoneticknowledge in speech technology—and phonetic knowledgefrom speech technology?. The Integration of Phonetic Knowledge in SpeechTechnology . 2005
  • 9Strik H.Is phonetic knowledge of any use for speechtechnology?. TheIntegration of Phonetic Knowledge in Speech Technology . 2005


  • 1杨玉芳.句法边界的韵律学表现[J]声学学报,1997(05).
  • 2冯勇强,初敏,贺琳,吕士楠.汉语话语音节时长统计分析[A]新世纪的现代语音学——第五届全国现代语音学学术会议论文集,2001.
  • 3王作英,肖熙.基于段长分布的HMM语音识别模型[J].电子学报,2004,32(1):46-49. 被引量:42



  • 1丁声树 李荣.汉语音韵讲义.方言,1981,(4).
  • 2唐作藩.音韵学教程[M].北京:北京大学出版社,2002.
  • 3Julius[EB/OL].[2008-05-13].http://Julius.Sourceforge.jp/.
  • 4Palmkit[EB/OL].[1997-10-24].http://palmkit.sourceforge.net/.
  • 5胡琼.基于隐马尔科夫模型的天津方言语音合成[D].上海:上海交通大学,2011.
  • 6中国社会科学院语言研究所.方言调查字表[M].北京:商务印书馆,2011.
  • 7张钹.计算机视听觉一人工智能的梦[R].第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会论文集.特邀报告,2009,8:2-3.
  • 8Satoshi Nakamura. Development and Application of Multilingual Speech Translation[C]. Oriental COCOSDA International Conference on Speech Database and Assessments. IEEE.2009:1-4.
  • 9Wang Haifeng. Hybrid Method for Spoken Language Translation[R]. 7Th National Conference onMan-Machine Speech Communication and International Workshop on speech and language processing. Invited lecture. 2009.8:5.
  • 10David Geer. Statistical Machine Translation Gains Respect[C]. IEEE Computer Society Press 2005~38(10):18-25.










使用帮助 返回顶部