Most of the information in digital world is accessible to few who can read or understand a particular language. The speech corpus acquisition is an essential part of all spoken technology systems. The quality and the ...Most of the information in digital world is accessible to few who can read or understand a particular language. The speech corpus acquisition is an essential part of all spoken technology systems. The quality and the volume of speech data in corpus directly affect the accuracy of the system. However, there are a lot of scopes to develop speech technology system using Hindi language which is spoken primarily in India. To achieve such an ambitious goal, the collection of standard database is a prerequisite. This paper summarizes the Hindi corpus and lexical resources being developed by various organizations across the country.展开更多
简要分析中文语音合成的整个过程,并进行初步研究和实践,提出基于语音数据库的语音合成的程序实现方式。通过简单文本处理和注音后,从语音库中读取语音数据进行拼接,经语音合成后,封装成Wave格式送给播放程序进行播放。编程实现采用C#语...简要分析中文语音合成的整个过程,并进行初步研究和实践,提出基于语音数据库的语音合成的程序实现方式。通过简单文本处理和注音后,从语音库中读取语音数据进行拼接,经语音合成后,封装成Wave格式送给播放程序进行播放。编程实现采用C#语言,调用Windows系统API函数进行开发,语音数据库存储使用SQL Server 2005。展开更多
文摘Most of the information in digital world is accessible to few who can read or understand a particular language. The speech corpus acquisition is an essential part of all spoken technology systems. The quality and the volume of speech data in corpus directly affect the accuracy of the system. However, there are a lot of scopes to develop speech technology system using Hindi language which is spoken primarily in India. To achieve such an ambitious goal, the collection of standard database is a prerequisite. This paper summarizes the Hindi corpus and lexical resources being developed by various organizations across the country.
文摘简要分析中文语音合成的整个过程,并进行初步研究和实践,提出基于语音数据库的语音合成的程序实现方式。通过简单文本处理和注音后,从语音库中读取语音数据进行拼接,经语音合成后,封装成Wave格式送给播放程序进行播放。编程实现采用C#语言,调用Windows系统API函数进行开发,语音数据库存储使用SQL Server 2005。