摘要
韵律模型是文语转换系统中的重要组成部分,对合成语音的自然度起着至关重要的作用。结合人工神经网络和单元选择算法,将它们分别应用于韵律模型中时长和基频曲线的生成,其中时长模型采用三层的反向传播网络,而基频模型则采用一种基于最小距离和的单元选择算法。
Prosody model is a essential part in text-to-speech system. It plays an important role in naturalness of synthesized speech. This paper integrates artificial neural networks with unit selection in prosody model, and applies them to the generation of duration and pitch. It presents a three-layer back-propagation neural network in duration model, and an algorithm based on minimizing distance summation of a whole utterance in pitch model.
出处
《计算机应用研究》
CSCD
北大核心
2006年第6期79-81,104,共4页
Application Research of Computers
基金
国家自然科学基金资助项目(60435020)
关键词
文语转换
韵律模型
神经网络
单元选择
Text-to-Speech (TTS)
Prosody Model
Neural Network
Unit Selection