This paper describes a method for recognizing Chinese tones in continuous speech. The first and second order differentials of the fundamental frequency logarithmically converted are used as feature parameters. A left-...This paper describes a method for recognizing Chinese tones in continuous speech. The first and second order differentials of the fundamental frequency logarithmically converted are used as feature parameters. A left-to-right hidden Markov modeling with five states, each of which is modeled by a single Gaussian distribution, expresses each of Chinese tones. Non-voiced portions are coded by random values normally distributed to uniformly deal with all the time frames in an utterance. Speaker dependent tone recognition was conducted for ten speakers. The average rate of 81.8% was obtained for these speakers.展开更多
文章分析了经典隐马尔可夫模型(Hidden Markov Model,HMM)齐次假设的理论缺陷,以及两种非齐次HMM。语音识别对比实验表明,经验性的惩罚概率法是稳健的、且更有效的补偿方法。实验结果还指出在最优惩罚概率下,经典HMM达到了与非齐...文章分析了经典隐马尔可夫模型(Hidden Markov Model,HMM)齐次假设的理论缺陷,以及两种非齐次HMM。语音识别对比实验表明,经验性的惩罚概率法是稳健的、且更有效的补偿方法。实验结果还指出在最优惩罚概率下,经典HMM达到了与非齐次的基于段长分布的HMM(Duration Distribution Based HMM,DDBHMM)几乎相同的识别率,证明了齐次假设并不影响经典HMM在实用中的重要性。文章提出了一种改进Baum-Welch重估算法的初值的经验方法,用于HMM参数的估计,在汉语连续语音识别实验中一致性地降低了音节误识率。展开更多
文摘This paper describes a method for recognizing Chinese tones in continuous speech. The first and second order differentials of the fundamental frequency logarithmically converted are used as feature parameters. A left-to-right hidden Markov modeling with five states, each of which is modeled by a single Gaussian distribution, expresses each of Chinese tones. Non-voiced portions are coded by random values normally distributed to uniformly deal with all the time frames in an utterance. Speaker dependent tone recognition was conducted for ten speakers. The average rate of 81.8% was obtained for these speakers.
文摘文章分析了经典隐马尔可夫模型(Hidden Markov Model,HMM)齐次假设的理论缺陷,以及两种非齐次HMM。语音识别对比实验表明,经验性的惩罚概率法是稳健的、且更有效的补偿方法。实验结果还指出在最优惩罚概率下,经典HMM达到了与非齐次的基于段长分布的HMM(Duration Distribution Based HMM,DDBHMM)几乎相同的识别率,证明了齐次假设并不影响经典HMM在实用中的重要性。文章提出了一种改进Baum-Welch重估算法的初值的经验方法,用于HMM参数的估计,在汉语连续语音识别实验中一致性地降低了音节误识率。