期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
Discriminative tone model training and optimal integration for Mandarin speech recognition
1
作者 黄浩 朱杰 《Journal of Southeast University(English Edition)》 EI CAS 2007年第2期174-178,共5页
Two discriminative methods for solving tone problems in Mandarin speech recognition are presented. First, discriminative training on the HMM (hidden Markov model) based tone models is proposed. Then an integration t... Two discriminative methods for solving tone problems in Mandarin speech recognition are presented. First, discriminative training on the HMM (hidden Markov model) based tone models is proposed. Then an integration technique of tone models into a large vocabulary continuous speech recognition system is presented. Discriminative model weight training based on minimum phone error criteria is adopted aiming at optimal integration of the tone models. The extended Baum Welch algorithm is applied to find the model-dependent weights to scale the acoustic scores and tone scores. Experimental results show that tone recognition rates and continuous speech recognition accuracy can be improved by the discriminatively trained tone model. Performance of a large vocabulary continuous Mandarin speech recognition system can be further enhanced by the discriminatively trained weight combinations due to a better interpolation of the given models. 展开更多
关键词 discriminative training minimum phone error tone modeling Mandarin speech recognition
下载PDF
Discriminative training of GMM-HMM acoustic model by RPCL learning 被引量:1
2
作者 Zaihu PANG Shikui TU +2 位作者 Dan SU Xihong WU Lei XU 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2011年第2期283-290,共8页
This paper presents a new discriminative approach for training Gaussian mixture models(GMMs)of hidden Markov models(HMMs)based acoustic model in a large vocabulary continuous speech recognition(LVCSR)system.This appro... This paper presents a new discriminative approach for training Gaussian mixture models(GMMs)of hidden Markov models(HMMs)based acoustic model in a large vocabulary continuous speech recognition(LVCSR)system.This approach is featured by embedding a rival penalized competitive learning(RPCL)mechanism on the level of hidden Markov states.For every input,the correct identity state,called winner and obtained by the Viterbi force alignment,is enhanced to describe this input while its most competitive rival is penalized by de-learning,which makes GMMs-based states become more discriminative.Without the extensive computing burden required by typical discriminative learning methods for one-pass recognition of the training set,the new approach saves computing costs considerably.Experiments show that the proposed method has a good convergence with better performances than the classical maximum likelihood estimation(MLE)based method.Comparing with two conventional discriminative methods,the proposed method demonstrates improved generalization ability,especially when the test set is not well matched with the training set. 展开更多
关键词 discriminative training hidden Markov model rival penalized competitive learning Bayesian Ying-Yang harmony learning large vocabulary continuous speech recognition
原文传递
Recognition of Speech Based on HMM/MLP Hybrid Network
3
作者 黄心晔 马小辉 +2 位作者 李想 富煜清 陆佶人 《Journal of Southeast University(English Edition)》 EI CAS 2000年第2期26-30,共5页
This paper presents a new HMM/MLP hybrid network for speech recognition. By taking advantage of the discriminative training of MLP, the unreasonable model correctness assumption on the model correctness of the ML trai... This paper presents a new HMM/MLP hybrid network for speech recognition. By taking advantage of the discriminative training of MLP, the unreasonable model correctness assumption on the model correctness of the ML training in basic HMM can be overcome, and its discriminative ability and recognition performance can be improved. Experimental results demonstrate that the discriminative ability and recognition performance of HMM/MLP is apparently better than normal HMM. 展开更多
关键词 HMM/MLP hybrid network discriminative training speech recognition
下载PDF
Experimental Study of Discriminative Adaptive Training and MLLR for Automatic Pronunciation Evaluation 被引量:3
4
作者 宋寅 梁维谦 《Tsinghua Science and Technology》 SCIE EI CAS 2011年第2期189-193,共5页
A stronger canonical model was developed to improve the performance of automatic pronunciation evaluations. Three different strategies were investigated with speaker adaptive training to normalize variations among spe... A stronger canonical model was developed to improve the performance of automatic pronunciation evaluations. Three different strategies were investigated with speaker adaptive training to normalize variations among speakers, minimum phone error training to identify easily confused phones and maximum likelihood linear regression (MLLR) adaptation to compensate for accent variations between native and non-native speakers. The three schemes were combined to improve the correlation coefficient between machine scores and human scores from 0.651 to 0.679 on the sentence level and from 0.788 to 0.822 on the speaker level. 展开更多
关键词 discriminative adaptive training (DAT) speaker adaptive training (SAT) minimum phone error(MPE) automatic pronunciation evaluation (APE)
原文传递
Are Relatives of Depression Patients More Prone to the Disease?
5
《Bulletin of the Chinese Academy of Sciences》 2015年第4期254-254,共1页
Decreased hedonic and motivational capacity,namely anhedonia,is one of the cardinal features for patients with major depressive disorder(MDD).Recent studies suggest that this reduced ability to experience pleasure i... Decreased hedonic and motivational capacity,namely anhedonia,is one of the cardinal features for patients with major depressive disorder(MDD).Recent studies suggest that this reduced ability to experience pleasure is also a marker or endophenotype for MDD and represents a genetic predisposition to this disorder.Dr. 展开更多
关键词 Depression pleasure Raymond relatives displayed marker discrimination grant training Guangzhou
下载PDF
Discriminative tonal feature extraction method in mandarin speech recognition 被引量:1
6
作者 HUANG Hao ZHU Jie 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2007年第4期126-130,共5页
To utilize the supra-segmental nature of Mandarin tones, this article proposes a feature extraction method for hidden markov model (HMM) based tone modeling. The method uses linear transforms to project Fo(fundamen... To utilize the supra-segmental nature of Mandarin tones, this article proposes a feature extraction method for hidden markov model (HMM) based tone modeling. The method uses linear transforms to project Fo(fundamental frequency) features of neighboring syllables as compensations, and adds them to the original Fo features of the current syUable. The transforms are discriminatively trained by using an objective function termed as "minimum tone error", which is a smooth approximation of tone recognition accuracy. Experiments show that the new tonal features achieve 3.82% tone recognition rate improvement, compared with the baseline, using maximum likelihood trained HMM on the normal F0 features. Further experiments show that discriminative HMM training on the new features is 8.78% better than the baseline. 展开更多
关键词 discriminative training tone recognition feature extraction Mandarin speech recognition
原文传递
Stream Weight Training Based on MCE for Audio-Visual LVCSR 被引量:1
7
作者 刘鹏 王作英 《Tsinghua Science and Technology》 SCIE EI CAS 2005年第2期141-144,共4页
In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is dis... In this paper we address the problem of audio-visual speech recognition in the framework of the multi-stream hidden Markov model. Stream weight training based on minimum classification error criterion is discussed for use in large vocabulary continuous speech recognition (LVCSR). We present the lattice re- scoring and Viterbi approaches for calculating the loss function of continuous speech. The experimental re- sults show that in the case of clean audio, the system performance can be improved by 36.1% in relative word error rate reduction when using state-based stream weights trained by a Viterbi approach, compared to an audio only speech recognition system. Further experimental results demonstrate that our audio-visual LVCSR system provides significant enhancement of robustness in noisy environments. 展开更多
关键词 audio-visual speech recognition (AVSR) large vocabulary continuous speech recognition (LVCSR) discriminative training minimum classification error (MCE)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部