用于语音识别的空间相关性变换被引量：2

Spatial correlation transformation for speech recognition

导出

摘要针对经典隐含Markov模型忽略了语音信号之间的依存关系的问题,提出一种线性特征变换——空间相关性变换,利用同一个说话人的不同语音单元之间的相关性(空间相关性)得到鉴别性能更好的新特征。该变换的最优变换矩阵在最小协方差准则下得到。识别系统采用新特征及其模型参数代替原特征及其模型参数进行Viterbi搜索。实现空间相关性变换的关键是最优变换矩阵的计算,提出了两种相应的算法。实验结果表明:该方法在说话人无关识别系统上取得了比自适应方法更好的性能,同时该方法与自适应方法结合应用可进一步提高系统性能。 The traditional Hidden Markov model for speech recognition ignores the relationships between speech signals. This paper presents a linear feature transformation, Spatial Correlation Transformation, to utilize the correlation between different acoustic units of the same speaker （Spatial Correlation） to obtain new features having better discrimination. The optimum transformation matrix is determined based on the Minimum Covariance criterion. The recognition system uses these new features and the corresponding model parameters in the Viterhi search instead of the original features. The key to the transformation is the calculation of the optimum transformation matrix. Experiments show that this approach achieves better performance than adaptation approaches on the speaker independent recognition system. Moreover, the combination of this approach and adaptation approaches further improves the system performance.

作者苏腾荣吴及王作英

机构地区清华大学电子工程系

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2009年第10期1655-1659,共5页 Journal of Tsinghua University(Science and Technology)

关键词语音识别空间相关性特征变换最小协方差 speech recognition spatial correlation feature transformation minimum covariance

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1Young S. Statistical modelling in continuous speech recognition [C]// Proc International Conference on Uncertainty in Artificial Intelligence, Seattle, USA, 2001.
2Leggetter C J, Woodland P C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models [J]. Computer Speech and Language, 1995, 9(2): 171-185.
3Gales M J F, Woodland P C. Mean and variance adaptation within the MLLR framework [J]. Computer Speech and Language, 1996, 10(4) : 249 - 264.
4Hazen T. The use of speaker correlation information for automatic speech recognition [D]. Cambridge: Mass, Inst, Technol, Jan 1998.
5Kuhn R, Junqua J C, Nguyen P, et al. Rapid speaker adaptation in eigenvoice space [J]. IEEE Trans on Speech and Audio Processing, 2000, 8(6) : 695 - 707.
6王作英.基于段长分布的HMM语音识别模型[C].见:第二届全国汉字语音识别会议,庐山,1989

共引文献1

1鄢翔,王作英.A＊算法在两阶段词图搜索中的应用[J].计算机工程与应用,2002,38(22):78-80. 被引量：1

同被引文献15

1Leggetter C J and Woodland P C. Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech and Language, 1995, 9(2): 171-185.
2Kuhn R, Junqua J C, and Nguyen P, et al.. Rapid speaker adaptation in eigenvoice space. IEEE Transactions on Speech and Audio Processing, 2000, 8(6): 695-707.
3Anastasakos Tasos, McDonough John, and Makhoul John. Speaker adaptive training: A maximum likelihood approach to speaker normalization. Proceedings of ICASSP, Munich, Germany, 1997: 1043-1046.
4Sinha R and Gales M J F, et al.. The CU-HTK mandaria broadcast news transcription system. Proceedings of ICASSP, Toulouse, France, 2006: 1077-1080.
5Ng Tim, et al.. Progress in the BBN 2007 mandarin speech to text system. Proceedings of ICASSP, Las Vegas, USA, 2008: 1537-1540.
6Su Teng-rong, Wu Ji, and Wang Zuo-ying. Spatial correlation transformation based on minimum covariance. Proceedings of ICASSP, Las Vegas, USA, 2008: 4697-4700.
7吕艳新,孙书学,顾晓辉.基于EMD和能量比的战场声目标分类与识别[J].振动与冲击,2008,27(11):51-55. 被引量：17
8陈湘涛,李明亮,陈玉娟.基于时间序列相似性聚类的应用研究综述[J].计算机工程与设计,2010,31(3):577-581. 被引量：27
9曾番,鹿光,李国宏.基于小波包分析的战场被动声目标特征提取[J].弹箭与制导学报,2010,30(2):240-242. 被引量：1
10祁瑞华,杨德礼,胡润波.基于相关系数加权朴素信念分类模型[J].计算机工程与设计,2010,31(22):4824-4826. 被引量：1

引证文献2

1苏腾荣,吴及,王作英.基于空间相关性变换的声学模型训练[J].电子与信息学报,2010,32(4):1003-1007.
2李国,韩学良,段钢.飞机噪声识别方法研究及FPGA固化实现[J].计算机工程与设计,2014,35(3):835-840.

1苏腾荣,吴及,王作英.基于空间相关性变换的声学模型训练[J].电子与信息学报,2010,32(4):1003-1007.
2胡庆辉,阮晓霞.基于MCD初始化的高斯混合模型聚类[J].桂林航天工业学院学报,2016,21(1):1-6. 被引量：4
3姜志威,丁晓青,彭良瑞.针对无切分维吾尔文文本行识别的字符模型优化[J].清华大学学报（自然科学版）,2015,55(8):873-877. 被引量：3
4张蓓,王顺芳.基于MCD稳健估计的PCA人脸识别算法[J].计算机工程与设计,2015,36(3):778-782. 被引量：11
5王树义.算法的相关性变换与划分带个数的关系[J].大连理工大学学报,1995,35(3):422-424. 被引量：1
6曹玉东.语音识别中的搜索策略研究[J].攀枝花学院学报,2007,24(3):46-49.
7王树义,钱达源.脉动阵列算法自动综合的优化策略[J].计算机学报,1996,19(9):661-667. 被引量：3
8周德全,郭耀红.用HMM框架下的神经网络分类器识别雷达目标[J].红外与毫米波学报,2001,20(2):107-110. 被引量：1
9蔡云飞,唐振民,张浩峰.基于Cross-EKF定位的多机器人协作围捕策略研究[J].控制与决策,2010,25(9):1313-1317. 被引量：5
10胡庆武,艾明耀,殷万玲,袁辉.大旋角无人机影像全自动拼接方法研究[J].计算机工程,2012,38(15):152-155. 被引量：8

清华大学学报（自然科学版）

2009年第10期

浏览历史

内容加载中请稍等...

用于语音识别的空间相关性变换被引量：2

参考文献6

共引文献1

同被引文献15

引证文献2

相关作者

相关机构

相关主题

浏览历史

用于语音识别的空间相关性变换 被引量：2

参考文献6

共引文献1

同被引文献15

引证文献2

相关作者

相关机构

相关主题

浏览历史

用于语音识别的空间相关性变换被引量：2