基于Gauss混合模型的清浊音恢复改进算法被引量：1

Improved recovery algorithm for unvoiced/voiced parameters based on GMM

导出

摘要为提高子带清浊音(unvoiced/voiced,U/V)解码端恢复算法在不同能量电平下的鲁棒性,提出了一种改进型能量自适应U/V参数解码端恢复算法。通过跟踪长时能量的变化轨迹,在Gauss混合模型(Gaussian mixed model,GMM)下,用归一化的能量参数和线谱频率参数(line spec-tral frequency,LSF)对U/V参数的分布特性进行估计。测试结果表明:在较低的能量电平下,与用绝对能量对U/V参数进行恢复的算法相比,该能量自适应U/V参数恢复算法能够将清浊音误判率降低10%～25%,并将合成语音的平均意见得分(mean opinion score,MOS)提高0.03～0.09,改善了算法的性能。 The robustness of an unvoiced/voiced （U/V） speech classification recovery algorithm is improved by an energy self-adaption algorithm for the recovery of the U/V parameter. The algorithm traces the long-time changes of the energy level to estimate the statistical distribution of the U/V parameter from the normalized energy and the line spectral frequency （LSF） parameters based on the Gaussian mixed model （GMM）. Tests show that for relatively low energy levels, this energy self-adaption algorithm reduces the U/V classification error rate by 10% - 25% and improves the mean opinion score （MOS） of the synthesized speech signal by about 0.03 - 0.09 compared to the original method which uses the absolute energy value.

作者计哲徐敬德常亮崔慧娟唐昆

机构地区清华大学电子工程系

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2011年第11期1751-1755,共5页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(60572081)

关键词语音编码 Gauss混合模型特征参数线谱频率清浊音参数 speech coding Gaussian mixed model （GMM） characteristic parameter line spectral frequency （LSF）unvoiced/voiced （U/V） parameter

分类号 TN912.32 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Kondoz A M. Digital Speech: Coding for Low Bit Rate Communication Systems [M]. Chichester, UK: John Wiley & Sons, 2004.
2Paliwal K K, Kleijn W B. Quantization of LPC parameters[C]//Speech Coding and Synthesis. Amsterdam, the Netherlands: Elsevier Science, 1995: 433- 466.
3Paliwal K K, Atal B S. Efficient vector quantization of LPC parameters at 24 bits/frame [J]. IEEE Trans Speech Audio Processing, 1993, 1(1): 3-14.
4魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报（自然科学版）,2010,50(1):79-82. 被引量：4
5洪侃,李晔,崔慧娟,唐昆.基于子带清浊音模式的声码器增益参数抗误码算法[J].清华大学学报（自然科学版）,2008,48(10):1621-1624. 被引量：2
6Ovens M J, Ponting K M, Turner M E, et al. Ultra low bit rate voice coding [C]// Speech Coding for Algorithms for Radio Channels. London, UK: IEE, 2000: 97- 111.
7李晔.低速率语音编码技术与算法研究[D].北京:清华大学,2009.
8Theodoridis S, Koutroumbas K. Pattern Recognition [M]. 4th Ed. London, UK: Academic Press, 2008.
9李军林.低速率语音编码算法研究[D].北京:清华大学,2004.
10Plante F, Meyer G F. A pitch extraction reference database [C]// European Conf on Speech Communication and Technology. Madrid, Spain, 1995:837-840.

二级参考文献13

1Farvardin N. A study of vector quantization for noisy channels [J]. IEEE Trans Inform Theory, 1993, 39(3)I 799 - 809,
2Farvardin N. On the performance and complexity of channel-optimized vector quantizers[J].IEEE Trans Inform Theory, 1991, 37(1) : 155 - 160.
3De Marca J R B, Jayant N S. An algorithm for assigning binary indices to the code vectors of multi-dimensional quantizer[C]//IEEE Int Comm Conf Seattle. WA: IEEE, 1987: 1128- 1132.
4Ovens M J, Ponting K M, Turner, M E. Ultra low bit rate voice coding [C] // Speech Coding for Algorithms :for Radio Channels, IEE Seminar, London, UK, 2000: 97- 111.
5Wei X, Dang X, Cui H, et al. Voiced/unvoiced classification recovery in the speech decoder based on GMM [C]//ICSP, IEEE, 2008: 546-548.
6McCree V, Barnwell T. A mixed excitation LPC vocoder model for low bit rate speech coding [J]. IEEE Trans on Speech Audio Processing, 1995, 3(4) : 242 - 250.
7Deng H, O'Shaughnessy D. Voiced-unvoiced-silence speech sound classification based on unsupervised learning [C] // International Conf on Multimedia Expo. Beijing: IEEE, 2007: 176-179.
8Theodoridis S, Koutroumbas K. Pattern Recognition (Third Edition) [M]. Beijing: China Machine Press, 2006.
9Plante F, Meyer G F. A pitch extraction reference database [C] // European Conf on Speech Communication and Technology. Madrid, 1995 : 837 - 840.
10李晔,洪侃,王童,崔慧娟,唐昆.声码器基音周期参数抗差错算法[J].清华大学学报（自然科学版）,2008,48(1):82-84. 被引量：2

共引文献10

1李晔,洪侃,王童,崔慧娟,唐昆.正弦激励线性预测声码器子带清浊音模糊判决[J].清华大学学报（自然科学版）,2008,48(7):1101-1103. 被引量：4
2崔慧娟,李晔,洪侃,唐昆.基音周期与带通浊音度参数联合量化算法[J].清华大学学报（自然科学版）,2008,48(10):1594-1596.
3龚利衡,盛玉霞,唐昆,崔慧娟.数字对讲机语音编解码算法改进与优化[J].通信技术,2009,42(5):77-79. 被引量：4
4计哲,李晔,崔慧娟,唐昆.SELP 2.4kb/s语音编码算法跳跃帧判决及处理[J].清华大学学报（自然科学版）,2009(8):1152-1155. 被引量：1
5唐昆,李晔,徐敬德,崔慧娟.基音周期矢量量化中权重系数的计算[J].清华大学学报（自然科学版）,2010,50(4):569-571.
6孙致钊,李晔,徐敬德,崔慧娟,唐昆.SELP声码器参数抗差错恢复算法[J].清华大学学报（自然科学版）,2010,50(5):780-783. 被引量：1
7徐敬德,常亮,计哲,崔慧娟,唐昆.基于码字特征的多模式多级矢量量化算法[J].清华大学学报（自然科学版）,2011,51(2):172-175. 被引量：2
8计哲,徐敬德,崔慧娟,唐昆.基于SELP声码器的连续丢包隐藏算法[J].清华大学学报（自然科学版）,2010,50(12):2003-2006.
9常亮,徐敬德,崔慧娟,唐昆.基于SELP的150b／s语音压缩编码算法[J].清华大学学报（自然科学版）,2013,53(7):967-971. 被引量：2
10徐静云,赵晓群,蔡志端,王培良.基于胞腔均匀度的清浊模式码书设计算法[J].计算机应用,2016,36(12):3374-3377. 被引量：1

同被引文献12

1赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报（自然科学版）,2005,45(4):448-451. 被引量：5
2李哗.低速率语音编码技术与算法研究[D].北京:清华大学,2009.
3Tsao C, Gray R M. Matrix quantizer design for LPC speech using the generalized Lloyd algorithm [J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985, 33(3): 537-545.
4Zhao M, Tang K, Cui H. Mode-based quantization of LP parameters for very low bit rate vocoder [C]// International conference on Communications, Circuits and Systems and West Sino Expositions. Chengdu, China: IEEE Press, 2002 : 28 - 31.
5Eriksson T, Linden J, Skoglund J. Intcrframe LSF quantization for noisy channels [J]. IEEE Transactions on Speech and Audio Processing, 1999, 7(5) : 495 - 509.
6JIANG Hao, CUI Huijuan, TANG Kun. Sinusoidal excitation LPC vocoder [J]. Chinese Journal of Electronics, 1998, 7(3), 296-300.
7Theodoridis S, Koutroumbas K. Pattern Recognition [M]. 3rd ED. Beiiing: China Machine Press, 2006.
8何洪华.超低速率语音编码算法研究[D].北京:清华大学,2011.
9李晔,彭坦,许明,计哲,崔慧娟,唐昆.带有帧间级间预测的线谱频率参数多级矢量量化[J].清华大学学报（自然科学版）,2009(7):981-983. 被引量：9
10魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报（自然科学版）,2010,50(1):79-82. 被引量：4

引证文献1

1常亮,徐敬德,崔慧娟,唐昆.基于SELP的150b／s语音压缩编码算法[J].清华大学学报（自然科学版）,2013,53(7):967-971. 被引量：2

二级引证文献2

1杨亚涛,张松涛,马潇.基于AMBE-1000的无线数字语音通信系统设计[J].北京电子科技学院学报,2016,24(4):66-72. 被引量：1
2孙凤梅,薛颜,李克靖.基于TMS320F28335的声码器设计与实现[J].电子设计工程,2018,26(20):183-187. 被引量：2

1计哲,高圣翔,唐昆,金鑫.能量参数解码端HMM估计算法[J].清华大学学报（自然科学版）,2013,53(6):869-872.
2赵永刚,唐昆,崔慧娟.预测自适应Gauss混合模型线谱频率的量化[J].清华大学学报（自然科学版）,2007,47(4):530-533.
3陈亮1,陈亮2,郑静华,张翼鹏,庞亮.改进的语音子带清浊音参数量化算法[J].军事通信技术,2013(4):49-53.
4“触手司及”的5X悬疑视频揭秘OPPO拍照“黑科技”[J].新潮电子,2017,0(3):9-9.
5魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报（自然科学版）,2010,50(1):79-82. 被引量：4
6IP电话的通话质量评价[J].通信工程,2004(2):49-49.
7彭坦,龚晨,李晔,洪侃,崔慧娟,唐昆.语音编码抗信道误码保护算法[J].高技术通讯,2008,18(5):452-457.
8计哲,徐敬德,崔慧娟,唐昆.基于SELP声码器的连续丢包隐藏算法[J].清华大学学报（自然科学版）,2010,50(12):2003-2006.
9刘鑫,鲍长春.基于耳蜗滤波器倒谱参数的音频频带扩展方法[J].清华大学学报（自然科学版）,2013,53(6):913-916. 被引量：1
10李晔,彭坦,许明,计哲,崔慧娟,唐昆.带有帧间级间预测的线谱频率参数多级矢量量化[J].清华大学学报（自然科学版）,2009(7):981-983. 被引量：9

清华大学学报（自然科学版）

2011年第11期

浏览历史

内容加载中请稍等...

基于Gauss混合模型的清浊音恢复改进算法被引量：1

参考文献11

二级参考文献13

共引文献10

同被引文献12

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于Gauss混合模型的清浊音恢复改进算法 被引量：1

参考文献11

二级参考文献13

共引文献10

同被引文献12

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于Gauss混合模型的清浊音恢复改进算法被引量：1