基于矢量量化的语音信号频带扩展被引量：4

Bandwidth Expansion of Speech Based on Vector Quantization

下载PDF

导出

摘要对基于矢量量化的频带扩展方法进行了改进.在码本形成上提出了重新量化的方法,并用码本结合浊音度的方法调整增益.首先根据清浊度和能量被划分为标准将窄带输入信号清音、浊音和静音3类;然后每类信号选择不同的码本,用基于矢量量化的方法将窄带信号的谱包络转换成高频带信号的谱包络;再用激励信号(高斯白噪声信号)和重建的高频谱包络合成高频带语音;最后将高频带与原窄带信号之和作为最终的宽带信号.仿真及与其他方法比较说明,本文的方法所需计算量小,适合实时环境. A progress on the traditional technique for high-band spectral envelope prediction based on codebook mapping is presented. A re-quantize method to train the codebook and a new refinement method based on VQ and voicing degree are also proposed. First, the narrow band speech is divided into three groups by voicing degree and energy, and in each group, the high-band spectral envelope is determined from the high-band code vector which is closest in shape to the spectral envelope of the frame of input narrowband speech under analysis. Second, the Gaussian white noise is adopted as the excitation signal to synthesis the high band signal. The method mentioned here is proved suitable for real time signal processing.

作者郎玥赵胜辉匡镜明

机构地区北京理工大学信息科学技术学院电子工程系

出处《北京理工大学学报》 EI CAS CSCD 北大核心 2005年第3期260-264,共5页 Transactions of Beijing Institute of Technology

基金与爱立信公司的国际合作项目

关键词矢量量化频带扩展语音信号处理 vector quantization bandwidth expansion speech signal processing

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献10

1Cheng Y M, O'Shaugnessy D, Mermelstein P. Statistical recovery of wideband speech from narrowband speech[J]. IEEE Transaction Speech Audio Process, 1994(2): 544-548.
2Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation[Z]. International Conference on Acoustic Speech Signal Process, Istanbul, 2000.
3Jax P, Vary P. Wideband extension of telephone speech using a hidden markov model[A]. IEEE Workshop on Speech Coding[C], Delavan: IEEE,2000.
4Yoshida Y, Abe M. An algorithm to reconstruct wideband speech from narrow band speech based on codebook mapping[Z]. IEEE International Conference on Spoken Language Processing, Yokohama, 1994.
5Enbom N, Kleijn W B. Bandwidth expansion of speech based on vector quantization of the mel frequency cepstral coefficients[A]. IEEE Workshop on Speech Coding[C], Porvoo, Finland: IEEE,1999.
6Atal B, Rabiner R. A pattern recognition approach to voiced unvoiced silence classification with applications to speech recognition[J]. IEEE Transaction on Acoustics, Speech and Signal Processing, 1976, 24: 201-212.
7Bistritz Y, Peller S. Immittance spectral pairs (ISP) for speech encoding[J]. IEEE Transaction on Acoustics, Speech and Signal Processing, 1993(2): 27-30.
8Makhoul J, Berouti M. High frequency regeneration in speech coding systems[Z]. IEEE International Conference Acoustic Speech Signal Process, Washington, 1979.
9Nilsson M, Kleijn W B. Avoiding over-estimation in bandwidth extension of telephony speech[Z]. IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake Sity, 2001.
10Yasukawa, Hiroshi. Adaptive filtering for broad band signal reconstruction using spectrum extrapolation[A]. IEEE Digital Signal Processing Workshop[C], Loen Norway: IEEE,1996.

同被引文献25

1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量：8
2党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量：2
3俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量：7
4Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111.
5Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509.
6Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244.
7Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646.
8Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173.
9Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846.
10Bernhard H P. A tight upper bound on the gain of linear and nonlinear predictors for stationary stochastic processes. IEEE Transactions on Signal Processing, 1998; 46(11): 2909--2917.

引证文献4

1张勇,胡瑞敏.基于高斯混合模型的语音带宽扩展算法的研究[J].声学学报,2009,34(5):471-480. 被引量：7
2ZHANG Yong,HU Ruimin.Speech wideband extension based on Gaussian mixture model[J].Chinese Journal of Acoustics,2009,28(4):362-377. 被引量：4
3ZHANG Yong,LIU Yi.Narrowband speech wideband extension algorithm research[J].Chinese Journal of Acoustics,2014,33(2):178-191.
4张勇,刘轶.窄带语音带宽扩展算法研究[J].声学学报,2014,39(6):764-773. 被引量：4

二级引证文献13

1张兴涛,鲍长春,刘鑫,张丽燕.基于Volterra级数预测的音频频带扩展[J].电子学报,2012,40(12):2501-2506. 被引量：2
2邓峰,鲍长春,鲍枫.基于核Fisher判别和加权码书映射的音频信号削波修复方法[J].数据采集与处理,2014,29(2):211-221.
3ZHANG Yong,LIU Yi.Narrowband speech wideband extension algorithm research[J].Chinese Journal of Acoustics,2014,33(2):178-191.
4温涛,许枫,王梦宾,杨娟,闫路.预测特征误差映射及其在多基地水下目标识别中的应用[J].声学学报,2019,44(1):57-67. 被引量：2
5张勇,刘轶.窄带语音带宽扩展算法研究[J].声学学报,2014,39(6):764-773. 被引量：4
6何昕,蒋豪,韩丹.管制指令特征参数提取研究[J].科学技术与工程,2015,35(20):89-94. 被引量：6
7林胜义,肖政宏.基于线性源滤波器的语音频带扩展方法研究[J].自动化与信息工程,2016,37(1):37-42.
8王迎雪,赵胜辉,于莹莹,匡镜明.基于受限玻尔兹曼机的语音带宽扩展[J].电子与信息学报,2016,38(7):1717-1723. 被引量：3
9白海钏,鲍长春,刘鑫.基于局部最小二乘支持向量机的音频频带扩展方法[J].电子学报,2016,44(9):2203-2210. 被引量：3
10王迎雪,赵胜辉,匡镜明.考虑帧间信息的语音带宽扩展[J].声学学报,2017,42(3):370-376.

1林胜义,肖政宏.基于线性源滤波器的语音频带扩展方法研究[J].自动化与信息工程,2016,37(1):37-42.
2邱炳飞.电压变化引起的故障[J].中国有线电视,2006(22):2257-2257.
3NEC开发可接收两个频带信号新天线[J].通信与信息技术,2004(4):6-6.
4方立军,马骏,徐光争,常文革.LFM频带扩展及搬移的设计[J].现代电子,1999(1):43-46. 被引量：1
5王春明,汪洋,王君.倾斜开槽漏泄电缆频带扩展的研究[J].电子世界,2013(15):59-60. 被引量：1
6周焱,张昕,杨晓冬,郭黎利.泄漏同轴电缆单模辐射频带扩展技术的研究[J].应用科技,2006,33(3):4-6. 被引量：2
7杨成利,马宪华,牟光臣.关于传输线带宽扩展技术的研究[J].河南机电高等专科学校学报,2005,13(6):16-17.
8张兴涛,鲍长春,刘鑫,张丽燕.基于Volterra级数预测的音频频带扩展[J].电子学报,2012,40(12):2501-2506. 被引量：2
9李宪优,田耕,蒲明辉,李显宝,魏学荣.同轴电缆的远距离传输带宽扩展技术[J].宇航计测技术,2004,24(3):40-43. 被引量：3
10赵义正.改进GMM谱包络转换性能的语音转换算法研究[J].科学技术与工程,2010,10(17):4172-4174. 被引量：3

北京理工大学学报

2005年第3期

浏览历史

内容加载中请稍等...

基于矢量量化的语音信号频带扩展被引量：4

参考文献10

同被引文献25

引证文献4

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基于矢量量化的语音信号频带扩展 被引量：4

参考文献10

同被引文献25

引证文献4

二级引证文献13

相关作者

相关机构

相关主题

浏览历史

基于矢量量化的语音信号频带扩展被引量：4