基于耳蜗滤波器倒谱参数的音频频带扩展方法被引量：1

Bandwidth extension of audio signals based on cochlear filter cepstral coefficients

导出

摘要音频频带扩展是根据接收的宽带信号在解码端人为地重建出丢失的高频成分,以提升音频听觉质量。该文基于耳蜗滤波器倒谱参数提出了一种盲目式音频频带扩展方法。该方法模拟外耳听觉系统,提取耳蜗滤波器倒谱系数来描述宽带音频频谱信息,并利用Gauss混合模型对高频谱包络进行估计。结合基于最近邻匹配的谱细节恢复方法,实现了宽带向超宽带音频的有效扩展。主客观测试表明,该方法的重建音频质量优于基于传统音频特征的扩展方法。 Bandwidth extension of audio signals artificially restores the truncated high-frequency components from the transmitted wideband signal at the decoder to improve the auditory quality of the reproduced audio signals.This paper describes a blind bandwidth extension method based on the cochlear filter cepstral coefficients.The system emulates auditory peripheral hearing to calculate cochlear filter cepstral coefficients that more accurately describe the spectral information of the wideband audio.The Gaussian mixture model is used to estimate the spectral envelope of high frequencies.Finally,nearest neighbor mapping used to recover the fine structure at high frequencies is combined with the bandwidth extension of wideband audio signals to give superior wideband audio.Objective and subjective tests both indicate that the method achieves better performance than bandwidth extension methods based on conventional audio features.

作者刘鑫鲍长春

机构地区北京工业大学电子信息与控制工程学院

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2013年第6期913-916,共4页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金项目(61072089 60872027) 北京工业大学研究生科技基金项目(ykj-2012-7001)

关键词音频信号处理音频频带扩展耳蜗滤波器倒谱系数 Gauss混合模型最近邻匹配 audio signal processing audio bandwidth extension cochlear filter cepstral coefficient Gaussian mixture model nearest neighbor mapping

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献12

1Larsen E, Aarts R M. Audio Bandwidth Extension: Application of Psychoacoustics, Signal Processing and Loudspeaker Design [M]. Chichester, UK~ John Wiley Sons, 2004.
2Jax P, Vary P. On artificial bandwidth extension of telephone speech [J]. Signal Processing, 2003, 83(8) : 1707 - 1719.
3Jax P, Vary P. Feature selection for improved bandwidth extension of speech signals [C]// IEEE International Conference on Acoustics, Speech, and Signal Processing. Montreal, Canada: IEEE Press, 2004:697-700.
4Nilsson M, Gustafsson H, Andersen S V, ct al. Gaussian mixture model based mutual information estimation between frequency bands in speech I-C]// IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando, FL, USA: IEEE Press, 2002.- 525-528.
5Nour-Eldin A H, Kabal P. Mel-frequency cepstral coefficient-based bandwidth extension of narrowband speech [C]// 9th Annual Conference of the International Speech Communication Association. Brisbane, Australia: ISCA Press, 2008: 53-56.
6Li Q. An auditory-based transform for audio signal processing [C]// IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA: IEEE Press, 2009: 181 - 184.
7Li Q, Huang Y. An auditory-based feature extraction algorithm for robust speaker identification under mismatched conditions [J]. IEEE Transactions on Audio Speech and Language Processing, 2011, 19(6): 1791- 1801.
8Jax P, Vary P. An upper bound on the quality of artificial bandwidth extension of narrowband speech signals [C]// IEEE International Conference on Acoustics, Speech and Signal Processing. Orlando, FL, USA: IEEE Press, 2002.- 237 - 240.
9Liu X, Bao C, Jia M, et al. A harmonic bandwidth extension based on Gaussian mixture model [C]// IEEE 10th International Conference on Signal Processing. Beijing, China: IEEE Press, 2010: 474-477.
10Shlien S. The modulated lapped transform, its time-varying forms, and its applications to audio coding standards [J]. IEEE Transactions on Speech and Audio Processing, 1997, 5(4) : 359 - 366.

同被引文献10

1窦庚欣,鲍长春.一种基于矢量量化的语音信号频带扩展方法[C]∥第十二届全国信号处理学术年会(CCSP-2005).苏州:信号处理,2005,21(z1).
2Jax P, Vary P. On artificial bandwidth extension of telephone speech [J].Signal Processing,2003, 83(8): 1707-1719.
3Nels Rohde, Svend Aage Vedstesen. Artificial bandwidth extension of narrowband Speech[D]. Aalborg: Aalborg University, 2007.
4Kominek J, Black A W. The CMU Arctic speech databases[J]. Proc of Isca Speech Synthesis Workshop, 2004, 99(4):223--224.
5Liu X, Bao C C. Audio bandwidth extension based on temporal smoothing cepslral coefficients[J]. Eurasip Journal on Audio Speech & Music Processing, 2014, 2014(1):1-16.
6何勇军,韩纪庆.一种语音频带扩展的方法及其改进[C].乌鲁木齐:第十届全国人机语音通讯学术会议暨国际语音语言处理研讨会论文摘要集,2009:40-41.
7Kitawaki N, Nagabuchi evaluation for low-bit-rate Journal on Selected Areas H, Itoh K. Objective quality speech coding systems[J]. IEEE in Communications, 1988, 6(2):242-248.
8ITU-T. ITU-T Recommendation P.862, Perceptual evaluation of speech quality (PESQ), an objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs[S]. Geneva:ITU-TP.862 Recommendation, 2001.
9张丽燕,鲍长春,刘鑫,张兴涛.基于非线性音频特征分类的频带扩展方法[J].通信学报,2013,34(8):120-130. 被引量：2
10张勇,刘轶.窄带语音带宽扩展算法研究[J].声学学报,2014,39(6):764-773. 被引量：4

引证文献1

1林胜义,肖政宏.基于线性源滤波器的语音频带扩展方法研究[J].自动化与信息工程,2016,37(1):37-42.

1景新幸.小波变换及其在宽带音频压缩编码中的应用[J].电声技术,1999,23(6):7-8. 被引量：1
2白海钏,鲍长春,刘鑫.基于局部最小二乘支持向量机的音频频带扩展方法[J].电子学报,2016,44(9):2203-2210. 被引量：3
3林胜义,肖政宏.基于线性源滤波器的语音频带扩展方法研究[J].自动化与信息工程,2016,37(1):37-42.
4魏旋,党晓妍,崔慧娟,唐昆.基于Gauss混合模型的清浊音解码端恢复算法[J].清华大学学报（自然科学版）,2010,50(1):79-82. 被引量：4
5白海钏,鲍长春,刘鑫,李红蕊.基于灰色Verhulst模型的音频频带扩展方法[J].电子学报,2014,42(8):1624-1629.
6李蕴华.将倒谱参数与基音信息有效结合进行说话人辨认[J].信号处理,2000,16(1):85-89. 被引量：7
7赵永刚,唐昆,崔慧娟.预测自适应Gauss混合模型线谱频率的量化[J].清华大学学报（自然科学版）,2007,47(4):530-533.
8贾克明,陶洪久.基于DSP的嵌入式语音识别系统的研究与实现[J].武汉理工大学学报（信息与管理工程版）,2006,28(7):156-159. 被引量：4
9刘鑫,鲍长春.基于回声状态网络的音频频带扩展方法[J].电子学报,2016,44(11):2758-2766. 被引量：3
10方立军,马骏,徐光争,常文革.LFM频带扩展及搬移的设计[J].现代电子,1999(1):43-46. 被引量：1

清华大学学报（自然科学版）

2013年第6期

浏览历史

内容加载中请稍等...

基于耳蜗滤波器倒谱参数的音频频带扩展方法被引量：1

参考文献12

同被引文献10

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于耳蜗滤波器倒谱参数的音频频带扩展方法 被引量：1

参考文献12

同被引文献10

引证文献1

相关作者

相关机构

相关主题

浏览历史

基于耳蜗滤波器倒谱参数的音频频带扩展方法被引量：1