期刊文献+

基于幅度压缩滤波的清浊音分类及基音估计 被引量:4

Voiced/Unvoiced Classification and Pitch Estimation Based on Amplitude Compression Filter
下载PDF
导出
摘要 该文针对传统算法在实环境(不同噪声类型和信噪比)下容易发生清浊误判和基音估计错误问题,提出一种基于幅度压缩基音估计滤波(PEFAC)的清浊音分类及基音估计方法。首先,通过PEFAC削弱语音的低频噪声,提取出基音谐波;然后,采用基于对称平均幅度和函数的脉冲序列加权算法(SIM)确定谐波数目;最后,利用动态规划估计出基音,用基于3元素特征矢量的高斯混合模型对清浊音进行分类。仿真结果表明,在实环境下,所提方法能有效抑制清浊误判及基音估计错误现象的发生,性能优于传统方法。 A method of voiced/unvoiced classification and pitch estimation based on Pitch Estimation Filter with Amplitude Compression(PEFAC) is proposed in this paper. The method first attenuates strong noise components at the low frequencies based on PEFAC and extracts pitch harmonic from noisy speech in the log-frequency domain. Then, the harmonic number associated with the pitch harmonic is determined by Symmetric average magnitude sum function weighted Impulse-train Matching(SIM) scheme in time domain. A pitch tracking scheme using dynamic programming is applied to select the pitch candidates and a voiced speech probability is computed from the likelihood ratio of Gaussian Mixture Models(GMMs) classifiers based on 3-element feature vector. The simulated results show that the proposed method efficiently reduces voiced/unvoiced and pitch estimation error, and it is superior to some of the state-of-the–art method in the real environment.
出处 《电子与信息学报》 EI CSCD 北大核心 2016年第3期586-593,共8页 Journal of Electronics & Information Technology
基金 国家自然科学基金(61271248) 湖州市自然科学基金(2015YZ04)~~
关键词 语音信号处理 基音 幅度压缩基音估计滤波 对称平均幅度和函数 高斯混合模型 噪声语音 Speech signal processing Pitch Pitch Estimation Filter with Amplitude Compression(PEFAC) Symmetric average magnitude sum function Gaussian Mixture Model(GMM) Noise speech
  • 相关文献

参考文献20

  • 1RABINER L, CHENG M, ROSENBERG A E, et al. Acomparative performance study of several pitch detection algorithms[J]. IEEE Transactions on Acoustics, Speech andSignal Processing, 1976, 24(5): 399-418.
  • 2VEPREK P and SCORDILIS M S. Analysis, enhancement and evaluation of five pitch determination techniques[J]. Speech Communication, 2002, 37(3): 249-270.
  • 3HAN Kun and Wang Deliang. Neural network based pitch tracking in very noisy speech[J[. IEEE/ACM Transactions on Audio, speech, and Language Processing, 2014, 22(12): 2158-2168.
  • 4MOLINA E, TARDON L J, BARBANCHO A M, et al. SiPTH: Singing transcription based on hysteresis defined on the pitch-time curve[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2015, 23(2): 252-263.
  • 5DUAN Zhiyao, HAN Jinyu, and PARDO B. Multi-pitch streaming of harmonic sound mixtures[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2014, 22(1): 138-150.
  • 6CHEN Yujui, WEI Chengwen, CHIANG Yifan, et al. Neuromorphic pitch based noise reduction for monosyllable heaving aid system application[J]. IEEE Transactions on Circuits and Systems, 2014, 61(2): 463-475.
  • 7王玥,钱志鸿,张营.基于扩展谱相减的RCAF基音周期检测算法[J].电子与信息学报,2009,31(5):1161-1165. 被引量:6
  • 8SHIMAMURA T and KOBAYASHI H. Weighted autocorrelation for pitch extraction of noisy speech[J]. IEEE Transactions on Speech and Audio Processing, 2001, 9(7): 727-730.
  • 9徐敬德,常亮,崔慧娟,唐昆.基于频域和时域结合的基音周期提取算法[J].清华大学学报(自然科学版),2012,52(3):413-415. 被引量:4
  • 10SHAHNAZ C, ZHU W P, and AHMAD M O. Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2012, 20(1): 322-335.

二级参考文献30

  • 1A.V奥本海姆 黄建国等(译).离散时间信号处理[M].北京:科学出版社,1998..
  • 2杨行逡 迟惠生 等.语音信号数字处理[M].北京:电子工业出版社,1995..
  • 3Ney H.A dynamic programming technique for nonlinear smoothing[C]//International Conf on Acoustics,Speech,and Signal Processing.Atlanta,USA:IEEE,1981:62-65.
  • 4Kumar K,Jain J.Speech pitch shifting using complex continuous wavelet transform[C]//Annual IEEE India Conference.New Delhi,India,2006:1-4.
  • 5Shelby G A,Cooper C M,Adhami R R A.Wavelet-based speech pitch detector for tone languages[C]//IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.Beijing,China,1994:596-599.
  • 6GAO Yanhua,ZHENG Guoqiang.Speech pitch period detection algorithm based on wavelet transform and spacial correlation function[C]//Electrical and International Conference on Control Engineering.Jinan,China,2010:5613-5616.
  • 7Hasan K,Shahnaz C,Fatath S A.Determination of pitch of noisy speech using dominant harmonic frequency[C]//IEEE-SP International Symposium on Circuits and Systems(ISCAS03).Bangkok,Thailand,2003,2:556-559.
  • 8Gu Y H.HMM-based noisy speech pitch contour estimation[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.New York,USA,1992,2:21-24.
  • 9HUANG Dongyan,LIN Weisi,Rahardja S.Speech pitch detection in noisy environment using multi-rate adaptive lossless FIR filters[C]//International Symposium on Circuits and Systems.Vancouver,Canada,2004,3:429-432.
  • 10Plante F,Meyer G F.A pitch extraction reference database[C]//European Conference on Speech Communication and Technology.Madrid,Spain,1995:837-840.

共引文献61

同被引文献24

引证文献4

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部