期刊文献+

电话线质量语音的基音周期提取算法 被引量:2

Pitch determination algorithm for telephone speech
原文传递
导出
摘要 基音周期提取算法是语音编码中至关重要的组成部分,针对传统自相关方法在电话线质量语音中提取准确度不高的现状,该文提出了一种新的时频结合的基音周期提取算法。时域上,引进一个新的参数——长时基音周期,并根据语音短时平稳特性,对自相关函数进行时域修正,去除不可能成为基音周期的延时值。频域上,计算频域自相关函数,将基音周期候选值所对应的频域自相关值也作为候选值权重的一部分,以增大真正基音周期的权重。通过Keele语音库进行性能测试表明:该算法对电话线质量语音的严重错误率比传统自相关方法的降低46.8%,极大地提高了电话线质量语音的基音周期判决准确度,同时对正常语音的严重错误率也降低了31.2%。 Pitch determination algorithm is a critical part in speech coding systems. Considering the poor performance of traditional autoeorrelation function (ACF) based algorithm in telephone speech, a new time-frequency based pitch determination algorithm is proposed. In time domain, a new parameter, long-time average pitch (I.TAP) is introduced. Based on both LTAP and short-time stationary property of speech, the time-domain ACF is revised to eliminate the lag values that are impossible to be true pitch. In frequency domain, a frequency-domain based ACF is calculated and incorporated into the weight calculation for each pitch candidate. This processing has the potential to increase the weight of the true pitch. The experiments results on Keele speech database show that the proposed algorithm reduces the gross pitch error rate by 46.8% for telephone speech compared with the traditional ACF based algorithm, thus improves the performance of pitch determination on telephone speech significantly, and it also reduces the gross errorrate by 31.2% for normal speech.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2013年第11期1548-1552,1557,共6页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60572081)
关键词 基音周期提取算法 电话线质量语音 自相关函数 时域修正 频域加权 pitch determination algorithm telephone speech autocorrelation function (ACF) time domain revising frequency-domain weighting
  • 相关文献

参考文献15

  • 1Mousset Eric, Ainsworth William A, Fonollosa Jose A R. A comparison of several recent methods of fundamental frequency and voicing decision estimation [C]/ / ICSLP'96. Philadelphia, USA: IEEE, 1996: 1273 -1276.
  • 2Talkin D. A robust algorithm for pitch tracking [C]/ / Speech Coding and Synthesis. Amsterdam, Netherlands: Elsevier Science, 1995: 495 - 518.
  • 3Shahnaz C, Zhu W p, Ahmad M O. A temporal matching method for pitch determination from noisy speech signals [C]/ / International Midwest Symposium on Circuits and Systems. Knoxville, USA: IEEE, 2008: 938 - 94l.
  • 4Shahnaz C, Zhu W r. Ahmad M O. Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme [J]. IEEE Transactions on Audio, Speech and Language Processing, 2012, 20( 1): 322 - 335.
  • 5Verteletskaya E, Sakhnov K, Simak B. Pitch detection algorithms and voiced I unvoiced classification for noisy speech [C]/ / International Conference on Systems, Signals and Image Processing. Chalkida, Greece: IEEE, 2009: 1 - 5.
  • 6Wang C, Seneff S. Robust pitch tracking for prosodic modeling in telephone speech [C]// International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Istanbul, Turkey: IEEE, 2000: 1343 - 1346.
  • 7Kasi Kavita , Zahorian Stephen A. Yet another algorithm for pitch tracking [C]// International Conference on Acoustics, Speech, and Signal Processing (lCASSP). Orlando, USA: IEEE, 2002: 1-361 - 1-364.
  • 8Zahorian SA, Dikshit r , Hu H. A spectral-temporal method for pitch tracking [C]// ICSLP'06. Pittsburgh, USA: IEEE, 2006: 1710 - 1713.
  • 9Nakatani T, Irino T. Robust and accurate fundamental frequency estimation based on dominant harmonic components [J]. J Acoust Soc Am, 2004, 116(6): 3690 - 3700.
  • 10魏旋,党晓妍,崔慧娟,唐昆.基于动态规划的低延时基音提取算法[J].清华大学学报(自然科学版),2008,48(10):1586-1588. 被引量:6

二级参考文献16

  • 1刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报(自然科学版),2006,46(1):74-77. 被引量:22
  • 2Rabiner L, Cheng M. A comparative performance study of several pitch detection algorithms[J].IEEE Tram On Acoustics, Speech, and Signal Processing, 1976, 24(5): 399 - 418.
  • 3Secrest B, Doddington G. Postprocessing techniques for voice pitch trackers[C]//International Conf On Acoustics, Speech, and Signal Processing. Paris: IEEE, 1982: 172- 175.
  • 4Ney H. A dynamic programming technique for nonlinear smoothing[C]// International Conf On Acoustics, Speech, and Signal Processing. Atlanta: IEEE, 1981: 62-65.
  • 5Plante F, Meyer G F. A pitch extraction reference database [C]// European Conf on Speech Communication and Technology. Madrid, 1995:837 - 840.
  • 6Kondoz A M. Digital Speech[M]. England: John Wiley& Sons Ltd, 2004.
  • 7Ney H.A dynamic programming technique for nonlinear smoothing[C]//International Conf on Acoustics,Speech,and Signal Processing.Atlanta,USA:IEEE,1981:62-65.
  • 8Kumar K,Jain J.Speech pitch shifting using complex continuous wavelet transform[C]//Annual IEEE India Conference.New Delhi,India,2006:1-4.
  • 9Shelby G A,Cooper C M,Adhami R R A.Wavelet-based speech pitch detector for tone languages[C]//IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.Beijing,China,1994:596-599.
  • 10GAO Yanhua,ZHENG Guoqiang.Speech pitch period detection algorithm based on wavelet transform and spacial correlation function[C]//Electrical and International Conference on Control Engineering.Jinan,China,2010:5613-5616.

共引文献7

同被引文献27

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部