一种基于动态平滑的实时基频提取算法被引量：1

A dynamic smoothing based real-time pitch detection algorithm

下载PDF

导出

摘要基频是语音信号处理中的一个基本声学特征。传统的基频提取算法为了获得较好的检测效果,需要复杂的时频域计算。对于资源受限的应用条件,例如人工耳蜗等嵌入式实时系统,很难应用计算量大的基频提取算法。语音信号的基频具有短时平稳性,根据这个特点来确定基频候选值可以提高提取的准确性。据此,提出一种基于动态平滑的基频提取算法,使用此算法对汉语声调词库进行基频提取,并与另外两种基频提取算法进行比较。实验结果表明,新算法的基频绝对平均估计误差小于3Hz,优于另两种算法,能够准确地提取基频,同时算法计算量低,适合实时应用。 Fundamental frequency is one of the most important features in speech signal processing.Traditional pitch detection algorithms（PDA） can hardly be applied in the resource-limited hardware system due to the computation complexity.A dynamic smoothing based pitch detection algorithm is proposed in this paper.As pitch is a physical quantity that does not change rapidly,using continuous speech frames to decide the best candidate of fundamental frequency can improve the accuracy.An objective experiment was carried out to compare the pitch detection accuracy of the DSPDA with two other algorithms.The experimental results show that the averaged pitch detection error is 3Hz lower than that of other algorithms.

作者胡海洋原猛冯海泓

机构地区中国科学院声学研究所东海研究站中国科学院研究生院

出处《声学技术》 CSCD 2012年第6期583-588,共6页 Technical Acoustics

基金国家自然科学基金资助项目(11104316) 上海自然科学基金资助项目(11ZR1446000)

关键词基频提取动态平滑实时处理 pitch detection dynamic smoothing real-time

分类号 N912.3 [自然科学总论]

引文网络
相关文献

参考文献16

1Milczynski Matthias, et al. Perception of Mandarin Chinese with cochlear implants using enhanced temporal pitch cues[J]. Hearing Research, 2012, 285(1-2): 1-12.
2Yuan Meng, Lee Tan. Cantonese tone recognition with enhanced temporal periodicity cues[J]. J. Acoust. Soc. Am., 2009, 126(1): 327-337.
3Alain de Cheveigne, Hideki K. YIN, a fundamental frequency estimator for speech and music[J]. J. Acoust. Soc. Am, 2002, 111(4): 1917-1930.
4Talkin David. A robust algorithm for pitch tracking (RAPT)[A]. Kleijn W. B., Paliwal K. K. Speech Coding and Synthesis[C]//Elsevier Science B.V. 1995: 495-518.
5NoU A. M. Cepstrum Pitch Determination[J]. J. Acoust. Soc. Am. 1967, 41(2): 293-309.
6Klapuri Anssi. Pitch estimation using multiple independent time-freuqncy windows[C]//New Paltz, New York: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Oct. 17-20, 1999: 115-118.
7Ney Hermann. A dynamic programming technique for nonlinear smoothing[C]// Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP 81. Apr 1981, 6: 62-65.
8Boersma Paul. Praat, a system for doing phonetics by computer[C]// Glot International 5:9/10, 2001: 341-345.
9Van Immerseel LucM., Martens Jean E Pitch and voiced/unvoiced determination with an auditory model[J]. J. Acoust. Soc. Am, 1992, 91(6): 3511-3526.
10Licldider J.C.R., Pollack I. Effects of differentiation, Intergration, and Infinite Peak Clipping upon the Intelligibility of Speech[J]. J. Acoust. Soc. Am, 1948, 20(1): 42-50.

同被引文献27

1吴玺宏.声纹识别听声辨人[J].计算机世界,2001,(8):14.
2Ding H,Soon Y,Yeo C K.A DCT-based speech enhancement system with pitch synchronous analysis.Audio,Speech,and Language Processing,IEEE Transactions on,2011;19(8):2614-2623.
3Chen J H,Kao Y A.Pitch marking based on an adaptable filter and a peak-valley estimation method.Computational Linguistics and Chinese Language Processing,2001;6(5):1-12.
4Geckinli N,Yavuz D.Algorithm for pitch extraction using zero-crossing interval sequence.Acoustics,Speech and Signal Processing,IEEE Transactions on,1977;25(6):559-564.
5俞翠华.含噪语音信号的基音提取算法研究.南京信息工程大学,2011.
6Ahmadi S,Spanias A S.Cepstrum-based pitch detection using a new statistical V/UV classification algorithm.Speech and Audio Processing,IEEE Transactions on,1999;7(3):333-338.
7Hermes D J.Measurement of pitch by subharmonic summation.The Journal of The Acoustical Society of America,1988;83(1):257-264.
8Cao C,Li M,Liu J et al.Singing melody extraction in polyphonic music by harmonic tracking.In:Proc.8th International Conference on Music Information Retrieval(ISMIR),2007:373-374.
9Jin Z,Wang D L.HMM-based multipitch tracking for noisy and reverberant speech.Audio,Speech,and Language Processing,IEEE Transactions on,2011;19(5):1091-1102.
10Ellis D P W,Poliner G E.Classification-based melody transcription.Machine Learning,2006;65(2-3):439-456.

引证文献1

1宋黎明,李明,颜永红.谐波显著度的基频提取方法[J].声学学报,2015,40(2):294-299. 被引量：5

二级引证文献5

1张宇,杨帅,黄楠木,李琳.高速摄影成像分析声带振动发声的前后不对称性[J].声学学报,2017,42(3):341-347. 被引量：1
2Tan Xinjie,Cui Jizhe.A Review of Audio Gene Recognition Copyright Protecting Technology[J].计算机科学与技术汇刊（中英文版）,2017,6(1):8-15.
3杨贵福,夏一鸣,冉华,冯永平,孙慧.基于优化能量值门限和增强倍频效应的抗噪基音检测算法[J].东北师大学报（自然科学版）,2019,51(1):63-70.
4后方帅,黎美琪,刘若伦.利用谐波显著度和语者音色特征的混合语音中目标人基频轨迹提取[J].声学技术,2019,38(4):408-413. 被引量：3
5章森,曹瑞兴,邓海刚.一种稳定、精准、实时的语音信号基频的检测与提取算法[J].图像与信号处理,2020,9(4):246-255.

1米小妍.人工耳蜗——再造的听觉器官[J].高中生（高考）,2003(2):26-29.
2彭玉灵.嵌入式实时系统及中国RTOS的发展[J].四川大学学报（自然科学版）,2004,41(z1):197-201.
3张云云.自动汉字词组扫描表生成与加载[J].陕西科技大学学报（自然科学版）,1992,11(2):26-29.
4Min HAN,Di Rong CHEN,Zhao Xu SUN.Rademacher Complexity in Neyman-Pearson Classification[J].Acta Mathematica Sinica,English Series,2009,25(5):855-868.
5裴道国.中小检验检疫机构如何搞好实验室建设[J].中国检验检疫,2008(6):18-18. 被引量：1
6高军,王华伟,赵美.系统思考中的模型与方法研究[J].系统科学学报,2016,24(1):48-52. 被引量：4
7张青贵.从观察数据确定李雅普诺夫指数谱算法的一种改进[J].系统工程与电子技术,1998,20(9):66-70. 被引量：3
8李硕,李冰洋,王蜜.小波变换及其在语音信号处理中的应用[J].哈尔滨师范大学自然科学学报,2006,22(5):21-24. 被引量：4
9王黎明,钟琦.面向科学传播的媒体监测方法研究[J].科普研究,2016,11(4):27-34. 被引量：2
10张丽,陈志强,高文焕,康克军.均值加速的快速中值滤波算法[J].清华大学学报（自然科学版）,2004,44(9):1157-1159. 被引量：54

声学技术

2012年第6期

浏览历史

内容加载中请稍等...

一种基于动态平滑的实时基频提取算法被引量：1

参考文献16

同被引文献27

引证文献1

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种基于动态平滑的实时基频提取算法 被引量：1

参考文献16

同被引文献27

引证文献1

二级引证文献5

相关作者

相关机构

相关主题

浏览历史

一种基于动态平滑的实时基频提取算法被引量：1