摘要
基于多相滤波器组的语音基频检测方法 ,运用多相滤波器组分解语音信号频谱 ,然后利用声带震动的能量准周期性在各子带进行峰值搜索 ,并综合这些子带的搜索结果计算基音周期 ,最后根据先验知识以及一种新的清浊音判定方法对结果进行校正。基于标准
Past PDAs, to our best knowledge, were unable to predict accurately the pitch at any instant. We now propose a PDA based on multi phase filter bank that can do so. Section 1 discusses in much detail the design of multi phase filter bank. Essentially it explains how to decompose the speech spectrum for removing vocal tract effects. Fig.1 shows the prototype low pass filter. Section 1 also gives the procedure of bandwidth selection. Section 2 discusses in much detail the procedure for pitch detection. Essentially it performs peak searching in each sub band and gets decision for each sub band; it synthesizes all the sub band decisions to obtain the final detection results. Section 2 also gives the rules for correct peak searching. Section 3 proposes a new method for making voiced/unvoiced decision based essentially on the following fact: the energy spectrum of the voiced speech is different from that of the unvoiced speech. Experimental results given in section 4 on sentences chosen from TIMIT database show preliminarily that out new PDA based on multi phase filter bank can predict accurately the pitch at any instant.
出处
《西北工业大学学报》
EI
CAS
CSCD
北大核心
2003年第5期603-606,共4页
Journal of Northwestern Polytechnical University
基金
陕西省自然科学基金 (2 0 0 3CS110 1)
西北工业大学博士论文创新基金 (2 0 0 2 35 )
关键词
基音频率检测
多相滤波器组
峰值搜索
Pitch Detection Algorithm(PDA), multi phase filter bank, peak searching