
一种新型基频变窗音频信号分析/合成系统 被引量:1

A new audio analysis/synthesis system of adapted window controlled by estimated fundamental frequency
摘要 音频信号短时谱的基频随时间会发生变化,因此其谐波成分之间的间隔也会发生变化,在时域上信号随时间会发生或快或慢的变化,这导致短时谱分析所要求的时域和频域分辨率随时间是变化的。传统的固定分析窗由于其时频分辨率固定,无法同时满足上述要求,因而对短时分析造成偏差。本文基于正弦加噪声模型提出了一个分析窗宽受基频控制的自适应新型音频信号分析/合成系统方案,有效地提高了对信号实时分析的精度。并在此基础上,进一步对分析窗的使用、正弦成分的确定和追踪以及噪声成分的分离提出了新的算法和理论依据。本系统对实现音频信号的人为改造提供一套灵活高效的系统框架基础。 The fundamental frequency of short time spectral always varies with the real times, which results in the span between the harmonics varies, while the time signal shows fast or slow variations. The changes of resolution of time or frequency are required in the situation illustrated above. The conventional fixed windows can not satisfy the requirement of resolution both in time and frequency, as leads to the mistake in the analysis of short time. This paper proposes a new adapted audio analysis/synthesis system based on deterministic plus stochastic model in which the size of analysis window is controlled by estimated fundamental frequency in order to improve the effect of analysis for partials. New algorithms and theoretical methods are taken in the design of analysis window, partials determination and tracking and the department of residuals. This scheme offers a robust alternative as the flexible efficient fundamental frame for the music modifying.
作者 杨诚 马永杰
出处 《信息化纵横》 2009年第11期54-59,共6页
关键词 基频估计 正弦成分 噪声成分 自适应窗 频率追踪 音频分析/合成 fundamental frequency estimation partials residuals adapted window peaks matching sound analysis/synthesis
  • 相关文献


  • 1SERRA X, SMITH J. Spectral Modeling Synthesis: A Sound Analysis/Synthesis System Based on a Deterministic plus Stochastic Decomposition.Computer Music Journal, 1990,14(4): 12-24.
  • 2RODET X, DEPALLE P. Spectral Envelopes and Inverse FFF Synthesis. AES 1992, San Francisco.
  • 3SERRA X. Musical Sound Modeling with Sinusoids plus Noise.Musical Signal Processing, 1997:1-25.
  • 4PARIS S, JAUFFRET C.Frequency Line Tracking Using HMM-based Schemes. IEEE Transactions on Aerospace and Electronic Systems, 2003,39(2):439-449.
  • 5DEPALLE P, GARCFA G,RODET X. Tracking of Partials for Additive Sound Synthesis Using Hidden Markov Models. Acoustics, Speech, and Signal Processing, ICASSP-93, IEEE, 1993:225-228.
  • 6SMITH J, SERRA X. PARSHL: An Analysis/Synthesis Program for Non-Harmonic Sounds Based On a Sinusoidal Representation. ICMC-87.
  • 7MEAULAY R J, QUATIERI T F.Speech Analysis/Syhthesis Based on a Sinusoidal Representation. IEEE transactions on Acoustics, Speech, and Signal Processing, 1986,34(4):744-754.
  • 8QUATIERI T F, MCAULAY R J. Audio Signal Processing Based on Sinusoidal Analysis/Synthesis. Kluwer International Series in Engineering and Computer …, Spring 1998:343- 416.
  • 9CHEVEIGNE A D, KAWAHARA H. YIN, a Fundamental Frequency Estimator For Speech and Music J. Acoust. Soc. Am. 111(4),April 2002:1917-1930.
  • 10GOOWIN M M. The STFT, Sinusoidal Models, and Speech Modification. Springer Handbook of Speech Processing Benesty, Sondhi, Huang(Eds.),2008:229-258.


  • 1周俊,高悦,谭薇,陈砚圃.语音时长规整技术的研究回溯[J].现代电子技术,2006,29(18):102-105. 被引量:6
  • 2Flanagan J L, Golden R M.Phase vocoder[J].Bell System Technical Journal, 1966,45: 1493-1509.
  • 3Laroche J, Dolson M.Improved phase vocoder time-scale modification of audio[J].IEEE Transactions on Speech and Audio Processing, 1999,7 (3) : 323-332.
  • 4Karrer T, Lee E, Borchers J.PhaVoRIT: a Phase Vocoder for Real-time Interactive Time stretching[C]//Proc of the International Computer Music Conference, 2006: 708-715.
  • 5Bello J P, Daudet L, Abdallah S, et al.A tutorial on on- set detection in music signals[J].IEEE Transactions on Speech and Audio Processing,2005, 13(5):1035-1047.
  • 6Barry D,Dorran D, Coyle E.Time and pitch scale modi- fication: a real-time framework and tutorial[C]//Proc of the llth International Conference on Digital Audio Ef- fects, 2008: 103-110.
  • 7黄昊,郭立,李琳.基于感知敏感成分划分的语音时长规整算法[J].数据采集与处理,2008,23(6):740-745. 被引量:4









使用帮助 返回顶部