期刊文献+

基于语音转折点检测的改进波形相似叠加时长规整算法 被引量:1

Improved Waveform Similarity Overlap-and-Add Time Warping Algorithm Based on Speech Turning Point Detection
下载PDF
导出
摘要 波形相似叠加算法忽略语音本身感知特性,对整段语音统一规整,在采样率较低或规整比例较大时处理效果不佳。为此,通过分析人耳听觉系统的预测特点,提出一种改进的波形相似叠加时长规整算法。采用子带谱熵法检测出语音的转折部分并保持其不变,以保证转折区的语音信息不受损坏,并给出一种局部补偿法以修正整体规整精度。仿真结果表明,该算法在整体规整比例不变的情况下可提高合成语音的自然度。 The Waveform Similarity Overlap-and-Add(WSOLA)algorithm neglects the perceptual characteristics of real sound speech signals,and employs uniform time scaling of the entire signal.When sampling rate is low or scaling proportion is large,the scale quality is degraded.Aiming at such problems,an enhanced WSOLA algorithm is proposed through analyzing the acoustic prediction characteristics of human auditory system.This method detects the turning points of the speech using a subband spectrum entropy measure and leaves them intact to ensure the turning points undamaged,while time scaling the remainder of the signal.A local compensate measure is further put forward to correct the whole scale accuracy.Simulation results show that the new algorithm improves the natural degree of the synthetic speech signals with the whole scale proportion unchanged.
作者 雷颖思 杨燕
出处 《计算机工程》 CAS CSCD 北大核心 2015年第10期260-264,共5页 Computer Engineering
基金 甘肃省科技厅自然科学基金资助项目(1310RJZA050)
关键词 时长规整算法 波形相似叠加算法 听觉预测 转折点检测 子带谱熵 局部补偿法 time warping algorithm Waveform Similarity Overlap-and-Add(WSOLA)algorithm acoustic prediction turning point detection subband spectrum entropy local compensation method
  • 相关文献

参考文献17

  • 1Moulines E,Laroche J.Non-parametric Techniques for Pitch-scale and Time-scale Modification of Speech[J].Speech Communication,1995,16(2):175-205.
  • 2Stylianou Y,CappéO,Moulines E.Continuous Probabilistic Transform for Voice Conversion[J].IEEE Transactions on Speech and Audio Processing,1998,6(2):131-142.
  • 3Nejime Y,Aritsuka T,Imamura T,et al.A Portable Digital Speech-rate Converter for Hearing Impairment[J].IEEE Transactions on Rehabilitation Engineer-ing,1996,4(2):73-83.
  • 4Arfib D,Verfaille V.Driving Pitch-shifting and Time-scaling Algorithms with Adaptive and Gestural Techniques[C]//Proceedings of the 6th International Conference on Digital Audio Effects.London,UK:[s.n.],2003.
  • 5Amatriain X,Bonada J,Loscos A,et al.Content-based Transformations[J].Journal of New Music Research,2003,32(1):95-114.
  • 6Roucos S,Wilgus A.High Quality Time-scale Modification for Speech[C]//Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing.Washington D.C.,USA:IEEE Press,1985:493-496.
  • 7Griffin D,Lim J S.Signal Estimation from Modified Short-time Fourier Transform[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1984,32(2):236-243.
  • 8McAulay R,Quatieri T F.Speech Analysis/Synthesis Based on a Sinusoidal Representation[J].IEEE Transactions on Acoustics,Speech and Signal Processing,1986,34(4):744-754.
  • 9叶锡恩,张巧文.基于WSOLA算法的语音时长调整研究[J].科技通报,2005,21(5):593-596. 被引量:4
  • 10周俊,高悦,谭薇,陈砚圃.语音时长规整技术的研究回溯[J].现代电子技术,2006,29(18):102-105. 被引量:6

二级参考文献34

  • 1杜守富,毛启容,詹永照.自适应同步叠加语音时长规整算法[J].通信学报,2005,26(2):136-140. 被引量:4
  • 2叶锡恩,张巧文.基于WSOLA算法的语音时长调整研究[J].科技通报,2005,21(5):593-596. 被引量:4
  • 3周俊,高悦,谭薇,陈砚圃.语音时长规整技术的研究回溯[J].现代电子技术,2006,29(18):102-105. 被引量:6
  • 4Wong P H W,Au, O C. Fast SOLA-based time-scale modification using modified envelope matching [C]//Proceedings of ICASSP 2002. Hong Kong, China:[s. n.],2002.
  • 5Makhoul J, El-jaroudi A. Time-scale modification in medium to low rate speech coding[J]. Proc ICASSP, 1986,311075-1078.
  • 6Philipos C L. Mimicking the human ear[J].IEEE Signal Processing Magazine, 1998,15(5) : 101-130.
  • 7Fmui S. On the role of spectral transition for speechperception[J].J Acoust Soc Amer, 1986, 80(4): 1016-1025.
  • 8Stevens K N. Acoustic correlates of some phonetic categories[J].J Acoust Soc Amer, 1980,68(3):836- 842.
  • 9Rabiner L, Juang B H. Fundamentals of speech recognition [M]. Englewood Cliffs, N J: Prentice-Hall, 1993: 100-117.
  • 10Deller J R, Hansen J H L, Proakis J G. Discretetime processing of speech signals[M]. New York, USA:Macmillan Publishing Company, 1993: 289-303.

共引文献9

同被引文献6

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部