期刊文献+

基于FDA的基频建模

Modeling pitch contour based on functional data analysis
下载PDF
导出
摘要 提出了一种利用FDA方法对语音基频包络建模的新方法.用B-样条函数对4种声调的单字基频抽取其基频样点进行数据平滑处理,得到平滑后的基频曲线,将平滑后的基频曲线进行2次时间校准处理,得到拟合后的基频曲线.将原始基频与拟合后的基频曲线进行对比,实验结果表明,文中提出的方法建立的基频模型的均方误差为6.47Hz,可应用于语音合成等语音信息处理中. A novel method for modeling pitch contour with FDA method is presented. By smoothing the pitch's samples of four kinds of Mandarin monotone with B-spline basis function, the fitted pitch contour is obtained. Comparing the pitch contours of before alignment and after alignment, the experimental results demonstrated that proposed method can accurately model the pitch contours with 6.47 Hz of mean root error. Proposed method can be applied to speech synthesis.
出处 《西北师范大学学报(自然科学版)》 CAS 北大核心 2013年第2期44-48,共5页 Journal of Northwest Normal University(Natural Science)
基金 国家自然科学基金资助项目(60875015 61263036 61262055) 甘肃省自然科学基金资助项目(1107RJZA112)
关键词 泛函数据分析(FDA) 基频曲线 基频建模 B-样条函数 FDA pitch contour pitch modeling B-spline
  • 相关文献

参考文献9

  • 1贾珈,蔡莲红,李明,张帅.汉语普通话与沈阳方言转换的研究[J].清华大学学报(自然科学版),2009(S1):1309-1315. 被引量:7
  • 2FUJISAKI H, HIROSE K, Analysis of voicefundamental frequency contours for declarativesentences of Japanese[J]. J Acoust Soc Jpn , 1984,5(4): 233-242.
  • 3XU Yi, WANG Q E. Pitch targets and theirrealization: Evidence from mandarin Chinese [ J ].Speech Communication , 2001, 33 : 319-337.
  • 4梁青青,杨鸿武,郭威彤,裴东,甘振业.利用五度字调模型实现普通话到兰州方言的转换[J].声学技术,2010,29(6):620-625. 被引量:3
  • 5RAMSAY J O,SILVERMAN B W. FunctionalData Analysis[M]. New York: Springer, 2005.
  • 6BRIGGER P. B-spline snakes: A flexible tool forparametric contour detection[J], IEEE Transactionson Image Processing -, 2000,9(9) : 1484-1496.
  • 7LEE S,BYRD D,KRIVOKAPICJ. Functional dataanalysis of prosodic effects on articulatory timing[J].The Journal of Acoustical Society of America,2006’ 119(3): 1666-1671.
  • 8RAMSAY J O, MUNHALL K G, GRACCO V L,et al. Functional data analysis of lip motion[J]. TheJournal of Acoustical Society of America,1996.99(6) : 3718-3727.
  • 9GUBIAN M, BOVES L,CANGEMI F. Jointanalysis of f0 and speech rate with functional dataanalysis[C]//JEEE ICASSP. Bragg: IEEE Press,2011: 4972-4975.

二级参考文献12

共引文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部