基于DIVA模型的语音-映射单元自动获取

Automatic acquisition of speech sound-target cells based on DIVA model

下载PDF

导出

摘要针对DIVA模型中存在的"感知能力与语音生成技巧发育不平衡"问题,提出了一种自动获取语音-映射单元的方法.该方法将人耳模拟为一个具有不同带宽的并联带通滤波器组,分别与模型中21维度的听觉存储空间相关联,对不同听觉的不同反应,分别考虑其频带的屏蔽效应、听觉响度与频率的关系.在读取语音输入信号的过程中,模型能较好地获得初始听觉表示,其方式与婴儿咿呀学语的过程基本一致.仿真实验表明,通过边界定义、相似性比较以及搜索更新等步骤,此方法能很好地进行初始输入模式的自组织匹配,并最终使DIVA模型更具语音获取的自然特性. Contraposing the shortage of Directions Into Velocities of Articulators（ DIVA） model about＂infants perceptual abilities do develop faster at first than their speech production skills＂,the paper presents an automatic acquisition method of speech sound-target cells. The method simulates the human ear as a parallel band-pass filter group with different bandwidth and associates respectively; the filter with the 21-dimensional storage space of auditory sense in DIVA model. This method was done in order for different auditory reactions,the shielding effect of frequency band,sound loudness,and frequency relation could be considered respectively for this study. In the process of reading the input signal of speech,the model can acquire good initial hearing and the process is consistent with baby ＇s babble. The simulation results show that through boundary definition,similarity comparison,searching and updates and so on,the method has nicer self-organized pattern matching effect for initial input,which makes the DIVA model a more natural characteristic regarding speech acquisition.

作者张少白刘欣

机构地区南京邮电大学计算机学院

出处《智能系统学报》 CSCD 北大核心 2013年第4期305-311,共7页 CAAI Transactions on Intelligent Systems

基金国家自然科学基金资助项目(61073115 61271334 61373065)

关键词 DIVA模型音素语音-映射单元语音生成与获取 DIVA model phoneme speech sound-target cells speech acquisition and production

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献21

1GUENTHER F H, BRUMBERG J S, WRIGHT E J, et al. Wireless brain-machine interface for real-time speech syn- thesis[J]. PLoS ONE, 2009, 4 (12) : 8218.
2BRUMBERG J S, NIETO-CASTANON A, KENNEDY P R, et al. Brain-computer interfaces for speech communication [J]. Speech Communication, 2010, 52 (4): 367-379.
3TOURVILLE J T, GUENTHER F H. The DIVA model: a neural theory of speech acquisition and production [ J ]. Lan- guage and Cognitive Processes, 2011, 25(7) : 952-981.
4GUENTHER F H, VLADUSICH T. A neural theory of speech acquisition and production [ J ]. Journal of Neurolin- guistics, 2012, 25(5): 408-422.
5GUENTHER F H. A neural network model of speech acqui- sition and motor equivalent speech production [ J ]. Biological Cybernetics, 1994, 72(1) : 43-53.
6GHOSH S S. Understanding cortical contributions to speech production through modeling and functional imaging [ D ]. Boston, USA: Boston University, 2005 : 1-36.
7GUENTHER F H, GHOSH S S. A neural model of speech production[ C ]//Proceedings of the 6th International Semi- nar on Speech Production. Sydney, Australia, 2003: 85-90.
8TOURVILLE J A, REILLY K J. Neural mechanisms under- lying auditory feedback control of speech [ J ]. Neurolmage, 2008, 39 (3) : 1429-1443.
9MAX L, GHOSH S S. Unstable or insufficiently activated internal models and feedback-biased motor control as sources of dysfluency : a theoretical model of stuttering [J]. Contemporary Issues in Communication Science and Disor- ders, 2004, 31: 105-122.
10CIVIER O, GUENTHER F H. Simulations of feedback and feedforward control in stuttering [ C ]//Proceedings of the 3th Oxford Dysfluency Conference. Oxford, UK, 2005: 1-7.

1张昕,张少白.一种改进的伪逆控制方案在DIVA模型中的应用[J].南京邮电大学学报（自然科学版）,2012,32(3):81-85.
2刘燕燕,张少白.关于DIVA模型中语速对语音生成影响的研究[J].计算机技术与发展,2011,21(12):33-35.
3高丽琴,张少白.DIVA模型中运动感觉系统传输延迟问题的研究[J].计算机技术与发展,2012,22(3):117-120.
4付玉林.改顶报纸为提报纸[J].湖北教育（科学课）,2015,0(4):120-120.
5超爆笑之百人的逆袭[J].计算机应用文摘,2008(15):58-58.
6魏定国,吴时霖.UML与SDL结合使用的研究[J].计算机科学,2002,29(10):150-152.
7钟伟.未来电脑狂想曲[J].现代计算机（中旬刊）,2009(6):117-118.
8肖然.曲婉婷:逆生长成就好声音[J].高中生（作文）,2014(2):13-13.
9张少白,王勇,刘友谊.基于DIVA模型的脑电信号去噪方法研究[J].电子学报,2015,43(4):700-707. 被引量：3
10张少白,陈燕俐,何利文.基于DIVA模型的中文复合元音发音方法研究[J].系统仿真学报,2017,29(2):255-263. 被引量：2

智能系统学报

2013年第4期

浏览历史

内容加载中请稍等...

基于DIVA模型的语音-映射单元自动获取

参考文献21

相关作者

相关机构

相关主题

浏览历史