期刊文献+

基于听觉模型的汉语耳语音声调检测 被引量:5

Perceiving of Tone in Whispered Chinese Based on Auditory Model
下载PDF
导出
摘要 从听觉感知出发,分析了听觉外周模型对于语音激励的主要响应过程,采取听神经平均发放率为声调感知线索,提出了一种汉语耳语音声调的识别方法.其理论基础是听神经发放信息是听觉中枢的唯一信息来源,它是对于语音激励中声强、频谱、共振峰等多种特征的综合反应,因此适合用作耳语音的声调特征.采用BP神经网络对大量汉语元音耳语四声样本进行训练、识别,得到65.1%的平均识别率,达到了改善汉语耳语音声调识别效果的目的. Based on the analysis of the response of a peripheral auditory model for speech stimulation, the average firing rate of auditory nerves is chosen as the cue for whispered tone. Thus a method for whispered Chinese tone perceiving is proposed. The underlying principle is based on the fact that auditory nerve is the only source of information for central auditory system and it responds to several types of acoustic stimulus such as intensity, formant,etc. Therefore the average firing rate of auditory nerves is a suitable characteristic for the tone of whispered speech. The BP artificial neural network was trained by these proposed parameters to achieve tone recognition. Experiments are performed on a lot of Chinese whispered speech data and the average correct rate reaches 65.1%, which shows that the proposed method is effective for improving the performance of whispered Chinese tone perceiving.
出处 《电子学报》 EI CAS CSCD 北大核心 2009年第4期864-867,共4页 Acta Electronica Sinica
基金 国家自然基金(No.60572076) 江苏省高校自然科学基金(No.05KJB510113)
关键词 声调检测 汉语耳语音 听觉模型 听神经平均发放率 tone detection whispered Chinese auditory model the average firing rate of auditory nerves
  • 相关文献

参考文献15

二级参考文献54

  • 1栗学丽,丁慧,徐柏龄.基于熵函数的耳语音声韵分割法[J].声学学报,2005,30(1):69-75. 被引量:34
  • 2Taisuke Itoh, Kazuya Takeda, Fumitada Itakura. Acoustic Analysis and Recognition of Whispered Speech[J]. ICASSP,2002: 389-392.
  • 3Robert W. Morris, Mark A. Clements. Reconstruction of Speech from Whispers [J]. Medical Engineering & Physics, 200'2,24: 515-520.
  • 4Qian-Jie Fu,Fan-Gang Zeng. Identification of Temporal Envelope Cues in Chinese Tone Recognition [J]. Asia Pacific Journal of Speech, Language and Hearing,2000,(5) :45-57.
  • 5Man Gao. Tones in Whispered Chinese:Articulatory and PerceptualCues. [Master], University of Victoria,2002.
  • 6W Meyer Eppler. Realization of Prosodic Features in Whispered Speech [J]. Journal of Acoustical Society of America, 1957, 29( 1 ) : 104-106.
  • 7林茂灿.普通话声调的声学特性和知觉征兆[J].中国语文,1988,(2):182-193.
  • 8Ross M, Shaffe H, Cohen A, Freudberg R et al. Average magnitude difference function pitch extractor. IEEE Trans on Acoustics, Speech and Signal Processing, 1974; 22(5):353-362
  • 9Rabiner L R. On the use of autocorrelation analysis for pitch detection. IEEE Trans. ASSP, 1977; ASSP-25(1):24-33
  • 10Noll A M. Cepstrum pitch determination. J. Acoust. Soc.Am., 1967; 41(2): 293-309

共引文献43

同被引文献43

引证文献5

二级引证文献44

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部