期刊文献+

言语识别中的时域及频域信息 被引量:22

Temporal and spectral cues for speech recognition
下载PDF
导出
摘要 本文对言语识别中的声学要素从时域和频域的角度进行探讨,旨在为人工耳蜗编码策略的改善提供理论依据。声码器技术被用于一系列的实验以确定时域和频域信息对言语识别和汉语四声识别的相互作用。频域信息是由声码器中的频道数来决定,而时域信息则是由声码器的低通滤波器的截止频率来决定。听力正常成人参加了各项感知试验。结果表明,时域和频域信息都对音素识别很重要。在安静环境下,辅音和元音识别率分别在8和12频道及16Hz和4Hz的低通截止频率时达到平台成绩。在噪声环境下,元音识别受益于增高的频道数。汉语四声的识别需要256Hz的低通截止频率才达到平台成绩,这一频率比英语音素识别所需的时域信息高得多。声调识别率在本研究中最高频道数12时仍未见饱和。为了研究细微结构和时域包络对四声识别的相对重要性,我们用声嵌合技术将不同声调信号的时域包络和细微结构进行对换。感知实验结果表明,声调识别主要取决于细微结构,这一点与音乐感知的结果类似,而不象言语识别,后者主要依赖于时域包络信息。因此,增加人工耳蜗系统中有效的频道数将有助于尤其是噪声环境下的言语识别。将人工耳蜗刺激中提供更多的细微结构信息可能会提高患者声调识别的成绩。 The present study explores the temporal and spectral cues for speech recognition in an attempt to provide information for improving the speech processing strategies in cochlear implant systems. A noise-excited voeoder was used in a series of experiments to determine the relative contribution of temporal and spectral cues to phoneme recognition and lexical tone recognition. Spectral information was controlled by varying the number of channels and temporal information was controlled by varying the lowpass cutoff frequencies of the envelope extractors. Normalhearing adult subjects participated in the perceptual tests. The results demonstrated that both temporal and spectral cues are important for phoneme recognition in quiet and in noise. The plateau performance for consonant and vowel recognition in quiet was reached when the number of channels was 8 and 12, respectively and the lowpass cutoff frequency was 16 and 4 Hz, respectively. In noise conditions, vowel recognition benefited from increased spectral resolution. For Mandarin Chinese tone recognition, the lowpass cutoff frequency required for asymptotic performance was 256 Hz, much higher than that required for English phoneme recognition. Tone recognition performance had not yet reached plateau when 12 chan- nels, the highest in this study, were used. To study the relative importance of fine structure and temporal envelope in lexical tone recognition, a separate experiment using the auditory chimera technique was carded out. The perceptual results demonstrated that tone recognition relies more on the fine structure as does melody perception rather than on the temporal envelope as does English speech perception. Therefore, to improve speech recognition, especially in noise, efforts should be concentrated on providing more effective channels in the cochlear implant systems. Lexical tone recognition could benefit from fine structure information presented in the cochlear implant stimulations.
作者 徐立
机构地区 School of Hearing
出处 《中华耳科学杂志》 CSCD 2006年第4期335-342,共8页 Chinese Journal of Otology
基金 美国NIH(F32-DC00470 RO1-DC03808 R03-DC006161.) 俄亥俄大学研究基金。
关键词 人工耳蜗 言语识别 声调识别 时域信息 频域信息 Cochlear implant Speech perception Tone perception Temporal cues Spectral cues
  • 相关文献

参考文献55

  • 1[2]Friesen LM,Shannon RV,Baskent D,et al.Speech recognition in noise as a function of the number of spectral channels:Comparison of acoustic hearing and cochlear implants.J Acoust Soc Am,2001,110:1150-1163.
  • 2[3]Shannon RV.Multichannel electrical stimulation of the auditory nerve in man.I.Basic psychophysics.Hear Res,1983,11:157-189.
  • 3[4]Shannon RV.Temporal modulation transfer functions in patients with cochlear implants.J Acoust Soc Am,1992,91:2156-2164.
  • 4[5]van den Honert C,Stypulkowski PH.Physiological properties of the electrically stimulated auditory nerve.II.Single fiber recordings.Hear Res,1984,14(3):225-243.
  • 5[6]Rubinstein JT,Hong R.Signal coding in cochlear implants:Exploiting stochastic effects of electrical stimulation.Ann Otol Rhinol Laryngol,2003,112:14-19.
  • 6[7]Skinner MW,Arndt PL,Staller SJ.Nucleus 24 advanced encoder conversion study:Performance vs preference.Ear Hear,2002,23:2S-25S.
  • 7[8]Villchur E.Electronic models to simulate the effect of sensory distortions onspeech perception by the deaf.J Acoust Soc Am,1977,62:665-674.
  • 8[9]ter Keurs M,Festen JM,Plomp R.Effect of spectral envelope smearing on speech reception.I.J Acoust Soc Am,1992,91:2872-2880.
  • 9[10]ter Keurs M,Festen JM,Plomp R.Effect of spectral envelope smearing on speech reception.Ⅱ.J Acoust Soc Am,1993,93:1547-1552.
  • 10[11]Baer T,Moore BCJ.Effects of spectral smearing on the intelligibility of sentences in noise.J Acoust Soc Am,1993,94:1229-1241.

共引文献3

同被引文献172

引证文献22

二级引证文献83

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部