期刊文献+

基于AMR编码参数的语音识别 被引量:1

Speech Recognition Based on AMR Vocoder Parameters
下载PDF
导出
摘要 基于语音编码系统的语音识别 ,由于受编码的影响其识别效果在编码速率下降时显著降低。传统的识别方法从重构语音波形中提取特征参数 ,并针对该特征参数进行训练和识别。比较了基于编码语音的识别准确率和基于编码参数的识别准确率 ,并研究了编码参数对识别准确率的影响。在此基础上 ,通过选择受编码影响较小的编码参数 ,直接将 LPC参数和残差信号参数组合起来构成特征参数进行语音识别。实验结果表明 ,采用这种方法的 AMR语音识别系统 ,其识别效果接近于基于原始语音的识别效果。 Speech coding affects speech recognition performance by deteriorating recognition accuracy as the coded bit rate decreases. The conventional systems that recognize coded speech reconstruct the speech waveform from the coded parameters and then perform recognition based on the characteristic parameters of the waveform. In this paper, a comparison is made between the recognition accuracy of coded speech and the accuracy obtained when using the features derived from the coding parameters. The effects of coding on the recognition accuracy is analyzed. The cepstral streams representing the LPC parameters are combined with residual parameters to recognize directly from the coded parameters. Experiment results suggest that it is possible to obtain recognition accuracy equal to the conventional systems from reconstructed waveforms.
出处 《解放军理工大学学报(自然科学版)》 EI 2002年第5期6-9,共4页 Journal of PLA University of Science and Technology(Natural Science Edition)
关键词 编码参数 AMR声码器 语音识别 MEL频率倒谱系数 语音编码系统 编码速度 AMR vocoder speech recognition MFCC (Mel Frequency Cepstral Coefficients)
  • 相关文献

参考文献6

  • 1Haeb-Umback R. Robust speech recognition for wireless networks and mobile telephony [A]. In: Proc Eurospeech, 97'[C]. Rhodes, Greece, 1997.
  • 2MOKBEL C, MAUUARY L, JOUVET D, etc. Towards improving ASR robustness for PSN & GSMtelephone applications [A]. In: 2^nd IEEE Workshop on Interactive Voice Technology for telecommunications applications (IVTTA1994) [C]. Greece,1996.
  • 33G TS 26. 090-99. AMR speech codec; transcoding functions [S].
  • 4KONDOZ A M. Digital speech-coding for low bit rate communication systems [M]. Singapore: John Wiley sons, 1995.
  • 5ATAL B S. Effectiveness of linear prediction characteristics of the speech wave[J]. J Acoust Soc Am 1974,55(6):1 304-1312.
  • 6RABINER L, JUANG B H. Fundamentals of speechrecognition [M]. Englewood Cliffs: Prentice Hall,1993.

同被引文献9

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部