期刊文献+

一种基于感知特征动态失真度量的语音质量评估算法 被引量:3

Speech Quality Evaluation Algorithm Based on Dynamic Distortion Measurement of Perception Characteristics
下载PDF
导出
摘要 实现一种基于语音感知特征参数动态规整失真度量的客观侵入式语音质量评估算法,该算法分为特征提取、失真度量、MOS映射三个步骤。算法的创新在于:特征提取过程中选取更能表征语音实质的GFCC参数取代传统的LPC、LPCC、MFCC、IMFCC等参数,在失真度量过程中选用动态规整距离取代传统的平均欧式距离,MOS映射时对映射函数进行修正以防止出现坏值而影响算法性能。文章详细介绍了算法的原理,在实现算法的基础上从相关度和偏离误差等指标对算法性能进行衡量,结果证明算法性能良好。 This paper realizes an intrusive objective speech quality evaluation algorithm based on dynamic distorhon measurement of speech perception characteristic parameteJ:s. The key steps of this algorithm(GFCCD_MOS) is feature extraction of Gammatone Frequency Cepstrum Coefficient, distortion measurement by Dynamic Time Warping and MOS-Mapping. This paper introduces the detail principle and measured the algorithm performance according to some indicators such as relevance and deviation error. The simulation results show the good performance of GFCCD_MOS.
出处 《自动化技术与应用》 2017年第4期1-4,11,共5页 Techniques of Automation and Applications
关键词 语音质量评估 GFCC 动态规整 MOS映射 speech quality evaluation Gammatone Frequency Cepstrum Coefficient dynamic warping MOS-Mapping
  • 相关文献

参考文献6

二级参考文献38

  • 1鄢田云,云霞,靳蕃,朱庆军.RBF神经网络及其在基于输出的客观音质评价中的应用[J].电子学报,2004,32(8):1282-1285. 被引量:7
  • 2张军,张德运,傅鹏.一种改进的心理声学语音质量客观评价算法[J].微电子学与计算机,2007,24(3):203-206. 被引量:6
  • 3丁瑾,钟涛,胡健栋.话音质量的一种新的评价方法[J].电子学报,1997,25(4):6-9. 被引量:5
  • 4ITU-T Recommendation P.862.Perceptual evaluation of speech quality (PESQ):An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs[S].2001,2.
  • 5KUBICHEK R.Mel-cepstral distance measure for objective speech quality assessment[A].Proc.IEEE Pacific Rim Conference on Communications,Computers,and Signal Processing.Piscatawey:IEEE Press[C].1993.125-128.
  • 6王金甲.噪声环境下鲁棒性文本自由说话人辨认系统的研究.燕山大学学报,2003,(3):15-17.
  • 7Hermansky H,Morgan N.RASTA processing of speech[J].IEEE Trans on Speech and Audio Processing,1994,22(4):578-589.
  • 8S Furui. Digital Speech Processing, Synthesis, and Recognition [ M]. New York: Marcel Dekker, 2001.
  • 9H Gish, M Schmidt. Text-independent speaker identification [ J]. IEEE Signal Proc, 1994,11 (4): 18 - 32.
  • 10D A Reynolds, et al. The SuperSID project: Exploiting high- level information for high-accuracy speaker recognition [ A ]. International Conference on Acoustics, Speech, and Signal Processing[ C]. Hong Kong, China: IEEE, 2003.4:784 - 787.

共引文献92

同被引文献22

引证文献3

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部