期刊文献+

以测度作为神经网络输入的客观音质评价研究 被引量:2

Object speech quality assessment by neural network with distance measure as inputs
下载PDF
导出
摘要 利用径向基函数网络(RBFN)进行语音质量客观评价,以避免在回归分析中选取具体函数的困难.选取3种距离测度而非语音文件本身作为神经网络的输入,使得网络输入维数大大减小,网络结构大大简化.且对径向基函数网络结构作了修正,使其更便于作音质评价.作者在网络参数和结构学习中采用平滑后的训练集,有效减少了随机因素对客观评测结果的影响,也大大减少了网络结构的复杂性.主客观评价结果的相关性实验中,相关系数达0.96以上,这表明了该方法的可靠性. In this paper radial-basis function network (RBFN) is used for objective speech quality assessment in order to avoid the difficulty of choose regression function, Three kind of distance measure rather than the speech itself are chosen as the inputs of the neural network, so the dimensions of neural network's input is de- creased greatly. Thus the structure of neural network is simplified greatly, The structure of RBFN is also modified so as to convenient for speech quality assessment. Using smoothed training set for learning of parameters and structure of RBFN, the affection of random factors on the result of objective assessment can be reduced effectively, and the neural network' s structure can be simplified. The experimental results show that the structure of obtained RBFN is very simple, and the correlation coefficient between the subject seores and object MOS estimate is above 0.96. This shows that the method is reliable.
出处 《四川大学学报(自然科学版)》 CAS CSCD 北大核心 2007年第6期1210-1214,共5页 Journal of Sichuan University(Natural Science Edition)
基金 国家自然科学基金(10571127) 973项目(2002cb312206)
关键词 语音质量客观评价 测度 径向基函数网络 objective speech quality assessment, measure, radial-basis function network
  • 相关文献

参考文献6

二级参考文献13

共引文献74

同被引文献18

  • 1Rand D A J.阀控式铅酸蓄电池[M].北京:机械工业出版社,2007.
  • 2Wang C, Hill D J. Learning from neural control. IEEE Trans [J]. Neural Networks, 2006, 17 (1) : 130.
  • 3Labrosse Jean J.嵌入式实时操作系统μC/OS-II[M].2版.北京:北京航空航天大学出版社,2004.
  • 4Noergaard Tammy. Embedded systems architecture:a comprehensive guide for engineers and programmers[M].北京:人民邮电出版社,2008.
  • 5Takahashi A, Yoshino H, Kitawaki N. Perceptual QoS asseIment technologies for VoIP[ J]. IEEE Communications Mag- azine, 2004,42 ( 7 ) :28 - 34.
  • 6Clark A. Modeling the effects of burst packet loss and recen- cy on subjective voice quality[ C ]//Proceeding of the 2nd IP Telephony Workshop. New York,USA,2001:123 - 127.
  • 7Egi N, Hayashi T, Takahashi A. Parametric packet-layer model for evaluation audio quality in multimedia streaming services [ J ]. IEICE Transactions on Communications, 2010, E93-B(6) :1359 - 1366.
  • 8ITU-T SG12 Temporary Document TD 297 Updated draft terms of reference for P. NAMS [ S ]. Geneva, Switzerland : ITU-T,2010.
  • 9ITU-T Recommendation G. 107 The E-model, a computa- tional model for use in transmission planning [ S ]. Geneva, Switzerland : ITU-T,2002.
  • 10ITU-T Recommendation P. 862 Perceptual evaluation of speech quality( PESQ), an objective method for end to end speech quality assessment of narrowband telephone networks and speech codecs [ S ]. Geneva, Switzerland : ITU-T,2001.

引证文献2

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部