基于HMM状态聚类均值替代的宽带语音ISF参数补偿算法

ISF Parameters Concealment Algorithm Based on the HMM Clustering Means for Wideband Speech

下载PDF

导出

摘要宽带语音在Internet传输中不可避免会出现丢帧现象,由于错误传播的影响,使接收语音质量急剧下降。该文采取大型连续分布隐马尔可夫模型(LCDHMM)对宽带语音ISF参数建模,采用Viterbi算法确定丢失帧之前若干语音帧ISF参数观察值的最佳状态序列。由于状态的冗余度较大,用丢帧前最近接收的正确帧ISF参数的HMM状态对应的聚类均值和真实值的加权,代替丢失帧的ISF参数值。将采取该算法的补偿语音和采取G.722.2标准附件I所提算法的补偿语音进行比较,仿真结果表明该算法具有较好的补偿效果,其波形与谱失真更小。 There exist inevitably frame losses in the transmission of wideband speech. Due to the influence of error propagation, the quality of the received speech decays rapidly. In this paper, large hidden Markov model is adopted to model the ISF parameter in wideband speech, and the best state sequences of ISF parameters of the several speech frames before the lost frame are determined using Viterbi Algorithm. Because of the large redundancy of the HMM states, the lost ISF parameters are substituted by the weighted values between the clustering means and the real values of the ISF parameters of the nearest received frame. The speech compensated by this algorithm is compared to which by Annex I of G.722.2 specification. Simulation shows that the algorithm in this paper can result in best speech, and smaller waveform and spectrum distortion.

作者王仕奎周琳吴镇扬尤红岩

机构地区东南大学信息科学与工程学院安徽师范大学物理与电子信息学院

出处《电子与信息学报》 EI CSCD 北大核心 2009年第4期827-831,共5页 Journal of Electronics & Information Technology

基金东南大学科技基金项目(XJ0704268) 安徽省高校省级自然科学研究项目(KJ2007B088)资助课题

关键词宽带语音丢帧隐马尔可夫模型聚类补偿 Wideband speech Frame loss Hidden Markov Model（HMM） Clustering Concealment

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1Ding Lijing, Radwan Ayman, El-Hennawey, Mohamed Samy, and Goubran Rafik A. Performance study of objective voice quality measures in VoIP[C]. IEEE Symposium on Computers and Communications, Aveiro, Portugal, 1-4 July 2007: 197-202.
2Thyssen J, Zopf R, Chen Juin-Hwey, and Shetty N. A candidate for the ITU-T G.722 packet loss concealment standard[C]. IEEE International Conference on Acoustics, Speech and Signal Processing, Honolulu, USA , Vol.4, 15-20 April 2007: Ⅳ-549-Ⅳ-552.
3Lee Moon-Keun, Jung Sung-Kyo, Kang Hong-Goo, Park Young-Cheol, and Youn Dae-Hee. A packet loss concealment algorithm based on time-scale modification for CELP-type speech coders [C]. Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference, Vol.1, 6-10 April 2003: Ⅰ-116-Ⅰ-119.
4Rφdbro Christoffer A, Murthi Manohar N, Andersen S V, and Jensen S H. Hidden Markov model-based packet loss concealment for voice over IP [J]. IEEE Trans. on Audio,Speech, and Language Processing, 2006, 14(5): 1609-1623.
5Telecommunication Standadization Sector of ITU. Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB) Appendix Ⅰ: Error concealment of erroneous or lost frames [S]. 2002.
6Telecommunication Standadization Sector of ITU. ITU-T Draft Rec G.722.2: Wideband coding of speech at around 16 kbit/s using Adaptive Multi-Rate Wideband (AMR-WB)[S]. 2003.
7Rabiner L and Juang Bin-Huang. Fundamentals of Speech Recognition [M]. USA, Prentice Hall, 1993: 321-389.
8Roucos S, Makhoul J, and Schwartz R. A variable-order Markov chain for coding of speech spectra [C]. ICASSP, 1982: 582-585.
9Farges E and Clements M. Hidden Markov models applied to very low bit rate coding [C]. ICASSP, Tokyo, Japan, 1986: 433-436.
10周琳,吴镇扬.基于MS估计和迭代结构的信源信道联合解码系统性能分析[J].电子与信息学报,2006,28(2):257-261. 被引量：1

二级参考文献8

1Fingscheidt T,Vary P.Soft bit speech decoding:A new approach to error concealment[J].IEEE Trans.on Speech and Audio Processing,2001,9(3):240-251.
2Adrat M,Heanel R,Vary P.On joint source-channel decoding for correlated source.In Proceedings of ICASSP-02[C],Orlando,Florida,USA,2002,3:2505-2508.
3Fingscheidt T,Vary P.Robust speech decoding:A universal approach to bit error concealment.In Proceedings of ICASSP-97[C],Munich,Germany,1997,3:1667-1670.
4Hagenauer J.Source-controlled channel decoding[J].IEEE Trans.on Communications,1995,43(9):2449-2457.
5Veaux C,Scalart P,Gilloire A.Channel decoding using inter-and intra-correlation of source encoded frames.In Proceedings of Data Compression Conference-2000[C],Snowbird,Utah,USA.2000:103-112.
6Fingscheidt T,Hindelang T,Cox R V,et al..Joint source-channel(de-)coding for mobile communications[J].IEEE Trans.on Communications,2002,50(2):200-211.
7Xu W.Repeated joint source-channel decoding in a GSM system.In Proceedings of PIMRC 2000[C],London,UK.2000,1:241-245.
8Cover T M.Thomas J A.Elements of Information Theory[M].New York,NY:John Wiley &Sons,1991:19-29.

1Scagl.,A 赖奕蓉.运用基于时频表示的参数建模对运动参数进行估计[J].空载雷达,1998(2):56-61.
2李海婷,鲍长春.宽带ISF参数的非等系数帧间预测分裂矢量量化方法[J].电子学报,2008,36(6):1214-1217. 被引量：1
3包希日莫,高光来.蒙古语声学模型状态聚类:问题集设计[J].内蒙古大学学报（自然科学版）,2013,44(1):87-92. 被引量：1
4储德寅.非线性编辑系统中视频采集丢帧问题探讨[J].中国有线电视,2003(18):78-81.
5霍玲玲.Kalman滤波器理论的研究[J].电子制作,2015,23(11Z).
6李海婷,鲍长春.宽带ISF参数的转换分类乘积码锥形矢量量化[J].电子学报,2008,36(2):362-366. 被引量：6
7王军,张连海,屈丹.一种针对ISF参数的量化算法[J].通信技术,2009,42(10):204-206.
8靳同红,莫正波,郑德亮,王胜春.时频分析技术及应用研究[J].机械科学与技术,2009,28(1):75-78. 被引量：10
9徐向华,朱杰,郭强.汉语连续语音识别中的分级聚类算法的研究和应用[J].信号处理,2004,20(5):497-500. 被引量：2
10张磊.带阻尼EMI滤波器寄生参数建模研究[J].现代雷达,2014,36(12):78-82.

电子与信息学报

2009年第4期

浏览历史

内容加载中请稍等...

基于HMM状态聚类均值替代的宽带语音ISF参数补偿算法

参考文献11

二级参考文献8

相关作者

相关机构

相关主题

浏览历史