期刊文献+

利用结构特征的语音压缩感知重建算法 被引量:6

A Reconstruction Algorithm for Speech Compressive Sensing Using Structural Features
下载PDF
导出
摘要 针对语音信号在变换域中不够稀疏使得压缩感知重建困难的问题,提出了一种利用频域结构特征的重建算法.该算法为单帧语音信号的修正离散余弦变换系数引入幅度和状态2个隐变量,并分别用高斯马尔可夫过程和马尔可夫链对幅度和状态沿频率轴的连续性建模.在此基础上用因子图表示系数及其幅度、状态的联合后验分布,在因子图上用Turbo消息传递迭代求出系数的后验均值,进而重建原始语音信号.与当前几种最新的算法相比,该算法在不同帧长、不同压缩率下均获得更高的重建精度,重建信号在时频图上的能量分布也与原始语音最为接近.可见,利用语音频域系数的连续性,以Turbo消息传递的方式可以在压缩感知中得到较高的重建精度. It is difficult to reconstruct speech signal after compressive sampling because coefficients of the signal in transforming domain aren't sparse enough.In this paper the speech signal was recovered from compressed samples in the frequency domain using structural features.Two hidden variables,amplitude and state,are defined for each modified discrete cosine transforming(MDCT)coefficient of the speech signal.The probability density function of the amplitude of the MDCT coefficient is represented using a Gaussian mixture model,and the continuity of the states along the frequency axis is modeled through a first order Markov chain,the continuity of the amplitude along the frequency axis is modeled through Gauss-Markov process.The joint posterior distribution of coefficient,amplitude and state is represented by the factor graph,on which the posterior mean of the coefficient is obtained using Turbo message passing method,and then the speech can be reconstructed.After compressive sampling the MDCT coefficients of a speech segment,we reconstructed the signal using our proposed algorithm and other state-of-the-art algorithms for comparison.The results showed that our proposed algorithm achieved best reconstruction quality under different frames and compressive ratios.The spectrogram showed that the energy distribution of reconstructed signal using our algorithm was the most similar to the original signal's energy distribution.It can be seen that better reconstruction accuracy can be obtained using the continuity along frequency axis and Turbo message passing method.
作者 贾晓立 江晓波 蒋三新 刘佩林 JIA Xiaoli JIANG Xiaobo JIANG Sanxin LIU Peilin(Shanghai Key Laboratory of Navigation and Location-Based Services, Shanghai Jiao Tong University, Shanghai 200240, Chin)
出处 《上海交通大学学报》 EI CAS CSCD 北大核心 2017年第9期1111-1116,共6页 Journal of Shanghai Jiaotong University
基金 国家自然科学基金(61171171 61401501) 华为技术有限公司研究基金资助
关键词 语音信号 压缩感知 高斯混合模型 马尔可夫链 消息传递 speech signal compressive sensing Gaussian mixture model Markov chain message passing
  • 相关文献

同被引文献56

引证文献6

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部