基于NMF和FCRF的单通道语音分离被引量：1

Single-channel speech separation with non-negative matrix factorization and factorial conditional random fields

导出

摘要近年来,非负矩阵分解(non-negative matrix factorization,NMF)被广泛应用于单通道语音分离问题。然而,标准的NMF算法假设语音的相邻帧之间是相互独立的,不能表征语音信号的时间连续性信息。为此,该文提出了一种基于NMF和因子条件随机场(factorial conditional random field,FCRF)的语音分离算法,首先将NMF和k均值聚类结合对纯净语音的频谱结构以及时间连续性进行建模,然后利用得到的模型训练FCRF模型,进而对混合语音信号进行分离。结果表明:该算法相比没有考虑语音时间连续特性的基于NMF的算法如激活集牛顿算法(active-set Newton algorithm,ASNA),在客观指标上有明显提高。 Non-negative matrix factorization （NMF） has been extensively used for single channel speech separation. However, a typical issue with the standard NMF based methods is that they assume the independency of each time frame of the speech signal and, thus, cannot model the temporal continuity of the speech signal. This paper presents an algorithm for single-channel speech separation based on NMF and the factorial conditional random field （FCRF） method. A model is developed by combining NMF with the k-means clustering method. This model can concurrently describe the spectral structure and the temporal continuity of the speech signal. Then, the model is used to train the FCRF model, which isused to separate the mixed speech signal. Tests show that this algorithm consistently improves the separation performance compared with the active-set Newton algorithm, an NMF based approach that dose not consider the temporal dynamics of the speech signal.

作者李煦屠明吴超国雁萌纳跃跃付强颜永红

机构地区中国科学院声学研究所亚利桑那州立大学

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2017年第1期84-88,共5页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(11461141004,91120001,61271426) 中国科学院战略性先导科技专项(XDA06030100,XDA06030500) 国家“八六三”高技术项目(2012AA012503) 中科院重点部署项目(KGZD-EW-103-2)

关键词单通道语音分离因子条件随机场非负矩阵分解 K均值聚类 single-channel speech separation factorial conditionalrandom field （FCRF） non-negative matrix factorization（NMF） k-means clustering

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

同被引文献1

1刘文举,聂帅,梁山,张学良.基于深度学习语音分离技术的研究现状与进展[J].自动化学报,2016,42(6):819-833. 被引量：70

引证文献1

1涂斌炜,吕俊.基于不确定性感知的语音分离方法[J].自动化与信息工程,2021,42(1):35-40. 被引量：1

二级引证文献1

1李计管,朱爽,姜德宏,徐华.基于集成AI技术的高速公路收费机器人系统[J].中国交通信息化,2022(5):112-115. 被引量：2

1王雨,林家骏,袁文浩,陈宁.基于改进基音跟踪算法的单通道语音分离[J].华东理工大学学报（自然科学版）,2013,39(3):338-344. 被引量：4
2邓全.CDMA网络优化中不必要的双向邻区优化初探[J].中国电子教育,2010(3):57-60. 被引量：1
3龚婕,杨士元.SAR图像的检测和分类方法[J].北京邮电大学学报,2005,28(4):99-102. 被引量：3
4蒋少西.GC—LRA全自动有线／无线转接器[J].移动通信,1993,6(4):24-26.
5陈锴,卢晶,徐柏龄.基于话者状态检测的自适应语音分离方法的研究[J].声学学报,2006,31(3):211-216. 被引量：3
6邢安昊,黎塔,颜永红.利用二重打分方法的激活词语音识别[J].声学技术,2013,32(S1):211-212.
7孟东霞,马建芬,乔永凤.一种改进的基于EASI的语音分离算法[J].计算机工程与应用,2007,43(33):214-216.
8雷震洲.开放式网路结构(ONA)[J].电信科学,1988,4(7):63-64.
9徐丽琴,何晓川.一种基于负熵的信赖区域盲分离方法[J].西安邮电学院学报,2010,15(3):14-18. 被引量：1
10中兴通讯cdma2000中标摩洛哥全国网[J].电信技术,2006(8):78-78.

清华大学学报（自然科学版）

2017年第1期

浏览历史

内容加载中请稍等...

基于NMF和FCRF的单通道语音分离被引量：1

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于NMF和FCRF的单通道语音分离 被引量：1

同被引文献1

引证文献1

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于NMF和FCRF的单通道语音分离被引量：1