基于DS证据理论多特征融合模型的说话人分割聚类研究

下载PDF

导出

摘要说话人分割聚类是语音处理领域一个重要的研究课题。为提高说话人分割聚类的准确性,提出一种基于DS证据理论多特征融合模型用于提取说话人嵌入特征。该方法使用2种组合特征来更高效地表征语音,用于DenseNet网络的输入,利用DS证据理论对softmax层的输出进行融合,得到说话人的嵌入特征。分别使用单一特征与组合特征输入的DenseNet网络与该模型进行实验对比分析,结果表明,基于该模型的说话人分割聚类提取目标说话人的准确性更有优势。 Speaker diarization is an important research topic in the field of speech processing.In order to improve the accuracy of speaker diarization,a Dempster-Shafer theory based multi-feature fusion model is proposed for extracting speaker embedding features.Through this method,two combined features are used to represent the speech more efficiently,which is used for the input of the DenseNet network,and the DS evidence theory is used to fuse the output of the softmax layer to get the embedded features of the speaker.The DenseNet network with single feature input and combined feature input is used to compare with the model,and the results show that the accuracy of speaker diarization and clustering based on this model is better.

作者项羽令晓明郭亚龙

机构地区兰州交通大学光电技术与智能控制教育部重点实验室兰州交通大学国家绿色镀膜技术与装备工程技术研究中心

出处《科技创新与应用》 2023年第23期108-111,共4页 Technology Innovation and Application

关键词说话人分割聚类 DS证据理论密集卷积网络组合特征鲁棒性 speaker diarization Dempster-Shafer theory DenseNet combinatorial features robustness

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献5

1张欢,陆见光,唐向红.面向冲突证据的改进DS证据理论算法[J].北京航空航天大学学报,2020,46(3):616-623. 被引量：31
2Guangzhe Zhao,Aiguo Chen,Guangxi Lu,Wei Liu.Data Fusion Algorithm Based on Fuzzy Sets and D-S Theory of Evidence[J].Tsinghua Science and Technology,2020,25(1):12-19. 被引量：19
3李稀敏,洪青阳,黄晓丹.基于说话人的音频分割与聚类[J].心智与计算,2010,0(2):139-147. 被引量：5
4马勇,鲍长春.说话人分割聚类研究进展[J].信号处理,2013,29(9):1190-1199. 被引量：7
5朱唯鑫,郭武.采用长度规整MAP的说话人分割聚类[J].信号处理,2016,32(7):859-865. 被引量：1

二级参考文献97

1付中华,张艳宁.在线无监督说话人检索中稳健的模型自举算法[J].软件学报,2007,18(3):608-616. 被引量：3
2..http://www.itl.nist.gov/iad/mig/tests/rt/,.
3S.E.Tranter,D.A.Reynolds.An overview of automatic speaker diarization systems[J].IEEE Tram on Audio,Speech,and Language for Processing.2006,14(5):1557-1565.
4M.Kotti,V.Moschou,C.Kotropoulos.Speaker segmentation and clustering.Signal Processing 2008(88):1091-1124.
5T.Stafylakis and V.Katsouros.A review of recent advances in speaker diarization with bayesian methods.Speech and Language Technologies[M].InTech pubhshing 2011:217-240.
6X.Anguera,S.Bozonnet,N.Evans,C.Fredouille,G.Friedland,O.Vinyals.Speaker diarization:a review of recent research[J].IEEE Trans on Audio,Speech,and Language for Processing.2012,20(2):356-370.
7J.Ramírez; J.M.G6rriz,J.C.Segura.Voice activity detection.Fundamentals and Speech Recognition System Robustness[M].In M.Grimm and K.Kroschel.Robust Speech Recognition and Understanding.2007:1-22.
8D.Liu and F.Kubala,Fast speaker change detection for broadcast news transcription and indexing[C].In Proc.Eur Conf.Speech Commun Technol,1999(3):1031-1034.
9Nwe,T.L,Sun,H.,Li.,H.,Rahardja,S.,Speaker diarization in meeting audio,In Proc.of ICASSP,2010:4073-4076.
10V.Gupta,P.Kenny,P.Ouellet,G.Boulianne,and P.Dumouchel.Combining gaussianized/non-gaussianized features to improve speaker diarization of telephone conversations[J].IEEE Signal processing letters.2007,14(12):1040-1043.

共引文献55

1王锦,张振明,黄乃康,杨海成.集成环境下面向产品的CAPP系统[J].计算机工程与应用,2000,36(4):32-35. 被引量：5
2冷先平.论单纯化原理与标志设计的关系[J].中国包装,2000,20(3):84-86.
3陈祝允,李艳雄,杜佳媛.基于矢量量化的时序说话人聚类方法[J].科学技术与工程,2014,22(2):41-44. 被引量：5
4马勇,鲍长春.基于稀疏神经网络的说话人分割[J].北京工业大学学报,2015,41(5):662-667. 被引量：9
5马勇,鲍长春.基于高层信息特征的重叠语音检测[J].清华大学学报（自然科学版）,2017,57(1):79-83. 被引量：3
6李敬阳,李锐,王莉,王晓笛.基于变分贝叶斯改进的说话人聚类算法[J].数据采集与处理,2017,32(1):54-61. 被引量：2
7赖松轩,李艳雄.说话人聚类的初始类生成方法[J].计算机工程与应用,2017,53(3):149-153.
8李艳妮,张二华.多人会话混合语音的说话人分割[J].计算机与数字工程,2020,48(7):1558-1563.
9朱冠霖,王兆强,王异凡,李志峰,孙崇智.基于神经网络和证据融合的液压泵故障诊断研究[J].机电工程,2020,37(12):1498-1503. 被引量：15
10袁杰,王姝,王福利,孙晓辉.基于改进主观贝叶斯方法识别电熔镁炉异常工况[J].东北大学学报（自然科学版）,2021,42(2):153-159.

1崔琳,崔晨露,刘政伟,薛凯.改进MFCC和并行混合模型的语音情感识别[J].计算机科学,2023,50(S01):156-162. 被引量：4
2张福华,刘丽,朱俊东,朱再新,余大权.基于信息熵更新权重的数据自适应聚类研究[J].电子设计工程,2023,31(16):176-179.
3张佳琳,买日旦·吾守尔,古兰拜尔·吐尔洪.低资源条件下的语音合成方法综述[J].计算机工程与应用,2023,59(15):1-16.

科技创新与应用

2023年第23期

浏览历史

内容加载中请稍等...

基于DS证据理论多特征融合模型的说话人分割聚类研究

参考文献5

二级参考文献97

共引文献55

相关作者

相关机构

相关主题

浏览历史