基于注意力机制的联合监督端到端说话人识别模型

End-to-end Speaker Recognition Model for Joint Supervision Based on Attention Mechanism

下载PDF

导出

摘要随着深度学习网络模型在生物识别领域的应用,将说话人识别的发展推向一个新的阶段。早期用于说话人识别的深度学习模型主要为深度神经网络(DNN),在一定程度上改善了说话人识别的性能,但模型训练速度和识别精度都有待提升。笔者基于提取局部特征,引入模型训练复杂程度更低的卷积神经网络(CNN),采用跳跃连接的方法,解决了CNN在训练阶段随着卷积层数的增加引起的梯度消失问题,并在训练阶段对话语采用基于注意力机制的由帧级到段级聚合,以及softmax loss、center loss联合监督的方法对模型进行训练,大幅提升了CNN用于说话人识别的性能。 With the application of deep learning network model in the field of biometrics,the development of speaker recognition is pushed to a new stage.The early deep learning model for speaker recognition is mainly deep neural network(DNN),which improves the performance of speaker recognition to a certain extent,but its training speed and recognition accuracy still need to be improved.Based on the extraction of local features and convolutional neural network(CNN)that is less complex,this paper introduces the method of jump connection,which solves the problem of gradient disappearance caused by the increase of convolution layer in CNN training stage.Besides the method uses the attention mechanism based utterance level aggregation,and joint supervision method of softmax loss and center loss to train the model,which greatly improves the performance of CNN for speaker recognition.

作者史王雷冯爽 Shi Wanglei;Feng Shuang(Key Laboratory of Intelligent Financial Media of Ministry of Education,Communication University of China,Beijing 100024,China)

机构地区中国传媒大学智能融媒体教育部重点实验室

出处《信息与电脑》 2020年第4期145-147,共3页 Information & Computer

关键词说话人识别卷积神经网络聚合联合监督 speaker recognition convolutional neural network aggregation joint supervision

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

1邢瑾.机器学习算法在高分辨率遥感影像土地覆被分类中的对比分析[J].甘肃科技,2020,36(3):27-34. 被引量：6
2乌日其其格.论蒙古语标准音语音库的建立[J].满语研究,2019,0(2):69-72.
3ZHU Tao,CHENG Chunling.Joint CTC-Attention End-to-End Speech Recognition with a Triangle Recurrent Neural Net work Encoder[J].Journal of Shanghai Jiaotong university(Science),2020,25(1):70-75. 被引量：2

信息与电脑

2020年第4期

浏览历史

内容加载中请稍等...

基于注意力机制的联合监督端到端说话人识别模型

相关作者

相关机构

相关主题

浏览历史