基于AOF-LCNN的语音回放攻击场景下的说话人识别算法

Speaker recognition based on AOF-LCNN in audio playback attack scenario

下载PDF

导出

摘要针对语音回放攻击场景下的LCNN说话人识别系统中存在的过拟合问题,提出一种基于AOF-LCNN的神经网络。设计了一个新的DNN结构分类器作为后端分类网络,将该DNN结构级联在LCNN网络之后,形成一套新的端到端网络结构。由于LCNN结构中的MFM结构可能是造成过拟合的原因,在DNN后端结构中采用LeakyReLU作为激活函数,以抵消MFM的过拟合影响。在ASVspoof 2017数据集上的结果表明,该算法在Dev数据集和Eval数据集上分别达到了3.59%和13.79%的等错误率(EER),相对LCNN系统的等错误率分别降低了2.12%和3.51%。该算法一定程度上解决了过拟合的问题,提高了系统的鲁棒性,同时降低了系统的等错误率,从而提高识别性能。 Aiming at the over-fitting problem in LCNN speaker recognition system in audio playback attack scenario, a neural network based on AOF-LCNN is proposed. A new DNN structure classifier is designed as the back-end classification network and it is cascaded after the LCNN network to form a new end-to-end network structure. Because the MFM structure in the LCNN structure may be the cause of over-fitting, LeakyReLU is used as the activation function in the DNN back-end structure to offset the over-fitting effect of MFM. The results on the ASVspoof 2017 dataset show that the proposed method achieves an EER of 3.59% on the Dev dataset, an EER of 13.79% on the Eval dataset. The EER of the proposed method compared to that of the LCNN system was reduced by 2.12% and 3.51%, respectively. The proposed method solves the over-fitting problem to some extent, improves the robustness of the system and reduces the equal error rate of the system, thus it improves the recognition performance.

作者李波蔡晓东侯珍珍陈思 LI Bo;CAI Xiaodong;HOU Zhenzhen;CHEN Si(School of Information and Communication,Guilin University of Electronic Technology,Guilin 541004,China)

机构地区桂林电子科技大学信息与通信学院

出处《桂林电子科技大学学报》 2020年第1期13-17,共5页 Journal of Guilin University of Electronic Technology

基金新疆重点研发计划(2018B03022-1,2018B03022-2) 桂林电子科技大学研究生教育创新计划(2017YJCX29)。

关键词说话人识别回放语音攻击 AOF-LCNN speaker recognition audio playback attack AOF-LCNN

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

1崔浩斌,刘伟.基于单片机控制的语音采集与回放系统设计研究[J].微处理机,2020,41(3):51-54. 被引量：4
2李珣,时斌斌,刘洋,张蕾,王晓华.基于改进YOLOv2模型的多目标识别方法[J].激光与光电子学进展,2020,57(10):105-114. 被引量：14
3曾春艳,马超峰,王志锋,孔祥斌.基于卷积神经网络的鲁棒性说话人识别方法[J].华中科技大学学报（自然科学版）,2020,48(6):39-44. 被引量：9
4吴建,许镜,丁韬.基于集成迁移学习的细粒度图像分类算法[J].重庆邮电大学学报（自然科学版）,2020,32(3):452-458. 被引量：17
5Franciele Sabadin Bertol,Bruna Araujo,Brunno Brochado Jorge,Natalino Rinaldi,Luiz Alberto De Carli,Cristiane Valle Tovo.Role of micronutrients in staging of nonalcoholic fatty liver disease:A retrospective cross-sectional study[J].World Journal of Gastrointestinal Surgery,2020,12(6):269-276. 被引量：4
6王鲲.数据中心智能化运维探索与实践[J].中国金融电脑,2020(7):61-65. 被引量：4
7张汝姣,何阳阳,姜栋,刘少飞,张宇佳,郑稳生,吴松,荆志成.高效液相色谱串联质谱法测定大鼠血浆中伐地那非浓度的可行性研究[J].中华心血管病杂志,2020,48(6):507-512.
8Xiao-Dong Wu,Yi-Jun Wang,Duan Huang,Ying Guo.Simultaneous measurement-device-independent continuous variable quantum key distribution with realistic detector compensation[J].Frontiers of physics,2020,15(3):1-12. 被引量：2

桂林电子科技大学学报

2020年第1期

浏览历史

内容加载中请稍等...

基于AOF-LCNN的语音回放攻击场景下的说话人识别算法

相关作者

相关机构

相关主题

浏览历史