基于卷积神经网络的法庭说话人识别研究

Research on Forensic Speaker Recognition Based on Convolutional Neural Networks

下载PDF

导出

摘要传统的法庭说话人识别方法存在对语音数据建模能力差、特征提取难以及容易受噪声干扰影响等问题,为了改进这些问题,提出一种基于卷积神经网络的法庭说话人识别方法。该方法以AlexNet网络为基础进行参数调整,为了弥补ReLU函数作为激活函数时易出现神经元坏死和偏移的现象,融合Tanh和ReLU函数的特性,构造一种新的TR函数作为网络的激活函数。同时,为了避免人工提取语音特征的主观性和不全面性,在实验中将语音转换成声纹图作为网络输入。实验结果表明,激活函数为TR函数时,该方法在法庭说话人识别数据集的准确率达到了92.24%,在花朵图像公开数库的准确率达到了96.13%,效果均好于Tanh和ReLU函数。 In the traditional court speaker recognition method,there are some problems such as poor modeling ability of speech data,difficulty in feature extraction and vulnerability to noise interference.In order to improve these problems,a court speaker recognition method based on convolution neural network is proposed.This method is based on AlexNet network to adjust the parameters,in order to make up for the phenomenon of neuronal necrosis and migration when ReLU function is used as activation function,the characteristics of Tanh and ReLU function are fused.A new TR function is constructed as the activation function of the network.At the same time,in order to avoid the subjectivity and incompleteness of manual extraction of speech features,in this paper,the speech is converted into sound pattern as the input of the network.The experimental results show that when the activation function is a TR function,the accuracy on the court speaker recognition data set is 92.24%,and the accuracy on the flower image open database is 96.13%,which is better than the Tanh and ReLU function.

作者南兆营 NAN Zhaoying(Criminal Investigation Police University of China,Shenyang 110854,China)

机构地区中国刑事警察学院

出处《电声技术》 2021年第2期23-27,31,共6页 Audio Engineering

关键词卷积神经网络法庭说话人识别激活函数声纹图 convolutional neural network forensic speaker recognition activation function spectrogram

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1稿约[J].贵州警官职业学院学报,2007,19(6).
2声明[J].中北大学学报（社会科学版）,2021,37(2):99-99.
3声明[J].中北大学学报（社会科学版）,2021,37(3):143-143.
4南兆营.基于参数迁移和C-LSTM的说话人识别研究[J].电声技术,2020,44(11):37-41. 被引量：1
5颜琪.花卉图像识别的设计与研究[J].电脑知识与技术,2021,17(2):175-176. 被引量：2
6《宇航材料工艺》征订启事[J].宇航材料工艺,2020,50(6):54-54.
7朱龙珠,田诺,张全.基于语义分析的语音情感在线识别方法研究[J].电子设计工程,2021,29(11):151-154. 被引量：1
8曹蠡馨,董秤均,杨燕,梁雨晴,罗榆希,谢知言,王洁梅,夏波,伍文彬.黄连解毒汤对Aβ_(1-42)诱导AD大鼠学习记忆能力及胆碱能系统的影响[J].中国实验方剂学杂志,2021,27(10):23-30. 被引量：12
9孙建威,杨新明,张瑛.丙戊酸钠联合甲强龙对大鼠脊髓损伤的影响及其机制[J].中华解剖与临床杂志,2021,26(2):214-222. 被引量：3

电声技术

2021年第2期

浏览历史

内容加载中请稍等...

基于卷积神经网络的法庭说话人识别研究

相关作者

相关机构

相关主题

浏览历史