期刊文献+

法庭语音比对中话者自身变化性建模方法研究 被引量:2

Study on Modeling Method of Inter-Speaker Variability in Forensic Voice Comparison
下载PDF
导出
摘要 针对法庭说话人识别中待鉴定人员语音样本不足的问题,提出了一种新的对说话人自身变化性建模的替代性方法以及相应的方差控制算法。使用同条件下的参考数据库构建识别系统的多个相同说话人得分模型,代替检验需要的多个非同期的带检验人员语音样本比较时的得分模型,以获得能反映说话人自身变化性的统计模型。基于目前最新的法庭证据评估的似然比证据强度评估体系,使用MFCC(Mel Frequency Cepstral Coefficients)和GFCC(Gammatone Frequency Cepstral Coefficients)特征对该方法的有效性进行了验证,并对上述特征进行了特征级和决策级融合。实验结果表明:该方法在纯净语音环境和噪声环境下都具有很高的识别率和稳定性,并且特征级融合能进一步提高识别系统的性能。 Focusing on the lack of voice samples of a person to be examined in forensic speaker recognition, this paper proposes a new alternative method modeling the self-variability of target speaker and corresponding variance control algorithm. The method constructs multiple same-speaker scores of recognition system from a reference database under similar condition to take the place of multiple non-contemporaneous voice samples needed in examinations. The aim is to obtain the statistical model that can reflect the self-variability of the target speaker. MFCC and GFCC are used to test the performance of the proposed method in state-of-art evidence estimation framework based on likelihood ratio, and feature fusion and decision fusion are also been applied in the experiment. Results show that the proposed method has a very high rate of recognition and stability under the condition of clean voice and noisy voice, and feature fusion can further improve recognition performance.
作者 王华朋 姜囡 刘恩 晁亚东 WANG Huapeng;JIANG Nan;LIU En;CHAO Yadong(Department of Audio-Visual Data Inspection Technology, Criminal Investigation Police University of China, Shenyang 110854, China)
出处 《计算机工程与应用》 CSCD 北大核心 2019年第8期110-115,214,共7页 Computer Engineering and Applications
基金 2016国家社会科学基金重点项目(No.16AYY015) 辽宁省重点研发计划项目(No.2017231006) 公安部公安理论及软科学项目(No.2017231006)
关键词 似然比 证据强度 建模 梅尔频率倒谱系数(MFCC) 伽马通频率倒谱系数(GFCC) likelihood ratio evidence strength modeling Mel Frequency Cepstral Coefficients(MFCC) Gammatone Frequency Cepstral Coefficients (GFCC)
  • 相关文献

参考文献5

二级参考文献26

  • 1肖哲.基于Matlab的RLS自适应语音噪声对消系统的设计与实现[J].长沙大学学报,2006,20(2):83-86. 被引量:4
  • 2[1]Ahmed Mezghani,Douglas.Speaker verification using a new representation based on a CMFCC and formants[J].IEEE Electrical and Computer Engineering,2005,22:1469-1472.
  • 3[2]Minh N Do.An automatic apeaker recognition system[J].Swiss Federal Institute of Technology,2001,6:122-124.
  • 4杨畅.基于听觉掩蔽效应的改进谱减法算法研究[D].西安:西安电子科技大学,2009.
  • 5Shao Yang, Jin Zhaozhang, Wang Deliang.An auditory-based feature for robust speech recognition[C]//IEEE Interna- tional Conference on Acoustics, Speech and Signal Pro- cessing.United States, Institute of Electrical and Electronics Engineers Inc, 2009 : 4625-4628.
  • 6Zhao Xiaojia, Shao Yang, Wang Deliang.CASA-based robust speaker identification[C]//IEEE Transactions on Audio, Speech and Language Processing.United States, Institute of Electrical and Electronics Engineers Inc,2012,20(5): 1608-1616.
  • 7KARUPPUSWAMY R, ARUMUGAM K, SWATHI P M. Folded ar- chitecture for digital Gammatone fiher used in speech processor of cochlear implant[ J]. ETRI Journal, 2013, 35 (4) : 697 - 705.
  • 8王赞松.FIR.数字滤波器设计[D].西安:西安电子科技大学,2012:7-9.
  • 9IMMERSEEL L V, PEETERS S. Digital implementation of linear Gammatone filters: Comparison of design methods[J]. Acoustics Research Letters Online, 2003, 4(3):59-64.
  • 10SHAO Y, WANG D. Robust speaker identification using auditory features and computational auditory scene analysis[ C]//ICASSP 2008: Proceedings of the 2008 IEEE International Conference on Acoustics, Speech and Signal Processing. Piscataway: IEEE, 2008:1589 - 1592.

共引文献33

同被引文献11

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部