摘要
因与原始语音具有高度相似性,经高保真设备回放的翻录语音常被不法分子用于对说话人认证(ASV)系统进行攻击,以达到非法认证的目的。为提高系统抵抗翻录语音攻击的顽健性,通过研究原始语音与翻录语音产生的实际过程,发现两者在频率域相位上有明显差异,并在此基础上提出了一种基于相位谱的翻录语音检测方法。分析讨论了FFT和不同偷录、回放设备对翻录语音检测率的影响。实验结果表明,该方法能够准确地判断待测语音是否为翻录语音,其检测率达到了99.04%。并且,将该算法加载到说话人识别系统中,使系统的等错误概率(EER)降低了约22%,有效提高了系统抵抗翻录语音攻击的性能。
Due to a high similarity between the recaptured voice recorded by high-fidelity ripping equipment and the original voice, the automatic speaker verification(ASV)system used to be attacked illegally by the recaptured voice. In order to improve the ability of resisting the attack, a recaptured voice detection method was proposed based on the difference of phase spectrum between original and recaptured voices for the ASV system. In addition, the effects of different recording and replay devices, the FFT were discussed. Experimental results show that the proposed method can accurately recognize the recording voice, of which detection rate is 99.04%.Meanwhile, the equal error rate(EER) of the ASV system has dropped about 22% with this method being integrated, which indicates that the system's ability of resisting playback attack is enhanced.
出处
《电信科学》
北大核心
2017年第8期145-154,共10页
Telecommunications Science
基金
国家自然科学基金资助项目(No.61672302
No.61300055)
浙江省自然科学基金资助项目(No.LZ15F020010
No.Y17F020051)
宁波大学科研基金资助项目(No.XKXL1405
No.XKXL1420
No.XKXL1509
No.XKXL1503)
宁波大学王宽诚幸福基金资助项目~~