期刊文献+

音色变换音频信号的篡改检测技术研究 被引量:3

Study on tamper detection technology for voice transformation
下载PDF
导出
摘要 针对音色变换软件带来的社会安全问题,提出一种音频信号篡改检测方法。首先根据语音信号的混沌特性和人耳的听觉特性,利用美尔频率倒谱系数(Mel frequency cepstral coefficients,MFCC)特征提取原理,提取待测音频的杜芬频率倒谱系数(Duffing frequency cepstral coefficients,DFCC),然后将特征参数的幅度进行提升,利用支持向量机(SVM)将特征参数与语料库里的特征进行分类。分类成功的情况下,根据幅度提升的大小判断待测音频信号是否经过篡改;同时根据幅度提升的大小和待测音频的性别判断说话人的真实性别。大量的实验结果表明,该方法在音频信号的篡改检测和音频信号说话人的真实性别判断方面均具有较高的准确率,并且性能稳定。 To address the social security issue brought up by voice transformation software, a method for sound signal tamper detection is proposed. Firstly, with the extraction method of Mel frequency cepstral coefficients (MFCC),Duffing frequency cepstral coefficients(DFCC) characteristic parameters of audio signals are extracted based on the human hearing characteristics and chaos characteristics of speech signal. Then, the amplitude of characteristic parameters is enhanced and support vector machine (SVM) is used to classify the characteristic parameters and characteristics in corpus. In case of successful classification, the audio signal will be judged whether it is tampered as per the size of the amplitude enhanced. Meanwhile, the speaker gender will be judged according to the size of the amplitude enhanced and the gender of the audio. Through a large number of experiments, it shows that the method has stable performance and high accuracy both in the audio signal tampering detection and audio speaker real gender judgement.
作者 何朝霞 潘平 罗辉 HE Zhaoxia;PAN Ping;LUO Hui(College of Technology & Engineering,Yangtze University,Jingzhou 434023,China;Computer Science and Information Institute,Guizhou University,Guiyang 550025,China;School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,China)
出处 《中国测试》 CAS 北大核心 2017年第2期98-103,共6页 China Measurement & Test
基金 贵州省科学技术基金项目(黔科合J字[2012]2132) 贵阳市科技计划项目(筑科合同[2011101]1-2) 长江大学工程技术学院科学研究发展基金(15j0401)
关键词 篡改检测 特征提取 杜芬频率倒谱系数 支持向量机 tamper detection feature extraction DFCC SVM
  • 相关文献

参考文献6

二级参考文献53

  • 1栗学丽,丁慧,徐柏龄.基于熵函数的耳语音声韵分割法[J].声学学报,2005,30(1):69-75. 被引量:34
  • 2杨莉莉,林玮,徐柏龄.汉语耳语音孤立字识别研究[J].应用声学,2006,25(3):187-192. 被引量:8
  • 3蒋明哲,廖建新,朱晓民,杨波.基于智能网的变声彩话业务的设计与实现[J].电信工程技术与标准化,2006,19(5):84-87. 被引量:1
  • 4孙卓,岳振军.一种汉语语音变换技术[J].电声技术,2007,31(6):37-40. 被引量:1
  • 5LIEBERMAN P, BLUMSTEIN S E. Speech physiology, speech perception, and acoustic phonetics (Cambridge studies in speech science and communication) [M]. London : Cambridge University Press, 1988 : 356-363.
  • 6YAIR M Y, CHAZAN E. Super resolution pitch deterruination of speech signals [J]. IEEE Trans. on Signal Processing, 1991.39 ( 1 ): 40-48.
  • 7MOUI.INES E,CHARPENTIER F. Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones[J]. Speech Communication, 1990,9 (5/6) : 453-467.
  • 8Stylianou Y. Voice transformation: a survey[C]// IEEE International Conference on Acoustics, Speech and Signal Processing. Taipei, Taiwan: IEEE Press, 2009 : 3585-3588.
  • 9Atal B S, Hanauer S L. Speech analysis and synthesis by linear prediction of the speech wave[J]. The Journal of the Acoustical Society of America, 1971,50(2):637-655.
  • 10Desai S, Raghavendra E V, Yegnanarayana B, et al. Voice conversion using artificial neural networks [C]//IEEE International Conference on Acoustics, Speech and Signal Processing. Taipei, Taiwan: IEEE Press, 2009 : 3893-3896.

共引文献25

同被引文献25

引证文献3

二级引证文献13

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部