采用局部相位量化的合成语音检测方法

A method for synthetic speech detection using local phase quantization

下载PDF

导出

摘要由于语音合成的便利性,合成伪装语音对说话人认证系统的安全构成了很大的威胁。为了进一步提升说话人认证系统的伪装语音检测能力,提出了一种利用语谱图频域信息的合成语音检测方法,它通过局部相位量化算法对语谱图频域信息进行描述。首先,将语谱图分为若干子块,然后对每个子块进行局部相位量化,经直方图统计分析后获得局部相位量化特征向量并将该特征向量作为随机森林分类器的输入特征,实现合成语音检测。实验结果表明,该方法进一步降低了合成语音检测系统的串联检测代价数值,并且具有更强的泛化能力。 Due to the convenience of speech synthesis,synthesized disguised speech poses a great threat to the secu-rity of speaker verification systems.In order to further enhance the ability of detecting the camouflage to the speaker verification system,a method of synthetic speech detection was put forward using the information in spectral domain of the synthetic speech spectrogram.The method employed the local phase quantization(LPQ)algorithm to describe frequency domain information in the speech spectrogram.Firstly,the spectrogram was divided into several sub-blocks,and then the LPQ was performed on each sub-block.After the histogram statistical analysis,the LPQ feature vector was obtained and used as the input feature of the random forest classifier to realize the synthetic speech detection.The experimental results demonstrate that the proposed method further reduces tandem detection cost func-tion(t-DCF)and has better generalization ability.

作者徐嘉简志华金宏辉杨曼 XU Jia;JIAN Zhihua;JIN Honghui;YANG Man(School of Communication Engineering,Hangzhou Dianzi University,Hangzhou 310018,China;Key Laboratory of Data Storage and Transmission Technology of Zhejiang Province,Hangzhou 310018,China)

机构地区杭州电子科技大学信工程学院浙江省数据存储传输及应用技术研究重点实验室

出处《电信科学》北大核心 2024年第2期63-71,共9页 Telecommunications Science

基金国家自然科学基金资助项目(No.61201301,No.61772166)。

关键词说话人认证伪装攻击合成语音检测局部相位量化 speaker verification spoofing attack synthetic speech detection LPQ

分类号 TP391.42 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献5

1朱长水,丁勇,袁宝华,曹红根.融合LBP和LPQ的人脸识别[J].南京师大学报（自然科学版）,2015,38(1):104-107. 被引量：7
2刘琳岚,高声荣,舒坚.基于随机森林的链路质量预测[J].通信学报,2019,40(4):202-211. 被引量：9
3陈佳,章坚武,张浙亮.基于上下文信息与注意力特征的欺骗语音检测[J].电信科学,2023,39(2):92-102. 被引量：2
4徐嘉,简志华,金宏辉,吴超,游林,吴迎笑.基于中心对称局部二值模式的合成伪装语音检测方法[J].电信科学,2023,39(1):72-78. 被引量：2
5徐剑,简志华,于佳祺,金易帆,游林,汪云路.采用完整局部二进制模式的伪装语音检测[J].电信科学,2021,37(5):91-99. 被引量：5

二级参考文献23

1Zhao W,Chellappa R, Phillips P J, et ai. Face recognition : a literature survey [ J ]. Acm Computing Surveys ( CSUR), 2003, 35 (4) : 399-458.
2Ahonen T, Hadid A, Pietikainen M. Face description with local binary patterns : Application to face recognition [ J ]. IEEE Transactions on Pattern An'Mysis and Machine Intelligence,2006,28(12) :2 037-2 041.
3Ahonen T, Rahtu E, Ojansivu V, et al. Recognition of blurred faces using local phase quamization [ C ]//19th International Conference on Pattern Recognition. Tampa: IEEE, 2008 : 1-4.
4Ahonen T,Pietikainen M. Image description using joint distribution of filter bank responses[ J ]. Pattern Recognition Letters, 2009,30(4) :368-376.
5Heikkila M, Pietikainen M, Schmid C. Description of interest regions with local binary patterns [ J ]. Pattern Recognition, 2009,42(3) :425-436.
6Zhang B,Gao Y,Zhao S,et al. Local derivative pattern versus local binary pattern:face recognition with high-order local pat- tern descriptor[ J]. IEEE Transactions on Image Processing,2010,19(2) :533-544.
7Guo Z,Zhang L,Zhang D. Rotation invariant texture classification using LBP variance(LBPV) with global matching[ J ]. Pat- tern Recognition, 2010,43 ( 3 ) : 706-719.
8Ojansivu V, Heikkila J. Blur Insensitive Texture Classification Using Local Phase Quantization [ M ]//Image and Signal Pro- cessing. Heidelberg, Berlin: Springer, 2008 : 236- 243.
9Heikkila J, Ojansivu V, Rahtu E. Improved blur insensitivity for decorrelated local phase quantization [ C ]//20th International Conference on Pattern Recognition. Istanbul : IEEE, 2010: 818- 821.
10Lei Z, Ahonen T, Pietikainen M, et al. Local frequency descriptor for low-resolution face recognition [ C ]//IEEE International Conference on Automatic Face & Gesture Recognition and Workshops. Shanghai, China:IEEE,2011:161-166.

共引文献20

1李荣.利用异或运算和编码约束的降维LDP人脸识别方法[J].计算机测量与控制,2017,25(10):171-175.
2杨恢先,唐金鑫,陶霞,姜德财,颜微.基于韦伯梯度方向直方图的人脸识别算法[J].计算机工程与应用,2017,53(15):200-205. 被引量：3
3梅星宇,李新华,鲍文霞,张东彦,梁栋.基于复频域纹理特征的植物叶片识别算法[J].江苏农业学报,2019,35(6):1334-1339. 被引量：5
4李巨虎,范睿先,陈志泊.基于颜色和纹理特征的森林火灾图像识别[J].华南理工大学学报（自然科学版）,2020,48(1):70-83. 被引量：46
5夏宇,刘伟,罗嵘,胡顺仁.基于推理模型与指数加权卡尔曼滤波的链路质量估计[J].计算机工程,2020,46(5):216-223. 被引量：1
6王烽,李泽平,林川,王忠德,黄初华.基于随机森林的带宽预测算法研究与实现[J].计算机工程与设计,2020,41(7):1892-1898. 被引量：4
7林枫,蔡延光,蔡颢,王建成.基于改进果蝇算法优化的随机森林模型[J].常熟理工学院学报,2020,34(5):51-57. 被引量：8
8舒坚,高素,陈宇斌.基于自适应广义回归神经网络的链路质量评估[J].计算机研究与发展,2020,57(12):2662-2672. 被引量：5
9安昳,曲珍,许宁,尼玛扎西.面部动态特征描述的抑郁症识别[J].中国图象图形学报,2020,25(11):2415-2427. 被引量：4
10马立川,彭佳怡,裴庆祺,朱浩瑾.高效的决策树隐私分类服务协议[J].通信学报,2021,42(8):80-89. 被引量：4

1熊枫情,罗芊芊,蒋汶秦,吕萧羽,徐平华.基于GPQ半监督神经网络的织物图像检索[J].纺织高校基础科学学报,2024,37(1):42-48.
2陈结,陈换过,肖志奇,解超.基于工况辨识的风电机组主传动系统运行状态监测[J].太阳能学报,2024,45(2):77-85.

电信科学

2024年第2期

浏览历史

内容加载中请稍等...

采用局部相位量化的合成语音检测方法

参考文献5

二级参考文献23

共引文献20

相关作者

相关机构

相关主题

浏览历史