声码器半解码参数用于说话人身份确认被引量：2

Research on Speaker Verification With Half-Decoded Parameters of Vocoder

下载PDF

导出

摘要面向通信领域广泛使用的线性预测声码器,设计了一种不经过“解码-特征提取”过程,而直接由传输码流截取说话人特征的方法,并针对宽带自适应多码率声码器(AMRWB)建立了与文本无关的话者确认系统.系统采用基于概率统计模型的GMMUBM结构,以LPC倒谱作为主要的话者特征矢量,并加入基音衍生参数以提高确认性能.实验表明,该系统在运算速度提高一个数量级的情况下,达到了与基于重建语音的话者确认系统相接近的性能,且对码率失配具有良好的鲁棒性. A feature extraction method is designed for linear predict vocoders widely used in the communication field. In this method, feature vectors are extracted not from the decoded waveform, but from the bit stream of transmission directly. Specifically for Wideband Adaptive Multi Rate vocoder （AMR-WB）, we implemented a text-independent speaker verification system. Which employs the probability-statistics-based GMM-UBM framework as speaker model and takes LPC cepstrum and pitch derived parameters as feature vectors. Experiments indicate that the half-decoded based system, which runs ten times faster than the decoded-based system, is capable of similar performance to the latter, and shows robustness for code rate mismatch of AMR-WB.

作者李晓先戴蓓蒨李辉

机构地区中国科学技术大学电子科学与技术系

出处《中国科学技术大学学报》 CAS CSCD 北大核心 2005年第4期523-529,共7页 JUSTC

基金国家自然科学基金(6027039) 安徽省自然科学基金(01042205)资助项目.

关键词话者确认半解码参数基音频率 GMM-UBM AMR-WB编码 speaker verification half-decoded parameter pitch GMM-UBM AMR-WB codec

分类号 TN912.34 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1Besacier L, Mayorga P, Bonastre J F, et al.Overview of compression and packet loss effects in speech biometrics[J]. IEEE Proc.Image and Signal Processing, 2003,150.. 372-376.
2Dtmn R B, Quatieri T F, Reynolds D A, etal. Speaker recognition from coded speech and the effects of score normalization, signals[J]. Systems and Computers, 2001, 2 : 1562-1 567.
3Bessette B, Salami R, Lefebvre R, et al.The adaptive multirate wideband speech codec (AMR-WB) [J]. IEEE Trans. Speechand Audio Processing, 2002, 10:8.
4AMR Wideband Speech Codec; Transcoding functions[S], 3GPP TS 26. 190. 510, 2001.
5AMR Wideband Speech Codec; Frame Structure[S], 3GPP TS 26. 201. 500, 2001.
6Sonmez K, Shriberg E, Heek L, et al.Modeling dynamic prosodic variation for speaker verification[J]. Proe. Intk Conf. on Spoken Language Processing, 1998,7 : 3 189-3 192.
7Reynolds D A, Quatieri T F, Dunn R B.Speaker verification using adapted gaussian mixture models[J]. Digital Signal Processing, 2000,10:19-41.
8Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models [J]. IEEE Trans.Speech and Audio Processing, 1995,3: 72-83.
9Hermansky H, Morgan N. RASTA processing of speech[J]. IEEE Trans. Speech and Audio Processing, 1994, 2: 578-589.

同被引文献15

1PETRACCA M,SERVETTI A,DEMARTIN J C. Performance analysis of compressed-domain automatic speaker recognition as a function of speech coding technique and bit rate [C]//Pmceedings of International Conference on Multimedia and Expo (ICME).Toronto : [s.n.], 2006 : 1393- 1396.
2DUNN R B, QUATIERI T F, RENOLDS D A, et al. Speaker recognition from coded speech and the effects of score normalization [C]//Proceedings of the 35th Asilomar Conference on Signals, Systems and Computers. Pacific Grove: [s.n.],2001: 1562-1567.
3ITU. ITU-T Recommendation G.729-1996 Coding of speech at 8 kbit/s using conjugate-structure algebraic-codeexcited linear-prediction (CS-ACELP)[S]. Helsinki: WTSC Resolution, 1996.
4ITU-T Recommendation G.723.1-1996. Dual rate speech coder for multimedia communications transmitting at 5.3 and 6.3 kbit/s [S]. Helsinki: WTSC Resolution, 1996.
5唐晖.VoIP说话人识别技术研究[D].郑州:解放军信息工程大学,2008.
6CAMPELL W M. Generalized linear discriminant sequence kernels for speaker recognition [C]//Proceedings of ICASSP. Orlando: [s.n.], 2002 ( 1 ) : 161-164.
7CAMPELL W M, CAMPBELL J P, REYNOLDS D A, et al. Torres-Carrasquillo, support vector machines for speaker and language recognition [J]. Computer Speech and Language, 2005(8) :213-232.
8Frédéric Bimbot.A tutorial on text-independent speaker verification[J].Eurasip journal on applied signal processing,2004,(4):430-451.
9Besacier L,Mayorga P.Overview of compression and packet loss effectsin speech biometrics[C]//Proc.Vision,Image and Signal Processing.2003:372-376.
10Grassi S,Besacier L.Influence of gsm speech coding on the performance of text-independent speaker recognition[C]//Proc.EUSIPCO'00.2000:437-440.

引证文献2

1石如亮,李弼程,张连海,王波.基于编码比特流的说话人识别[J].信息工程大学学报,2007,8(3):323-326. 被引量：2
2杨于村,蒋燕.基于广义线性区分核支持向量机的说话人确认[J].电声技术,2009,33(8):64-67.

二级引证文献2

1李斌.G.723.1标准及其在语音处理中的应用[J].科技信息,2013(35):163-164. 被引量：1
2李榕健,于洪涛,李邵梅.基于DTW的编码域说话人识别研究[J].电子技术应用,2010,36(8):119-121.

1范锦秀,赵欢,张波涛.AMR-WB编码算法研究和复杂度分析[J].电声技术,2009,33(7):68-72. 被引量：2
2刘祥明,王玲.AMR-WB编码中线谱频率量化的DSP优化与实现[J].计算机系统应用,2009,18(9):174-177.
3张保轩,王连军,田岚.基于PC机的汉语话者确认系统[J].山东电子,1995(3):16-17.
4朱敏,朱小康.一个快速的码本搜索方法[J].信息安全与通信保密,2007,29(7):73-75. 被引量：2
5潘继飞,姜秋喜.一种脉间滑变雷达信号特征提取新方法[J].电子信息对抗技术,2011,26(1):9-13. 被引量：3
6陈礼升,易清明,石敏.桶形移位器在AVS熵解码器中的应用及仿真[J].电视技术,2011,35(7):41-43.
7张玲华,杨震,郑宝玉.一种修正的倒谱公式及其在说话人识别中的应用[J].信号处理,2003,19(z1):121-124.
8张宝菊,李桂苓.数字电视码流截取卡研制[J].电子测量技术,2002,25(6):8-9.
9李霄寒,黄南晨,戴蓓蒨,姚志强.基于HMM-UBM和短语音的说话人身份确认[J].信息与控制,2004,33(6):762-764. 被引量：1
10李勃,杨腾祥,胡建华,赵琳.智能卡话者确认系统的研究[J].昆明理工大学学报（理工版）,1999,24(2):12-17.

中国科学技术大学学报

2005年第4期

浏览历史

内容加载中请稍等...

声码器半解码参数用于说话人身份确认被引量：2

参考文献9

同被引文献15

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

声码器半解码参数用于说话人身份确认 被引量：2

参考文献9

同被引文献15

引证文献2

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

声码器半解码参数用于说话人身份确认被引量：2