期刊文献+

基于DSP开集说话人识别系统的实时实现 被引量:2

Realtime Implementation of Open-Set Speaker-Recognition System by DSP
下载PDF
导出
摘要 为了给说话人识别系统的应用提供一个较为重要的技术途径,利用美国TI公司生产的TMS320VC5402DSP作为CPU开发的DSP(D igital S ignal Processor)系统,实时实现了一个基于说话人自适应的开集说话人识别系统。为了提高系统的处理速度和识别的准确性,系统采用少量的语音数据产生说话人模型,在改进的矢量量化方法的基础上,利用一种说话人自适应的阈值处理算法,有效地提高了系统的识别率。同时对降低算法的计算量、数据的存储量进行了较深入的研究。从说话人识别的响应时间、训练时间等综合方面考虑,使真正意义上的说话人识别系统在DSP芯片上实现成为可能。实验表明,该系统在普通机房条件下,可以取得较好的实验效果,系统识别时间小于1 s,完全满足实时性的要求。 In order to provide an important method for the practical applications of a speaker-recognition system, this paper presents an open-set speaker-recognition Real-time system based on speaker adaptive dynamic threshold, which has realized with TMS320VC5402 digital signal processor. In order to improve the processing speed and the recognition precision, it uses the little speech data to get the speaker's voice model, and based on the revised vector quantization algorithm, it presents a dynamic threshold method, which can improve the recognition accuracy greatly. At the same time, the research of the decreasing the amount of operation and storage has been conducted thoroughly. On the consideration of some factors, such as the respond time and train time of the system, it is possible to realize a real speaker recognition system by Digital Signal Processor. Experiment results show that the recognition rate of this system is satisfied, and the recognition time of the system is less than 1 second, which can meet the requirement of real-time system.
出处 《吉林大学学报(信息科学版)》 CAS 2006年第3期252-258,共7页 Journal of Jilin University(Information Science Edition)
基金 长春市科技计划基金资助项目(05GG18)
关键词 说话人识别 开集 说话人自适应阈值 MEL倒谱系数 数字信号处理器 speaker recognition system open-set speaker adaptive dynamic threshold Mel-frequency cepstral coefficients digital signal processor
  • 相关文献

参考文献16

  • 1DENG Jiu-qing,HU Qi-xiu.Open Set Text-Independent Speaker Recognition Based on Set-Score Pattern Classification [ C ]//IEEE International Conference on Acoustics,Speech,and Signal Processing.Hong Kong:IEEE Press,2003,2:73-76.
  • 2REYNOLDS D A.An Overview of Automatic Speaker Recognition Technology [ C ] // IEEE International Conference on Acoustics,Speech,and Signal Processing.Orlando,Florida:IEEE Press,2002:4072-4075.
  • 3PRZYBOCKI MARK,ALVIN MARTIN.The NIST Speaker Recognition Evaluation-Overview,Methodology,Systems,Results,Perspective [J].Speech Communications,2000,31:225-254.
  • 4REAL E C,BAUMANN A H.Open Set Classification Using Tolerance Intervals [ C ] // Signals,Systems and Computers,Conference Record of the Thirty-Fourth Asilomar Conference.Pacific Grove,USA:IEEE Press,2000,(2):1217-1221.
  • 5SOVKA P,POLLAK P,KIBIC J.Extended Spectral Subtraction [ C ] // Signal Processing Ⅷ:Theories and Applications (Proceedings of EUSIPCO-96).Trieste,Italy:European Association for Signal Processing (EURASIP),1996:963-966.
  • 6何致远,胡起秀,徐光祐.两级决策的开集说话人辨认方法[J].清华大学学报(自然科学版),2003,43(4):516-520. 被引量:12
  • 7王让定,柴佩琪.语音倒谱特征的研究[J].计算机工程,2003,29(13):31-33. 被引量:50
  • 8李霄寒,戴蓓倩,方绍武,刘鸣.高阶MFCC的话者识别性能及其噪声鲁棒性[J].信号处理,2001,17(2):124-129. 被引量:14
  • 9邵央,刘丙哲,李宗葛.基于MFCC和加权矢量量化的说话人识别系统[J].计算机工程与应用,2002,38(5):127-128. 被引量:34
  • 10FURUI SADAOKI.Recent Advances in Speaker Recognition [J].Pattern Recognition Letters,1997 (18):859-872.

二级参考文献26

  • 1何致远.说话人确认和辨认的研究与实现[D].北京:清华大学,2002.
  • 2何致远 胡起秀 姚志宏.基于HMM的数字串提示文本的说话人确认[A]..第九届全国多媒体技术学术会议论文集[C].北京,2000.215—219.
  • 3SI Luo, HU Qixiu. Two-stage speaker identification system based on VQ and NBDGMM [A]. Proc of the Sixth Inter Conf on Spoken Language Processing [C]. Beijing, 2000.
  • 4Fakotakis N, Sirigos J. A high performance text independent speaker recognition system based on vowel spotting and neural nets [A]. Proc Inter Conf on Acoustics, Speech and Signal Processing[C]. Atlanta, USA. 1996. 661-664.
  • 5Furui S. Recent advances in speaker recognition [J]. Lecture Notes in Computer Science, 1997, 1206:237-252.
  • 6Li Qi, Juang Biinghwang, Lee Chinhui, et al. Recent advancements in automatic speaker authentication [J]. IEEE Robotics and Automation Magazine, 1999, 3:24 - 34.
  • 7Furui S. Cepstral analysis technique for automatic speaker verification [J]. IEEE Trans on Acoustics, Speech and Signal Processing, 1981, 29(2) : 254 - 272.
  • 8JIN Qin, SI Luo, HU Qixiu. A high-performance text-independent speaker identification system based on BCDM [A]. Proc of the Fifth Inter Conf on Spoken Language Processing[C]. Sydney, Australia. 1998.
  • 9Huang Xuedong, Acero A, Hon H W. Spoken Language Processing.Prentice Hall,2001.
  • 10Young S, Kershaw D, Odell J, et al. The HTK Book.Microsoft Corporation &CUED,2000.

共引文献111

同被引文献6

引证文献2

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部