期刊文献+

两级决策的开集说话人辨认方法 被引量:12

Method of open-set speaker identification with two-level decision strategy
原文传递
导出
摘要 为了减少语音数据量 ,提高处理速度和识别的准确性 ,提出了一种采用公共码本、个人隐 Markov模型 (HMM)和个人拒识阈值进行两级决策来实现开集说话人辨认的新方法。在系统实现时 ,采用了一种改进的语音切分算法来提高输入数据的有效性 ,并将说话人识别和人脸识别融合在一起进行身份验证。实验证明这种融合方法能够有效地降低识别的相等错误率至 1%。 To reduce required speech data and improve the processing speed and the recognition precision, this paper presents a novel speaker identification method using the public codebook, the individual hidden Markov model (HMM) and the individual threshold of rejection to make a two level decision strategy. The system used an improved algorithm of speech segmentation to extract the available speech data from utterances. An approach of integrating the speaker recognition with the face recognition to verify a person's identity could further reduce the equal error rate to 1%.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2003年第4期516-520,共5页 Journal of Tsinghua University(Science and Technology)
基金 国家"八六三"高技术项目 ( 863 -3 0 6-ZT0 3 -0 1-1) 国家教育振兴计划
关键词 说话人识别 说话人辨认 语音切分 隐MARKOV模型 两级决策 语音识别 speaker recognition speaker identification speech segmentation hidden Markov model 
  • 相关文献

参考文献9

  • 1牟晓隆,胡起秀,吴文虎.与文本无关的复合策略说话人辨识系统[J].清华大学学报(自然科学版),1997,37(3):16-19. 被引量:6
  • 2何致远.说话人确认和辨认的研究与实现[D].北京:清华大学,2002.
  • 3何致远 胡起秀 姚志宏.基于HMM的数字串提示文本的说话人确认[A]..第九届全国多媒体技术学术会议论文集[C].北京,2000.215—219.
  • 4JIN Qin, SI Luo, HU Qixiu. A high-performance text-independent speaker identification system based on BCDM [A]. Proc of the Fifth Inter Conf on Spoken Language Processing[C]. Sydney, Australia. 1998.
  • 5SI Luo, HU Qixiu. Two-stage speaker identification system based on VQ and NBDGMM [A]. Proc of the Sixth Inter Conf on Spoken Language Processing [C]. Beijing, 2000.
  • 6Fakotakis N, Sirigos J. A high performance text independent speaker recognition system based on vowel spotting and neural nets [A]. Proc Inter Conf on Acoustics, Speech and Signal Processing[C]. Atlanta, USA. 1996. 661-664.
  • 7Furui S. Recent advances in speaker recognition [J]. Lecture Notes in Computer Science, 1997, 1206:237-252.
  • 8Li Qi, Juang Biinghwang, Lee Chinhui, et al. Recent advancements in automatic speaker authentication [J]. IEEE Robotics and Automation Magazine, 1999, 3:24 - 34.
  • 9Furui S. Cepstral analysis technique for automatic speaker verification [J]. IEEE Trans on Acoustics, Speech and Signal Processing, 1981, 29(2) : 254 - 272.

共引文献5

同被引文献113

引证文献12

二级引证文献65

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部