期刊文献+

一种三层判决的说话人索引算法 被引量:1

Speaker Index Algorithm of Three-layer Criterion
下载PDF
导出
摘要 为提高说话人索引准确率,提出一种三层判决的说话人索引算法。第1层使用惩罚距离公式对说话人改变进行检测,第2层采用说话人模型自举法进行初次说话人辨认,第3层采用GMM说话人超级矢量进行判决,解决说话人模型自举法中产生的数据不匹配问题。实验结果表明,采用惩罚距离公式,与贝叶斯信息判决方法相比不需调整参数,与DISTBIC方法相比F1值提高2%,使用GMM说话人超级矢量,在说话人索引准确率和数量准确率方面分别提高8.95%、18.25%。 To improve the precision of speaker index,a speaker indexing algorithm of three-layer criterion is proposed.In the first layer,penalty distance is proposed to judge whether speaker changes.In the second layer,speaker model bootstrapping is used to identify speaker first time.In the third layer,GMM Speaker Supervector(GMMSS) is used to identify speaker further in order to settle the problem of data mismatch in speaker model bootstrapping.Experimental results show that,it is no need to tune penalty factor compared to BIC and F1 can improve 2% compared to DISTBIC;speaker indexing accuracy can improve 8.95% and the accuracy on the number of speaker can improve 18.25% by using GMMSS in speaker identification.
出处 《计算机工程》 CAS CSCD 2012年第2期184-185,共2页 Computer Engineering
基金 东莞市2010年高等院校科研机构科技计划基金资助项目(201010814014)
关键词 三层判决 说话人索引 惩罚距离 模型自举法 GMM说话人超级矢量 three-layer criterion speaker index penalty distance model bootstrapping method GMM Speaker Supervector(GMMSS)
  • 相关文献

参考文献10

  • 1Narayanan K S. Unsupervised Speaker Indexing Using Generic Models[J]. IEEE Trans. on Speech and Audio Processing, 2005, 13(5): 1004-1013.
  • 2Chen S S, Gopalakrishnan P C. Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion[C] //Proc. of DARPA Broadcast News Transcription & Understanding Workshop. New Your, USA: [s. n.] , 1998: 127-132.
  • 3Kotti M, Moschou V, Kotropoulos C. Speaker Segmentation and Clustering[J]. Signal Processing, 2008, 88(5): 1091-1124.
  • 4Delacourt P, Wellekens. DISTBIC: A Speaker-based Segmentation for Audio Data Indexing[J]. Speech Communication, 2000, 32(1/2): 111-126.
  • 5付中华,张艳宁.在线无监督说话人检索中稳健的模型自举算法[J].软件学报,2007,18(3):608-616. 被引量:3
  • 6Kenny P, Boulianne G. Speaker and Session Variability in GMM- based Speaker Verification[J]. IEEE Trans. on Audio, Speech and Language Processing, 2007, 15(4): 1448-1460.
  • 7Chu S M, Tang Hao. Fishervoice and Semi-supervised Speaker Clustering[C] //Proc. of ICASSP’09. [S. 1.] : IEEE Press, 2009: 4089-4092.
  • 8He Q H, Yang J C. Combining GMM, Jenson’s Inequality and BIC for Speaker Indexing[J]. Electronics Letters, 2010, 46(9): 654-655.
  • 9郑继明,张萍.改进的BIC说话人分割算法[J].计算机工程,2010,36(17):240-242. 被引量:7
  • 10Nishida M, Kawahara T. Speaker Model Selection Based on Bayesian Information Criterion Applied to Unsupervised Speaker Model Indexing[J]. IEEE Trans. on Speech and Audio Processing, 2005, 13(4): 583-592.

二级参考文献7

共引文献7

同被引文献4

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部