期刊文献+

改进的T^2-BIC说话人二级分割算法 被引量:1

Improved Two-stage T^2-BIC Algorithm for Speaker Segmentation
下载PDF
导出
摘要 针对传统T2-BIC算法累积误差较大、召回率不高的问题,提出一种改进的T2-BIC说话人二级分割算法。第1级采用改进的滑动窗口检测搜索窗中的T2统计量峰值,利用贝叶斯信息准则(BIC)对峰值进行确认,第2级利用分步解决的思想处理由于BIC可信度过低而漏选的分割点。实验结果表明,与同类算法相比,该算法分割效果较好,准确率、召回率和综合性能都有所提高。 This paper proposes an improved two-stage T2-BIC algorithm for speaker segmentation,because traditional T2-BIC algorithm has the problems of a bigger accumulated error and a lower recall ratio.In the first stage,the peak position of T2 statistic in search window is detected by using improved sliding variable-size analysis window,and Bayesian Information Criterion(BIC) algorithm is used to acknowledge the peaks.In the second stage,the idea of divide-and-conquer is used to detect the missed turns because of low BIC reliability.Experimental result shows that compared with other algorithms,the improved algorithm achieves better performance,and improves the precision,recall and F measure.
出处 《计算机工程》 CAS CSCD 北大核心 2011年第6期291-292,F0003,共3页 Computer Engineering
基金 重庆市教育委员会科学技术研究基金资助项目(KJ080524)
关键词 T2统计量 贝叶斯信息准则 T2-BIC算法 分步解决 T2 statistic Bayesian Information Criterion(BIC) T2-BIC algorithm divide-and-conquer
  • 相关文献

参考文献6

  • 1Chcn Shaobing, Gopalakrishnan R. Speaker Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion[C]//Proc. of DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, USA: [s. n.], 1998.
  • 2Delacourt D A. Wellekens C J. DISTBIC: A Speaker-based Segmentation for Audio Data Indexing[J]. Speech Communication, 2000. 32(1/2): 1 l 1-126.
  • 3Cheng Shi-sian, Wang Hsin-min, Fu Hsin-chia. BIC-based Speaker Segmentation Using Divide-and-conquer Strategies with Application to Speaker Diarization[J]. IEEE Trans. on Audio, Speech and Language Ploccssin, 2010. 18(1): 141-157.
  • 4张世磊,张树武,徐波.一种两层次无监督的音频分割算法[J].中文信息学报,2007,21(2):106-111. 被引量:5
  • 5Zhou Bowen, Hansen J H L. Efficient Audio Stream Segmentation via the Combined T^2 Statistic and Bayesian Information Criterion[J]. IEEE Trans. on Speech and Audio Processing, 2005, 13(4): 467-474.
  • 6余小清,谭海英.一种改进型BIC话者改变检测算法[J].上海大学学报(自然科学版),2007,13(4):403-408. 被引量:2

二级参考文献25

  • 1NIST Spoken Language Technology Evaluations: Benchmark Tests [EB/OL]. http://www. nist. gov/speech/tests/index. htm.
  • 2Zhou B, Hansen J. Efficient audio stream segmentation via T2 statistic based Bayesian information criterion[J]. IEEE Transactions on Speech Audio Process,2005, 13(4): 467-474.
  • 3Chen S, Gopalakrishnan P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion [A]. DARPA Broadcast News Trans. and Under [C]. Workshop, 1998.8.
  • 4Delacourt P, Wellekens CJ. DISTBIC: a speaker-based segmentation for audio data indexing [J].Speech Communication, 2000, 32: 111-126.
  • 5Lu L, Zhang HJ. Real-Time Unsupervised Speaker Change Detection [A]. In: Proceedings of ICPR (2)2002 [C]. Quebec, Canada, 2002: 358-361.
  • 6Cheng S, Wang H. METRIC-SEQDAC: A Hybrid Approach for Audio Segmentation [A]. In: Proceedings of ICSLP2004 [C]. Jeju Island, Korea, 2004:1617-1620.
  • 7Cheng S, Wang H. A Sequential Metric-based Audio Segmentation Method via The Bayesian Information Criterion [A]. In: Proceedings of Eurospeech2003[C]. Geneva, Switzerland, 2003: 945-948.
  • 8Zhou B, Hansen J. Unsupervised Audio Stream Segmentation and Clustering Via the Bayesian Information Criterion [A]. In: Proceedings of ICSLP2000[C]. China, 2000:714-717.
  • 9J. Ajmera. Robust Audio Segmentation [D]. Ph. D.Thesis, 2004.
  • 10KEMP T,SCHMIDT M,WESTPHAL M,et al.Strategies for automatic segmentation of audio data[C]// Proceedings of the ICASSP,Istanbul,Turkey.2000:1423-1426.

共引文献5

同被引文献9

引证文献1

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部