改进的T^2-BIC说话人二级分割算法被引量：1

Improved Two-stage T^2-BIC Algorithm for Speaker Segmentation

下载PDF

导出

摘要针对传统T2-BIC算法累积误差较大、召回率不高的问题,提出一种改进的T2-BIC说话人二级分割算法。第1级采用改进的滑动窗口检测搜索窗中的T2统计量峰值,利用贝叶斯信息准则(BIC)对峰值进行确认,第2级利用分步解决的思想处理由于BIC可信度过低而漏选的分割点。实验结果表明,与同类算法相比,该算法分割效果较好,准确率、召回率和综合性能都有所提高。 This paper proposes an improved two-stage T2-BIC algorithm for speaker segmentation,because traditional T2-BIC algorithm has the problems of a bigger accumulated error and a lower recall ratio.In the first stage,the peak position of T2 statistic in search window is detected by using improved sliding variable-size analysis window,and Bayesian Information Criterion（BIC） algorithm is used to acknowledge the peaks.In the second stage,the idea of divide-and-conquer is used to detect the missed turns because of low BIC reliability.Experimental result shows that compared with other algorithms,the improved algorithm achieves better performance,and improves the precision,recall and F measure.

作者郑继明司可宁

机构地区重庆邮电大学数理学院重庆邮电大学计算机科学与技术学院

出处《计算机工程》 CAS CSCD 北大核心 2011年第6期291-292,F0003,共3页 Computer Engineering

基金重庆市教育委员会科学技术研究基金资助项目(KJ080524)

关键词 T2统计量贝叶斯信息准则 T2-BIC算法分步解决 T2 statistic Bayesian Information Criterion（BIC） T2-BIC algorithm divide-and-conquer

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1Chcn Shaobing, Gopalakrishnan R. Speaker Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion[C]//Proc. of DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, USA: [s. n.], 1998.
2Delacourt D A. Wellekens C J. DISTBIC: A Speaker-based Segmentation for Audio Data Indexing[J]. Speech Communication, 2000. 32(1/2): 1 l 1-126.
3Cheng Shi-sian, Wang Hsin-min, Fu Hsin-chia. BIC-based Speaker Segmentation Using Divide-and-conquer Strategies with Application to Speaker Diarization[J]. IEEE Trans. on Audio, Speech and Language Ploccssin, 2010. 18(1): 141-157.
4张世磊,张树武,徐波.一种两层次无监督的音频分割算法[J].中文信息学报,2007,21(2):106-111. 被引量：5
5Zhou Bowen, Hansen J H L. Efficient Audio Stream Segmentation via the Combined T^2 Statistic and Bayesian Information Criterion[J]. IEEE Trans. on Speech and Audio Processing, 2005, 13(4): 467-474.
6余小清,谭海英.一种改进型BIC话者改变检测算法[J].上海大学学报（自然科学版）,2007,13(4):403-408. 被引量：2

二级参考文献25

1NIST Spoken Language Technology Evaluations: Benchmark Tests [EB/OL]. http://www. nist. gov/speech/tests/index. htm.
2Zhou B, Hansen J. Efficient audio stream segmentation via T2 statistic based Bayesian information criterion[J]. IEEE Transactions on Speech Audio Process,2005, 13(4): 467-474.
3Chen S, Gopalakrishnan P. Speaker, environment and channel change detection and clustering via the Bayesian information criterion [A]. DARPA Broadcast News Trans. and Under [C]. Workshop, 1998.8.
4Delacourt P, Wellekens CJ. DISTBIC: a speaker-based segmentation for audio data indexing [J].Speech Communication, 2000, 32: 111-126.
5Lu L, Zhang HJ. Real-Time Unsupervised Speaker Change Detection [A]. In: Proceedings of ICPR (2)2002 [C]. Quebec, Canada, 2002: 358-361.
6Cheng S, Wang H. METRIC-SEQDAC: A Hybrid Approach for Audio Segmentation [A]. In: Proceedings of ICSLP2004 [C]. Jeju Island, Korea, 2004:1617-1620.
7Cheng S, Wang H. A Sequential Metric-based Audio Segmentation Method via The Bayesian Information Criterion [A]. In: Proceedings of Eurospeech2003[C]. Geneva, Switzerland, 2003: 945-948.
8Zhou B, Hansen J. Unsupervised Audio Stream Segmentation and Clustering Via the Bayesian Information Criterion [A]. In: Proceedings of ICSLP2000[C]. China, 2000:714-717.
9J. Ajmera. Robust Audio Segmentation [D]. Ph. D.Thesis, 2004.
10KEMP T,SCHMIDT M,WESTPHAL M,et al.Strategies for automatic segmentation of audio data[C]// Proceedings of the ICASSP,Istanbul,Turkey.2000:1423-1426.

共引文献5

1常辽豫,余小清,万旺根,李昌莲,许雪琼.MP3压缩域中语音分割的研究与实现[J].计算机应用,2009,29(4):1188-1192. 被引量：3
2王志明,周序生.基于定长窗分层检测的音频分割算法[J].计算机仿真,2009,26(9):350-354. 被引量：1
3高福友,陈雁翔.一种基于说话者的无监督语音分割算法[J].合肥工业大学学报（自然科学版）,2010,33(5):683-686. 被引量：3
4郑继明,张萍.改进的BIC说话人分割算法[J].计算机工程,2010,36(17):240-242. 被引量：7
5陈国艳,张颖,梁德群.基于BIC准则的图像分割算法[J].辽宁工程技术大学学报（自然科学版）,2016,35(11):1359-1362. 被引量：1

同被引文献9

1Taras Butko,Climent Nadeu. Audio segmentation of broadcast news in the Albayzin-2010 evaluation: Overview, results, and discussion [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2011 (1): 1-10.
2Sebastien Lefevre, Nicole Vincent. A two level strategy for au- dio segmentation[J]. Journal of Digital Signal Processing, 2010, 21 (2): 270-277.
3Dalibor Mitrovic, Matthias Zeppelzauer, Christian Breithene- der. Features for content-based audio retrieval [J]. Journal of Advances in Computer, 2010, 78 (10): 71-150.
4Cheng Shisian, Wang Hsinmin, Fu Hsinchia. BIC-based au- dio segmentation by divide and conquer [C] //International Conference on Acoustics, 2008: 4841-4844.
5郑继明,俞佳.基于GLR距离和BIC的混合音频分割算法[J].计算机工程与设计,2009,30(13):3120-3123. 被引量：3
6王志明,张瑞杰,李弼程.基于分层熵检测的音频分割算法[J].科学技术与工程,2009,9(17):5012-5016. 被引量：1
7张瑞杰,李弼程,屈丹.基于可信度变化趋势的音频分割算法[J].计算机工程,2010,36(8):177-179. 被引量：3
8于俊清,胡小强,孙凯.改进的音频混合分割方法[J].计算机辅助设计与图形学学报,2010,22(7):1174-1181. 被引量：4
9郑继明,张萍.改进的BIC说话人分割算法[J].计算机工程,2010,36(17):240-242. 被引量：7

引证文献1

1冷娇娇,赵彤洲,方晖,李翔,李碧.基于方差稳定性度量的乐器音频分割算法[J].计算机工程与设计,2016,37(3):768-772. 被引量：4

二级引证文献4

1刘莹,赵彤洲,江逸琪,柴悦,李翔.基于自相关函数的钢琴乐音改进识别算法[J].武汉工程大学学报,2018,40(2):208-213. 被引量：6
2刘莹,赵彤洲,邹冲,赵娜.基于频谱包络分析的音乐推荐算法[J].软件导刊,2018,17(6):74-76. 被引量：4
3刘超.基于频谱包络的钢琴乐音仿真模型构建[J].自动化技术与应用,2021,40(6):104-108. 被引量：4
4杨静.基于三维时空域的音符信号切分识别方法研究[J].科技通报,2019,35(9):119-122. 被引量：1

1储岳中.一类基于贝叶斯信息准则的k均值聚类算法[J].安徽工业大学学报（自然科学版）,2010,27(4):409-412. 被引量：15
2赵凯,史长琼,张理阳.基于聚类分析的P2P流量识别[J].长沙理工大学学报（自然科学版）,2010,7(3):58-62. 被引量：3
3白志杰,李弼程,彭天强.基于BIC的新闻视频近似重复帧检测方法[J].计算机应用,2009,29(6):1694-1695.
4邸若海,高晓光,郭志高.基于改进BIC评分的贝叶斯网络结构学习[J].系统工程与电子技术,2017,39(2):437-444. 被引量：10
5许明,韩军伟,郭雷,尹文杰.利用模型选择确定视觉词袋模型中词汇数目[J].计算机工程与应用,2011,47(31):148-150. 被引量：3
6张端金,汪爱娟.基于改进的小波核主元分析故障检测[J].郑州大学学报（工学版）,2015,36(1):97-100. 被引量：4
7单宝明,蔡漫漫.基于摄像头的直立行走智能车控制系统设计[J].甘肃科学学报,2017,29(1):25-29. 被引量：1
8于俊清,胡小强,孙凯.改进的音频混合分割方法[J].计算机辅助设计与图形学学报,2010,22(7):1174-1181. 被引量：4
9郭鹏,李乃祥,刘同海.基于进化MCMC的DBN学习算法[J].计算机工程,2011,37(10):143-145.
10谭立球,夏利民,谷士文.基于信息瓶颈算法的图像分割[J].计算机工程,2008,34(18):215-216.

计算机工程

2011年第6期

浏览历史

内容加载中请稍等...

改进的T^2-BIC说话人二级分割算法被引量：1

参考文献6

二级参考文献25

共引文献5

同被引文献9

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

改进的T^2-BIC说话人二级分割算法 被引量：1

参考文献6

二级参考文献25

共引文献5

同被引文献9

引证文献1

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

改进的T^2-BIC说话人二级分割算法被引量：1