基于隐马尔可夫链的广播新闻分割分类被引量：7

HIDDEN MARKOV MODEL BASED BROADCAST NEWS SEGMENTATION AND CLASSIFICATION

下载PDF

导出

摘要提出了使用具有模拟随机时序数据良好能力的隐马尔可夫链来完成广播新闻分割分类的算法 .首先使用含隐藏语义状态的隐马尔可夫链把原始广播新闻粗略分割分类成开始 /结束和语音两部分 ,其次应用 3个隐马尔可夫链 ,按照最大似然概率法把语音片段预识别为主持人介绍、广告和天气预报 ,最后由语义变化速率识别出新闻现场报道 ,完成广播新闻的精细分割分类任务 . A new HMM-based segmentation and classification algorithm is proposed for the segmentation and classification of broadcast news since HMM can simulate stochastic time series data quite well. Firstly, by using an HMM, which has two hidden semantic states, the raw broadcast news is coarse-grained segmented into two parts: prelude/finale and speech. Then three HMMs are used to pre-classify speech clips as anchorpersons, commercials and weather forecasts based on maximum probability. Finally the change of semantic rate is checked to identify the detailed report.

作者庄越挺毛祎吴飞潘云鹤

机构地区浙江大学人工智能研究所

出处《计算机研究与发展》 EI CSCD 北大核心 2002年第9期1057-1063,共7页 Journal of Computer Research and Development

基金教育部博士点科研基金 ( 2 0 0 10 335 0 49) 教育部优秀年轻教师基金高等学校骨干教师资助计划资助

关键词隐马尔可夫链广播新闻音频片段特征阈值分割分类算法音频信号语音识别多媒体 broadcast news, clip features, segmentation and classification, threshold, hidden Markov model

分类号 TN912.34 [电子电信—通信与信息系统] TP37 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献11

1[1]J T Foote. An overview of audio information retrieval. Multimedia Systems, 1999, 7(1): 2～11
2[2]S John. Real time discrimination of broadcast speech/music. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-96). Atlanta, GA, 1996. 993～996
3[3]E Scheirer, M Slaney. Construction and evaluation of a robust multifeature music/speech discriminator. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing (ICASSP-97). Munich, Germany, 1997. 1331～1334
4[4]M Spina, V Zue. Automatic transcription of general audio data: Preliminary analysis. In: Proc of Int'l Conf on Spoken Language Processing. Philadelphia, PA, 1996. 594～597
5[5]J T Foote. A similarity measure for automatic audio classification. In: Proc of AAAI 1997 Spring Symp on Intelligent Integration and Use of Text, Image, Video, and Audio Corpora. Palo Alto, CA: Stanford, 1997
6[6]S Savitha, D Petkovic, D Ponceleon. Towards robust features for classifying audio in the cuevideo system. In: Proc of ACM Multimedia 99. New York, USA, 1999. 393～400
7[7]Tong Zhang, C-C Jay Kuo. Heuristic approach for generic audio data segmentation and annotation. In: Proc of ACM Multimedia Conf. Orlando, 1999. 67～76
8[8]M Slaney, R F Lyon. A perceptual pitch detector. In: Proc of Int'l Conf on Acoustic, Speech, and Signal Processing 1990 (ICASSP 90). Albuquerque, 1990. 357～360
9[9]L R Rabiner. A tutorial on hidden Markov models and selected applications in speech recognition. Proc of the IEEE, 1989, 77(2): 257～286
10[10]G Tzanetakis, P Cook. Multifeature audio segmentation for browsing and annotation. In: Proc of 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, 1999

同被引文献64

1宫改云,高新波,伍忠东.FCM聚类算法中模糊加权指数m的优选方法[J].模糊系统与数学,2005,19(1):143-148. 被引量：81
2张一彬,周杰,边肇祺,张大鹏.一种新的基于分类的音频流分割方法[J].电子学报,2006,34(4):612-617. 被引量：10
3Xie L, Chang S-F, Divakaran A, et al. Structure analysis of soccer video with hidden Markov models[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Orlando, 2002. 345～350
4Chang P, Han M, Gong Y. Highlight detection and classification of baseball game video with hidden Markov models[A]. In: Proceedings of the International Conference on Image Processing, New York, 2002. 167～171
5Rui Yong, Gupta Anoop, Acero Alex. Automatically extracting highlights for TV baseball programs[A]. In: Proceedings of ACM Multimedia, Los Angeles, 2000. 105～115
6Tzanetakis George, Cook Perry. Sound analysis using MPEG compressed audio[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 757～761
7Rabiner L, Juang B-H. Fundamentals of Speech Recognition[M]. New Jersey: Prentice-Hall, 1993
8Huang L S, Yang C-H. A novel approach to robust speech endpoint detection in car environments[A]. In: Proceedings of International Conference on Acoustic, Speech and Signal Processing, Istanbul, 2000. 434～438
9Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. Proceedings of the IEEE, 1989, 77(2): 257～286
10Platt J C. Probabilistic Outputs for Support Vector Machines for Pattern Recognition[M]. In: Fayyad U, ed. Advances in Large Margin Classifiers. Boston: Kluwer Academic Publishers, 1999. 61～74

引证文献7

1杨玉莲,谢磊.基于子词链的中文新闻广播故事自动分割[J].计算机应用研究,2009,26(2):583-586. 被引量：2
2陈忠克,郭振江,刘骏伟,吴飞,庄越挺.足球比赛精彩场景的自动分析与提取[J].计算机辅助设计与图形学学报,2004,16(6):856-860.
3闫丽颖,王欢,杨颖.模糊c均值聚类在wav格式音频检索中的研究[J].中国科技信息,2006(02A):15-15. 被引量：1
4张瑞杰,李弼程,屈丹.基于可信度变化趋势的音频分割算法[J].计算机工程,2010,36(8):177-179. 被引量：3
5胡澳,裴峥.K-Medoids和FCM融合聚类法语音信号分类的应用[J].济南大学学报（自然科学版）,2016,30(1):17-22.
6吴飞,庄永真,潘红.基于分形布朗运动和Ada Boosting的多类音频例子识别[J].计算机研究与发展,2003,40(7):941-949. 被引量：8
7吴飞,庄越挺,潘云鹤.基于增量学习支持向量机的音频例子识别与检索[J].计算机研究与发展,2003,40(7):950-955. 被引量：7

二级引证文献21

1王若恩,陈锦昌.一类分形曲线的构造算法及维数[J].工程图学学报,2005,26(5):105-109. 被引量：2
2李东晖,杜树新,吴铁军.基于壳向量的线性支持向量机快速增量学习算法[J].浙江大学学报（工学版）,2006,40(2):202-206. 被引量：15
3董乐红,耿国华,高原.Boosting算法综述[J].计算机应用与软件,2006,23(8):27-29. 被引量：24
4徐武,李琳,陶红亮,杨印根.Web Information Retrieval的分析与展望[J].景德镇高专学报,2006,21(4):15-17. 被引量：1
5张燕,唐振民,李燕萍,钱博.基于内容的音乐检索综述[J].金陵科技学院学报,2007,23(2):25-29. 被引量：7
6向坚,吴飞,庄越挺,俞坚.非线性子空间中的运动数据编辑和风格生成[J].浙江大学学报（工学版）,2008,42(12):2049-2054.
7郑继明,魏国华,吴渝.有效的基于内容的音频特征提取方法[J].计算机工程与应用,2009,45(12):131-133. 被引量：6
8文益民,王耀南,吕宝粮,陈义明.支持向量机处理大规模问题算法综述[J].计算机科学,2009,36(7):20-25. 被引量：12
9许磊,张凤鸣.缺失飞参数据填补的组合方法研究[J].计算机工程与应用,2010,46(21):210-212. 被引量：6
10魏唯,欧阳丹彤,吕帅,殷明浩.结合增量与启发式搜索的多目标问题处理方法[J].计算机研究与发展,2010,47(11):1954-1961. 被引量：4

1谢逸,唐成华,黄向农.双层隐马尔可夫链的突发流合成[J].智能系统学报,2012,7(2):108-114.
2陈志刚,尹福昌,王斌.基于小波和水平集方法的尿沉渣图像分割[J].计算机应用研究,2008,25(9):2878-2880. 被引量：3
3HU ShuLan.Transportation inequalities for hidden Markov chains and applications[J].Science China Mathematics,2011,54(5):1027-1042.
4刘文静.县级广播电视台节目主持人如何做好新闻现场报道[J].西部广播电视,2016,37(8):143-143. 被引量：4
5宋涛,王星.基于二次聚类和隐马尔可夫链的持卡消费行为预测[J].计算机应用,2016,36(7):1904-1908. 被引量：1
6王树辉,王华栋.浅论电视记者在新闻现场报道中的行为把控[J].中小企业管理与科技,2012(21):179-180. 被引量：3
7网络、滤波、滤波器[J].电子科技文摘,2001,0(1):36-36.
8吴飞,庄越挺,郑科,刘骏伟,潘云鹤.基于压缩域特征话者识别的电视节目分类检索[J].模式识别与人工智能,2002,15(1):21-27. 被引量：2
9吴飞,庄越挺,张引,潘云鹤.基于隐马尔可夫链的音频语义检索[J].模式识别与人工智能,2001,14(1):104-108. 被引量：10
10徐洪丽,钱旭,刘绍翰.一种基于隐马尔可夫链的网络入侵检测研究[J].山东农业大学学报（自然科学版）,2008,39(4):648-652.

计算机研究与发展

2002年第9期

浏览历史

内容加载中请稍等...

基于隐马尔可夫链的广播新闻分割分类被引量：7

参考文献11

同被引文献64

引证文献7

二级引证文献21

相关作者

相关机构

相关主题

浏览历史

基于隐马尔可夫链的广播新闻分割分类 被引量：7

参考文献11

同被引文献64

引证文献7

二级引证文献21

相关作者

相关机构

相关主题

浏览历史

基于隐马尔可夫链的广播新闻分割分类被引量：7