基于播音员识别的新闻视频故事分割方法被引量：4

Segmentation method of news video stories based on announcer identification

下载PDF

导出

摘要新闻视频的语义单元分割是基于内容的新闻视频检索和情报挖掘的重要步骤,受到众多研究者的关注。提出了一种基于播音员识别的新闻视频故事单分割的新方法,首先从新闻节目中提取各播音员的声学感知特征的作为其声纹,训练出其相应的混合高斯模型(GMM),并采用KL差异法从视频镜头中探测出各播音员和非播音员音频镜头,最后结合视频字幕帧事件和新闻节目特殊的结构知识对新闻节目进行故事单元分割。在2个多小时的CCTV和CNN新闻视频实验中获得96.02%查准率和92.58%的查全率。 As an important step of content based news video retrieving and information mining,semantic unit segmentation has attracted many researchers＇ interests.This paper focuses on a new method of news video stories segmentation which is based on the announcer identification.Firstly,the voiceprints including acoustic perception characteristics of each announcer are extracted, and their Gaussian mixture models are trained,then the audio shots of announcer and not-announcer are detected by the KL divergence method,at last the unit segmenting is carried on under the guidance of video topic caption frames and special structure knowledge of news program.Finally the 92.58% recall and the 96.02% precision are achieved during more than 2 hours＇ experiment.

作者徐新文李国辉甘亚莉

机构地区国防科技大学信息系统与管理学院

出处《计算机工程与应用》 CSCD 北大核心 2008年第19期4-7,共4页 Computer Engineering and Applications

基金国家自然科学基金(the National Natural Science Foundation of China under Grant No.60243006) 国家教育部博士点基金(No.20069998022)

关键词播音员声纹故事单元分割高斯混合模型新闻视频 voiceprint story unit segmentation Gaussian mixture model news video

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献9

1Guide lines for the TRECVID 2004 evaluation[EB/OL].(2005-02-17). http ://www-nlpir.nist.gov/proj ects/tv2004/tv2004.html#2.2.
2Hoashi K,Sugano M,Naito M,et al.Shot boundary determination on MPEG compressed domain and story segmentation experiments for TRECVID 2004[C]//TREC Video Retrieval Evaluation Foruin,2004.
3Hsu W,Chang S F.Generative,discriminative,and ensemble learning on multi-model perceptual fusion toward news video story segmentation[C]//International Conference on Multimedia and Expo,2004.
4Chaisorn L,Chua T S,Lee C H.The segmentation of news video into story units[C]//International Conference on Multimedia and Expo, 2002.
5Zhai Y,Yillnaz A,Shah M.Story segmentation in news videos using visual and text cues[C]//Leow W K.LNCS 3568:CIVR 2005,2005: 92-102.
6Rabiner L, Juang B H.Fundamentals of speech recognition[M].Englewood Cliffs, NJ: Prentice Hall, 1993.
7Reynolds D A,Rose R C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Trans Speech and Audio ProceSsing, 1995,3 ( 1 ) : 72-83.
8Reynolds D A.Speaker identification and verification using Gaussian mixture speaker models[J],Speech Communication, 1995,17 (1/2) : 91-108.
9Cover T M,Tomas J A.Elements of information theory[M],USA: John Wiley & Sons, 1991:18-19.

同被引文献33

1杨玉莲,谢磊.基于子词链的中文新闻广播故事自动分割[J].计算机应用研究,2009,26(2):583-586. 被引量：2
2谢毓湘,栾悉道,吴玲达,老松杨.NVPS：一个多模态的新闻视频处理系统[J].情报学报,2004,23(4):404-409. 被引量：5
3刘群,张华平,俞鸿魁,程学旗.基于层叠隐马模型的汉语词法分析[J].计算机研究与发展,2004,41(8):1421-1429. 被引量：197
4史迎春,王韬,周献中.基于语义的新闻视频检索研究[J].计算机工程,2004,30(16):155-157. 被引量：7
5刘华咏.基于音视频特征和文字信息自动分段新闻故事[J].系统仿真学报,2004,16(11):2608-2610. 被引量：8
6姚喜双.播音员、节目主持人的语言评价[J].语言文字应用,2005(2):2-13. 被引量：21
7傅间莲,陈群秀.自动文摘系统中的主题划分问题研究[J].中文信息学报,2005,19(6):28-35. 被引量：13
8刘文萍,李也白,张常年.视频镜头边缘检测技术[J].计算机工程与应用,2006,42(21):17-20. 被引量：10
9Janvier B,Bruno E,Marchand-Maillet S,et al.Performance Evaluation of a Contextual News Story Segmentation Algorithm[C]∥Proc of Int’l Conf on Multimedia Content Analysis,Management,and Retrieval,2006:60730X-1-60730X-10.
10Chaisorn L,Chua T S,Lee C H.The Segmentation of News Video into Story Units[C]∥Proc of Int’l Conf on Multimedia and Expo,2002:73-76.

引证文献4

1王国营,寇红召,李涛.一种多模态融合新闻视频条目分割算法[J].计算机工程与科学,2011,33(6):46-50. 被引量：1
2余骁捷,吴及,孔繁庭,李树森.多信息融合的新闻节目主题划分方法[J].中文信息学报,2012,26(2):121-127.
3买迪娜.马合木提.语义信息缺失下的新闻视频检索系统研究[J].计算机与网络,2017,43(6):73-75.
4孙奇.播音员主持人的情感方式与表达技巧探析[J].卫星电视与宽带多媒体,2022(14):160-161.

二级引证文献1

1黄锋,易嘉闻,吴健辉,何伟,李武劲,欧先锋.光流法和显著性相结合的动态背景下运动目标检测方法[J].成都工业学院学报,2020,23(1):13-18. 被引量：2

1冀中,张春田,苏育挺.新闻视频故事单元分割技术综述[J].中国图象图形学报,2007,12(11):1952-1960. 被引量：9
2吴永辉.网上招个播音员[J].计算机应用文摘,2006(18):108-108.
3屈洁,封化民.新闻视频播音员的检测与跟踪[J].北京电子科技学院学报,2009,17(4):1-9. 被引量：1
4倪早菊.记忆超人[J].奇闻怪事,2009(6):25-25.
5LabVIEW 8.20-20 Years of Innovation[J].国外电子测量技术,2006,25(11):81-84.
6王道才.让电脑成为定时播音员[J].电脑知识与技术（经验技巧）,2009(2):32-34.
7刘翔翔,杨勇平,吉鹏飞.视频实验教学系统实现与应用[J].办公自动化（综合月刊）,2011(12):14-16. 被引量：2
8王欣欣.哪种语言最有效率[J].科技新时代,2011(10):29-29.
9刘嘉琦,封化民,闫建鹏.基于多模态特征融合的新闻故事单元分割[J].计算机工程,2012,38(24):161-165. 被引量：8
10李雅灵.县级播音员的多重角色[J].创新科技,2011(9):58-58.

计算机工程与应用

2008年第19期

浏览历史

内容加载中请稍等...

基于播音员识别的新闻视频故事分割方法被引量：4

参考文献9

同被引文献33

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于播音员识别的新闻视频故事分割方法 被引量：4

参考文献9

同被引文献33

引证文献4

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于播音员识别的新闻视频故事分割方法被引量：4