期刊文献+

新闻故事中的关键说话人发现方法 被引量:1

Method of key speaker discovery in news story
下载PDF
导出
摘要 为了发现新闻故事中的关键说话人,用以提高多媒体检索效率,在说话人索引的基础上,提出了关键人发现方法:根据新闻故事中说话人的特点,基于说话人频率、说话人持续时间、平均每次说话人时长和说话人位置因子4个因素,综合定义了说话人关键度,用以判断说话人的重要性,把每个新闻故事中说话人关键度最大的人作为关键说话人。实验结果表明,该种算法可以找到故事中绝大部分的关键说话人,验证了该算法的有效性和可行性。 To solve the problem of key speaker discovery in News story and improve the efficiency of multimedia retrieval,on the basis of speaker index,the method of key speaker discovery is proposed.Speaker key is synthetically defined by speaker freque-ncy,speaker duration,average every time speaker length and speaker position factor,which is used to judge the speaker's importance and the biggest speaker key is regard as key speaker in every story.The experimental result shows that can find most key speaker in story and feasibility and effectiveness of the algorithm is demonstrated.
出处 《计算机工程与设计》 CSCD 北大核心 2012年第6期2353-2357,共5页 Computer Engineering and Design
基金 国家自然科学基金项目(61101160) 广东省自然科学博士启动基金项目(10451064101004651) 中央高校基本科研业务费专项基金项目(2011ZM0029)
关键词 新闻故事 关键说话人 多媒体检索 主要角色 说话人关键度 news story key character multimedia retrieval key character speaker key
  • 相关文献

参考文献10

  • 1陈予琳.关键词检索方法在科技查新中的应用研究[J].河南师范大学学报(自然科学版),2011,39(3):171-173. 被引量:16
  • 2LU L, Hanjalic. Towards optimal audio keywords detection for audio content analysis and discovery [C]. 14th Annual ACM International Conference on Multimedia, 2006: 825-834.
  • 3LU L, Hanjalic A. Audio keywords discovery for text-like audio content analysis andretrieval [J]. IEEE Transactions on Multi- media, 2008, 10 (1): 74-85.
  • 4Vijayasenan D, Valente F. An irfforamtion theoretic approach to speaker diafization of meeting data [J]. IEEE Transactions on Au- dio Speech and Lagoage Processing, 2009, 17 (7) : 1382-1393.
  • 5Barras C. ZHU Xuan. Multistage speaker diarization of broad- cast news [J]. IEEE Transactions on Audio Speech and Lan- guage Processing, 2006, 14 (5): 1505-1512.
  • 6HANK J, KIM S. Strategies to improve the robustness of ag- glomerative hierchical clustering under data source variation for speaker diarization [J]. IEEE Transactions on Audio Speech and LanguageProcessing , 2008, 16 (8): 1590-1601.
  • 7Friedlan G, Vinyals O. Prosodic and other long-term features for speaker diarization [J]. IEEE Transactions on Audio Speech and Language Processing, 2009, 17 (5): 985-993.
  • 8Nishida M, Kawahara T. Speaker model selection based on the 1Myessian information criterion applied to unsupervised speaker indexing [J]. IEEE Transactions on Speech and Audio processing, 2005, 13 (4): 583-592.
  • 9CHOU S M, TANG Hao, HUANG Thomas. Fishervoice and semi-supervised speaker clustering [C]. IEEE International Conference on Acoustics Speech and Signal Processing, 2009: 4089-4092.
  • 10杨继臣,贺前华,李艳雄,王伟凝.一种两步判决的说话人分割算法[J].电子与信息学报,2010,32(8):2006-2009. 被引量:7

二级参考文献16

  • 1李育嫦.文献检索中提高查全率与查准率的方法探讨[J].图书馆学研究,2002(11):92-93. 被引量:26
  • 2张帆,朱红涛.基于关键词的网络信息检索优化探索[J].情报科学,2005,23(6):912-916. 被引量:11
  • 3李育嫦.自然语言检索中的词汇控制研究[J].图书馆学研究,2006(4):75-78. 被引量:8
  • 4江惜春.分类与主题结合有效兼顾查新的查全率和查准率[J].农业图书情报学刊,2006,18(10):106-107. 被引量:3
  • 5Sinha R, Tranter S E, Gales M J F, and Woodland P C. The cambridge university March 2005 speaker diarisation system. In proceeding of the European Conference Speech Communication and Technology. Lisbon, Portugal, 2005: 2437-2440.
  • 6Kotti M, Benetos E, and Kotropoulos C. Computationally efficient and robust BIC-Based speaker segmentation [J]. IEEE Transactions on Speech and Audio Processing, 2008, 16(5): 920-933.
  • 7Chen S and Gopalakrishnan P S. Speaker, environment and channel change detection and clustering via the Bayesian information criterion. Proc. DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA Feb. 1998: 127-132.
  • 8E1-Khoury E, Senac C, and Pinquier J. Improved speaker diarization system for meetings. In ICASSP2009, Taipei, April, 2009: 4097-4100.
  • 9Christoph Boehm and Franz pernkopf. Effective metric-based speaker segmentation in the frequency domain. In ICASSP2009, Taipei, April 2009: 4081-4084.
  • 10Kwon S and Naxayanan S. Unsupervised speaker indexing using generic models [J]. IEEE Transactions on Speech and Audio Processing, 2005, 13(5): 1004-1013.

共引文献21

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部