期刊文献+

层次化话题发现与跟踪方法及系统实现 被引量:11

Hierarchical Topic Detection and Tracking and Implementation of System
下载PDF
导出
摘要 自1996年话题发现与跟踪评测启动以来,该研究受到普遍关注,取得巨大进步,也遇到诸多困难。通过分析大量话题数据,提出层次化话题与层次聚类的区别在于话题的层次是由事件的构成决定的,层次化话题应当分为三层,即微类、中类和上类。原因在于计算机自动分析产生的层次化话题必须与现实世界有客观的联系。据此提出一个面向大规模真实数据的有充分理论依据的层次化话题发现与跟踪方法,并在集群系统上予以实现。 Since 1996,topic detection and tracking has obtained extensive attention and has encountered great challenge when making great progress. By analyzing mass data, the differences between hierarchical topic and hierarchical clustering are firstly proposed, which should be decided by the construction of event and be represented as three layers, for hierarchical topic produced by computer automatically has external relation with the real world. Then an algorithm for hierarchical topic detection and tracking that can process large-scale data are proposed and implemented on our clusters computer.
出处 《广西师范大学学报(自然科学版)》 CAS 北大核心 2007年第2期157-160,共4页 Journal of Guangxi Normal University:Natural Science Edition
基金 国家863计划资助项目(2005AA147030) 国家242信息安全计划资助项目(2005A37) 北京市教育委员会科技发展计划面上项目(KM200600006002)
关键词 话题发现与跟踪 层次化话题识别 层次化话题跟踪 多层聚类 事件结构 topic detection and tracking hierarchical topic detection hierarchical topic tracking multilayered clustering event structure
  • 相关文献

参考文献5

  • 1FISCUS J G,DODDINGTON G R.Topic detection and tracking evaluation overview[C]//Allan J.Topic Detection and Tracking:Event-based Information Organization.Boston:Kluwer Academic Publishers,2002:17-31.
  • 2于满泉,骆卫华,许洪波,白硕.话题识别与跟踪中的层次化话题识别技术研究[J].计算机研究与发展,2006,43(3):489-495. 被引量:49
  • 3王会珍,朱靖波,季铎,叶娜,张斌.基于反馈学习自适应的中文话题追踪[J].中文信息学报,2006,20(3):92-98. 被引量:17
  • 4邱立坤,程葳.面向BBS的话题挖掘初探[C]//自然语言理解与大规模内容计算.北京:清华大学出版社,2005:401-407.
  • 5QU Wei-guang,CHEN Xiao-he,DONG Yu,et al.Chinese WSD based on context calculation model[J].Journal of Guangxi Normal University:Natural Science Edition,2006,24(4):179-182.

二级参考文献23

  • 1T, Brants, F, R, Chen, A, O, Farahat. A system for new event detection. In: Proc, SIGIR 2003, the 26th Annual lnt'l ACM SIGIR Conf. Research and Development in Information Retrieval.New York: ACM Press, 2003. 330-337.
  • 2R. Swan, J. Allan. Automatic generation of overview timelines.ACM SIGIR, Research and Development in Information Retrieval, Athans, Greece, 2000.
  • 3F. Fukumoto, Y. Suzuki. Event tracking based on domain dependency. ACM SIGIR, Research and Development in Information Retrieval, Athans, Greece, 2000.
  • 4David A. Smith. Detecting and browsing events in unstructured text. The 25th Annual ACM SIGIR Conf., Finland, 2002.
  • 5R. Papka. On-line new event detection, clustering and tracking:[Ph, D. dissertation]. Massachusetts: Department of Computer Science, University of Massachusetts, 1999.
  • 6Ying-Ju Chen, Hsin Hsi. NLP and IR approaches to monolingual and multilingual link detection, The 19th Int'l Conf.Computational Linguistics, Taipei, Taiwan, 2002.
  • 7J. Allan, Ao Feng, Alvaro Bolivar, Flexible intrinsic evaluation of hierarchical clustering for TDT. The 12th ACM Int'l Conf.Information and Knowledge Management (CIKM 2003 ),Louisiana, USA, 2003.
  • 8J, Allan, Topic Detection and Tracking; Event-Based Information Retrieval, Norvell, MA, USA; Kluwer Aeademic Publishers,2002.
  • 9NIST. The 2004 topic detection and tracking (TDT 2004) task definition and evaluation plan. Version 1.2, National Institute of Suandards and Technology, Teeh. Rep., 2004.
  • 10Dolf Trieschnigg, Wessel Kraaij. TNO Hierarchical topic detection report at TDT 2004. The 7th Topic Detection and Tracking Conf.(TDT2004), Gaithersbury, USA, 2004.

共引文献69

同被引文献160

引证文献11

二级引证文献60

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部