期刊文献+

面向互联网新闻的话题检测与追踪 被引量:5

Topic Detection and Tracking for News Web Pages
下载PDF
导出
摘要 为了更加准确有效地从海量的互联网网页中获取感兴趣的信息,设计并实现了一个面向互联网新闻的话题追踪与检测系统,并在该系统的基础上提出了面向海量互联网网页时话题检测中聚类算法选择策略以及一个基于多重特征的话题追踪模型,该模型能够很好地区分相似与相同的话题,并且话题追踪正确率达到了85.7%,实验结果表明文中系统能够有效地检测和追踪互联网上的话题。 In order to get information from the internet more quickly and accurately, this paper designs and implements a topic detection and tracking system over a vast amount of web pages. On the basis of the system it also proposes the algorithm selection strategy for topic detection in the intemet and a topic tracking model based on multiple features. The topic tracking model can validly distinguish the same topic from the similar topic, and the accuracy of topic tracking achieves 85.7%. The experimental results show that the system in the paper has good performance of topic detection and tracking.
作者 许志凯 徐志明 李栋 李生 XU Zhikai, XU Zhiming, LI Dong, LI Sheng (School of Computer Science & Technology, Harbin Institute of Technology, Harbin, 150001, China)
出处 《智能计算机与应用》 2011年第1X期59-61,65,共4页 Intelligent Computer and Applications
基金 基金项目:国家自然科学基金资助项目(60736044).
关键词 话题追踪与检测 聚类算法 向量空间模型 相似度计算 Topic Detection and Tracking Clustering Algorithm Vector Space Model Similarity Calculation
  • 相关文献

参考文献4

二级参考文献79

共引文献207

同被引文献38

  • 1YE Hui-min,CHENG Wei,DAI Guan-zhong.Design and Implementation of On-Line Hot Topic Discovery Model[J].Wuhan University Journal of Natural Sciences,2006,11(1):21-26. 被引量:14
  • 2LI H, WEI J. Netnews bursty hot topic detection based on bursty features[ C]//Proceedings of the 2010 International Conference on E-Business and E-Govement. Washington, DC: IEEE Computer So- ciety, 2010:1437 - 1440.
  • 3HOLZ F, TERESNIAK S. Towards automatic detection and tracking of topic change[ M]. Berlin: Springer-Verlag, 2010:327 -339.
  • 4QIANKUN Z, MITRA P, BI C. Temporal and information flow based event detection from social text streams[ C]//Proceedings of the 22nd National Conference on Artificial intelligence. Vancouver: AAAI, 2007:1501 - 1506.
  • 5ALLAN J, PAPKA R, LAVERENKO V. On-line new event detection and tracking [ C ]// Proceedings of the 21st Annual International ACM S1GIR Conference on Research and Development. New York: ACM, 1998:37 -45.
  • 6YANG Y, PIERCE T, CARBONELL J. A study of retrospective and online event detection [ C ]// Proceedings of the 21 st International ACM SIGIR Conference on Research and Development in Informa- tion. New York: ACM, 1998:28 -36.
  • 7BLEI D M, NG A Y, JORDAN M I. Latent Dirichlet allocation[ J]. Journal of Machine Learning Research, 2003, 3(5): 993-1022.
  • 8SALTON G. The SMART retrieval system experiments in automatic document processing[ M]. Englewood Cliffs, New Jersey: Prentice Hall Inc, 1971:337 -354.
  • 9CHENG X, MIAO D, WANG C, et al. Coupled term-term relation analysis for document clustering[ C]//Proceedings of the 2013 In- ternational Joint Conference on Neural Networks. Piscataway: IEEE, 2013:1 -8.
  • 10Rada Mihalcea,Paul Tarau. TextRank:Bringing Orderinto Texts[A].2004.404-411.

引证文献5

二级引证文献19

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部