期刊文献+

面向动态演化的话题检测研究 被引量:17

Dynamic evolvement-oriented topic detection research
下载PDF
导出
摘要 受CURE聚类算法的启发,在分析了动态演化特性的基础上,提出了一种面向动态演化特性的双质心话题模型,以解决话题动态演化特性对话题检测的影响。该模型动态地建立分界点,以其为界将话题表示成初始质心和当前质心两个质心。初始质心代表分界点之前话题所关注的内容,当前质心表示从分界点到当前时间之间话题所关注的内容。提出了基于时间和词分布密度两种不同的分界点确定方法。详细描述了分界点、初始质心、当前质心的建立及更新方法。最后对基于双质心话题模型的英语话题检测算法进行了研究探讨,通过实验证明了该算法的有效性。 Inspired by the CURE algorithm, on the basis of analyzing the dynamic evolvement properties, the authors proposed a dynamic evolvement-orient topic model based on the double centroids to solve the negative influence of the topic's dynamic evolvement properties on topic detection. This topic model dynamically chooses a division point, and expresses a topic as double centroids, i.e. the initial centroid and the current centroid. The initial centroid is about the contents involved before division point, and the current centroid is about the contents interested between the division point and the current time. This paper researches into two distinct methods to create division point, which are based on time and distribution density, respectively. This paper depicts in detail the creation and the modification of the division point, the initial centroid and the current centroid, and finally discusses the English topic detection algorithm based on the double centroids topic model, which is proved to be successful by experiments.
出处 《高技术通讯》 CAS CSCD 北大核心 2006年第12期1230-1235,共6页 Chinese High Technology Letters
基金 国家自然科学重点基金(60435020)和863计划(2004AA117010-08)资助项目.
关键词 话题检测 动态演化 双质心 分界点 分布密度 topic detection, dynamic evolvement, double centroids, division point, distribution density
  • 相关文献

参考文献8

  • 1The 2003 topic detection and tracking task definition and evaluation plan.http://www.nist.gov/speech/tests/tdt/tdt2003/evalplan.htm,April,2003
  • 2Makkonen J.Investigations on event evolution in TDT.In:Proceedings of Student Workshop of Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics,Edmonton,Canada.2003,43-48
  • 3Nallapati R,Feng A,Peng F C.Event threading within news topics.In:Proceedings of International Conference on Information and Knowledge Management,Washington 2004,446-453
  • 4吴平博,陈群秀,马亮.基于事件框架的事件相关文档的智能检索研究[J].中文信息学报,2003,17(6):25-30. 被引量:30
  • 5贾自艳,何清,张海俊,李嘉佑,史忠植.一种基于动态进化模型的事件探测和追踪算法[J].计算机研究与发展,2004,41(7):1273-1280. 被引量:58
  • 6Sudipto G,Rajeev R,Kyuseok S.CURE:an efficient clustering algorithm for large databases.In:Proceedings of the ACM SIGMOD International Conference on Management of Data.Seattle.1998,73-84
  • 7Papka R.On-line new event detection,clustering,and tracking:[PhD thesis].Department of Computer Science,University of Massachusetts,1999
  • 8Allan J,Papka R,Lavrenko V.On-line new event detection and tracking.In:Proceedings of the 21st ACM-SIGIR International Conference on Research and Development in Information Retrieval,Australia.August 1998,37-45

二级参考文献14

  • 1林尧璃 马少平.人工智能导论[M].北京:清华大学出版社,1989..
  • 2J. Allan, J. Carbonell, G. Doddington, J. Yamron, and Yiming Y. Topic Detection and Tracking Pilot Study Final Report[ C]. In the DARPA Broadcast News Transefiptkm and Understanding Workshop,1998.
  • 3S Robertson and D. A. Hull. The TREC- 9 filtering track final report[C]. In Proceedings of the 9th Text REtrieval Conference (TREC- 9), 2000.
  • 4Yiming Yang, Tom Ault, Thomas Pierce, and Charles W. Lattimex. Improving Text Categorization Methods far Event Tracking[C]. In Proc. Of the 23rd Annual International ACM SIGIR Conference on Research and D6vdopment in Information Retrieval, 2000,65 - 72.
  • 5Liang Ma, Qunxiu Chen, Shaoping Ma, and Min Zhang, Lianhong Cai. Incremental Learning for Profile Training in Adaptive Doctanent Filtering[C]. In Proceedings of the llth Text REtrieval Conference(TREC- 11), 2002.
  • 6James Allan. Incremental Rdevanee Feedback for Information Filtering[ C]. In H. P. Frd, editor, Proceedings of the Nineteenth ACM Conference on Research and development in Information Retrieval,New York. ACM Press 1996,270- 278.
  • 7Gauch S, Wang J, Rachakonda S-M. A Corpus Analysis Approach far Automatic Query Expansion and its Extension to Multiple DataBases[J]. ACM Trabsactions on Information Systems, 1999,17(3):250 - 269.
  • 8R Papka.On-line new event detection,clustering,and tracking:[Ph D dissertation].MA:University of Massachusetts Amherst,1999
  • 9K Hui,W Lam.Automatic event generation from multi-lingual news stories.In:Proc of the First ACM/IEEE-CS Joint Conf on Digital Libraries.Roanoke,New York:ACM Press,2001.23~24
  • 10N Stokes,J Carthy,A F Smeaton.Segmenting broadcast news streams using lexical chaining.In:T Vidal,P Liberatore,eds.Proc of STAIRS 2002.Amsterdam:IOS Press,2002.145~154

共引文献85

同被引文献188

引证文献17

二级引证文献253

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部