期刊文献+

在线新事件检测系统中的性能提升策略 被引量:3

Performance Improvement Strategy in Online New Event Detection System
下载PDF
导出
摘要 现有的关于在线新事件检测(ONED)系统的研究更多地关注如何提高检测的准确率而很少考虑对资源的利用率,使ONED系统在实际应用中存在性能低下的问题。该文分析了传统的事件检测系统存在的性能上的缺点,并在此基础上进行了改进,在基本不降低识别正确率的基础上,通过合理设定技术参数以及对链表索引机制进行预筛选,降低了文档比较过程中的存储和计算开销。实验结果表明,改进的系统提升了检测性能。 The existing studies focus a lot on the detection accuracy without considering efficiency, which leads a low performance in practical Online New Event Detection(ONED) domain. This paper analyzes the traditional ONED systems about the low performance limitation, and proposes an improved framework. A lot of storage and calculation overhead can be degraded based on the techniques of setting reasonable parameters and pre-filtering index linking tables, without decreasing detection accuracy. Experimental results demonstrate the enhanced performance of the ONED system.
出处 《计算机工程》 CAS CSCD 北大核心 2008年第15期72-74,共3页 Computer Engineering
基金 上海市青年科技启明星计划基金资助项目(051430)
关键词 在线新事件检测 话题识别与跟踪 信息检索 预筛选 Online New Event Detection(ONED) Topic Detection and Tracking(TDT) Information retrieval pre-filtering
  • 相关文献

参考文献4

  • 1Allan J, Papka, R Lavrenko V. On-line New Event Detection and Tracking[C]//Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. [S. l.]: ACM Press, 1998: 37-45.
  • 2Allan J, Lavrenko V, Jin H. First Story Detection in TDT is Hard[C] //Proc. of the 9th International Conference on Information and Knowledge Management. New York, USA: ACM Press, 2000: 3-5.
  • 3Braun R, Kaneshiro R. Exploiting Topic Pragmatics for New Event Detection in TDT-2004[C]//Proc. of Topic Detection and Tracking Workshop. [S. l.]: ACM Press, 2004.
  • 4Luo Gang, Tang Chunqiang, Philip S Y. Resource-adaptive Real-time New Event Detection[C]//Proc. of ACM SIGMODPODS'07. Bei.jing, China: ACM Press, 2007: 3-4.

同被引文献53

  • 1姜吉发.一种跨语句汉语事件信息抽取方法[J].计算机工程,2005,31(2):27-29. 被引量:12
  • 2周俊生,戴新宇,尹存燕,陈家骏.基于层叠条件随机场模型的中文机构名自动识别[J].电子学报,2006,34(5):804-809. 被引量:112
  • 3刘务华,罗铁坚,王文杰.文本聚类算法的质量评价[J].中国科学院研究生院学报,2006,23(5):640-646. 被引量:7
  • 4ZHANG Kuo,ZI Juan, WU Li-lang. New event detection based on in-dexing-tree and named entity [C] //Proc of the 30th Annual Interna-tional ACM SIGIR Conference on Research and Development in Infor-mation on Retrieval. New York: ACM Press,2007 ;215-222.
  • 5ALSUMAIT L S. Online topic detection,tracking and significanceranking using generative topic models [M]. Fairfax : Georage MasonUniversity, 2009.
  • 6SALTON G,BUCKLEY B. Term-weighting approach in automatictext retrieval [J]. Information Processing and Management,1988,24(5): 513-523.
  • 7LAFFERTY J, McCALLUM A, PEREIRA F. Conditional randomfields : probabilistic models for segmenting and labeling sequence data[C]//Proc .of the 18th International Conference on Machine Lear-ning. San Francisco : Morgan Kaufmann Publishers, 2001 : 282-289.
  • 8NOCEDAL J. Updating quasi Newton matrices with limited storage[J]. Mathematics of Computation,1980,35( 151) :773-782.
  • 9ALLAN J, FENG Ao, BOLIVAR A. Flexible intrintic of evaluationhierarchical clustering for TDT [C] //Proc of the 12th InternationalConference on Information and Knowledge Management. 2003 : 263-270.
  • 10ALLAN J(CARBONELL J,DODDINGTON J,et al Topic detectionand tracking pilot study final report [C] //Proc of DARPA BroadcastNews Transcription and Understanding Workshop. San Francisco :Morgan Kaufmann Publishers, 1998 ; 194-218.

引证文献3

二级引证文献7

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部