期刊文献+

一种基于PDBMLCA聚类的网络突发事件发现算法

An algorithm for discovering bursty events based on PDBMLCA clustering
下载PDF
导出
摘要 针对目前网络信息爆炸式发展的状况下,需要及时了解和掌握网上重要信息及追踪网络事件进展,给出了一种突发事件发现算法.该算法通过引入文本词语的突发度量值,考虑位置对词语权重影响等因素,提高了计算权重值的准确度.根据基于预设密度的最大链路算法,在平均半径的范围内,满足一定条件的文本集合连成一条链路,进而形成一个类簇相似的文本以类簇为类.该聚类算法在结合突发值及位置影响等因素下,能够合理的划分一段时期内的文本并归属相应的主题.实验结果表明,该算法在发现突发事件中有较好的效果. Nowadays,for the big growth of the information on network,and the need of the grasp and track the important information or events online,we develop an algorithm for discovering bursty events.The algorithm can improve the accuracy of the calculation of weight values through the introduction of burst value of text words and the consideration of the position impact for word weights.Within the range of the average radius,the texts even extend to a link under certain conditions based on maximum link preset density algorithm.Combined with factors of burst value and position impact,this clustering algorithm can divide the texts properly in a period and attribute to the appropriate topics.The experimental results show that the algorithm has a good effect in discovering bursty events.
出处 《北京交通大学学报》 CAS CSCD 北大核心 2013年第2期63-67,共5页 JOURNAL OF BEIJING JIAOTONG UNIVERSITY
基金 国家自然科学基金资助项目(60972012,61271308) 北京市自然科学基金资助项目(4112045) 高等学校博士学科点专项科研基金资助(W11C100030) 北京市科技计划资助项目(Z121100000312024)
关键词 自然语言 聚类 突发事件 权重 natural language clustering bursty event weighting
  • 相关文献

参考文献8

  • 1Brants T, Chen F,Farahat A. A system for new event de-tection[C]//Proceedings of the 26th Annual InternationalACM SIGIR Conference on Research and Development inInformaion Retrieval, ACM New York, USA, 2003: 330-337.
  • 2Corso D G, Gulli A, Romani F. Ranking a stream of news[C]// Proceedings of the 14th International Conference onWorld Wide Web, ACM New York, USA,2005: 97 —106.
  • 3Holz F, Teresniak S. Towards automatic detection andtracking of topic change[j] ? Computational Linguistics andIntelligent Text Processing, 2010,6008: 327 — 339.
  • 4Yao J, Cui B, Huang Y. Bursty event detection from col-laborative t?s[j]. World Wide Web Internet and Web In-formation Systems, 2012,15(2) : 171 - 195.
  • 5Zhang C, Fan X,Chen X. Hot topic detection on chineseshort text[J]. Communications in Computer and Informa-tion Science, 2011,176: 207 — 212.
  • 6Sudhamathy G, Venkateswaran C. Web log clustering ap-proaches a survey [ J ]. International Journal on ComputerScience and Engineering, 2011,3(7) :2896 — 2902.
  • 7Xu G,Zhang Y, Li L. Web mining and social network-ing: techniques and applications [ M]. Berlin: Springer,2010: 127-156.
  • 8Lee S,Lee S, Kim K. Bursty event detection from textstreams for disaster management[C]// Proceedings of the21st International Conference Companion on World WideWeb, ACM, New York, USA,2012: 679-682.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部