期刊文献+

基于短文本信息流的回顾式话题识别模型 被引量:3

Retrospective Topic Identification Model for Short Text Information Flow
下载PDF
导出
摘要 近几年来,短文本信息流广泛应用于一些全民媒体,它在公开传递信息同时携带了丰富且具有极大价值的信息资源。该文提出了一种回顾式话题识别模型,改进了权值计算方法,有效提取了具有较强分辨话题能力的关键词,在聚类过程中将BIC值作为话题类别合并依据,提高了聚类的准确率。通过进行时间段分隔和去掉孤立点信息提高了算法的效率。实验结果表明,该方法有效地提高了短文本信息流的话题检测准确率和效率。 In recent years, the short text information flow has occured in some public media. For this kind of data, a retrospective topic identification model is presented with an improved weight estimation. It employes the value of BIC for clustering to improve the clustering accuracy. By dividing the time segments and removing isolated information point, the efficiency of the algorithm is further improved. The experimental results show that this method achieves good accuracy and efficiency in the topic detection of the short text information flow.
出处 《中文信息学报》 CSCD 北大核心 2015年第1期111-117,132,共8页 Journal of Chinese Information Processing
基金 河北省科技支撑计划项目(10213581) 淮安市社会发展项目(HASZ2012046) 淮安市科技支撑计划(工业)项目(HAG2012086)
关键词 短文本 信息流 话题识别 聚类 short text information flow topic identification clustering
  • 相关文献

参考文献16

  • 1Wang ZM, Zhou XS. A topic detection method based on bicharacteristic vectors [C]//Proceedings of the Int'l Conf. on Networks Security, Wireless Communi- cations and Trusted Computing. Vol. 2. Washington: IEEE Computer Society, 2009. 683-687.
  • 2Allan J, Papka R. On-line new event detection and traeking[C]//Proeeedings of the 21 st Annum Interna- tional ACM SIGIR Conference on Research and Devel-opment in Information Retrieval. Melbourne: ACM Press, 1998.37-45.
  • 3赵华,赵铁军,张姝,王浩畅.基于内容分析的话题检测研究[J].哈尔滨工业大学学报,2006,38(10):1740-1743. 被引量:20
  • 4Seo YW, Sycara K. Text clustering for topic detection [C]//Proceedings of the Pittsburgh: Robotics Institu- te, Carnegie Mellon University, 2004. 1-11.
  • 5骆卫华,于满泉,许洪波,王斌,程学旗.基于多策略优化的分治多层聚类算法的话题发现研究[J].中文信息学报,2006,20(1):29-36. 被引量:38
  • 6Sakaki Ti, Okazzki M, Matsuo Y. Earthquake Shakes Twitter User.. Real-time Event Detection Detection by Social Sensors[C]//Proceedings of the 19th Interna- tional Conference on World Wide Web, 2010. Raleigh, North Carolina .. ACM Press, 2010 : 851-861.
  • 7Petrovi S, Osborne M, Lavrenko V. Streaming First Story Detection with application to Twitter[C]//Pro- ceedings of HLTNAACL, 2010. stroudsburg, PA, USA: Association for Computational Linguistics, 2010 .. 181-189.
  • 8Liu Zitao, Yu Wenchao, Chen Wei, et al. Short Text Feature Selection for Micro-blog Mining[C]//Compu- tational Intelligence and Softeare Engineering, 2010. Wuhan, China..Wuhan Unive- sity, 2010 : 1-4.
  • 9Pelleg D, Moore A. X-means: Extending K-means with Efficient Estimation of the Number of Clusters [C]//Proceedings 17th ICML. Stanford University. 2000. 727 -734.
  • 10张小明,李舟军,巢文涵.基于增量型聚类的自动话题检测研究[J].软件学报,2012,23(6):1578-1587. 被引量:23

二级参考文献42

共引文献112

同被引文献23

引证文献3

二级引证文献26

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部