期刊文献+

基于语义相似度的论坛话题追踪方法 被引量:22

Method for BBS topic tracking based on semantic similarity
下载PDF
导出
摘要 现有的话题追踪方法大多面向新闻数据,将其应用于论坛时效果不够理想。结合论坛的特点,提出一种基于语义相似度的论坛话题追踪方法。该方法首先通过构建话题和帖子的关键词表建立其文本表示模型,然后利用知网计算两个关键词表的语义相似度并以此作为帖子与话题的相关程度,最后根据相关程度实现论坛话题追踪。该方法较好地避免了向量空间模型的缺陷。实验表明,该方法能比较有效地解决面向论坛的话题追踪问题。 To study the BBS topic tracking, the paper discovered that most of the traditional methods of topic tracking deal with news reports, and they are not suitable when they are applied to BBS. The paper utilized the characteristics of BBS and presented a topic tracking method for BBS data based on semantic similarity. This method firstly constructed keywords tables of topic and post as their representation models, and then computed the two tables' semantic similarity with the help of HowNet which is served as correlation degree between post and topic. Finally, this method used the correlation degree to realize BBS-oriented topic tracking. This method effectively avoids the disadvantage of Vector Space Model ( VSM). The experimental results show that this method can solve the problem of BBS-oriented topic tracking effectively.
出处 《计算机应用》 CSCD 北大核心 2011年第1期93-96,共4页 journal of Computer Applications
基金 国家863计划项目(2007AA01Z439)
关键词 话题追踪 论坛 关键词 语义相似度 向量空间模型 topic tracking BBS key word semantic similarity Vector Space Model (VSM)
  • 相关文献

参考文献9

  • 1MAKKONEN J. Semantic classes in topic detection and tracking [D]. Helsinki: Department of Computer Science, University of Helsinki, 2009.
  • 2焦健,瞿有利.知网的话题更新与跟踪算法研究[J].北京交通大学学报,2009,33(5):132-136. 被引量:10
  • 3ZHENG W, ZHANG Y, HONG Y, et al. Topic tracking based on keywords dependency profile[ C]// Proceedings of the 4th Asia Information Retrieval Symposium. Berlin: Springer-Verlag, 2008: 129 - 140.
  • 4任晓东,张永奎,薛晓飞.基于K-Modes聚类的自适应话题追踪技术[J].计算机工程,2009,35(9):222-224. 被引量:13
  • 5林鸿飞,宋丹,杨志豪.基于语义框架的话题追踪方法[C]//中国中文信息学会二十五周年学术会议.北京:清华大学出版社,2006:383-384.
  • 6ZHU MINGLIANG, HU WEIMING, WU OU. Topic detection and tracking for threaded discussion communities[ C]// Proceedings of the 2008 IEEE/WIC/ACM International Conferences on Web Intelli- gence and Intelligent Agent Technology. Washington: IEEE Compute Society, 2008:77-83.
  • 7杨洁,季铎,蔡东风,林晓庆,白宇.基于联合权重的多文档关键词抽取技术[J].中文信息学报,2008,22(6):75-79. 被引量:15
  • 8邱立坤,程葳.面向BBS的话题挖掘初探[C]//自然语言理解与大规模内容计算.北京:清华大学出版社,2005:401-407.
  • 9刘群 李素建.基于《知网》的词汇语义相似度的计算[A]..第三届汉语词汇语义学研讨会[C].台北,2002..

二级参考文献21

  • 1李素建,王厚峰,俞士汶,辛乘胜.关键词自动标引的最大熵模型应用研究[J].计算机学报,2004,27(9):1192-1197. 被引量:92
  • 2金珠,林鸿飞,赵晶.基于HowNet的话题跟踪及倾向性分类研究[J].情报学报,2005,24(5):555-561. 被引量:21
  • 3王会珍,朱靖波,季铎,叶娜,张斌.基于反馈学习自适应的中文话题追踪[J].中文信息学报,2006,20(3):92-98. 被引量:17
  • 4索红光,刘玉树,曹淑英.一种基于词汇链的关键词抽取方法[J].中文信息学报,2006,20(6):25-30. 被引量:88
  • 5Jilin Chen, Benyu Zhang, Dou Shen, Qiang Yang. Zheng Chen. Diverse Topic Phrase Extraction from Text Collection. Data Mining [C]//ICDM apos: 06. Sixth International Conference on Volume, Issue, Digital Object Identifier. 2006.
  • 6Blaz Fortuna, Dunja Mladenic, Marko Grobelnik . Semi-Automatic Construction of Topic Ontology[C]// ESWC 2005.
  • 7Khaled M. Hammouda, Diego N. Matute, and Mohamed S. Kamel. CorePhrase: Keyphrase Extraction for Document Clustering[C]//Machine Learning and Data Mining in Pattern Recognition. 2005: 265-274.
  • 8Neto, J., Santos, A., Kaestner, C., Freitas, A. Document clustering and text summarization [C]// Proc. 4th International Conference Practical Applications of Knowledge Discovery and Data Mining (PADD-2000), London, UK: 2000:41-55.
  • 9Salton, G. (1991): Developments in Automatic Text Retrieval[J]. Science, Vol 253, 974-979.
  • 10K.B. Khoo and M. Ishizuka. Emerging Topic Track ing System [C]//Proc. of Web Intelligent (WI 2001), LNAI 2198 (Springer), Maebashi, Japan: 2001: 125-130.

共引文献51

同被引文献208

引证文献22

二级引证文献102

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部