期刊文献+

舆情事件网页内容的词汇关联分析算法实现研究

Research on the Analysis Algorithm of Word Co-Occurrence on Public Opinion Evens
下载PDF
导出
摘要 基于舆情事件的词汇关联分析,既是面向网络舆情的情报研究中的一项关键技术,也是保证和提高网络舆情分析质量的一个重要途径。文章研究基于词跨度的关键词获取算法,对候选关键词进行权重计算。研究计算词汇之间的共现率算法,通过限定范围和结果组配的方法识别词汇间的关系。实验测试取得了良好效果,对于提高网络舆情事件分析的质量有重要意义和应用价值。 The words co-occurrence analysis on public opinions is not only a key technology in network public opinion analysis, but also an important way to ensure and improve quality of network public opinion analysis. This paper studies the acquisition algorithm based on word span keywords which filters out unrelated words. Word frequency is computed and the location of candidate key- words is marked. The computing vocabulary between the co-occurrence rate algorithm is studied through limited scope and results of group with methods to identify the relationship between words, and a set of keywords are drawn. Experimental test has achieved good results and verifies the significant application value for improving the quality of public opinion analysis.
出处 《信息工程大学学报》 2014年第1期105-110,共6页 Journal of Information Engineering University
基金 科研基金资助项目
关键词 舆情事件 关键词抽取 关联分析 public opinion event keyword extraction correlation analysis
  • 相关文献

参考文献8

  • 1郭邦财.蜜蜂群并行网页抓取系统[J].软件导刊,2011,10(1):68-70. 被引量:2
  • 2中国电子商务研究中心.中文分词算法及其比较分析[EB/OL].[2012-09-20].http://b2b.toocle.corn/detail-5095609.html.
  • 3LI Juanzi, FAN Qi'na, ZHANG Kuo. Keyword Extraction Based on tf/idf for Chinese News Document[ J l. Wuhan University Journal of Natural Sciences, 2007,12 ( 5 ) : 20-25.
  • 4马颖华,王永成,苏贵洋,张宇萌.一种基于字同现频率的汉语文本主题抽取方法[J].计算机研究与发展,2003,40(6):874-878. 被引量:48
  • 5Matsuo Y, Ishizuka M. Keyword Extraction from a Single Document Using Word Co-occurrence Statistical Information [ J ]. Journal on Artificial Intelligence Tools,2004,13 ( 1 ) : 157-169.
  • 6武磊.基于事件的词汇相关性分析及应用研究[D].上海:南京政治学院,2012.
  • 7hang k, Xu H, Tang J, Li J Z. Keyword Extraction Using Support Vector Machine[ C ]//Proceedings of the Seventh Interna- :ional Conference on Web-Age Information Management( WAIM2006). 2006:85-96.
  • 8LI Juanzi FAN Qi'na ZHANG Kuo.Keyword Extraction Based on tf/idf for Chinese News Document[J].Wuhan University Journal of Natural Sciences,2007,12(5):917-921. 被引量:24

二级参考文献4

共引文献67

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部