期刊文献+

基于CRF模型的网络新闻主题线索发掘研究 被引量:6

Topic Clues Extraction of Network News Based on Conditional Random Fields
下载PDF
导出
摘要 为了准确挖掘出同一主题的大量网络新闻的线索发展脉络,该文提出了一种基于条件随机场模型的网络新闻主题线索发掘方法。首先,根据新闻主题线索句的识别规则提取出相关特征,并应用到条件随机场模型中提取出主题线索句;然后,按照时间顺序构建原始线索链;最后,对语义相近的原始线索链进行合并处理,获得最终的新闻主题发展脉络。实验结果表明,该方法在主题线索句识别上有较好的效果,最终得到的主题线索脉络能够较清晰地展现新闻发展趋势。 To accurately find out the clues of the same topic from a large number of Web news, a method of topic clues mining is proposed based on the Conditional Random Fields model. Firstly, according to the identification rules of the topic sentence, the relative characteristics were extracted and utilized on the Conditional Random Field model to get the candidate topic sentences. Then the lexical chains of topic clues were built by chronological order and lexical weight. Finally the similar clue chains in semantic needed to be merged and the whole development context of network news can be described. The experiment results show the method proposed achieves a good performance on the topic clue sentence extraction and the topic clue chains obtained can clearly show the development trend of network news.
作者 徐静 杨小平
出处 《中文信息学报》 CSCD 北大核心 2017年第3期94-100,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金(71271209) 北京市自然科学基金(4132067) 教育部人文社会科学青年基金(11YJC630268)
关键词 主题线索 条件随机场 线索链 topic clue conditional random fields clue chain
  • 相关文献

参考文献6

二级参考文献91

共引文献158

同被引文献35

引证文献6

二级引证文献8

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部