期刊文献+

基于BiLSTM-CRF的体育新闻主题词抽取方法

Topic Word Extraction Based on BiLSTM-CRF for Sport News
下载PDF
导出
摘要 针对典型的循环神经网络方法在抽取主题词时因缺少上下文相关的句子级信息而导致识别准确率较低的问题,提出了一种基于双向长短期记忆网络条件随机场(BiLSTM-CRF)模型联合TextRank的主题词抽取方法。首先,利用TextRank对新闻文本进行主题句抽取,再使用双向长短期记忆(BiLSTM)模型获取文本的前后特征,最后使用条件随机场(CRF)完成句子级序列标注,得到主题词。在多组体育类新闻数据集上进行实验,该方法较对照组BiLSTM方法F1值提高约0.8%~5.1%,且用时更短。因此,改进的BiLSTM-CRF方法可显著提升主题词的抽取准确率和效率。 To solve the problem of low recognition accuracy caused by the lack of text context information in typical recurrent neural network for extracting topic words,we proposed a novel method for extracting topic words based on Bidirectional Long Short-Term Memory(BiLSTM)network with Conditional Random Field.Firstly,the topic sentences were extracted from news texts by the TextRank model.Then,the forward and backward characters of texts were obtained by BiLSTM network.Finally,the topic words were sequence-tagged in sentence-level by a Conditional Random Field layer.Experiments were performed on multiple sports news datasets.Compared with the control group of BiLSTM method,the F1 value increases by 0.8%-5.1%.The experimental results show that our method can significantly improve the accuracy and efficiency of topic word extraction.
作者 江逸琪 赵彤洲 柴悦 高佩东 JIANG Yiqi;ZHAO Tongzhou;CHAI Yue;GAO Peidong(School of Computer Science and Engineering,Wuhan Institute of Technology,Wuhan 430205,China)
出处 《武汉工程大学学报》 CAS 2020年第1期102-107,共6页 Journal of Wuhan Institute of Technology
基金 国家自然科学基金(61601176) 武汉工程大学中青年人才项目(Q20191510) 武汉工程大学研究生创新基金(CX2018195)。
关键词 抽取 TextRank BiLSTM-CRF sport news topic word extraction TextRank BiLSTM-CRF
  • 相关文献

参考文献8

二级参考文献52

  • 1何干俊.汉语音译词的多维考察[J].江西社会科学,2012,32(4):194-197. 被引量:4
  • 2左晓飞,刘怀亮,范云杰,赵辉.基于概念语义场的文本聚类算法研究[J].情报杂志,2012,31(5):180-184. 被引量:2
  • 3张仰森,曹元大,俞士汶.语言模型复杂度度量与汉语熵的估算[J].小型微型计算机系统,2006,27(10):1931-1934. 被引量:7
  • 4黄昌宁,赵海.中文分词十年回顾[J].中文信息学报,2007,21(3):8-19. 被引量:248
  • 5XU Wei, AlexRudnicky. Can artificial neural networks learn lan- guage modelsT[C]// Proceedings of International Conference on Spoken Language Processing. 2000.
  • 6Bengio Yoshua. A neural probabilisfic language model[J]. Journal of Machine Learning Research, 2003, 10(3): 1137-1155.
  • 7Tom'aVs Mikolov. Statistical language models based on neural networks[D]. Brno University of Technology, Czech Republic, 2012.
  • 8Mikolov T, Karafi'at M, Burget L, et al. Recurrent neural network based language model[C]// Proceedings of Interspeech, 2010: 1045-1048.
  • 9Kombrink S, Mikolov T, Karafrat M, et al. Recurrent neural network based language modeling in meeting recognition[C]// Proceedings of Interspeech, 2011: 2877-2880.
  • 10Mikolov T, Kombrink S, Burget L, et al. Extensions of recurrent neural network language model[C]// Proceedings of ICASSP, 2011: 5528-5531.

共引文献208

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部