期刊文献+

基于BiLSTM-CRF模型的汉语否定信息识别 被引量:2

Chinese Negation Recognition Based on BiLSTM-CRF
下载PDF
导出
摘要 否定信息识别是将自然语言中的肯定信息与否定信息分离,它对信息检索、文本挖掘、情感分析等都有重要作用。该文主要对汉语否定信息中的触发词识别和覆盖域识别进行研究,采用双向长短期记忆网络结合条件随机场(BiLSTM-CRF)为模型,预训练的词向量为输入特征对触发词进行识别,在此基础上添加已知触发词特征对覆盖域进行识别。中文否定与不确定信息语料上,触发词识别取得F1值为91.03%,覆盖域识别在该语料的子语料财经新闻上取得F1值最高为73.91%。实验结果表明,这一模型在汉语否定触发词识别和覆盖域识别上取得的效果优于CRF模型和BiLSTM模型。 Negation recognition is to distinguish positive information from negative information in natural language,which is of substantial significance in information retrieval,text mining and emotion analysis.This paper investigates the cue detection and scope recognition in Chinese negative information by combining the BiLSTM(bidirectional long short-term memory network)and CRF(conditional random field)as BiLSTM-CRF.The pre-trained word embedding is input as features to detect the cue.And then the known cue features are added to define the scope.For Chinese negation and speculation information corpus,the cue detection reaches 91.03%in F1 value,and the scope recognition 73.91%(in the sub-corpus financial news only).The experimental results show that this proposed method is superior to the CRF model and the BiLSTM model in Chinese negative cue detection and scope recognition.
作者 陈世梅 伍星 唐凡 CHEN Shimei;WU Xing;TANG Fan(College of Computer Science,Chongqing University,Chongqing 400044,China;Ppdai Group Inc.,Shanghai 201210,China)
出处 《中文信息学报》 CSCD 北大核心 2018年第11期55-61,共7页 Journal of Chinese Information Processing
基金 国家自然科学基金(51608070)
关键词 BiLSTM-CRF 否定触发词 否定覆盖域 BiLSTM-CRF cue detection scope recognition
  • 相关文献

参考文献3

二级参考文献28

  • 1张瑞朋,宋柔.否定词跨标点句管辖的判断[J].中文信息学报,2007,21(5):131-135. 被引量:3
  • 2Liu B. Sentiment analysis and opinion mining [J]. Synthesis Lectures on Human Language Technologies, 2012, 5(1) : 1- 167.
  • 3Taboada M, Brooke J, Tofiloski M, et al. Lexicon-based methods for sentiment analysis [J]. Computational Linguistics, 2011, 37(2): 267-307.
  • 4Pang B, Lee L, Vaithyanathan S. Thumbs up? Sentiment classification using machine learning techniques [C] //Proc of the 2002 Conf on Empirical Methods in Natural Language Processing. Stroudsburg, PA: ACL, 2002:79-86.
  • 5Tan Songbo, Zhang Jin. An empirical study of sentiment analysis for Chinese documents [J]. Expert Systems with Applications, 2008, 34(4): 2622-2629.
  • 6Chen Zhancheng, Zou Bowei, Zhu Qiaoming, et al. The construction of Chinese negation and uncertainty identification corpus [G] //LNCS 8229: Proc of the 14th Chinese Lexical Semantics Workshop. Berlin: Springer, 2013:226-231.
  • 7Jia L F, Yu C, Meng W Y. The effect of negation on sentiment analysis and retrieval effectiveness [C] //Proc of the 18th ACM Conf on Information and Knowledge Management. New York; ACM, 2009: 1827-1830.
  • 8Zhu X D, Guo H Y, Mohammad S, et al. An empirical study on the effect of negation words on sentiment [C]//Proe of the 52nd Annual Meeting of the Association for Computational Linguistics. Stroudsburg, PA.. ACL, 2014: 304-313.
  • 9张志飞,李飚,卫志华,等.中文否定句的情感倾向性分析[C]//第5届中文倾向性分析评测会议.北京:中国中文信息学会,2013:111-120.
  • 10Pawlak Z, Grzymala-Busse J, Slowinski R, et al. Rough sets [J]. Communications of the ACM, 1995, 38(11): 88-95.

共引文献18

同被引文献24

引证文献2

二级引证文献60

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部