期刊文献+

基于双向LSTM和GBDT的中医文本关系抽取模型 被引量:11

TCM text relationship extraction model based on bidirectional LSTM and GBDT
下载PDF
导出
摘要 为解决采用softmax作为长短期记忆网络分类器导致实体关系识别模型泛化能力不足,不能较好适用中医实体关系抽取等问题,提出一种融合梯度提升树的双向长短期记忆网络的关系识别算法(BILSTM-GBDT)。先采用word2vec对中医文本进行向量化表示,再利用基于注意力机制的双向长短期记忆网络提取高阶特征,最后采用集成分类模型梯度提升树作为特征分类器,提高关系识别效果。在中医等多个关系语料库上的实验结果表明,该模型与传统SVM方法、GBDT方法及其深度学习方法相比,均有更高的精确率、召回率和F值。 In order to solve the problem that the use of softmax as a long-short-term memory network classifier leads to the lack of generalization ability of the entity relationship recognition model,it is not suitable for the extraction of TCM entity relationships. This paper proposed a bidirectional long short-term memory( BILSTM) relational identification algorithm( BILSTM-GBDT) that incorporates a gradient boosting decision tree( GBDT). Firstly,it trained the Chinese medicine text vector by word2 vec,then extracted the high-order features by the bidirectional long short-term memory network based on the attention mechanism. Finally,it used the integrated classification model gradient lifting tree as the feature classifier to improve the relationship recognition effect. Experimental results on multiple relational corpora such as Chinese medicine show that the model has higher accuracy,recall and F value than traditional SVM method,GBDT method and deep learning method.
作者 罗计根 杜建强 聂斌 熊旺平 刘蕾 贺佳 Luo Jigen;Du Jianqiang;Nie Bin;Xiong Wangping;Liu Lei;He Jia(School of Computer,Jiangxi University of Traditional Chinese Medicine,Nanchang 330004,China)
出处 《计算机应用研究》 CSCD 北大核心 2019年第12期3744-3747,共4页 Application Research of Computers
基金 国家自然科学基金资助项目(61363042,61562045,61762051) 江西省科技厅重大研发计划资助项目(20171ACE50021) 江西省科技厅重点研发计划资助项目(20171BBG70108) 江西省研究生创新专项资金资助项目(YC2017-S349)
关键词 关系抽取 长短期记忆网络 梯度提升树 注意力机制 中医文本 relationship extraction LSTM GBDT attention mechanism Chinese medicine text
  • 相关文献

参考文献9

二级参考文献91

共引文献297

同被引文献127

引证文献11

二级引证文献41

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部