Relation Classification via Sequence Features and Bi-Directional LSTMs 被引量：6

Relation Classification via Sequence Features and Bi-Directional LSTMs

导出

摘要 Structure features need complicated pre-processing, and are probably domain-dependent. To reduce time cost of pre-processing, we propose a novel neural network architecture which is a bi-directional long-short-term-memory recurrent-neural-network（Bi-LSTM-RNN） model based on low-cost sequence features such as words and part-of-speech（POS） tags, to classify the relation of two entities. First, this model performs bi-directional recurrent computation along the tokens of sentences. Then, the sequence is divided into five parts and standard pooling functions are applied over the token representations of each part. Finally, the token representations are concatenated and fed into a softmax layer for relation classification. We evaluate our model on two standard benchmark datasets in different domains, namely Sem Eval-2010 Task 8 and Bio NLP-ST 2016 Task BB3. In Sem Eval-2010 Task 8, the performance of our model matches those of the state-of-the-art models, achieving 83.0% in F1. In Bio NLP-ST 2016 Task BB3, our model obtains F1 51.3% which is comparable with that of the best system. Moreover, we find that the context between two target entities plays an important role in relation classification and it can be a replacement of the shortest dependency path. Structure features need complicated pre-processing, and are probably domain-dependent. To reduce time cost of pre-processing, we propose a novel neural network architecture which is a bi-directional long-short-term-memory recurrent-neural-network（Bi-LSTM-RNN） model based on low-cost sequence features such as words and part-of-speech（POS） tags, to classify the relation of two entities. First, this model performs bi-directional recurrent computation along the tokens of sentences. Then, the sequence is divided into five parts and standard pooling functions are applied over the token representations of each part. Finally, the token representations are concatenated and fed into a softmax layer for relation classification. We evaluate our model on two standard benchmark datasets in different domains, namely Sem Eval-2010 Task 8 and Bio NLP-ST 2016 Task BB3. In Sem Eval-2010 Task 8, the performance of our model matches those of the state-of-the-art models, achieving 83.0% in F1. In Bio NLP-ST 2016 Task BB3, our model obtains F1 51.3% which is comparable with that of the best system. Moreover, we find that the context between two target entities plays an important role in relation classification and it can be a replacement of the shortest dependency path.

作者 REN Yuanfang TENG Chong LI Fei CHEN Bo JI Donghong

机构地区 School of Computer

出处《Wuhan University Journal of Natural Sciences》 CAS CSCD 2017年第6期489-497,共9页 武汉大学学报（自然科学英文版）

基金 Supported by the China Postdoctoral Science Foundation(2014T70722) the Humanities and Social Science Foundation of Ministry of Education of China(16YJCZH004)

关键词 Bi-LSTM-RNN relation classification sequence features structure features Bi-LSTM-RNN relation classification sequence features structure features

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献2

1ZHOU Jing,Lü Chaozhen,JI Donghong,LIANG Xiaohui.Framework Construction and Application for Global Health Information Platform[J].Wuhan University Journal of Natural Sciences,2015,20(2):153-158. 被引量：2
2SHEN Fengshan ZHANG Junying YUAN Xiguo.Novel Method of Mining Classification Information for SVM Training[J].Wuhan University Journal of Natural Sciences,2011,16(6):475-480. 被引量：1

二级参考文献33

1Blake C, Keogh E, Merz C. UCI Repository of machine learning databases [ED/OL]. [2011-03-10]. http: // www.ics. uci. edu/_mlearn/MLRepository, html.
2Cheng Weiyuan, Juang Chiafeng. An incremental support vector machine-trained TS-type fuzzy system for online classification problems[J]. Fuzzy Sets and Systems, 2011,163 24-44.
3Yi Yang, Wu Jiansheng, Xu Wei. Incremental SVM based on reserved set for network intrusion detection[J]. Expert Systems with Applications, 2011, 38: 7698-7707.
4Andrzej P, Luo Jie, Barbara C. The more you learn, the less you store: Memory-controlled incremental SVM for visual place recognition[J]. Image and Vision Computing, 2010, 28: 1080-1097.
5Vapnik V. The Nature of Statistical Learning Theory[M]. New York: Springer-Verlag, 1998.
6Burges C. A tutorial on support vector machines for pattern recognition[J]. Data Mining and Knowledge Discovery, 1998, 2(2): 121-167.
7Cristianini N, Shawe-Taylor J. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods[M]. Cambridge: Cambridge University Press, 2000.
8Malon C, Uchida S, Suzuki M. Mathematical symbol recognition with support vector machines[J]. Pattern Recognition Letters, 2008, 29: 1326-1332.
9Platt J. Fast Training Support Vector Machines Using Sequential Minimal Optimization [C]//Advances in Kernel Methods--Support Vector Learning. Cambridge: MIT Press, 1998.
10Joachims T. Making large-scale support vector machine learning practical[C]//Advances in Kernel Methods--Support Vector Learning. Cambridge: MIT Press, 1998.

共引文献1

1Lü Chen,CHEN Bo,Lü Chaozhen,QIU Likun,JI Donghong.A Multiple Feature Approach for Disorder Normalization in Clinical Notes[J].Wuhan University Journal of Natural Sciences,2016,21(6):482-490.

同被引文献33

1才华.基于小字符集的藏文自动分词技术研究[J].西藏大学学报（社会科学版）,2013,28(5):43-47. 被引量：3
2马永生,李启明,关德师.鄂尔多斯盆地中部气田奥陶系马五_（1－4）碳酸盐岩微相特征与储层不均质性研究[J].沉积学报,1996,14(1):22-32. 被引量：25
3祁坤钰.信息处理用藏文自动分词研究[J].西北民族大学学报（哲学社会科学版）,2006(4):92-97. 被引量：34
4张齐.三孔隙度重叠法和三孔隙度差值及比值法在保山盆地永铸街气田气层识别中的应用[J].石油天然气学报,2010,32(2):90-93. 被引量：16
5王伟军,王金鹏.科学知识图谱在技术预见中的应用探析[J].情报科学,2010,28(8):1127-1131. 被引量：33
6史晓东,卢亚军.央金藏文分词系统[J].中文信息学报,2011,25(4):54-56. 被引量：30
7刘汇丹,诺明花,赵维纳,吴健,贺也平.SegT:一个实用的藏文分词系统[J].中文信息学报,2012,26(1):97-103. 被引量：25
8曹馨宇,曹存根,吴昱明.从Web中获取部分整体关系[J].中文信息学报,2013,27(2):26-33. 被引量：3
9王海燕,冷伏海.支持科技规划优先领域选择的战略情报与服务框架研究[J].图书情报工作,2013,57(7):70-74. 被引量：9
10李雅,侯海燕,朱建春,吴敬超,欧阳五庆.知识图谱方法在科研选题中的应用研究——以乙酰甲喹纳米乳的研制及药效评价研究为例[J].图书情报工作,2013,57(9):84-91. 被引量：6

引证文献6

1刘秀磊,王延飞,刘思含,李红臣.科技情报对象关系抽取的技术选择[J].情报工程,2018,4(3):39-47. 被引量：3
2Lili Wang,Ziyan Chen,Hongwu Yang.TPOS Tagging Method Based on BiLSTM_CRF Model[J].国际计算机前沿大会会议论文集,2019(1):501-503.
3张娜娜,王裴岩,张桂平.面向工艺操作说明文本的命名实体深度学习识别方法[J].计算机应用与软件,2019,36(11):188-195. 被引量：6
4王文刀,王润泽,魏鑫磊,漆云亮,马义德.基于堆叠式双向LSTM的心电图自动识别算法[J].计算机科学,2020,47(7):118-124. 被引量：9
5王莉莉,王宏渊,白玛曲珍,杨鸿武.基于BiLSTM_CRF模型的藏文分词方法[J].重庆邮电大学学报（自然科学版）,2020,32(4):648-654. 被引量：7
6周雪晴,张占松,朱林奇,张超谟.基于双向长短期记忆网络的流体高精度识别新方法[J].中国石油大学学报（自然科学版）,2021,45(1):69-76. 被引量：15

二级引证文献40

1Lun Gao,Ran-Hong Xie,Li-Zhi Xiao,Shuai Wang,Chen-Yu Xu.Identification of low-resistivity-low-contrast pay zones in the feature space with a multi-layer perceptron based on conventional well log data[J].Petroleum Science,2022,19(2):570-580. 被引量：2
2葛世奇,孙新,寇桓锦,袁燕.基于预训练模型的政务领域实体关系抽取[J].情报工程,2022,8(4):3-13. 被引量：2
3王皓,于洪志.知识图谱在科技情报研究中的应用模型研究[J].中国新通信,2018,20(23):88-89.
4王佳雯,王剑,线岩团,余正涛.融入中心句的涉案新闻要素实体识别方法[J].通信技术,2021,54(4):835-841.
5和青芳,王慧,程光.自适应小数据集乳腺癌病理组织分类研究[J].计算机科学,2021,48(S01):67-73. 被引量：2
6李传栋,邱磊,于雁.基于改进残差密集网络的心律失常自动分类[J].计算机与现代化,2021(11):106-111. 被引量：1
7韩玉娇.基于AdaBoost机器学习算法的大牛地气田储层流体智能识别[J].石油钻探技术,2022,50(1):112-118. 被引量：8
8刘志鹏,廖家舟,钟晓玲.车务段安全信息分析与辅助决策系统研究[J].铁道运输与经济,2022,44(2):45-51. 被引量：1
9李小平,白超,石森.基于CNN-BiLSTM模型的机车变压器油中溶解气体浓度预测方法[J].铁道学报,2022,44(5):42-48. 被引量：5
10刘晓彤,赵小兵.藏文自动分词技术研究[J].中央民族大学学报（自然科学版）,2022,31(2):63-66. 被引量：2

1周鹏.射频和微波电路设计中值得重视的几个问题[J].山东工业技术,2017(24):125-125. 被引量：1
2丁大为,张亚琴,王年.Synchronization of Fractional-Order Memristor-Based Chaotic System via Adaptive Control[J].Journal of Donghua University(English Edition),2017,34(5):653-660.

Wuhan University Journal of Natural Sciences

2017年第6期

浏览历史

内容加载中请稍等...