期刊文献+

基于组合网络的多特征老挝语实体关系抽取研究

Combined Network Based Multi-feature Lao Language Entity Relationship Extraction
下载PDF
导出
摘要 实体关系抽取旨在提取实体之间存在的语义关系,这可以为知识图谱、自动问答等下游任务提供支持,在自然语言处理领域具有重要作用。由于当前老挝语实体关系抽取的相关研究十分匮乏,可用数据也十分有限,因此在训练时神经网络无法获取足够的语义信息。针对此问题,该文提出了一种基于PCNN和BiGRU的组合模型的多特征老挝语实体关系抽取方法。首先,将位置特征与音素特征融入到词向量中得到包含多种语义的联合向量;然后,分别使用PCNN模型和BiGRU模型对联合向量进行深层语义的提取,其中PCNN模型能够更好地提取文本中的局部信息,BiGRU模型能够更好地考虑文本的全局信息,之后将两个模型的输出进行拼接,便得到了包含多维度语义信息的句子向量;最后,使用softmax进行多分类计算。实验表明,该文提出的方法,在有限的数据下得到了不错的效果,macro-averaged F1达到了82.25%。 Entity relation extraction aims to extract the semantic relations between entities,which can provide support for downstream tasks such as knowledge graphs and automatic question and answer.Due to the lack of research related to entity relation extraction in Lao language with very limited data,this paper proposes a multi-feature Lao entity relation extraction method based on the combined model of PCNN and BiGRU.First,the position feature and phoneme feature are integrated into the word vector to obtain joint vector containing multiple semantics.Then,the PCNN model and the BiGRU model are used to extract the deep semantics of the joint vector,respectively.Among them,the PCNN model can better extract the local information in the text,and the BiGRU model can better consider the global information of the text,and the output of the two models are concatenated to obtain multi-dimensional semantic information.Finally,the softmax is used for multi-class predication.Experiments show that the method proposed in this paper has obtained 82.25%macro-averaged F 1 with limited data.
作者 马霄飞 周兰江 周蕾越 MA Xiaofei;ZHOU Lanjiang;ZHOU Leiyue(Faculty of Information Engineering and Automation,Kunming University of Science and Technology,Kunming,Yunnan 650500,China;Faculty of Electrical and Information Engineering,Oxbridge College of Kunming University of Science and Technology,Kunming,Yunnan 650160,China)
出处 《中文信息学报》 CSCD 北大核心 2024年第6期96-107,共12页 Journal of Chinese Information Processing
基金 国家自然科学基金(61662040)。
关键词 多段卷积神经网络 双向门控循环单元 音素特征 联合向量 层归一化 PCNN BiGRU phoneme feature joint vector layer normalization
  • 相关文献

参考文献5

二级参考文献32

  • 1In: Proceedings of the 6th Message Understanding Conference (MUC - 7) [ C ]. National Institute of Standars and Technology, 1998.
  • 2C. Aone and M. Ramos-Santacruz. Rees: A large-scale relation and event extraction system[A]. In: Proceedings of the 6th Applied Natural Language Processing Conference[C] ,pages 76- 83, 2000.
  • 3S. Miller, M. Crystal, H. Fox, L. Ramshaw, R. Schwartz, R. Stone, R. Weischedel, and the Annotation Group.Algorithms that learn to extract information-BBN: Description of the SIFT system as used for MUC[ A]. In: Proceedings of the Seventh Message Understanding Conference (MUC-7)[C], 1998.
  • 4S. Soderland. Learning information extraction rules for semi-structured and free text[J]. Machine Learning, 1999. 34(1 - 3) :233 - 272.
  • 5N. Cristianini and J. Shawe-Taylor. An Introduction to Support Vector Machines[ M]. Cambridge University Press,Cambirdge University, 2000.
  • 6T. Zhang. Regularized winnow methods[A]. In: Advances in Neural Information Processing Systems 13[C], pages703 - 709, 2001.
  • 7D. Haussler. Convolution kernels on discrete structures[R]. Technical Report UCSC-CRL- 99- 10, 7, 1999.
  • 8H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, and C. Watkins. Text classification using string kernels[R]. J. Mach. Learn. Res., 2:419-444, 2002.
  • 9D. Zelenko, C. Aone, andA. Richardella. Kernel methods for relation extraction[R]. J. Mach. Learn. Res., 3:1083- 1106, 2003.
  • 10A. Culotta and J. Sorensen. Dependency tree kernels for relation extraction [ A]. In: Proceedings of ACL[ C ].2004. Barcelona, Spain.

共引文献158

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部