融合多粒度语言知识与层级信息的中文命名实体识别模型

Chinese named entity recognition model incorporating multi-granularity linguistic knowledge and hierarchical information

下载PDF

导出

摘要针对当前大多数命名实体识别(NER)模型只使用字符级信息编码且缺乏对文本层次信息提取的问题,提出一种融合多粒度语言知识与层级信息的中文NER(CNER)模型(CMH)。首先,使用经过多粒度语言知识预训练的模型编码文本,使模型能够同时捕获文本的细粒度和粗粒度语言信息,从而更好地表征语料;其次,使用ON-LSTM(Ordered Neurons Long Short-Term Memory network)模型提取层级信息,利用文本本身的层级结构信息增强编码间的时序关系;最后,在模型的解码端结合文本的分词信息,并将实体识别问题转化为表格填充问题,以更好地解决实体重叠问题并获得更准确的实体识别结果。同时,为解决当前模型在不同领域中的迁移能力较差的问题,提出通用实体识别的理念,通过筛选多领域的通用实体类型,构建一套提升模型在多领域中的泛化能力的通用NER数据集MDNER(Multi-Domain NER dataset)。为验证所提模型的效果,在数据集Resume、Weibo、MSRA上进行实验,与MECT(Multi-metadata Embedding based Cross-Transformer)模型相比,F1值分别提高了0.94、4.95和1.58个百分点。为了验证所提模型在多领域中的实体识别效果,在MDNER上进行实验,F1值达到了95.29%。实验结果表明,多粒度语言知识预训练、文本层级结构信息提取和高效指针解码器对模型的性能提升至关重要。 Aiming at the problem that most of the current Named Entity Recognition(NER)models only use characterlevel information encoding and lack text hierarchical information extraction,a Chinese NER(CNER)model incorporating Multi-granularity linguistic knowledge and Hierarchical information(CMH)was proposed.First,the text was encoded using a model that had been pre-trained with multi-granularity linguistic knowledge,so that the model could capture both finegrained and coarse-grained linguistic information of the text,and thus better characterize the corpus.Second,hierarchical information was extracted using the ON-LSTM(Ordered Neurons Long Short-Term Memory network)model,in order to utilize the hierarchical structural information of the text itself and enhance the temporal relationships between codes.Finally,at the decoding end of the model,incorporated with the word segmentation Information of the text,the entity recognition problem was transformed into a table filling problem in order to better solve the entity overlapping problem and obtain more accurate entity recognition results.Meanwhile,in order to solve the problem of poor migration ability of the current models in different domains,the concept of universal entity recognition was proposed,and a set of universal NER dataset MDNER(Multi-Domain NER dataset)was constructed to enhance the generalization ability of the model in multiple domains by filtering the universal entity types in multiple domains.To validate the effectiveness of the proposed model,experiments were conducted on the datasets Resume,Weibo,and MSRA,and the F1 values were improved by 0.94,4.95 and 1.58 percentage points,respectively,compared to the MECT(Multi-metadata Embedding based Cross-Transformer)model.In order to verify the proposed model’s entity recognition effect in multi-domain,experiments were conducted on MDNER,and the F1 value reached 95.29%.The experimental results show that the pre-training of multi-granularity linguistic knowledge,the extraction of hierarchical structural information of the text,and the efficient pointer decoder are crucial for the performance promotion of the model.

作者于右任张仰森蒋玉茹黄改娟 YU Youren;ZHANG Yangsen;JIANG Yuru;HUANG Gaijuan(Institute of Intelligent Information Processing,Beijing Information Science and Technology University,Beijing 100101,China)

机构地区北京信息科技大学智能信息处理研究所

出处《计算机应用》 CSCD 北大核心 2024年第6期1706-1712,共7页 journal of Computer Applications

基金国家自然科学基金资助项目(62176023)。

关键词命名实体识别自然语言处理知识图谱构建高效指针通用实体 Named Entity Recognition(NER) Natural Language Processing(NLP) knowledge graph construction efficient pointer generic entity

分类号 TP391.1 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献1

1曹宗胜,许倩倩,李朝鹏,姜阳邦彦,操晓春,黄庆明.基于对偶四元数的协同知识图谱推荐模型[J].计算机学报,2022,45(10):2221-2242. 被引量：7

共引文献6

1杨向荣.面向用户需求的数字图书馆精准推送系统设计[J].自动化与仪器仪表,2023(5):182-185. 被引量：3
2曾旻冬,李宁,李红仁,张仰超,呼树尧,张坤,马吉伟.基于知识图谱的燃气轮机故障诊断知识库构建方法及维护[J].电力大数据,2023,26(4):44-55. 被引量：3
3张彬,董雅倩,徐建民.基于蕴含情感要素用户正负偏好的电影推荐方法[J].河北大学学报(自然科学版),2023,43(6):653-664.
4鄢凡力,胥小波,赵容梅,孙思雨,琚生根.基于跨视图对比学习的知识感知推荐系统[J].工程科学与技术,2024,56(1):44-53.
5周北京,王海荣,马赫,张丽丝.项目邻居信息对比增强的推荐方法[J].山西大学学报（自然科学版）,2024,47(2):269-278.
6查易艺,王翀,张明明.基于知识图谱的变压器匝间短路故障辨识研究[J].自动化仪表,2024,45(4):14-18.

1占文韬,吴晓鸰,凌捷.基于多窗口注意力机制的中文命名实体识别[J].小型微型计算机系统,2024,45(6):1325-1330. 被引量：1
2戴高阳,孟小艳,张容祯,陈燕红,汪洋.基于匹配词权重优化的中文命名实体识别方法[J].计算机与数字工程,2024,52(2):521-527.
3赵英明,王浩森,赵明瞻.基于BERT的命名实体识别[J].河北建筑工程学院学报,2024,42(1):253-257.
4董永峰,白佳明,王利琴,王旭.融合先验知识和字形特征的中文命名实体识别[J].计算机应用,2024,44(3):702-708.
5吴婷,陈红梅,罗冬莲,陈芸芝.基于条形卷积和上下文感知的近海水产养殖提取方法[J].福州大学学报（自然科学版）,2024,52(1):37-44. 被引量：1
6Arshiya S.Ansari.A Review on the Recent Trends of Image Steganography for VANET Applications[J].Computers, Materials & Continua,2024,78(3):2865-2892.
7Yuantian Huang,Satoshi Iizuka,Edgar Simo-Serra,Kazuhiro Fukui.Controllable multi-domain semantic artwork synthesis[J].Computational Visual Media,2024,10(2):355-373.
8陈金玉,王名扬,刘旭.融合汉字字形结构信息的中文命名实体识别[J].东北师大学报（自然科学版）,2024,56(2):60-68.
9陈昊飏,张雷.融合语义解释和DeBERTa的极短文本层次分类[J].计算机科学,2024,51(5):250-257.
10白照杰.中晚唐天台上清正统的重建与赓续——“洞玄灵宝三师”考[J].复印报刊资料（宗教）,2022(5):77-88.

计算机应用

2024年第6期

浏览历史

内容加载中请稍等...

融合多粒度语言知识与层级信息的中文命名实体识别模型

参考文献1

共引文献6

相关作者

相关机构

相关主题

浏览历史