结合门控机制的卷积网络实体缺失检测方法

Convolutional Network Entity Missing Detection Method Combined with Gated Mechanism

下载PDF

导出

摘要实体信息充足与否直接影响着有赖于文本实体信息的相关应用,而常规的实体识别模型仅能对已存在的实体进行识别。文中提出以序列标注任务定义实体缺失检测任务,并提出了相应的3种实体缺失检测模型的训练数据构造方法。根据实体缺失任务的识别特点,提出了融合门控机制的卷积神经网络与预训练语言模型相结合的实体缺失检测方法。通过实验发现,基于预训练语言模型与门控卷积网络的模型对人名类、组织类、地点类实体缺失识别的F1最高分别达80.45%,83.02%和86.75%,显著高于基于LSTM的实体识别模型。通过字频统计发现,运用不同标注方法的数据集所训练的模型的准确率与被标注字符字频存在相关性。 The adequacy of the entity information directly affects the applications that depend on textual entity information,while conventional entity recognition models can only identify the existing entities.The task of the entity missing detection,defined as a sequence labeling task,aims at finding the location where the entity is missing.In order to construct training dataset,three corres-ponding methods are proposed.We introduce an entity missing detection method combining the convolutional neural network with the gated mechanism and the pre-trained language model.Experiments show that the F1 scores of this model are 80.45%for the PER entity,83.02%for the ORG entity,and 86.75%for the LOC entity.The model performance exceeds the other LSTM-based named entity recognition model.It is found that there is a correlation between the accuracy of the model and the word frequency of the annotated characters.

作者叶瀚李欣孙海春 YE Han;LI Xin;SUN Haichun(School of Information and Cyber Security,People's Public Security University of China,Beijing 102623,China)

机构地区中国人民公安大学信息网络安全学院

出处《计算机科学》 CSCD 北大核心 2023年第5期262-269,共8页 Computer Science

基金公安部技术研究计划项目(2020JSYJC22,2021JSZ09)。

关键词门控机制异常检测预训练语言模型卷积神经网络 Gated mechanism Abnormal detection Pre-trained language model Convolutional neural network

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1李聪聪,李强,王雪绒,赵金雨.基于双向门控机制和层次注意力的方面级情感分析[J].井冈山大学学报（自然科学版）,2023,44(2):71-78.
2肖大军,张逸茹,徐遐龄,刘涛,李翔.基于Bert的电网故障处置预案信息抽取研究与实现[J].电力信息与通信技术,2023,21(3):26-32. 被引量：6
3黄文兵.综合物探技术在滑坡勘察中的应用[J].中文科技期刊数据库（全文版）工程技术,2021(11):311-313.
4朱旭东.西夏文信息熵值的初步计算——以《天盛律令》文本为基础[J].西夏学,2022(2):185-193.
5徐洋.基于注意力机制的二进制标注事件抽取方法[J].电子技术与软件工程,2023(2):247-252.
6钟昕妤,李燕,徐丽娜,陈月月,帅亚琦.基于CmabBERT-BILSTM-CRF的针灸古籍分词技术研究[J].计算机时代,2023(4):11-15.
7高希.四朝三难李时勉[J].月读,2023(1):10-17.
8杜洁,骆力明,孙众.基于ALBERT预训练模型的事件抽取技术研究[J].计算机工程与科学,2023,45(4):711-717.
9徐康,李霏,姬东鸿.结合依存图卷积与文本片段搜索的方面情感三元组抽取[J].计算机工程,2023,49(4):61-67. 被引量：4
10伍秋萍,郑佩芸.香港地区学龄儿童汉字认读水平测试研制与常模建构[J].语言战略研究,2023,8(3):58-67.

计算机科学

2023年第5期

浏览历史

内容加载中请稍等...

结合门控机制的卷积网络实体缺失检测方法

相关作者

相关机构

相关主题

浏览历史