面向事件检测的预训练主动学习模型被引量：1

Design and Implementation of a Pretraining Active Learning Model for Unstructured Event Detection

下载PDF

导出

摘要深度学习在事件检测任务上取得了显著的成果,但模型严重依赖于大量的标注数据.由于事件结构化的信息和丰富的标签表示,使得获取注释的成本很高,难以大量获得.针对事件检测任务,为了提高语料标注效率,减少训练过程所需的标注样本数量,提出一种联合主动学习和预训练模型的事件检测模型.针对主动学习模型存在的冷启动问题,设计了基于融合不确定性的特殊样本选择策略,估计样本在微调下游事件检测任务方面的潜在贡献.一方面,结合预训练模型从原始任务中带来的丰富的语义信息,避免了重新设计网络结构或从零开始训练;另一方面,利用主动学习选择信息丰富的样本能更好地微调预训练模型,减少数据标注成本.在ACE 2005语料上进行数值实验验证,结果证明了所提出的EDPAL算法的有效性. With the rapid growth of network information,it has become more and more important to find the key information.Event detection focuses on extracting event triggers from unstructured natural language texts.Deep learning has achieved a great success in event detection tasks,but the model relies on a large amount of labeled data which are difficult to be obtained.And the cost of obtaining annotations is very high due to the structured information of the event and the rich label representation.To address these issues,this paper proposes a joint active learning and pre-trained event detection model(EDPAL).To handle the cold start problem of the active learning,a special sample selection strategy on the basis of fusion uncertainty is designed to estimate the potential contribution of samples in fine-tuning downstream event detection tasks.On the one hand,combined with the rich semantic information brought by the pre-training model from the original task,it avoids redesigning the network structure or training from scratch.On the other hand,the pre-training model can be better fine-tuned by selecting information-rich samples and reduce the cost of data labeling at the same time.The experimental results on the ACE 2005 corpus shows the effectiveness of the proposed EDPAL.

作者冯琳慧乔林波阚志刚 Feng Linhui;Qiao Linbo;Kan Zhigang(National Laboratory for Parallel and Distributed Processing,National University of Defense Technology,Changsha 410073,China)

机构地区国防科技大学并行与分布处理国家重点实验室

出处《南京师范大学学报（工程技术版）》 CAS 2022年第2期41-47,共7页 Journal of Nanjing Normal University(Engineering and Technology Edition)

关键词主动学习事件检测预训练模型样本选择策略微调 active learning event detection pre-trained model selecting strategy fine-tuning

分类号 O643 [理学—物理化学] X703 [环境科学与工程—环境工程]

引文网络
相关文献

参考文献3

1吴家皋,周凡坤,张雪英.HMM模型和句法分析相结合的事件属性信息抽取[J].南京师大学报（自然科学版）,2014,37(1):30-34. 被引量：10
2Xiaocheng FENG,Bing QIN,Ting LIU.A language-independent neural network for event detection[J].Science China(Information Sciences),2018,61(9):75-86. 被引量：56
3邱盈盈,洪宇,周文瑄,姚建民,朱巧明.面向事件抽取的深度与主动联合学习方法[J].中文信息学报,2018,32(6):98-106. 被引量：6

二级参考文献12

1]iang Huixing,Wang Xiaojie, Tian Jilei. Second-order HMM for event extraction from short message [ J ]. Lecture Notes in Computer Science,2010,6 177 : 149-156.
2Zhou Deyu, Yulan Heb. Biomedical events extraction using the hidden vector state model [ J ]. Artificial Intelligence in Medicine,2011,53(3) :205-213.
3Li Qing,Yuanzhu Peter Chen. Personalized text snippet extraction using statistical language models [ J ]. Pattern Recognition, 2010,43( 1 ) :378-386.
4Scheffer T, Decomain C, Wrobel S. Active hidden Markov models for information extraction [ C ]//Proceedings of the Interna- tional Symposium on Intelligent Data Analysis. Berlin:Springer,2001:301-109.
5Bolanle Ojokoh ,Zhang Ming,Tang Jian. A trigram hidden Markov model for metadata extraction from heterogeneous references[J]. Information Sciences,2011,181 (9) : 1 538-1 551.
6Liu Jiangyang. Resolution to combinational ambiguity of Chinese word segmentation [ C ]//Proceedings of the International Conference on E-Learning, E-Business, Enterprise Information Systems, and E-Government, 2009. Hongkong: IEEE, 2009 : 141-145.
7Souyma Ray, Mark Craven. Representing sentence structure in hidden markov models for information extraction [ C ]// Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence. Washington: Morgan Kanfmann Publishers,2001 : 1 273-1279.
8袁里驰.基于依存关系的句法分析统计模型[J].中南大学学报（自然科学版）,2009,40(6):1630-1635. 被引量：13
9梁吉光,田俊华,姜杰.基于改进HMM的文本信息抽取模型[J].计算机工程,2011,37(20):178-179. 被引量：9
10张寅生.汉语定义语句的抽取方法[J].计算机与数字工程,2011,39(10):45-47. 被引量：1

共引文献69

1王捷,洪宇,陈佳丽,姚建民.基于共享BERT和门控多任务学习的事件检测方法[J].中文信息学报,2021,35(10):101-109. 被引量：5
2陈铁军.义仆[J].传奇故事（百家讲堂）,2000(8):57-60.
3裴韬,郭思慧,袁烨城,张雪英,袁文,高昂,赵志远,薛存金.面向公共安全事件的网络文本大数据结构化研究[J].地球信息科学学报,2019,21(1):2-13. 被引量：16
4余丽,陆锋,张恒才.网络文本蕴涵地理信息抽取:研究进展与展望[J].地球信息科学学报,2015,17(2):127-134. 被引量：41
5于彤,朱玲,李敬华,高宏杰.中医文本信息抽取系统[J].中国医学创新,2015,12(21):108-110. 被引量：2
6朱玲,于彤,杨峰.基于关键动词的中医古籍概念实体间语义关系发现研究[J].中国数字医学,2016,11(5):73-75. 被引量：8
7祝春捷,潘坚跃,王译田,陈超.基于结构化表达的电力运维文本分析[J].电子设计工程,2019,27(17):53-58. 被引量：6
8高李政,周刚,罗军勇,兰明敬.元事件抽取研究综述[J].计算机科学,2019,46(8):9-15. 被引量：15
9安明慧,沈忱林,李寿山,李逸薇.基于联合学习的问答情感分类方法[J].中文信息学报,2019,33(10):119-126. 被引量：2
10张晨昕,饶元,樊笑冰,王硕.基于社交媒体的事件脉络挖掘研究进展[J].中文信息学报,2019,33(11):15-30. 被引量：5

同被引文献7

1王捷,洪宇,陈佳丽,姚建民.基于共享BERT和门控多任务学习的事件检测方法[J].中文信息学报,2021,35(10):101-109. 被引量：5
2王红斌,沈强,线岩团.融合迁移学习的中文命名实体识别[J].小型微型计算机系统,2017,38(2):346-351. 被引量：24
3贺瑞芳,段绍杨.基于多任务学习的中文事件抽取联合模型[J].软件学报,2019,30(4):1015-1030. 被引量：44
4田梓函,李欣.基于BERT-CRF模型的中文事件检测方法研究[J].计算机工程与应用,2021,57(11):135-139. 被引量：19
5黄河燕,刘啸.面向新领域的事件抽取研究综述[J].智能系统学报,2022,17(1):201-212. 被引量：7
6余传明,林虹君,张贞港.基于多任务深度学习的实体和事件联合抽取模型[J].数据分析与知识发现,2022,6(2):117-128. 被引量：9
7杨秀璋,彭国军,李子川,吕杨琦,刘思德,李晨光.基于Bert和BiLSTM-CRF的APT攻击实体识别及对齐研究[J].通信学报,2022,43(6):58-70. 被引量：12

引证文献1

1韩如雪,杨苗,宫小泽,胡镑,王永利,熊伟,赵显伟,徐琳.基于预训练语言模型与多任务学习的事件检测方法[J].南京理工大学学报,2023,47(6):748-755.

1郭锋锋.一种结合清晰区域增强多聚焦图像融合算法[J].攀枝花学院学报,2021,38(5):90-95.
2徐昌贵,张波,高建威,吴樊,张红,王超.FCOSR:一种无锚框的SAR图像任意朝向船舶目标检测网络[J].雷达学报（中英文）,2022,11(3):345-356. 被引量：8
3刘昱康,于学军.基于互信息的鲁棒跨域推荐系统[J].贵州大学学报（自然科学版）,2022,39(4):75-80. 被引量：2
4扈奔奔,张英明.数字经济发展与区域经济增长的关联机制研究——基于31省市2015—2021年面板数据[J].经济研究导刊,2022(15):51-53.
5黄海霞.土地资源调研中卫星遥感技术的应用分析[J].电子元器件与信息技术,2022,6(4):18-21. 被引量：5
6牛司平,朱旭,朱兰兰,张源源.皖南地区铜陵市大气颗粒物污染特征及潜在源研究[J].环境科学学报,2022,42(5):60-73. 被引量：5
7杨红,谢海燕,鲍昱璇,张凯欢,李新琪.阿克苏市春季PM_(10)和PM_(2.5)输送路径及潜在源分析[J].四川环境,2022,41(3):71-78. 被引量：2
8刘宇.高考化学试题解答有效争分策略[J].中学生数理化（高考理化）,2022(6):3-6.
9高绣叶,郑国萍.权威评价工具:职业院校技能大赛价值逻辑的深层阐释[J].职业技术教育,2022,43(10):20-25. 被引量：14
10丁红卫.新文科视域下的日语专业人才培养[J].高教发展与评估,2022,38(3):112-116. 被引量：5

南京师范大学学报（工程技术版）

2022年第2期

浏览历史

内容加载中请稍等...

面向事件检测的预训练主动学习模型被引量：1

参考文献3

二级参考文献12

共引文献69

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

面向事件检测的预训练主动学习模型 被引量：1

参考文献3

二级参考文献12

共引文献69

同被引文献7

引证文献1

相关作者

相关机构

相关主题

浏览历史

面向事件检测的预训练主动学习模型被引量：1