期刊文献+
共找到17篇文章
< 1 >
每页显示 20 50 100
A comprehensive review of existing corpora and methods for creating annotated corpora for event extraction tasks
1
作者 Mohd Hafizul Afifi Abdullah Norshakirah Aziz +3 位作者 Said Jadid Abdulkadir Kashif Hussain Hitham Alhussian Noureen Talpur 《Journal of Data and Information Science》 CSCD 2024年第4期196-238,共43页
Purpose:The purpose of this study is to serve as a comprehensive review of the existing annotated corpora.This review study aims to provide information on the existing annotated corpora for event extraction,which are ... Purpose:The purpose of this study is to serve as a comprehensive review of the existing annotated corpora.This review study aims to provide information on the existing annotated corpora for event extraction,which are limited but essential for training and improving the existing event extraction algorithms.In addition to the primary goal of this study,it provides guidelines for preparing an annotated corpus and suggests suitable tools for the annotation task.Design/methodology/approach:This study employs an analytical approach to examine available corpus that is suitable for event extraction tasks.It offers an in-depth analysis of existing event extraction corpora and provides systematic guidelines for researchers to develop accurate,high-quality corpora.This ensures the reliability of the created corpus and its suitability for training machine learning algorithms.Findings:Our exploration reveals a scarcity of annotated corpora for event extraction tasks.In particular,the English corpora are mainly focused on the biomedical and general domains.Despite the issue of annotated corpora scarcity,there are several high-quality corpora available and widely used as benchmark datasets.However,access to some of these corpora might be limited owing to closed-access policies or discontinued maintenance after being initially released,rendering them inaccessible owing to broken links.Therefore,this study documents the available corpora for event extraction tasks.Research limitations:Our study focuses only on well-known corpora available in English and Chinese.Nevertheless,this study places a strong emphasis on the English corpora due to its status as a global lingua franca,making it widely understood compared to other languages.Practical implications:We genuinely believe that this study provides valuable knowledge that can serve as a guiding framework for preparing and accurately annotating events from text corpora.It provides comprehensive guidelines for researchers to improve the quality of corpus annotations,especially for event extraction tasks across various domains.Originality/value:This study comprehensively compiled information on the existing annotated corpora for event extraction tasks and provided preparation guidelines. 展开更多
关键词 Information extraction event extraction Text mining Large language model Natural language processing
下载PDF
Biomedical Event Extraction Using a New Error Detection Learning Approach Based on Neural Network 被引量:2
2
作者 Xiaolei Ma Yang Lu +2 位作者 Yinan Lu Zhili Pei Jichao Liu 《Computers, Materials & Continua》 SCIE EI 2020年第5期923-941,共19页
Supervised machine learning approaches are effective in text mining,but their success relies heavily on manually annotated corpora.However,there are limited numbers of annotated biomedical event corpora,and the availa... Supervised machine learning approaches are effective in text mining,but their success relies heavily on manually annotated corpora.However,there are limited numbers of annotated biomedical event corpora,and the available datasets contain insufficient examples for training classifiers;the common cure is to seek large amounts of training samples from unlabeled data,but such data sets often contain many mislabeled samples,which will degrade the performance of classifiers.Therefore,this study proposes a novel error data detection approach suitable for reducing noise in unlabeled biomedical event data.First,we construct the mislabeled dataset through error data analysis with the development dataset.The sample pairs’vector representations are then obtained by the means of sequence patterns and the joint model of convolutional neural network and long short-term memory recurrent neural network.Following this,the sample identification strategy is proposed,using error detection based on pair representation for unlabeled data.With the latter,the selected samples are added to enrich the training dataset and improve the classification performance.In the BioNLP Shared Task GENIA,the experiments results indicate that the proposed approach is competent in extract the biomedical event from biomedical literature.Our approach can effectively filter some noisy examples and build a satisfactory prediction model. 展开更多
关键词 Biomedical event extraction pair representation error data detection sample identification
下载PDF
Joint Event Extraction Based on Global Event-Type Guidance and Attention Enhancement 被引量:1
3
作者 Daojian Zeng Jian Tian +3 位作者 Ruoyao Peng Jianhua Dai Hui Gao Peng Peng 《Computers, Materials & Continua》 SCIE EI 2021年第9期4161-4173,共13页
Event extraction is one of the most challenging tasks in information extraction.It is a common phenomenon where multiple events exist in the same sentence.However,extracting multiple events is more difficult than extr... Event extraction is one of the most challenging tasks in information extraction.It is a common phenomenon where multiple events exist in the same sentence.However,extracting multiple events is more difficult than extracting a single event.Existing event extraction methods based on sequence models ignore the interrelated information between events because the sequence is too long.In addition,the current argument extraction relies on the results of syntactic dependency analysis,which is complicated and prone to error trans-mission.In order to solve the above problems,a joint event extraction method based on global event-type guidance and attention enhancement was proposed in this work.Specifically,for multiple event detection,we propose a global-type guidance method that can detect event types in the candidate sequence in advance to enhance the correlation information between events.For argument extraction,we converted it into a table-flling problem,and proposed a table-flling method of the attention mechanism,that is simple and can enhance the correlation between trigger words and arguments.The experimental results based on the ACE 2005 dataset showed that the proposed method achieved 1.6%improvement in the task of event detection,and obtained state-of-the-art results in the argument extraction task,which proved the effectiveness of the method. 展开更多
关键词 event extraction event-type guidance table flling attention mechanisms
下载PDF
Multi-Modal Military Event Extraction Based on Knowledge Fusion
4
作者 Yuyuan Xiang Yangli Jia +1 位作者 Xiangliang Zhang Zhenling Zhang 《Computers, Materials & Continua》 SCIE EI 2023年第10期97-114,共18页
Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event eleme... Event extraction stands as a significant endeavor within the realm of information extraction,aspiring to automatically extract structured event information from vast volumes of unstructured text.Extracting event elements from multi-modal data remains a challenging task due to the presence of a large number of images and overlapping event elements in the data.Although researchers have proposed various methods to accomplish this task,most existing event extraction models cannot address these challenges because they are only applicable to text scenarios.To solve the above issues,this paper proposes a multi-modal event extraction method based on knowledge fusion.Specifically,for event-type recognition,we use a meticulous pipeline approach that integrates multiple pre-trained models.This approach enables a more comprehensive capture of the multidimensional event semantic features present in military texts,thereby enhancing the interconnectedness of information between trigger words and events.For event element extraction,we propose a method for constructing a priori templates that combine event types with corresponding trigger words.This approach facilitates the acquisition of fine-grained input samples containing event trigger words,thus enabling the model to understand the semantic relationships between elements in greater depth.Furthermore,a fusion method for spatial mapping of textual event elements and image elements is proposed to reduce the category number overload and effectively achieve multi-modal knowledge fusion.The experimental results based on the CCKS 2022 dataset show that our method has achieved competitive results,with a comprehensive evaluation value F1-score of 53.4%for the model.These results validate the effectiveness of our method in extracting event elements from multi-modal data. 展开更多
关键词 event extraction MULTI-MODAL knowledge fusion pre-trained models
下载PDF
KeyEE:Enhancing Low-Resource Generative Event Extraction with Auxiliary Keyword Sub-Prompt
5
作者 Junwen Duan Xincheng Liao +1 位作者 Ying An Jianxin Wang 《Big Data Mining and Analytics》 EI CSCD 2024年第2期547-560,共14页
Event Extraction(EE)is a key task in information extraction,which requires high-quality annotated data that are often costly to obtain.Traditional classification-based methods suffer from low-resource scenarios due to... Event Extraction(EE)is a key task in information extraction,which requires high-quality annotated data that are often costly to obtain.Traditional classification-based methods suffer from low-resource scenarios due to the lack of label semantics and fine-grained annotations.While recent approaches have endeavored to address EE through a more data-efficient generative process,they often overlook event keywords,which are vital for EE.To tackle these challenges,we introduce KeyEE,a multi-prompt learning strategy that improves low-resource event extraction by Event Keywords Extraction(EKE).We suggest employing an auxiliary EKE sub-prompt and concurrently training both EE and EKE with a shared pre-trained language model.With the auxiliary sub-prompt,KeyEE learns event keywords knowledge implicitly,thereby reducing the dependence on annotated data.Furthermore,we investigate and analyze various EKE sub-prompt strategies to encourage further research in this area.Our experiments on benchmark datasets ACE2005 and ERE show that KeyEE achieves significant improvement in low-resource settings and sets new state-of-the-art results. 展开更多
关键词 natural language processing event extraction(EE) Multi-Prompt Learning(MPL) low-resource
原文传递
BUEES: a bottom-up event extraction system 被引量:3
6
作者 Xiao DING Bing QIN Ting LIU 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2015年第7期541-552,共12页
Traditional event extraction systems focus mainly on event type identification and event participant extraction based on pre-specified event type paradigms and manually annotated corpora. However, different domains ha... Traditional event extraction systems focus mainly on event type identification and event participant extraction based on pre-specified event type paradigms and manually annotated corpora. However, different domains have different event type paradigms. When transferring to a new domain, we have to build a new event type paradigm and annotate a new corpus from scratch. This kind of conventional event extraction system requires massive human effort, and hence prevents event extraction from being widely applicable. In this paper, we present BUEES, a bottom-up event extraction system, which extracts events from the web in a completely unsupervised way. The system automatically builds an event type paradigm in the input corpus, and then proceeds to extract a large number of instance patterns of these events. Subsequently, the system extracts event arguments according to these patterns. By conducting a series of experiments, we demonstrate the good performance of BUEES and compare it to a state-of-the-art Chinese event extraction system, i.e., a supervised event extraction system. Experimental results show that BUEES performs comparably (5% higher F-measure in event type identification and 3-0 higher F-measure in event argument extraction), but without any human effort. 展开更多
关键词 event extraction Unsupervised learning BOTTOM-UP
原文传递
Joint Entity and Event Extraction with Generative Adversarial Imitation Learning 被引量:11
7
作者 Tongtao Zhang Heng Ji Avirup Sil 《Data Intelligence》 2019年第2期99-120,共22页
We propose a new framework for entity and event extraction based on generative adversarial imitation learning-an inverse reinforcement learning method using a generative adversarial network(GAN).We assume that instanc... We propose a new framework for entity and event extraction based on generative adversarial imitation learning-an inverse reinforcement learning method using a generative adversarial network(GAN).We assume that instances and labels yield to various extents of difficulty and the gains and penalties(rewards)are expected to be diverse.We utilize discriminators to estimate proper rewards according to the difference between the labels committed by the ground-truth(expert)and the extractor(agent).Our experiments demonstrate that the proposed framework outperforms state-of-the-art methods. 展开更多
关键词 Information extraction event extraction Imitation learning Generative adversarial network
原文传递
Overview of CCKS 2020 Task 3: Named Entity Recognition and Event Extraction in Chinese Electronic Medical Records 被引量:7
8
作者 Xia Li Qinghua Wen +2 位作者 Hu Lin Zengtao Jiao Jiangtao Zhang 《Data Intelligence》 2021年第3期376-388,共13页
The China Conference on Knowledge Graph and Semantic Computing(CCKS)2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records.Two annotated data... The China Conference on Knowledge Graph and Semantic Computing(CCKS)2020 Evaluation Task 3 presented clinical named entity recognition and event extraction for the Chinese electronic medical records.Two annotated data sets and some other additional resources for these two subtasks were provided for participators.This evaluation competition attracted 354 teams and 46 of them successfully submitted the valid results.The pre-trained language models are widely applied in this evaluation task.Data argumentation and external resources are also helpful. 展开更多
关键词 Chinese electronic medical records event extraction Named entity recognition Clinical text CCKS
原文传递
A Joint Learning Framework for the CCKS-2020 Financial Event Extraction Task 被引量:6
9
作者 Jiawei Sheng Qian Li +6 位作者 Yiming Hei Shu Guo Bowen Yu Lihong Wang Min He Tingwen Liu Hongbo Xu 《Data Intelligence》 2021年第3期444-459,共16页
This paper presents a winning solution for the CCKS-2020 financial event extraction task, where the goal is to identify event types, triggers and arguments in sentences across multiple event types. In this task, we fo... This paper presents a winning solution for the CCKS-2020 financial event extraction task, where the goal is to identify event types, triggers and arguments in sentences across multiple event types. In this task, we focus on resolving two challenging problems(i.e., low resources and element overlapping) by proposing a joint learning framework, named SaltyFishes. We first formulate the event extraction task as a joint probability model. By sharing parameters in the model across different types, we can learn to adapt to low-resource events based on high-resource events. We further address the element overlapping problems by a mechanism of Conditional Layer Normalization, achieving even better extraction accuracy. The overall approach achieves an F1-score of 87.8% which ranks the first place in the competition. 展开更多
关键词 event detection event extraction Joint learning Financial event
原文传递
An Ensemble Approach for Emotion Cause Detection with Event Extraction and Multi-Kernel SVMs 被引量:7
10
作者 Ruifeng Xu Jiannan Hu +2 位作者 Qin Lu Dongyin Wu Lin Gui 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2017年第6期646-659,共14页
In this paper, we present a new challenging task for emotion analysis, namely emotion cause extraction.In this task, we focus on the detection of emotion cause a.k.a the reason or the stimulant of an emotion, rather t... In this paper, we present a new challenging task for emotion analysis, namely emotion cause extraction.In this task, we focus on the detection of emotion cause a.k.a the reason or the stimulant of an emotion, rather than the regular emotion classification or emotion component extraction. Since there is no open dataset for this task available, we first designed and annotated an emotion cause dataset which follows the scheme of W3 C Emotion Markup Language. We then present an emotion cause detection method by using event extraction framework,where a tree structure-based representation method is used to represent the events. Since the distribution of events is imbalanced in the training data, we propose an under-sampling-based bagging algorithm to solve this problem. Even with a limited training set, the proposed approach may still extract sufficient features for analysis by a bagging of multi-kernel based SVMs method. Evaluations show that our approach achieves an F-measure 7.04%higher than the state-of-the-art methods. 展开更多
关键词 emotion cause detection event extraction multi-kernel SVMs bagging
原文传递
A Prior Information Enhanced Extraction Framework for Document-level Financial Event Extraction 被引量:1
11
作者 Haitao Wang Tong Zhu +2 位作者 Mingtao Wang Guoliang Zhang Wenliang Chen 《Data Intelligence》 2021年第3期460-476,共17页
Document-level financial event extraction(DFEE) is the task of detecting events and extracting the corresponding event arguments in financial documents, which plays an important role in information extraction in the f... Document-level financial event extraction(DFEE) is the task of detecting events and extracting the corresponding event arguments in financial documents, which plays an important role in information extraction in the financial domain. This task is challenging as the financial documents are generally long text and event arguments of one event may be scattered in different sentences. To address this issue, we proposed a novel Prior Information Enhanced Extraction framework(PIEE) for DFEE, leveraging prior information from both event types and pre-trained language models. Specifically, PIEE consists of three components: event detection, event argument extraction, and event table filling. In event detection, we identify the event type. Then, the event type is explicitly used for event argument extraction. Meanwhile, the implicit information within language models also provides considerable cues for event arguments localization. Finally, all the event arguments are filled in an event table by a set of predefined heuristic rules. To demonstrate the effectiveness of our proposed framework, we participated in the share task of CCKS2020 Task 4-2: Documentlevel Event Arguments Extraction. On both Leaderboard A and Leaderboard B, PIEE took the first place and significantly outperformed the other systems. 展开更多
关键词 event extraction Information extraction Financial event event detection event argument extraction
原文传递
A Multi-event Extraction Model for Nursing Records
12
作者 Ruoyu Song Lan Wei Yuhang Guo 《国际计算机前沿大会会议论文集》 2022年第2期146-158,共13页
Nursing records contain information on patients’treatment processes,which reflect the changes in patients’conditions and have legal effects.However,some of the written records of intensive care unit(ICU)nurses are i... Nursing records contain information on patients’treatment processes,which reflect the changes in patients’conditions and have legal effects.However,some of the written records of intensive care unit(ICU)nurses are incomplete according to our observations.This paper proposes an approach extracting structured nursing events from unstructured nursing records for detecting the missing items automatically.According to the PIO(problem,intervention,outcome)principle in the field of medical care,we propose event schemas for nursing records and annotate a Chinese nursing event extraction dataset(CNEED)on ICU nursing records.We find that several events may occur in a nursing record.Therefore,we present a multi-event extraction model for the nursing records.The experimental results demonstrate that our model achieves good results on CNEED and outperforms competitive methods on the multi-event argument attribution problem.By observing the results of automatic event extraction by our model,we detect missing items in the existing nursing records.This proves that our model can be used to help nurses check and improve the method of recording nursing processes. 展开更多
关键词 event extraction Nursing records Multi-event
原文传递
A survey on neural relation extraction 被引量:12
13
作者 LIU Kang 《Science China(Technological Sciences)》 SCIE EI CAS CSCD 2020年第10期1971-1989,共19页
Relation extraction is a key task for knowledge graph construction and natural language processing,which aims to extract meaningful relational information between entities from plain texts.With the development of deep... Relation extraction is a key task for knowledge graph construction and natural language processing,which aims to extract meaningful relational information between entities from plain texts.With the development of deep learning,many neural relation extraction models were proposed recently.This paper introduces a survey on the task of neural relation extraction,including task description,widely used evaluation datasets,metrics,typical methods,challenges and recent research progresses.We mainly focus on four recent research problems:(1)how to learn the semantic representations from the given sentences for the target relation,(2)how to train a neural relation extraction model based on insufficient labeled instances,(3)how to extract relations across sentences or in a document and(4)how to jointly extract relations and corresponding entities?Finally,we give out our conclusion and future research issues. 展开更多
关键词 knowledge graph relation extraction event extraction and information extraction
原文传递
Event Type Recognition Based on Trigger Expansion 被引量:7
14
作者 秦兵 赵妍妍 +2 位作者 丁效 刘挺 翟国富 《Tsinghua Science and Technology》 SCIE EI CAS 2010年第3期251-258,共8页
Event extraction is an important research point in information extraction, which includes two important sub-tasks of event type recognition and event argument recognition. This paper describes a method based on automa... Event extraction is an important research point in information extraction, which includes two important sub-tasks of event type recognition and event argument recognition. This paper describes a method based on automatic expansion of the event triggers for event type recognition. The event triggers are first extended through a thesaurus to enable the extraction of the candidate events and their candidate types. Then, a binary classification method is used to recognize the candidate event types. This method effectively improves the unbalanced data problem in training models and the data sparseness problem with a small corpus. Evaluations on the ACE2005 dataset give a final F-score of 61.24%, which outperforms traditional methods based on pure machine learning. 展开更多
关键词 event extraction event type recognition event trigger
原文传递
Cross-Context News Corpus for Protest Event-Related Knowledge Base Construction
15
作者 Ali Hürriyetoglu Erdem Yörük +4 位作者 Osman Mutlu Fırat Durusan ÇagrıYoltar Deniz Yüret Burak Gürel 《Data Intelligence》 2021年第2期308-335,共28页
We describe a gold standard corpus of protest events that comprise various local and international English language sources from various countries.The corpus contains document-,sentence-,and token-level annotations.Th... We describe a gold standard corpus of protest events that comprise various local and international English language sources from various countries.The corpus contains document-,sentence-,and token-level annotations.This corpus facilitates creating machine learning models that automatically classify news articles and extract protest event-related information,constructing knowledge bases that enable comparative social and political science studies.For each news source,the annotation starts with random samples of news articles and continues with samples drawn using active learning.Each batch of samples is annotated by two social and political scientists,adjudicated by an annotation supervisor,and improved by identifying annotation errors semi-automatically.We found that the corpus possesses the variety and quality that are necessary to develop and benchmark text classification and event extraction systems in a cross-context setting,contributing to the generalizability and robustness of automated text processing systems.This corpus and the reported results will establish a common foundation in automated protest event collection studies,which is currently lacking in the literature. 展开更多
关键词 event extraction Text classification Political science Social science NEWS Contentious politics PROTESTS event coreference resolution
原文传递
The Construction of Case Event Logic Graph for Judgment Documents
16
作者 Congyao Zhang Shiping Tang 《国际计算机前沿大会会议论文集》 2021年第1期209-217,共9页
The construction of a case event logic graph for the judgment documentcan more intuitively retrospect the development of the case. This paperproposes a joint model of event extraction and relationship recognition for ... The construction of a case event logic graph for the judgment documentcan more intuitively retrospect the development of the case. This paperproposes a joint model of event extraction and relationship recognition for judgmentdocuments. By extracting the case information in the judgment document,a case event logic graph was constructed. The development process of the casewas shown, and a reference was provided for the analysis of the context of thecase. The experimental results show that the proposed method can extract eventsand identify the relationship between events, and the F1 value reaches 0.809. Thecase event logic graph reveals the development context of the case accurately andvividly. 展开更多
关键词 event logic graph Judgment document event extraction Relationship recognition
原文传递
Identifying associations between epidemiological entities in news data for animal disease surveillance 被引量:1
17
作者 Sarah Valentin Renaud Lancelot Mathieu Roche 《Artificial Intelligence in Agriculture》 2021年第1期163-174,共12页
Event-based surveillance systems are at the crossroads of human and animal(and plant and ecosystem)health,epidemiology,statistics,and informatics.Thus,their deployment faces many challenges specific to each domain and... Event-based surveillance systems are at the crossroads of human and animal(and plant and ecosystem)health,epidemiology,statistics,and informatics.Thus,their deployment faces many challenges specific to each domain and their intersections,such as relations among automation,artificial intelligence,and expertise.In this context,ourwork pertins to the extraction of epidemiological events in textual data(i.e.news)by unsupervised methods.We define the event extraction task as detecting pairs of epidemiological entities(e.g.a disease name and location).The quality of the ranked lists of pairs was evaluated using specific ranking evaluation metrics.We used a publicly available annotated corpus of 438 documents(i.e.news articles)related to animal disease events.The statistical approach was able to detect event-related pairs of epidemiological features with a good trade-off between precision and recall.Our results showed that using a window of words outperformed document-based and sentence-based approaches,while reducing the probability of detecting false pairs.Our results indicated that Mutual Information was less adapted than the Dice coefficient for ranking pairs of features in the event extraction framework.We believe that Mutual Information would be more relevant for rare pair detection(i.e.weak signals),but requires higher manual curation to avoid false positive extraction pairs.Moreover,generalising the country-level spatial features enabled better discrimination(i.e.ranking)of relevant disease-location pairs for event extraction. 展开更多
关键词 Animal disease surveillance Text mining event extraction
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部