期刊文献+

A Hybrid Method of Coreference Resolution in Information Security 被引量:1

下载PDF
导出
摘要 In the field of information security,a gap exists in the study of coreference resolution of entities.A hybrid method is proposed to solve the problem of coreference resolution in information security.The work consists of two parts:the first extracts all candidates(including noun phrases,pronouns,entities,and nested phrases)from a given document and classifies them;the second is coreference resolution of the selected candidates.In the first part,a method combining rules with a deep learning model(Dictionary BiLSTM-Attention-CRF,or DBAC)is proposed to extract all candidates in the text and classify them.In the DBAC model,the domain dictionary matching mechanism is introduced,and new features of words and their contexts are obtained according to the domain dictionary.In this way,full use can be made of the entities and entity-type information contained in the domain dictionary,which can help solve the recognition problem of both rare and long entities.In the second part,candidates are divided into pronoun candidates and noun phrase candidates according to the part of speech,and the coreference resolution of pronoun candidates is solved by making rules and coreference resolution of noun phrase candidates by machine learning.Finally,a dataset is created with which to evaluate our methods using information security data.The experimental results show that the proposed model exhibits better performance than the other baseline models.
出处 《Computers, Materials & Continua》 SCIE EI 2020年第8期1297-1315,共19页 计算机、材料和连续体(英文)
基金 This work was supported by the National Natural Science Foundation of China(grant no.61602515).
  • 相关文献

参考文献2

二级参考文献13

  • 1陈凯江 刘秉伟 黄萱菁 等.基于隐马尔可夫模型的实体名识别[A]..见:863计划智能计算机主题学术会议论文集[C].北京:清华大学出版社,2001.443~453.
  • 2N A Chinichor. Overview of MUC-7/MET-2. In: Proc of the 7th Message Understanding Cord (MUC-7). San Francisco: Morgan Kaufmann Publishers, 1998.
  • 3C Cardie, K Wagstaff. Noun phrase coreference as clustering. In:Proc of the Joint Cod on Empirical Methods in NLP and Very Large Corpora. Maryland: University of Maryland, USA, 1999.82~ 89.
  • 4W M Soon, H T Ng, C Y Lim. Corpus-based learning for noun phrase oonference resolution. In: Proc of the Joint Conf on Empirical Methods in NLP and Very Large Corpora. Maryland: University of Maryland, USA, 1999. 285~291.
  • 5R Mitkov. Anaphora resolution: The state of the art. Proc of the COLING'98/ACL'98, Wolverhampton, 1999.
  • 6J C Reynar, A Ratnaparkhi. A maximum entropy approach to identifying sentence boundaries. In: The 5th Cord on Applied Natural Language Processing. San Francisco: Morgan Kaufmann Publishers, 1997.
  • 7A Ratnaaparkhi. A maximum entropy part-of-speech tagger. The Empirical Methods in Natural Language Processing Conf, PA,USA, 1996.
  • 8D Lin. Principle-based parsing without overgeneration. ACL -93, Columbus, Ohio, USA, 1993.
  • 9G A Miller.Wordnet: A lexical database for English. Comm ACM, 1995, 38(11): 39~41.
  • 10D Hays. Dependency theory: A formalism and some observations. Language, 1964, 40:511~525.

共引文献45

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部