期刊文献+

利用Text-CNN改进PubMedBERT在化学诱导性疾病实体关系分类效果的尝试 被引量:1

Improving PubMedBERT for CID-Entity-Relation Classification Using Text-CNN
原文传递
导出
摘要 【目的】改进PubMedBERT在化学诱导性疾病(CID)实体关系分类的效果。【方法】提出一种基于PubMedBERT并结合Text-CNN的实体关系分类方法。该方法以实体对和文本组成句子对进行输入,利用PubMedBERT预训练模型对化学诱导性疾病相关文本进行编码获取全局特征,通过Text-CNN捕捉文本局部重要信息,判断实体对是否具有CID关系。【结果】在BioCreative V CDR数据集中,该方法的精确率、召回率和F1值分别达到78.3%、73.5%和75.8%,较其他方法最少提升了3.1%、1.5%和3.3%。【局限】仅考虑了化学诱导性疾病文本语料,在临床等其他语料上的效果有待检验。【结论】该方法能够捕捉化学诱导性疾病文本特征,提升实体关系分类的效果。 [Objective] This paper tries to improve the performance of PubMedBERT for CID entity relation classification. [Methods] We proposed a classification model based on PubMedBERT, which was also fine-tuned by Text-CNN. Then, we input entity pairs and sentence pairs to the model. Third, we used PubMedBERT to encode CID texts and obtained their global features. Finally, we captured important local information from the global features with Text-CNN to decide whether entity pairs have CID relation. [Results] The precision, recall and F1 value of this method on the BioCreative V CDR dataset reached 78.3%, 73.5% and 75.8% respectively,which were at least 3.1%, 1.5% and 3.3% higher than other methods. [Limitations] This model only examines CID texts, and more research is needed to evaluate its performance on clinical data or corpus of other domains.[Conclusions] This method can capture the features of CID texts and improve their entity relation classification.
作者 董淼 苏中琪 周晓北 兰雪 崔志刚 崔雷 Dong Miao;Su Zhongqi;Zhou Xiaobei;Lan Xue;Cui Zhigang;Cui Lei(Financial Section,China Medical University,Shenyang 110122,China;China Medical University Library,Shenyang 110122,China;Institute of Health Sciences,China Medical University,Shenyang 110122,China;School of Health Management,China Medical University,Shenyang 110122,China;Nursing School,China Medical University,Shenyang 110122,China)
出处 《数据分析与知识发现》 CSSCI CSCD 北大核心 2021年第11期145-152,共8页 Data Analysis and Knowledge Discovery
关键词 CID实体关系分类 PubMedBERT Text-CNN 句子对 CID Entity Relation Classification PubMedBERT Text-CNN Sentence Pair
  • 相关文献

参考文献7

二级参考文献110

  • 1崔建梅,尹大力.药物重新定位策略在新药发现中的应用与进展[J].中国药学杂志,2005,40(20):1524-1526. 被引量:7
  • 2Cohen K B, Hunter L. Getting Started in Text Mining [ J ]. PLoS Computational Biology, 2008,4 (1) :e20.
  • 3Barbosa - Silva A, Soldatos T G, Magalhaes I L F, et al. LAITOR - Literature Assistant for Identification of Terms co - Occurrences and Relationships [ J ]. BMC Bioinformatics, 2010,11 ( 1 ) : 70 - 79.
  • 4Lee S, Lee K H, Song M, et al. Building the Process - drug - side Effect Network to Discover the Relationship Between Biological Processes and Side Effects [ J ]. BMC Bioinformatics,2011,12 : $2. doi : 10.1156/1471 - 2105 - 12 - $2 - $2.
  • 5Saetre R, Yoshida K, Miwa M, et al. Extracting Protein Interactions from Text with the Unified AkaneRE Event Extraction System[ J]. IEEE- ACM Transaction on Computational Biology and Bioinfor- matics, 2010,7(3) : 442 -453.
  • 6Garten Y, Ahman R B. Pharmspresso : A Text Mining Tool for Ex- traction of Pharmacogenomic Concepts and Relationships from Full Text [ J ]. BMC B ioinformatics, 2009,10 : $6. doi : 10. 1186/1471 - 2105 - 10 - $2 - $6.
  • 7Li J, Zhu X, Chen J Y. Building Disease - specific Drug - pro- tein Connectivity Maps from Molecular Interaction Networks and Pubmed Abstracts [ J] . PLoS Computational Biology,2009,5 (7) :e1000450.
  • 8Fundel K, Kuffner R, Zimmer R. RelEx--Relation Extraction Using Dependency Parse Trees[ J]. Bioinformatics, 2007,23 ( 3 ) : 365 - 371.
  • 9Friedman C, Kra P, Yu H, et al. GENIES : A Natural - language Processing System for the Extraction of Molecular Pathways from Journal Articles [ J ]. Bioinformatics, 2001,17 ( S1 ) : $74 - $82.
  • 10McDonald D M, Chen H, Su H, et al. Extracting Gene Pathway Relations Using a Hybrid Grammar: The Arizona Relation Parser [ J ]. Bioinformatics, 2004,20 ( 18 ) :3370 - 3378.

共引文献65

同被引文献6

引证文献1

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部