期刊文献+

基于ALBERT的中文医疗病历命名实体识别 被引量:6

ALBERT-Based Named Entity Recognition of Chinese Medical Records
下载PDF
导出
摘要 医疗病历命名实体识别的主要任务是将临床电子病历中的非结构化文本转化为结构化数据,进而为面向医疗领域任务开展的数据挖掘提供基础支撑.提出一种基于ALBERT模型融合学习的中文医疗病历命名实体识别模型.首先,采用人工标注方式扩展样本数据集,结合ALBERT模型对数据集进行微调;其次,采用双向长短记忆网络(BiLSTM)提取文本的全局特征;最后,基于条件随机场模型(CRF)命名实体的序列标记.在标准数据集上的实验结果表明,该方法进一步提高了医疗文本命名识别精度,减少了时间开销. The main task of named entity recognition on medical record is to convert unstructured text into structured data,and then provide an important fundamental support for data mining for medical field tasks.This paper proposes a named entity recognition method for Chinese medical records based on ALBERT and fusion model.Firstly,we use manual labeling to expand the sample dataset,and fine-tune the dataset in conjunction with the ALBERT.Secondly,the Bi-directional Long Short-Term Memory(BiLSTM)is used to extract the global features of the text.Finally,on the basis of the conditional random field model(CRF),sequence tags for named entities are made.The experimental results on the standard dataset show that the proposed method further improves the accuracy of name entity recognition on medical text and greatly reduces the time overhead.
作者 陈杰 奚雪峰 皮洲 盛胜利 崔志明 Chen Jie;Xi Xuefeng;Pi Zhou;Victor S Sheng;Cui Zhiming(School of Electronic and Computer Engineering,Suzhou University of Science and Technology,Suzhou 215009,China;Suzhou Smart City Research Institute,Suzhou 215009,China;Computer Science Department,Texas Tech University,Texas 79431,USA)
出处 《南京师范大学学报(工程技术版)》 CAS 2021年第1期36-43,共8页 Journal of Nanjing Normal University(Engineering and Technology Edition)
基金 国家自然科学基金项目(61673290、61876217) 江苏省“六大人才高峰”高层次人才项目(XYDXX-086) 苏州市科技发展计划产业前瞻性项目(SYG201817)、2020年江苏省研究生科研创新计划项目(KYCX20_2762).
关键词 ALBERT 命名实体识别 电子医疗病历 双向长短记忆网络 条件随机场 ALBERT named entity recognition clinical electronic medical records BiLSTM CRF
  • 相关文献

参考文献3

二级参考文献21

  • 1Huang Fei, Vogel S, Waibel A. Automatic extraction of named entity translingual equivalence based on multi-feature cost minimization//Proceedings of the 2003 Annual Confer- ence of the ACL, Workshop on Multilingual and Mixed-lan- guage Named Entity Recognition. Sapporo, Japan, 2003: 184-192.
  • 2Al-Onaizan Y, Knight K. Translating named entities using monolingual and bilingual resources//Proceedings of the 40th Annual Meeting of the Association for Computational Lin- guistics (ACL). Philadelphia, PA, USA, 2002:400 -408.
  • 3Feng Donghui, Lv Yajuan, Zhou Ming. A new approach for English Chinese named entity alignment//Proceedings of the Conference on Empirical Methods in Natural Language Pro cessing (EMNLP 2004). Barcelona, 2004 : 372-379.
  • 4Lee Chun-Jen, Chang Jason S, Jang Jyh-Shing R. Alignment of bilingual named entities in parallel corpora using statistical models and multiple knowledge sources. ACM Transactions on Asian Language Information Processing (TAMP), 2006, 5(2) : 121-145.
  • 5Moore R C. Learning translations of named-entity phrases from parallel corpora//Proceedings of lOth Conference of the European Chapter of ACL. Budapest, Hungary, 2003: 456- 464.
  • 6Krishman Vijay, Manning Christopher D. An effective two- stage model for exploiting non-local dependencies in named entity recognition//Proceedings of the 44th Annual Meeting of ACL. Sydney, 2006:1121-1128.
  • 7Ji Heng, Grishman Ralph. Collaborative entity extraction and translation//Proceedings of the International Conference on Recent Advances in Natural Language Processing. Borovets, Bulgaria, 2007:281-238.
  • 8Chen Hsin-His, Yang Changhua, Lin Ying. Learning formu- lation and transformation rules for multilingual named enti- ties//Proceedings of the ACL 2003 Workshop on Multilingual and Mixed-language Named Entity Recognition. Sapporo, Japan, 2003:1-8.
  • 9Berger Adam L, Della Pietra Stephen A, Della Pietra Vin- cent J. A maximum entropy approach to natural language processing. Computational Linguistics, 1996, 22(1) : 39- 72.
  • 10Och Franz loser, Ney Hermann. Discriminative training and maximum entropy models for statistical machine transla- tion//Proceedings of the 40th Annual Meeting of the ACL. Philadelphia, PA, USA, 2002: 295-302.

共引文献187

同被引文献54

引证文献6

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部