期刊文献+

基于CRF与规则相结合的中文电子病历命名实体识别研究 被引量:12

A study on the named entity recognition of Chinese electronic medical record based on combination of CRF and rules
下载PDF
导出
摘要 目的:探讨基于条件随机场(conditional random field,CRF)与规则相结合的中文电子病历命名实体识别。方法:基于条件随机场和规则相结合的方法来识别实体,将语言、关键词、词典等作为特征,识别出的结果再利用规则进行优化。结果:与条件随机场的方法相比,条件随机场和规则相结合的方法识别准确率提高到78.98%,召回率和F值也提高到88.37%和83.41%。结论:基于条件随机场和规则相结合的方法来识别实体,准确率和召回率满足应用需求,为电子病历后续研究奠定了基础。 Objective:To explore the named entity recognition of Chinese electronic medical record based on the combination of conditional random field (CRF) and rules. Methods:Entities are recognized based on the combination of conditional random field and rules. Language, keywords, dictionaries are used as recognition features, and the recognition results are optimized by the rules. Results:Compared with the method of conditional random field, the accuracy of the method combining the conditional random field with the rules is improved to 78.98%, and the recall rate and F value are also increased to 88.37% and 83.41%. Conclusion: The accuracy and recall rate based on the method combining the conditional random field with the rule to identify entities can meet the apphcation requirements, which will lay the foundation for the follow - up study of electronic medical record.
机构地区 蚌埠医学院
出处 《包头医学院学报》 CAS 2017年第11期124-125,130,共3页 Journal of Baotou Medical College
基金 安徽省高校自然科学一般项目(KJ2015B076by) 安徽省质量工程项目(2016mooc256) 安徽高校人文社科重点项目(SK2017A0182) 蚌埠医学院自然科学基金面上项目(BYKY1659)
关键词 命名实体识别 条件随机场 规则 Named entity recognition Conditional random field Rules
  • 相关文献

参考文献4

二级参考文献61

  • 1黄丹.网络医疗对医疗服务理念的挑战[J].中药研究与信息,2005,7(9):31-32. 被引量:4
  • 2TANABE L, WILBUR W J. A priority model for named entities [ C ]//Proc of Human Language Technology Conference. Morristown : Association for Computational Linguistics, 2006 : 33-40.
  • 3GU Bao-hua. Recognizing nested named entities in GENIA corpus [ C ]//Proc of Human Language Technology Conference. Morristown : Association for Computational Linguistics, 2006 : 112-113.
  • 4SUNDHEIM B M. Overview of results of the M UC-6 evaluation [ C ]// Proc of the 6th Conference on Message Under Standing. Morristown: Association for Computational Linguistics, 1996:423-442.
  • 5KIM J, OHTA T, TSURUOKA Y, et al. Introduction to the bio-entity recognition task at JNLPBA[ C ]//Proc of International Workshop on Natural Language Processing in Biomedicine and It's Applications. 2004 : 70 - 75.
  • 6YEH A, MORGAN A, COLOSIMO M, et al. BioCreAtIvE task 1A: gene mention finding evaluation[ J]. BMC Bioinformatics, 2005,6 (1) : S2.
  • 7LEAMAN R, GONZALEZ G. BANNER: an executable survey of advances in biomedical named entity recognition [ C ]//Proc of Pacific Symposium on Biocomputing. 2008:652-663.
  • 8KIM J D, OHTA T, TATEISI Y, et al. GENIA corpus:a semantically annotated corpus for bio-textmining [ J]. Bioinformatios, 2003, 19(1) : i180-i182.
  • 9TANABE L, XIE N, THOM L H, et al. GENETAG: a tagged corpus for gene/protein named entity recognition [ J]. BMC Bioinformatics, 2005,6( 1 ) : $3.
  • 10COHEN K B, FOX L, OGREN P V, et al. Corpus design for biomedical natural language processing [ C ]//Proc of ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Database. 2005,38-45.

共引文献148

同被引文献101

引证文献12

二级引证文献79

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部