期刊文献+

基于条件随机场的医药领域症状信息抽取 被引量:11

Information extraction of symptoms in the medical field based on CRF
原文传递
导出
摘要 为了实现中文医药领域症状信息的自动化抽取,提出了一种基于条件随机场的拆分症状文本特征的抽取方法,将症状文本自动识别拆分为症状主体和症状表现形式自动识别两个部分,再将这两种识别结果作为特征加入到症状信息抽取过程中。信息抽取的结果包含完整的症状信息二元组:症状主体和症状表现形式。实验表明,该方法在症状信息抽取的准确率及查全率上有较大提升。 In order to achieve the automated extraction of Chinese language symptom information, we put forward an extraction approach of splitting text features of symptoms based on CRF. The approach splits symptom tests into symptom subject recognition and symptom manifestation recognition, and then uses the results in the extraction of symptoms. The final results include complete two-tuples: symptom subject, symptom manifestation. Our experi- ments show that this approach can achieve a higher recognition rate than other methods.
出处 《北京化工大学学报(自然科学版)》 CAS CSCD 北大核心 2016年第1期98-103,共6页 Journal of Beijing University of Chemical Technology(Natural Science Edition)
基金 国家科技支撑计划(2012BAH88F02)
关键词 命名实体识别 条件随机场 隐马尔科夫模型 症状抽取 named entity recognition conditional random field (CRF) hidden Markov model (HMM) symptomextraction
  • 相关文献

参考文献8

  • 1Lei J]B, Tang B Z, Lu X Q, et al. A comprehensive study of named entity recognition in Chinese clinical text [J]. Journal of the American Medical Informatics Associ- ation, 2014, 21(5) : 808-814.
  • 2刘凯,周雪忠,于剑,张润顺.基于条件随机场的中医临床病历命名实体抽取[J].计算机工程,2014,40(9):312-316. 被引量:27
  • 3Wang Y Q, Liu Y G, Yu Z H, et al. A preliminary work on symptom name recognition from free text clinical re- cords of traditional Chinese medicine using conditional random fields and reasonable features [C ]//Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, Stroudsburg, USA: Association for Computa- tional Linguistics, 2012 : 223-230.
  • 4Wang Y Q, Yu Z H, Chen L, et al. Supervised methods for symptom name recognition in free text clinical records of traditional Chinese medicine: an empirical study [J]. Journal of Biomedical Informatics, 2014, 47: 91-104.
  • 5王世昆,李绍滋,陈彤生.基于条件随机场的中医命名实体识别[J].厦门大学学报(自然科学版),2009,48(3):359-364. 被引量:35
  • 6叶枫,陈莺莺,周根贵,李昊旻,李莹.电子病历中命名实体的智能识别[J].中国生物医学工程学报,2011,30(2):256-262. 被引量:47
  • 7Lafferty J]D, McCallum A, Pereira F C N. Conditional random fields : probabilistic models for segmenting and la- beling sequence data[C~//Proceedings of the Eighteenth International Conference on Machine Learning, San Fran- cisco, USA: Morgan Kaufmann Publishers Inc, 2001: 282-289.
  • 8Rabiner L R. A tutorial on hidden markov models and se- lected applications in speech recognition[M ]. San Fran- cisco, USA: Morgan Kaufmann Publishers Inc, 1990.

二级参考文献36

  • 1俞鸿魁,张华平,刘群,吕学强,施水才.基于层叠隐马尔可夫模型的中文命名实体识别[J].通信学报,2006,27(2):87-94. 被引量:151
  • 2Burr Settles. Biomedical named entity recognition using conditional random fields and rich feature sets[C]//Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and Its Applications. Geneva, Switzerland ; COLING, 2004 : 104 -- 107.
  • 3Hieuxuan. FlexCRFs, flexible conditional random fields [EB/OL]. http,//www, jaist, ae. jp. html.
  • 4中国科学院计算技术研究所.汉语词法分析工具ICT-CLAS[EB/0L].http://www.nlp.org.cn/.
  • 5Zhang Leo Maximum entropy modeling toolkit for python and C+ + [EB/OL]. 2007-07. http:Hhomepages, inf. ed. ac. uk/s0450736/maxent_toolkit, html.
  • 6Chang Chihchung, Lin Chihjen. LIBSVM -- a library for support vector machines[EB/OL], http://www, csie.ntu. edu. tw/-cjlin/libsvm.
  • 7Doan A,Naughton JF,Ramakrishnan R,et al.Information extraction challenges in managing unstructured data[J].ACM SIGMOD Record,2008,37(4):14-20.
  • 8Vlachos A,Gasperin C.Bootstrapping and evaluating named entity recognition in the biomedical domain[C]//Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology.New York:Association for Computational Linguistics Morristown,2006:138-145.
  • 9Bundschus M,Dejori M,Stetter M,et al.Extraction of semantic biomedical relations from text using conditional random fields[J].BMC Bioinformatics,2008,9:207.
  • 10Leaman R,Gonzalez GR.BANNER:An executable survey of advances in biomedical named entity recognition[C]//Proceedings of Pacific Symposium on Biocomputing.Hawaii:World Scientific Publishing Co.Pte.Ltd,2008:652-663.

共引文献91

同被引文献120

引证文献11

二级引证文献71

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部