期刊文献+

A Multiple Feature Approach for Disorder Normalization in Clinical Notes

A Multiple Feature Approach for Disorder Normalization in Clinical Notes
原文传递
导出
摘要 In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We develop a two-step method to acquire a list of candidate CUIs and their associated preferred names using UMLS API and to choose the closest CUI by calculating the similarity between the input disorder mention and each candidate. The similarity calculation step is formulated as a classification problem and multiple features(string features,ranking features,similarity features,and contextual features) are used to normalize the disorder mentions. The results show that the multiple feature approach improves the accuracy of the normalization task from 32.99% to 67.08% compared with the Meta Map baseline. In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We develop a two-step method to acquire a list of candidate CUIs and their associated preferred names using UMLS API and to choose the closest CUI by calculating the similarity between the input disorder mention and each candidate. The similarity calculation step is formulated as a classification problem and multiple features(string features,ranking features,similarity features,and contextual features) are used to normalize the disorder mentions. The results show that the multiple feature approach improves the accuracy of the normalization task from 32.99% to 67.08% compared with the Meta Map baseline.
出处 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第6期482-490,共9页 武汉大学学报(自然科学英文版)
基金 Supported by the National Natural Science Foundation of China(61133012,61202193,61373108) the Major Projects of the National Social Science Foundation of China(11&ZD189) the Chinese Postdoctoral Science Foundation(2013M540593,2014T70722) the Open Foundation of Shandong Key Laboratory of Language Resource Development and Application
关键词 natural language processing disorder normalization Levenshtein distance semantic composition multiple features natural language processing disorder normalization Levenshtein distance semantic composition multiple features
  • 相关文献

参考文献3

二级参考文献40

  • 1Verspoor K, Cohen K B, Goertzel B, et al. Introduction to BioNLP'06. Linking natural language processing and biol?ogy: Towards deeper biological literature analysis[C]// Pro?ceedings of the HLT-NAACL Workshop on Linking Natural Language and Biology. New York:ACL, 2006:iii-iv.
  • 2Zweigenbaum P, Demner-Fushman D, Yu H, et al. New frontiers in biomedical text mining[C]// Proceedings of the Pacific Symposium on Biocomputing 12. Wailea, Maui, Ha?waii: IEEE Press, 2007: 205-208.
  • 3Zweigenbaum P, Demner-Fushman D, Yu H, et al. Frontiers of biomedical text mining: Current progress[J]. Briefings in Bioinformatics, 2007, 8(5): 358-375.
  • 4Ananiadou S, McNaught J. Text Mining for Biology and Biomedicine[M]. Boston: Artech House Inc, 2006.
  • 5Cohen A M, Hersh W R. A survey of current work in bio?medical text mining[J]. Briefings in Bioinformatics, 2005, 6(1):57-71.
  • 6Ananiadou S, Kell D B, Tsujii J. Text mining and its poten?tial applications in systems biology[J]. Trends in Biotechnol 2006,24(12): 571-579.
  • 7Cohen K B, Hunter L.Getting started in text mining[J]. PLoS Comput Bioi, 2008, 4: e20.
  • 8Tomanek K, Wermter J, Hahn U. A reappraisal of sentence and token splitting for life sciences documents[J]. Stud Health TechnolInform, 2007,129 (Pt I): 524 -528.
  • 9Kulick S, Bies A, Liberman M, et al. White P: Integrated annotation for biomedical information extraction[C]// HLT-NAACL 2004 Workshop: Biolink 2004, Linking Bio?logical Literature, Ontologies and Databases. Boston: Artech House Inc, 2004: 61-68.
  • 10Coden A R, Pakhomov S V, Ando R K, et al. Chute CG: Domain-specific language models and lexicons for tag?ging[J]. J Biomed Inform, 2005, 36: 422-430.

共引文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部