期刊文献+

A CONDITIONAL RANDOM FIELDS APPROACH TO BIOMEDICAL NAMED ENTITY RECOGNITION 被引量:3

A CONDITIONAL RANDOM FIELDS APPROACH TO BIOMEDICAL NAMED ENTITY RECOGNITION
下载PDF
导出
摘要 Named entity recognition is a fundamental task in biomedical data mining. In this letter, a named entity recognition system based on CRFs (Conditional Random Fields) for biomedical texts is presented. The system makes extensive use of a diverse set of features, including local features, full text features and external resource features. All features incorporated in this system are described in detail, and the impacts of different feature sets on the performance of the system are evaluated. In order to improve the performance of system, post-processing modules are exploited to deal with the abbrevia- tion phenomena, cascaded named entity and boundary errors identification. Evaluation on this system proved that the feature selection has important impact on the system performance, and the post-processing explored has an important contribution on system performance to achieve better re- sults. Named entity recognition is a fundamental task in biomedical data mining. In this letter, a named entity recognition system based on CRFs (Conditional Random Fields) for biomedical texts is presented. The system makes extensive use of a diverse set of features, including local features, full text features and external resource features. All features incorporated in this system are described in detail, and the impacts of different feature sets on the performance of the system are evaluated. In order to improve the performance of system, post-processing modules are exploited to deal with the abbreviation phenomena, cascaded named entity and boundary errors identification. Evaluation on this system proved that the feature selection has important impact on the system performance, and the post-processing explored has an important contribution on system performance to achieve better resuits.
出处 《Journal of Electronics(China)》 2007年第6期838-844,共7页 电子科学学刊(英文版)
基金 Supported by The National Natural Science Foundation of China(No.60302021).
关键词 计算机技术 设计方案 网络技术 随机函数 Conditional Random Fields (CRFs) Named entity recognition Feature selection Post-processing
  • 相关文献

参考文献7

  • 1J. Lafferty,A. McCallum,and F. Pereira.Conditional random fields: Probabilistic models for segmenting and labeling sequence data[].Proc of the th Inter- national Conference on Machine Learning.2001
  • 2S. Kulick,A. Bies,and M. Liberman, et al.Integrated annotation for biomedical information extraction[].NAACL/HLT Workshop on Linking Biological Lit- erature Ontologies and Databases: Tools for Users.2004
  • 3A. S. Schwartz,and M. A. Hearst.A simple algorithm for identifying abbreviation definitions in biomedical text[].Proc of the Pacific Symposium on Biocomputing.2003
  • 4Guodong Zhou,and Jian Su.Exploring deep knowl- edge resources in biomedical name recognition[].Proc of the Joint Workshop on Natural Language Proc- essing in Biomedicine and Its Applications.2004
  • 5J. Finkel,S. Dingare,and H. Nguyen, et al.Exploiting context for biomedical entity recognition: from syntax to the Web[].Proc of the Joint Workshop on Natural Language Processing in Biomedicine and Its Appli- cations.2004
  • 6S. Burr.Biomedical named entity recognition using conditional random fields and novel feature sets[].Proc of the Joint Workshop on Natural Language Proc- essing in Biomedicine and Its Applications.2004
  • 7Tsuruoka Y,Tateishi Y,Kim J-D , et al.Developing a robust part-of-speech tagger for biomedical text[].Panhellenic Conference on Informatics.2005

同被引文献6

引证文献3

二级引证文献27

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部