期刊文献+

基于实体词典与机器学习的基因命名实体识别 被引量:4

Gene Named Entity Recognition Based on Entity Dictionary and Machine Learning
下载PDF
导出
摘要 将实体词典以特征的形式引入到机器学习模型中,提出一种基于实体词典与机器学习的基因命名实体识别方法,在GENIA 3.02语料上进行实验。测试结果表明引入实体词典特征后,在获得较高实体识别准确率的同时,优化CRFs识别模型的时间复杂度,提高系统识别效率。 By introducing the entity dictionary into the model of machine learning in the form of characteristics,this article proposes a method of gene- named entity recognition based on entity dictionary and machine learning and experiments on corpus GENIT 3.02.As indicated by the test results,after the characteristics of the entity dictionary are introduced,while a higher accuracy rate of entity recognition is obtained,the time complexity of CRFs recognition model is optimized and the systems recognition efficiency is enhanced.
出处 《医学信息学杂志》 CAS 2015年第12期54-60,共7页 Journal of Medical Informatics
基金 国家科技支撑计划项目(项目编号:2011BA H10B05)
关键词 实体词典 机器学习 基因命名实体 命名实体识别 Entity dictionary Machine learning Gene named entity Named entity recognition
  • 相关文献

参考文献6

  • 1Hatzivassiloglou V, Duboue' PA, Rzhetsky A. Disambiguat- ing Proteins, Genes and RNA in text: a machine learning approach [J] . Bioinformatics, 2001, 1 (1): 1 - 10.
  • 2National Center for Biotechnology Information, U.S. Nation- al Library of Medicine. Semantic Network - UMLS~ Refer ence Manual [EB/OL]. [2015-02-10]. http: //www. ncbi. nlm. nih. gov/books/NBk9679/.
  • 3The Stanford Natural Language Processing Group. Stanford Log - linear Part - of - Speech Tagger [ EB/OL]. [ 2015 - 02 -15 ]. http: //alp. stanford, edu/software/tagger, shtml.
  • 4Smith L, Rindflesch T, Wilbur W J. MedPost : a part - of - speech tagger for bioMedical text [ J ]. Bioinformatics, 2004, 20 (14): 2320-2321.
  • 5Tsuruoka Y, Tateisi Y, Kim J D, et al. Developing a Ro- bust Part - of - Speech Tagger for Biomedical Text [ J ]. Advances in Informatics Lecture Notes in Computer Science, 2005, (374) : 382 -392.
  • 6Department of Information Science, Faculty of Science, Uni- versity of Tokyo. GENIA Tagger: part - of - speech tag- ging, shallow parsing, and named entity recognition for bio medical text [EB/OL] . [20t5-02-15]. http: //www. nactem, ac. uk/GENIA/tagger.

同被引文献55

引证文献4

二级引证文献31

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部