期刊文献+

生物医学命名实体识别的研究与进展 被引量:25

Research and development on biomedical named entity recognition
下载PDF
导出
摘要 为直接高效地获取文献中的知识,命名实体识别用来识别文本中具有特定意义的实体。这是应用文本挖掘技术自动获取知识的关键的第一步,因此受到日益广泛的关注。主要从评测方法、特征选择、机器学习方法和后期处理等方面介绍了近年来生物医学命名实体识别方面的主要研究方法及成果,并对目前各方面存在的问题进行了分析和讨论,最后对该领域的研究前景进行了展望。 In order to acquire the useful knowledge directly and effectively from documents, named entity recognition is to recognize the meaningful entities in documents. It is the first and important step to acquire relevant knowledge automatically by using text mining technology. This review introduced main approaches and achievements in recognizing biomedical named entities, focusing on the strategy of evaluation, feature selection, methods on machine learning and the post-processing. It also looked into the current problems and displayed the promising solutions. The last section drew the prospection for the research on biomedical named entity recognition.
出处 《计算机应用研究》 CSCD 北大核心 2010年第3期811-815,832,共6页 Application Research of Computers
基金 国家自然科学基金资助项目(60773021 60603054)
关键词 命名实体识别 文本挖掘 特征选择 机器学习 named entity recognition text mining feature selection machine learning
  • 相关文献

参考文献38

  • 1TANABE L, WILBUR W J. A priority model for named entities [ C ]//Proc of Human Language Technology Conference. Morristown : Association for Computational Linguistics, 2006 : 33-40.
  • 2GU Bao-hua. Recognizing nested named entities in GENIA corpus [ C ]//Proc of Human Language Technology Conference. Morristown : Association for Computational Linguistics, 2006 : 112-113.
  • 3SUNDHEIM B M. Overview of results of the M UC-6 evaluation [ C ]// Proc of the 6th Conference on Message Under Standing. Morristown: Association for Computational Linguistics, 1996:423-442.
  • 4KIM J, OHTA T, TSURUOKA Y, et al. Introduction to the bio-entity recognition task at JNLPBA[ C ]//Proc of International Workshop on Natural Language Processing in Biomedicine and It's Applications. 2004 : 70 - 75.
  • 5YEH A, MORGAN A, COLOSIMO M, et al. BioCreAtIvE task 1A: gene mention finding evaluation[ J]. BMC Bioinformatics, 2005,6 (1) : S2.
  • 6LEAMAN R, GONZALEZ G. BANNER: an executable survey of advances in biomedical named entity recognition [ C ]//Proc of Pacific Symposium on Biocomputing. 2008:652-663.
  • 7KIM J D, OHTA T, TATEISI Y, et al. GENIA corpus:a semantically annotated corpus for bio-textmining [ J]. Bioinformatios, 2003, 19(1) : i180-i182.
  • 8TANABE L, XIE N, THOM L H, et al. GENETAG: a tagged corpus for gene/protein named entity recognition [ J]. BMC Bioinformatics, 2005,6( 1 ) : $3.
  • 9COHEN K B, FOX L, OGREN P V, et al. Corpus design for biomedical natural language processing [ C ]//Proc of ACL-ISMB Workshop on Linking Biological Literature, Ontologies and Database. 2005,38-45.
  • 10PROUX D, RECHENMANN F, JULLIARD L, et al. Detecting gene symbols and names in biological texts : a first step toward pertinent information extraction[ C]//Proc of the 8th Workshop on Genome Informatics. 1998: 72-80.

同被引文献274

引证文献25

二级引证文献186

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部