期刊文献+

生物文本中蛋白质名称的识别 被引量:2

Protein Name Recognition from Biological Text
下载PDF
导出
摘要 随着基因和蛋白质序列的发布和分子生物学研究的发展,其相关的数据呈指数级增长,因此如何从海量的相关文献中直接获取生物学家研究领域的相关信息变得迫在眉睫,识别生物文献中的命名实体如蛋白质、基因、脱氧核糖核酸名称等成为生物信息学中信息抽取的最基本任务。介绍了国际同类研究中生物命名实体识别的各种方法,重点介绍了蛋白质名称识别的相关方法、所用资源、实验结果及与国际同类研究的比较结果。 The genome sequence has ushered in a new era of rapid and exponential growth of data related to the biology community. Thus, there is a clear need in this area for automatic methods of extracting specific information directly relating to the interests of biology researchers. Name Entity(NE) such as protein, gene, DNA, etc. recognized from biological literature is a fundamental task in information extraction of bioinformatics. This paper introduces various methods of biological name entity recognition in international research on this area. Then the methods are presented with the relevant corpus and experiment resuits for protein name recognition. The promising results are gotten compared with the other state-of-the-art research.
出处 《计算机应用研究》 CSCD 北大核心 2007年第1期100-102,共3页 Application Research of Computers
基金 国家自然科学基金资助项目(60302021)
关键词 生物信息 命名实体识别 机器学习 特征选择 Bioinformatics Name Entity Recognition Machine Learning Feature Selection
  • 相关文献

参考文献10

  • 1Mika S,B Rost.Protein Names Peeled Precisely off Free Text[J].Bioinformatics,2004,20(Suppl 1):I241-I247.
  • 2Franzen K,Eriksson G,Olsson F,et al.Protein Names and How to Find Them[J].Int J Med Inf,2002,67(1-3):49-61.
  • 3K Fukuda,A Tamura,T Tsunoda,et al.Toward Information Extraction:Identifying Protein Names from Biological Papers[C].Proceedings of Pacific Symposium on Biocomputing,1998.707-718.
  • 4T Ohta,Y Tateishi,H Mima,et al.The GENIA Corpus:An Annotated Research Abstract Corpus in the Molecular Biology Domain[C].Human Language Technologies Conference,2002.73-77.
  • 5Tong Zhang,David E Johnson.A Robust Risk Minimization Based Named Entity Recognition System[C].Proceedings of CoNLL,2003.204-207.
  • 6Tong Zhang,Fred Damerau,David E Johnson.Text Chunking Based on a Generalization of Winnow[J].Journal of Machine Learning Research,2002,(2):615-637.
  • 7Radu Florian,Abe Ittycheriah,Hongyan Jing,et al.Named Entity Recognition Through Classifier Combination[C].Proceedings of CoNLL,2003.168-171.
  • 8Tong Zhang.Large Margin Winnow Methods for Text Categorization[C].KDD Workshop on Text Mining,2000.81-87.
  • 9Schwartz A,Hearst M.A Simple Algorithm for Identifying Abbreviation Definitions in Biomedical Text[J].Pacific Symposium on Biocomputing,2003,(8):451-462.
  • 10Zhou G,Zhang J,Su J,et al.Recognizing Names in Biomedical Texts:A Machine Learning Approach[J].Bioinformatics,2004,20(7):1178-1190.

同被引文献51

  • 1王浩畅,赵铁军,刘延力,于浩.生物医学文本中命名实体识别的智能化方法[J].北京邮电大学学报,2006,29(z2):54-58. 被引量:2
  • 2徐健,张智雄.典型关系抽取系统的技术方法解析[J].数字图书馆论坛,2008(9):13-18. 被引量:3
  • 3王浩畅,赵铁军.基于SVM的生物医学命名实体的识别[J].哈尔滨工程大学学报,2006,27(B07):570-574. 被引量:18
  • 4邹霞.英语复合词的述谓结构与语义格研究[J].邵阳学院学报(社会科学版),2007,6(3):89-91. 被引量:3
  • 5Cohen AM,Hersh WR.A aurvey of current work in biomedical text mining[J].Brief Bioinform(S1467-5463),2005,6(1):57-71.
  • 6Huang W,Nakamori Y,Wang S,et al.Mining scientific literature to predict new relationships[J].Intell Data Anal (S1088-467X),2005,9(2):219-234.
  • 7Cohen KB,Hunter L.Getting started in text mining[J].PLoS Comput Biol(S1553-734X),2008,4(1):1-3.
  • 8Ganiz MC,Pottenger WM,Janneck CD.Recent advances in literature based discovery[J/OL].http://dimacs.rutgers.edu/-billp/pubs/JASISTLBD.pdf.
  • 9Mendonca EA,Cimino JJ.Automated knowledge extraction from MEDLINE citations[J].Proc AMIA Symp(S1531-605X),2000:575-579.
  • 10Skusa A,Ruegg A,Kohler J.Extraction of biological interaction networks from scientific literature[J].Brief Bioinform(S1467-5463),2005,6(3):263-276.

引证文献2

二级引证文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部