期刊文献+

The Curation of Genetic Variants: Difficulties and Possible Solutions

The Curation of Genetic Variants: Difficulties and Possible Solutions
原文传递
导出
摘要 The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are pre- dominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods. The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are pre- dominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods.
出处 《Genomics, Proteomics & Bioinformatics》 SCIE CAS CSCD 2012年第6期317-325,共9页 基因组蛋白质组与生物信息学报(英文版)
关键词 Difficulties in curation Automated curation Manual curation Interpretation of variants Difficulties in curation Automated curation Manual curation Interpretation of variants
  • 相关文献

参考文献36

  • 1Bale S, Devisscher M, Van Criekinge W, Rehm HL, Decouttere F, Nussbaum R, et al. MutaDATABASE: a centralized and standard- ized DNA variation database. Nat Biotech 2011 ;29:117-8.
  • 2Wildeman M, van Ophuizen E, den Dunnen JT, Taschner PE. Improving sequence variant descriptions in variant databases and literature using the Mutalyzer sequence variation nomenclature checker. Hum Mutat 2008:29:6-13.
  • 3Gieger C, Deneke H, Fluck J. The future of text mining in genome- based clinical research. Biosilico 2003; 1:97-102.
  • 4Shatkay H, Feldman R. Mining the biomedical literature in the genomic era: an overview. J Comput Biol 2003;10:821-55.
  • 5Van Auken K, Jaffery J, Chan J, Muller HM, Sternberg PW. Semi- automated curation of protein subcellular localization: a text mining- based approach to Gene Ontology (GO) Cellular Component curation. BMC Bioinformaties 2009:10:228.
  • 6Mitropoulou C, Webb A J, Mitropoulous K, Brookes A J, Patrinos JP. Locus-specific database domain and data content analysis: evolution and content maturation toward clinical use. Hum Murat 2010;31:1109-16.
  • 7Vihinen M, den Dunnen JT, Dalgleish R, Cotton RGH. Guidelines for establishing locus specific databases. Hum Mutat 2012;33:298-305.
  • 8Fokkema IFAC, Taschner PE, Schaafsma GC, Celli J, Laros JF, den Dunnen JT. LOVD v. 2.0: the next generation in gene variant databases. Hum Mutat 2011;32:557-63.
  • 9Mathiak B, Eckstein S, editors. Five steps to text mining in biomedical literature. Proceedings of the second European workshop on data mining and text mining in bioinformatics. Italy: Pisa; 2004.
  • 10Baker CJO, Witte R. Mutation mining--a prospector's tale. Inf Syst Front 2006;8:47-57.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部