期刊文献+

构建非冗余EID的若干技巧

ISSUES ON CONSTRUCTION OF NON-REDUNDANT EXON/INTRON DATABASE
下载PDF
导出
摘要 基于GenBank构建的外显子内含子数据库(EID)含有大量的冗余数据.为了解决冗余问题,构建了基于RefSeq的非冗余EID(non-redundantEID).RefSeq是由NCBI工作人员负责维护和更新的参考序列库,为基因组注释、基因识别、基因突变、多态性分析、表达研究和比对分析提供了重要的参考.该EID可用于大规模分析Exon/Intron结构和内含子剪切(Splicing)的研究,并拥有一些内部机制来控制数据质量和可能出现的错误.同时,它的新的改进是增加了基因序列中非翻译区(UTR)的数据内容.该文对构建基于RefSeq的非冗余EID的一些技巧作出说明. There are a lot of redundant data in Exon/Intron Database (EID) based on GenBank. In order to resolve this puzzle, a non - redundant EID is constructed based on RefSeq. RefSeq is a sequence database maintained and renewed by NCBI staff for medical, functional, and diversity studies, providing a consistent reference for genome annotation, gene identification and characterization, mutation and polymorphism analysis, expression studies, and comparative analyses. This EID is a good choice for large scale computational investigation of exon/in- tron structure and splicing. It has many internal filters that could control for sequence quality, consistency of gene descriptions, accordance with standards, and possible errors. New modification also includes data of untranslated regions (UTR) of gene sequences as well. Here some issues on the construction of non - redundant EID are addressed.
出处 《华南师范大学学报(自然科学版)》 CAS 北大核心 2009年第4期94-96,110,共4页 Journal of South China Normal University(Natural Science Edition)
基金 国家自然科学基金资助项目(30470495)
关键词 非冗余EID RefSeq 剪切 非翻译区 non - redundant EID RefSeq splicing UTR
  • 相关文献

参考文献5

  • 1PRUITT K, TATUSOVA T, MAGLOTT D. The reference sequence (RefSeq) project[ M/OL]. NCBI Edition:NC- BI handbook: Chapter 18. http://www, nebi. n|m. nih. gov/books/bv, fcgi7 rid = handbook, chapter, ch18.
  • 2SAXONOV S, DAIZADEH I, FEDOROV A, et al. EID: The exon/intron database - an exhaustive database of protein - coding intron - containing genes [ J ]. Nuclear Acids Research, 2000, 28:185 -190.
  • 3SAKHKAR M, PASSETH F, SOUZA J E, et al. Exlnt: an exon intron database [ J ]. Nuclear Acids Research, 2002, 30 : 191 - 194.
  • 4GOPALAN V, TAN T W, LEE T K, et al. Xpro: data- base of eukaryotic protein - encoding genes [ J ]. Nuclear Acids Research, 2004, 32: D59- D63.
  • 5金鹰,邓小元.基于CDS..join特征域的Exon/Intron数据库的构建[J].华南师范大学学报(自然科学版),2009,41(1):91-94. 被引量:2

二级参考文献13

  • 1ROGOZIN I B, SVERDLOV A V, BABENKO V N, et al. Analysis of evolution of exon - intron structure of eukaryotic genes [ J]. Briefings in Bioinformatics, 2005, 6 : 118 - 134.
  • 2GILBERT W. Why genes in pieces? [ J]. Nature, 1978, 271:501-505.
  • 3SAXONOV S, DAIZADEH I, FEDOROV A, et al. The exon/intron database - an exhaustive database of protein - coding intron - containing genes [ J ]. Nuclear Acids Research, 2000, 28 : 185 - 190.
  • 4FEDOROV A, SAXONOV S, FEDOROV L, et al. Comparison of intron - containing and intron - lacking human genes elucidates putative exonic splicing enhancers [ J ]. Nucleic Acids Res, 2001, 29 : 1464 - 1469.
  • 5SAKHKAR M, PASSETTI F. ExInt: an exon intron database[J]. Nucleic Acids Res, 2002, 30:1191 - 194.
  • 6FEDOROV A, MERICAN AF, GILBERT W. Large - scale comparison of intron positions among animal, plant, and fungal genes[J]. PNAS, 2002, 99:16128 -16133.
  • 7ROY SW, FEDOROV A, GILBERT W. Large - scale comparison of intron positions in mammalian genes shows intron loss but no gain[J]. PNAS, 2003, 100:7158 - 7162.
  • 8GOPALAN V, TAN TW. Xpro: database of eukaryotic protein - encoding genes [ J ]. Nucleic Acids Res, 2004, 32 : D59 - 63.
  • 9FEDOROV A, STOMBAUGH J. Computer identification of snoRNA genes using a mammalian orthologous intron database [ J ]. Nucleic Acid Res, 2005,33:4578 - 4583.
  • 10SHEPELEV V, FEDOROV A. Advances in the exon - intron database (EID) [ J]. Brief Bioinformatics, 2006, 7 : 178 - 185.

共引文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部