期刊文献+

人类蛋白编码基因局部GC水平相关性分析 被引量:5

Analysis of correlation of local GC level in human protein coding genes
下载PDF
导出
摘要 GC含量是基因组DNA序列碱基组成的重要特征,蕴涵基因结构、功能和进化信息。文中通过从公共数据库提取7992个非冗余的人类蛋白质编码基因DNA序列,分析了基因序列不同区域的局部GC含量和相关性。结果表明:基因局部GC含量呈现不均一性,5′非翻译区GC水平最高,为62.56%;而3′非翻译区GC水平最低,为43.97%。3′侧翼序列的GC含量能较好地代表基因所在区域DNA长片段的GC水平。虽然开放阅读框的GC含量比内含子、3′非翻译区和3′侧翼序列的GC含量高,但4个区域的GC含量之间均存在较高的相关性。密码子第三位置的平均GC含量(GC3)为58.09%,显著高于密码子第一位置和第二位置的GC含量,且与开放阅读框的GC水平高度相关,相关系数高达0.91。GC3与内含子、3′非翻译区、3′侧翼序列的GC水平相关性也较高,GC3对3′侧翼序列的GC含量的直线回归斜率为1.25。因此,GC3可作为基因所在区域GC水平变化的敏感性指标。而密码子第一位置和第二位置以及5′侧翼序列和5′非翻译区GC水平与基因其他区域的GC水平的相关性较弱。该研究结果提示:基因蛋白编码区密码子第三位置、内含子、3′非翻译区和3′侧翼序列的碱基可能经历了相近的进化过程,而蛋白编码区密码子第一位置和第二位置、5′侧翼序列和5′非翻译区由于功能的需要而经历了不同的突变和选择。 GC level is an important feature of genomic composition, which significantly improve our understanding of structure, function and evolution of genes. In this paper, the nonredundant DNA sequence of 7 992 human protein coding genes were retrieved from public database and the local GC level of different sequence regions and correlation between GC levels were analyzed.. The results showed that the GC levels of different sequence regions were strikingly nonuniform. 5' untranslated regions were of richest GC, with average GC content being 62.5%. 3'-untranslated regions were of poorest GC, with average GC content being 43.97%. GC contents of 3' flanking sequences profoundly matched the GC levels of DNA large fragments where the genes were located. Although the GC contents of open reading frames (ORFs) were higher than that of intron, 3' non-translated region and 3' flanking sequences, high correlation existed among the GC contents of the four regions. Average GC content of the third codon position (GC3) was 58.9%, higher than that of the fist and second posi- tion, and showed high correlation to GC contents of ORFs, with correlation coefficients being 0.91, besides of its significant association with GC contents of intron, 3'-untranslated region and 3' flanking sequences. Moreover, the linear regression of GC3 against GC contents of 3' flanking sequences yielded a slope of 1.25. Thus, GC3 was a sensitive indicator for GC change of local genome. As for 5' flanking sequences, 5' untranslated regions, fist and second codon position, however, their GC level exhibited weaker correlation with that of other regions. These results suggest that the third codon positions, introns, 3'-untranslated regions and 3' flanking sequences may evolve similarly while first and second codon positions, 5' flanking sequences and 5' untranslated region were expected to bear more selective stress for holding their functions.
出处 《遗传》 CAS CSCD 北大核心 2008年第9期1169-1174,共6页 Hereditas(Beijing)
基金 四川省应用基础研究项目(编号:03JY029-041)资助~~
关键词 局部GC含量 相关 人类蛋白编码基因 local GC level correlation human protein coding genes
  • 相关文献

参考文献20

  • 1Abbari K, Bernardi G. CpG doublets, CpG islands and Alu repeat elements in long human DNA sequences from different isochores families. Gene, 1998, 224(1-2): 123-128.
  • 2Galtier N, Piganeau G, Mouchiroud D, Duret, L. GC-content evolution in mammalian genomes: the biased gene conversion hypothesis. Genetics, 2001, 159(2): 907-911.
  • 3Rolfe R, Meselson M. The relative homogeneity of microbial DNA. Proc Natl Acad Sci USA, 1959, 45 (7): 1039-1043.
  • 4Hsu F, Kent WJ, Clawson H, Kuhn RM, Diekhans M, Haussler D. The UCSC known genes. Bioinformatics, 2006, 22(9): 1036-1046.
  • 5International Human Genome Sequencing Consortium (Lander ES, Linton LM, Birren B, et al) Initial sequencing and analysis of the human genome. Nature, 2001, 409(6822): 860-921.
  • 6Bernardi G, Filipski J. The mosaic genome of warmblooded vertebrates. Science, 1985, 228(4702): 953-958.
  • 7Clay O, Caccio S, Zoubak S. Human coding and noncoding DNA: compositional correlations. Mol Phylogenet Evol, 1996, 5(1): 2-12.
  • 8朱蔚,郑佐华,袁有忠,周宗祥,毛裕民.编码序列的(G+C)%与蛋白质的耐热性相关性分析[J].Acta Genetica Sinica,1999,26(4):418-427. 被引量:6
  • 9吴宪明,吴松锋,任大明,朱云平,贺福初.密码子偏性的分析方法及相关研究进展[J].遗传,2007,29(4):420-426. 被引量:188
  • 10石秀凡,黄京飞,柳树群,刘次全.人类基因同义密码子偏好的特征以及与基因GC含量的关系[J].生物化学与生物物理进展,2002,29(3):411-414. 被引量:63

二级参考文献102

  • 1张昆林,张静,罗静初.酵母基因上游与内含子可能存在的转录协同作用[J].生物化学与生物物理进展,2005,32(1):46-52. 被引量:11
  • 2DOnfrio G,Mouchiroud D,Assani B,et al.Correlation between the compositional properties of human genes,codon usage,and amino acid composition of protein.J Mol Evol,1991,32(6):504~510
  • 3Bernardi G.The isochore organization of the human genome.Annu Rev Genet,1989,23:637~661
  • 4Bernardi G.The human genome:organization and evolution history.Annu Rev Genet,1995,29:445~476
  • 5Karlin S,Campbell M A,Mrázek J.Comparative DNA analysis across diverse genomes.Annu Rev Genet,1998,32:185~225
  • 6Musto H,Romero H,Zavala A,et al.Synonymous codon choices in the extremely GC-poor genome of Plasmodium falciparum:compositional constraints and translational selection.J Mol Evol,1999,49(1):27~35
  • 7Thomas L K,Dix D B,Thompson R C.Codon choice and gene expression:Synonymous codons differ in their ability to direct aminoacylated-transfer RNA binding to ribosomes in vitro.Proc Natl Acad Sci USA,1988,85(12):4242~4246
  • 8Lee Yongeok,J Gen Microbiol,1993年,139卷,1227页
  • 9Yi Ting,J Bacteriol,1991年,173卷,21期,6849页
  • 10Lee Chanyong,J Biol Chem,1990年,263卷,31期,19082页

共引文献242

同被引文献53

引证文献5

二级引证文献15

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部