期刊文献+

The Transcript-centric Mutations in Human Genomes

The Transcript-centric Mutations in Human Genomes
原文传递
导出
摘要 Since the human genome is mostly transcribed, genetic variations must exhibit sequence signatures reflecting the relationship between transcription processes and chromosomal structures as we have observed in unicellular or- ganisms. In this study, a set of 646 ubiquitous expression-invariable genes (EIGs) which are present in germline cells were defined and examined based on RNA-sequencing data from multiple high-throughput transcriptomic data. We demonstrated a relationship between gene expression level and transcript-centric mutations in the human genome based on single nucleotide polymorphism (SNP) data. A significant positive correlation was shown be- tween gene expression and mutation, where highly-expressed genes accumulate more mutations than low- ly-expressed genes. Furthermore, we found four major types of transcript-centric mutations: C---~T, A---~G; C---~ and G--~T in human genomes and identified a negative gradient of the sequence variations aligning from the 5' end to the 3' end of the transcription units (TUs). The periodical occurrence of these genetic variations across TUs is associated with nucleosome phasing. We propose that transcript-centric mutations are one of the major driving forces for gene and genome evolution along with creation of new genes, gene/genome duplication, and horizontal gene transfer. Since the human genome is mostly transcribed, genetic variations must exhibit sequence signatures reflecting the relationship between transcription processes and chromosomal structures as we have observed in unicellular or- ganisms. In this study, a set of 646 ubiquitous expression-invariable genes (EIGs) which are present in germline cells were defined and examined based on RNA-sequencing data from multiple high-throughput transcriptomic data. We demonstrated a relationship between gene expression level and transcript-centric mutations in the human genome based on single nucleotide polymorphism (SNP) data. A significant positive correlation was shown be- tween gene expression and mutation, where highly-expressed genes accumulate more mutations than low- ly-expressed genes. Furthermore, we found four major types of transcript-centric mutations: C---~T, A---~G; C---~ and G--~T in human genomes and identified a negative gradient of the sequence variations aligning from the 5' end to the 3' end of the transcription units (TUs). The periodical occurrence of these genetic variations across TUs is associated with nucleosome phasing. We propose that transcript-centric mutations are one of the major driving forces for gene and genome evolution along with creation of new genes, gene/genome duplication, and horizontal gene transfer.
出处 《Genomics, Proteomics & Bioinformatics》 CAS CSCD 2012年第1期11-22,共12页 基因组蛋白质组与生物信息学报(英文版)
基金 supported by grants from the National Basic Research Program (973 Program 2011CB944100 and 2011CB944101) National Natural Science Foundation of China (90919024) awarded to JY Knowledge Innovation Program of the Chinese Academy of Sciences (KSCX2-EW-R-01-04) to SH
关键词 RNA-SEQ genetic variations sequence signatures RNA-seq, genetic variations, sequence signatures
  • 相关文献

参考文献45

  • 1Wong,GK. Compositional gradients in Gramineae genes[J].Genome Research,2002.851-856.
  • 2Barnes,D.E,Lindahl,T. Repair and genetic consequences of endogenous DNA base damage in mammalian cells[J].Annual Review of Genetics,2004.445-476.
  • 3Majewski,J. Dependence of mutational asymmetry on gene-expression levels in the human genome[J].American Journal of Human Genetics,2003.688-692.
  • 4Zhang,Y. Error-prone lesion bypass by human DNA polymerase eta[J].Nucleic Acids Research,2000.4717-4724.
  • 5Green,P. Transcription-associated mutational asymmetry in mammalian evolution[J].Nature Genetics,2003.514-517.
  • 6Rogozin,I.B,Pavlov,Y.I. Theoretical analysis of mutation hotspots and their DNA sequence context specificity[J].Mutation Research,2003.65-85.
  • 7Weiss,K.M. In search of human variation[J].Genome Research,1998.691-697.
  • 8Ramskold,D. An abundance of ubiquitously expressed genes revealed by tissue transcriptome sequence data[J].PLoS Computational Biology,2009.e1000598.doi:10.1371/journal.pcbi.1000598.
  • 9Wang,E.T. Alternative isoform regulation in human tissue transcriptomes[J].Nature,2008.470-476.
  • 10Marioni,J. RNA-seq:an assessment of technical reproducibility and comparison with gene expression arrays[J].Genome Research,2008.1509-1517.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部