期刊文献+

基因组序列8-mer频次使用规律及与物种进化的关系 被引量:1

Rules of 8-mer usage in genome sequences and its relation to genome evolution
下载PDF
导出
摘要 基因组序列k-mer的非随机使用规律及包含的生物学意义一直是人们关注的问题,目前还没有根本性进展。本文以七个物种的全部基因序列为样本,得到各物种基因组序列的8-mer频谱分布。发现狗和牛的频谱有三个峰,而斑马鱼、青鳉鱼、秀丽线虫和酿酒酵母的频谱只有一个峰,鸡的频谱分布形状介于两者之间。将8-mer集合按照XY二核苷含量分类,结果显示只有CG二核苷分类下0CG、1CG和2CG三类子集的频谱形成各自独立的单峰分布。对照随机序列,发现0CG模体是随机进化的,1CG和2CG模体是定向进化的,它们的使用频次远小于随机频次,且这种独立进化分离规律具有物种普适性。三个CG子集频谱之间的距离是产生单峰或多峰现象的根本原因。将七个物种基因组序列标准化到109bp,比较发现1CG和2CG子集频谱与物种进化显著相关,0CG子集频谱与物种进化无显著关系。可以认为三种CG模体各自执行着不同的生物学功能。基因组序列8-mer的独立分离规律为揭示基因组结构、基因组进化以及模体的生物功能提供了一种新的思维方式。 The rules of k-mer non-random usage in genome sequences and its biological significance are important problems and its mechanism is still not clear. Based on seven genome sequences,the distributions of 8-mer frequency spectra were gotten. Results show that 8-mer spectra of dog and cow are trimodal and of zebra fish,medaka,nematode and yeast are unimodal. For chicken genome,the 8-mer spectrum is a medium between the two models. When the 8-mer set were classified into three subsets according to XY dinucleotide content,results show that only if in CG dinucleotide classification,the 0CG,1CG and 2CG subsets form independent and unimodal distributions respectively. Compared with random sequences,it is found that 0CG motifs are the result of the random evolution,1CG / 2CG motifs are the result of the directed evolution and their frequencies are far low from the random frequencies. The rules of independent separation for the three CG subsets have species universality. Results indicate that the prime reasons about unimdals or multimodals of 8-mer spectra in different species are the distance differences of the three CG spectra. When seven genome sequences are normalized into 109 bp,results show that the spectra of 1CG and 2CG motifs are correlated significantly with genome evolution and of 0CG motifs has not obvious relation to genome evolution. We think that the three CG motifs have different biological functions. The rules of independent separation for the three CG subsets will provide a novel idea to research genome structures and evolutions and provide a method to reveal the functional elements in genome sequences.
出处 《生物信息学》 2016年第4期195-202,共8页 Chinese Journal of Bioinformatics
基金 国家自然科学基金项目(No.31260219) 国家级大学生创新训练计划项目(No.201512149)
关键词 基因组序列 8-mer频谱 CG二核苷分类 独立分离规律 基因组进化 Genome sequence 8-mer spectrum CG dinucleotide classification Independent separation rule Genome evolution
  • 相关文献

参考文献1

二级参考文献25

  • 1Csrs M, No L, Kucherov G. Reconsidering the significance of genomic word frequencies. Trends Genet, 2007, 23(11): 543-546.
  • 2Tuller T, Chor B, Nelson N. Forbidden penta-peptides. Protein Sci, 2007, 16(10): 2251-2259.
  • 3Hao B, Lee HC, Zhang S. Fractals related to long DNA sequences and complete genomes. Chaos, Soliton Fract, 2000, 11(6): 825-836.
  • 4Subirana JA, Messeguer X. The most frequent short sequences in non-coding DNA. Nucleic Acids Res, 2010, 38(4): 1172-1181.
  • 5Hampikian G, Andersen T. Absent sequences: Nullomers and primes. Pac Syrup Biocomput, 2007, 12:355-366.
  • 6Hariharan R, Simon R, PJllai MR, Taylor TD. Comparative analysis of DNA word abundances in four yeast genomes using a novel statistical background mode. PLoS One, 2013, 8(3): e58038.
  • 7Yu HJ. Segmented k-mer and its application on similadty analysis of mitochonddal genome sequences. Gene, 2013, 518:419-424.
  • 8Chae H, Park J, Lee SW, Nephew KP, Kim S. Comparative analysis using k-mer and k-flank patterns provides evidence for CpG island sequence evolution in mammalian genomes. Nucleic Acids Res, 2013, 41 (9): 4783-4791.
  • 9Youngik Y, Kenneth N, Sun K. A novel k-mer mixture logistic regression for methylation susceptibility modeling of CpG dinucleotides in human gene promoters. BMC Bioinforrnatics, 2012, 13(Suppl 3): $15.
  • 10Rayan C, Paul M. Informed and automated k-mer size selection for genome assembly. Bioinformatics, 2013, 30(1): 31 -37.

共引文献3

同被引文献1

引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部