Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content....Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content. However, there has been controversy about the cause of GC variation. Here, we characterized the GC content of 1 092 paralogs and other single-copy genes in the duplicated chromosomal regions of the rice genome (ssp. indica) and classified the paralogs into GC3-rich and GC3-poor groups. By referring to out-group sequences from Arabidopsis and maize, we confirmed that the average synonymous substitution rate of the GC3-rich genes is significantly lower than that of the GC3-poor genes. Furthermore, we explored the other possible factors corresponding to the GC variation including the length of coding sequences, the number of exons in each gene, the number of genes in each family, the location of genes on chromosomes and the protein functions. Consequently, we propose that natural selection rather than mutation bias was the primary cause of the GC variation.展开更多
In this study, 414 whole protein-coding sequences (238 004 codons) of alternatively spliced genes of human chromosome 1 have been employed to explore the patterns of codon usage bias among genes. Overall codon usage d...In this study, 414 whole protein-coding sequences (238 004 codons) of alternatively spliced genes of human chromosome 1 have been employed to explore the patterns of codon usage bias among genes. Overall codon usage data analysis indicates that G- and C-ending codons are predominant in the genes. The base usage in all three codon positions suggests a selection-mutation balance. Multivariate statistical analysis reveals that the codon usage variation has a strong positive correlation with the expressivities of the genes (r=0.5790, P<0.0001). All 27 codons identified as optimal are G- and C-ending codons. Correlation analysis shows a strong negative correlation between the gene length and codon adaptation index value (r=0.2252, P<0.0001), and a significantly positive correlation between the gene length and Nc values (r=0.1876, P<0.0001). These results suggest that the comparatively shorter genes in the genes have higher codon usage bias to maximize translational efficiency, and selection may also contribute to the reduction of highly expressed proteins.展开更多
While some pseudogenes have been reported to play important roles in gene regulation, little is known about the possible relationship between pseudogene functions and evolutionary process of pseudogenes, or about the ...While some pseudogenes have been reported to play important roles in gene regulation, little is known about the possible relationship between pseudogene functions and evolutionary process of pseudogenes, or about the forces responsible for the pseudogene evolution. In this study, we characterized human processed pseudogenes in terms of evolutionary dynamics. Our results show that pseudogenes tend to evolve toward: lower GC content, strong dinucleotide bias, reduced abundance of transcription factor binding motifs and short palindromes, and decreased ability to form nucleosomes. We explored possible evolutionary forces that shaped the evolution pattern of pseudogenes, and concluded that mutations in pseudogenes are likely determined, at least partially, by neighbor-dependent mutational bias and recombination-associated selection.展开更多
基金the State Key Basic Research and Development Plan of China(2003CB715900)the National Natural Science Foundation of China(90408015,30121003 and 30430030).
文摘Cereal genes are classified into two distinct classes according to the guanine-cytosine (GC) content at the third codon sites (GC3). Natural selection and mutation bias have been proposed to affect the GC content. However, there has been controversy about the cause of GC variation. Here, we characterized the GC content of 1 092 paralogs and other single-copy genes in the duplicated chromosomal regions of the rice genome (ssp. indica) and classified the paralogs into GC3-rich and GC3-poor groups. By referring to out-group sequences from Arabidopsis and maize, we confirmed that the average synonymous substitution rate of the GC3-rich genes is significantly lower than that of the GC3-poor genes. Furthermore, we explored the other possible factors corresponding to the GC variation including the length of coding sequences, the number of exons in each gene, the number of genes in each family, the location of genes on chromosomes and the protein functions. Consequently, we propose that natural selection rather than mutation bias was the primary cause of the GC variation.
基金the National Natural Science Foundation of China (No. 60171038) and the Science and Technology Ministry of China (No. 2001CCA01400)
文摘In this study, 414 whole protein-coding sequences (238 004 codons) of alternatively spliced genes of human chromosome 1 have been employed to explore the patterns of codon usage bias among genes. Overall codon usage data analysis indicates that G- and C-ending codons are predominant in the genes. The base usage in all three codon positions suggests a selection-mutation balance. Multivariate statistical analysis reveals that the codon usage variation has a strong positive correlation with the expressivities of the genes (r=0.5790, P<0.0001). All 27 codons identified as optimal are G- and C-ending codons. Correlation analysis shows a strong negative correlation between the gene length and codon adaptation index value (r=0.2252, P<0.0001), and a significantly positive correlation between the gene length and Nc values (r=0.1876, P<0.0001). These results suggest that the comparatively shorter genes in the genes have higher codon usage bias to maximize translational efficiency, and selection may also contribute to the reduction of highly expressed proteins.
基金supported by the National Natural Science Foundation(61102162,61271448,31260274,31260219)the Program for Young Talents of Science and Technology in Universities of Inner Mongolia Autonomous Region(NJYT-14-B10)the Inner Mongolia Science&Technology Plan(20140401)
文摘While some pseudogenes have been reported to play important roles in gene regulation, little is known about the possible relationship between pseudogene functions and evolutionary process of pseudogenes, or about the forces responsible for the pseudogene evolution. In this study, we characterized human processed pseudogenes in terms of evolutionary dynamics. Our results show that pseudogenes tend to evolve toward: lower GC content, strong dinucleotide bias, reduced abundance of transcription factor binding motifs and short palindromes, and decreased ability to form nucleosomes. We explored possible evolutionary forces that shaped the evolution pattern of pseudogenes, and concluded that mutations in pseudogenes are likely determined, at least partially, by neighbor-dependent mutational bias and recombination-associated selection.