KaKs_Calculator is a software package that calculates nonsynonymous (Ka) and synonymous (Ks) substitution rates through model selection and model averaging. Since existing methods for this estimation adopt their s...KaKs_Calculator is a software package that calculates nonsynonymous (Ka) and synonymous (Ks) substitution rates through model selection and model averaging. Since existing methods for this estimation adopt their specific mutation (substitution) models that consider different evolutionary features, leading to diverse estimates, KaKs_Calculator implements a set of candidate models in a maximum likelihood framework and adopts the Akaike information criterion to measure fitness between models and data, aiming to include as many features as needed for accurately capturing evolutionary information in protein-coding sequences. In addition, several existing methods for calculating Ka and Ks are also incorporated into this software. KaKs_Calculator, including source codes, compiled executables, and documentation, is freely available for academic use at http://evolution.genomics.org.cn/software.htm.展开更多
Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comp...Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comparing three domestic breeds with wild ancestor and the positions of markers on the genetic linkage map, we found that SNPs rates were similar for all chromosomes while the recombina-tion rates increased in micro chromosomes. In other words no correlation exists in chromosome size. Nevertheless, when we scanned the genome by calculating the values of each characteristic within non-overlapping windows, instead of single value for each chromosomes, the nucleotide diversity was found to be significantly correlated with the recombination rate (r=0.27, P<0.0005). Furthermore, the significant association not only existed between these two features, but also existed between all 6 pairwise combinations of nucleotide diversity, recombination rate, GC content and average gene length. This co-variation is very meaningful for the studies of sequence evolution.展开更多
A scientific dream proposed some 20 years ago has been realized-the completion of the DNA sequence for the Human Genome Project (HGP) in 2004. As a result, an entirely new field of biological research has arisen: g...A scientific dream proposed some 20 years ago has been realized-the completion of the DNA sequence for the Human Genome Project (HGP) in 2004. As a result, an entirely new field of biological research has arisen: genome biology or genomics is celebrated for its unprecedented scale, intrinsically digital output, and systematic approach to getting all the data. Its sequel, the HapMap Project, will reach fruition later this year. These projects established new precedents for international collaborations and open data access. Chinese scientists contributed to 1% of the展开更多
In the canonical version of evolution by gene duplication, one copy is kept unaltered while the other is free to evolve. This process of evolutionary experimentation can persist for millions of years. Since it is so s...In the canonical version of evolution by gene duplication, one copy is kept unaltered while the other is free to evolve. This process of evolutionary experimentation can persist for millions of years. Since it is so short lived in comparison to the lifetime of the core genes that make up the majority of most genomes, a substantial fraction of the genome and the transcriptome may—in principle—be attributable to what we will refer to as "evolutionary transients", referring here to both the process and the genes that have gone or are undergoing this process. Using the rice gene set as a test case, we argue that this phenomenon goes a long way towards explaining why there are so many more rice genes than Arabidopsis genes, and why most excess rice genes show low similarity to eudicots.展开更多
基金grants from the Ministry of Science and Technology of China (No. 2001AA231061) the National Natural Science Foundation of China (No. 30270748)
文摘KaKs_Calculator is a software package that calculates nonsynonymous (Ka) and synonymous (Ks) substitution rates through model selection and model averaging. Since existing methods for this estimation adopt their specific mutation (substitution) models that consider different evolutionary features, leading to diverse estimates, KaKs_Calculator implements a set of candidate models in a maximum likelihood framework and adopts the Akaike information criterion to measure fitness between models and data, aiming to include as many features as needed for accurately capturing evolutionary information in protein-coding sequences. In addition, several existing methods for calculating Ka and Ks are also incorporated into this software. KaKs_Calculator, including source codes, compiled executables, and documentation, is freely available for academic use at http://evolution.genomics.org.cn/software.htm.
基金Supported by the Chinese Academy of Sciences (Grant Nos. GJHZ0701-6 and KSCX2-YW-N-023)Ministry of Science and Technology under high-tech program 863 (Grant Nos. 2006AA10A121 and 2006AA02Z334)+3 种基金Chinese 973 Program (Grant Nos. 2007CB815703 and 2007CB815705)Ministry of Education (Grant No. XXBKYHT2006001)National Natural Science Foundation of China (Grant Nos. 30725008, 90608010, 90403130 and 90612019)Chinese Municipal Science and Technology Commission (Grant No. D07030200740000)
文摘Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comparing three domestic breeds with wild ancestor and the positions of markers on the genetic linkage map, we found that SNPs rates were similar for all chromosomes while the recombina-tion rates increased in micro chromosomes. In other words no correlation exists in chromosome size. Nevertheless, when we scanned the genome by calculating the values of each characteristic within non-overlapping windows, instead of single value for each chromosomes, the nucleotide diversity was found to be significantly correlated with the recombination rate (r=0.27, P<0.0005). Furthermore, the significant association not only existed between these two features, but also existed between all 6 pairwise combinations of nucleotide diversity, recombination rate, GC content and average gene length. This co-variation is very meaningful for the studies of sequence evolution.
文摘A scientific dream proposed some 20 years ago has been realized-the completion of the DNA sequence for the Human Genome Project (HGP) in 2004. As a result, an entirely new field of biological research has arisen: genome biology or genomics is celebrated for its unprecedented scale, intrinsically digital output, and systematic approach to getting all the data. Its sequel, the HapMap Project, will reach fruition later this year. These projects established new precedents for international collaborations and open data access. Chinese scientists contributed to 1% of the
基金supported by Chinese Academy of Sciences (Grants No. GJHZ0701-6 and KSCX2-YWN-023)National Natural Science Foundation of China (Grants No. 30725008, 90403130, 90608010, 30221004, 90612019, and 30392130)+4 种基金the "973" Program (Grants No. 2006CB910400, 2007CB815701, 2007CB815703, and 2007CB815705)the "863" Program (Grants No. 2006AA02Z334, 2006AA10A121, and 2006AA02Z177)Beijing Municipal Science and Technology Commission (Grant No. D07030200740000)Danish Platform for Integrative Biology, Danish Natural Science Research Council, Danish Research Council, the Solexa Project (Grant No. 272-07-0196)National Science Foundation of USA (Grant No. DBI 0217241)
文摘In the canonical version of evolution by gene duplication, one copy is kept unaltered while the other is free to evolve. This process of evolutionary experimentation can persist for millions of years. Since it is so short lived in comparison to the lifetime of the core genes that make up the majority of most genomes, a substantial fraction of the genome and the transcriptome may—in principle—be attributable to what we will refer to as "evolutionary transients", referring here to both the process and the genes that have gone or are undergoing this process. Using the rice gene set as a test case, we argue that this phenomenon goes a long way towards explaining why there are so many more rice genes than Arabidopsis genes, and why most excess rice genes show low similarity to eudicots.