The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of trans...The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of transmission and pathogenesis, development of diagnostics, prevention by future vaccination, and treatment by developing new drugs. We report the complete genome sequence and comparative analysis of an isolate (BJ01) of the coronavirus that has been recognized as a pathogen for SARS. The genome is 29725 nt in size and has 11 ORFs (Open Reading Frames). It is composed of a stable region encoding an RNA-dependent RNA polymerase (composed of 2 ORFs) and a variable region representing 4 CDSs (coding sequences) for viral structural genes (the S, E, M, N proteins) and 5 PUPs (putative uncharacterized proteins). Its gene order is identical to that of other known coronaviruses. The sequence alignment with all known RNA viruses places this virus as a member in the family of Coronaviridae. Thirty putative substitutions have been identified by comparative analysis of the 5 SARS- associated virus genome sequences in GenBank. Fifteen of them lead to possible amino acid changes (non-synonymous mutations) in the proteins. Three amino acid changes, with predicted alteration of physical and chemical features, have been detected in the S protein that is postulated to beinvolved in the immunoreactions between the virus and its host. Two amino acid changes have been detected in the Mprotein, which could be related to viral envelope formation. Phylogenetic analysis suggests the possibility of non-human origin of the SARS-associated viruses but provides noevidence that they are man-made. Further efforts should focus on identifying the etiology of the SARS-associated virus and ruling out conclusively the existence of otherpossible SARS-related pathogen(s).展开更多
Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comp...Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comparing three domestic breeds with wild ancestor and the positions of markers on the genetic linkage map, we found that SNPs rates were similar for all chromosomes while the recombina-tion rates increased in micro chromosomes. In other words no correlation exists in chromosome size. Nevertheless, when we scanned the genome by calculating the values of each characteristic within non-overlapping windows, instead of single value for each chromosomes, the nucleotide diversity was found to be significantly correlated with the recombination rate (r=0.27, P<0.0005). Furthermore, the significant association not only existed between these two features, but also existed between all 6 pairwise combinations of nucleotide diversity, recombination rate, GC content and average gene length. This co-variation is very meaningful for the studies of sequence evolution.展开更多
文摘The genome sequence of the Severe Acute Respiratory Syndrome (SARS)-associated virus provides essential information for the identification of pathogen(s), exploration of etiology and evolution, interpretation of transmission and pathogenesis, development of diagnostics, prevention by future vaccination, and treatment by developing new drugs. We report the complete genome sequence and comparative analysis of an isolate (BJ01) of the coronavirus that has been recognized as a pathogen for SARS. The genome is 29725 nt in size and has 11 ORFs (Open Reading Frames). It is composed of a stable region encoding an RNA-dependent RNA polymerase (composed of 2 ORFs) and a variable region representing 4 CDSs (coding sequences) for viral structural genes (the S, E, M, N proteins) and 5 PUPs (putative uncharacterized proteins). Its gene order is identical to that of other known coronaviruses. The sequence alignment with all known RNA viruses places this virus as a member in the family of Coronaviridae. Thirty putative substitutions have been identified by comparative analysis of the 5 SARS- associated virus genome sequences in GenBank. Fifteen of them lead to possible amino acid changes (non-synonymous mutations) in the proteins. Three amino acid changes, with predicted alteration of physical and chemical features, have been detected in the S protein that is postulated to beinvolved in the immunoreactions between the virus and its host. Two amino acid changes have been detected in the Mprotein, which could be related to viral envelope formation. Phylogenetic analysis suggests the possibility of non-human origin of the SARS-associated viruses but provides noevidence that they are man-made. Further efforts should focus on identifying the etiology of the SARS-associated virus and ruling out conclusively the existence of otherpossible SARS-related pathogen(s).
基金Supported by the Chinese Academy of Sciences (Grant Nos. GJHZ0701-6 and KSCX2-YW-N-023)Ministry of Science and Technology under high-tech program 863 (Grant Nos. 2006AA10A121 and 2006AA02Z334)+3 种基金Chinese 973 Program (Grant Nos. 2007CB815703 and 2007CB815705)Ministry of Education (Grant No. XXBKYHT2006001)National Natural Science Foundation of China (Grant Nos. 30725008, 90608010, 90403130 and 90612019)Chinese Municipal Science and Technology Commission (Grant No. D07030200740000)
文摘Positive correlation between recombination rate and nucleotide diversity has been observed in a wide variety of eukaryotes on megabase scale. On the basis of genome-wide chicken genetic variation map generated by comparing three domestic breeds with wild ancestor and the positions of markers on the genetic linkage map, we found that SNPs rates were similar for all chromosomes while the recombina-tion rates increased in micro chromosomes. In other words no correlation exists in chromosome size. Nevertheless, when we scanned the genome by calculating the values of each characteristic within non-overlapping windows, instead of single value for each chromosomes, the nucleotide diversity was found to be significantly correlated with the recombination rate (r=0.27, P<0.0005). Furthermore, the significant association not only existed between these two features, but also existed between all 6 pairwise combinations of nucleotide diversity, recombination rate, GC content and average gene length. This co-variation is very meaningful for the studies of sequence evolution.