The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable t...The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable tree of life in the late 1970s. How-ever, the availability of more and more complete bacterial genomes has brought about complications instead of refinement of the tree. In particular, it turns out that different choice of genes may tell different history. This might be caused by possible horizontal gene transfer (HGT) among species. There is an urgent need to develop phylogenetic methods that make use of whole genome data. We describe a new approach in molecular phylogeny,namely, tree construction based on K-tuple frequency analysis of the genomic sequences.Putting aside the technicalities, we emphasize the transition from randomness to determin-ism when the string length K increases and try to comment on the challenge mentioned in the title.展开更多
基金This work was partially supported by the Natural Science Foundation of China,the Special Funds for Major State Basic Research Project,the Innovation Project of the Chinese Academy of Sciences,and the Major Innovation Research Project"248"of Beijing Munic
文摘The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable tree of life in the late 1970s. How-ever, the availability of more and more complete bacterial genomes has brought about complications instead of refinement of the tree. In particular, it turns out that different choice of genes may tell different history. This might be caused by possible horizontal gene transfer (HGT) among species. There is an urgent need to develop phylogenetic methods that make use of whole genome data. We describe a new approach in molecular phylogeny,namely, tree construction based on K-tuple frequency analysis of the genomic sequences.Putting aside the technicalities, we emphasize the transition from randomness to determin-ism when the string length K increases and try to comment on the challenge mentioned in the title.