We perform an exhaustive, taxon by taxon, comparison of the branchings in the composition vector trees (CVTrees) inferred from 432 prokaryotic genomes available on 31 December 2006, with the bacte-riologists' taxo...We perform an exhaustive, taxon by taxon, comparison of the branchings in the composition vector trees (CVTrees) inferred from 432 prokaryotic genomes available on 31 December 2006, with the bacte-riologists' taxonomy-primarily the latest online Outline of the Bergey's Manual of Systematic Bacteri-ology. The CVTree phylogeny agrees very well with the Bergey's taxonomy in majority of fine branchings and overall structures. At the same time most of the differences between the trees and the Manual have been known to biologists to some extent and may hint at taxonomic revisions. Instead of demonstrating the overwhelming agreement this paper puts emphasis on the biological implications of the differences.展开更多
In order to show that the newly developed K-string composition distance method, based on counting oligopeptide frequencies, for inferring phylogenetic relations of prokaryotes works equally well without requiring the ...In order to show that the newly developed K-string composition distance method, based on counting oligopeptide frequencies, for inferring phylogenetic relations of prokaryotes works equally well without requiring the whole proteome data, we used all ribosomal proteins and the set of aminoacyl tRNA synthetases for each species. The latter group has been known to yield inconsistent trees if used individually. Our trees are obtained without making any sequence alignment. Altogether 16 Archaea, 105 Bacteria and 2 Eucarya are represented on the tree. Most of the lower branchings agree well with the latest, 2003, Outline of the second edition of the Bergeys Manual of Systematic Bacteriology and the trees also suggest some relationships among higher taxa.展开更多
The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable t...The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable tree of life in the late 1970s. How-ever, the availability of more and more complete bacterial genomes has brought about complications instead of refinement of the tree. In particular, it turns out that different choice of genes may tell different history. This might be caused by possible horizontal gene transfer (HGT) among species. There is an urgent need to develop phylogenetic methods that make use of whole genome data. We describe a new approach in molecular phylogeny,namely, tree construction based on K-tuple frequency analysis of the genomic sequences.Putting aside the technicalities, we emphasize the transition from randomness to determin-ism when the string length K increases and try to comment on the challenge mentioned in the title.展开更多
Shigella species and Escherichia coli are closely related organisms. Early phenotyping experiments and several recent molecular studies put Shigella within the species E. coli. However, the whole-genome-based, alignme...Shigella species and Escherichia coli are closely related organisms. Early phenotyping experiments and several recent molecular studies put Shigella within the species E. coli. However, the whole-genome-based, alignment-free and parameter-free CVTree approach shows convincingly that four established Shigella species, Shigella boydii, Shigella sonnei, Shigella felxneri and Shigella dysenteriae, are distinct from E. coli strains, and form sister species to E. coli within the genus Esch- erichia. In view of the overall success and high resolution power of the CVTree approach, this result should be taken seriously. We hope that the present report may promote further in-depth study of the Shigella-E. coli relationship.展开更多
The Composition Vector Tree (CVTree) is a parameter-free and alignment-free method to infer pro-karyotic phylogeny from their complete genomes. It is distinct from the traditional 16S rRNA analysis in both the input d...The Composition Vector Tree (CVTree) is a parameter-free and alignment-free method to infer pro-karyotic phylogeny from their complete genomes. It is distinct from the traditional 16S rRNA analysis in both the input data and the methodology. The prokaryotic phylogenetic trees constructed by using the CVTree method agree well with the Bergey’s taxonomy in all major groupings and fine branching patterns. Thus, combined use of the CVTree approach and the 16S rRNA analysis may provide an objective and reliable reconstruction of the prokaryotic branch of the Tree of Life.展开更多
文摘We perform an exhaustive, taxon by taxon, comparison of the branchings in the composition vector trees (CVTrees) inferred from 432 prokaryotic genomes available on 31 December 2006, with the bacte-riologists' taxonomy-primarily the latest online Outline of the Bergey's Manual of Systematic Bacteri-ology. The CVTree phylogeny agrees very well with the Bergey's taxonomy in majority of fine branchings and overall structures. At the same time most of the differences between the trees and the Manual have been known to biologists to some extent and may hint at taxonomic revisions. Instead of demonstrating the overwhelming agreement this paper puts emphasis on the biological implications of the differences.
基金This work was partly supported by the Special Funds for Major State Basic Research Projects(Grant No.G2000077308)National Natural Science Foundation of China(Grant No.30170232)+1 种基金the Innovation Project of Chinese Academy of Sciencesby a grant from Shaghai Municipality via Fudan University.
文摘In order to show that the newly developed K-string composition distance method, based on counting oligopeptide frequencies, for inferring phylogenetic relations of prokaryotes works equally well without requiring the whole proteome data, we used all ribosomal proteins and the set of aminoacyl tRNA synthetases for each species. The latter group has been known to yield inconsistent trees if used individually. Our trees are obtained without making any sequence alignment. Altogether 16 Archaea, 105 Bacteria and 2 Eucarya are represented on the tree. Most of the lower branchings agree well with the latest, 2003, Outline of the second edition of the Bergeys Manual of Systematic Bacteriology and the trees also suggest some relationships among higher taxa.
基金This work was partially supported by the Natural Science Foundation of China,the Special Funds for Major State Basic Research Project,the Innovation Project of the Chinese Academy of Sciences,and the Major Innovation Research Project"248"of Beijing Munic
文摘The diversity and classification of microbes has been a long-standing issue.Molecular phylogeny of the prokaryotes based on comparison of the 16S rRNA sequences of the small ribosomal subunit has led to a reasonable tree of life in the late 1970s. How-ever, the availability of more and more complete bacterial genomes has brought about complications instead of refinement of the tree. In particular, it turns out that different choice of genes may tell different history. This might be caused by possible horizontal gene transfer (HGT) among species. There is an urgent need to develop phylogenetic methods that make use of whole genome data. We describe a new approach in molecular phylogeny,namely, tree construction based on K-tuple frequency analysis of the genomic sequences.Putting aside the technicalities, we emphasize the transition from randomness to determin-ism when the string length K increases and try to comment on the challenge mentioned in the title.
基金supported by the National Basic Research Program of China (973 Project, Grant No. 2007CB814800 and2013CB834100)the Shanghai Leading Academic Discipline Project (Grant No. B111)the National Key Laboratory of Applied Surface Physics and the Department of Physics, Fu-dan University
文摘Shigella species and Escherichia coli are closely related organisms. Early phenotyping experiments and several recent molecular studies put Shigella within the species E. coli. However, the whole-genome-based, alignment-free and parameter-free CVTree approach shows convincingly that four established Shigella species, Shigella boydii, Shigella sonnei, Shigella felxneri and Shigella dysenteriae, are distinct from E. coli strains, and form sister species to E. coli within the genus Esch- erichia. In view of the overall success and high resolution power of the CVTree approach, this result should be taken seriously. We hope that the present report may promote further in-depth study of the Shigella-E. coli relationship.
文摘The Composition Vector Tree (CVTree) is a parameter-free and alignment-free method to infer pro-karyotic phylogeny from their complete genomes. It is distinct from the traditional 16S rRNA analysis in both the input data and the methodology. The prokaryotic phylogenetic trees constructed by using the CVTree method agree well with the Bergey’s taxonomy in all major groupings and fine branching patterns. Thus, combined use of the CVTree approach and the 16S rRNA analysis may provide an objective and reliable reconstruction of the prokaryotic branch of the Tree of Life.