1
|
Rudra B, Duncan L, Shah AJ, Shah HN, Gupta RS. Phylogenomic and comparative genomic studies robustly demarcate two distinct clades of Pseudomonas aeruginosa strains: proposal to transfer the strains from an outlier clade to a novel species Pseudomonas paraeruginosa sp. nov. Int J Syst Evol Microbiol 2022; 72. [PMID: 36355412 DOI: 10.1099/ijsem.0.005542] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
The strains of
Pseudomonas aeruginosa
exhibit considerable differences in their genotypic and pathogenic properties. To clarify their evolutionary/taxonomic relationships, comprehensive phylogenomic and comparative genomic studies were conducted on the genome sequences of 212
P
.
aeruginosa
strains covering their genetic diversity. In a phylogenomic tree based on 118 conserved proteins, the analysed strains formed two distinct clades. One of these clades, Clade-1, encompassing >70 % of the strains including the type strain DSM 50071T, represents the species P. aeruginosa sensu stricto. Clade-2, referred to in earlier work as the outlier group, with NCTC 13628T as its type strain, constitutes a novel species level lineage. The average nucleotide identity, average amino acid identity and digital DNA–DNA hybridization values between the strains from Clade-1 and Clade-2 are in the range of 93.4–93.7, 95.1–95.3 and 52–53 %, respectively. The 16S rRNA gene of
P. aeruginosa
DSM 50071T also shows 98.3 % similarity to that of NCTC 13628T. These values are lower than the suggested cut-off values for species distinction, indicating that the Clade-2 strains (NCTC 13628T) constitute a new species. We also report the identification of 12 conserved signature indels in different proteins and 24 conserved signature proteins that are exclusively found in either Clade-1 or Clade-2, providing a reliable means for distinguishing these clades. Additionally, in contrast to swimming motility, twitching motility is only present in Clade-1 strains. Based on earlier work, the strains from these two clades also differ in their pathogenic mechanisms (presence/absence of Type III secretion system), production of biosurfactants, phenazines and siderophores, and several other genomic characteristics. Based on the evidence from different studies, we propose that the Clade-2 strains constitute a novel species for which the name Pseudomonas paraeruginosa is proposed. The type strain is NCTC 13628T (=PA7T=ATCC 9027T). The description of
Pseudomonas aeruginosa
is also emended to include information for different molecular markers specific for this species.
Collapse
Affiliation(s)
- Bashudev Rudra
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton L8N 3Z5, Canada
| | - Louise Duncan
- School of Cancer and Pharmaceutical Sciences, King's College London, London SE1 9NH, UK
| | - Ajit J Shah
- Department of Natural Sciences, Middlesex University, London NW4 4BT, UK
| | - Haroun N Shah
- Department of Natural Sciences, Middlesex University, London NW4 4BT, UK
| | - Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton L8N 3Z5, Canada
| |
Collapse
|
2
|
Feng Y, Neri U, Gosselin S, Louyakis AS, Papke RT, Gophna U, Gogarten JP. The Evolutionary Origins of Extreme Halophilic Archaeal Lineages. Genome Biol Evol 2021; 13:6320066. [PMID: 34255041 PMCID: PMC8350355 DOI: 10.1093/gbe/evab166] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/10/2021] [Indexed: 12/12/2022] Open
Abstract
Interest and controversy surrounding the evolutionary origins of extremely halophilic Archaea has increased in recent years, due to the discovery and characterization of the Nanohaloarchaea and the Methanonatronarchaeia. Initial attempts in explaining the evolutionary placement of the two new lineages in relation to the classical Halobacteria (also referred to as Haloarchaea) resulted in hypotheses that imply the new groups share a common ancestor with the Haloarchaea. However, more recent analyses have led to a shift: the Nanohaloarchaea have been largely accepted as being a member of the DPANN superphylum, outside of the euryarchaeota; whereas the Methanonatronarchaeia have been placed near the base of the Methanotecta (composed of the class II methanogens, the Halobacteriales, and Archaeoglobales). These opposing hypotheses have far-reaching implications on the concepts of convergent evolution (distantly related groups evolve similar strategies for survival), genome reduction, and gene transfer. In this work, we attempt to resolve these conflicts with phylogenetic and phylogenomic data. We provide a robust taxonomic sampling of Archaeal genomes that spans the Asgardarchaea, TACK Group, euryarchaeota, and the DPANN superphylum. In addition, we assembled draft genomes from seven new representatives of the Nanohaloarchaea from distinct geographic locations. Phylogenies derived from these data imply that the highly conserved ATP synthase catalytic/noncatalytic subunits of Nanohaloarchaea share a sisterhood relationship with the Haloarchaea. We also employ a novel gene family distance clustering strategy which shows this sisterhood relationship is not likely the result of a recent gene transfer. In addition, we present and evaluate data that argue for and against the monophyly of the DPANN superphylum, in particular, the inclusion of the Nanohaloarchaea in DPANN.
Collapse
Affiliation(s)
- Yutian Feng
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Uri Neri
- Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Israel
| | - Sean Gosselin
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Artemis S Louyakis
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - R Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Uri Gophna
- Shmunis School of Biomedicine and Cancer Research, Tel Aviv University, Israel
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA.,Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
3
|
Hellmuth M, Seemann CR. Alternative characterizations of Fitch's xenology relation. J Math Biol 2019; 79:969-986. [PMID: 31111195 DOI: 10.1007/s00285-019-01384-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Revised: 05/08/2019] [Indexed: 11/25/2022]
Abstract
Horizontal gene transfer (HGT) is an important factor for the evolution of prokaryotes as well as eukaryotes. According to Walter M. Fitch, two genes are xenologs if they are separated by at least one HGT. This concept is formalized through Fitch relations, which are defined as binary relations that comprise all pairs (x, y) of genes x and y for which y has been horizontally transferred at least once since it diverged from the last common ancestor of x and y. This definition, in particular, preserves the directional character of the transfer. Fitch relations are characterized by a small set of forbidden induced subgraphs on three vertices and can be recognized in linear time. The mathematical characterization of Fitch relations is crucial to understand whether putative xenology relations are at least to some extent "biologically feasible". In this contribution, we provide two novel characterizations of Fitch relations. In particular, these results allow us directly to reconstruct gene trees (together with the location of the horizontal transfer events) that explain the underlying Fitch relation. As a biological side result, we can conclude that the phylogenetic signal to infer these gene trees is entirely contained in those pairs of genes x and y for which no directional transfer has been taken place in the common history of y and the last common ancestor of x and y. In other words, non-HGT events provide the essential information about the gene trees. In addition, we utilize the new characterizations to present an alternative, short and elegant proof of the characterization theorem established by Geiß et al. (J Math Bio 77(5), 2018).
Collapse
Affiliation(s)
- Marc Hellmuth
- Institute of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany. .,Center for Bioinformatics, Saarland University, Building E 2.1, P.O. Box 151150, 66041, Saarbrücken, Germany.
| | - Carsten R Seemann
- Bioinformatics Group, Department of Computer Science; and Interdisciplinary Center of Bioinformatics, Leipzig University, Härtelstraße 16-18, 04107, Leipzig, Germany.,Max Planck Institute for Mathematics in the Sciences, Inselstraße 22, 04103, Leipzig, Germany
| |
Collapse
|
4
|
Prabha R, Singh DP. Cyanobacterial phylogenetic analysis based on phylogenomics approaches render evolutionary diversification and adaptation: an overview of representative orders. 3 Biotech 2019; 9:87. [PMID: 30800598 DOI: 10.1007/s13205-019-1635-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Accepted: 02/11/2019] [Indexed: 12/12/2022] Open
Abstract
Phylogenetic studies based on a definite set of marker genes usually reconstruct evolutionary relationships among the prokaryotic species. Based on specific target sequences, such studies represent variations and allow identification of similarities or dissimilarities in organisms. With the advent of completely sequenced genomes and accumulation of information on whole prokaryotic genomes, phylogenetic reconstructions should be considered more reliable if they are ideally based on entire genomes to resolve phylogenetic interest. We applied phylogenomics approaches taking into account completely sequenced cyanobacterial genomes to reconstruct underlying species that represented major taxonomic classes and belonged to distinctly different habitats (freshwater, marine, soils, and rocks). We did not rely on describing phylogeny of all representative class of cyanobacterial species on the basis of only ribosomal gene, 16S rDNA gene. In contrast, we analyzed combined molecular marker and phylogenomics approaches (genome alignment, gene content and gene order, composition vector and protein domain content) for accurately inferring phylogenetic relationship of species. We have shown that this approach reflects the impact of evolution on the organisms and considers connects with the ecological adaptation in cyanobacteria in different habitats. Analysis revealed that the members from marine habitat occupy different profile than those from freshwater. Impact of GC content and genomic repetitiveness over the diversification of cyanobacterial species and their possible role in adaptation was also reflected. Members occupying similar habitats cover more evolutionary distance together and also evolve various strategies for adaptation and survival either through genomic repetitiveness or preferences for genes of particular functions or modified GC content. Genomes undergo different changes for their adaptation in diverse habitats.
Collapse
Affiliation(s)
- Ratna Prabha
- 1ICAR-National Bureau of Agriculturally Important Microorganisms, Kushmaur, Maunath Bhanjan, 275101 India
- 2Department of Biotechnology, Mewar University, Gangrar, Chittorgarh, Rajasthan India
| | - Dhananjaya P Singh
- 1ICAR-National Bureau of Agriculturally Important Microorganisms, Kushmaur, Maunath Bhanjan, 275101 India
| |
Collapse
|
5
|
A phylogenomic and molecular markers based taxonomic framework for members of the order Entomoplasmatales: proposal for an emended order Mycoplasmatales containing the family Spiroplasmataceae and emended family Mycoplasmataceae comprised of six genera. Antonie van Leeuwenhoek 2018; 112:561-588. [PMID: 30392177 DOI: 10.1007/s10482-018-1188-4] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Accepted: 10/14/2018] [Indexed: 01/01/2023]
Abstract
The "Spiroplasma cluster" is a taxonomically heterogeneous assemblage within the phylum Tenericutes encompassing different Entomoplasmatales species as well as the genus Mycoplasma, type genus of the order Mycoplasmatales. Within this cluster, the family Entomoplasmataceae contains two non-cohesive genera Entomoplasma and Mesoplasma with their members exhibiting extensive polyphyletic branching; additionally, the genus Mycoplasma is also embedded within this family. Genome sequences are now available for all 19 Entomoplasmataceae species with validly published names, as well as 6 of the 7 species from the genus Mycoplasma. With the aim of developing a reliable phylogenetic and taxonomic framework for the family Entomoplasmataceae, exhaustive phylogenetic and comparative genomic studies were carried out on these genome sequences. Phylogenetic trees were constructed based on concatenated sequences of 121 core proteins for this cluster, 67 conserved proteins shared with the phylum Firmicutes, 40 ribosomal proteins, three major subunits of RNA polymerase (RpoA, B and C) by different means and also for the 16S rRNA gene sequences. The interspecies relationships as well as different species groups observed in these trees were identical and robustly resolved. In all of these trees, members of the genera Mesoplasma and Entomoplasma formed three and two distinct clades, respectively, which were interspersed among the members of the other genus. The observed species groupings in the phylogenetic trees are independently strongly supported by our identification of 103 novel molecular markers or synapomorphies in the forms of conserved signature indels and conserved signature proteins, which are uniquely shared by the members of different observed species clades. To account for the different observed species clades, we are proposing a division of the genus Mesoplasma into an emended genus Mesoplasma and two new genera Tullyiplasma gen. nov. and Edwardiiplasma gen. nov. Likewise, to recognize the distinct species groupings of Entomoplasma, we are proposing its division into an emended genus Entomoplasma and a new genus Williamsoniiplasma gen. nov. Lastly, to rectify the long-existing taxonomic anomaly caused by the presence of genus Mycoplasma (order Mycoplasmatales) within the Entomoplasmatales, we are proposing an emendation of the family Mycoplasmataceae to include both Entomoplasmataceae plus Mycoplasma species and an emendation of the order Mycoplasmatales, which now comprises of the emended family Mycoplasmataceae and the family Spiroplasmataceae. The taxonomic reclassifications proposed here accurately reflect the species relationships within this group of Tenericutes and they should lead to a better understanding of their biological and pathogenic characteristics.
Collapse
|
6
|
Segev E, Pasternak Z, Ben Sasson T, Jurkevitch E, Gonen M. Automatic identification of optimal marker genes for phenotypic and taxonomic groups of microorganisms. PLoS One 2018; 13:e0195537. [PMID: 29718935 PMCID: PMC5931505 DOI: 10.1371/journal.pone.0195537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2017] [Accepted: 03/23/2018] [Indexed: 11/18/2022] Open
Abstract
Finding optimal markers for microorganisms important in the medical, agricultural, environmental or ecological fields is of great importance. Thousands of complete microbial genomes now available allow us, for the first time, to exhaustively identify marker proteins for groups of microbial organisms. In this work, we model the biological task as the well-known mathematical “hitting set” problem, solving it based on both greedy and randomized approximation algorithms. We identify unique markers for 17 phenotypic and taxonomic microbial groups, including proteins related to the nitrite reductase enzyme as markers for the non-anammox nitrifying bacteria group, and two transcription regulation proteins, nusG and yhiF, as markers for the Archaea and Escherichia/Shigella taxonomic groups, respectively. Additionally, we identify marker proteins for three subtypes of pathogenic E. coli, which previously had no known optimal markers. Practically, depending on the completeness of the database this algorithm can be used for identification of marker genes for any microbial group, these marker genes may be prime candidates for the understanding of the genetic basis of the group's phenotype or to help discover novel functions which are uniquely shared among a group of microbes. We show that our method is both theoretically and practically efficient, while establishing an upper bound on its time complexity and approximation ratio; thus, it promises to remain efficient and permit the identification of marker proteins that are specific to phenotypic or taxonomic groups, even as more and more bacterial genomes are being sequenced.
Collapse
Affiliation(s)
- Elad Segev
- Department of Mathematics, Holon Institute of Technology, Holon, Israel
- * E-mail:
| | - Zohar Pasternak
- Department of Plant Pathology and Microbiology, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Tom Ben Sasson
- Department of Mathematics and Computer Science, The Open University of Israel, Raanana, Israel
| | - Edouard Jurkevitch
- Department of Plant Pathology and Microbiology, Faculty of Agriculture, Food and Environment, The Hebrew University of Jerusalem, Rehovot, Israel
| | - Mira Gonen
- Department of Computer Science, Ariel University, Ariel, Israel
| |
Collapse
|
7
|
Gupta RS, Sawnani S, Adeolu M, Alnajar S, Oren A. Phylogenetic framework for the phylum Tenericutes based on genome sequence data: proposal for the creation of a new order Mycoplasmoidales ord. nov., containing two new families Mycoplasmoidaceae fam. nov. and Metamycoplasmataceae fam. nov. harbouring Eperythrozoon, Ureaplasma and five novel genera. Antonie van Leeuwenhoek 2018; 111:1583-1630. [PMID: 29556819 DOI: 10.1007/s10482-018-1047-3] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2017] [Accepted: 02/08/2018] [Indexed: 12/19/2022]
Abstract
The genus Mycoplasma, including species earlier classified in the genera Eperythrozoon and Haemobartonella, contains ~ 120 species and constitutes an extensively polyphyletic assemblage of bacteria within the phylum Tenericutes. Due to their small genome sizes and lack of unique characteristics, the relationships among the mycoplasmas/Tenericutes are not reliably discerned. Using genome sequences for 140 Tenericutes, their evolutionary relationships were examined using multiple independent approaches. Phylogenomic trees were constructed for 63 conserved proteins, 45 ribosomal proteins, three main subunits of RNA polymerase and 16S rRNA gene sequences. In all of these trees, Tenericutes species reliably grouped into four main clades designated as the "Acholeplasma", "Spiroplasma", "Pneumoniae" and "Hominis" clusters. These clades are also distinguished based on a similarity matrix constructed based on 16S rRNA gene sequences. Mycoplasma species were dispersed across 3 of these 4 clades highlighting their extensive polyphyly. In parallel, our comparative genomic analyses have identified > 100 conserved signature indels (CSIs) and 14 conserved signature proteins (CSPs), which are uniquely shared by the members of four identified clades, strongly supporting their monophyly and identifying them in molecular terms. Mycoplasma mycoides, the type species of the genus Mycoplasma, and a small number of other Mycoplasma species, formed a strongly supported clade within the "Spiroplasma" cluster. Nine CSIs and 14 CSPs reliably distinguish this clade from all other Mycoplasmatales species. The remainder of the Mycoplasmatales species are part of the "Pneumoniae" and "Hominis" clusters, which group together in phylogenetic trees. Here we are proposing that the order Mycoplasmatales should be emended to encompass only the Mycoplasma species within the "Spiroplasma" cluster and that a new order, Mycoplasmoidales ord. nov., should be created to encompass the other Mycoplasma species. The "Pneumoniae" and the "Hominis" clusters are proposed as two new families, Mycoplasmoidaceae fam. nov., which includes the genera Eperythrozoon, Ureaplasma, and the newly proposed genera Malacoplasma and Mycoplasmoides, and Metamycoplasmataceae fam. nov. to contain the newly proposed genera Metamycoplasma, Mycoplasmopsis, and Mesomycoplasma. The results presented here allow reliable discernment, both in phylogenetic and molecular terms, of the members of the two proposed families as well as different described genera within these families including members of the genus Eperythrozoon, which is comprised of uncultivable organisms. The taxonomic reclassifications proposed here, which more accurately portray the genetic diversity among the Tenericutes/Mycoplasma species, provide a new framework for understanding the biological and clinical aspects of these important microbes.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, L8N 3Z5, Canada.
| | - Sahil Sawnani
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, L8N 3Z5, Canada
| | - Mobolaji Adeolu
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, L8N 3Z5, Canada
| | - Seema Alnajar
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, L8N 3Z5, Canada
| | - Aharon Oren
- Department of Plant and Environmental Sciences, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, The Edmond J. Safra Campus, 9190401, Jerusalem, Israel
| |
Collapse
|
8
|
Gupta RS, Lo B, Son J. Phylogenomics and Comparative Genomic Studies Robustly Support Division of the Genus Mycobacterium into an Emended Genus Mycobacterium and Four Novel Genera. Front Microbiol 2018; 9:67. [PMID: 29497402 PMCID: PMC5819568 DOI: 10.3389/fmicb.2018.00067] [Citation(s) in RCA: 301] [Impact Index Per Article: 50.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2017] [Accepted: 01/11/2018] [Indexed: 01/08/2023] Open
Abstract
The genus Mycobacterium contains 188 species including several major human pathogens as well as numerous other environmental species. We report here comprehensive phylogenomics and comparative genomic analyses on 150 genomes of Mycobacterium species to understand their interrelationships. Phylogenetic trees were constructed for the 150 species based on 1941 core proteins for the genus Mycobacterium, 136 core proteins for the phylum Actinobacteria and 8 other conserved proteins. Additionally, the overall genome similarity amongst the Mycobacterium species was determined based on average amino acid identity of the conserved protein families. The results from these analyses consistently support the existence of five distinct monophyletic groups within the genus Mycobacterium at the highest level, which are designated as the "Tuberculosis-Simiae," "Terrae," "Triviale," "Fortuitum-Vaccae," and "Abscessus-Chelonae" clades. Some of these clades have also been observed in earlier phylogenetic studies. Of these clades, the "Abscessus-Chelonae" clade forms the deepest branching lineage and does not form a monophyletic grouping with the "Fortuitum-Vaccae" clade of fast-growing species. In parallel, our comparative analyses of proteins from mycobacterial genomes have identified 172 molecular signatures in the form of conserved signature indels and conserved signature proteins, which are uniquely shared by either all Mycobacterium species or by members of the five identified clades. The identified molecular signatures (or synapomorphies) provide strong independent evidence for the monophyly of the genus Mycobacterium and the five described clades and they provide reliable means for the demarcation of these clades and for their diagnostics. Based on the results of our comprehensive phylogenomic analyses and numerous identified molecular signatures, which consistently and strongly support the division of known mycobacterial species into the five described clades, we propose here division of the genus Mycobacterium into an emended genus Mycobacterium encompassing the "Tuberculosis-Simiae" clade, which includes all of the major human pathogens, and four novel genera viz. Mycolicibacterium gen. nov., Mycolicibacter gen. nov., Mycolicibacillus gen. nov. and Mycobacteroides gen. nov. corresponding to the "Fortuitum-Vaccae," "Terrae," "Triviale," and "Abscessus-Chelonae" clades, respectively. With the division of mycobacterial species into these five distinct groups, attention can now be focused on unique genetic and molecular characteristics that differentiate members of these groups.
Collapse
Affiliation(s)
- Radhey S. Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, CA, Canada
| | | | | |
Collapse
|
9
|
Hellmuth M, Hernandez-Rosales M, Long Y, Stadler PF. Inferring phylogenetic trees from the knowledge of rare evolutionary events. J Math Biol 2017; 76:1623-1653. [PMID: 29218395 DOI: 10.1007/s00285-017-1194-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2016] [Revised: 11/16/2017] [Indexed: 10/18/2022]
Abstract
Rare events have played an increasing role in molecular phylogenetics as potentially homoplasy-poor characters. In this contribution we analyze the phylogenetic information content from a combinatorial point of view by considering the binary relation on the set of taxa defined by the existence of a single event separating two taxa. We show that the graph-representation of this relation must be a tree. Moreover, we characterize completely the relationship between the tree of such relations and the underlying phylogenetic tree. With directed operations such as tandem-duplication-random-loss events in mind we demonstrate how non-symmetric information constrains the position of the root in the partially reconstructed phylogeny.
Collapse
Affiliation(s)
- Marc Hellmuth
- Department of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany.,Center for Bioinformatics, Saarland University, Building E 2.1, P.O. Box 151150, 66041, Saarbrücken, Germany
| | - Maribel Hernandez-Rosales
- CONACYT-Instituto de Matemáticas, UNAM Juriquilla, Blvd. Juriquilla 3001, 76230, Juriquilla, Querétaro, QRO, Mexico
| | - Yangjing Long
- Department of Mathematics and Computer Science, University of Greifswald, Walther-Rathenau-Straße 47, 17487, Greifswald, Germany. .,School of Mathematical Sciences, Central China Normal University, Luoyu Road 152, Wuhan, 430079, Hubei, China.
| | - Peter F Stadler
- Bioinformatics Group, Department of Computer Science, Interdisciplinary Center of Bioinformatics, German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Competence Center for Scalable Data Services and Solutions, and Leipzig Research Center for Civilization Diseases, Leipzig University, Härtelstraße 16-18, 04107, Leipzig, Germany.,Max-Planck-Institute for Mathematics in the Sciences, Inselstraße 22, 04103, Leipzig, Germany.,Inst. f. Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090, Vienna, Austria.,Santa Fe Institute, 1399 Hyde Park Rd., Santa Fe, NM, 87501, USA
| |
Collapse
|
10
|
Palmer M, Steenkamp ET, Coetzee MPA, Chan WY, van Zyl E, De Maayer P, Coutinho TA, Blom J, Smits THM, Duffy B, Venter SN. Phylogenomic resolution of the bacterial genus Pantoea and its relationship with Erwinia and Tatumella. Antonie van Leeuwenhoek 2017; 110:1287-1309. [PMID: 28255640 DOI: 10.1007/s10482-017-0852-4] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2016] [Accepted: 02/23/2017] [Indexed: 11/29/2022]
Abstract
Investigation of the evolutionary relationships between related bacterial species and genera with a variety of lifestyles have gained popularity in recent years. For analysing the evolution of specific traits, however, a robust phylogeny is essential. In this study we examined the evolutionary relationships among the closely related genera Erwinia, Tatumella and Pantoea, and also attempted to resolve the species relationships within Pantoea. To accomplish this, we used the whole genome sequence data for 35 different strains belonging to these three genera, as well as nine outgroup taxa. Multigene datasets consisting of the 1039 genes shared by these 44 strains were then generated and subjected to maximum likelihood phylogenetic analyses, after which the results were compared to those using conventional multi-locus sequence analysis (MLSA) and ribosomal MLSA (rMLSA) approaches. The robustness of the respective phylogenies was then explored by considering the factors typically responsible for destabilizing phylogenetic trees. We found that the nucleotide datasets employed in the MLSA, rMLSA and 1039-gene datasets contained significant levels of homoplasy, substitution saturation and differential codon usage, all of which likely gave rise to the observed lineage specific rate heterogeneity. The effects of these factors were much less pronounced in the amino acid dataset for the 1039 genes, which allowed reconstruction of a fully supported and resolved phylogeny. The robustness of this amino acid tree was also supported by different subsets of the 1039 genes. In contrast to the smaller datasets (MLSA and rMLSA), the 1039 amino acid tree was also not as sensitive to long-branch attraction. The robust and well-supported evolutionary hypothesis for the three genera, which confidently resolved their various inter- and intrageneric relationships, represents a valuable resource for future studies. It will form the basis for studies aiming to understand the forces driving the divergence and maintenance of lineages, species and biological traits in this important group of bacteria.
Collapse
Affiliation(s)
- Marike Palmer
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Emma T Steenkamp
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Martin P A Coetzee
- Department of Genetics, Forestry and Agricultural Biotechnology Institute (FABI),, University of Pretoria, Pretoria, South Africa
| | - Wai-Yin Chan
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Elritha van Zyl
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Pieter De Maayer
- School of Molecular and Cell Biology, University of the Witwatersrand, Johannesburg, South Africa
| | - Teresa A Coutinho
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa
| | - Jochen Blom
- Computational Genomics, Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| | - Theo H M Smits
- Environmental Genomics and Systems Biology Research Group, Institute of Natural Resource Sciences, Zürich University of Applied Sciences (ZHAW), Wädenswil, Switzerland
| | - Brion Duffy
- Environmental Genomics and Systems Biology Research Group, Institute of Natural Resource Sciences, Zürich University of Applied Sciences (ZHAW), Wädenswil, Switzerland
| | - Stephanus N Venter
- Department of Microbiology and Plant Pathology, Forestry and Agricultural Biotechnology Institute (FABI), University of Pretoria, Pretoria, South Africa.
| |
Collapse
|
11
|
Gupta RS. Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification. FEMS Microbiol Rev 2016; 40:520-53. [PMID: 27279642 DOI: 10.1093/femsre/fuw011] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/24/2022] Open
Abstract
Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|
12
|
Origin of Bacteriochlorophyll a and the Early Diversification of Photosynthesis. PLoS One 2016; 11:e0151250. [PMID: 26953697 PMCID: PMC4783071 DOI: 10.1371/journal.pone.0151250] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2016] [Accepted: 02/25/2016] [Indexed: 11/20/2022] Open
Abstract
Photosynthesis originated in the domain Bacteria billions of years ago; however, the identity of the last common ancestor to all phototrophic bacteria remains undetermined and speculative. Here I present the evolution of BchF or 3-vinyl-bacteriochlorophyll hydratase, an enzyme exclusively found in bacteria capable of synthetizing bacteriochlorophyll a. I show that BchF exists in two forms originating from an early divergence, one found in the phylum Chlorobi, including its paralogue BchV, and a second form that was ancestral to the enzyme found in the remaining anoxygenic phototrophic bacteria. The phylogeny of BchF is consistent with bacteriochlorophyll a evolving in an ancestral phototrophic bacterium that lived before the radiation event that gave rise to the phylum Chloroflexi, Chlorobi, Acidobacteria, Proteobacteria, and Gemmatimonadetes, but only after the divergence of Type I and Type II reaction centers. Consequently, it is suggested that the lack of phototrophy in many groups of extant bacteria is a derived trait.
Collapse
|
13
|
A New Comparative-Genomics Approach for Defining Phenotype-Specific Indicators Reveals Specific Genetic Markers in Predatory Bacteria. PLoS One 2015; 10:e0142933. [PMID: 26569499 PMCID: PMC4646340 DOI: 10.1371/journal.pone.0142933] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2015] [Accepted: 10/28/2015] [Indexed: 01/23/2023] Open
Abstract
Predatory bacteria seek and consume other live bacteria. Although belonging to taxonomically diverse groups, relatively few bacterial predator species are known. Consequently, it is difficult to assess the impact of predation within the bacterial realm. As no genetic signatures distinguishing them from non-predatory bacteria are known, genomic resources cannot be exploited to uncover novel predators. In order to identify genes specific to predatory bacteria, we developed a bioinformatic tool called DiffGene. This tool automatically identifies marker genes that are specific to phenotypic or taxonomic groups, by mapping the complete gene content of all available fully-sequenced genomes for the presence/absence of each gene in each genome. A putative ‘predator region’ of ~60 amino acids in the tryptophan 2,3-dioxygenase (TDO) protein was found to probably be a predator-specific marker. This region is found in all known obligate predator and a few facultative predator genomes, and is absent from most facultative predators and all non-predatory bacteria. We designed PCR primers that uniquely amplify a ~180bp-long sequence within the predators’ TDO gene, and validated them in monocultures as well as in metagenetic analysis of environmental wastewater samples. This marker, in addition to its usage in predator identification and phylogenetics, may finally permit reliable enumeration and cataloguing of predatory bacteria from environmental samples, as well as uncovering novel predators.
Collapse
|
14
|
Simm S, Keller M, Selymesi M, Schleiff E. The composition of the global and feature specific cyanobacterial core-genomes. Front Microbiol 2015; 6:219. [PMID: 25852675 PMCID: PMC4365693 DOI: 10.3389/fmicb.2015.00219] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2014] [Accepted: 03/04/2015] [Indexed: 12/21/2022] Open
Abstract
Cyanobacteria are photosynthetic prokaryotes important for many ecosystems with a high potential for biotechnological usage e.g., in the production of bioactive molecules. Either asks for a deep understanding of the functionality of cyanobacteria and their interaction with the environment. This in part can be inferred from the analysis of their genomes or proteomes. Today, many cyanobacterial genomes have been sequenced and annotated. This information can be used to identify biological pathways present in all cyanobacteria as proteins involved in such processes are encoded by a so called core-genome. However, beside identification of fundamental processes, genes specific for certain cyanobacterial features can be identified by a holistic genome analysis as well. We identified 559 genes that define the core-genome of 58 analyzed cyanobacteria, as well as three genes likely to be signature genes for thermophilic and 57 genes likely to be signature genes for heterocyst-forming cyanobacteria. To get insights into cyanobacterial systems for the interaction with the environment we also inspected the diversity of the outer membrane proteome with focus on β-barrel proteins. We observed that most of the transporting outer membrane β-barrel proteins are not globally conserved in the cyanobacterial phylum. In turn, the occurrence of β-barrel proteins shows high strain specificity. The core set of outer membrane proteins globally conserved in cyanobacteria comprises three proteins only, namely the outer membrane β-barrel assembly protein Omp85, the lipid A transfer protein LptD, and an OprB-type porin. Thus, we conclude that cyanobacteria have developed individual strategies for the interaction with the environment, while other intracellular processes like the regulation of the protein homeostasis are globally conserved.
Collapse
Affiliation(s)
- Stefan Simm
- Department of Biosciences, Molecular Cell Biology of Plants, Goethe University Frankfurt am Main, Germany
| | - Mario Keller
- Department of Biosciences, Molecular Cell Biology of Plants, Goethe University Frankfurt am Main, Germany
| | - Mario Selymesi
- Department of Biosciences, Molecular Cell Biology of Plants, Goethe University Frankfurt am Main, Germany
| | - Enrico Schleiff
- Department of Biosciences, Molecular Cell Biology of Plants, Goethe University Frankfurt am Main, Germany ; Cluster of Excellence Frankfurt, Goethe University Frankfurt am Main, Germany ; Center of Membrane Proteomics, Goethe University Frankfurt am Main, Germany ; Buchmann Institute of Molecular Life Sciences, Goethe University Frankfurt am Main, Germany
| |
Collapse
|
15
|
Gupta RS, Naushad S, Baker S. Phylogenomic analyses and molecular signatures for the class Halobacteria and its two major clades: a proposal for division of the class Halobacteria into an emended order Halobacteriales and two new orders, Haloferacales ord. nov. and Natrialbales ord. nov., containing the novel families Haloferacaceae fam. nov. and Natrialbaceae fam. nov. Int J Syst Evol Microbiol 2014; 65:1050-1069. [PMID: 25428416 DOI: 10.1099/ijs.0.070136-0] [Citation(s) in RCA: 122] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The Halobacteria constitute one of the largest groups within the Archaea. The hierarchical relationship among members of this large class, which comprises a single order and a single family, has proven difficult to determine based upon 16S rRNA gene trees and morphological and physiological characteristics. This work reports detailed phylogenetic and comparative genomic studies on >100 halobacterial (haloarchaeal) genomes containing representatives from 30 genera to investigate their evolutionary relationships. In phylogenetic trees reconstructed on the basis of 32 conserved proteins, using both neighbour-joining and maximum-likelihood methods, two major clades (clades A and B) encompassing nearly two-thirds of the sequenced haloarchaeal species were strongly supported. Clades grouping the same species/genera were also supported by the 16S rRNA gene trees and trees for several individual highly conserved proteins (RpoC, EF-Tu, UvrD, GyrA, EF-2/EF-G). In parallel, our comparative analyses of protein sequences from haloarchaeal genomes have identified numerous discrete molecular markers in the form of conserved signature indels (CSI) in protein sequences and conserved signature proteins (CSPs) that are found uniquely in specific groups of haloarchaea. Thirteen CSIs in proteins involved in diverse functions and 68 CSPs that are uniquely present in all or most genome-sequenced haloarchaea provide novel molecular means for distinguishing members of the class Halobacteria from all other prokaryotes. The members of clade A are distinguished from all other haloarchaea by the unique shared presence of two CSIs in the ribose operon protein and small GTP-binding protein and eight CSPs that are found specifically in members of this clade. Likewise, four CSIs in different proteins and five other CSPs are present uniquely in members of clade B and distinguish them from all other haloarchaea. Based upon their specific clustering in phylogenetic trees for different gene/protein sequences and the unique shared presence of large numbers of molecular signatures, members of clades A and B are indicated to be distinct from all other haloarchaea because of their uniquely shared evolutionary histories. Based upon these results, it is proposed that clades A and B be recognized as two new orders, Natrialbales ord. nov. and Haloferacales ord. nov., within the class Halobacteria, containing the novel families Natrialbaceae fam. nov. and Haloferacaceae fam. nov. Other members of the class Halobacteria that are not members of these two orders will remain part of the emended order Halobacteriales in an emended family Halobacteriaceae.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | - Sohail Naushad
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | - Sheridan Baker
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| |
Collapse
|
16
|
Ravinesan DA, Gupta RS. Molecular signatures for members of the genus Dehalococcoides and the class Dehalococcoidia. Int J Syst Evol Microbiol 2014; 64:2176-2181. [PMID: 24676731 DOI: 10.1099/ijs.0.057919-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The bacteria belonging to the class Dehalococcoidia, due to their ability to dehalogenate chlorinated compounds, are of much interest for bioremediation of contaminated sites. We report here comparative analyses on different genes/proteins from the genomes of members of the class Dehalococcoidia. These studies have identified numerous novel molecular markers in the forms of conserved signature indels (CSIs) in broadly distributed proteins and conserved signature genes/proteins (CSPs), which are uniquely found in members of the class Dehalococcoidia, but except for an isolated exception, they are not found in other sequenced bacterial genomes. Of these molecular markers, nine CSIs in divergent proteins and 19 CSPs are specific for members of the genera Dehalococcoides and Dehalogenimonas, providing potential molecular markers for the bacterial class Dehalococcoidia. Additionally, four CSIs in divergent proteins and 28 CSPs are only found in all members of the genus Dehalococcoides for which genome sequences are available, but they are absent in Dehalogenimonas lykanthroporepellens and in other bacteria. The gene sequences of several of these CSPs exhibiting specificity for the genus Dehalococcoides or the class Dehalococcoidia are highly conserved and PCR primers based upon them provide a novel means for identification of other related bacteria. Two other CSIs identified in this study in the SecD and aspartate carbomyltransferase proteins weakly support an affiliation of the class Dehalococcoidia with the other members of the phylum Chloroflexi.
Collapse
Affiliation(s)
- Dasha A Ravinesan
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8N 3Z5, Canada
| | - Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON L8N 3Z5, Canada
| |
Collapse
|
17
|
Naushad HS, Lee B, Gupta RS. Conserved signature indels and signature proteins as novel tools for understanding microbial phylogeny and systematics: identification of molecular signatures that are specific for the phytopathogenic genera Dickeya, Pectobacterium and Brenneria. Int J Syst Evol Microbiol 2014; 64:366-383. [DOI: 10.1099/ijs.0.054213-0] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Genome sequences are enabling applications of different approaches to more clearly understand microbial phylogeny and systematics. Two of these approaches involve identification of conserved signature indels (CSIs) and conserved signature proteins (CSPs) that are specific for different lineages. These molecular markers provide novel and more definitive means for demarcation of prokaryotic taxa and for identification of species from these groups. Genome sequences are also enabling determination of phylogenetic relationships among species based upon sequences for multiple proteins. In this work, we have used all of these approaches for studying the phytopathogenic bacteria belonging to the genera
Dickeya
,
Pectobacterium
and
Brenneria
. Members of these genera, which cause numerous diseases in important food crops and ornamental plants, are presently distinguished mainly on the basis of their branching in phylogenetic trees. No biochemical or molecular characteristic is known that is uniquely shared by species from these genera. Hence, detailed studies using the above approaches were carried out on proteins from the genomes of these bacteria to identify molecular markers that are specific for them. In phylogenetic trees based upon concatenated sequences for 23 conserved proteins, members of the genera
Dickeya
,
Pectobacterium
and
Brenneria
formed a strongly supported clade within the other
Enterobacteriales
. Comparative analysis of protein sequences from the
Dickeya
,
Pectobacterium
and
Brenneria
genomes has identified 10 CSIs and five CSPs that are either uniquely or largely found in all genome-sequenced species from these genera, but not present in any other bacteria in the database. In addition, our analyses have identified 10 CSIs and 17 CSPs that are specifically present in either all or most sequenced
Dickeya
species/strains, and six CSIs and 19 CSPs that are uniquely found in the sequenced
Pectobacterium
genomes. Finally, our analysis also identified three CSIs and one CSP that are specifically shared by members of the genera
Pectobacterium
and
Brenneria
, but absent in species of the genus
Dickeya
, indicating that the former two genera shared a common ancestor exclusive of
Dickeya
. The identified CSIs and CSPs provide novel tools for identification of members of the genera
Dickeya
and
Pectobacterium
and for delimiting these taxa in molecular terms. Descriptions of the genera
Dickeya
and
Pectobacterium
have been revised to provide information for these molecular markers. Biochemical studies on these CSIs and CSPs, which are specific for these genera, may lead to discovery of novel properties that are unique to these bacteria and which could be targeted to develop antibacterial agents that are specific for these plant-pathogenic bacteria.
Collapse
Affiliation(s)
- Hafiz Sohail Naushad
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | - Brian Lee
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| | - Radhey S. Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5
| |
Collapse
|
18
|
Seetharam A, Stuart GW. Whole genome phylogenies for multiple Drosophila species. BMC Res Notes 2012; 5:670. [PMID: 23210901 PMCID: PMC3531268 DOI: 10.1186/1756-0500-5-670] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2012] [Accepted: 11/27/2012] [Indexed: 11/23/2022] Open
Abstract
Background Reconstructing the evolutionary history of organisms using traditional phylogenetic methods may suffer from inaccurate sequence alignment. An alternative approach, particularly effective when whole genome sequences are available, is to employ methods that don’t use explicit sequence alignments. We extend a novel phylogenetic method based on Singular Value Decomposition (SVD) to reconstruct the phylogeny of 12 sequenced Drosophila species. SVD analysis provides accurate comparisons for a high fraction of sequences within whole genomes without the prior identification of orthologs or homologous sites. With this method all protein sequences are converted to peptide frequency vectors within a matrix that is decomposed to provide simplified vector representations for each protein of the genome in a reduced dimensional space. These vectors are summed together to provide a vector representation for each species, and the angle between these vectors provides distance measures that are used to construct species trees. Results An unfiltered whole genome analysis (193,622 predicted proteins) strongly supports the currently accepted phylogeny for 12 Drosophila species at higher dimensions except for the generally accepted but difficult to discern sister relationship between D. erecta and D. yakuba. Also, in accordance with previous studies, many sequences appear to support alternative phylogenies. In this case, we observed grouping of D. erecta with D. sechellia when approximately 55% to 95% of the proteins were removed using a filter based on projection values or by reducing resolution by using fewer dimensions. Similar results were obtained when just the melanogaster subgroup was analyzed. Conclusions These results indicate that using our novel phylogenetic method, it is possible to consult and interpret all predicted protein sequences within multiple whole genomes to produce accurate phylogenetic estimations of relatedness between Drosophila species. Furthermore, protein filtering can be effectively applied to reduce incongruence in the dataset as well as to generate alternative phylogenies.
Collapse
Affiliation(s)
- Arun Seetharam
- Department of Biology, Indiana State University, Terre Haute, Indiana 47809, USA
| | | |
Collapse
|
19
|
Gupta RS, Bhandari V, Naushad HS. Molecular Signatures for the PVC Clade (Planctomycetes, Verrucomicrobia, Chlamydiae, and Lentisphaerae) of Bacteria Provide Insights into Their Evolutionary Relationships. Front Microbiol 2012; 3:327. [PMID: 23060863 PMCID: PMC3444138 DOI: 10.3389/fmicb.2012.00327] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2012] [Accepted: 08/23/2012] [Indexed: 11/13/2022] Open
Abstract
The PVC superphylum is an amalgamation of species from the phyla Planctomycetes, Verrucomicrobia, and Chlamydiae, along with the Lentisphaerae, Poribacteria, and two other candidate divisions. The diverse species of this superphylum lack any significant marker that differentiates them from other bacteria. Recently, genome sequences for 37 species covering all of the main PVC groups of bacteria have become available. We have used these sequences to construct a phylogenetic tree based upon concatenated sequences for 16 proteins and identify molecular signatures in protein sequences that are specific for the species from these phyla or those providing molecular links among them. Of the useful molecular markers identified in the present work, six conserved signature indels (CSIs) in the proteins Cyt c oxidase, UvrD helicase, urease, and a helicase-domain containing protein are specific for the species from the Verrucomicrobia phylum; three other CSIs in an ABC transporter protein, cobyrinic acid ac-diamide synthase, and SpoVG protein are specific for the Planctomycetes species. Additionally, a 3 aa insert in the RpoB protein is uniquely present in all sequenced Chlamydiae, Verrucomicrobia, and Lentisphaerae species, providing evidence for the shared ancestry of the species from these three phyla. Lastly, we have also identified a conserved protein of unknown function that is exclusively found in all sequenced species from the phyla Chlamydiae, Verrucomicrobia, Lentisphaerae, and Planctomycetes suggesting a specific linkage among them. The absence of this protein in Poribacteria, which branches separately from other members of the PVC clade, indicates that it is not specifically related to the PVC clade of bacteria. The molecular markers described here in addition to clarifying the evolutionary relationships among the PVC clade of bacteria also provide novel tools for their identification and for genetic and biochemical studies on these organisms.
Collapse
Affiliation(s)
- Radhey S. Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster UniversityHamilton, ON, Canada
| | - Vaibhav Bhandari
- Department of Biochemistry and Biomedical Sciences, McMaster UniversityHamilton, ON, Canada
| | - Hafiz Sohail Naushad
- Department of Biochemistry and Biomedical Sciences, McMaster UniversityHamilton, ON, Canada
| |
Collapse
|
20
|
Bhandari V, Naushad HS, Gupta RS. Protein based molecular markers provide reliable means to understand prokaryotic phylogeny and support Darwinian mode of evolution. Front Cell Infect Microbiol 2012; 2:98. [PMID: 22919687 PMCID: PMC3417386 DOI: 10.3389/fcimb.2012.00098] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 06/27/2012] [Indexed: 11/20/2022] Open
Abstract
The analyses of genome sequences have led to the proposal that lateral gene transfers (LGTs) among prokaryotes are so widespread that they disguise the interrelationships among these organisms. This has led to questioning of whether the Darwinian model of evolution is applicable to prokaryotic organisms. In this review, we discuss the usefulness of taxon-specific molecular markers such as conserved signature indels (CSIs) and conserved signature proteins (CSPs) for understanding the evolutionary relationships among prokaryotes and to assess the influence of LGTs on prokaryotic evolution. The analyses of genomic sequences have identified large numbers of CSIs and CSPs that are unique properties of different groups of prokaryotes ranging from phylum to genus levels. The species distribution patterns of these molecular signatures strongly support a tree-like vertical inheritance of the genes containing these molecular signatures that is consistent with phylogenetic trees. Recent detailed studies in this regard on the Thermotogae and Archaea, which are reviewed here, have identified large numbers of CSIs and CSPs that are specific for the species from these two taxa and a number of their major clades. The genetic changes responsible for these CSIs (and CSPs) initially likely occurred in the common ancestors of these taxa and then vertically transferred to various descendants. Although some CSIs and CSPs in unrelated groups of prokaryotes were identified, their small numbers and random occurrence has no apparent influence on the consistent tree-like branching pattern emerging from other markers. These results provide evidence that although LGT is an important evolutionary force, it does not mask the tree-like branching pattern of prokaryotes or understanding of their evolutionary relationships. The identified CSIs and CSPs also provide novel and highly specific means for identification of different groups of microbes and for taxonomical and biochemical studies.
Collapse
Affiliation(s)
- Vaibhav Bhandari
- Department of Biochemistry and Biomedical Sciences, McMaster University Hamilton, ON, Canada
| | | | | |
Collapse
|
21
|
Kahlke T, Goesmann A, Hjerde E, Willassen NP, Haugen P. Unique core genomes of the bacterial family vibrionaceae: insights into niche adaptation and speciation. BMC Genomics 2012; 13:179. [PMID: 22574681 PMCID: PMC3464603 DOI: 10.1186/1471-2164-13-179] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2011] [Accepted: 03/12/2012] [Indexed: 01/05/2023] Open
Abstract
Background The criteria for defining bacterial species and even the concept of bacterial species itself are under debate, and the discussion is apparently intensifying as more genome sequence data is becoming available. However, it is still unclear how the new advances in genomics should be used most efficiently to address this question. In this study we identify genes that are common to any group of genomes in our dataset, to determine whether genes specific to a particular taxon exist and to investigate their potential role in adaptation of bacteria to their specific niche. These genes were named unique core genes. Additionally, we investigate the existence and importance of unique core genes that are found in isolates of phylogenetically non-coherent groups. These groups of isolates, that share a genetic feature without sharing a closest common ancestor, are termed genophyletic groups. Results The bacterial family Vibrionaceae was used as the model, and we compiled and compared genome sequences of 64 different isolates. Using the software orthoMCL we determined clusters of homologous genes among the investigated genome sequences. We used multilocus sequence analysis to build a host phylogeny and mapped the numbers of unique core genes of all distinct groups of isolates onto the tree. The results show that unique core genes are more likely to be found in monophyletic groups of isolates. Genophyletic groups of isolates, in contrast, are less common especially for large groups of isolate. The subsequent annotation of unique core genes that are present in genophyletic groups indicate a high degree of horizontally transferred genes. Finally, the annotation of the unique core genes of Vibrio cholerae revealed genes involved in aerotaxis and biosynthesis of the iron-chelator vibriobactin. Conclusion The presented work indicates that genes specific for any taxon inside the bacterial family Vibrionaceae exist. These unique core genes encode conserved metabolic functions that can shed light on the adaptation of a species to its ecological niche. Additionally, our study suggests that unique core genes can be used to aid classification of bacteria and contribute to a bacterial species definition on a genomic level. Furthermore, these genes may be of importance in clinical diagnostics and drug development.
Collapse
Affiliation(s)
- Tim Kahlke
- Department of Chemistry, Faculty of Science and Technology, The Norwegian Structural Biology Centre, University of Tromsø, Tromsø, Norway.
| | | | | | | | | |
Collapse
|
22
|
Gao B, Gupta RS. Phylogenetic framework and molecular signatures for the main clades of the phylum Actinobacteria. Microbiol Mol Biol Rev 2012; 76:66-112. [PMID: 22390973 PMCID: PMC3294427 DOI: 10.1128/mmbr.05011-11] [Citation(s) in RCA: 167] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The phylum Actinobacteria harbors many important human pathogens and also provides one of the richest sources of natural products, including numerous antibiotics and other compounds of biotechnological interest. Thus, a reliable phylogeny of this large phylum and the means to accurately identify its different constituent groups are of much interest. Detailed phylogenetic and comparative analyses of >150 actinobacterial genomes reported here form the basis for achieving these objectives. In phylogenetic trees based upon 35 conserved proteins, most of the main groups of Actinobacteria as well as a number of their superageneric clades are resolved. We also describe large numbers of molecular markers consisting of conserved signature indels in protein sequences and whole proteins that are specific for either all Actinobacteria or their different clades (viz., orders, families, genera, and subgenera) at various taxonomic levels. These signatures independently support the existence of different phylogenetic clades, and based upon them, it is now possible to delimit the phylum Actinobacteria (excluding Coriobacteriia) and most of its major groups in clear molecular terms. The species distribution patterns of these markers also provide important information regarding the interrelationships among different main orders of Actinobacteria. The identified molecular markers, in addition to enabling the development of a stable and reliable phylogenetic framework for this phylum, also provide novel and powerful means for the identification of different groups of Actinobacteria in diverse environments. Genetic and biochemical studies on these Actinobacteria-specific markers should lead to the discovery of novel biochemical and/or other properties that are unique to different groups of Actinobacteria.
Collapse
Affiliation(s)
- Beile Gao
- Department of Biochemistry and Biomedical Science, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
23
|
Microbial systematics in the post-genomics era. Antonie van Leeuwenhoek 2011; 101:45-54. [DOI: 10.1007/s10482-011-9663-1] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/01/2011] [Accepted: 10/15/2011] [Indexed: 10/16/2022]
|
24
|
O'Malley MA, Koonin EV. How stands the Tree of Life a century and a half after The Origin? Biol Direct 2011; 6:32. [PMID: 21714936 PMCID: PMC3158114 DOI: 10.1186/1745-6150-6-32] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2011] [Accepted: 06/30/2011] [Indexed: 12/21/2022] Open
Abstract
We examine the Tree of Life (TOL) as an evolutionary hypothesis and a heuristic. The original TOL hypothesis has failed but a new "statistical TOL hypothesis" is promising. The TOL heuristic usefully organizes data without positing fundamental evolutionary truth.
Collapse
Affiliation(s)
- Maureen A O'Malley
- Department of Philosophy, Quadrangle A14, University of Sydney, NSW 2006, Australia
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD20894, USA
| |
Collapse
|
25
|
Phylogeny and evolution of the Archaea: one hundred genomes later. Curr Opin Microbiol 2011; 14:274-81. [PMID: 21632276 DOI: 10.1016/j.mib.2011.04.015] [Citation(s) in RCA: 153] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2011] [Revised: 04/13/2011] [Accepted: 04/18/2011] [Indexed: 02/07/2023]
|
26
|
Abstract
The species is a fundamental unit of biological organization, but its relevance for Bacteria and Archaea is still hotly debated. Even more controversial is whether the deeper branches of the ribosomal RNA-derived phylogenetic tree, such as the phyla, have ecological importance. Here, we discuss the ecological coherence of high bacterial taxa in the light of genome analyses and present examples of niche differentiation between deeply diverging groups in terrestrial and aquatic systems. The ecological relevance of high bacterial taxa has implications for bacterial taxonomy, evolution and ecology.
Collapse
|
27
|
Reyes-Prieto F, García-Chéquer AJ, Jaimes-Díaz H, Casique-Almazán J, Espinosa-Lara JM, Palma-Orozco R, Méndez-Tenorio A, Maldonado-Rodríguez R, Beattie KL. LifePrint: a novel k-tuple distance method for construction of phylogenetic trees. Adv Appl Bioinform Chem 2011; 4:13-27. [PMID: 21918634 PMCID: PMC3169951 DOI: 10.2147/aabc.s15021] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
PURPOSE Here we describe LifePrint, a sequence alignment-independent k-tuple distance method to estimate relatedness between complete genomes. METHODS We designed a representative sample of all possible DNA tuples of length 9 (9-tuples). The final sample comprises 1878 tuples (called the LifePrint set of 9-tuples; LPS9) that are distinct from each other by at least two internal and noncontiguous nucleotide differences. For validation of our k-tuple distance method, we analyzed several real and simulated viroid genomes. Using different distance metrics, we scrutinized diverse viroid genomes to estimate the k-tuple distances between these genomic sequences. Then we used the estimated genomic k-tuple distances to construct phylogenetic trees using the neighbor-joining algorithm. A comparison of the accuracy of LPS9 and the previously reported 5-tuple method was made using symmetric differences between the trees estimated from each method and a simulated "true" phylogenetic tree. RESULTS The identified optimal search scheme for LPS9 allows only up to two nucleotide differences between each 9-tuple and the scrutinized genome. Similarity search results of simulated viroid genomes indicate that, in most cases, LPS9 is able to detect single-base substitutions between genomes efficiently. Analysis of simulated genomic variants with a high proportion of base substitutions indicates that LPS9 is able to discern relationships between genomic variants with up to 40% of nucleotide substitution. CONCLUSION Our LPS9 method generates more accurate phylogenetic reconstructions than the previously proposed 5-tuples strategy. LPS9-reconstructed trees show higher bootstrap proportion values than distance trees derived from the 5-tuple method.
Collapse
Affiliation(s)
- Fabián Reyes-Prieto
- Laboratory of Biotechnology and Genomic Bioinformatics, Department of Biochemistry, National School of Biological Sciences, Mexico City, Mexico
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Molecular signatures for the Crenarchaeota and the Thaumarchaeota. Antonie van Leeuwenhoek 2010; 99:133-57. [PMID: 20711675 DOI: 10.1007/s10482-010-9488-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2010] [Accepted: 07/26/2010] [Indexed: 10/19/2022]
Abstract
Crenarchaeotes found in mesophilic marine environments were recently placed into a new phylum of Archaea called the Thaumarchaeota. However, very few molecular characteristics of this new phylum are currently known which can be used to distinguish them from the Crenarchaeota. In addition, their relationships to deep-branching archaeal lineages are unclear. We report here detailed analyses of protein sequences from Crenarchaeota and Thaumarchaeota that have identified many conserved signature indels (CSIs) and signature proteins (SPs) (i.e., proteins for which all significant blast hits are from these groups) that are specific for these archaeal groups. Of the identified signatures 6 CSIs and 13 SPs are specific for the Crenarchaeota phylum; 6 CSIs and >250 SPs are uniquely found in various Thaumarchaeota (viz. Cenarchaeum symbiosum, Nitrosopumilus maritimus and a number of uncultured marine crenarchaeotes) and 3 CSIs and ~10 SPs are found in both Thaumarchaeota and Crenarchaeota species. Some of the molecular signatures are also present in Korarchaeum cryptofilum, which forms the independent phylum Korarchaeota. Although some of these molecular signatures suggest a distant shared ancestry between Thaumarchaeota and Crenarchaeota, our identification of large numbers of Thaumarchaeota-specific proteins and their deep branching between the Crenarchaeota and Euryarchaeota phyla in phylogenetic trees shows that they are distinct from both Crenarchaeota and Euryarchaeota in both genetic and phylogenetic terms. These observations support the placement of marine mesophilic archaea into the separate phylum Thaumarchaeota. Additionally, many CSIs and SPs have been found that are specific for different orders within Crenarchaeota (viz. Sulfolobales-3 CSIs and 169 SPs, Thermoproteales-5 CSIs and 25 SPs, Desulfurococcales-4 SPs, and Sulfolobales and Desulfurococcales-2 CSIs and 18 SPs). The signatures described here provide novel means for distinguishing the Crenarchaeota and the Thaumarchaeota and for the classification of related and novel species in different environments. Functional studies on these signature proteins could lead to discovery of novel biochemical properties that are unique to these groups of archaea.
Collapse
|
29
|
Novel glycoside hydrolases identified by screening a Chinese Holstein dairy cow rumen-derived metagenome library. Appl Environ Microbiol 2010; 76:6701-5. [PMID: 20709844 DOI: 10.1128/aem.00361-10] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
One clone encoding glycoside hydrolases was identified through functional screening of a rumen bacterial artificial chromosome (BAC) library. Of the 68 open reading frames (ORFs) predicted, one ORF encodes a novel endo-β-1,4-xylanase with two catalytic domains of family GH43 and two cellulose-binding modules (CBMs) of family IV. Partial characterization showed that this endo-xylanase has a greater specific activity than a number of other xylanases over a wide temperature range at neutral pH and could be useful in some industrial applications.
Collapse
|
30
|
Stark M, Berger SA, Stamatakis A, von Mering C. MLTreeMap--accurate Maximum Likelihood placement of environmental DNA sequences into taxonomic and functional reference phylogenies. BMC Genomics 2010; 11:461. [PMID: 20687950 PMCID: PMC3091657 DOI: 10.1186/1471-2164-11-461] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2010] [Accepted: 08/05/2010] [Indexed: 02/05/2023] Open
Abstract
Background Shotgun sequencing of environmental DNA is an essential technique for characterizing uncultivated microbes in situ. However, the taxonomic and functional assignment of the obtained sequence fragments remains a pressing problem. Results Existing algorithms are largely optimized for speed and coverage; in contrast, we present here a software framework that focuses on a restricted set of informative gene families, using Maximum Likelihood to assign these with the best possible accuracy. This framework ('MLTreeMap'; http://mltreemap.org/) uses raw nucleotide sequences as input, and includes hand-curated, extensible reference information. Conclusions We discuss how we validated our pipeline using complete genomes as well as simulated and actual environmental sequences.
Collapse
Affiliation(s)
- Manuel Stark
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Switzerland
| | | | | | | |
Collapse
|
31
|
Gupta RS. Molecular signatures for the main phyla of photosynthetic bacteria and their subgroups. PHOTOSYNTHESIS RESEARCH 2010; 104:357-372. [PMID: 20414806 DOI: 10.1007/s11120-010-9553-9] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2009] [Accepted: 04/08/2010] [Indexed: 05/29/2023]
Abstract
The bacterial groups corresponding to different photosynthetic prokaryotes are presently identified mainly on the basis of their branching in phylogenetic trees. The availability of genome sequences is enabling identification of many molecular signatures that are specific for different groups of photosynthetic bacteria. Our recent work has identified large numbers of signatures consisting of conserved inserts or deletions (indels) in widely distributed proteins, as well as whole proteins that are specific for various sequenced species/strains from Cyanobacteria, Chlorobi, and Proteobacteria phyla. Based upon these signatures, it is now possible to identify/distinguish bacteria from these phyla of photosynthetic bacteria as well as their major subclades in clear molecular terms. The use of these signatures in conjunction with phylogenomic analyses, summarized here, is leading to a holistic picture concerning the branching order and evolutionary relationships among the above groups of photosynthetic bacteria. Although detailed studies in this regard have not yet been carried on Chloroflexi and Heliobacteriaceae, we have identified some conserved indels that are specific for these groups. Some of the conserved indels for the photosynthetic bacteria are present in photosynthesis-related proteins. These include a 4 aa insert in the pyruvate flavodoxin/ferridoxin oxidoreductase that is specific for the genus Chloroflexus, a 2 aa insert in magnesium chelatase that is uniquely shared by all Cyanobacteria except the deepest branching Clade A (Gloebacterales), a 6 aa insert in an A-type flavoprotein that is specific for various marine unicellular Cyanobacteria, a 2 aa insert in heme oxygenase that is specific for various Prochlorococcus strains/isolates, and 1 aa deletion in the protein protochlorophyllide oxidoreductase that is commonly shared by various Prochlorococcus strains except the deepest branching isolates MIT 9303 and MIT 9313. The identified CSIs are located in the structures of these proteins in surface loops indicating that they may be important in mediating protein-protein interactions. The cellular functions of these conserved indels, or most of the signature proteins are presently unknown, but they provide valuable means for discovering novel properties that are unique to different groups of photosynthetic bacteria.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada.
| |
Collapse
|
32
|
Gupta RS, Mathews DW. Signature proteins for the major clades of Cyanobacteria. BMC Evol Biol 2010; 10:24. [PMID: 20100331 PMCID: PMC2823733 DOI: 10.1186/1471-2148-10-24] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2009] [Accepted: 01/25/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The phylogeny and taxonomy of cyanobacteria is currently poorly understood due to paucity of reliable markers for identification and circumscription of its major clades. RESULTS A combination of phylogenomic and protein signature based approaches was used to characterize the major clades of cyanobacteria. Phylogenetic trees were constructed for 44 cyanobacteria based on 44 conserved proteins. In parallel, Blastp searches were carried out on each ORF in the genomes of Synechococcus WH8102, Synechocystis PCC6803, Nostoc PCC7120, Synechococcus JA-3-3Ab, Prochlorococcus MIT9215 and Prochlor. marinus subsp. marinus CCMP1375 to identify proteins that are specific for various main clades of cyanobacteria. These studies have identified 39 proteins that are specific for all (or most) cyanobacteria and large numbers of proteins for other cyanobacterial clades. The identified signature proteins include: (i) 14 proteins for a deep branching clade (Clade A) of Gloebacter violaceus and two diazotrophic Synechococcus strains (JA-3-3Ab and JA2-3-B'a); (ii) 5 proteins that are present in all other cyanobacteria except those from Clade A; (iii) 60 proteins that are specific for a clade (Clade C) consisting of various marine unicellular cyanobacteria (viz. Synechococcus and Prochlorococcus); (iv) 14 and 19 signature proteins that are specific for the Clade C Synechococcus and Prochlorococcus strains, respectively; (v) 67 proteins that are specific for the Low B/A ecotype Prochlorococcus strains, containing lower ratio of chl b/a2 and adapted to growth at high light intensities; (vi) 65 and 8 proteins that are specific for the Nostocales and Chroococcales orders, respectively; and (vii) 22 and 9 proteins that are uniquely shared by various Nostocales and Oscillatoriales orders, or by these two orders and the Chroococcales, respectively. We also describe 3 conserved indels in flavoprotein, heme oxygenase and protochlorophyllide oxidoreductase proteins that are specific for either Clade C cyanobacteria or for various subclades of Prochlorococcus. Many other conserved indels for cyanobacterial clades have been described recently. CONCLUSIONS These signature proteins and indels provide novel means for circumscription of various cyanobacterial clades in clear molecular terms. Their functional studies should lead to discovery of novel properties that are unique to these groups of cyanobacteria.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada.
| | | |
Collapse
|
33
|
Fournier PE, Raoult D. Bacterial genomes. Infect Dis (Lond) 2010. [DOI: 10.1016/b978-0-323-04579-7.00007-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
|
34
|
Ettema TJG, Andersson SGE. The alpha-proteobacteria: the Darwin finches of the bacterial world. Biol Lett 2009; 5:429-32. [PMID: 19324639 DOI: 10.1098/rsbl.2008.0793] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The alpha-proteobacteria represent one of the most diverse bacterial subdivisions, displaying extreme variations in lifestyle, geographical distribution and genome size. Species for which genome data are available have been classified into a species tree based on a conserved set of vertically inherited core genes. By mapping the variation in gene content onto the species tree, genomic changes can be associated with adaptations to specific growth niches. Genes for adaptive traits are mostly located in 'plasticity zones' in the bacterial genome, which also contain mobile elements and are highly variable across strains. By physically separating genes for information processing from genes involved in interactions with the surrounding environment, the rate of evolutionary change can be substantially enhanced for genes underlying adaptation to new growth habitats, possibly explaining the ecological success of the alpha-proteo-bacterial subdivision.
Collapse
Affiliation(s)
- Thijs J G Ettema
- Department of Molecular Evolution, Evolutionary Biology Center, 752 36 Uppsala, Sweden
| | | |
Collapse
|
35
|
Forterre P, Gribaldo S, Brochier-Armanet C. Happy together: genomic insights into the unique Nanoarchaeum/Ignicoccus association. J Biol 2009; 8:7. [PMID: 19216728 PMCID: PMC2656216 DOI: 10.1186/jbiol110] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
The complete genome sequence of the crenarchaeon Ignicoccus hospitalis published recently in Genome Biology provides a great leap forward in the dissection of its unique association with another hyperthermophilic archaeon, Nanoarchaeum equitans.
Collapse
|
36
|
Dutilh BE, He Y, Hekkelman ML, Huynen MA. Signature, a web server for taxonomic characterization of sequence samples using signature genes. Nucleic Acids Res 2008; 36:W470-4. [PMID: 18487625 PMCID: PMC2447722 DOI: 10.1093/nar/gkn277] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Signature genes are genes that are unique to a taxonomic clade and are common within it. They contain a wealth of information about clade-specific processes and hold a strong evolutionary signal that can be used to phylogenetically characterize a set of sequences, such as a metagenomics sample. As signature genes are based on gene content, they provide a means to assess the taxonomic origin of a sequence sample that is complementary to sequence-based analyses. Here, we introduce Signature (http://www.cmbi.ru.nl/signature), a web server that identifies the signature genes in a set of query sequences, and therewith phylogenetically characterizes it. The server produces a list of taxonomic clades that share signature genes with the set of query sequences, along with an insightful image of the tree of life, in which the clades are color coded based on the number of signature genes present. This allows the user to quickly see from which part(s) of the taxonomy the query sequences likely originate.
Collapse
Affiliation(s)
- Bas E Dutilh
- Center for Molecular and Biomolecular Informatics/Nijmegen Center for Molecular Life Sciences, Radboud University Nijmegen Medical Centre, Geert Grooteplein 28, 6525 GA, Nijmegen, The Netherlands.
| | | | | | | |
Collapse
|