1
|
Feng Y, Arsenault D, Louyakis AS, Altman-Price N, Gophna U, Papke RT, Gogarten JP. Using the pan-genomic framework for the discovery of genomic islands in the haloarchaeon Halorubrum ezzemoulense. mBio 2024; 15:e0040824. [PMID: 38619241 PMCID: PMC11078007 DOI: 10.1128/mbio.00408-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 03/19/2024] [Indexed: 04/16/2024] Open
Abstract
In this study, we use pan-genomics to characterize the genomic variability of the widely dispersed halophilic archaeal species Halorubrum ezzemoulense (Hez). We include a multi-regional sampling of newly sequenced, high-quality draft genomes. The pan-genome graph of the species reveals 50 genomic islands that represent rare accessory genetic capabilities available to members. Most notably, we observe rearrangements that have led to the insertion/recombination/replacement of mutually exclusive genomic islands in equivalent genome positions ("homeocassettes"). These conflicting islands encode for similar functions, but homologs from islands located between the same core genes exhibit high divergence on the amino acid level, while the neighboring core genes are nearly identical. Both islands of a homeocassette often coexist in the same geographic location, suggesting that either island may be beyond the reach of selective sweeps and that these loci of divergence between Hez members are maintained and persist long term. This implies that subsections of the population have different niche preferences and rare metabolic capabilities. After an evaluation of the gene content in the homeocassettes, we speculate that these islands may play a role in the speciation, niche adaptability, and group selection dynamics in Hez. Though homeocassettes are first described in this study, similar replacements and divergence of genes on genomic islands have been previously reported in other Haloarchaea and distantly related Archaea, suggesting that homeocassettes may be a feature in a wide range of organisms outside of Hez.IMPORTANCEThis study catalogs the rare genes discovered in strains of the species Halorubrum ezzemoulense (Hez), an obligate halophilic archaeon, through the perspective of its pan-genome. These rare genes are often found to be arranged on islands that confer metabolic and transport functions and contain genes that have eluded previous studies. The discovery of divergent, but homologous islands occupying equivalent genome positions ("homeocassettes") in different genomes, reveals significant new information on genome evolution in Hez. Homeocassette pairs encode for similar functions, but their dissimilarity and distribution imply high rates of recombination, different specializations, and niche preferences in Hez. The coexistence of both islands of a homeocassette pair in multiple environments demonstrates that both islands are beyond the reach of selective sweeps and that these genome content differences between strains persist long term. The switch between islands through recombination under different environmental conditions may lead to a greater range of niche adaptability in Hez.
Collapse
Affiliation(s)
- Yutian Feng
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Danielle Arsenault
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Artemis S. Louyakis
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Neta Altman-Price
- The Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
- Avinoam Adam Department of Natural Sciences, The Open University of Israel, Raanana, Israel
| | - Uri Gophna
- The Shmunis School of Biomedicine and Cancer Research, Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - R. Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, USA
| |
Collapse
|
2
|
Abstract
EMBL-EBI The European Bioinformatics Institute; E. coli Escherichia coli; E. faecalis Enterobacter faecalis; B. fragilis Bacteroides fragilis; B. vulgatus Bacteroides vulgatus; SaPIs Staphylococcus aureus pathogenicity islands; ARGs Antibiotic resistance genes; STEC Shiga toxigenic E. coli; Stx Shiga toxin; BLAST Basic Local Alignment Search Tool; TSST-1 Toxic shock toxin 1; RBPs Receptor-binding proteins; LPS lipopolysaccharide; OMVs Outer membrane vesicles; PT Phosphorothioate; BREX Bacteriophage exclusion; OCR Overcome classical restriction; Pgl Phage growth limitation; DISARM Defense island system associated with restrictionmodification; R-M system Restriction-modification system; BREX system Bacteriophage exclusion system; CRISPR Clustered regularly interspaced short palindromic repeats; Cas CRISPR-associated; PAMs Prospacer adjacent motifs; crRNA CRISPR RNA; SIE; OMPs; Superinfection exclusion; Outer membrane proteins; Abi Abortive infection; TA Toxin-antitoxin; TLR Toll-like receptor; APCs Antigen-presenting cells; DSS Dextran sulfate sodium; IELs Intraepithelial lymphocytes; FMT Fecal microbiota transfer; IFN-γ Interferon-gamma; IBD Inflammatory bowel disease; AgNPs Silver nanoparticles; MDSC Myeloid-derived suppressor cell; CRC Colorectal cancer; VLPs Virus-like particles; TMP Tape measure protein; PSMB4 Proteasome subunit beta type-4; ALD Alcohol-related liver disease; GVHD Graft-versus-host disease; ROS Reactive oxygen species; RA Rheumatoid arthritis; CCP Cyclic citrullinated protein; AMGs Accessory metabolic genes; T1DM Type 1 diabetes mellitus; T2DM Type 2 diabetes mellitus; SCFAs Short-chain fatty acids; GLP-1 Glucagon-like peptide-1; A. baumannii Acinetobacter baumannii; CpG Deoxycytidylinate-phosphodeoxyguanosine; PEG Polyethylene glycol; MetS Metabolic syndrome; OprM Outer membrane porin M.
Collapse
Affiliation(s)
- Han Shuwen
- Department of Colorectal Surgery and Oncology, Key Laboratory of Cancer Prevention and Intervention, Ministry of Education, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China,Department of Medical Oncology, Huzhou Central Hospital, Huzhou, China
| | - Ding Kefeng
- Department of Colorectal Surgery and Oncology, Key Laboratory of Cancer Prevention and Intervention, Ministry of Education, The Second Affiliated Hospital, Zhejiang University School of Medicine, Hangzhou, China,Department of Colorectal Surgery and Oncology, Cancer Center Zhejiang University, Hangzhou, China,CONTACT Ding Kefeng Department of Colorectal Surgery and Oncology, Key Laboratory of Cancer Prevention and Intervention, Ministry of Education, The Second Affiliated Hospital, Zhejiang University School of Medicine, 88 Jiefang Road, Building 6 room 2018, Hangzhou, Zhejiang310009, China
| |
Collapse
|
3
|
Rangel LT, Soucy SM, Setubal JC, Gogarten JP, Fournier GP. An efficient, non-phylogenetic method for detecting genes sharing evolutionary signals in phylogenomic datasets. Genome Biol Evol 2021; 13:6352501. [PMID: 34390574 PMCID: PMC8483891 DOI: 10.1093/gbe/evab187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/11/2021] [Indexed: 11/25/2022] Open
Abstract
Assessing the compatibility between gene family phylogenies is a crucial and often computationally demanding step in many phylogenomic analyses. Here, we describe the Evolutionary Similarity Index (IES), a means to assess shared evolution between gene families using a weighted orthogonal distance regression model applied to sequence distances. The utilization of pairwise distance matrices circumvents comparisons between gene tree topologies, which are inherently uncertain and sensitive to evolutionary model choice, phylogenetic reconstruction artifacts, and other sources of error. Furthermore, IES enables the many-to-many pairing of multiple copies between similarly evolving gene families. This is done by selecting non-overlapping pairs of copies, one from each assessed family, and yielding the least sum of squared residuals. Analyses of simulated gene family data sets show that IES’s accuracy is on par with popular tree-based methods while also less susceptible to noise introduced by sequence alignment and evolutionary model fitting. Applying IES to an empirical data set of 1,322 genes from 42 archaeal genomes identified eight major clusters of gene families with compatible evolutionary trends. The most cohesive cluster consisted of 62 genes with compatible evolutionary signal, which occur as both single-copy and multiple homologs per genome; phylogenetic analysis of concatenated alignments from this cluster produced a tree closely matching previously published species trees for Archaea. Four other clusters are mainly composed of accessory genes with limited distribution among Archaea and enriched toward specific metabolic functions. Pairwise evolutionary distances obtained from these accessory gene clusters suggest patterns of interphyla horizontal gene transfer. An IES implementation is available at https://github.com/lthiberiol/evolSimIndex.
Collapse
Affiliation(s)
- Luiz Thibério Rangel
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
- Corresponding author: E-mail:
| | - Shannon M Soucy
- Department of Biomedical Data Science, Geisel School of Medicine, Dartmouth College, Hanover, New Hampshire, USA
| | - João C Setubal
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, Brasil
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, USA
- Institute for Systems Genomics, University of Connecticut, USA
| | - Gregory P Fournier
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| |
Collapse
|
4
|
Avni E, Snir S. A New Phylogenomic Approach For Quantifying Horizontal Gene Transfer Trends in Prokaryotes. Sci Rep 2020; 10:12425. [PMID: 32709941 PMCID: PMC7381616 DOI: 10.1038/s41598-020-62446-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2018] [Accepted: 01/27/2020] [Indexed: 11/09/2022] Open
Abstract
It is well established nowadays that among prokaryotes, various families of orthologous genes exhibit conflicting evolutionary history. A prime factor for this conflict is horizontal gene transfer (HGT) - the transfer of genetic material not via vertical descent. Thus, the prevalence of HGT is challenging the meaningfulness of the classical Tree of Life concept. Here we present a comprehensive study of HGT representing the entire prokaryotic world. We mainly rely on a novel analytic approach for analyzing an aggregate of gene histories, by means of the quartet plurality distribution (QPD) that we develop. Through the analysis of real and simulated data, QPD is used to reveal evidence of a barrier against HGT, separating the archaea from the bacteria and making HGT between the two domains, in general, quite rare. In contrast, bacteria's confined HGT is substantially more frequent than archaea's. Our approach also reveals that despite intensive HGT, a strong tree-like signal can be extracted, corroborating several previous works. Thus, QPD, which enables one to analytically combine information from an aggregate of gene trees, can be used for understanding patterns and rates of HGT in prokaryotes, as well as for validating or refuting models of horizontal genetic transfers and evolution in general.
Collapse
Affiliation(s)
- Eliran Avni
- Department of Evolutionary Biology, University of Haifa, Haifa, 31905, Israel.
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, Haifa, 31905, Israel.
| |
Collapse
|
5
|
Rangel LT, Marden J, Colston S, Setubal JC, Graf J, Gogarten JP. Identification and characterization of putative Aeromonas spp. T3SS effectors. PLoS One 2019; 14:e0214035. [PMID: 31163020 PMCID: PMC6548356 DOI: 10.1371/journal.pone.0214035] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Accepted: 05/21/2019] [Indexed: 11/23/2022] Open
Abstract
The genetic determinants of bacterial pathogenicity are highly variable between species and strains. However, a factor that is commonly associated with virulent Gram-negative bacteria, including many Aeromonas spp., is the type 3 secretion system (T3SS), which is used to inject effector proteins into target eukaryotic cells. In this study, we developed a bioinformatics pipeline to identify T3SS effector proteins, applied this approach to the genomes of 105 Aeromonas strains isolated from environmental, mutualistic, or pathogenic contexts and evaluated the cytotoxicity of the identified effectors through their heterologous expression in yeast. The developed pipeline uses a two-step approach, where candidate Aeromonas gene families are initially selected using Hidden Markov Model (HMM) profile searches against the Virulence Factors DataBase (VFDB), followed by strict comparisons against positive and negative control datasets, greatly reducing the number of false positives. This approach identified 21 Aeromonas T3SS likely effector families, of which 8 represent known or characterized effectors, while the remaining 13 have not previously been described in Aeromonas. We experimentally validated our in silico findings by assessing the cytotoxicity of representative effectors in Saccharomyces cerevisiae BY4741, with 15 out of 21 assayed proteins eliciting a cytotoxic effect in yeast. The results of this study demonstrate the utility of our approach, combining a novel in silico search method with in vivo experimental validation, and will be useful in future research aimed at identifying and authenticating bacterial effector proteins from other genera.
Collapse
Affiliation(s)
- Luiz Thiberio Rangel
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
- Interunidades em Bioinformática, Universidade de São Paulo, São Paulo, Brasil
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brasil
| | - Jeremiah Marden
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - Sophie Colston
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
| | - João Carlos Setubal
- Interunidades em Bioinformática, Universidade de São Paulo, São Paulo, Brasil
- Departamento de Bioquímica, Instituto de Química, Universidade de São Paulo, São Paulo, Brasil
| | - Joerg Graf
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, United States of America
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, United States of America
- Institute for Systems Genomics, University of Connecticut, Storrs, Connecticut, United States of America
| |
Collapse
|
6
|
Berube PM, Rasmussen A, Braakman R, Stepanauskas R, Chisholm SW. Emergence of trait variability through the lens of nitrogen assimilation in Prochlorococcus. eLife 2019; 8:41043. [PMID: 30706847 PMCID: PMC6370341 DOI: 10.7554/elife.41043] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2018] [Accepted: 01/31/2019] [Indexed: 12/12/2022] Open
Abstract
Intraspecific trait variability has important consequences for the function and stability of marine ecosystems. Here we examine variation in the ability to use nitrate across hundreds of Prochlorococcus genomes to better understand the modes of evolution influencing intraspecific allocation of ecologically important functions. Nitrate assimilation genes are absent in basal lineages but occur at an intermediate frequency that is randomly distributed within recently emerged clades. The distribution of nitrate assimilation genes within clades appears largely governed by vertical inheritance, gene loss, and homologous recombination. By mapping this process onto a model of Prochlorococcus’ macroevolution, we propose that niche-constructing adaptive radiations and subsequent niche partitioning set the stage for loss of nitrate assimilation genes from basal lineages as they specialized to lower light levels. Retention of these genes in recently emerged lineages has likely been facilitated by selection as they sequentially partitioned into niches where nitrate assimilation conferred a fitness benefit.
Collapse
Affiliation(s)
- Paul M Berube
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, United States
| | - Anna Rasmussen
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, United States
| | - Rogier Braakman
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, United States
| | | | - Sallie W Chisholm
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, United States.,Department of Biology, Massachusetts Institute of Technology, Cambridge, United States
| |
Collapse
|
7
|
Shalev Y, Soucy SM, Papke RT, Gogarten JP, Eichler J, Gophna U. Comparative Analysis of Surface Layer Glycoproteins and Genes Involved in Protein Glycosylation in the Genus Haloferax. Genes (Basel) 2018; 9:genes9030172. [PMID: 29558455 PMCID: PMC5867893 DOI: 10.3390/genes9030172] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2018] [Revised: 03/01/2018] [Accepted: 03/09/2018] [Indexed: 11/16/2022] Open
Abstract
Within the Haloferax genus, both the surface (S)-layer protein, and the glycans that can decorate it, vary between species, which can potentially result in many different surface types, analogous to bacterial serotypes. This variation may mediate phenotypes, such as sensitivity to different viruses and mating preferences. Here, we describe S-layer glycoproteins found in multiple Haloferax strains and perform comparative genomics analyses of major and alternative glycosylation clusters of isolates from two coastal sites. We analyze the phylogeny of individual glycosylation genes and demonstrate that while the major glycosylation cluster tends to be conserved among closely related strains, the alternative cluster is highly variable. Thus, geographically- and genetically-related strains may exhibit diverse surface structures to such an extent that no two isolates present an identical surface profile.
Collapse
Affiliation(s)
- Yarden Shalev
- School of Molecular and Cell Biology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Shannon M Soucy
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA.
| | - R Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269, USA.
| | - J Peter Gogarten
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269, USA.
| | - Jerry Eichler
- Department of Life Sciences, Ben Gurion University of the Negev, Beersheva 8410501, Israel.
| | - Uri Gophna
- School of Molecular and Cell Biology and Biotechnology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 69978, Israel.
| |
Collapse
|
8
|
Shakya M, Soucy SM, Zhaxybayeva O. Insights into origin and evolution of α-proteobacterial gene transfer agents. Virus Evol 2017; 3:vex036. [PMID: 29250433 PMCID: PMC5721377 DOI: 10.1093/ve/vex036] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Several bacterial and archaeal lineages produce nanostructures that morphologically resemble small tailed viruses, but, unlike most viruses, contain apparently random pieces of the host genome. Since these elements can deliver the packaged DNA to other cells, they were dubbed gene transfer agents (GTAs). Because many genes involved in GTA production have viral homologs, it has been hypothesized that the GTA ancestor was a virus. Whether GTAs represent an atypical virus, a defective virus, or a virus co-opted by the prokaryotes for some function, remains to be elucidated. To evaluate these possibilities, we examined the distribution and evolutionary histories of genes that encode a GTA in the α-proteobacterium Rhodobacter capsulatus (RcGTA). We report that although homologs of many individual RcGTA genes are abundant across bacteria and their viruses, RcGTA-like genomes are mainly found in one subclade of α-proteobacteria. When compared with the viral homologs, genes of the RcGTA-like genomes evolve significantly slower, and do not have higher %A+T nucleotides than their host chromosomes. Moreover, they appear to reside in stable regions of the bacterial chromosomes that are generally conserved across taxonomic orders. These findings argue against RcGTA being an atypical or a defective virus. Our phylogenetic analyses suggest that RcGTA ancestor likely originated in the lineage that gave rise to contemporary α-proteobacterial orders Rhizobiales, Rhodobacterales, Caulobacterales, Parvularculales, and Sphingomonadales, and since that time the RcGTA-like element has co-evolved with its host chromosomes. Such evolutionary history is compatible with maintenance of these elements by bacteria due to some selective advantage. As for many other prokaryotic traits, horizontal gene transfer played a substantial role in the evolution of RcGTA-like elements, not only in shaping its genome components within the orders, but also in occasional dissemination of RcGTA-like regions across the orders and even to different bacterial phyla.
Collapse
Affiliation(s)
- Migun Shakya
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA
| | - Shannon M Soucy
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA
| | - Olga Zhaxybayeva
- Department of Biological Sciences, Dartmouth College, 78 College Street, Hanover, NH 03755, USA.,Department of Computer Science, Dartmouth College, 6211 Sudikoff Lab, Hanover, NH 03755, USA
| |
Collapse
|
9
|
Koonin EV. Horizontal gene transfer: essentiality and evolvability in prokaryotes, and roles in evolutionary transitions. F1000Res 2016; 5. [PMID: 27508073 PMCID: PMC4962295 DOI: 10.12688/f1000research.8737.1] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/18/2016] [Indexed: 01/01/2023] Open
Abstract
The wide spread of gene exchange and loss in the prokaryotic world has prompted the concept of ‘lateral genomics’ to the point of an outright denial of the relevance of phylogenetic trees for evolution. However, the pronounced coherence congruence of the topologies of numerous gene trees, particularly those for (nearly) universal genes, translates into the notion of a statistical tree of life (STOL), which reflects a central trend of vertical evolution. The STOL can be employed as a framework for reconstruction of the evolutionary processes in the prokaryotic world. Quantitatively, however, horizontal gene transfer (HGT) dominates microbial evolution, with the rate of gene gain and loss being comparable to the rate of point mutations and much greater than the duplication rate. Theoretical models of evolution suggest that HGT is essential for the survival of microbial populations that otherwise deteriorate due to the Muller’s ratchet effect. Apparently, at least some bacteria and archaea evolved dedicated vehicles for gene transfer that evolved from selfish elements such as plasmids and viruses. Recent phylogenomic analyses suggest that episodes of massive HGT were pivotal for the emergence of major groups of organisms such as multiple archaeal phyla as well as eukaryotes. Similar analyses appear to indicate that, in addition to donating hundreds of genes to the emerging eukaryotic lineage, mitochondrial endosymbiosis severely curtailed HGT. These results shed new light on the routes of evolutionary transitions, but caution is due given the inherent uncertainty of deep phylogenies.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
10
|
Boto L. Evolutionary change and phylogenetic relationships in light of horizontal gene transfer. J Biosci 2016; 40:465-72. [PMID: 25963270 DOI: 10.1007/s12038-015-9514-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Horizontal gene transfer has, over the past 25 years, become a part of evolutionary thinking. In the present paper I discuss horizontal gene transfer (HGT) in relation to contingency, natural selection, evolutionary change speed and the Tree-of-Life endeavour, with the aim of contributing to the understanding of the role of HGT in evolutionary processes. In addition, the challenges that HGT imposes on the current view of evolution are emphasized.
Collapse
Affiliation(s)
- Luis Boto
- Departamento de Biodiversidad y Biologia Evolutiva, Museo Nacional Ciencias Naturales, CSIC, C/ Jose Gutierrez Abascal 2, 28006, Madrid, Spain,
| |
Collapse
|
11
|
Warren JM, Simmons MP, Wu Z, Sloan DB. Linear Plasmids and the Rate of Sequence Evolution in Plant Mitochondrial Genomes. Genome Biol Evol 2016; 8:364-74. [PMID: 26759362 PMCID: PMC4779610 DOI: 10.1093/gbe/evw003] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/04/2016] [Indexed: 12/20/2022] Open
Abstract
The mitochondrial genomes of flowering plants experience frequent insertions of foreign sequences, including linear plasmids that also exist in standalone forms within mitochondria, but the history and phylogenetic distribution of plasmid insertions is not well known. Taking advantage of the increased availability of plant mitochondrial genome sequences, we performed phylogenetic analyses to reconstruct the evolutionary history of these plasmids and plasmid-derived insertions. Mitochondrial genomes from multiple land plant lineages (including liverworts, lycophytes, ferns, and gymnosperms) include fragmented remnants from ancient plasmid insertions. Such insertions are much more recent and widespread in angiosperms, in which approximately 75% of sequenced mitochondrial genomes contain identifiable plasmid insertions. Although conflicts between plasmid and angiosperm phylogenies provide clear evidence of repeated horizontal transfers, we were still able to detect significant phylogenetic concordance, indicating that mitochondrial plasmids have also experienced sustained periods of (effectively) vertical transmission in angiosperms. The observed levels of sequence divergence in plasmid-derived genes suggest that nucleotide substitution rates in these plasmids, which often encode their own viral-like DNA polymerases, are orders of magnitude higher than in mitochondrial chromosomes. Based on these results, we hypothesize that the periodic incorporation of mitochondrial genes into plasmids contributes to the remarkable heterogeneity in substitution rates among genes that has recently been discovered in some angiosperm mitochondrial genomes. In support of this hypothesis, we show that the recently acquired ψtrnP-trnW gene region in a maize linear plasmid is evolving significantly faster than homologous sequences that have been retained in the mitochondrial chromosome in closely related grasses.
Collapse
Affiliation(s)
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins
| | - Zhiqiang Wu
- Department of Biology, Colorado State University, Fort Collins
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins
| |
Collapse
|
12
|
Fournier GP, Andam CP, Gogarten JP. Ancient horizontal gene transfer and the last common ancestors. BMC Evol Biol 2015; 15:70. [PMID: 25897759 PMCID: PMC4427996 DOI: 10.1186/s12862-015-0350-0] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2014] [Accepted: 04/14/2015] [Indexed: 01/22/2023] Open
Abstract
Background The genomic history of prokaryotic organismal lineages is marked by extensive horizontal gene transfer (HGT) between groups of organisms at all taxonomic levels. These HGT events have played an essential role in the origin and distribution of biological innovations. Analyses of ancient gene families show that HGT existed in the distant past, even at the time of the organismal last universal common ancestor (LUCA). Most gene transfers originated in lineages that have since gone extinct. Therefore, one cannot assume that the last common ancestors of each gene were all present in the same cell representing the cellular ancestor of all extant life. Results Organisms existing as part of a diverse ecosystem at the time of LUCA likely shared genetic material between lineages. If these other lineages persisted for some time, HGT with the descendants of LUCA could have continued into the bacterial and archaeal lineages. Phylogenetic analyses of aminoacyl-tRNA synthetase protein families support the hypothesis that the molecular common ancestors of the most ancient gene families did not all coincide in space and time. This is most apparent in the evolutionary histories of seryl-tRNA synthetase and threonyl-tRNA synthetase protein families, each containing highly divergent “rare” forms, as well as the sparse phylogenetic distributions of pyrrolysyl-tRNA synthetase, and the bacterial heterodimeric form of glycyl-tRNA synthetase. These topologies and phyletic distributions are consistent with horizontal transfers from ancient, likely extinct branches of the tree of life. Conclusions Of all the organisms that may have existed at the time of LUCA, by definition only one lineage is survived by known progeny; however, this lineage retains a genomic record of heterogeneous genetic origins. The evolutionary histories of aminoacyl-tRNA synthetases (aaRS) are especially informative in detecting this signal, as they perform primordial biological functions, have undergone several ancient HGT events, and contain many sites with low substitution rates allowing deep phylogenetic reconstruction. We conclude that some aaRS families contain groups that diverge before LUCA. We propose that these ancient gene variants be described by the term “hypnologs”, reflecting their ancient, reticulate origin from a time in life history that has been all but erased”. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0350-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Gregory P Fournier
- Department of Earth, Atmospheric and Planetary Sciences, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
| | - Cheryl P Andam
- Department of Epidemiology, School of Public Health, Harvard University, Boston, MA, 02115, USA.
| | - Johann Peter Gogarten
- Department of Molecular and Cell Biology, and the Institute for Systems Genomics, University of Connecticut, 91 North Eagleville Road, Storrs, CT, 06269-3125, USA.
| |
Collapse
|
13
|
Ancestral Reconstruction of a Pre-LUCA Aminoacyl-tRNA Synthetase Ancestor Supports the Late Addition of Trp to the Genetic Code. J Mol Evol 2015; 80:171-85. [PMID: 25791872 DOI: 10.1007/s00239-015-9672-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2015] [Accepted: 03/09/2015] [Indexed: 01/14/2023]
Abstract
The genetic code was likely complete in its current form by the time of the last universal common ancestor (LUCA). Several scenarios have been proposed for explaining the code's pre-LUCA emergence and expansion, and the relative order of the appearance of amino acids used in translation. One co-evolutionary model of genetic code expansion proposes that at least some amino acids were added to the code by the ancient divergence of aminoacyl-tRNA synthetase (aaRS) families. Of all the amino acids used within the genetic code, Trp is most frequently claimed as a relatively recent addition. We observe that, since TrpRS and TyrRS are paralogous protein families retaining significant sequence similarity, the inferred sequence composition of their ancestor can be used to evaluate this co-evolutionary model of genetic code expansion. We show that ancestral sequence reconstructions of the pre-LUCA paralog ancestor of TyrRS and TrpRS have several sites containing Tyr, yet a complete absence of sites containing Trp. This is consistent with the paralog ancestor being specific for the utilization of Tyr, with Trp being a subsequent addition to the genetic code facilitated by a process of aaRS divergence and neofunctionalization. Only after this divergence could Trp be specifically encoded and incorporated into proteins, including the TyrRS and TrpRS descendant lineages themselves. This early absence of Trp is observed under both homogeneous and non-homogeneous models of ancestral sequence reconstruction. Simulations support that this observed absence of Trp is unlikely to be due to chance or model bias. These results support that the final stages of genetic code evolution occurred well within the "protein world," and that the presence-absence of Trp within conserved sites of ancient protein domains is a likely measure of their relative antiquity, permitting the relative timing of extremely early events within protein evolution before LUCA.
Collapse
|
14
|
Bioinformatic genome comparisons for taxonomic and phylogenetic assignments using Aeromonas as a test case. mBio 2014; 5:e02136. [PMID: 25406383 PMCID: PMC4251997 DOI: 10.1128/mbio.02136-14] [Citation(s) in RCA: 167] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Prokaryotic taxonomy is the underpinning of microbiology, as it provides a framework for the proper identification and naming of organisms. The “gold standard” of bacterial species delineation is the overall genome similarity determined by DNA-DNA hybridization (DDH), a technically rigorous yet sometimes variable method that may produce inconsistent results. Improvements in next-generation sequencing have resulted in an upsurge of bacterial genome sequences and bioinformatic tools that compare genomic data, such as average nucleotide identity (ANI), correlation of tetranucleotide frequencies, and the genome-to-genome distance calculator, or in silico DDH (isDDH). Here, we evaluate ANI and isDDH in combination with phylogenetic studies using Aeromonas, a taxonomically challenging genus with many described species and several strains that were reassigned to different species as a test case. We generated improved, high-quality draft genome sequences for 33 Aeromonas strains and combined them with 23 publicly available genomes. ANI and isDDH distances were determined and compared to phylogenies from multilocus sequence analysis of housekeeping genes, ribosomal proteins, and expanded core genes. The expanded core phylogenetic analysis suggested relationships between distant Aeromonas clades that were inconsistent with studies using fewer genes. ANI values of ≥96% and isDDH values of ≥70% consistently grouped genomes originating from strains of the same species together. Our study confirmed known misidentifications, validated the recent revisions in the nomenclature, and revealed that a number of genomes deposited in GenBank are misnamed. In addition, two strains were identified that may represent novel Aeromonas species. Improvements in DNA sequencing technologies have resulted in the ability to generate large numbers of high-quality draft genomes and led to a dramatic increase in the number of publically available genomes. This has allowed researchers to characterize microorganisms using genome data. Advantages of genome sequence-based classification include data and computing programs that can be readily shared, facilitating the standardization of taxonomic methodology and resolving conflicting identifications by providing greater uniformity in an overall analysis. Using Aeromonas as a test case, we compared and validated different approaches. Based on our analyses, we recommend cutoff values for distance measures for identifying species. Accurate species classification is critical not only to obviate the perpetuation of errors in public databases but also to ensure the validity of inferences made on the relationships among species within a genus and proper identification in clinical and veterinary diagnostic laboratories.
Collapse
|
15
|
Soucy SM, Fullmer MS, Papke RT, Gogarten JP. Inteins as indicators of gene flow in the halobacteria. Front Microbiol 2014; 5:299. [PMID: 25018750 PMCID: PMC4071816 DOI: 10.3389/fmicb.2014.00299] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Accepted: 05/30/2014] [Indexed: 12/21/2022] Open
Abstract
This research uses inteins, a type of mobile genetic element, to infer patterns of gene transfer within the Halobacteria. We surveyed 118 genomes representing 26 genera of Halobacteria for intein sequences. We then used the presence-absence profile, sequence similarity and phylogenies from the inteins recovered to explore how intein distribution can provide insight on the dynamics of gene flow between closely related and divergent organisms. We identified 24 proteins in the Halobacteria that have been invaded by inteins at some point in their evolutionary history, including two proteins not previously reported to contain an intein. Furthermore, the size of an intein is used as a heuristic for the phase of the intein's life cycle. Larger size inteins are assumed to be the canonical two domain inteins, consisting of self-splicing and homing endonuclease domains (HEN); smaller sizes are assumed to have lost the HEN domain. For many halobacterial groups the consensus phylogenetic signal derived from intein sequences is compatible with vertical inheritance or with a strong gene transfer bias creating these clusters. Regardless, the coexistence of intein-free and intein-containing alleles reveal ongoing transfer and loss of inteins within these groups. Inteins were frequently shared with other Euryarchaeota and among the Bacteria, with members of the Cyanobacteria (Cyanothece, Anabaena), Bacteriodetes (Salinibacter), Betaproteobacteria (Delftia, Acidovorax), Firmicutes (Halanaerobium), Actinobacteria (Longispora), and Deinococcus-Thermus-group.
Collapse
Affiliation(s)
- Shannon M Soucy
- Department of Molecular and Cell Biology, University of Connecticut Storrs, CT, USA
| | - Matthew S Fullmer
- Department of Molecular and Cell Biology, University of Connecticut Storrs, CT, USA
| | - R Thane Papke
- Department of Molecular and Cell Biology, University of Connecticut Storrs, CT, USA
| | | |
Collapse
|
16
|
Abstract
Carl Woese is known to the scientific community primarily through his landmark contributions to microbiology, in particular, his discovery of the third Domain of Life, which came to be known as the Archaea. While it is well known how he made this discovery, through the techniques he developed based on his studies of rRNA, the reasons why he was driven in this scientific direction, and what he saw as the principle outcome of his discovery--it was not the Archaea!--are not so widely appreciated. In this essay, I discuss his vision of evolution, one which transcends population genetics, and which has ramifications not only for our understanding of the origin of life on Earth and elsewhere, but also for our understanding of biology as a novel class of complex dynamical systems.
Collapse
Affiliation(s)
- Nigel Goldenfeld
- Institute for Universal Biology; Institute for Genomic Biology, and Department of Physics; University of Illinois at Urbana-Champaign; Urbana, IL USA
| |
Collapse
|
17
|
Guimaraes AMS, Santos AP, do Nascimento NC, Timenetsky J, Messick JB. Comparative genomics and phylogenomics of hemotrophic mycoplasmas. PLoS One 2014; 9:e91445. [PMID: 24642917 PMCID: PMC3958358 DOI: 10.1371/journal.pone.0091445] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2013] [Accepted: 02/10/2014] [Indexed: 11/18/2022] Open
Abstract
Hemotrophic mycoplasmas (hemoplasmas) are a group of animal pathogens of the Mollicutes class. Recently, the genomes of 8 hemoplasmas have been completely sequenced. The aim of this study was to gain a better understanding of their genomic features and relationship to other Mycoplasma species. The genome structure and dynamics of hemoplasmas were analyzed by evaluating gene synteny, adaptive evolution of paralogous gene families (PGF) and horizontal gene transfer (HGT). The Mollicutes class was then phylogenetically analyzed by constructing a distance matrix of the 16S rRNA genes and a phylogenetic tree with 32 conserved, concatenated proteins. Our results suggest that the hemoplasmas have dynamic genomes. The genome size variation (from 547 to 1,545 genes) indicates substantial gene gain/loss throughout evolution. Poorly conserved gene syntenies among hemoplasmas, positional shuffling of paralogous genes between strains, HGT, and codons under positive selection in PGFs were also observed. When compared to other Mollicutes species, the hemoplasmas experienced further metabolic reduction, and the 16S rRNA gene distance matrix of the available mollicutes suggests that these organisms presently constitute the most divergent clade within its class. Our phylogenetic tree of concatenated proteins showed some differences when compared to the 16S rRNA gene tree, but non-mycoplasma organisms, such as Ureaplasma spp. and Mesoplasma spp., continue to branch within Mycoplasma clades. In conclusion, while the hemoplasmas experienced further metabolic shrinkage through gene loss, PGFs with positively selected codons are likely beneficial to these species. Phylogeny of the mollicutes based on 16S rRNA genes or concatenated proteins do not obey the current taxonomy. The metabolism and genetic diversity of the mollicutes, the presence of HGT, and lack of standard for genus circumscription are likely to hinder attempts to classify these organisms based on phylogenetic analyses.
Collapse
Affiliation(s)
- Ana M. S. Guimaraes
- Department of Comparative Pathobiology, School of Veterinary Medicine, Purdue University, West Lafayette, Indiana, United States of America
- Coordenação de Aperfeiçoamento de Pessoal de Ensino Superior (CAPES)-Fulbright Program, Ministério da Educação, Brasília, Distrito Federal, Brazil
| | - Andrea P. Santos
- Department of Comparative Pathobiology, School of Veterinary Medicine, Purdue University, West Lafayette, Indiana, United States of America
| | - Naíla C. do Nascimento
- Department of Comparative Pathobiology, School of Veterinary Medicine, Purdue University, West Lafayette, Indiana, United States of America
| | - Jorge Timenetsky
- Department of Microbiology, Instituto de Ciências Biomédicas, Universidade de São Paulo, São Paulo, São Paulo, Brazil
| | - Joanne B. Messick
- Department of Comparative Pathobiology, School of Veterinary Medicine, Purdue University, West Lafayette, Indiana, United States of America
- * E-mail:
| |
Collapse
|
18
|
O'Malley MA. When integration fails: Prokaryote phylogeny and the tree of life. STUDIES IN HISTORY AND PHILOSOPHY OF BIOLOGICAL AND BIOMEDICAL SCIENCES 2013; 44:551-62. [PMID: 23137776 DOI: 10.1016/j.shpsc.2012.10.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
Much is being written these days about integration, its desirability and even its necessity when complex research problems are to be addressed. Seldom, however, do we hear much about the failure of such efforts. Because integration is an ongoing activity rather than a final achievement, and because today's literature about integration consists mostly of manifesto statements rather than precise descriptions, an examination of unsuccessful integration could be illuminating to understand better how it works. This paper will examine the case of prokaryote phylogeny and its apparent failure to achieve integration within broader tree-of-life accounts of evolutionary history (often called 'universal phylogeny'). Despite the fact that integrated databases exist of molecules pertinent to the phylogenetic reconstruction of all lineages of life, and even though the same methods can be used to construct phylogenies wherever the organisms fall on the tree of life, prokaryote phylogeny remains at best only partly integrated within tree-of-life efforts. I will examine why integration does not occur, compare it with integrative practices in animal and other eukaryote phylogeny, and reflect on whether there might be different expectations of what integration should achieve. Finally, I will draw some general conclusions about integration and its function as a 'meta-heuristic' in the normative commitments guiding scientific practice.
Collapse
Affiliation(s)
- Maureen A O'Malley
- Department of Philosophy, University of Sydney, Quadrangle A14, NSW 2006, Australia.
| |
Collapse
|
19
|
Skippington E, Ragan MA. Phylogeny rather than ecology or lifestyle biases the construction of Escherichia coli-Shigella genetic exchange communities. Open Biol 2013; 2:120112. [PMID: 23091700 PMCID: PMC3472396 DOI: 10.1098/rsob.120112] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Accepted: 08/20/2012] [Indexed: 11/12/2022] Open
Abstract
Genetic material can be transmitted not only vertically from parent to offspring, but also laterally (horizontally) from one bacterial lineage to another. Lateral genetic transfer is non-uniform; biases in its nature or frequency construct communities of genetic exchange. These biases have been proposed to arise from phylogenetic relatedness, shared ecology and/or common lifestyle. Here, we test these hypotheses using a graph-based abstraction of inferred genetic-exchange relationships among 27 Escherichia coli and Shigella genomes. We show that although barriers to inter-phylogenetic group lateral transfer are low, E. coli and Shigella are more likely to have exchanged genetic material with close relatives. We find little evidence of bias arising from shared environment or lifestyle. More than one-third of donor-recipient pairs in our analysis show some level of fragmentary gene transfer. Thus, within the E. coli-Shigella clade, intact genes and gene fragments have been disseminated non-uniformly and at appreciable frequency, constructing communities that transgress environmental and lifestyle boundaries.
Collapse
Affiliation(s)
- Elizabeth Skippington
- Institute for Molecular Bioscience and Australian Research Council Centre of Excellence in Bioinformatics, The University of Queensland, Brisbane, Queensland 4072, Australia
| | | |
Collapse
|
20
|
Hao W. Unrecognized fine-scale recombination can mimic the effects of adaptive radiation. Gene 2013; 518:483-8. [PMID: 23337592 DOI: 10.1016/j.gene.2012.12.107] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2012] [Revised: 12/19/2012] [Accepted: 12/25/2012] [Indexed: 11/17/2022]
Abstract
Gene sequences can undergo accelerated nucleotide changes and rapid diversification. The rapid sequence changes can then potentially lead to phylogenetic incongruence. Recently, Bodilis et al. (2011) observed artificial phylogenetic incongruence using the Pseudomonas surface protein gene oprF, and hypothesized that it was the result of a long-branch attraction artifact ultimately caused by adaptive radiation. In this study, an alternative hypothesis, namely fine-scale recombination, was tested on the same dataset. The results reveal that regions in oprF are of different evolutionary origins, and the mosaic gene structure resulted in confounding phylogenetic signals. These findings demonstrate that unrecognized fine-scale recombination can confound the phylogenetic interpretation and emphasize the limitation of using whole genes as the unit of phylogenetic analysis.
Collapse
Affiliation(s)
- Weilong Hao
- Department of Biological Sciences, Wayne State University, Detroit, MI 48202, USA.
| |
Collapse
|
21
|
Abstract
The aminoacyl-tRNA synthetases (aaRSs) are essential components of the protein synthesis machinery responsible for defining the genetic code by pairing the correct amino acids to their cognate tRNAs. The aaRSs are an ancient enzyme family believed to have origins that may predate the last common ancestor and as such they provide insights into the evolution and development of the extant genetic code. Although the aaRSs have long been viewed as a highly conserved group of enzymes, findings within the last couple of decades have started to demonstrate how diverse and versatile these enzymes really are. Beyond their central role in translation, aaRSs and their numerous homologs have evolved a wide array of alternative functions both inside and outside translation. Current understanding of the emergence of the aaRSs, and their subsequent evolution into a functionally diverse enzyme family, are discussed in this chapter.
Collapse
|
22
|
Eveleigh RJ, Meehan CJ, Archibald JM, Beiko RG. Being Aquifex aeolicus: Untangling a hyperthermophile's checkered past. Genome Biol Evol 2013; 5:2478-97. [PMID: 24281050 PMCID: PMC3879981 DOI: 10.1093/gbe/evt195] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/22/2013] [Indexed: 12/20/2022] Open
Abstract
Lateral gene transfer (LGT) is an important factor contributing to the evolution of prokaryotic genomes. The Aquificae are a hyperthermophilic bacterial group whose genes show affiliations to many other lineages, including the hyperthermophilic Thermotogae, the Proteobacteria, and the Archaea. Previous phylogenomic analyses focused on Aquifex aeolicus identified Thermotogae and Aquificae either as successive early branches or sisters in a rooted bacterial phylogeny, but many phylogenies and cellular traits have suggested a stronger affiliation with the Epsilonproteobacteria. Different scenarios for the evolution of the Aquificae yield different phylogenetic predictions. Here, we outline these scenarios and consider the fit of the available data, including three sequenced Aquificae genomes, to different sets of predictions. Evidence from phylogenetic profiles and trees suggests that the Epsilonproteobacteria have the strongest affinities with the three Aquificae analyzed. However, this pattern is shown by only a minority of encoded proteins, and the Archaea, many lineages of thermophilic bacteria, and members of genus Clostridium and class Deltaproteobacteria also show strong connections to the Aquificae. The phylogenetic affiliations of different functional subsystems showed strong biases: Most but not all genes implicated in the core translational apparatus tended to group Aquificae with Thermotogae, whereas a wide range of metabolic and cellular processes strongly supported the link between Aquificae and Epsilonproteobacteria. Depending on which sets of genes are privileged, either Thermotogae or Epsilonproteobacteria is the most plausible adjacent lineage to the Aquificae. Both scenarios require massive sharing of genes to explain the history of this enigmatic group, whose history is further complicated by specific affinities of different members of Aquificae to different partner lineages.
Collapse
Affiliation(s)
- Robert J.M. Eveleigh
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Conor J. Meehan
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | - John M. Archibald
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
| | - Robert G. Beiko
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| |
Collapse
|
23
|
Ancient origin of the divergent forms of leucyl-tRNA synthetases in the Halobacteriales. BMC Evol Biol 2012; 12:85. [PMID: 22694720 PMCID: PMC3436685 DOI: 10.1186/1471-2148-12-85] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2011] [Accepted: 04/27/2012] [Indexed: 02/01/2023] Open
Abstract
Background Horizontal gene transfer (HGT) has greatly impacted the genealogical history of many lineages, particularly for prokaryotes, with genes frequently moving in and out of a line of descent. Many genes that were acquired by a lineage in the past likely originated from ancestral relatives that have since gone extinct. During the course of evolution, HGT has played an essential role in the origin and dissemination of genetic and metabolic novelty. Results Three divergent forms of leucyl-tRNA synthetase (LeuRS) exist in the archaeal order Halobacteriales, commonly known as haloarchaea. Few haloarchaeal genomes have the typical archaeal form of this enzyme and phylogenetic analysis indicates it clusters within the Euryarchaeota as expected. The majority of sequenced halobacterial genomes possess a bacterial form of LeuRS. Phylogenetic reconstruction puts this larger group of haloarchaea at the base of the bacterial domain. The most parsimonious explanation is that an ancient transfer of LeuRS took place from an organism related to the ancestor of the bacterial domain to the haloarchaea. The bacterial form of LeuRS further underwent gene duplications and/or gene transfers within the haloarchaea, with some genomes possessing two distinct types of bacterial LeuRS. The cognate tRNALeu also reveals two distinct clusters for the haloarchaea; however, these tRNALeu clusters do not coincide with the groupings found in the LeuRS tree, revealing that LeuRS evolved independently of its cognate tRNA. Conclusions The study of leucyl-tRNA synthetase in haloarchaea illustrates the importance of gene transfer originating in lineages that went extinct since the transfer occurred. The haloarchaeal LeuRS and tRNALeu did not co-evolve.
Collapse
|
24
|
Lang AS, Zhaxybayeva O, Beatty JT. Gene transfer agents: phage-like elements of genetic exchange. Nat Rev Microbiol 2012; 10:472-82. [PMID: 22683880 DOI: 10.1038/nrmicro2802] [Citation(s) in RCA: 254] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Horizontal gene transfer is important in the evolution of bacterial and archaeal genomes. An interesting genetic exchange process is carried out by diverse phage-like gene transfer agents (GTAs) that are found in a wide range of prokaryotes. Although GTAs resemble phages, they lack the hallmark capabilities that define typical phages, and they package random pieces of the producing cell's genome. In this Review, we discuss the defining characteristics of the GTAs that have been identified to date, along with potential functions for these agents and the possible evolutionary forces that act on the genes involved in their production.
Collapse
Affiliation(s)
- Andrew S Lang
- Department of Biology, Memorial University, St. John's, Newfoundland and Labrador A1B 3X9, Canada.
| | | | | |
Collapse
|
25
|
Abstract
Lateral gene transfer (LGT), the acquisition of genes from other species, is a major evolutionary force. However, its success as an adaptive process makes the reconstruction of the history of life an intricate puzzle: If no gene has remained unaffected during the course of life's evolution, how can one rely on molecular markers to reconstruct the relationships among species? Here, we take a completely different look at LGT and its impact for the reconstruction of the history of life. Rather than trying to remove the effect of LGT in phylogenies, and ignoring as a result most of the information of gene histories, we use an explicit phylogenetic model of gene transfer to reconcile gene histories with the tree of species. We studied 16 bacterial and archaeal phyla, representing a dataset of 12,000 gene families distributed in 336 genomes. Our results show that, in most phyla, LGT provides an abundant phylogenetic signal on the pattern of species diversification and that this signal is robust to the choice of gene families under study. We also find that LGT brings an abundant signal on the location of the root of species trees, which has been previously overlooked. Our results quantify the great variety of gene transfer rates among lineages of the tree of life and provide strong support for the "complexity hypothesis," which states that genes whose products participate to macromolecular protein complexes are relatively resistant to transfer.
Collapse
|
26
|
Fournier GP, Andam CP, Alm EJ, Gogarten JP. Molecular evolution of aminoacyl tRNA synthetase proteins in the early history of life. ORIGINS LIFE EVOL B 2011; 41:621-32. [PMID: 22200905 DOI: 10.1007/s11084-011-9261-2] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2011] [Accepted: 12/06/2011] [Indexed: 01/02/2023]
Abstract
Aminoacyl-tRNA synthetases (aaRS) consist of several families of functionally conserved proteins essential for translation and protein synthesis. Like nearly all components of the translation machinery, most aaRS families are universally distributed across cellular life, being inherited from the time of the Last Universal Common Ancestor (LUCA). However, unlike the rest of the translation machinery, aaRS have undergone numerous ancient horizontal gene transfers, with several independent events detected between domains, and some possibly involving lineages diverging before the time of LUCA. These transfers reveal the complexity of molecular evolution at this early time, and the chimeric nature of genomes within cells that gave rise to the major domains. Additionally, given the role of these protein families in defining the amino acids used for protein synthesis, sequence reconstruction of their pre-LUCA ancestors can reveal the evolutionary processes at work in the origin of the genetic code. In particular, sequence reconstructions of the paralog ancestors of isoleucyl- and valyl- RS provide strong empirical evidence that at least for this divergence, the genetic code did not co-evolve with the aaRSs; rather, both amino acids were already part of the genetic code before their cognate aaRSs diverged from their common ancestor. The implications of this observation for the early evolution of RNA-directed protein biosynthesis are discussed.
Collapse
Affiliation(s)
- Gregory P Fournier
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | | | | | | |
Collapse
|
27
|
Zhi XY, Zhao W, Li WJ, Zhao GP. Prokaryotic systematics in the genomics era. Antonie van Leeuwenhoek 2011; 101:21-34. [PMID: 22116211 DOI: 10.1007/s10482-011-9667-x] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2011] [Accepted: 10/24/2011] [Indexed: 11/29/2022]
Abstract
As an essential and basic biological discipline, prokaryotic systematics is entering the era of genomics. This paradigmatic shift is significant not only for understanding molecular phylogeny at the whole genome level but also in revealing the genetic or epigenetic basis that accounts for the phenotypic criteria used to classify and identify species. These developments provide an opportunity and a challenge for systematists to reanalyze the molecular mechanisms underlying the taxonomic characteristics of prokaryotes by drawing the knowledge from studies of genomics and/or functional genomics employing platform technologies and related bioinformatics tools. It is expected that taxonomic books, such as Bergey's Manual of Systematic Bacteriology may evolve into a systematics library indexed by phylogenomic information with an comprehensive understanding of prokaryotic speciation and associated increasing knowledge of biological phenomena.
Collapse
Affiliation(s)
- Xiao-Yang Zhi
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education and the Laboratory for Conservation and Utilization of Bio-Resources, Yunnan Institute of Microbiology, Yunnan University, Kunming, China
| | | | | | | |
Collapse
|
28
|
Zaneveld JRR, Parfrey LW, Van Treuren W, Lozupone C, Clemente JC, Knights D, Stombaugh J, Kuczynski J, Knight R. Combined phylogenetic and genomic approaches for the high-throughput study of microbial habitat adaptation. Trends Microbiol 2011; 19:472-82. [PMID: 21872475 PMCID: PMC3184378 DOI: 10.1016/j.tim.2011.07.006] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2011] [Revised: 07/22/2011] [Accepted: 07/25/2011] [Indexed: 01/21/2023]
Abstract
High-throughput sequencing technologies provide new opportunities to address longstanding questions about habitat adaptation in microbial organisms. How have microbes managed to adapt to such a wide range of environments, and what genomic features allow for such adaptation? We review recent large-scale studies of habitat adaptation, with emphasis on those that utilize phylogenetic techniques. On the basis of current trends, we summarize methodological challenges faced by investigators, and the tools, techniques and analytical approaches available to overcome them. Phylogenetic approaches and detailed information about each environmental sample will be crucial as the ability to collect genome sequences continues to expand.
Collapse
Affiliation(s)
- Jesse R R Zaneveld
- Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, CO 80309, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
29
|
Andam CP, Gogarten JP. Biased gene transfer and its implications for the concept of lineage. Biol Direct 2011; 6:47. [PMID: 21943000 PMCID: PMC3191353 DOI: 10.1186/1745-6150-6-47] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2011] [Accepted: 09/23/2011] [Indexed: 11/10/2022] Open
Abstract
Background In the presence of horizontal gene transfer (HGT), the concepts of lineage and genealogy in the microbial world become more ambiguous because chimeric genomes trace their ancestry from a myriad of sources, both living and extinct. Results We present the evolutionary histories of three aminoacyl-tRNA synthetases (aaRS) to illustrate that the concept of organismal lineage in the prokaryotic world is defined by both vertical inheritance and reticulations due to HGT. The acquisition of a novel gene from a distantly related taxon can be considered as a shared derived character that demarcates a group of organisms, as in the case of the spirochaete Phenylalanyl-tRNA synthetase (PheRS). On the other hand, when organisms transfer genetic material with their close kin, the similarity and therefore relatedness observed among them is essentially shaped by gene transfer. Studying the distribution patterns of divergent genes with identical functions, referred to as homeoalleles, can reveal preferences for transfer partners. We describe the very ancient origin and the distribution of the archaeal homeoalleles for Threonyl-tRNA synthetases (ThrRS) and Seryl-tRNA synthetases (SerRS). Conclusions Patterns created through biased HGT can be undistinguishable from those created through shared organismal ancestry. A re-evaluation of the definition of lineage is necessary to reflect genetic relatedness due to both HGT and vertical inheritance. In most instances, HGT bias will maintain and strengthen similarity within groups. Only in cases where HGT bias is due to other factors, such as shared ecological niche, do patterns emerge from gene phylogenies that are in conflict with those reflecting shared organismal ancestry. Reviewers This article was reviewed by W. Ford Doolittle, François-Joseph Lapointe, and Frederic Bouchard.
Collapse
Affiliation(s)
- Cheryl P Andam
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | |
Collapse
|
30
|
Williams D, Fournier GP, Lapierre P, Swithers KS, Green AG, Andam CP, Gogarten JP. A rooted net of life. Biol Direct 2011; 6:45. [PMID: 21936906 PMCID: PMC3189188 DOI: 10.1186/1745-6150-6-45] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2011] [Accepted: 09/21/2011] [Indexed: 01/29/2023] Open
Abstract
Abstract Phylogenetic reconstruction using DNA and protein sequences has allowed the reconstruction of evolutionary histories encompassing all life. We present and discuss a means to incorporate much of this rich narrative into a single model that acknowledges the discrete evolutionary units that constitute the organism. Briefly, this Rooted Net of Life genome phylogeny is constructed around an initial, well resolved and rooted tree scaffold inferred from a supermatrix of combined ribosomal genes. Extant sampled ribosomes form the leaves of the tree scaffold. These leaves, but not necessarily the deeper parts of the scaffold, can be considered to represent a genome or pan-genome, and to be associated with members of other gene families within that sequenced (pan)genome. Unrooted phylogenies of gene families containing four or more members are reconstructed and superimposed over the scaffold. Initially, reticulations are formed where incongruities between topologies exist. Given sufficient evidence, edges may then be differentiated as those representing vertical lines of inheritance within lineages and those representing horizontal genetic transfers or endosymbioses between lineages. Reviewers W. Ford Doolittle, Eric Bapteste and Robert Beiko.
Collapse
Affiliation(s)
- David Williams
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269-3125, USA.
| | | | | | | | | | | | | |
Collapse
|
31
|
Andam CP, Fournier GP, Gogarten JP. Multilevel populations and the evolution of antibiotic resistance through horizontal gene transfer. FEMS Microbiol Rev 2011; 35:756-67. [DOI: 10.1111/j.1574-6976.2011.00274.x] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
32
|
Popa O, Dagan T. Trends and barriers to lateral gene transfer in prokaryotes. Curr Opin Microbiol 2011; 14:615-23. [PMID: 21856213 DOI: 10.1016/j.mib.2011.07.027] [Citation(s) in RCA: 153] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2011] [Revised: 07/19/2011] [Accepted: 07/25/2011] [Indexed: 11/19/2022]
Abstract
Gene acquisition by lateral gene transfer (LGT) is an important mechanism for natural variation among prokaryotes. Laboratory experiments show that protein-coding genes can be laterally transferred extremely fast among microbial cells, inherited to most of their descendants, and adapt to a new regulatory regime within a short time. Recent advance in the phylogenetic analysis of microbial genomes using networks approach reveals a substantial impact of LGT during microbial genome evolution. Phylogenomic networks of LGT among prokaryotes reconstructed from completely sequenced genomes uncover barriers to LGT in multiple levels. Here we discuss the kinds of barriers to gene acquisition in nature including physical barriers for gene transfer between cells, genomic barriers for the integration of acquired DNA, and functional barriers for the acquisition of new genes.
Collapse
Affiliation(s)
- Ovidiu Popa
- Institute of Molecular Evolution, Heinrich-Heine University of Düsseldorf, Universitätstr. 1 40225, Düsseldorf, Germany
| | | |
Collapse
|
33
|
O'Malley MA, Koonin EV. How stands the Tree of Life a century and a half after The Origin? Biol Direct 2011; 6:32. [PMID: 21714936 PMCID: PMC3158114 DOI: 10.1186/1745-6150-6-32] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2011] [Accepted: 06/30/2011] [Indexed: 12/21/2022] Open
Abstract
We examine the Tree of Life (TOL) as an evolutionary hypothesis and a heuristic. The original TOL hypothesis has failed but a new "statistical TOL hypothesis" is promising. The TOL heuristic usefully organizes data without positing fundamental evolutionary truth.
Collapse
Affiliation(s)
- Maureen A O'Malley
- Department of Philosophy, Quadrangle A14, University of Sydney, NSW 2006, Australia
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda MD20894, USA
| |
Collapse
|
34
|
|
35
|
Vogan AA, Higgs PG. The advantages and disadvantages of horizontal gene transfer and the emergence of the first species. Biol Direct 2011; 6:1. [PMID: 21199581 PMCID: PMC3043529 DOI: 10.1186/1745-6150-6-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2010] [Accepted: 01/03/2011] [Indexed: 01/05/2023] Open
Abstract
Background Horizontal Gene Transfer (HGT) is beneficial to a cell if the acquired gene confers a useful function, but is detrimental if the gene has no function, if it is incompatible with existing genes, or if it is a selfishly replicating mobile element. If the balance of these effects is beneficial on average, we would expect cells to evolve high rates of acceptance of horizontally transferred genes, whereas if it is detrimental, cells should reduce the rate of HGT as far as possible. It has been proposed that the rate of HGT was very high in the early stages of prokaryotic evolution, and hence there were no separate lineages of organisms. Only when the HGT rate began to fall, would lineages begin to emerge with their own distinct sets of genes. Evolution would then become more tree-like. This phenomenon has been called the Darwinian Threshold. Results We study a model for genome evolution that incorporates both beneficial and detrimental effects of HGT. We show that if rate of gene loss during genome replication is high, as was probably the case in the earliest genomes before the time of the last universal common ancestor, then a high rate of HGT is favourable. HGT leads to the rapid spread of new genes and allows the build-up of larger, fitter genomes than could be achieved by purely vertical inheritance. In contrast, if the gene loss rate is lower, as in modern prokaryotes, then HGT is, on average, unfavourable. Conclusions Modern cells should therefore evolve to reduce HGT if they can, although the prevalence of independently replicating mobile elements and viruses may mean that cells cannot avoid HGT in practice. In the model, natural selection leads to gradual improvement of the replication accuracy and gradual decrease in the optimal rate of HGT. By clustering genomes based on gene content, we show that there are no separate lineages of organisms when the rate of HGT is high; however, as the rate of HGT decreases, a tree-like structure emerges with well-defined lineages. The model therefore passes through a Darwinian Threshold. Reviewers This article was reviewed by Eugene V. Koonin, Anthony Poole and J. Peter Gogarten.
Collapse
Affiliation(s)
- Aaron A Vogan
- Origins Institute, McMaster University, Hamilton, Ontario, Canada
| | | |
Collapse
|
36
|
Lateral acquisition of genes is affected by the friendliness of their products. Proc Natl Acad Sci U S A 2010; 108:343-8. [PMID: 21149709 DOI: 10.1073/pnas.1009775108] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
A major factor in the evolution of microbial genomes is the lateral acquisition of genes that evolved under the functional constraints of other species. Integration of foreign genes into a genome that has different components and circuits poses an evolutionary challenge. Moreover, genes belonging to complex modules in the pretransfer species are unlikely to maintain their functionality when transferred alone to new species. Thus, it is widely accepted that lateral gene transfer favors proteins with only a few protein-protein interactions. The propensity of proteins to participate in protein-protein interactions can be assessed using computational methods that identify putative interaction sites on the protein. Here we report that laterally acquired proteins contain significantly more putative interaction sites than native proteins. Thus, genes encoding proteins with multiple protein-protein interactions may in fact be more prone to transfer than genes with fewer interactions. We suggest that these proteins have a greater chance of forming new interactions in new species, thus integrating into existing modules. These results reveal basic principles for the incorporation of novel genes into existing systems.
Collapse
|
37
|
Kloesges T, Popa O, Martin W, Dagan T. Networks of gene sharing among 329 proteobacterial genomes reveal differences in lateral gene transfer frequency at different phylogenetic depths. Mol Biol Evol 2010; 28:1057-74. [PMID: 21059789 PMCID: PMC3021791 DOI: 10.1093/molbev/msq297] [Citation(s) in RCA: 111] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Lateral gene transfer (LGT) is an important mechanism of natural variation among prokaryotes. Over the full course of evolution, most or all of the genes resident in a given prokaryotic genome have been affected by LGT, yet the frequency of LGT can vary greatly across genes and across prokaryotic groups. The proteobacteria are among the most diverse of prokaryotic taxa. The prevalence of LGT in their genome evolution calls for the application of network-based methods instead of tree-based methods to investigate the relationships among these species. Here, we report networks that capture both vertical and horizontal components of evolutionary history among 1,207,272 proteins distributed across 329 sequenced proteobacterial genomes. The network of shared proteins reveals modularity structure that does not correspond to current classification schemes. On the basis of shared protein-coding genes, the five classes of proteobacteria fall into two main modules, one including the alpha-, delta-, and epsilonproteobacteria and the other including beta- and gammaproteobacteria. The first module is stable over different protein identity thresholds. The second shows more plasticity with regard to the sequence conservation of proteins sampled, with the gammaproteobacteria showing the most chameleon-like evolutionary characteristics within the present sample. Using a minimal lateral network approach, we compared LGT rates at different phylogenetic depths. In general, gene evolution by LGT within proteobacteria is very common. At least one LGT event was inferred to have occurred in at least 75% of the protein families. The average LGT rate at the species and class depth is about one LGT event per protein family, the rate doubling at the phylum level to an average of two LGT events per protein family. Hence, our results indicate that the rate of gene acquisition per protein family is similar at the level of species (by recombination) and at the level of classes (by LGT). The frequency of LGT per genome strongly depends on the species lifestyle, with endosymbionts showing far lower LGT frequencies than free-living species. Moreover, the nature of the transferred genes suggests that gene transfer in proteobacteria is frequently mediated by conjugation.
Collapse
Affiliation(s)
- Thorsten Kloesges
- Institute of Botany III, Heinrich-Heine University Düsseldorf, Düsseldorf, Germany
| | | | | | | |
Collapse
|
38
|
Abstract
Phylogenetic trees of individual genes of prokaryotes (archaea and bacteria) generally have different topologies, largely owing to extensive horizontal gene transfer (HGT), suggesting that the Tree of Life (TOL) should be replaced by a "net of life" as the paradigm of prokaryote evolution. However, trees remain the natural representation of the histories of individual genes given the fundamentally bifurcating process of gene replication. Therefore, although no single tree can fully represent the evolution of prokaryote genomes, the complete picture of evolution will necessarily combine trees and nets. A quantitative measure of the signals of tree and net evolution is derived from an analysis of all quartets of species in all trees of the "Forest of Life" (FOL), which consists of approximately 7,000 phylogenetic trees for prokaryote genes including approximately 100 nearly universal trees (NUTs). Although diverse routes of net-like evolution collectively dominate the FOL, the pattern of tree-like evolution that reflects the consistent topologies of the NUTs is the most prominent coherent trend. We show that the contributions of tree-like and net-like evolutionary processes substantially differ across bacterial and archaeal lineages and between functional classes of genes. Evolutionary simulations indicate that the central tree-like signal cannot be realistically explained by a self-reinforcing pattern of biased HGT.
Collapse
Affiliation(s)
- Pere Puigbò
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland, USA
| | | | | |
Collapse
|