1
|
Farrell AA, Nesbø CL, Zhaxybayeva O. Early Divergence and Gene Exchange Highways in the Evolutionary History of Mesoaciditogales. Genome Biol Evol 2023; 15:evad156. [PMID: 37616556 PMCID: PMC10476701 DOI: 10.1093/gbe/evad156] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2023] [Revised: 08/15/2023] [Accepted: 08/16/2023] [Indexed: 08/26/2023] Open
Abstract
The placement of a nonhyperthermophilic order Mesoaciditogales as the earliest branching clade within the Thermotogota phylum challenges the prevailing hypothesis that the last common ancestor of Thermotogota was a hyperthermophile. Yet, given the long branch leading to the only two Mesoaciditogales described to date, the phylogenetic position of the order may be due to the long branch attraction artifact. By testing various models and applying data recoding in phylogenetic reconstructions, we observed that early branching of Mesoaciditogales within Thermotogota is strongly supported by the conserved marker genes assumed to be vertically inherited. However, based on the taxonomic content of 1,181 gene families and a phylogenetic analysis of 721 gene family trees, we also found that a substantial number of Mesoaciditogales genes are more closely related to species from the order Petrotogales. These genes contribute to coenzyme transport and metabolism, fatty acid biosynthesis, genes known to respond to heat and cold stressors, and include many genes of unknown functions. The Petrotogales comprise moderately thermophilic and mesophilic species with similar temperature tolerances to that of Mesoaciditogales. Our findings hint at extensive horizontal gene transfer (HGT) between, or parallel independent gene gains by, the two ecologically similar lineages and suggest that the exchanged genes may be important for adaptation to comparable temperature niches.
Collapse
Affiliation(s)
- Anne A Farrell
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA
| | - Camilla L Nesbø
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, Ontario, Canada
| | - Olga Zhaxybayeva
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, USA
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire, USA
| |
Collapse
|
2
|
Hyun JC, Palsson BO. Reconstruction of the last bacterial common ancestor from 183 pangenomes reveals a versatile ancient core genome. Genome Biol 2023; 24:183. [PMID: 37553643 PMCID: PMC10411014 DOI: 10.1186/s13059-023-03028-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 07/28/2023] [Indexed: 08/10/2023] Open
Abstract
BACKGROUND Cumulative sequencing efforts have yielded enough genomes to construct pangenomes for dozens of bacterial species and elucidate intraspecies gene conservation. Given the diversity of organisms for which this is achievable, similar analyses for ancestral species are feasible through the integration of pangenomics and phylogenetics, promising deeper insights into the nature of ancient life. RESULTS We construct pangenomes for 183 bacterial species from 54,085 genomes and identify their core genomes using a novel statistical model to estimate genome-specific error rates and underlying gene frequencies. The core genomes are then integrated into a phylogenetic tree to reconstruct the core genome of the last bacterial common ancestor (LBCA), yielding three main results: First, the gene content of modern and ancestral core genomes are diverse at the level of individual genes but are similarly distributed by functional category and share several poorly characterized genes. Second, the LBCA core genome is distinct from any individual modern core genome but has many fundamental biological systems intact, especially those involving translation machinery and biosynthetic pathways to all major nucleotides and amino acids. Third, despite this metabolic versatility, the LBCA core genome likely requires additional non-core genes for viability, based on comparisons with the minimal organism, JCVI-Syn3A. CONCLUSIONS These results suggest that many cellular systems commonly conserved in modern bacteria were not just present in ancient bacteria but were nearly immutable with respect to short-term intraspecies variation. Extending this analysis to other domains of life will likely provide similar insights into more distant ancestral species.
Collapse
Affiliation(s)
- Jason C Hyun
- Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA, USA
| | - Bernhard O Palsson
- Bioinformatics and Systems Biology Program, University of California, La Jolla, San Diego, CA, USA.
- Department of Bioengineering, University of California, La Jolla, San Diego, CA, USA.
| |
Collapse
|
3
|
Goldman AD, Kaçar B. Very early evolution from the perspective of microbial ecology. Environ Microbiol 2023; 25:5-10. [PMID: 35944516 DOI: 10.1111/1462-2920.16144] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Accepted: 07/19/2022] [Indexed: 01/21/2023]
Abstract
The universal ancestor at the root of the species tree of life depicts a population of organisms with a surprising degree of complexity, posessing genomes and translation systems much like that of microbial life today. As the first life forms were most likely to have been simple replicators, considerable evolutionary change must have taken place prior to the last universal common ancestor. It is often assumed that the lack of earlier branches on the tree of life is due to a prevalence of random horizontal gene transfer that obscured the delineations between lineages and hindered their divergence. Therefore, principles of microbial evolution and ecology may give us some insight into these early stages in the history of life. Here, we synthesize the current understanding of organismal and genome evolution from the perspective of microbial ecology and apply these evolutionary principles to the earliest stages of life on Earth. We focus especially on broad evolutionary modes pertaining to horizontal gene transfer, pangenome structure, and microbial mat communities.
Collapse
Affiliation(s)
- Aaron D Goldman
- Department of Biology, Oberlin College and Conservatory, Oberlin, Ohio, USA
| | - Betül Kaçar
- Department of Bacteriology, University of Wisconsin-Madison, Madison, Wisconsin, USA
| |
Collapse
|
4
|
Cote-L’Heureux A, Maurer-Alcalá XX, Katz LA. Old genes in new places: A taxon-rich analysis of interdomain lateral gene transfer events. PLoS Genet 2022; 18:e1010239. [PMID: 35731825 PMCID: PMC9255765 DOI: 10.1371/journal.pgen.1010239] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 07/05/2022] [Accepted: 05/06/2022] [Indexed: 11/26/2022] Open
Abstract
Vertical inheritance is foundational to Darwinian evolution, but fails to explain major innovations such as the rapid spread of antibiotic resistance among bacteria and the origin of photosynthesis in eukaryotes. While lateral gene transfer (LGT) is recognized as an evolutionary force in prokaryotes, the role of LGT in eukaryotic evolution is less clear. With the exception of the transfer of genes from organelles to the nucleus, a process termed endosymbiotic gene transfer (EGT), the extent of interdomain transfer from prokaryotes to eukaryotes is highly debated. A common critique of studies of interdomain LGT is the reliance on the topology of single-gene trees that attempt to estimate more than one billion years of evolution. We take a more conservative approach by identifying cases in which a single clade of eukaryotes is found in an otherwise prokaryotic gene tree (i.e. exclusive presence). Starting with a taxon-rich dataset of over 13,600 gene families and passing data through several rounds of curation, we identify and categorize the function of 306 interdomain LGT events into diverse eukaryotes, including 189 putative EGTs, 52 LGTs into Opisthokonta (i.e. animals, fungi and their microbial relatives), and 42 LGTs nearly exclusive to anaerobic eukaryotes. To assess differential gene loss as an explanation for exclusive presence, we compare branch lengths within each LGT tree to a set of vertically-inherited genes subsampled to mimic gene loss (i.e. with the same taxonomic sampling) and consistently find shorter relative distance between eukaryotes and prokaryotes in LGT trees, a pattern inconsistent with gene loss. Our methods provide a framework for future studies of interdomain LGT and move the field closer to an understanding of how best to model the evolutionary history of eukaryotes.
Collapse
Affiliation(s)
- Auden Cote-L’Heureux
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, United States of America
| | | | - Laura A. Katz
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, United States of America
- Program in Organismic Biology and Evolution, University of Massachusetts Amherst, Amherst, Massachusetts, United States of America
| |
Collapse
|
5
|
Estrada A, Suárez-Díaz E, Becerra A. Reconstructing the Last Common Ancestor: Epistemological and Empirical Challenges. Acta Biotheor 2022; 70:15. [PMID: 35575816 DOI: 10.1007/s10441-022-09439-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 04/25/2022] [Indexed: 11/24/2022]
Abstract
Reconstructing the genetic traits of the Last Common Ancestor (LCA) and the Tree of Life (TOL) are two examples of the reaches of contemporary molecular phylogenetics. Nevertheless, the whole enterprise has led to paradoxical results. The presence of Lateral Gene Transfer poses epistemic and empirical challenges to meet these goals; the discussion around this subject has been enriched by arguments from philosophers and historians of science. At the same time, a few but influential research groups have aimed to reconstruct the LCA with rich-in-detail hypotheses and high-resolution gene catalogs and metabolic traits. We argue that LGT poses insurmountable challenges for detailed and rich in details reconstructions and propose, instead, a middle-ground position with the reconstruction of a slim LCA based on traits under strong pressures of Negative Natural Selection, and for the need of consilience with evidence from organismal biology and geochemistry. We defend a cautionary perspective that goes beyond the statistical analysis of gene similarities and assumes the broader consequences of evolving empirical data and epistemic pluralism in the reconstruction of early life.
Collapse
Affiliation(s)
- Amadeo Estrada
- Posgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México, Coyoacán, Mexico
| | - Edna Suárez-Díaz
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Circuito Exterior Ciudad Universitaria, 04510, Coyoacán, DF, Mexico
| | - Arturo Becerra
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Circuito Exterior Ciudad Universitaria, 04510, Coyoacán, DF, Mexico.
| |
Collapse
|
6
|
Chin AF, Wrabl JO, Hilser VJ. A thermodynamic atlas of proteomes reveals energetic innovation across the tree of life. Mol Biol Evol 2022; 39:6509521. [PMID: 35038744 PMCID: PMC8896757 DOI: 10.1093/molbev/msac010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein stability is a fundamental molecular property enabling organisms to adapt to their biological niches. How this is facilitated and whether there are kingdom specific or more general universal strategies is not known. A principal obstacle to addressing this issue is that the vast majority of proteins lack annotation, specifically thermodynamic annotation, beyond the amino acid and chromosome information derived from genome sequencing. To address this gap and facilitate future investigation into large-scale patterns of protein stability and dynamics within and between organisms, we applied a unique ensemble-based thermodynamic characterization of protein folds to a substantial portion of extant sequenced genomes. Using this approach, we compiled a database resource focused on the position-specific variation in protein stability. Interrogation of the database reveals; 1) domains of life exhibit distinguishing thermodynamic features, with eukaryotes particularly different from both archaea and bacteria, 2) the optimal growth temperature of an organism is proportional to the average apolar enthalpy of its proteome, 3) intrinsic disorder content is also proportional to the apolar enthalpy (but unexpectedly not the predicted stability at 25 °C), and 4) secondary structure and global stability information of individual proteins is extractable. We hypothesize that wider access to residue-specific thermodynamic information of proteomes will result in deeper understanding of mechanisms driving functional adaptation and protein evolution. Our database is free for download at https://afc-science.github.io/thermo-env-atlas/.
Collapse
Affiliation(s)
- Alexander F Chin
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - James O Wrabl
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| | - Vincent J Hilser
- Department of Biology, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA.,T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 North Charles Street, Baltimore, MD, 21218, USA
| |
Collapse
|
7
|
Watkins A. Multi-model approaches to phylogenetics: Implications for idealization. STUDIES IN HISTORY AND PHILOSOPHY OF SCIENCE 2021; 90:285-297. [PMID: 34768089 DOI: 10.1016/j.shpsa.2021.10.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 09/17/2021] [Accepted: 10/08/2021] [Indexed: 06/13/2023]
Abstract
Phylogenetic models traditionally represent the history of life as having a strictly-branching tree structure. However, it is becoming increasingly clear that the history of life is often not strictly-branching; lateral gene transfer, endosymbiosis, and hybridization, for example, can all produce lateral branching events. There is thus motivation to allow phylogenetic models to have a reticulate structure. One proposal involves the reconciliation of genealogical discordance. Briefly, this method uses patterns of disagreement - discordance - between trees of different genes to add lateral branching events to phylogenetic trees of taxa, and to estimate the most likely cause of these events. I use this practice to argue for: (1) a need for expanded accounts of multiple-models idealization, (2) a distinction between automatic and manual de-idealization, and (3) recognition that idealization may serve the meso-level aims of science in a different way than hitherto acknowledged.
Collapse
Affiliation(s)
- Aja Watkins
- Boston University Department of Philosophy, 745 Commonwealth Ave, Boston 02215, Massachusetts, USA. http://www.ajawatkins.org
| |
Collapse
|
8
|
Berkemer SJ, McGlynn SE. A New Analysis of Archaea-Bacteria Domain Separation: Variable Phylogenetic Distance and the Tempo of Early Evolution. Mol Biol Evol 2021; 37:2332-2340. [PMID: 32316034 PMCID: PMC7403611 DOI: 10.1093/molbev/msaa089] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Comparative genomics and molecular phylogenetics are foundational for understanding biological evolution. Although many studies have been made with the aim of understanding the genomic contents of early life, uncertainty remains. A study by Weiss et al. (Weiss MC, Sousa FL, Mrnjavac N, Neukirchen S, Roettger M, Nelson-Sathi S, Martin WF. 2016. The physiology and habitat of the last universal common ancestor. Nat Microbiol. 1(9):16116.) identified a number of protein families in the last universal common ancestor of archaea and bacteria (LUCA) which were not found in previous works. Here, we report new research that suggests the clustering approaches used in this previous study undersampled protein families, resulting in incomplete phylogenetic trees which do not reflect protein family evolution. Phylogenetic analysis of protein families which include more sequence homologs rejects a simple LUCA hypothesis based on phylogenetic separation of the bacterial and archaeal domains for a majority of the previously identified LUCA proteins (∼82%). To supplement limitations of phylogenetic inference derived from incompletely populated orthologous groups and to test the hypothesis of a period of rapid evolution preceding the separation of the domains, we compared phylogenetic distances both within and between domains, for thousands of orthologous groups. We find a substantial diversity of interdomain versus intradomain branch lengths, even among protein families which exhibit a single domain separating branch and are thought to be associated with the LUCA. Additionally, phylogenetic trees with long interdomain branches relative to intradomain branches are enriched in information categories of protein families in comparison to those associated with metabolic functions. These results provide a new view of protein family evolution and temper claims about the phenotype and habitat of the LUCA.
Collapse
Affiliation(s)
- Sarah J Berkemer
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany.,Bioinformatics Group, Department of Computer Science, University Leipzig, Leipzig, Germany.,Competence Center for Scalable Data Services and Solutions, Dresden/Leipzig, Germany
| | - Shawn E McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Meguro, Tokyo, Japan.,Blue Marble Space Institute of Science, Seattle, WA.,RIKEN Center for Sustainable Resource Science (CSRS), Saitama, Japan
| |
Collapse
|
9
|
Abstract
The advent of comparative genomics in the late 1990s led to the discovery of extensive lateral gene transfer in prokaryotes. The resulting debate over whether life as a whole is best represented as a tree or a network has since given way to a general consensus in which trees and networks co-exist rather than stand in opposition. Embracing this consensus allows us to move beyond the question of which is true or false. The future of the tree of life debate lies in asking what trees and networks can, and should, do for science.
Collapse
Affiliation(s)
- Cédric Blais
- Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada; Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada.
| | - John M Archibald
- Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, NS, Canada; Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS, Canada.
| |
Collapse
|
10
|
Harris HMB, Hill C. A Place for Viruses on the Tree of Life. Front Microbiol 2021; 11:604048. [PMID: 33519747 PMCID: PMC7840587 DOI: 10.3389/fmicb.2020.604048] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2020] [Accepted: 12/14/2020] [Indexed: 12/15/2022] Open
Abstract
Viruses are ubiquitous. They infect almost every species and are probably the most abundant biological entities on the planet, yet they are excluded from the Tree of Life (ToL). However, there can be no doubt that viruses play a significant role in evolution, the force that facilitates all life on Earth. Conceptually, viruses are regarded by many as non-living entities that hijack living cells in order to propagate. A strict separation between living and non-living entities places viruses far from the ToL, but this may be theoretically unsound. Advances in sequencing technology and comparative genomics have expanded our understanding of the evolutionary relationships between viruses and cellular organisms. Genomic and metagenomic data have revealed that co-evolution between viral and cellular genomes involves frequent horizontal gene transfer and the occasional co-option of novel functions over evolutionary time. From the giant, ameba-infecting marine viruses to the tiny Porcine circovirus harboring only two genes, viruses and their cellular hosts are ecologically and evolutionarily intertwined. When deciding how, if, and where viruses should be placed on the ToL, we should remember that the Tree functions best as a model of biological evolution on Earth, and it is important that models themselves evolve with our increasing understanding of biological systems.
Collapse
Affiliation(s)
- Hugh M B Harris
- APC Microbiome Ireland, College of Medicine and Health, University College Cork, Cork, Ireland
| | - Colin Hill
- APC Microbiome Ireland, College of Medicine and Health, University College Cork, Cork, Ireland.,School of Microbiology, University College Cork, Cork, Ireland
| |
Collapse
|
11
|
DeSalle R, Riley M. Should Networks Supplant Tree Building? Microorganisms 2020; 8:E1179. [PMID: 32756444 PMCID: PMC7466111 DOI: 10.3390/microorganisms8081179] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 07/21/2020] [Accepted: 07/29/2020] [Indexed: 12/15/2022] Open
Abstract
Recent studies suggested that network methods should supplant tree building as the basis of genealogical analysis. This proposition is based upon two arguments. First is the observation that bacterial and archaeal lineages experience processes oppositional to bifurcation and hence the representation of the evolutionary process in a tree like structure is illogical. Second is the argument tree building approaches are circular-you ask for a tree and you get one, which pins a verificationist label on tree building that, if correct, should be the end of phylogenetic analysis as we currently know it. In this review, we examine these questions and suggest that rumors of the death of the bacterial tree of life are exaggerated at best.
Collapse
Affiliation(s)
- Rob DeSalle
- Sackler Institute for Comparative Genomics, American Museum of Natural History, Central Park West at 79th Street, New York, NY 10024, USA;
| | - Margaret Riley
- Department of Biology, University of Massachusetts Amherst, 116 North Pleasant Street, Amherst, MA 01003, USA
| |
Collapse
|
12
|
Olson ME, Arroyo-Santos A, Vergara-Silva F. A User’s Guide to Metaphors In Ecology and Evolution. Trends Ecol Evol 2019; 34:605-615. [DOI: 10.1016/j.tree.2019.03.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Revised: 03/16/2019] [Accepted: 03/18/2019] [Indexed: 11/25/2022]
|
13
|
Rossoni AW, Price DC, Seger M, Lyska D, Lammers P, Bhattacharya D, Weber APM. The genomes of polyextremophilic cyanidiales contain 1% horizontally transferred genes with diverse adaptive functions. eLife 2019; 8:e45017. [PMID: 31149898 PMCID: PMC6629376 DOI: 10.7554/elife.45017] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Accepted: 05/30/2019] [Indexed: 01/08/2023] Open
Abstract
The role and extent of horizontal gene transfer (HGT) in eukaryotes are hotly disputed topics that impact our understanding of the origin of metabolic processes and the role of organelles in cellular evolution. We addressed this issue by analyzing 10 novel Cyanidiales genomes and determined that 1% of their gene inventory is HGT-derived. Numerous HGT candidates share a close phylogenetic relationship with prokaryotes that live in similar habitats as the Cyanidiales and encode functions related to polyextremophily. HGT candidates differ from native genes in GC-content, number of splice sites, and gene expression. HGT candidates are more prone to loss, which may explain the absence of a eukaryotic pan-genome. Therefore, the lack of a pan-genome and cumulative effects fail to provide substantive arguments against our hypothesis of recurring HGT followed by differential loss in eukaryotes. The maintenance of 1% HGTs, even under selection for genome reduction, underlines the importance of non-endosymbiosis related foreign gene acquisition.
Collapse
Affiliation(s)
- Alessandro W Rossoni
- Institute of Plant Biochemistry, Cluster of Excellence on Plant Sciences (CEPLAS)Heinrich Heine UniversityDüsseldorfGermany
| | - Dana C Price
- Department of Plant BiologyRutgers UniversityNew BrunswickUnited States
| | - Mark Seger
- Arizona Center for Algae Technology and InnovationArizona State UniversityMesaUnited States
| | - Dagmar Lyska
- Institute of Plant Biochemistry, Cluster of Excellence on Plant Sciences (CEPLAS)Heinrich Heine UniversityDüsseldorfGermany
| | - Peter Lammers
- Arizona Center for Algae Technology and InnovationArizona State UniversityMesaUnited States
| | | | - Andreas PM Weber
- Institute of Plant Biochemistry, Cluster of Excellence on Plant Sciences (CEPLAS)Heinrich Heine UniversityDüsseldorfGermany
| |
Collapse
|
14
|
Affiliation(s)
- Aaron Novick
- Department of Philosophy, Dalhousie University, Halifax, Nova Scotia, Canada
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- * E-mail: (AN); (WFD)
| | - W. Ford Doolittle
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada
- * E-mail: (AN); (WFD)
| |
Collapse
|
15
|
|
16
|
Sleep NH. Geological and Geochemical Constraints on the Origin and Evolution of Life. ASTROBIOLOGY 2018; 18:1199-1219. [PMID: 30124324 DOI: 10.1089/ast.2017.1778] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
The traditional tree of life from molecular biology with last universal common ancestor (LUCA) branching into bacteria and archaea (though fuzzy) is likely formally valid enough to be a basis for discussion of geological processes on the early Earth. Biologists infer likely properties of nodal organisms within the tree and, hence, the environment they inhabited. Geologists both vet tenuous trees and putative origin of life scenarios for geological and ecological reasonability and conversely infer geological information from trees. The latter approach is valuable as geologists have only weakly constrained the time when the Earth became habitable and the later time when life actually existed to the long interval between ∼4.5 and ∼3.85 Ga where no intact surface rocks are known. With regard to vetting, origin and early evolution hypotheses from molecular biology have recently centered on serpentinite settings in marine and alternatively land settings that are exposed to ultraviolet sunlight. The existence of these niches on the Hadean Earth is virtually certain. With regard to inferring geological environment from genomics, nodes on the tree of life can arise from true bottlenecks implied by the marine serpentinite origin scenario and by asteroid impact. Innovation of a very useful trait through a threshold allows the successful organism to quickly become very abundant and later root a large clade. The origin of life itself, that is, the initial Darwinian ancestor, the bacterial and archaeal roots as free-living cellular organisms that independently escaped hydrothermal chimneys above marine serpentinite or alternatively from shallow pore-water environments on land, the Selabacteria root with anoxygenic photosynthesis, and the Terrabacteria root colonizing land are attractive examples that predate the geological record. Conversely, geological reasoning presents likely events for appraisal by biologists. Asteroid impacts may have produced bottlenecks by decimating life. Thermophile roots of bacteria and archaea as well as a thermophile LUCA are attractive.
Collapse
Affiliation(s)
- Norman H Sleep
- Department of Geophysics, Stanford University , Stanford, California
| |
Collapse
|
17
|
Abstract
BACKGROUND Deciphering the history of life on Earth has long been regarded as one of the most central tasks in biology. In past years, widespread discordance between the evolutionary histories of different groups of orthologous genes of prokaryotes have been revealed, primarily due to horizontal gene transfers (HGTs). Nonetheless, evidence that support a strong tree-like signal of evolution have been uncovered, despite the presence of HGT events. Therefore, a challenging task is to distill this tree-like signal from the noise induced by all sources of non-tree-like events. RESULTS In this work we tackle this question, using real and simulated data. We first tighten a recent related theoretical result in this field. In a simulation study, we infer individual quartet topologies, and then use the inferred quartets to reconstruct simulated species trees. We demonstrate that accurate tree reconstruction is feasible despite surprisingly high rates of HGT. In a real data study, we construct phylogenies of two sets of prokaryotes, and show that our tree reconstruction scheme is comparable with (and complementary better than) other commonly used methods. CONCLUSIONS Using a blend of theoretical and empirical investigations, our study proves the feasibility of accurate quartet-based phylogenetic reconstruction, the vast impact of HGT events notwithstanding.
Collapse
Affiliation(s)
- Eliran Avni
- Department of Evolutionary Biology, University of Haifa, 199 Aba Khoushy Ave. Mount Carmel, Haifa, 3498838, Israel
| | - Sagi Snir
- Department of Evolutionary Biology, University of Haifa, 199 Aba Khoushy Ave. Mount Carmel, Haifa, 3498838, Israel.
| |
Collapse
|
18
|
Danchin A, Ouzounis C, Tokuyasu T, Zucker JD. No wisdom in the crowd: genome annotation in the era of big data - current status and future prospects. Microb Biotechnol 2018; 11:588-605. [PMID: 29806194 PMCID: PMC6011933 DOI: 10.1111/1751-7915.13284] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Science and engineering rely on the accumulation and dissemination of knowledge to make discoveries and create new designs. Discovery-driven genome research rests on knowledge passed on via gene annotations. In response to the deluge of sequencing big data, standard annotation practice employs automated procedures that rely on majority rules. We argue this hinders progress through the generation and propagation of errors, leading investigators into blind alleys. More subtly, this inductive process discourages the discovery of novelty, which remains essential in biological research and reflects the nature of biology itself. Annotation systems, rather than being repositories of facts, should be tools that support multiple modes of inference. By combining deduction, induction and abduction, investigators can generate hypotheses when accurate knowledge is extracted from model databases. A key stance is to depart from 'the sequence tells the structure tells the function' fallacy, placing function first. We illustrate our approach with examples of critical or unexpected pathways, using MicroScope to demonstrate how tools can be implemented following the principles we advocate. We end with a challenge to the reader.
Collapse
Affiliation(s)
- Antoine Danchin
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
- School of Biomedical Sciences, Li KaShing Faculty of Medicine, Hong Kong University, 21 Sassoon Road, Pokfulam, Hong Kong
| | - Christos Ouzounis
- Biological Computation and Process Laboratory, Centre for Research and Technology Hellas, Chemical Process and Energy Resources Institute, Thessalonica, 57001, Greece
| | - Taku Tokuyasu
- Shenzhen Institutes of Advanced Technology, Institute of Synthetic Biology, Shenzhen University Town, 1068 Xueyuan Avenue, Shenzhen, China
| | - Jean-Daniel Zucker
- Integromics, Institute of Cardiometabolism and Nutrition, Hôpital de la Pitié-Salpêtrière, 47 Boulevard de l'Hôpital, 75013, Paris, France
| |
Collapse
|
19
|
Anselmetti Y, Duchemin W, Tannier E, Chauve C, Bérard S. Phylogenetic signal from rearrangements in 18 Anopheles species by joint scaffolding extant and ancestral genomes. BMC Genomics 2018; 19:96. [PMID: 29764366 PMCID: PMC5954271 DOI: 10.1186/s12864-018-4466-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Background Genomes rearrangements carry valuable information for phylogenetic inference or the elucidation of molecular mechanisms of adaptation. However, the detection of genome rearrangements is often hampered by current deficiencies in data and methods: Genomes obtained from short sequence reads have generally very fragmented assemblies, and comparing multiple gene orders generally leads to computationally intractable algorithmic questions. Results We present a computational method, ADseq, which, by combining ancestral gene order reconstruction, comparative scaffolding and de novo scaffolding methods, overcomes these two caveats. ADseq provides simultaneously improved assemblies and ancestral genomes, with statistical supports on all local features. Compared to previous comparative methods, it runs in polynomial time, it samples solutions in a probabilistic space, and it can handle a significantly larger gene complement from the considered extant genomes, with complex histories including gene duplications and losses. We use ADseq to provide improved assemblies and a genome history made of duplications, losses, gene translocations, rearrangements, of 18 complete Anopheles genomes, including several important malaria vectors. We also provide additional support for a differentiated mode of evolution of the sex chromosome and of the autosomes in these mosquito genomes. Conclusions We demonstrate the method’s ability to improve extant assemblies accurately through a procedure simulating realistic assembly fragmentation. We study a debated issue regarding the phylogeny of the Gambiae complex group of Anopheles genomes in the light of the evolution of chromosomal rearrangements, suggesting that the phylogenetic signal they carry can differ from the phylogenetic signal carried by gene sequences, more prone to introgression. Electronic supplementary material The online version of this article (10.1186/s12864-018-4466-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yoann Anselmetti
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France.,Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, 43 Boulevard du 11 novembre 1918, Villeurbanne cedex, 69622, France
| | - Wandrille Duchemin
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, 43 Boulevard du 11 novembre 1918, Villeurbanne cedex, 69622, France.,INRIA Grenoble - Rhône-Alpes, 655 Avenue de l'Europe, Montbonnot-Saint-Martin, 38330, France
| | - Eric Tannier
- Univ Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR5558, 43 Boulevard du 11 novembre 1918, Villeurbanne cedex, 69622, France.,INRIA Grenoble - Rhône-Alpes, 655 Avenue de l'Europe, Montbonnot-Saint-Martin, 38330, France
| | - Cedric Chauve
- Department of Mathematics, Simon Fraser University, 8888 University Drive, Burnaby, V5A1S6, BC, Canada
| | - Sèverine Bérard
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France.
| |
Collapse
|
20
|
|
21
|
McTavish EJ, Drew BT, Redelings B, Cranston KA. How and Why to Build a Unified Tree of Life. Bioessays 2017; 39. [PMID: 28980328 DOI: 10.1002/bies.201700114] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Revised: 08/27/2017] [Indexed: 01/20/2023]
Abstract
Phylogenetic trees are a crucial backbone for a wide breadth of biological research spanning systematics, organismal biology, ecology, and medicine. In 2015, the Open Tree of Life project published a first draft of a comprehensive tree of life, summarizing digitally available taxonomic and phylogenetic knowledge. This paper reviews, investigates, and addresses the following questions as a follow-up to that paper, from the perspective of researchers involved in building this summary of the tree of life: Is there a tree of life and should we reconstruct it? Is available data sufficient to reconstruct the tree of life? Do we have access to phylogenetic inferences in usable form? Can we combine different phylogenetic estimates across the tree of life? And finally, what is the future of understanding the tree of life?
Collapse
Affiliation(s)
| | - Bryan T Drew
- University of Nebraska at Kearney, Kerney, NE, 68849, USA
| | - Ben Redelings
- University of Kansas, Lawrence, KS, 66045, USA Duke University, Durham NC 27705 USA; Ronin Institute, Durham, NC 27705 USA
| | | |
Collapse
|
22
|
Lu B, Zhang L, Leong HW. A program to compute the soft Robinson-Foulds distance between phylogenetic networks. BMC Genomics 2017; 18:111. [PMID: 28361712 PMCID: PMC5374702 DOI: 10.1186/s12864-017-3500-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Background Over the past two decades, phylogenetic networks have been studied to model reticulate evolutionary events. The relationships among phylogenetic networks, phylogenetic trees and clusters serve as the basis for reconstruction and comparison of phylogenetic networks. To understand these relationships, two problems are raised: the tree containment problem, which asks whether a phylogenetic tree is displayed in a phylogenetic network, and the cluster containment problem, which asks whether a cluster is represented at a node in a phylogenetic network. Both the problems are NP-complete. Results A fast exponential-time algorithm for the cluster containment problem on arbitrary networks is developed and implemented in C. The resulting program is further extended into a computer program for fast computation of the Soft Robinson–Foulds distance between phylogenetic networks. Conclusions Two computer programs are developed for facilitating reconstruction and validation of phylogenetic network models in evolutionary and comparative genomics. Our simulation tests indicated that they are fast enough for use in practice. Additionally, the distribution of the Soft Robinson–Foulds distance between phylogenetic networks is demonstrated to be unlikely normal by our simulation data. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3500-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Bingxin Lu
- Department of Computer Science, National University of Singapore, 13 Computing Drive, Singapore, 117417, Singapore
| | - Louxin Zhang
- Department of Mathematics, National University of Singapore, 10 Lower Kent Ridge, Singapore, 119076, Singapore.
| | - Hon Wai Leong
- Department of Computer Science, National University of Singapore, 13 Computing Drive, Singapore, 117417, Singapore
| |
Collapse
|
23
|
Danchin A, Fang G. Unknown unknowns: essential genes in quest for function. Microb Biotechnol 2016; 9:530-40. [PMID: 27435445 PMCID: PMC4993169 DOI: 10.1111/1751-7915.12384] [Citation(s) in RCA: 62] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2016] [Accepted: 06/24/2016] [Indexed: 01/18/2023] Open
Abstract
The experimental design of a minimal synthetic genome revealed the presence of a large number of genes without ascribed function, in part because the abstract laws of life must be implemented within ad hoc material contraptions. Creating a function needs recruitment of some pre‐existing structure and this reveals kludges in their set‐up and history. Here, we show that looking for functions as an engineer would help in discovery of a significant number of those, proposed together with conceptual handles allowing investigators to pursue this endeavour in other contexts.
Collapse
Affiliation(s)
- Antoine Danchin
- Institute of Cardiometabolism and Nutrition, CHU Pitié-Salpêtrière, 47 boulevard de l'Hôpital, 75013, Paris, France
| | - Gang Fang
- Department of Biology, New York University Shanghai Campus, 1555 Century Avenue, Pudong New Area, Shanghai, 200122, China
| |
Collapse
|
24
|
Gupta RS. Impact of genomics on the understanding of microbial evolution and classification: the importance of Darwin's views on classification. FEMS Microbiol Rev 2016; 40:520-53. [PMID: 27279642 DOI: 10.1093/femsre/fuw011] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/14/2016] [Indexed: 12/24/2022] Open
Abstract
Analyses of genome sequences, by some approaches, suggest that the widespread occurrence of horizontal gene transfers (HGTs) in prokaryotes disguises their evolutionary relationships and have led to questioning of the Darwinian model of evolution for prokaryotes. These inferences are critically examined in the light of comparative genome analysis, characteristic synapomorphies, phylogenetic trees and Darwin's views on examining evolutionary relationships. Genome sequences are enabling discovery of numerous molecular markers (synapomorphies) such as conserved signature indels (CSIs) and conserved signature proteins (CSPs), which are distinctive characteristics of different prokaryotic taxa. Based on these molecular markers, exhibiting high degree of specificity and predictive ability, numerous prokaryotic taxa of different ranks, currently identified based on the 16S rRNA gene trees, can now be reliably demarcated in molecular terms. Within all studied groups, multiple CSIs and CSPs have been identified for successive nested clades providing reliable information regarding their hierarchical relationships and these inferences are not affected by HGTs. These results strongly support Darwin's views on evolution and classification and supplement the current phylogenetic framework based on 16S rRNA in important respects. The identified molecular markers provide important means for developing novel diagnostics, therapeutics and for functional studies providing important insights regarding prokaryotic taxa.
Collapse
Affiliation(s)
- Radhey S Gupta
- Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, ON, Canada
| |
Collapse
|