1
|
Di Giulio M. The polyphyletic origins of glycyl-tRNA synthetase and lysyl-tRNA synthetase and their implications. Biosystems 2024; 244:105287. [PMID: 39127441 DOI: 10.1016/j.biosystems.2024.105287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 08/07/2024] [Accepted: 08/07/2024] [Indexed: 08/12/2024]
Abstract
I analyzed the polyphyletic origin of glycyl-tRNA synthetase (GlyRS) and lysyl-tRNA synthetase (LysRS), making plausible the following implications. The fact that the genetic code needed to evolve aminoacyl-tRNA synthetases (ARSs) only very late would be in perfect agreement with a late origin, in the main phyletic lineages, of both GlyRS and LysRS. Indeed, as suggested by the coevolution theory, since the genetic code was structured by biosynthetic relationships between amino acids and as these occurred on tRNA-like molecules which were evidently already loaded with amino acids during its structuring, this made possible a late origin of ARSs. All this corroborates the coevolution theory of the origin of the genetic code to the detriment of theories which would instead predict an early intervention of the action of ARSs in organizing the genetic code. Furthermore, the assembly of the GlyRS and LysRS protein domains in main phyletic lineages is itself at least evidence of the possibility that ancestral genes were assembled using pieces of genetic material that coded these protein domains. This is in accordance with the exon theory of genes which postulates that ancestral exons coded for protein domains or modules that were assembled to form the first genes. This theory is exemplified precisely in the evolution of both GlyRS and LysRS which occurred through the assembly of protein domains in the main phyletic lineages, as analyzed here. Furthermore, this late assembly of protein domains of these proteins into the two main phyletic lineages, i.e. a polyphyletic origin of both GlyRS and LysRS, appears to corroborate the progenote evolutionary stage for both LUCA and at least the first part of the evolutionary stages of the ancestor of bacteria and that of archaea. Indeed, this polyphyletic origin would imply that the genetic code was still evolving because at least two ARSs, i.e. proteins that make the genetic code possible today, were still evolving. This would imply that the evolutionary stages involved were characterized not by cells but by protocells, that is, by progenotes because this is precisely the definition of a progenote. This conclusion would be strengthened by the observation that both GlyRS and LysRS originating in the phyletic lineages leading to bacteria and archaea, would demonstrate that, more generally, proteins were most likely still in rapid and progressive evolution. Namely, a polyphyletic origin of proteins which would qualify at least the initial phase of the evolutionary stage of the ancestor of bacteria and that of archaea as stages belonging to the progenote.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and TRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
2
|
Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. On Protein Loops, Prior Molecular States and Common Ancestors of Life. J Mol Evol 2024:10.1007/s00239-024-10167-y. [PMID: 38652291 DOI: 10.1007/s00239-024-10167-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 03/22/2024] [Indexed: 04/25/2024]
Abstract
The principle of continuity demands the existence of prior molecular states and common ancestors responsible for extant macromolecular structure. Here, we focus on the emergence and evolution of loop prototypes - the elemental architects of protein domain structure. Phylogenomic reconstruction spanning superkingdoms and viruses generated an evolutionary chronology of prototypes with six distinct evolutionary phases defining a most parsimonious evolutionary progression of cellular life. Each phase was marked by strategic prototype accumulation shaping the structures and functions of common ancestors. The last universal common ancestor (LUCA) of cells and viruses and the last universal cellular ancestor (LUCellA) defined stem lines that were structurally and functionally complex. The evolutionary saga highlighted transformative forces. LUCA lacked biosynthetic ribosomal machinery, while the pivotal LUCellA lacked essential DNA biosynthesis and modern transcription. Early proteins therefore relied on RNA for genetic information storage but appeared initially decoupled from it, hinting at transformative shifts of genetic processing. Urancestral loop types suggest advanced folding designs were present at an early evolutionary stage. An exploration of loop geometric properties revealed gradual replacement of prototypes with α-helix and β-strand bracing structures over time, paving the way for the dominance of other loop types. AlphFold2-generated atomic models of prototype accretion described patterns of fold emergence. Our findings favor a ‛processual' model of evolving stem lines aligned with Woese's vision of a communal world. This model prompts discussing the 'problem of ancestors' and the challenges that lie ahead for research in taxonomy, evolution and complexity.
Collapse
Affiliation(s)
- Kelsey Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
- Callout Biotech, Albuquerque, NM, 87112, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
| |
Collapse
|
3
|
Di Giulio M. The absence of the evolutionary state of the Prokaryote would imply a polyphyletic origin of proteins and that LUCA, the ancestor of bacteria and that of archaea were progenotes. Biosystems 2023; 233:105014. [PMID: 37652180 DOI: 10.1016/j.biosystems.2023.105014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 08/25/2023] [Accepted: 08/26/2023] [Indexed: 09/02/2023]
Abstract
I analysed the similarity gradient observed in protein families - of phylogenetically deep fundamental traits - of bacteria and archaea, ranging from cases such as the core of the DNA replication apparatus where there is no sequence similarity between the proteins involved, to cases in which, as in the translation initiation factors, only some proteins involved would be homologs, to cases such as for aminoacyl-tRNA synthetases in which most of the proteins involved would be homologs. This pattern of similarity between bacteria and archaea would seem to be a very clear indication of a transitional evolutionary stage that preceded both the Last Bacterial Common Ancestor and the Last Archaeal Common Ancestor, i.e. progenotic stages. Indeed, this similarity pattern would seem to exemplify an ongoing transition as all the evolutionary phases would be represented in it. Instead, in the cellular stage it is expected that these evolutionary phases should have already been overcome, i.e. completed, and therefore no longer detectable. In fact, if we had really been in the presence of the prokaryotic stage then we should not have observed this similarity pattern in proteins involved in defining the ancestral characters of bacteria and archaea, as the completion of the different cellular structures should have required a very low number of proteins to be late evolved in lineages leading to bacteria and archaea. Indeed, the already reached state of the Prokaryote would have determined complete cellular structures therefore a total absence of proteins to evolve independently in the two main phyletic lineages and able to complete the evolution of a particular character already evidently in a definitive state, which, on the other hand, does not appear to have been the case. All this would have prevented the formation of this pattern of similarity which instead would appear to be real. In conclusion, the existence of this pattern of similarity observed in the families of homologous proteins of bacteria and archaea would imply the absence of the evolutionary stage of the Prokaryote and consequently a progenotic status to be assigned to the LUCA. Indeed, the LUCA stage would have been a stage of evolutionary transition because it is belatedly marked by the presence of all the different evolutionary phases, evidently more easily interpretable within the definition of progenote than that of genote precisely because they are inherent in an evolutionary transition and not to an evolution that has already been achieved. Finally, I discuss the importance of these arguments for the polyphyletic origin of proteins.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
4
|
The origins of the cell membrane, the progenote, and the universal ancestor (LUCA). Biosystems 2022; 222:104799. [DOI: 10.1016/j.biosystems.2022.104799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 10/21/2022] [Accepted: 10/22/2022] [Indexed: 11/18/2022]
|
5
|
Abdelzaher H, Tawfik SM, Nour A, Abdelkader S, Elbalkiny ST, Abdelkader M, Abbas WA, Abdelnaser A. Climate change, human health, and the exposome: Utilizing OMIC technologies to navigate an era of uncertainty. Front Public Health 2022; 10:973000. [PMID: 36211706 PMCID: PMC9533016 DOI: 10.3389/fpubh.2022.973000] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Accepted: 08/17/2022] [Indexed: 01/25/2023] Open
Abstract
Climate change is an anthropogenic phenomenon that is alarming scientists and non-scientists alike. The emission of greenhouse gases is causing the temperature of the earth to rise and this increase is accompanied by a multitude of climate change-induced environmental exposures with potential health impacts. Tracking human exposure has been a major research interest of scientists worldwide. This has led to the development of exposome studies that examine internal and external individual exposures over their lifetime and correlate them to health. The monitoring of health has also benefited from significant technological advances in the field of "omics" technologies that analyze physiological changes on the nucleic acid, protein, and metabolism levels, among others. In this review, we discuss various climate change-induced environmental exposures and their potential health implications. We also highlight the potential integration of the technological advancements in the fields of exposome tracking, climate monitoring, and omics technologies shedding light on important questions that need to be answered.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Anwar Abdelnaser
- Institute of Global Health and Human Ecology, The American University in Cairo, New Cairo, Egypt
| |
Collapse
|
6
|
Romei M, Sapriel G, Imbert P, Jamay T, Chomilier J, Lecointre G, Carpentier M. Protein folds as synapomorphies of the tree of life. Evolution 2022; 76:1706-1719. [PMID: 35765784 PMCID: PMC9541633 DOI: 10.1111/evo.14550] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Revised: 05/17/2022] [Accepted: 05/31/2022] [Indexed: 01/22/2023]
Abstract
Several studies showed that folds (topology of protein secondary structures) distribution in proteomes may be a global proxy to build phylogeny. Then, some folds should be synapomorphies (derived characters exclusively shared among taxa). However, previous studies used methods that did not allow synapomorphy identification, which requires congruence analysis of folds as individual characters. Here, we map SCOP folds onto a sample of 210 species across the tree of life (TOL). Congruence is assessed using retention index of each fold for the TOL, and principal component analysis for deeper branches. Using a bicluster mapping approach, we define synapomorphic blocks of folds (SBF) sharing similar presence/absence patterns. Among the 1232 folds, 20% are universally present in our TOL, whereas 54% are reliable synapomorphies. These results are similar with CATH and ECOD databases. Eukaryotes are characterized by a large number of them, and several SBFs clearly support nested eukaryotic clades (divergence times from 1100 to 380 mya). Although clearly separated, the three superkingdoms reveal a strong mosaic pattern. This pattern is consistent with the dual origin of eukaryotes and witness secondary endosymbiosis in their phothosynthetic clades. Our study unveils direct analysis of folds synapomorphies as key characters to unravel evolutionary history of species.
Collapse
Affiliation(s)
- Martin Romei
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance,IMPMC (UMR 7590), BiBiP, Sorbonne Université, CNRS, MNHNParisFrance
| | - Guillaume Sapriel
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance,UFR des sciences de la santéUniversité Versailles‐St‐QuentinVersaillesFrance
| | - Pierre Imbert
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance
| | - Théo Jamay
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance
| | | | - Guillaume Lecointre
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance
| | - Mathilde Carpentier
- Institut Systématique Evolution Biodiversité (ISYEB UMR 7205)Sorbonne Université, MNHN, CNRS, EPHE, UAParisFrance
| |
Collapse
|
7
|
Estrada A, Suárez-Díaz E, Becerra A. Reconstructing the Last Common Ancestor: Epistemological and Empirical Challenges. Acta Biotheor 2022; 70:15. [PMID: 35575816 DOI: 10.1007/s10441-022-09439-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 04/25/2022] [Indexed: 11/24/2022]
Abstract
Reconstructing the genetic traits of the Last Common Ancestor (LCA) and the Tree of Life (TOL) are two examples of the reaches of contemporary molecular phylogenetics. Nevertheless, the whole enterprise has led to paradoxical results. The presence of Lateral Gene Transfer poses epistemic and empirical challenges to meet these goals; the discussion around this subject has been enriched by arguments from philosophers and historians of science. At the same time, a few but influential research groups have aimed to reconstruct the LCA with rich-in-detail hypotheses and high-resolution gene catalogs and metabolic traits. We argue that LGT poses insurmountable challenges for detailed and rich in details reconstructions and propose, instead, a middle-ground position with the reconstruction of a slim LCA based on traits under strong pressures of Negative Natural Selection, and for the need of consilience with evidence from organismal biology and geochemistry. We defend a cautionary perspective that goes beyond the statistical analysis of gene similarities and assumes the broader consequences of evolving empirical data and epistemic pluralism in the reconstruction of early life.
Collapse
Affiliation(s)
- Amadeo Estrada
- Posgrado en Ciencias Biológicas, Universidad Nacional Autónoma de México, Coyoacán, Mexico
| | - Edna Suárez-Díaz
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Circuito Exterior Ciudad Universitaria, 04510, Coyoacán, DF, Mexico
| | - Arturo Becerra
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Circuito Exterior Ciudad Universitaria, 04510, Coyoacán, DF, Mexico.
| |
Collapse
|
8
|
Fernandes GDC, Turchetto‐Zolet AC, Passaglia LMP. Glutamine synthetase evolutionary history revisited: tracing back beyond the Last Universal Common Ancestor. Evolution 2022; 76:605-622. [DOI: 10.1111/evo.14434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Revised: 12/14/2021] [Accepted: 12/22/2021] [Indexed: 11/29/2022]
Affiliation(s)
- Gabriela de Carvalho Fernandes
- Departamento de Genética and Programa de Pós‐graduação em Genética e Biologia Molecular Universidade Federal do Rio Grande do Sul (UFRGS) Av. Bento Gonçalves, 9500, Prédio 43312, Mailbox 15053 Porto Alegre RS 91‐501‐970 Brazil
| | - Andreia Carina Turchetto‐Zolet
- Departamento de Genética and Programa de Pós‐graduação em Genética e Biologia Molecular Universidade Federal do Rio Grande do Sul (UFRGS) Av. Bento Gonçalves, 9500, Prédio 43312, Mailbox 15053 Porto Alegre RS 91‐501‐970 Brazil
| | - Luciane Maria Pereira Passaglia
- Departamento de Genética and Programa de Pós‐graduação em Genética e Biologia Molecular Universidade Federal do Rio Grande do Sul (UFRGS) Av. Bento Gonçalves, 9500, Prédio 43312, Mailbox 15053 Porto Alegre RS 91‐501‐970 Brazil
| |
Collapse
|
9
|
Di Giulio M. The RNase P, LUCA, the ancestors of the life domains, the progenote, and the tree of life. Biosystems 2021; 212:104604. [PMID: 34979158 DOI: 10.1016/j.biosystems.2021.104604] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 12/24/2021] [Accepted: 12/29/2021] [Indexed: 11/15/2022]
Abstract
I have tried to interpret the phylogenetic distribution of the RNase P with the aim of helping to clarify the stage reached by the evolution of cellularity in the Last Universal Common Ancestor (LUCA); that is to say, if the evolutionary stage of the LUCA was represented by a protocell (progenote) or by a complete cell (genote). Since there are several arguments that lead one to believe that only the RNA moiety of the RNase P was present in the LUCA, this might imply that this evolutionary stage was actually the RNA world. If true this would imply that the LUCA was a progenote because the RNA world being a world subject to multiple evolutionary transitions that would involve a high noise at many its levels, which would fall within the definition of the progenote. Furthermore, since RNA-mediated catalysis is much less efficient than protein-mediated catalysis, then the only RNA moiety that was present in the LUCA could imply - by per se, without invoking the existence of the RNA world - that the LUCA was a progenote because an inefficient catalysis might have characterized this evolutionary stage. This evolutionary stage would still fall under the definition of the progenote. In addition, the observation that the protein moieties of the RNase P of bacteria and archaea are not-homologs would imply that these originated independently in the two main phyletic lineages. In turn, this would imply the progenotic nature of the ancestors of both archaea and bacteria. Indeed, it is admissible that such a late origin - in the main phyletic lineages - of the protein moieties of the RNase P is witness to an evolutionary transition towards a more efficient catalysis, evidently made clear precisely by the evolution of the protein moieties of the RNase P which would have helped the RNA of the RNase P to a more efficient catalysis. Hence, this would date that evolutionary moment as a transition to a much more efficient catalysis and consequently would imply which in that evolutionary stage there was the actual transition from the progenotic to genotic status. Finally, this late origin of the RNase P protein moieties in the bacterial and archaeal domains per se could imply the presence of a progenotic stage for their ancestors, or at least that a cell stage would have been much less likely. In fact, it is true that genes can originate both in a cellular and in a progenotic stage, but they mainly typify the latter because they are, by definition, in formation. Then it is expected that in the evolutionary stage of the formation of the main phyletic lineages - that is to say, in an evolutionary time in which the formation of genes might be expected - that the origin of proteins is to be related to a rapid and progressive evolution typical of the progenote precisely because in such an evolutionary stage the origin of genes is more easily and simply explained as reflecting a progenotic rather than a genotic stage. Indeed, if instead the evolutionary stage of the ancestors of bacteria and archaea had been the cellular one, then observing the origin of the protein moieties of the RNase P would have been, to some extent, anomalous because this completion should have already occurred, simply because the transformation of a ribozyme into an enzyme should have already taken place precisely because it falls within the very definition of the cellular status. The conclusion is that both the LUCA and the ancestor of archaea and that of bacteria may have been progenotes. If these arguments were true then either the tree of life as commonly understood would not exist and therefore the main phyletic lineages would have originated directly from the LUCA, or there would have been at least two different populations of progenotes that would have finally defined the domain of bacteria and that of archaea.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena (L'Aquila), Italy.
| |
Collapse
|
10
|
The Indeterminacy Bottleneck: Implications for Habitable Worlds. Acta Biotheor 2021; 70:1. [PMID: 34862917 DOI: 10.1007/s10441-021-09432-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Accepted: 11/01/2021] [Indexed: 10/19/2022]
Abstract
It is often assumed that the transition between chemical evolution and biological evolution undergoes a smooth process; that once life has arisen, it will automatically 'flood' a solar system body. However, there is no a priori reason to assume that a link between them is a given. The fact that both chemical evolution and biological evolution meet in a single point can be critical. Thus, one may ask: can a world's environment be favourable for chemical evolution but not for biological evolution, or vice versa? This is an important question worth exploration because certain worlds in the solar system in the past seemed to possess the possibility of chemical evolution, while several worlds in the present seem to exhibit such a possibility. Have such solar system bodies thus been, or are, 'flooded' by life? Did they possess the opportunity for biological evolution? The answer depends on the very nature of certain conditions under which evolution occurs, which may indicate that a link between chemical evolution and biological evolution is not automatically realised on a habitable solar system body. Thus, these conditions imply that in the emergence and distribution of cellular life, there exists an indeterminacy bottleneck at which chemical evolution and biological evolution meet through a single cell, whose descendants goes 'information explosive', 'entropy implosive' and 'habitat expansive', which determine whether life moves on to new environments. The consequence is that a world's environment can indeed be favourable for biological evolution, but not for chemical evolution. Thus, even if chemical evolution leads to the emergence of a microbial organism in a world, then it is not a given that such a first life form will be subjected to distribution to other environments; and not a given that its existence will continue in the environment it originated in. Thus, the bottleneck may be one of the decisive factors in the differences between habitable and inhabited worlds.
Collapse
|
11
|
Di Giulio M. The phylogenetic distribution of the cell division system would not imply a cellular LUCA but a progenotic LUCA. Biosystems 2021; 210:104563. [PMID: 34653531 DOI: 10.1016/j.biosystems.2021.104563] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/08/2021] [Accepted: 10/08/2021] [Indexed: 12/28/2022]
Abstract
The stage reached by the evolution of cellularity in the Last Universal Common Ancestor (LUCA) has not yet been identified. In actual fact, it has not been clarified whether the LUCA was a cell (genote) or a protocell (progenote). Recently, Pende et al. (2021) analysed the phylogenetic distribution of the cell division system present in bacteria and archaea reaching the conclusion that LUCA was a cell and not a progenote. I find this conclusion unreasonable with respect to the observations they presented. One of the points is that the presence in the domains of life of many genes - some paralogs - which would define the membrane-remodeling superfamily would seem to imply a tempo and a mode of evolution for the LUCA more typical of the progenote than the genote. Indeed, the simultaneous presence of different genes - in a given evolutionary stage and with functions that are also partially correlated - would seem to define a heterogeneity that would appear to be the expression of a rapid and progressive evolution precisely because this evolution would have taken place in the diversification of all these genes. Furthermore, the presence of different genes coding for the function of cell division and related functions could reflect a progenotic status in LUCA, precisely because these functions might have originated from a single ancestral gene instead coding for a protein (or proteins) with multiple functions, and therefore an expression of a rapid and progressive evolution typical of the progenote. I also criticize other aspects of considerations made by Pende at al. (2021). The arguments presented here together with those existing in the literature make the hypothesis of a cellular LUCA favoured by Pende et al. (2021) unlikely.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena (L'Aquila), Italy.
| |
Collapse
|
12
|
Caetano-Anollés G. The Compressed Vocabulary of Microbial Life. Front Microbiol 2021; 12:655990. [PMID: 34305827 PMCID: PMC8292947 DOI: 10.3389/fmicb.2021.655990] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Accepted: 04/27/2021] [Indexed: 12/22/2022] Open
Abstract
Communication is an undisputed central activity of life that requires an evolving molecular language. It conveys meaning through messages and vocabularies. Here, I explore the existence of a growing vocabulary in the molecules and molecular functions of the microbial world. There are clear correspondences between the lexicon, syntax, semantics, and pragmatics of language organization and the module, structure, function, and fitness paradigms of molecular biology. These correspondences are constrained by universal laws and engineering principles. Macromolecular structure, for example, follows quantitative linguistic patterns arising from statistical laws that are likely universal, including the Zipf's law, a special case of the scale-free distribution, the Heaps' law describing sublinear growth typical of economies of scales, and the Menzerath-Altmann's law, which imposes size-dependent patterns of decreasing returns. Trade-off solutions between principles of economy, flexibility, and robustness define a "triangle of persistence" describing the impact of the environment on a biological system. The pragmatic landscape of the triangle interfaces with the syntax and semantics of molecular languages, which together with comparative and evolutionary genomic data can explain global patterns of diversification of cellular life. The vocabularies of proteins (proteomes) and functions (functionomes) revealed a significant universal lexical core supporting a universal common ancestor, an ancestral evolutionary link between Bacteria and Eukarya, and distinct reductive evolutionary strategies of language compression in Archaea and Bacteria. A "causal" word cloud strategy inspired by the dependency grammar paradigm used in catenae unfolded the evolution of lexical units associated with Gene Ontology terms at different levels of ontological abstraction. While Archaea holds the smallest, oldest, and most homogeneous vocabulary of all superkingdoms, Bacteria heterogeneously apportions a more complex vocabulary, and Eukarya pushes functional innovation through mechanisms of flexibility and robustness.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, and C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL, United States
| |
Collapse
|
13
|
Di Giulio M. Errors of the ancestral translation, LUCA, and nature of its direct descendants. Biosystems 2021; 206:104433. [PMID: 33915233 DOI: 10.1016/j.biosystems.2021.104433] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2021] [Revised: 04/20/2021] [Accepted: 04/20/2021] [Indexed: 10/21/2022]
Abstract
I analyzed the implications of the observation that the methyltransferases, Trm5 and TrmD, which perform the methylation of the 37th base (m1G37) in tRNAs of bacteria and archaea respectively, are not homologous proteins. The first implication is that these methyltransferases originated very late only when the fundamental lineages leading to bacteria and archaea had separated, otherwise the two methyltransferases would have been homologous enzymes, which they are not. The conclusion that Trm5 and TrmD originated only when the main lineages were defined would imply that at least some aspects of the translation, such as +1 frameshifting, were still in rapid and progressive evolution, that is, they were still originating. This would in itself imply a high rate of translation errors because the absence of m1G37 from tRNAs could have determined a high rate of +1 translational frameshifting in the reading of mRNAs, identifying this stage as that of a phase of the origin of the genetic code. Furthermore, the observation that the frameshifting mechanism was still in rapid and progressive evolution in such an advanced evolutionary stage would imply that other mechanisms concerning translation were still rapidly evolving simply because it would be very unique if only the frameshifting mechanism were the only one still originating. Importantly, the observation that in archaea m1G37 also acts as a determinant of the identity of the tRNACysGCA would imply in itself that some aspects of the origin of the genetic code were still originating, greatly strengthening the hypothesis that other aspects of the translation apparatus were still in rapid and progressive evolution. Then, all this would imply a status of progenote for LUCA and ancestors of archaea and bacteria because a high rate of translation errors would fall within the definition of progenote.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy; Institute of Biosciences and Bioresources, National Research Council, Via P. Castellino, 111, 80131, Naples, Italy.
| |
Collapse
|
14
|
Nasir A, Mughal F, Caetano-Anollés G. The tree of life describes a tripartite cellular world. Bioessays 2021; 43:e2000343. [PMID: 33837594 DOI: 10.1002/bies.202000343] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2020] [Revised: 03/11/2021] [Accepted: 03/15/2021] [Indexed: 12/28/2022]
Abstract
The canonical view of a 3-domain (3D) tree of life was recently challenged by the discovery of Asgardarchaeota encoding eukaryote signature proteins (ESPs), which were treated as missing links of a 2-domain (2D) tree. Here we revisit the debate. We discuss methodological limitations of building trees with alignment-dependent approaches, which often fail to satisfactorily address the problem of ''gaps.'' In addition, most phylogenies are reconstructed unrooted, neglecting the power of direct rooting methods. Alignment-free methodologies lift most difficulties but require employing realistic evolutionary models. We argue that the discoveries of Asgards and ESPs, by themselves, do not rule out the 3D tree, which is strongly supported by comparative and evolutionary genomic analyses and vast genomic and biochemical superkingdom distinctions. Given uncertainties of retrodiction and interpretation difficulties, we conclude that the 3D view has not been falsified but instead has been strengthened by genomic analyses. In turn, the objections to the 2D model have not been lifted. The debate remains open. Also see the video abstract here: https://youtu.be/-6TBN0bubI8.
Collapse
Affiliation(s)
- Arshan Nasir
- Theoretical Biology and Biophysics (T-6), Los Alamos National Laboratory, Los Alamos, New Mexico, USA
| | - Fizza Mughal
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, USA
| |
Collapse
|
15
|
Di Giulio M. The late appearance of DNA, the nature of the LUCA and ancestors of the domains of life. Biosystems 2020; 202:104330. [PMID: 33352234 DOI: 10.1016/j.biosystems.2020.104330] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 12/15/2020] [Accepted: 12/15/2020] [Indexed: 01/27/2023]
Abstract
It has been firmly observed that replicative DNA polymerases of bacteria, archaea and eukaryotes are not homologous proteins. This lack of homology in the replication apparatus among the domains of life is not only compatible with but would seem to imply the view that the emergence of DNA occurred in the fundamental cellular lineages. In consequence, this diversity of DNA polymerase would go back to the level of ancestors of the domains of life and to the evolutionary time in which the DNA emerged. Therefore, the presumed evolutionary stage linked to the RNA- > DNA transition would have occurred only at the level of ancestors of the main lineages of the tree of life. Thus, the high noise associated with this major evolutionary transition and the impossibility for a cellular stage to generate different fundamental genetically profound traits - such as the different replication apparatuses of bacteria, archaea and eukaryotes - would imply not only that the last universal common ancestor (LUCA) was a progenote but that the ancestors of the domains of life were also at this evolutionary stage. So, I criticize the hypotheses which want, instead, that completely different cells - such as, bacteria and archaea - could have originated from a cellular LUCA.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena (L'Aquila), Italy; Institute of Biosciences and Bioresources, National Research Council, Via P. Castellino, 111, 80131, Naples, Italy.
| |
Collapse
|
16
|
Di Giulio M. LUCA as well as the ancestors of archaea, bacteria and eukaryotes were progenotes: Inference from the distribution and diversity of the reading mechanism of the AUA and AUG codons in the domains of life. Biosystems 2020; 198:104239. [PMID: 32919036 DOI: 10.1016/j.biosystems.2020.104239] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 09/01/2020] [Accepted: 09/01/2020] [Indexed: 11/25/2022]
Abstract
Here I use the rationale assuming that if of a certain trait that exerts its function in some aspect of the genetic code or, more generally, in protein synthesis, it is possible to identify the evolutionary stage of its origin then it would imply that this evolutionary moment would be characterized by a high translational noise because this trait would originate for the first time during that evolutionary stage. That is to say, if this trait had a non-marginal role in the realization of the genetic code, or in protein synthesis, then the origin of this trait would imply that, more generally, it was the genetic code itself that was still originating. But if the genetic code were still originating - at that precise evolutionary stage - then this would imply that there was a high translational noise which in turn would imply that it was in the presence of a protocell, i.e. a progenote that was by definition characterized by high translational noise. I apply this rationale to the mechanism of modification of the base 34 of the anticodon of an isoleucine tRNA that leads to the reading of AUA and AUG codons in archaea, bacteria and eukaryotes. The phylogenetic distribution of this mechanism in these phyletic lineages indicates that this mechanism originated only after the evolutionary stage of the last universal common ancestor (LUCA), namely, during the formation of cellular domains, i.e., at the stage of ancestors of these main phyletic lineages. Furthermore, given that this mechanism of modification of the base 34 of the anticodon of the isoleucine tRNA would result to emerge at a stage of the origin of the genetic code - despite in its terminal phases - then all this would imply that the ancestors of bacteria, archaea and eukaryotes were progenotes. If so, all the more so, the LUCA would also be a progenote since it preceded these ancestors temporally. A consequence of all this reasoning might be that since these three ancestors were of the progenotes that were different from each other, if at least one of them had evolved into at least two real and different cells - basically different from each other - then the number of cellular domains would not be three but it would be greater than three.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena (L'Aquila), Italy; Institute of Biosciences and Bioresources, National Research Council, Via P. Castellino, 111, 80131, Naples, Italy.
| |
Collapse
|
17
|
Danchin A, Sekowska A, You C. One-carbon metabolism, folate, zinc and translation. Microb Biotechnol 2020; 13:899-925. [PMID: 32153134 PMCID: PMC7264889 DOI: 10.1111/1751-7915.13550] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Accepted: 02/17/2020] [Indexed: 12/16/2022] Open
Abstract
The translation process, central to life, is tightly connected to the one-carbon (1-C) metabolism via a plethora of macromolecule modifications and specific effectors. Using manual genome annotations and putting together a variety of experimental studies, we explore here the possible reasons of this critical interaction, likely to have originated during the earliest steps of the birth of the first cells. Methionine, S-adenosylmethionine and tetrahydrofolate dominate this interaction. Yet, 1-C metabolism is unlikely to be a simple frozen accident of primaeval conditions. Reactive 1-C species (ROCS) are buffered by the translation machinery in a way tightly associated with the metabolism of iron-sulfur clusters, zinc and potassium availability, possibly coupling carbon metabolism to nitrogen metabolism. In this process, the highly modified position 34 of tRNA molecules plays a critical role. Overall, this metabolic integration may serve both as a protection against the deleterious formation of excess carbon under various growth transitions or environmental unbalanced conditions and as a regulator of zinc homeostasis, while regulating input of prosthetic groups into nascent proteins. This knowledge should be taken into account in metabolic engineering.
Collapse
Affiliation(s)
- Antoine Danchin
- AMAbiotics SASInstitut Cochin24 rue du Faubourg Saint‐Jacques75014ParisFrance
- School of Biomedical SciencesLi Ka Shing Faculty of MedicineThe University of Hong KongS.A.R. Hong KongChina
| | - Agnieszka Sekowska
- AMAbiotics SASInstitut Cochin24 rue du Faubourg Saint‐Jacques75014ParisFrance
| | - Conghui You
- Shenzhen Key Laboratory of Microbial Genetic EngineeringCollege of Life Sciences and OceanologyShenzhen University1066 Xueyuan Rd518055ShenzhenChina
| |
Collapse
|
18
|
The phylogenetic distribution of the glutaminyl-tRNA synthetase and Glu-tRNA Gln amidotransferase in the fundamental lineages would imply that the ancestor of archaea, that of eukaryotes and LUCA were progenotes. Biosystems 2020; 196:104174. [PMID: 32535177 DOI: 10.1016/j.biosystems.2020.104174] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 05/25/2020] [Accepted: 05/25/2020] [Indexed: 12/21/2022]
Abstract
The function of the glutaminyl-tRNA synthetase and Glu-tRNAGln amidotransferase might be related to the origin of the genetic code because, for example, glutaminyl-tRNA synthetase catalyses the fundamental reaction that makes the genetic code. If the evolutionary stage of the origin of these two enzymes could be unambiguously identified, then the genetic code should still have been originating at that particular evolutionary stage because the fundamental reaction that makes the code itself was still evidently evolving. This would result in that particular evolutionary moment being attributed to the evolutionary stage of the progenote because it would have a relationship between the genotype and the phenotype not yet fully realized because the genetic code was precisely still originating. I then analyzed the distribution of the glutaminyl-tRNA synthetase and Glu-tRNAGln aminodotrasferase in the main phyletic lineages. Since in some cases the origin of these two enzymes can be related to the evolutionary stages of ancestors of archaea and eukaryotes, this would indicate these ancestors as progenotes because at that evolutionary moment the genetic code was evidently still evolving, thus realizing the definition of progenote. The conclusion that the ancestor of archaea and that of eukaryotes were progenotes would imply that even the last universal common ancestor (LUCA) was a progenote because it appeared, on the tree of life, temporally before these ancestors.
Collapse
|
19
|
Bokhari RH, Amirjan N, Jeong H, Kim KM, Caetano-Anollés G, Nasir A. Bacterial Origin and Reductive Evolution of the CPR Group. Genome Biol Evol 2020; 12:103-121. [PMID: 32031619 PMCID: PMC7093835 DOI: 10.1093/gbe/evaa024] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/31/2020] [Indexed: 12/24/2022] Open
Abstract
The candidate phyla radiation (CPR) is a proposed subdivision within the bacterial domain comprising several candidate phyla. CPR organisms are united by small genome and physical sizes, lack several metabolic enzymes, and populate deep branches within the bacterial subtree of life. These features raise intriguing questions regarding their origin and mode of evolution. In this study, we performed a comparative and phylogenomic analysis to investigate CPR origin and evolution. Unlike previous gene/protein sequence-based reports of CPR evolution, we used protein domain superfamilies classified by protein structure databases to resolve the evolutionary relationships of CPR with non-CPR bacteria, Archaea, Eukarya, and viruses. Across all supergroups, CPR shared maximum superfamilies with non-CPR bacteria and were placed as deep branching bacteria in most phylogenomic trees. CPR contributed 1.22% of new superfamilies to bacteria including the ribosomal protein L19e and encoded four core superfamilies that are likely involved in cell-to-cell interaction and establishing episymbiotic lifestyles. Although CPR and non-CPR bacterial proteomes gained common superfamilies over the course of evolution, CPR and Archaea had more common losses. These losses mostly involved metabolic superfamilies. In fact, phylogenies built from only metabolic protein superfamilies separated CPR and non-CPR bacteria. These findings indicate that CPR are bacterial organisms that have probably evolved in an Archaea-like manner via the early loss of metabolic functions. We also discovered that phylogenies built from metabolic and informational superfamilies gave contrasting views of the groupings among Archaea, Bacteria, and Eukarya, which add to the current debate on the evolutionary relationships among superkingdoms.
Collapse
Affiliation(s)
| | - Nooreen Amirjan
- Department of Biosciences, COMSATS University Islamabad, Pakistan
| | - Hyeonsoo Jeong
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA
| | - Kyung Mo Kim
- Division of Polar Life Sciences, Korea Polar Research Institute, Incheon, Republic of Korea
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana
| | - Arshan Nasir
- Department of Biosciences, COMSATS University Islamabad, Pakistan
- Theoretical Biology & Biophysics Group, Los Alamos National Laboratory, Los Alamos, New Mexico
| |
Collapse
|
20
|
Preiner M, Asche S, Becker S, Betts HC, Boniface A, Camprubi E, Chandru K, Erastova V, Garg SG, Khawaja N, Kostyrka G, Machné R, Moggioli G, Muchowska KB, Neukirchen S, Peter B, Pichlhöfer E, Radványi Á, Rossetto D, Salditt A, Schmelling NM, Sousa FL, Tria FDK, Vörös D, Xavier JC. The Future of Origin of Life Research: Bridging Decades-Old Divisions. Life (Basel) 2020; 10:E20. [PMID: 32110893 PMCID: PMC7151616 DOI: 10.3390/life10030020] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 02/19/2020] [Accepted: 02/21/2020] [Indexed: 12/12/2022] Open
Abstract
Research on the origin of life is highly heterogeneous. After a peculiar historical development, it still includes strongly opposed views which potentially hinder progress. In the 1st Interdisciplinary Origin of Life Meeting, early-career researchers gathered to explore the commonalities between theories and approaches, critical divergence points, and expectations for the future. We find that even though classical approaches and theories-e.g. bottom-up and top-down, RNA world vs. metabolism-first-have been prevalent in origin of life research, they are ceasing to be mutually exclusive and they can and should feed integrating approaches. Here we focus on pressing questions and recent developments that bridge the classical disciplines and approaches, and highlight expectations for future endeavours in origin of life research.
Collapse
Affiliation(s)
- Martina Preiner
- Institute of Molecular Evolution, University of Düsseldorf, 40225 Düsseldorf, Germany; (S.G.G.); (F.D.K.T.)
| | - Silke Asche
- School of Chemistry, University of Glasgow, Glasgow G128QQ, UK;
| | - Sidney Becker
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK;
| | - Holly C. Betts
- School of Earth Sciences, University of Bristol, Bristol BS8 1RL, UK;
| | - Adrien Boniface
- Environmental Microbial Genomics, Laboratoire Ampère, Ecole Centrale de Lyon, Université de Lyon, 69130 Ecully, France;
| | - Eloi Camprubi
- Origins Center, Department of Earth Sciences, Utrecht University, 3584 CB Utrecht, The Netherlands;
| | - Kuhan Chandru
- Space Science Center (ANGKASA), Institute of Climate Change, Level 3, Research Complex, National University of Malaysia, UKM Bangi 43600, Selangor, Malaysia;
- Department of Physical Chemistry, University of Chemistry and Technology, Prague, Technicka 5, 16628 Prague 6–Dejvice, Czech Republic
| | - Valentina Erastova
- UK Centre for Astrobiology, School of Chemistry, University of Edinburgh, Edinburgh EH9 3FJ, UK;
| | - Sriram G. Garg
- Institute of Molecular Evolution, University of Düsseldorf, 40225 Düsseldorf, Germany; (S.G.G.); (F.D.K.T.)
| | - Nozair Khawaja
- Institut für Geologische Wissenschaften, Freie Universität Berlin, 12249 Berlin, Germany;
| | | | - Rainer Machné
- Institute of Synthetic Microbiology, University of Düsseldorf, 40225 Düsseldorf, Germany; (R.M.); (N.M.S.)
- Quantitative and Theoretical Biology, University of Düsseldorf, 40225 Düsseldorf, Germany
| | - Giacomo Moggioli
- School of Biological and Chemical Sciences, Queen Mary University of London, London E1 4DQ, UK;
| | - Kamila B. Muchowska
- Université de Strasbourg, CNRS, ISIS, 8 allée Gaspard Monge, 67000 Strasbourg, France;
| | - Sinje Neukirchen
- Archaea Biology and Ecogenomics Division, University of Vienna, 1090 Vienna, Austria; (S.N.); (E.P.); (F.L.S.)
| | - Benedikt Peter
- Cellular and Molecular Biophysics, Max Planck Institute of Biochemistry, 82152 Martinsried, Germany;
| | - Edith Pichlhöfer
- Archaea Biology and Ecogenomics Division, University of Vienna, 1090 Vienna, Austria; (S.N.); (E.P.); (F.L.S.)
| | - Ádám Radványi
- Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány 1/C, 1117 Budapest, Hungary (D.V.)
- Institute of Evolution, MTA Centre for Ecological Research, Klebelsberg Kuno u. 3., H-8237 Tihany, Hungary
| | - Daniele Rossetto
- Department of Cellular, Computational and Integrative Biology (CIBIO), University of Trento, 38123 Trento, Italy;
| | - Annalena Salditt
- Systems Biophysics, Physics Department, Ludwig-Maximilians-Universität München, 80799 Munich, Germany;
| | - Nicolas M. Schmelling
- Institute of Synthetic Microbiology, University of Düsseldorf, 40225 Düsseldorf, Germany; (R.M.); (N.M.S.)
- Cluster of Excellence on Plant Sciences (CEPLAS), University of Cologne, 50674 Cologne, Germany
| | - Filipa L. Sousa
- Archaea Biology and Ecogenomics Division, University of Vienna, 1090 Vienna, Austria; (S.N.); (E.P.); (F.L.S.)
| | - Fernando D. K. Tria
- Institute of Molecular Evolution, University of Düsseldorf, 40225 Düsseldorf, Germany; (S.G.G.); (F.D.K.T.)
| | - Dániel Vörös
- Department of Plant Systematics, Ecology and Theoretical Biology, Eötvös Loránd University, Pázmány Péter sétány 1/C, 1117 Budapest, Hungary (D.V.)
- Institute of Evolution, MTA Centre for Ecological Research, Klebelsberg Kuno u. 3., H-8237 Tihany, Hungary
| | - Joana C. Xavier
- Institute of Molecular Evolution, University of Düsseldorf, 40225 Düsseldorf, Germany; (S.G.G.); (F.D.K.T.)
| |
Collapse
|
21
|
Silva SR, Moraes AP, Penha HA, Julião MHM, Domingues DS, Michael TP, Miranda VFO, Varani AM. The Terrestrial Carnivorous Plant Utricularia reniformis Sheds Light on Environmental and Life-Form Genome Plasticity. Int J Mol Sci 2019; 21:E3. [PMID: 31861318 PMCID: PMC6982007 DOI: 10.3390/ijms21010003] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Revised: 12/13/2019] [Accepted: 12/15/2019] [Indexed: 12/22/2022] Open
Abstract
Utricularia belongs to Lentibulariaceae, a widespread family of carnivorous plants that possess ultra-small and highly dynamic nuclear genomes. It has been shown that the Lentibulariaceae genomes have been shaped by transposable elements expansion and loss, and multiple rounds of whole-genome duplications (WGD), making the family a platform for evolutionary and comparative genomics studies. To explore the evolution of Utricularia, we estimated the chromosome number and genome size, as well as sequenced the terrestrial bladderwort Utricularia reniformis (2n = 40, 1C = 317.1-Mpb). Here, we report a high quality 304 Mb draft genome, with a scaffold NG50 of 466-Kb, a BUSCO completeness of 87.8%, and 42,582 predicted genes. Compared to the smaller and aquatic U. gibba genome (101 Mb) that has a 32% repetitive sequence, the U. reniformis genome is highly repetitive (56%). The structural differences between the two genomes are the result of distinct fractionation and rearrangements after WGD, and massive proliferation of LTR-retrotransposons. Moreover, GO enrichment analyses suggest an ongoing gene birth-death-innovation process occurring among the tandem duplicated genes, shaping the evolution of carnivory-associated functions. We also identified unique patterns of developmentally related genes that support the terrestrial life-form and body plan of U. reniformis. Collectively, our results provided additional insights into the evolution of the plastic and specialized Lentibulariaceae genomes.
Collapse
Affiliation(s)
- Saura R. Silva
- Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias, UNESP—Universidade Estadual Paulista, Jaboticabal 14884-900, Brazil; (S.R.S.); (H.A.P.); (M.H.M.J.)
| | - Ana Paula Moraes
- Centro de Ciências Naturais e Humanas, Universidade Federal do ABC, São Bernardo do Campo 09606-070, Brazil;
| | - Helen A. Penha
- Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias, UNESP—Universidade Estadual Paulista, Jaboticabal 14884-900, Brazil; (S.R.S.); (H.A.P.); (M.H.M.J.)
| | - Maria H. M. Julião
- Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias, UNESP—Universidade Estadual Paulista, Jaboticabal 14884-900, Brazil; (S.R.S.); (H.A.P.); (M.H.M.J.)
| | - Douglas S. Domingues
- Departamento de Botânica, Instituto de Biociências, UNESP—Universidade Estadual Paulista, Rio Claro 13506-900, Brazil;
| | | | - Vitor F. O. Miranda
- Departamento de Biologia Aplicada à Agropecuária, Faculdade de Ciências Agrárias e Veterinárias, UNESP—Universidade Estadual Paulista, Jaboticabal 14884-900, Brazil
| | - Alessandro M. Varani
- Departamento de Tecnologia, Faculdade de Ciências Agrárias e Veterinárias, UNESP—Universidade Estadual Paulista, Jaboticabal 14884-900, Brazil; (S.R.S.); (H.A.P.); (M.H.M.J.)
| |
Collapse
|
22
|
Pollack JD, Gerard D, Makhatadze GI, Pearl DK. Evolutionary conservation and structural localizations suggest a physical trace of metabolism’s progressive geochronological emergence. J Biomol Struct Dyn 2019; 38:3700-3719. [DOI: 10.1080/07391102.2019.1679666] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- J. Dennis Pollack
- Department of Molecular Virology, Immunology and Medical Genetics, College of Medicine, The Ohio State University, Columbus, Ohio, USA
| | - David Gerard
- Department of Mathematics and Statistics, American University, Washington, DC, USA
| | - George I. Makhatadze
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, New York, USA
| | - Dennis K. Pearl
- Department of Statistics, Penn State University, University Park, Pennsylvania, USA
| |
Collapse
|
23
|
Caetano-Anollés D, Nasir A, Kim KM, Caetano-Anollés G. Testing Empirical Support for Evolutionary Models that Root the Tree of Life. J Mol Evol 2019; 87:131-142. [PMID: 30887086 PMCID: PMC6443624 DOI: 10.1007/s00239-019-09891-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 03/06/2019] [Indexed: 12/12/2022]
Abstract
Trees of life (ToLs) can only be rooted with direct methods that seek optimization of character state information in ingroup taxa. This involves optimizing phylogenetic tree, model and data in an exercise of reciprocal illumination. Rooted ToLs have been built from a census of protein structural domains in proteomes using two kinds of models. Fully-reversible models use standard-ordered (additive) characters and Wagner parsimony to generate unrooted trees of proteomes that are then rooted with Weston's generality criterion. Non-reversible models directly build rooted trees with unordered characters and asymmetric stepmatrices of transformation costs that penalize gain over loss of domains. Here, we test the empirical support for the evolutionary models with character state reconstruction methods using two published proteomic datasets. We show that the reversible models match reconstructed frequencies of character change and are faithful to the distribution of serial homologies in trees. In contrast, the non-reversible models go counter to trends in the data they must explain, attracting organisms with large proteomes to the base of the rooted trees while violating the triangle inequality of distances. This can lead to serious reconstruction inconsistencies that show model inadequacy. Our study highlights the aprioristic perils of disposing of countering evidence in natural history reconstruction.
Collapse
Affiliation(s)
- Derek Caetano-Anollés
- Department of Evolutionary Genetics, Max-Planck-Institut für Evolutionsbiologie, Plön, Germany.
| | - Arshan Nasir
- Department of Biosciences, COMSATS University, Islamabad, 45550, Pakistan
| | - Kyung Mo Kim
- Division of Polar Life Sciences, Korea Polar Research Institute, Incheon, Republic of Korea
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, and Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| |
Collapse
|
24
|
A qualitative criterion for identifying the root of the tree of life. J Theor Biol 2019; 464:126-131. [DOI: 10.1016/j.jtbi.2018.12.039] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 12/19/2018] [Accepted: 12/29/2018] [Indexed: 11/18/2022]
|
25
|
Metabolic models and gene essentiality data reveal essential and conserved metabolism in prokaryotes. PLoS Comput Biol 2018; 14:e1006556. [PMID: 30444863 PMCID: PMC6283598 DOI: 10.1371/journal.pcbi.1006556] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 12/06/2018] [Accepted: 10/09/2018] [Indexed: 01/13/2023] Open
Abstract
Essential metabolic reactions are shaping constituents of metabolic networks, enabling viable and distinct phenotypes across diverse life forms. Here we analyse and compare modelling predictions of essential metabolic functions with experimental data and thereby identify core metabolic pathways in prokaryotes. Simulations of 15 manually curated genome-scale metabolic models were integrated with 36 large-scale gene essentiality datasets encompassing a wide variety of species of bacteria and archaea. Conservation of metabolic genes was estimated by analysing 79 representative genomes from all the branches of the prokaryotic tree of life. We find that essentiality patterns reflect phylogenetic relations both for modelling and experimental data, which correlate highly at the pathway level. Genes that are essential for several species tend to be highly conserved as opposed to non-essential genes which may be conserved or not. The tRNA-charging module is highlighted as ancestral and with high centrality in the networks, followed closely by cofactor metabolism, pointing to an early information processing system supplied by organic cofactors. The results, which point to model improvements and also indicate faults in the experimental data, should be relevant to the study of centrality in metabolic networks and ancient metabolism but also to metabolic engineering with prokaryotes. If we tried to list every known chemical reaction within an organism–human, plant or even bacteria–we would get quite a long and confusing read. But when this information is represented in so-called genome-scale metabolic networks, we have the means to access computationally each of those reactions and their interconnections. Some parts of the network have alternatives, while others are unique and therefore can be essential for growth. Here, we simulate growth and compare essential reactions and genes for the simplest type of unicellular species–prokaryotes–to understand which parts of their metabolism are universally essential and potentially ancestral. We show that similar patterns of essential reactions echo phylogenetic relationships (this makes sense, as the genome provides the building plan for the enzymes that perform those reactions). Our computational predictions correlate strongly with experimental essentiality data. Finally, we show that a crucial step of protein synthesis (tRNA charging) and the synthesis and transformation of small molecules that enzymes require (cofactors) are the most essential and conserved parts of metabolism in prokaryotes. Our results are a step further in understanding the biology and evolution of prokaryotes but can also be relevant in applied studies including metabolic engineering and antibiotic design.
Collapse
|
26
|
Chouhan BPS, Maimaiti S, Gade M, Laurino P. Rossmann-Fold Methyltransferases: Taking a "β-Turn" around Their Cofactor, S-Adenosylmethionine. Biochemistry 2018; 58:166-170. [PMID: 30406995 DOI: 10.1021/acs.biochem.8b00994] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Methyltransferases (MTases) are superfamilies of enzymes that catalyze the transfer of a methyl group from S-adenosylmethionine (SAM), a nucleoside-based cofactor, to a wide variety of substrates such as DNA, RNA, proteins, small molecules, and lipids. Depending upon their structural features, the MTases can be further classified into different classes; we consider exclusively the largest class of MTases, the Rossmann-fold MTases. It has been shown that the nucleoside cofactor-binding Rossmann enzymes, particularly the nicotinamide adenine dinucleotide (NAD)-, flavin adenine dinucleotide (FAD)-, and SAM-binding MTases enzymes, share common binding motifs that include a Gly-rich loop region that interacts with the cofactor and a highly conserved acidic residue (Asp/Glu) that interacts with the ribose moiety of the cofactor. Here, we observe that the Gly-rich loop region of the Rossmann MTases adapts a specific type II' β-turn in the proximity of the cofactor (<4 Å), and it appears to be a key feature of these superfamilies. Additionally, we demonstrate that the conservation of this β-turn could play a critical role in the enzyme-cofactor interaction, thereby shedding new light on the structural conformation of the Gly-rich loop region from Rossmann MTases.
Collapse
Affiliation(s)
- Bhanu Pratap Singh Chouhan
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Shayida Maimaiti
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Madhuri Gade
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Paola Laurino
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| |
Collapse
|
27
|
Caetano-Anollés G, Nasir A, Kim KM, Caetano-Anollés D. Rooting Phylogenies and the Tree of Life While Minimizing Ad Hoc and Auxiliary Assumptions. Evol Bioinform Online 2018; 14:1176934318805101. [PMID: 30364468 PMCID: PMC6196624 DOI: 10.1177/1176934318805101] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 09/05/2018] [Indexed: 12/25/2022] Open
Abstract
Phylogenetic methods unearth evolutionary history when supported by three starting points of reason: (1) the continuity axiom begs the existence of a "model" of evolutionary change, (2) the singularity axiom defines the historical ground plan (phylogeny) in which biological entities (taxa) evolve, and (3) the memory axiom demands identification of biological attributes (characters) with historical information. Axiom consequences are interlinked, making the retrodiction enterprise an endeavor of reciprocal fulfillment. In particular, establishing direction of evolutionary change (character polarization) roots phylogenies and enables testing the existence of historical memory (homology). Unfortunately, rooting phylogenies, especially the "tree of life," generally follow narratives instead of integrating empirical and theoretical knowledge of retrodictive exploration. This stems mostly from a focus on molecular sequence analysis and uncertainties about rooting methods. Here, we review available rooting criteria, highlighting the need to minimize both ad hoc and auxiliary assumptions, especially argumentative ad hocness. We show that while the outgroup comparison method has been widely adopted, the generality criterion of nesting and additive phylogenetic change embodied in Weston rule offers the most powerful rooting approach. We also propose a change of focus, from phylogenies that describe the evolution of biological systems to those that describe the evolution of parts of those systems. This weakens violation of character independence, helps formalize the generality criterion of rooting, and provides new ways to study the problem of evolution.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Arshan Nasir
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan
| | - Kyung Mo Kim
- Division of Polar Life Sciences, Korea Polar Research Institute, Incheon, Republic of Korea
| | - Derek Caetano-Anollés
- Department of Evolutionary Genetics, Max-Planck-Institut für Evolutionsbiologie, Plön, Germany
| |
Collapse
|
28
|
Harish A. What is an archaeon and are the Archaea really unique? PeerJ 2018; 6:e5770. [PMID: 30357005 PMCID: PMC6196074 DOI: 10.7717/peerj.5770] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Accepted: 09/05/2018] [Indexed: 12/05/2022] Open
Abstract
The recognition of the group Archaea as a major branch of the tree of life (ToL) prompted a new view of the evolution of biodiversity. The genomic representation of archaeal biodiversity has since significantly increased. In addition, advances in phylogenetic modeling of multi-locus datasets have resolved many recalcitrant branches of the ToL. Despite the technical advances and an expanded taxonomic representation, two important aspects of the origins and evolution of the Archaea remain controversial, even as we celebrate the 40th anniversary of the monumental discovery. These issues concern (i) the uniqueness (monophyly) of the Archaea, and (ii) the evolutionary relationships of the Archaea to the Bacteria and the Eukarya; both of these are relevant to the deep structure of the ToL. To explore the causes for this persistent ambiguity, I examine multiple datasets and different phylogenetic approaches that support contradicting conclusions. I find that the uncertainty is primarily due to a scarcity of information in standard datasets-universal core-genes datasets-to reliably resolve the conflicts. These conflicts can be resolved efficiently by comparing patterns of variation in the distribution of functional genomic signatures, which are less diffused unlike patterns of primary sequence variation. Relatively lower heterogeneity in distribution patterns minimizes uncertainties and supports statistically robust phylogenetic inferences, especially of the earliest divergences of life. This case study further highlights the limitations of primary sequence data in resolving difficult phylogenetic problems, and raises questions about evolutionary inferences drawn from the analyses of sequence alignments of a small set of core genes. In particular, the findings of this study corroborate the growing consensus that reversible substitution mutations may not be optimal phylogenetic markers for resolving early divergences in the ToL, nor for determining the polarity of evolutionary transitions across the ToL.
Collapse
Affiliation(s)
- Ajith Harish
- Department of Cell and Molecular Biology, Program in Molecular Biology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
29
|
Staley JT, Caetano-Anollés G. Archaea-First and the Co-Evolutionary Diversification of Domains of Life. Bioessays 2018; 40:e1800036. [PMID: 29944192 DOI: 10.1002/bies.201800036] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2018] [Revised: 05/12/2018] [Indexed: 12/13/2022]
Abstract
The origins and evolution of the Archaea, Bacteria, and Eukarya remain controversial. Phylogenomic-wide studies of molecular features that are evolutionarily conserved, such as protein structural domains, suggest Archaea is the first domain of life to diversify from a stem line of descent. This line embodies the last universal common ancestor of cellular life. Here, we propose that ancestors of Euryarchaeota co-evolved with those of Bacteria prior to the diversification of Eukarya. This co-evolutionary scenario is supported by comparative genomic and phylogenomic analyses of the distributions of fold families of domains in the proteomes of free-living organisms, which show horizontal gene recruitments and informational process homologies. It also benefits from the molecular study of cell physiologies responsible for membrane phospholipids, methanogenesis, methane oxidation, cell division, gas vesicles, and the cell wall. Our theory however challenges popular cell fusion and two-domain of life scenarios derived from sequence analysis, demanding phylogenetic reconciliation. Also see the video abstract here: https://youtu.be/9yVWn_Q9faY.
Collapse
Affiliation(s)
- James T Staley
- Department of Microbiology and Astrobiology Program, University of Washington, Seattle, WA, 98195, USA
| | - Gustavo Caetano-Anollés
- Department of Crop Sciences, C. R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| |
Collapse
|
30
|
Palacios-Pérez M, Andrade-Díaz F, José MV. A Proposal of the Ur-proteome. ORIGINS LIFE EVOL B 2018; 48:245-258. [PMID: 29127550 DOI: 10.1007/s11084-017-9553-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Accepted: 10/24/2017] [Indexed: 11/25/2022]
Abstract
Herein we outline a plausible proteome, encoded by assuming a primeval RNY genetic code. We unveil the primeval phenotype by using only the RNA genotype; it means that we recovered the most ancestral proteome, mostly made of the 8 amino acids encoded by RNY triplets. By looking at those fragments, it is noticeable that they are positioned, not at catalytic sites, but in the cofactor binding sites. It implies that the stabilization of a molecule appeared long before its catalytic activity, and therefore the Ur-proteome comprised a set of proteins modules that corresponded to Cofactor Stabilizing Binding Sites (CSBSs), which we call the primitive bindome. With our method, we reconstructed the structures of the "first protein modules" that Sobolevsky and Trifonov (2006) found by using only RMSD. We also examine the probable cofactors that bound to them. We discuss the notion of CSBSs as the first proteins modules in progenotes in the context of several proposals about the primitive forms of life.
Collapse
Affiliation(s)
- Miryam Palacios-Pérez
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Fernando Andrade-Díaz
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Ciudad de México CDMX, Mexico.
| |
Collapse
|
31
|
Caetano-Anollés K, Caetano-Anollés D, Nasir A, Kim KM, Caetano-Anollés G. Order and polarity in character state transformation models that root the tree of life. Biochimie 2018; 149:135-136. [DOI: 10.1016/j.biochi.2018.04.001] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Accepted: 04/02/2018] [Indexed: 10/17/2022]
|
32
|
Kauko A, Lehto K. Eukaryote specific folds: Part of the whole. Proteins 2018; 86:868-881. [PMID: 29675831 DOI: 10.1002/prot.25517] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2017] [Revised: 04/17/2018] [Accepted: 04/18/2018] [Indexed: 01/07/2023]
Abstract
The origin of eukaryotes is one of the central transitions in the history of life; without eukaryotes there would be no complex multicellular life. The most accepted scenarios suggest the endosymbiosis of a mitochondrial ancestor with a complex archaeon, even though the details regarding the host and the triggering factors are still being discussed. Accordingly, phylogenetic analyses have demonstrated archaeal affiliations with key informational systems, while metabolic genes are often related to bacteria, mostly to the mitochondrial ancestor. Despite of this, there exists a large number of protein families and folds found only in eukaryotes. In this study, we have analyzed structural superfamilies and folds that probably appeared during eukaryogenesis. These folds typically represent relatively small binding domains of larger multidomain proteins. They are commonly involved in biological processes that are particularly complex in eukaryotes, such as signaling, trafficking/cytoskeleton, ubiquitination, transcription and RNA processing, but according to recent studies, these processes also have prokaryotic roots. Thus the folds originating from an eukaryotic stem seem to represent accessory parts that have contributed in the expansion of several prokaryotic processes to a new level of complexity. This might have taken place as a co-evolutionary process where increasing complexity and fold innovations have supported each other.
Collapse
Affiliation(s)
- Anni Kauko
- Department of Biochemistry, University of Turku, Turku, Finland
| | - Kirsi Lehto
- Department of Biochemistry, University of Turku, Turku, Finland
| |
Collapse
|
33
|
Harish A, Kurland C. Reply to Caetano-Anollés et al. comment on "Empirical genome evolution models root the tree of life". Biochimie 2018; 149:137-138. [PMID: 29631014 DOI: 10.1016/j.biochi.2018.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 04/02/2018] [Indexed: 10/17/2022]
Abstract
We recently analyzed the robustness of competing evolution models developed to identify the root of the Tree of Life: 1) An empirical Sankoff parsimony (ESP) model (Harish and Kurland, 2017), which is a nonstationary and directional evolution model; and 2) An a priori ancestor (APA) model (Kim and Caetano-Anollés, 2011) that is a stationary and reversible evolution model. Both Bayesian model selection tests as well as maximum parsimony analyses demonstrate that the ESP model is, overwhelmingly, the better model. Moreover, we showed that the APA model is not only sensitive to artifacts, but also that the underlying assumptions are neither empirically grounded nor biologically realistic.
Collapse
Affiliation(s)
- Ajith Harish
- Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.
| | | |
Collapse
|
34
|
Di Giulio M. On Earth, there would be a number of fundamental kinds of primary cells – cellular domains – greater than or equal to four. J Theor Biol 2018; 443:10-17. [DOI: 10.1016/j.jtbi.2018.01.025] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 01/10/2018] [Accepted: 01/19/2018] [Indexed: 11/15/2022]
|
35
|
Rivas M, Becerra A, Lazcano A. On the Early Evolution of Catabolic Pathways: A Comparative Genomics Approach. I. The Cases of Glucose, Ribose, and the Nucleobases Catabolic Routes. J Mol Evol 2017; 86:27-46. [PMID: 29189888 DOI: 10.1007/s00239-017-9822-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 11/26/2017] [Indexed: 11/29/2022]
Abstract
Compared with the large corpus of published work devoted to the study of the origin and early development of anabolism, little attention has been given to the discussion of the early evolution of catabolism in spite of its significance. In the present study, we have used comparative genomics to explore the evolution and phylogenetic distribution of the enzymes that catalyze the extant catabolic pathways of the monosaccharides glucose and ribose, as well as those of the nucleobases adenine, guanine, cytosine, uracil, and thymine. Based on the oxygen dependence of the enzymes, their conservation, and evolution, we speculate on the relative antiquity of the pathways. Our results allow us to suggest which catabolic pathways and enzymes may have already been present in the last common ancestor. We conclude that the enzymatic degradations of ribose, as well as those of purines adenine and guanine, are among the most ancient catabolic pathways which can be traced by protein-based methodologies.
Collapse
Affiliation(s)
- Mario Rivas
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Apdo. Postal 70-407, Cd. Universitaria, 04510, Mexico City, Mexico
| | - Arturo Becerra
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Apdo. Postal 70-407, Cd. Universitaria, 04510, Mexico City, Mexico
| | - Antonio Lazcano
- Facultad de Ciencias, Universidad Nacional Autónoma de México, Apdo. Postal 70-407, Cd. Universitaria, 04510, Mexico City, Mexico. .,Miembro de El Colegio Nacional, Mexico City, Mexico.
| |
Collapse
|
36
|
Mitchell JB. Enzyme function and its evolution. Curr Opin Struct Biol 2017; 47:151-156. [PMID: 29107208 DOI: 10.1016/j.sbi.2017.10.004] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Revised: 08/29/2017] [Accepted: 10/02/2017] [Indexed: 01/10/2023]
Abstract
With rapid increases over recent years in the determination of protein sequence and structure, alongside knowledge of thousands of enzyme functions and hundreds of chemical mechanisms, it is now possible to combine breadth and depth in our understanding of enzyme evolution. Phylogenetics continues to move forward, though determining correct evolutionary family trees is not trivial. Protein function prediction has spawned a variety of promising methods that offer the prospect of identifying enzymes across the whole range of chemical functions and over numerous species. This knowledge is essential to understand antibiotic resistance, as well as in protein re-engineering and de novo enzyme design.
Collapse
Affiliation(s)
- John Bo Mitchell
- EaStCHEM School of Chemistry and Biomedical Sciences Research Complex, University of St Andrews, North Haugh, St Andrews, Scotland KY16 9ST, United Kingdom
| |
Collapse
|
37
|
Harish A, Kurland CG. Empirical genome evolution models root the tree of life. Biochimie 2017; 138:137-155. [DOI: 10.1016/j.biochi.2017.04.014] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2016] [Accepted: 04/25/2017] [Indexed: 01/05/2023]
|
38
|
Nasir A, Kim KM, Caetano-Anollés G. Phylogenetic Tracings of Proteome Size Support the Gradual Accretion of Protein Structural Domains and the Early Origin of Viruses from Primordial Cells. Front Microbiol 2017; 8:1178. [PMID: 28690608 PMCID: PMC5481351 DOI: 10.3389/fmicb.2017.01178] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 06/09/2017] [Indexed: 01/05/2023] Open
Abstract
Untangling the origin and evolution of viruses remains a challenging proposition. We recently studied the global distribution of protein domain structures in thousands of completely sequenced viral and cellular proteomes with comparative genomics, phylogenomics, and multidimensional scaling methods. A tree of life describing the evolution of proteomes revealed viruses emerging from the base of the tree as a fourth supergroup of life. A tree of domains indicated an early origin of modern viral lineages from ancient cells that co-existed with the cellular ancestors. However, it was recently argued that the rooting of our trees and the basal placement of viruses was artifactually induced by small genome (proteome) size. Here we show that these claims arise from misunderstanding and misinterpretations of cladistic methodology. Trees are reconstructed unrooted, and thus, their topologies cannot be distorted a posteriori by the rooting methodology. Tracing proteome size in trees and multidimensional views of evolutionary relationships as well as tests of leaf stability and exclusion/inclusion of taxa demonstrated that the smallest proteomes were neither attracted toward the root nor caused any topological distortions of the trees. Simulations confirmed that taxa clustering patterns were independent of proteome size and were determined by the presence of known evolutionary relatives in data matrices, highlighting the need for broader taxon sampling in phylogeny reconstruction. Instead, phylogenetic tracings of proteome size revealed a slowdown in innovation of the structural domain vocabulary and four regimes of allometric scaling that reflected a Heaps law. These regimes explained increasing economies of scale in the evolutionary growth and accretion of kernel proteome repertoires of viruses and cellular organisms that resemble growth of human languages with limited vocabulary sizes. Results reconcile dynamic and static views of domain frequency distributions that are consistent with the axiom of spatiotemporal continuity that is tenet of evolutionary thinking.
Collapse
Affiliation(s)
- Arshan Nasir
- Department of Biosciences, COMSATS Institute of Information TechnologyIslamabad, Pakistan
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-ChampaignUrbana, IL, United States
| | - Kyung Mo Kim
- Division of Polar Life Sciences, Korea Polar Research InstituteIncheon, South Korea
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-ChampaignUrbana, IL, United States
| |
Collapse
|
39
|
Catharina L, Lima CR, Franca A, Guimarães ACR, Alves-Ferreira M, Tuffery P, Derreumaux P, Carels N. A Computational Methodology to Overcome the Challenges Associated With the Search for Specific Enzyme Targets to Develop Drugs Against Leishmania major. Bioinform Biol Insights 2017. [PMID: 28638238 PMCID: PMC5470852 DOI: 10.1177/1177932217712471] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
We present an approach for detecting enzymes that are specific of Leishmania major compared with Homo sapiens and provide targets that may assist research in drug development. This approach is based on traditional techniques of sequence homology comparison by similarity search and Markov modeling; it integrates the characterization of enzymatic functionality, secondary and tertiary protein structures, protein domain architecture, and metabolic environment. From 67 enzymes represented by 42 enzymatic activities classified by AnEnPi (Analogous Enzymes Pipeline) as specific for L major compared with H sapiens, only 40 (23 Enzyme Commission [EC] numbers) could actually be considered as strictly specific of L major and 27 enzymes (19 EC numbers) were disregarded for having ambiguous homologies or analogies with H sapiens. Among the 40 strictly specific enzymes, we identified sterol 24-C-methyltransferase, pyruvate phosphate dikinase, trypanothione synthetase, and RNA-editing ligase as 4 essential enzymes for L major that may serve as targets for drug development.
Collapse
Affiliation(s)
- Larissa Catharina
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Carlyle Ribeiro Lima
- Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique (UPR 9080), Centre National de la Recherche Scientifique (CNRS), Université Paris 7, Paris, France.,Molécules Thérapeutiques in silico (UMR-S 973), Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Alexander Franca
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz (IOC), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Ana Carolina Ramos Guimarães
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz (IOC), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Marcelo Alves-Ferreira
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| | - Pierre Tuffery
- Molécules Thérapeutiques in silico (UMR-S 973), Institut National de la Santé et de la Recherche Médicale (INSERM), Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Philippe Derreumaux
- Laboratoire de Biochimie Théorique, Institut de Biologie Physico-Chimique (UPR 9080), Centre National de la Recherche Scientifique (CNRS), Université Paris 7, Paris, France
| | - Nicolas Carels
- Laboratório de Modelagem de Sistemas Biológicos, Instituto Nacional de Ciência e Tecnologia de Inovação em Doenças de Populações Negligenciadas (INCT-IDPN), Centro de Desenvolvimento Tecnológico em Saúde (CDTS), Fundação Oswaldo Cruz (Fiocruz), Rio de Janeiro, Brazil
| |
Collapse
|
40
|
Staley JT. Domain Cell Theory supports the independent evolution of the Eukarya, Bacteria and Archaea and the Nuclear Compartment Commonality hypothesis. Open Biol 2017; 7:170041. [PMID: 28659382 PMCID: PMC5493775 DOI: 10.1098/rsob.170041] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Accepted: 05/26/2017] [Indexed: 01/15/2023] Open
Abstract
In 2015, the Royal Society of London held a meeting to discuss the various hypotheses regarding the origin of the Eukarya. Although not all participants supported a hypothesis, the proposals that did fit into two broad categories: one group favoured 'Prokaryotes First' hypotheses and another addressed 'Eukaryotes First' hypotheses. Those who proposed Prokaryotes First hypotheses advocated either a fusion event between a bacterium and an archaeon that produced the first eukaryote or the direct evolution of the Eukarya from the Archaea. The Eukaryotes First proponents posit that the eukaryotes evolved initially and then, by reductive evolution, produced the Bacteria and Archaea. No mention was made of another previously published hypothesis termed the Nuclear Compartment Commonality (NuCom) hypothesis, which proposed the evolution of the Eukarya and Bacteria from nucleated ancestors (Staley 2013 Astrobiol Outreach1, 105 (doi:10.4172/2332-2519.1000105)). Evidence from two studies indicates that the nucleated Planctomycetes-Verrucomicrobia-Chlamydia superphylum members are the most ancient Bacteria known (Brochier & Philippe 2002 Nature417, 244 (doi:10.1038/417244a); Jun et al. 2010 Proc. Natl Acad. Sci. USA107, 133-138 (doi:10.1073/pnas.0913033107)). This review summarizes the evidence for the NuCom hypothesis and discusses how simple the NuCom hypothesis is in explaining eukaryote evolution relative to the other hypotheses. The philosophical importance of simplicity and its relationship to truth in hypotheses such as NuCom and Domain Cell Theory is presented. Domain Cell Theory is also proposed herein, which contends that each of the three cellular lineages of life, the Archaea, Bacteria and Eukarya domains, evolved independently, in support of the NuCom hypothesis. All other proposed hypotheses violate Domain Cell Theory because they posit the evolution of different cellular descendants from ancestral cellular types.
Collapse
Affiliation(s)
- James T Staley
- Department of Microbiology and Astrobiology Program, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
41
|
Koç I, Caetano-Anollés G. The natural history of molecular functions inferred from an extensive phylogenomic analysis of gene ontology data. PLoS One 2017; 12:e0176129. [PMID: 28467492 PMCID: PMC5414959 DOI: 10.1371/journal.pone.0176129] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Accepted: 04/05/2017] [Indexed: 11/18/2022] Open
Abstract
The origin and natural history of molecular functions hold the key to the emergence of cellular organization and modern biochemistry. Here we use a genomic census of Gene Ontology (GO) terms to reconstruct phylogenies at the three highest (1, 2 and 3) and the lowest (terminal) levels of the hierarchy of molecular functions, which reflect the broadest and the most specific GO definitions, respectively. These phylogenies define evolutionary timelines of functional innovation. We analyzed 249 free-living organisms comprising the three superkingdoms of life, Archaea, Bacteria, and Eukarya. Phylogenies indicate catalytic, binding and transport functions were the oldest, suggesting a 'metabolism-first' origin scenario for biochemistry. Metabolism made use of increasingly complicated organic chemistry. Primordial features of ancient molecular functions and functional recruitments were further distilled by studying the oldest child terms of the oldest level 1 GO definitions. Network analyses showed the existence of an hourglass pattern of enzyme recruitment in the molecular functions of the directed acyclic graph of molecular functions. Older high-level molecular functions were thoroughly recruited at younger lower levels, while very young high-level functions were used throughout the timeline. This pattern repeated in every one of the three mappings, which gave a criss-cross pattern. The timelines and their mappings were remarkable. They revealed the progressive evolutionary development of functional toolkits, starting with the early rise of metabolic activities, followed chronologically by the rise of macromolecular biosynthesis, the establishment of controlled interactions with the environment and self, adaptation to oxygen, and enzyme coordinated regulation, and ending with the rise of structural and cellular complexity. This historical account holds important clues for dissection of the emergence of biomcomplexity and life.
Collapse
Affiliation(s)
- Ibrahim Koç
- Molecular Biology and Genetics, Gebze Technical University, Kocaeli, Turkey
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, United States of America
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, IL, United States of America
| |
Collapse
|
42
|
Harish A, Kurland CG. Akaryotes and Eukaryotes are independent descendants of a universal common ancestor. Biochimie 2017; 138:168-183. [PMID: 28461155 DOI: 10.1016/j.biochi.2017.04.013] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2017] [Accepted: 04/25/2017] [Indexed: 11/29/2022]
Abstract
We reconstructed a global tree of life (ToL) with non-reversible and non-stationary models of genome evolution that root trees intrinsically. We implemented Bayesian model selection tests and compared the statistical support for four conflicting ToL hypotheses. We show that reconstructions obtained with a Bayesian implementation (Klopfstein et al., 2015) are consistent with reconstructions obtained with an empirical Sankoff parsimony (ESP) implementation (Harish et al., 2013). Both are based on the genome contents of coding sequences for protein domains (superfamilies) from hundreds of genomes. Thus, we conclude that the independent descent of Eukaryotes and Akaryotes (archaea and bacteria) from the universal common ancestor (UCA) is the most probable as well as the most parsimonious hypothesis for the evolutionary origins of extant genomes. Reconstructions of ancestral proteomes by both Bayesian and ESP methods suggest that at least 70% of unique domain-superfamilies known in extant species were present in the UCA. In addition, identification of a vast majority (96%) of the mitochondrial superfamilies in the UCA proteome precludes a symbiotic hypothesis for the origin of eukaryotes. Accordingly, neither the archaeal origin of eukaryotes nor the bacterial origin of mitochondria is supported by the data. The proteomic complexity of the UCA suggests that the evolution of cellular phenotypes in the two primordial lineages, Akaryotes and Eukaryotes, was driven largely by duplication of common superfamilies as well as by loss of unique superfamilies. Finally, innovation of novel superfamilies has played a surprisingly small role in the evolution of Akaryotes and only a marginal role in the evolution of Eukaryotes.
Collapse
Affiliation(s)
- Ajith Harish
- Department of Cell and Molecular Biology, Structural and Molecular Biology Program, Uppsala University, Uppsala, Sweden.
| | - Charles G Kurland
- Department of Biology, Microbial Ecology Program, Lund University, Lund, Sweden.
| |
Collapse
|
43
|
Staley JT, Fuerst JA. Ancient, highly conserved proteins from a LUCA with complex cell biology provide evidence in support of the nuclear compartment commonality (NuCom) hypothesis. Res Microbiol 2017; 168:395-412. [PMID: 28111289 DOI: 10.1016/j.resmic.2017.01.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2016] [Revised: 01/08/2017] [Accepted: 01/09/2017] [Indexed: 12/23/2022]
Abstract
The nuclear compartment commonality (NuCom) hypothesis posits a complex last common ancestor (LUCA) with membranous compartments including a nuclear membrane. Such a LUCA then evolved to produce two nucleated lineages of the tree of life: the Planctomycetes-Verrucomicrobia-Chlamydia superphylum (PVC) within the Bacteria, and the Eukarya. We propose that a group of ancient essential protokaryotic signature proteins (PSPs) originating in LUCA were incorporated into ancestors of PVC Bacteria and Eukarya. Tubulins, ubiquitin system enzymes and sterol-synthesizing enzymes are consistent with early origins of these features shared between the PVC superphylum and Eukarya.
Collapse
Affiliation(s)
- James T Staley
- Department of Microbiology and Astrobiology Program, University of Washington, Seattle 98195, USA
| | - John A Fuerst
- School of Chemistry and Molecular Biosciences, University of Queensland, St. Lucia, Queensland 4072, Australia.
| |
Collapse
|
44
|
Arguments Reinforcing the Three-Domain View of Diversified Cellular Life. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2016; 2016:1851865. [PMID: 28050162 PMCID: PMC5165138 DOI: 10.1155/2016/1851865] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2016] [Revised: 10/18/2016] [Accepted: 11/03/2016] [Indexed: 11/18/2022]
Abstract
The archaeal ancestor scenario (AAS) for the origin of eukaryotes implies the emergence of a new kind of organism from the fusion of ancestral archaeal and bacterial cells. Equipped with this “chimeric” molecular arsenal, the resulting cell would gradually accumulate unique genes and develop the complex molecular machineries and cellular compartments that are hallmarks of modern eukaryotes. In this regard, proteins related to phagocytosis and cell movement should be present in the archaeal ancestor, thus identifying the recently described candidate archaeal phylum “Lokiarchaeota” as resembling a possible candidate ancestor of eukaryotes. Despite its appeal, AAS seems incompatible with the genomic, molecular, and biochemical differences that exist between Archaea and Eukarya. In particular, the distribution of conserved protein domain structures in the proteomes of cellular organisms and viruses appears hard to reconcile with the AAS. In addition, concerns related to taxon and character sampling, presupposing bacterial outgroups in phylogenies, and nonuniform effects of protein domain structure rearrangement and gain/loss in concatenated alignments of protein sequences cast further doubt on AAS-supporting phylogenies. Here, we evaluate AAS against the traditional “three-domain” world of cellular organisms and propose that the discovery of Lokiarchaeota could be better reconciled under the latter view, especially in light of several additional biological and technical considerations.
Collapse
|
45
|
Harish A, Abroi A, Gough J, Kurland C. Did Viruses Evolve As a Distinct Supergroup from Common Ancestors of Cells? Genome Biol Evol 2016; 8:2474-81. [PMID: 27497315 PMCID: PMC5010908 DOI: 10.1093/gbe/evw175] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
The evolutionary origins of viruses according to marker gene phylogenies, as well as their relationships to the ancestors of host cells remains unclear. In a recent article Nasir and Caetano-Anollés reported that their genome-scale phylogenetic analyses based on genomic composition of protein structural-domains identify an ancient origin of the “viral supergroup” (Nasir et al. 2015. A phylogenomic data-driven exploration of viral origins and evolution. Sci Adv. 1(8):e1500527.). It suggests that viruses and host cells evolved independently from a universal common ancestor. Examination of their data and phylogenetic methods indicates that systematic errors likely affected the results. Reanalysis of the data with additional tests shows that small-genome attraction artifacts distort their phylogenomic analyses, particularly the location of the root of the phylogenetic tree of life that is central to their conclusions. These new results indicate that their suggestion of a distinct ancestry of the viral supergroup is not well supported by the evidence.
Collapse
Affiliation(s)
- Ajith Harish
- Structural and Molecular Biology Group, Department of Cell and Molecular Biology, Biomedical Center, Uppsala University, Sweden
| | - Aare Abroi
- Estonian Biocentre, Riia 23, Tartu 51010, Estonia
| | - Julian Gough
- Computational Genomics Group, Department of Computer Science, University of Bristol, The Merchant Venturers Building, UK
| | - Charles Kurland
- Microbial Ecology, Department of Biology, Lund University, Sweden
| |
Collapse
|
46
|
Laurino P, Tóth-Petróczy Á, Meana-Pañeda R, Lin W, Truhlar DG, Tawfik DS. An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors. PLoS Biol 2016; 14:e1002396. [PMID: 26938925 PMCID: PMC4777477 DOI: 10.1371/journal.pbio.1002396] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 01/29/2016] [Indexed: 01/30/2023] Open
Abstract
Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose's ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint--geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.
Collapse
Affiliation(s)
- Paola Laurino
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Rubén Meana-Pañeda
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Lin
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Donald G. Truhlar
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
47
|
Poole AM, Lundin D, Rytkönen KT. The evolution of early cellular systems viewed through the lens of biological interactions. Front Microbiol 2015; 6:1144. [PMID: 26539175 PMCID: PMC4609892 DOI: 10.3389/fmicb.2015.01144] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Accepted: 10/05/2015] [Indexed: 11/27/2022] Open
Abstract
The minimal cell concept represents a pragmatic approach to the question of how few genes are required to run a cell. This is a helpful way to build a parts-list, and has been more successful than attempts to deduce a minimal gene set for life by inferring the gene repertoire of the last universal common ancestor, as few genes trace back to this hypothetical ancestral state. However, the study of minimal cellular systems is the study of biological outliers where, by practical necessity, coevolutionary interactions are minimized or ignored. In this paper, we consider the biological context from which minimal genomes have been removed. For instance, some of the most reduced genomes are from endosymbionts and are the result of coevolutionary interactions with a host; few such organisms are "free-living." As few, if any, biological systems exist in complete isolation, we expect that, as with modern life, early biological systems were part of an ecosystem, replete with organismal interactions. We favor refocusing discussions of the evolution of cellular systems on processes rather than gene counts. We therefore draw a distinction between a pragmatic minimal cell (an interesting engineering problem), a distributed genome (a system resulting from an evolutionary transition involving more than one cell) and the looser coevolutionary interactions that are ubiquitous in ecosystems. Finally, we consider the distributed genome and coevolutionary interactions between genomic entities in the context of early evolution.
Collapse
Affiliation(s)
- Anthony M. Poole
- Allan Wilson Centre for Molecular Ecology and Evolution, University of Canterbury, Christchurch, New Zealand
- Biomolecular Interaction Centre, University of Canterbury, Christchurch, New Zealand
- School of Biological Sciences, University of Canterbury, Christchurch, New Zealand
| | - Daniel Lundin
- Department of Biochemistry and Biophysics, Arrhenius Laboratories, Stockholm University, Stockholm, Sweden
- Centre for Ecology and Evolution in Microbial Model Systems, Linnaeus University, Kalmar, Sweden
| | - Kalle T. Rytkönen
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
- Yale Systems Biology Institute, Yale University, New Haven, CT, USA
| |
Collapse
|
48
|
Nasir A, Caetano-Anollés G. A phylogenomic data-driven exploration of viral origins and evolution. SCIENCE ADVANCES 2015; 1:e1500527. [PMID: 26601271 PMCID: PMC4643759 DOI: 10.1126/sciadv.1500527] [Citation(s) in RCA: 116] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2015] [Accepted: 06/30/2015] [Indexed: 05/05/2023]
Abstract
The origin of viruses remains mysterious because of their diverse and patchy molecular and functional makeup. Although numerous hypotheses have attempted to explain viral origins, none is backed by substantive data. We take full advantage of the wealth of available protein structural and functional data to explore the evolution of the proteomic makeup of thousands of cells and viruses. Despite the extremely reduced nature of viral proteomes, we established an ancient origin of the "viral supergroup" and the existence of widespread episodes of horizontal transfer of genetic information. Viruses harboring different replicon types and infecting distantly related hosts shared many metabolic and informational protein structural domains of ancient origin that were also widespread in cellular proteomes. Phylogenomic analysis uncovered a universal tree of life and revealed that modern viruses reduced from multiple ancient cells that harbored segmented RNA genomes and coexisted with the ancestors of modern cells. The model for the origin and evolution of viruses and cells is backed by strong genomic and structural evidence and can be reconciled with existing models of viral evolution if one considers viruses to have originated from ancient cells and not from modern counterparts.
Collapse
|
49
|
Abstract
The concept of the minimal cell has fascinated scientists for a long time, from both fundamental and applied points of view. This broad concept encompasses extreme reductions of genomes, the last universal common ancestor (LUCA), the creation of semiartificial cells, and the design of protocells and chassis cells. Here we review these different areas of research and identify common and complementary aspects of each one. We focus on systems biology, a discipline that is greatly facilitating the classical top-down and bottom-up approaches toward minimal cells. In addition, we also review the so-called middle-out approach and its contributions to the field with mathematical and computational models. Owing to the advances in genomics technologies, much of the work in this area has been centered on minimal genomes, or rather minimal gene sets, required to sustain life. Nevertheless, a fundamental expansion has been taking place in the last few years wherein the minimal gene set is viewed as a backbone of a more complex system. Complementing genomics, progress is being made in understanding the system-wide properties at the levels of the transcriptome, proteome, and metabolome. Network modeling approaches are enabling the integration of these different omics data sets toward an understanding of the complex molecular pathways connecting genotype to phenotype. We review key concepts central to the mapping and modeling of this complexity, which is at the heart of research on minimal cells. Finally, we discuss the distinction between minimizing the number of cellular components and minimizing cellular complexity, toward an improved understanding and utilization of minimal and simpler cells.
Collapse
|
50
|
Di Giulio M. The Non-Biological Meaning of the Term “Prokaryote” and Its Implications. J Mol Evol 2014; 80:98-101. [DOI: 10.1007/s00239-014-9662-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Accepted: 12/01/2014] [Indexed: 12/01/2022]
|