1
|
Di Giulio M. The polyphyletic origins of glycyl-tRNA synthetase and lysyl-tRNA synthetase and their implications. Biosystems 2024; 244:105287. [PMID: 39127441 DOI: 10.1016/j.biosystems.2024.105287] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2024] [Revised: 08/07/2024] [Accepted: 08/07/2024] [Indexed: 08/12/2024]
Abstract
I analyzed the polyphyletic origin of glycyl-tRNA synthetase (GlyRS) and lysyl-tRNA synthetase (LysRS), making plausible the following implications. The fact that the genetic code needed to evolve aminoacyl-tRNA synthetases (ARSs) only very late would be in perfect agreement with a late origin, in the main phyletic lineages, of both GlyRS and LysRS. Indeed, as suggested by the coevolution theory, since the genetic code was structured by biosynthetic relationships between amino acids and as these occurred on tRNA-like molecules which were evidently already loaded with amino acids during its structuring, this made possible a late origin of ARSs. All this corroborates the coevolution theory of the origin of the genetic code to the detriment of theories which would instead predict an early intervention of the action of ARSs in organizing the genetic code. Furthermore, the assembly of the GlyRS and LysRS protein domains in main phyletic lineages is itself at least evidence of the possibility that ancestral genes were assembled using pieces of genetic material that coded these protein domains. This is in accordance with the exon theory of genes which postulates that ancestral exons coded for protein domains or modules that were assembled to form the first genes. This theory is exemplified precisely in the evolution of both GlyRS and LysRS which occurred through the assembly of protein domains in the main phyletic lineages, as analyzed here. Furthermore, this late assembly of protein domains of these proteins into the two main phyletic lineages, i.e. a polyphyletic origin of both GlyRS and LysRS, appears to corroborate the progenote evolutionary stage for both LUCA and at least the first part of the evolutionary stages of the ancestor of bacteria and that of archaea. Indeed, this polyphyletic origin would imply that the genetic code was still evolving because at least two ARSs, i.e. proteins that make the genetic code possible today, were still evolving. This would imply that the evolutionary stages involved were characterized not by cells but by protocells, that is, by progenotes because this is precisely the definition of a progenote. This conclusion would be strengthened by the observation that both GlyRS and LysRS originating in the phyletic lineages leading to bacteria and archaea, would demonstrate that, more generally, proteins were most likely still in rapid and progressive evolution. Namely, a polyphyletic origin of proteins which would qualify at least the initial phase of the evolutionary stage of the ancestor of bacteria and that of archaea as stages belonging to the progenote.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
2
|
Prosdocimi F, de Farias ST. Major evolutionary transitions before cells: A journey from molecules to organisms. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2024; 191:11-24. [PMID: 38971326 DOI: 10.1016/j.pbiomolbio.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 05/25/2024] [Accepted: 07/03/2024] [Indexed: 07/08/2024]
Abstract
Basing on logical assumptions and necessary steps of complexification along biological evolution, we propose here an evolutionary path from molecules to cells presenting four ages and three major transitions. At the first age, the basic biomolecules were formed and become abundant. The first transition happened with the event of a chemical symbiosis between nucleic acids and peptides worlds, which marked the emergence of both life and the process of organic encoding. FUCA, the first living process, was composed of self-replicating RNAs linked to amino acids and capable to catalyze their binding. The second transition, from the age of FUCA to the age of progenotes, involved the duplication and recombination of proto-genomes, leading to specialization in protein production and the exploration of protein to metabolite interactions in the prebiotic soup. Enzymes and metabolic pathways were incorporated into biology from protobiotic reactions that occurred without chemical catalysts, step by step. Then, the fourth age brought origin of organisms and lineages, occurring when specific proteins capable to stackle together facilitated the formation of peptidic capsids. LUCA was constituted as a progenote capable to operate the basic metabolic functions of a cell, but still unable to interact with lipid molecules. We present evidence that the evolution of lipid interaction pathways occurred at least twice, with the development of bacterial-like and archaeal-like membranes. Also, data in literature suggest at least two paths for the emergence of DNA biosynthesis, allowing the stabilization of early life strategies in viruses, archaeas and bacterias. Two billion years later, the eukaryotes arouse, and after 1,5 billion years of evolution, they finally learn how to evolve multicellularity via tissue specialization.
Collapse
Affiliation(s)
- Francisco Prosdocimi
- Laboratório de Biologia Teórica e de Sistemas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - Sávio Torres de Farias
- Laboratório de Genética Evolutiva Paulo Leminski, Centro de Ciências Exatas e da Natureza, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil; Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds, LS7 3RB, UK
| |
Collapse
|
3
|
Di Giulio M. The absence of the evolutionary state of the Prokaryote would imply a polyphyletic origin of proteins and that LUCA, the ancestor of bacteria and that of archaea were progenotes. Biosystems 2023; 233:105014. [PMID: 37652180 DOI: 10.1016/j.biosystems.2023.105014] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2023] [Revised: 08/25/2023] [Accepted: 08/26/2023] [Indexed: 09/02/2023]
Abstract
I analysed the similarity gradient observed in protein families - of phylogenetically deep fundamental traits - of bacteria and archaea, ranging from cases such as the core of the DNA replication apparatus where there is no sequence similarity between the proteins involved, to cases in which, as in the translation initiation factors, only some proteins involved would be homologs, to cases such as for aminoacyl-tRNA synthetases in which most of the proteins involved would be homologs. This pattern of similarity between bacteria and archaea would seem to be a very clear indication of a transitional evolutionary stage that preceded both the Last Bacterial Common Ancestor and the Last Archaeal Common Ancestor, i.e. progenotic stages. Indeed, this similarity pattern would seem to exemplify an ongoing transition as all the evolutionary phases would be represented in it. Instead, in the cellular stage it is expected that these evolutionary phases should have already been overcome, i.e. completed, and therefore no longer detectable. In fact, if we had really been in the presence of the prokaryotic stage then we should not have observed this similarity pattern in proteins involved in defining the ancestral characters of bacteria and archaea, as the completion of the different cellular structures should have required a very low number of proteins to be late evolved in lineages leading to bacteria and archaea. Indeed, the already reached state of the Prokaryote would have determined complete cellular structures therefore a total absence of proteins to evolve independently in the two main phyletic lineages and able to complete the evolution of a particular character already evidently in a definitive state, which, on the other hand, does not appear to have been the case. All this would have prevented the formation of this pattern of similarity which instead would appear to be real. In conclusion, the existence of this pattern of similarity observed in the families of homologous proteins of bacteria and archaea would imply the absence of the evolutionary stage of the Prokaryote and consequently a progenotic status to be assigned to the LUCA. Indeed, the LUCA stage would have been a stage of evolutionary transition because it is belatedly marked by the presence of all the different evolutionary phases, evidently more easily interpretable within the definition of progenote than that of genote precisely because they are inherent in an evolutionary transition and not to an evolution that has already been achieved. Finally, I discuss the importance of these arguments for the polyphyletic origin of proteins.
Collapse
Affiliation(s)
- Massimo Di Giulio
- The Ionian School, Early Evolution of Life Department, Genetic Code and tRNA Origin Laboratory, Via Roma 19, 67030, Alfedena, L'Aquila, Italy.
| |
Collapse
|
4
|
Prosdocimi F, Cortines JR, José MV, Farias ST. Decoding viruses: An alternative perspective on their history, origins and role in nature. Biosystems 2023; 231:104960. [PMID: 37437771 DOI: 10.1016/j.biosystems.2023.104960] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2023] [Revised: 06/16/2023] [Accepted: 06/17/2023] [Indexed: 07/14/2023]
Abstract
This article provides an alternative perspective on viruses, exploring their origins, ecology, and evolution. Viruses are recognized as the most prevalent biological entities on Earth, permeating nearly all environments and forming the virosphere-a significant biological layer. They play a crucial role in regulating bacterial populations within ecosystems and holobionts, influencing microbial communities and nutrient recycling. Viruses are also key drivers of molecular evolution, actively participating in the maintenance and regulation of ecosystems and cellular organisms. Many eukaryotic genomes contain genomic elements with viral origins, which contribute to organismal equilibrium and fitness. Viruses are involved in the generation of species-specific orphan genes, facilitating adaptation and the development of unique traits in biological lineages. They have been implicated in the formation of vital structures like the eukaryotic nucleus and the mammalian placenta. The presence of virus-specific genes absent in cellular organisms suggests that viruses may pre-date cellular life. Like progenotes, viruses are ribonucleoprotein entities with simpler capsid architectures compared to proteolipidic membranes. This article presents a comprehensive scenario describing major transitions in prebiotic evolution and proposes that viruses emerged prior to the Last Universal Common Ancestor (LUCA) during the progenote era. However, it is important to note that viruses do not form a monophyletic clade, and many viral taxonomic groups originated more recently as reductions of cellular structures. Thus, viral architecture should be seen as an ancient and evolutionarily stable strategy adopted by biological systems. The goal of this article is to reshape perceptions of viruses, highlighting their multifaceted significance in the complex tapestry of life and fostering a deeper understanding of their origins, ecological impact, and evolutionary dynamics.
Collapse
Affiliation(s)
- Francisco Prosdocimi
- Laboratório de Biologia Teórica e de Sistemas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - Juliana Reis Cortines
- Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Brazil
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510, CDMX, Mexico
| | - Sávio Torres Farias
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil; Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds, LS7 3RB, UK
| |
Collapse
|
5
|
Sawers RG. Perspective elucidating the physiology of a microbial cell: Neidhardt's Holy Grail. Mol Microbiol 2023; 120:54-59. [PMID: 36855806 DOI: 10.1111/mmi.15051] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2023] [Revised: 02/24/2023] [Accepted: 02/26/2023] [Indexed: 03/02/2023]
Abstract
A living microbial cell represents a system of high complexity, integration, and extreme order. All processes within that cell interconvert free energy through a multitude of interconnected metabolic reactions that help to maintain the cell in a state of low entropy, which is a characteristic of all living systems. The study of macromolecular interactions outside this cellular environment yields valuable information about the molecular function of macromolecules but represents a system in comparative disorder. Consequently, care must always be taken in interpreting the information gleaned from such studies and must be compared with how the same macromolecules function in vivo, otherwise, discrepancies can arise. The importance of combining reductionist approaches with the study of whole-cell microbial physiology is discussed regarding the long-term aim of understanding how a cell functions in its entirety. This can only be achieved by the continued development of high-resolution structural and multi-omic technologies. It is only by studying the whole cell that we can ever hope to understand how living systems function.
Collapse
Affiliation(s)
- R Gary Sawers
- Institute of Microbiology, Martin-Luther University Halle-Wittenberg, Halle (Saale), Germany
| |
Collapse
|
6
|
de Farias ST, Furtado ANM, dos Santos Junior AP, José MV. Natural History of DNA-Dependent DNA Polymerases: Multiple Pathways to the Origins of DNA. Viruses 2023; 15:v15030749. [PMID: 36992459 PMCID: PMC10052633 DOI: 10.3390/v15030749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 03/09/2023] [Accepted: 03/12/2023] [Indexed: 03/17/2023] Open
Abstract
One of the major evolutionary transitions that led to DNA replacing RNA as the primary informational molecule in biological systems is still the subject of an intense debate in the scientific community. DNA polymerases are currently split into various families. Families A, B, and C are the most significant. In bacteria and some types of viruses, enzymes from families A and C predominate, whereas family B enzymes are more common in Archaea, Eukarya, and some types of viruses. A phylogenetic analysis of these three families of DNA polymerase was carried out. We assumed that reverse transcriptase was the ancestor of DNA polymerases. Our findings suggest that families A and C emerged and organized themselves when the earliest bacterial lineages had diverged, and that these earliest lineages had RNA genomes that were in transition—that is, the information was temporally stored in DNA molecules that were continuously being produced by reverse transcription. The origin of DNA and the apparatus for its replication in the mitochondrial ancestors may have occurred independently of DNA and the replication machinery of other bacterial lineages, according to these two alternate modes of genetic material replication. The family C enzymes emerged in a particular bacterial lineage before being passed to viral lineages, which must have functioned by disseminating this machinery to the other lineages of bacteria. Bacterial DNA viruses must have evolved at least twice independently, in addition to the requirement that DNA have arisen twice in bacterial lineages. We offer two possible scenarios based on what we know about bacterial DNA polymerases. One hypothesis contends that family A was initially produced and spread to the other lineages through viral lineages before being supplanted by the emergence of family C and acquisition at that position of the principal replicative polymerase. The evidence points to the independence of these events and suggests that the viral lineage’s acquisition of cellular replicative machinery was crucial for the establishment of a DNA genome in the other bacterial lineages, since these viral lineages may have served as a conduit for the machinery’s delivery to other bacterial lineages that diverged with the RNA genome. Our data suggest that family B initially established itself in viral lineages and was transferred to ancestral Archaea lineages before the group diversified; thus, the DNA genome must have emerged first in this cellular lineage. Our data point to multiple evolutionary steps in the origins of DNA polymerase, having started off at least twice in the bacterial lineage and once in the archaeal lineage. Given that viral lineages are implicated in a significant portion of the distribution of DNA replication equipment in both bacterial (families A and C) and Archaeal lineages (family A), our data point to a complex scenario.
Collapse
Affiliation(s)
- Sávio Torres de Farias
- Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa 58051-900, Brazil
- Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds LS7 3RB, UK
- Correspondence:
| | | | | | - Marco V. José
- Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds LS7 3RB, UK
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Ciudad de México C.P. 04510, Mexico
| |
Collapse
|
7
|
Abstract
Developing mathematical representations of biological systems that can allow predictions is a challenging and important research goal. It is demonstrated here how the ribosome, the nano-machine responsible for synthesizing all proteins necessary for cellular life, can be represented as a bipartite network. Ten ribosomal structures from Bacteria and six from Eukarya are explored. Ribosomal networks are found to exhibit unique properties despite variations in the nodes and edges of the different graphs. The ribosome is shown to exhibit very large topological redundancies, demonstrating mathematical resiliency. These results can potentially explain how it can function consistently despite changes in composition and connectivity. Furthermore, this representation can be used to analyze ribosome function within the large machinery of network theory, where the degrees of freedom are the possible interactions, and can be used to provide new insights for translation regulation and therapeutics.
Collapse
|
8
|
Prosdocimi F, de Farias ST. Entering the labyrinth: A hypothesis about the emergence of metabolism from protobiotic routes. Biosystems 2022; 220:104751. [DOI: 10.1016/j.biosystems.2022.104751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/26/2022] [Accepted: 07/31/2022] [Indexed: 11/26/2022]
|
9
|
Konjevoda P, Štambuk N. Relational model of the standard genetic code. Biosystems 2021; 210:104529. [PMID: 34464669 DOI: 10.1016/j.biosystems.2021.104529] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 11/28/2022]
Abstract
The genetic code is a set of rules that establishes mapping between triplets in messenger RNA and amino acids in proteins. The most common way to display these rules is the Standard Genetic Code (SGC) table. This paper takes an alternative approach, based on the relational data model by Edgar F. Codd (Commun. ACM, 13:377-387, 1970). The relational model (RM) proposes a distributed storage of data into a collection of tables (called relations), that can be connected by shared communality. Basic elements of the table are rows (called records or tuples), and columns (called fields or attributes). The SGC table, according to the relational data model, represents the so called unnormalized form of a table. Using normalization rules it is possible to subdivide the SGC table into four tables. The rows and columns of single tables are defined by the first and second base and individual tables by the third codon base. The result of this model is an approach to managing genetic code data, represented in terms of tuples and grouped into relations, with table structure and language consistent with first-order (predicate) logic. The RM explains that the final step in the development of the SGC was the adoption of coding function by the third base, which makes an informational/functional unit with the first base, despite the different physical location in a triplet. This enabled the synthesis of specific proteins without ambiguity, in accordance with the concept of ambiguity reduction and five phases of the general model on the origin of biological codes by Marcello Barbieri (BioSystems 181:11-19, 2019).
Collapse
Affiliation(s)
- Paško Konjevoda
- Laboratory for Epigenomics, Division of Molecular Medicine, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000 Zagreb, Croatia.
| | - Nikola Štambuk
- Center for Nuclear Magnetic Resonance, Ruđer Bošković Institute, Bijenička cesta 54, HR-10000 Zagreb, Croatia.
| |
Collapse
|
10
|
Prosdocimi F, de Farias ST. Life and living beings under the perspective of organic macrocodes. Biosystems 2021; 206:104445. [PMID: 34033908 DOI: 10.1016/j.biosystems.2021.104445] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/17/2021] [Accepted: 05/18/2021] [Indexed: 11/16/2022]
Abstract
A powerful and concise concept of life is crucial for studies aiming to understand the characteristics that emerged from an inorganic world. Among biologists, the most accepted argument define life under a top-down strategy by looking into the shared characteristics observed in all cellular organisms. This is often made highlighting (i) autonomy and (ii) evolutionary capacity as fundamental characteristics observed in all cellular organisms. Along the present work, we assume the framework of code biology considering that biology started with the emergence of the first organic code by self-organization. We reinforces that the conceptual structure of life should be reallocated from the ontology class of Matter to its sister class of Process. Along the emergence and early evolution of biological systems, biological codes changed from open systems of "naked" molecules (at the progenote era), to close, encapsulated systems (at the organismic era). Living beings appeared at the very moment when nucleic acids with coding properties became encapsulated. This led to the origin of viruses and, then, to the origin of cells. In this context, we propose that the single character that makes a clear distinction between the abiotic and the biotic world is the capacity to process organic codes. Thus, life appears with the self-assembly of a genetic code and evolves by the emergence of other overlapping codes. Once life has been clearly conceptualized, we go further to conceptualize organisms, parents, lineages, and species in terms of code biology.
Collapse
Affiliation(s)
- Francisco Prosdocimi
- Laboratório de Biologia Teórica e de Sistemas, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.
| | - Sávio Torres de Farias
- Laboratório de Genética Evolutiva Paulo Leminski, Centro de Ciências Exatas e da Natureza, Universidade Federal da Paraíba, João Pessoa, Paraíba, Brazil; Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds, LS7 3RB, UK.
| |
Collapse
|
11
|
Pereira Dos Santos Junior A, José MV, Torres de Farias S. From RNA to DNA: Insights about the transition of informational molecule in the biological systems based on the structural proximity between the polymerases. Biosystems 2021; 206:104442. [PMID: 33984392 DOI: 10.1016/j.biosystems.2021.104442] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 05/05/2021] [Accepted: 05/05/2021] [Indexed: 10/21/2022]
Abstract
Structural relations in an evolutionary context of polymerases is crucial to gain insights into the transition from an RNA world to a Ribonucleoprotein world. Herein, we present a structural proximity tree for the polymerases, from which we observe that the enzymes that have RNA as substrate are more homogeneous than the group with DNA as substrate. The homogeneity observed in enzymes with RNA as a substrate, may be because they performed all steps in information processing. In this sense, the emergence of the DNA molecule posed new challenges to the biological systems, where several parts of the informational flow were individualized by the emergence of enzymes for each step. From the data presented, we propose a polymerase diversification model, in which we have RNA-dependent RNA polymerases as an ancestor and all other polymerases diverged directly from this group by a radiation process.
Collapse
Affiliation(s)
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Ciudad de México, C.P. 04510, Mexico.
| | - Sávio Torres de Farias
- Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, 58051-900, Brazil; Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds, LS7 3RB, UK.
| |
Collapse
|