1
|
Caetano-Anollés K, Aziz MF, Mughal F, Caetano-Anollés G. On Protein Loops, Prior Molecular States and Common Ancestors of Life. J Mol Evol 2024:10.1007/s00239-024-10167-y. [PMID: 38652291 DOI: 10.1007/s00239-024-10167-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 03/22/2024] [Indexed: 04/25/2024]
Abstract
The principle of continuity demands the existence of prior molecular states and common ancestors responsible for extant macromolecular structure. Here, we focus on the emergence and evolution of loop prototypes - the elemental architects of protein domain structure. Phylogenomic reconstruction spanning superkingdoms and viruses generated an evolutionary chronology of prototypes with six distinct evolutionary phases defining a most parsimonious evolutionary progression of cellular life. Each phase was marked by strategic prototype accumulation shaping the structures and functions of common ancestors. The last universal common ancestor (LUCA) of cells and viruses and the last universal cellular ancestor (LUCellA) defined stem lines that were structurally and functionally complex. The evolutionary saga highlighted transformative forces. LUCA lacked biosynthetic ribosomal machinery, while the pivotal LUCellA lacked essential DNA biosynthesis and modern transcription. Early proteins therefore relied on RNA for genetic information storage but appeared initially decoupled from it, hinting at transformative shifts of genetic processing. Urancestral loop types suggest advanced folding designs were present at an early evolutionary stage. An exploration of loop geometric properties revealed gradual replacement of prototypes with α-helix and β-strand bracing structures over time, paving the way for the dominance of other loop types. AlphFold2-generated atomic models of prototype accretion described patterns of fold emergence. Our findings favor a ‛processual' model of evolving stem lines aligned with Woese's vision of a communal world. This model prompts discussing the 'problem of ancestors' and the challenges that lie ahead for research in taxonomy, evolution and complexity.
Collapse
Affiliation(s)
- Kelsey Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
- Callout Biotech, Albuquerque, NM, 87112, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.
| |
Collapse
|
2
|
Gardes J, Maldivi C, Boisset D, Aubourg T, Demongeot J. An Unsupervised Classifier for Whole-Genome Phylogenies, the Maxwell© Tool. Int J Mol Sci 2023; 24:16278. [PMID: 38003468 PMCID: PMC10671764 DOI: 10.3390/ijms242216278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 10/20/2023] [Accepted: 11/02/2023] [Indexed: 11/26/2023] Open
Abstract
The development of phylogenetic trees based on RNA or DNA sequences generally requires a precise and limited choice of important RNAs, e.g., messenger RNAs of essential proteins or ribosomal RNAs (like 16S), but rarely complete genomes, making it possible to explain evolution and speciation. In this article, we propose revisiting a classic phylogeny of archaea from only the information on the succession of nucleotides of their entire genome. For this purpose, we use a new tool, the unsupervised classifier Maxwell, whose principle lies in the Burrows-Wheeler compression transform, and we show its efficiency in clustering whole archaeal genomes.
Collapse
Affiliation(s)
- Joël Gardes
- Orange Labs, 38229 Meylan, France; (J.G.); (C.M.); (D.B.)
| | | | - Denis Boisset
- Orange Labs, 38229 Meylan, France; (J.G.); (C.M.); (D.B.)
| | - Timothée Aubourg
- Faculty of Medicine, Université Grenoble Alpes, AGEIS EA 7407 Tools for e-Gnosis Medical, 38700 La Tronche, France;
| | - Jacques Demongeot
- Faculty of Medicine, Université Grenoble Alpes, AGEIS EA 7407 Tools for e-Gnosis Medical, 38700 La Tronche, France;
| |
Collapse
|
3
|
Aziz MF, Mughal F, Caetano-Anollés G. Tracing the birth of structural domains from loops during protein evolution. Sci Rep 2023; 13:14688. [PMID: 37673948 PMCID: PMC10482863 DOI: 10.1038/s41598-023-41556-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2022] [Accepted: 08/28/2023] [Indexed: 09/08/2023] Open
Abstract
The structures and functions of proteins are embedded into the loop scaffolds of structural domains. Their origin and evolution remain mysterious. Here, we use a novel graph-theoretical approach to describe how modular and non-modular loop prototypes combine to form folded structures in protein domain evolution. Phylogenomic data-driven chronologies reoriented a bipartite network of loops and domains (and its projections) into 'waterfalls' depicting an evolving 'elementary functionome' (EF). Two primordial waves of functional innovation involving founder 'p-loop' and 'winged-helix' domains were accompanied by an ongoing emergence and reuse of structural and functional novelty. Metabolic pathways expanded before translation functionalities. A dual hourglass recruitment pattern transferred scale-free properties from loop to domain components of the EF network in generative cycles of hierarchical modularity. Modeling the evolutionary emergence of the oldest P-loop and winged-helix domains with AlphFold2 uncovered rapid convergence towards folded structure, suggesting that a folding vocabulary exists in loops for protein fold repurposing and design.
Collapse
Affiliation(s)
- M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA.
- C.R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
4
|
Caetano-Anollés G. Agency in evolution of biomolecular communication. Ann N Y Acad Sci 2023; 1525:88-103. [PMID: 37219369 DOI: 10.1111/nyas.15005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Biomolecular communication demands that interactions between parts of a molecular system act as scaffolds for message transmission. It also requires an organized system of signs-a communicative agency-for creating and transmitting meaning. The emergence of agency, the capacity to act in a given context and generate end-directed behaviors, has baffled evolutionary biologists for centuries. Here, I explore its emergence with knowledge grounded in over two decades of evolutionary genomic and bioinformatic exploration. Biphasic processes of growth and diversification exist that generate hierarchy and modularity in biological systems at widely ranging time scales. Similarly, a biphasic process exists in communication that constructs a message before it can be transmitted for interpretation. Transmission dissipates matter-energy and information and involves computation. Agency emerges when molecular machinery generates hierarchical layers of vocabularies in an entangled communication network clustered around the universal Turing machine of the ribosome. Computations canalize biological systems to perform biological functions in a dissipative quest to structure long-lived occurrents. This occurs within the confines of a "triangle of persistence" that maximizes invariance with trade-offs between economy, flexibility, and robustness. Thus, learning from previous historical and circumstantial experiences unifies modules in a hierarchy that expands the agency of systems.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences and C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| |
Collapse
|
5
|
Villarreal L, Witzany G. Self-empowerment of life through RNA networks, cells and viruses. F1000Res 2023; 12:138. [PMID: 36785664 PMCID: PMC9918806 DOI: 10.12688/f1000research.130300.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/20/2023] [Indexed: 01/05/2024] Open
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
6
|
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
7
|
Demongeot J, Thellier M. Primitive Oligomeric RNAs at the Origins of Life on Earth. Int J Mol Sci 2023; 24:ijms24032274. [PMID: 36768599 PMCID: PMC9916791 DOI: 10.3390/ijms24032274] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 01/18/2023] [Accepted: 01/21/2023] [Indexed: 01/26/2023] Open
Abstract
There are several theories on the origin of life, which differ by choosing the preponderant factor of emergence: main function (autocatalysis versus replication), initial location (black smokers versus ponds) or first molecule (RNA versus DNA). Among the two last ones, the first assumes that an RNA world involving a collaboration of small RNAs with amino-acids pre-existed and the second that DNA-enzyme-lipid complexes existed first. The debate between these classic theories is not closed and the arguments for one or the other of these theories have recently fueled a debate in which the two have a high degree of likelihood. It therefore seems interesting to propose a third intermediate way, based on the existence of an RNA that may have existed before the latter stages postulated by these theories, and therefore may be the missing link towards a common origin of them. To search for a possible ancestral structure, we propose as candidate a small RNA existing in ring or hairpin form in the early stages of life, which could have acted as a "proto-ribosome" by favoring the synthesis of the first peptides. Remnants of this putative candidate RNA exist in molecules nowadays involved in the ribosomal factory, the concentrations of these relics depending on the seniority of these molecules within the translation process.
Collapse
Affiliation(s)
- Jacques Demongeot
- Faculty of Medicine, Université Grenoble Alpes, Laboratory AGEIS EA 7407 Tools for e-Gnosis Medical, 38700 Grenoble, France
- Correspondence:
| | - Michel Thellier
- Académie des Sciences, Section Biologie Integrative, 75006 Paris, France
| |
Collapse
|
8
|
Modeling the ribosome as a bipartite graph. PLoS One 2022; 17:e0279455. [PMID: 36584020 PMCID: PMC9803165 DOI: 10.1371/journal.pone.0279455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 12/08/2022] [Indexed: 12/31/2022] Open
Abstract
Developing mathematical representations of biological systems that can allow predictions is a challenging and important research goal. It is demonstrated here how the ribosome, the nano-machine responsible for synthesizing all proteins necessary for cellular life, can be represented as a bipartite network. Ten ribosomal structures from Bacteria and six from Eukarya are explored. Ribosomal networks are found to exhibit unique properties despite variations in the nodes and edges of the different graphs. The ribosome is shown to exhibit very large topological redundancies, demonstrating mathematical resiliency. These results can potentially explain how it can function consistently despite changes in composition and connectivity. Furthermore, this representation can be used to analyze ribosome function within the large machinery of network theory, where the degrees of freedom are the possible interactions, and can be used to provide new insights for translation regulation and therapeutics.
Collapse
|
9
|
A Short Tale of the Origin of Proteins and Ribosome Evolution. Microorganisms 2022; 10:microorganisms10112115. [DOI: 10.3390/microorganisms10112115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 09/30/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022] Open
Abstract
Proteins are the workhorses of the cell and have been key players throughout the evolution of all organisms, from the origin of life to the present era. How might life have originated from the prebiotic chemistry of early Earth? This is one of the most intriguing unsolved questions in biology. Currently, however, it is generally accepted that amino acids, the building blocks of proteins, were abiotically available on primitive Earth, which would have made the formation of early peptides in a similar fashion possible. Peptides are likely to have coevolved with ancestral forms of RNA. The ribosome is the most evident product of this coevolution process, a sophisticated nanomachine that performs the synthesis of proteins codified in genomes. In this general review, we explore the evolution of proteins from their peptide origins to their folding and regulation based on the example of superoxide dismutase (SOD1), a key enzyme in oxygen metabolism on modern Earth.
Collapse
|
10
|
Demongeot J, Seligmann H. Evolution of small and large ribosomal RNAs from accretion of tRNA subelements. Biosystems 2022; 222:104796. [DOI: 10.1016/j.biosystems.2022.104796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 10/19/2022] [Indexed: 11/02/2022]
|
11
|
Hassler HB, Probert B, Moore C, Lawson E, Jackson RW, Russell BT, Richards VP. Phylogenies of the 16S rRNA gene and its hypervariable regions lack concordance with core genome phylogenies. MICROBIOME 2022; 10:104. [PMID: 35799218 PMCID: PMC9264627 DOI: 10.1186/s40168-022-01295-y] [Citation(s) in RCA: 44] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 05/23/2022] [Indexed: 05/02/2023]
Abstract
BACKGROUND The 16S rRNA gene is used extensively in bacterial phylogenetics, in species delineation, and now widely in microbiome studies. However, the gene suffers from intragenomic heterogeneity, and reports of recombination and an unreliable phylogenetic signal are accumulating. Here, we compare core gene phylogenies to phylogenies constructed using core gene concatenations to estimate the strength of signal for the 16S rRNA gene, its hypervariable regions, and all core genes at the intra- and inter-genus levels. Specifically, we perform four intra-genus analyses (Clostridium, n = 65; Legionella, n = 47; Staphylococcus, n = 36; and Campylobacter, n = 17) and one inter-genus analysis [41 core genera of the human gut microbiome (31 families, 17 orders, and 12 classes), n = 82]. RESULTS At both taxonomic levels, the 16S rRNA gene was recombinant and subject to horizontal gene transfer. At the intra-genus level, the gene showed one of the lowest levels of concordance with the core genome phylogeny (50.7% average). Concordance for hypervariable regions was lower still, with entropy masking providing little to no benefit. A major factor influencing concordance was SNP count, which showed a positive logarithmic association. Using this relationship, we determined that 690 ± 110 SNPs were required for 80% concordance (average 16S rRNA gene SNP count was 254). We also found a wide range in 16S-23S-5S rRNA operon copy number among genomes (1-27). At the inter-genus level, concordance for the whole 16S rRNA gene was markedly higher (73.8% - 10th out of 49 loci); however, the most concordant hypervariable regions (V4, V3-V4, and V1-V2) ranked in the third quartile (62.5 to 60.0%). CONCLUSIONS Ramifications of a poor phylogenetic performance for the 16S rRNA gene are far reaching. For example, in addition to incorrect species/strain delineation and phylogenetic inference, it has the potential to confound community diversity metrics if phylogenetic information is incorporated - for example, with popular approaches such as Faith's phylogenetic diversity and UniFrac. Our results highlight the problematic nature of these approaches and their use (along with entropy masking) is discouraged. Lastly, the wide range in 16S rRNA gene copy number among genomes also has a strong potential to confound diversity metrics. Video Abstract.
Collapse
Affiliation(s)
- Hayley B. Hassler
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC 29634 USA
| | - Brett Probert
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC 29634 USA
| | - Carson Moore
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC 29634 USA
| | - Elizabeth Lawson
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC 29634 USA
| | | | - Brook T. Russell
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, SC 29634 USA
| | - Vincent P. Richards
- Department of Biological Sciences, College of Science, Clemson University, Clemson, SC 29634 USA
| |
Collapse
|
12
|
Carr CE. Resolving the History of Life on Earth by Seeking Life As We Know It on Mars. ASTROBIOLOGY 2022; 22:880-888. [PMID: 35467949 PMCID: PMC9298492 DOI: 10.1089/ast.2021.0043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2023]
Abstract
An origin of Earth life on Mars would resolve significant inconsistencies between the inferred history of life and Earth's geologic history. Life as we know it utilizes amino acids, nucleic acids, and lipids for the metabolic, informational, and compartment-forming subsystems of a cell. Such building blocks may have formed simultaneously from cyanosulfidic chemical precursors in a planetary surface scenario involving ultraviolet light, wet-dry cycling, and volcanism. On the inferred water world of early Earth, such an origin would have been limited to volcanic island hotspots. A cyanosulfidic origin of life could have taken place on Mars via photoredox chemistry, facilitated by orders-of-magnitude more sub-aerial crust than early Earth, and an earlier transition to oxidative conditions that could have been involved in final fixation of the genetic code. Meteoritic bombardment may have generated transient habitable environments and ejected and transferred life to Earth. Ongoing and future missions to Mars offer an unprecedented opportunity to confirm or refute evidence consistent with a cyanosulfidic origin of life on Mars, search for evidence of ancient life, and constrain the evolution of Mars' oxidation state over time. We should seek to prove or refute a martian origin for life on Earth alongside other possibilities.
Collapse
Affiliation(s)
- Christopher E. Carr
- Daniel Guggenheim School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, Georgia, USA
- School of Earth and Atmospheric Sciences, Georgia Institute of Technology, Atlanta, Georgia, USA
- Address correspondence to: Christopher E. Carr, ESM Building, Room G10, 620 Cherry St NW, Atlanta, GA 30332, USA
| |
Collapse
|
13
|
The Coevolution of Biomolecules and Prebiotic Information Systems in the Origin of Life: A Visualization Model for Assembling the First Gene. Life (Basel) 2022; 12:life12060834. [PMID: 35743865 PMCID: PMC9225589 DOI: 10.3390/life12060834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Revised: 05/23/2022] [Accepted: 06/01/2022] [Indexed: 11/24/2022] Open
Abstract
Prebiotic information systems exist in three forms: analog, hybrid, and digital. The Analog Information System (AIS), manifested early in abiogenesis, was expressed in the chiral selection, nucleotide formation, self-assembly, polymerization, encapsulation of polymers, and division of protocells. It created noncoding RNAs by polymerizing nucleotides that gave rise to the Hybrid Information System (HIS). The HIS employed different species of noncoding RNAs, such as ribozymes, pre-tRNA and tRNA, ribosomes, and functional enzymes, including bridge peptides, pre-aaRS, and aaRS (aminoacyl-tRNA synthetase). Some of these hybrid components build the translation machinery step-by-step. The HIS ushered in the Digital Information System (DIS), where tRNA molecules become molecular architects for designing mRNAs step-by-step, employing their two distinct genetic codes. First, they created codons of mRNA by the base pair interaction (anticodon–codon mapping). Secondly, each charged tRNA transferred its amino acid information to the corresponding codon (codon–amino acid mapping), facilitated by an aaRS enzyme. With the advent of encoded mRNA molecules, the first genes emerged before DNA. With the genetic memory residing in the digital sequences of mRNA, a mapping mechanism was developed between each codon and its cognate amino acid. As more and more codons ‘remembered’ their respective amino acids, this mapping system developed the genetic code in their memory bank. We compared three kinds of biological information systems with similar types of human-made computer systems.
Collapse
|
14
|
Ye S, Lehmann J. Genetic code degeneracy is established by the decoding center of the ribosome. Nucleic Acids Res 2022; 50:4113-4126. [PMID: 35325219 PMCID: PMC9023292 DOI: 10.1093/nar/gkac171] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 02/10/2022] [Accepted: 03/23/2022] [Indexed: 11/21/2022] Open
Abstract
The degeneracy of the genetic code confers a wide array of properties to coding sequences. Yet, its origin is still unclear. A structural analysis has shown that the stability of the Watson–Crick base pair at the second position of the anticodon–codon interaction is a critical parameter controlling the extent of non-specific pairings accepted at the third position by the ribosome, a flexibility at the root of degeneracy. Based on recent cryo-EM analyses, the present work shows that residue A1493 of the decoding center provides a significant contribution to the stability of this base pair, revealing that the ribosome is directly involved in the establishment of degeneracy. Building on existing evolutionary models, we show the evidence that the early appearance of A1493 and A1492 established the basis of degeneracy when an elementary kinetic scheme of translation was prevailing. Logical considerations on the expansion of this kinetic scheme indicate that the acquisition of the peptidyl transferase center was the next major evolutionary step, while the induced-fit mechanism, that enables a sharp selection of the tRNAs, necessarily arose later when G530 was acquired by the decoding center.
Collapse
Affiliation(s)
- Shixin Ye
- INSERM U1195 unit, University of Paris-Saclay, 94276 Le Kremlin Bicêtre, France
| | - Jean Lehmann
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, University of Paris-Saclay, 91198 Gif-sur-Yvette, France
| |
Collapse
|
15
|
Proteomic Analysis of the Antibacterial Effect of Improved Dian Dao San against Propionibacterium acnes. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2022; 2022:3855702. [PMID: 35186097 PMCID: PMC8849895 DOI: 10.1155/2022/3855702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 01/09/2022] [Accepted: 01/13/2022] [Indexed: 12/01/2022]
Abstract
Propionibacterium acnes (P. acnes) is a major pathogen of acne vulgaris. The traditional Chinese medicine (TCM) compound prescription, Dian Dao San (DDS), is effective for treating P. acnes. Previous clinical work by our team demonstrated that improved Dian Dao San (IDDS) has better antibacterial effects. However, the mechanism of IDDS inhibition of P. acnes is still unknown. Hence, the isobaric tags for relative and absolute quantitation (iTRAQ) technology was applied to explore the antibacterial mechanism of IDDS against P. acnes. Our results suggested that the antibacterial mechanism of IDDS was related to the glycolytic pathway. gap, pgk, and tpiA enzymes were found to be potential target proteins in the bacterial glycolytic pathway as an antibacterial mechanism of inhibition. In addition, SEM and TEM analyses revealed that IDDS may destruct bacterial plasma membrane and cell wall. The results provide a reliable, direct, and scientific theoretical basis for wide application of IDDS.
Collapse
|
16
|
Farias STD, Prosdocimi F. RNP-world: The ultimate essence of life is a ribonucleoprotein process. Genet Mol Biol 2022; 45:e20220127. [PMID: 36190700 PMCID: PMC9528728 DOI: 10.1590/1678-4685-gmb-2022-0127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 06/03/2022] [Indexed: 11/22/2022] Open
Abstract
The fundamental essence of life is based on process of interaction between nucleic acids and proteins. In a prebiotic world, amino acids, peptides, ions, and other metabolites acted in protobiotic routes at the same time on which RNAs performed catalysis and self-replication. Nevertheless, it was only when nucleic acids and peptides started to interact together in an organized process that life emerged. First, the ignition was sparked with the formation of a Peptidyl Transferase Center (PTC), possibly by concatenation of proto-tRNAs. This molecule that would become the catalytic site of ribosomes started a process of self-organization that gave origin to a protoorganism named FUCA, a ribonucleic ribosomal-like apparatus capable to polymerize amino acids. In that sense, we review hypotheses about the origin and early evolution of the genetic code. Next, populations of open biological systems named progenotes were capable of accumulating and exchanging genetic material, producing the first genomes. Progenotes then evolved in two paths: some presented their own ribosomes and others used available ribosomes in the medium to translate their encoded information. At some point, two different types of organisms emerged from populations of progenotes: the ribosome-encoding organisms (cells) and the capsid-encoding organisms (viruses).
Collapse
Affiliation(s)
- Sávio Torres de Farias
- Universidade Federal da Paraíba, Brazil; Network of Researchers on the Chemical Evolution of Life, UK
| | | |
Collapse
|
17
|
Mahato C, Menon S, Singh A, Afrose SP, Mondal J, Das D. Short Peptide-based Cross-β Amyloids Exploit Dual Residues for Phosphoesterase like Activity. Chem Sci 2022; 13:9225-9231. [PMID: 36092997 PMCID: PMC9384705 DOI: 10.1039/d2sc03205h] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 07/17/2022] [Indexed: 11/21/2022] Open
Abstract
Herein, we report that short peptides are capable of exploiting their anti-parallel registry to access cross-β stacks to expose more than one catalytic residue, exhibiting the traits of advanced binding pockets of enzymes. Binding pockets decorated with more than one catalytic residue facilitate substrate binding and process kinetically unfavourable chemical transformations. The solvent-exposed guanidinium and imidazole moieties on the cross-β microphases synergistically bind to polarise and hydrolyse diverse kinetically stable model substrates of nucleases and phosphatase. Mutation of either histidine or arginine results in a drastic decline in the rate of hydrolysis. These results not only support the argument of short amyloid peptides as the earliest protein folds but also suggest their interactions with nucleic acid congeners, foreshadowing the mutualistic biopolymer relationships that fueled the chemical emergence of life. Amyloid based short peptide assemblies use antiparallel registry to expose multiple catalytic residues to bind and cleave kinetically stable phosphoester bonds of nucleic acid congeners, foreshadowing interactions of protein folds with nucleic acids.![]()
Collapse
Affiliation(s)
- Chiranjit Mahato
- Department of Chemical Sciences & Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata Mohanpur West Bengal 741246 India
| | - Sneha Menon
- Tata Institute of Fundamental Research Hyderabad Telangana 500046 India
| | - Abhishek Singh
- Department of Chemical Sciences & Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata Mohanpur West Bengal 741246 India
| | - Syed Pavel Afrose
- Department of Chemical Sciences & Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata Mohanpur West Bengal 741246 India
| | - Jagannath Mondal
- Tata Institute of Fundamental Research Hyderabad Telangana 500046 India
| | - Dibyendu Das
- Department of Chemical Sciences & Centre for Advanced Functional Materials, Indian Institute of Science Education and Research (IISER) Kolkata Mohanpur West Bengal 741246 India
| |
Collapse
|
18
|
Caetano-Anollés G, Aziz MF, Mughal F, Caetano-Anollés D. Tracing protein and proteome history with chronologies and networks: folding recapitulates evolution. Expert Rev Proteomics 2021; 18:863-880. [PMID: 34628994 DOI: 10.1080/14789450.2021.1992277] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
INTRODUCTION While the origin and evolution of proteins remain mysterious, advances in evolutionary genomics and systems biology are facilitating the historical exploration of the structure, function and organization of proteins and proteomes. Molecular chronologies are series of time events describing the history of biological systems and subsystems and the rise of biological innovations. Together with time-varying networks, these chronologies provide a window into the past. AREAS COVERED Here, we review molecular chronologies and networks built with modern methods of phylogeny reconstruction. We discuss how chronologies of structural domain families uncover the explosive emergence of metabolism, the late rise of translation, the co-evolution of ribosomal proteins and rRNA, and the late development of the ribosomal exit tunnel; events that coincided with a tendency to shorten folding time. Evolving networks described the early emergence of domains and a late 'big bang' of domain combinations. EXPERT OPINION Two processes, folding and recruitment appear central to the evolutionary progression. The former increases protein persistence. The later fosters diversity. Chronologically, protein evolution mirrors folding by combining supersecondary structures into domains, developing translation machinery to facilitate folding speed and stability, and enhancing structural complexity by establishing long-distance interactions in novel structural and architectural designs.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA.,C. R. Woese Institute for Genomic Biology, University of Illinois, Urbana, Illinois, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, Illinois, USA
| | - Derek Caetano-Anollés
- Data Science Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| |
Collapse
|
19
|
Sun F, Caetano-Anollés G. Menzerath-Altmann's Law of Syntax in RNA Accretion History. Life (Basel) 2021; 11:489. [PMID: 34071925 PMCID: PMC8228408 DOI: 10.3390/life11060489] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 05/25/2021] [Accepted: 05/26/2021] [Indexed: 01/13/2023] Open
Abstract
RNA evolves by adding substructural parts to growing molecules. Molecular accretion history can be dissected with phylogenetic methods that exploit structural and functional evidence. Here, we explore the statistical behaviors of lengths of double-stranded and single-stranded segments of growing tRNA, 5S rRNA, RNase P RNA, and rRNA molecules. The reconstruction of character state changes along branches of phylogenetic trees of molecules and trees of substructures revealed strong pushes towards an economy of scale. In addition, statistically significant negative correlations and strong associations between the average lengths of helical double-stranded stems and their time of origin (age) were identified with the Pearson's correlation and Spearman's rho methods. The ages of substructures were derived directly from published rooted trees of substructures. A similar negative correlation was detected in unpaired segments of rRNA but not for the other molecules studied. These results suggest a principle of diminishing returns in RNA accretion history. We show this principle follows a tendency of substructural parts to decrease their size when molecular systems enlarge that follows the Menzerath-Altmann's law of language in full generality and without interference from the details of molecular growth.
Collapse
Affiliation(s)
- Fengjie Sun
- School of Science and Technology, Georgia Gwinnett College, Lawrenceville, GA 30043, USA;
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA
| |
Collapse
|
20
|
Chu XY, Zhang HY. Protein Homochirality May Be Derived from Primitive Peptide Synthesis by RNA. ASTROBIOLOGY 2021; 21:628-635. [PMID: 33600215 DOI: 10.1089/ast.2020.2324] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Homochirality is a feature of life, but its origin is still disputed. Recent theories indicate that the origin of homochirality coincided with that of the RNA world, but proteins have not yet been incorporated into the story. Ribosome is considered a living fossil that survived the RNA world and records the oldest interaction between RNA and proteins. Inspired by several ribosome-related findings, we propose a hypothesis as follows: the substrate chirality preference of some primitive peptide synthesis ribozymes can mediate the chirality transmission from RNA to protein. In return, the chiral preference of protective peptide-RNA interaction can bring these ribozymes an evolutionary advantage and facilitate the expansion of enantiomeric excess in peptides. Monte Carlo simulation results show that this system's chemistry model is plausible. This model can be further tested through investigation of the chirality preference for the interactions between d/l-ribose-composed rRNA homologs and l/d-amino acid-composed peptides.
Collapse
Affiliation(s)
- Xin-Yi Chu
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, P. R. China
| | - Hong-Yu Zhang
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, Wuhan, P. R. China
| |
Collapse
|
21
|
Villarreal LP, Witzany G. Social Networking of Quasi-Species Consortia drive Virolution via Persistence. AIMS Microbiol 2021; 7:138-162. [PMID: 34250372 PMCID: PMC8255905 DOI: 10.3934/microbiol.2021010] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 04/25/2021] [Indexed: 12/31/2022] Open
Abstract
The emergence of cooperative quasi-species consortia (QS-C) thinking from the more accepted quasispecies equations of Manfred Eigen, provides a conceptual foundation from which concerted action of RNA agents can now be understood. As group membership becomes a basic criteria for the emergence of living systems, we also start to understand why the history and context of social RNA networks become crucial for survival and function. History and context of social RNA networks also lead to the emergence of a natural genetic code. Indeed, this QS-C thinking can also provide us with a transition point between the chemical world of RNA replicators and the living world of RNA agents that actively differentiate self from non-self and generate group identity with membership roles. Importantly the social force of a consortia to solve complex, multilevel problems also depend on using opposing and minority functions. The consortial action of social networks of RNA stem-loops subsequently lead to the evolution of cellular organisms representing a tree of life.
Collapse
|
22
|
de Farias ST, Rêgo TG, José MV. Origin of the 16S Ribosomal Molecule from Ancestor tRNAs. J Mol Evol 2021; 89:249-256. [PMID: 33760964 DOI: 10.1007/s00239-021-10002-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 03/06/2021] [Indexed: 12/15/2022]
Abstract
We tested the hypothesis that concatemers of ancestral tRNAs gave rise to the 16S ribosomal RNA. We built an ancestral sequence of proto-tRNAs that showed a significant identity of 51.69% and a percentage of structural identity of 0.941 with the 3' upper domain of 16S ribosomal molecule. We also propose a hypothesis in which the small ribosomal subunit emerged by proto-tRNA fusion and worked as a point to bind RNAs in an open structure configuration. In this context, the two ribosomal subunits initially worked independently, and that the subunit junction, with consequent primitive ribosome formation, was mediated by interactions with tRNA molecules during the primordial genetic code formation.
Collapse
Affiliation(s)
- Savio Torres de Farias
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, 58051-900, Brazil. .,Network of Researchers on the Chemical Evolution of Life (NoRCEL), Leeds, LS7 3RB, UK.
| | - Thais Gaudêncio Rêgo
- Departamento de Informática, Universidade Federal da Paraíba, João Pessoa, 58051-900, Brazil
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, C.P. 04510, Mexico, D.F., Mexico
| |
Collapse
|
23
|
Is it possible that cells have had more than one origin? Biosystems 2021; 202:104371. [PMID: 33524470 DOI: 10.1016/j.biosystems.2021.104371] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 01/22/2021] [Accepted: 01/22/2021] [Indexed: 01/03/2023]
Abstract
Cells occupy a prominent place in the history of life in Earth. The central role of cellular organization can be understood by the fact that "cellular life" is often used as a synonym for life itself. Thus, most characteristics used to define cell overlap with those ones used to define life. However, innovative scenarios for the origin of life are bringing alternative views to describe how cells may have evolved from the open biological systems named progenotes. Here, using a logical and conceptual analysis, we re-evaluate the characteristics used to infer a single origin for cells. We argue that some evidences used to support cell monophyly, such as the presence of elements from the translation mechanism together with the universality of the genetic code, actually indicate a unique origin for all "biological systems", a term used to define not only cells, but also viruses and progenotes. Besides, we present evidence that at least two biochemical pathways as important as (i) DNA replication and (ii) lipid biosynthesis are not homologous between Bacteria and Archaea. The identities observed between the proteins involved in those pathways along representatives of these two ancestral domains of life are too low to indicate common genic ancestry. Altogether these facts can be seen as an indication that cellular organization has possibly evolved two or more times and that LUCA (the Last Universal Common Ancestor) may not have existed as a cellular entity. Thus, we aim to consider the possibility that different strategies acquired by biological systems to exist, such as viral, bacterial and archaeal were most likely originated independently from the evolution of different progenote populations.
Collapse
|
24
|
Matzov D, Taoka M, Nobe Y, Yamauchi Y, Halfon Y, Asis N, Zimermann E, Rozenberg H, Bashan A, Bhushan S, Isobe T, Gray MW, Yonath A, Shalev-Benami M. Cryo-EM structure of the highly atypical cytoplasmic ribosome of Euglena gracilis. Nucleic Acids Res 2020; 48:11750-11761. [PMID: 33091122 PMCID: PMC7672448 DOI: 10.1093/nar/gkaa893] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 09/21/2020] [Accepted: 10/21/2020] [Indexed: 12/11/2022] Open
Abstract
Ribosomal RNA is the central component of the ribosome, mediating its functional and architectural properties. Here, we report the cryo-EM structure of a highly divergent cytoplasmic ribosome from the single-celled eukaryotic alga Euglena gracilis. The Euglena large ribosomal subunit is distinct in that it contains 14 discrete rRNA fragments that are assembled non-covalently into the canonical ribosome structure. The rRNA is substantially enriched in post-transcriptional modifications that are spread far beyond the catalytic RNA core, contributing to the stabilization of this highly fragmented ribosome species. A unique cluster of five adenosine base methylations is found in an expansion segment adjacent to the protein exit tunnel, such that it is positioned for interaction with the nascent peptide. As well as featuring distinctive rRNA expansion segments, the Euglena ribosome contains four novel ribosomal proteins, localized to the ribosome surface, three of which do not have orthologs in other eukaryotes.
Collapse
Affiliation(s)
- Donna Matzov
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Masato Taoka
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Yuko Nobe
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Yoshio Yamauchi
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Yehuda Halfon
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Nofar Asis
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Ella Zimermann
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Haim Rozenberg
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Anat Bashan
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Shashi Bhushan
- School of Biological Sciences, Nanyang Technological University, Singapore
| | - Toshiaki Isobe
- Department of Chemistry, Graduate School of Science, Tokyo Metropolitan University, Minami-osawa 1-1, Hachioji-shi, Tokyo 192-0397, Japan
| | - Michael W Gray
- Department of Biochemistry and Molecular Biology and Centre for Comparative Genomics and Evolutionary Bioinformatics, Dalhousie University, Halifax, Nova Scotia, Canada B3H 1X5
| | - Ada Yonath
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Moran Shalev-Benami
- Department of Structural Biology, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
25
|
Kunnev D. Origin of Life: The Point of No Return. Life (Basel) 2020; 10:life10110269. [PMID: 33153087 PMCID: PMC7693465 DOI: 10.3390/life10110269] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2020] [Revised: 11/01/2020] [Accepted: 11/01/2020] [Indexed: 12/13/2022] Open
Abstract
Origin of life research is one of the greatest scientific frontiers of mankind. Many hypotheses have been proposed to explain how life began. Although different hypotheses emphasize different initial phenomena, all of them agree around one important concept: at some point, along with the chain of events toward life, Darwinian evolution emerged. There is no consensus, however, how this occurred. Frequently, the mechanism leading to Darwinian evolution is not addressed and it is assumed that this problem could be solved later, with experimental proof of the hypothesis. Here, the author first defines the minimum components required for Darwinian evolution and then from this standpoint, analyzes some of the hypotheses for the origin of life. Distinctive features of Darwinian evolution and life rooted in the interaction between information and its corresponding structure/function are then reviewed. Due to the obligatory dependency of the information and structure subject to Darwinian evolution, these components must be locked in their origin. One of the most distinctive characteristics of Darwinian evolution in comparison with all other processes is the establishment of a fundamentally new level of matter capable of evolving and adapting. Therefore, the initiation of Darwinian evolution is the "point of no return" after which life begins. In summary: a definition and a mechanism for Darwinian evolution are provided together with a critical analysis of some of the hypotheses for the origin of life.
Collapse
Affiliation(s)
- Dimiter Kunnev
- Department of Oral Biology, University at Buffalo, Buffalo, NY 14263, USA
| |
Collapse
|
26
|
Demongeot J, Seligmann H. Codon assignment evolvability in theoretical minimal RNA rings. Gene 2020; 769:145208. [PMID: 33031892 DOI: 10.1016/j.gene.2020.145208] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 09/28/2020] [Accepted: 09/29/2020] [Indexed: 12/28/2022]
Abstract
Genetic code codon-amino acid assignments evolve for 15 (AAA, AGA, AGG, ATA, CGG, CTA, CTG. CTC, CTT, TAA, TAG, TCA, TCG, TGA and TTA (GNN codons notably absent)) among 64 codons (23.4%) across the 31 genetic codes (NCBI list completed with recently suggested green algal mitochondrial genetic codes). Their usage in 25 theoretical minimal RNA rings is examined. RNA rings are designed in silico to code once over the shortest length for all 22 coding signals (start and stop codons and each amino acid according to the standard genetic code). Though designed along coding constraints, RNA rings resemble ancestral tRNA loops, assigning to each RNA ring a putative anticodon, a cognate amino acid and an evolutionary genetic code integration rank for that cognate amino acid. Analyses here show 1. biases against/for evolvable codons in the two first vs last thirds of RNA ring coding sequences, 2. RNA rings with evolvable codons have recent cognates, and 3. evolvable codon and cytosine numbers in RNA ring compositions are positively correlated. Applying alternative genetic codes to RNA rings designed for nonredundant coding according to the standard genetic code reveals unsuspected properties of the standard genetic code and of RNA rings, notably on codon assignment evolvability and the special role of cytosine in relation to codon assignment evolvability and of the genetic code's coding structure.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700 La Tronche, France
| | - Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel.
| |
Collapse
|
27
|
Rivas M, Fox GE. Further Characterization of the Pseudo-Symmetrical Ribosomal Region. Life (Basel) 2020; 10:life10090201. [PMID: 32937913 PMCID: PMC7555685 DOI: 10.3390/life10090201] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 09/09/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
The peptidyl transferase center of the modern ribosome has been found to encompass an area of twofold pseudosymmetry (SymR). This observation strongly suggests that the very core of the ribosome arose from a dimerization event between two modest-sized RNAs. It was previously shown that at least four non-standard interactions exist between the two halves of SymR. Herein, we verify that the structure of the SymR is highly conserved with respect to both ribosome transition state and phylogenetic diversity. These comparisons also reveal two additional sites of interaction between the two halves of SymR and refine our understanding of the previously known interactions. In addition, the possible role that magnesium may have in the coordination, stabilization, association, and evolutionary history of the two halves (A-region and P-region) was examined. Together, the results identify a likely site where structural elements and Mg2+ ions may have facilitated the ligation of two aboriginal RNAs into a single unit.
Collapse
|
28
|
Abstract
We tested the hypothesis that concatemers of ancestral tRNAs gave rise to the 16S ribosomal RNA. We built an ancestral sequence of proto-tRNAs that showed a significant identity of 51.69% and a percentage of structural identity of 0.941 with the 16S ribosomal molecule. We also propose a hypothesis for the emergence of translation.
Collapse
|
29
|
Gospodinov A, Kunnev D. Universal Codons with Enrichment from GC to AU Nucleotide Composition Reveal a Chronological Assignment from Early to Late Along with LUCA Formation. Life (Basel) 2020; 10:life10060081. [PMID: 32516985 PMCID: PMC7345086 DOI: 10.3390/life10060081] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2020] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 12/14/2022] Open
Abstract
The emergence of a primitive genetic code should be considered the most essential event during the origin of life. Almost a complete set of codons (as we know them) should have been established relatively early during the evolution of the last universal common ancestor (LUCA) from which all known organisms descended. Many hypotheses have been proposed to explain the driving forces and chronology of the evolution of the genetic code; however, none is commonly accepted. In the current paper, we explore the features of the genetic code that, in our view, reflect the mechanism and the chronological order of the origin of the genetic code. Our hypothesis postulates that the primordial RNA was mostly GC-rich, and this bias was reflected in the order of amino acid codon assignment. If we arrange the codons and their corresponding amino acids from GC-rich to AU-rich, we find that: 1. The amino acids encoded by GC-rich codons (Ala, Gly, Arg, and Pro) are those that contribute the most to the interactions with RNA (if incorporated into short peptides). 2. This order correlates with the addition of novel functions necessary for the evolution from simple to longer folded peptides. 3. The overlay of aminoacyl-tRNA synthetases (aaRS) to the amino acid order produces a distinctive zonal distribution for class I and class II suggesting an interdependent origin. These correlations could be explained by the active role of the bridge peptide (BP), which we proposed earlier in the evolution of the genetic code.
Collapse
Affiliation(s)
- Anastas Gospodinov
- Roumen Tsanev Institute of Molecular Biology, Bulgarian Academy of Sciences, Acad. G. Bonchev Str. 21, Sofia 1113, Bulgaria;
| | - Dimiter Kunnev
- Department of Molecular & Cellular Biology, Roswell Park Cancer Institute, Buffalo, NY 14263, USA
- Correspondence:
| |
Collapse
|
30
|
Bowman JC, Petrov AS, Frenkel-Pinter M, Penev PI, Williams LD. Root of the Tree: The Significance, Evolution, and Origins of the Ribosome. Chem Rev 2020; 120:4848-4878. [PMID: 32374986 DOI: 10.1021/acs.chemrev.9b00742] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
The ribosome is an ancient molecular fossil that provides a telescope to the origins of life. Made from RNA and protein, the ribosome translates mRNA to coded protein in all living systems. Universality, economy, centrality and antiquity are ingrained in translation. The translation machinery dominates the set of genes that are shared as orthologues across the tree of life. The lineage of the translation system defines the universal tree of life. The function of a ribosome is to build ribosomes; to accomplish this task, ribosomes make ribosomal proteins, polymerases, enzymes, and signaling proteins. Every coded protein ever produced by life on Earth has passed through the exit tunnel, which is the birth canal of biology. During the root phase of the tree of life, before the last common ancestor of life (LUCA), exit tunnel evolution is dominant and unremitting. Protein folding coevolved with evolution of the exit tunnel. The ribosome shows that protein folding initiated with intrinsic disorder, supported through a short, primitive exit tunnel. Folding progressed to thermodynamically stable β-structures and then to kinetically trapped α-structures. The latter were enabled by a long, mature exit tunnel that partially offset the general thermodynamic tendency of all polypeptides to form β-sheets. RNA chaperoned the evolution of protein folding from the very beginning. The universal common core of the ribosome, with a mass of nearly 2 million Daltons, was finalized by LUCA. The ribosome entered stasis after LUCA and remained in that state for billions of years. Bacterial ribosomes never left stasis. Archaeal ribosomes have remained near stasis, except for the superphylum Asgard, which has accreted rRNA post LUCA. Eukaryotic ribosomes in some lineages appear to be logarithmically accreting rRNA over the last billion years. Ribosomal expansion in Asgard and Eukarya has been incremental and iterative, without substantial remodeling of pre-existing basal structures. The ribosome preserves information on its history.
Collapse
Affiliation(s)
- Jessica C Bowman
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Anton S Petrov
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Moran Frenkel-Pinter
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Petar I Penev
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Loren Dean Williams
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
31
|
Demongeot J, Seligmann H. Comparisons between small ribosomal RNA and theoretical minimal RNA ring secondary structures confirm phylogenetic and structural accretion histories. Sci Rep 2020; 10:7693. [PMID: 32376895 PMCID: PMC7203183 DOI: 10.1038/s41598-020-64627-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 04/01/2020] [Indexed: 12/16/2022] Open
Abstract
Ribosomal RNAs are complex structures that presumably evolved by tRNA accretions. Statistical properties of tRNA secondary structures correlate with genetic code integration orders of their cognate amino acids. Ribosomal RNA secondary structures resemble those of tRNAs with recent cognates. Hence, rRNAs presumably evolved from ancestral tRNAs. Here, analyses compare secondary structure subcomponents of small ribosomal RNA subunits with secondary structures of theoretical minimal RNA rings, presumed proto-tRNAs. Two independent methods determined different accretion orders of rRNA structural subelements: (a) classical comparative homology and phylogenetic reconstruction, and (b) a structural hypothesis assuming an inverted onion ring growth where the three-dimensional ribosome's core is most ancient and peripheral elements most recent. Comparisons between (a) and (b) accretions orders with RNA ring secondary structure scales show that recent rRNA subelements are: 1. more like RNA rings with recent cognates, indicating ongoing coevolution between tRNA and rRNA secondary structures; 2. less similar to theoretical minimal RNA rings with ancient cognates. Our method fits (a) and (b) in all examined organisms, more with (a) than (b). Results stress the need to integrate independent methods. Theoretical minimal RNA rings are potential evolutionary references for any sequence-based evolutionary analyses, independent of the focal data from that study.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel
| |
Collapse
|
32
|
Seligmann H. First arrived, first served: competition between codons for codon-amino acid stereochemical interactions determined early genetic code assignments. Naturwissenschaften 2020; 107:20. [PMID: 32367155 DOI: 10.1007/s00114-020-01676-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Revised: 03/10/2020] [Accepted: 04/05/2020] [Indexed: 12/12/2022]
Abstract
Stereochemical nucleotide-amino acid interactions, in the form of noncovalent nucleotide-amino acid interactions, potentially produced the genetic code's codon-amino acid assignments. Empirical estimates of single nucleotide-amino acid affinities on surfaces and in solution are used to test whether trinucleotide-amino acid affinities determined genetic code assignments pending the principle "first arrived, first served": presumed early amino acids have greater codon-amino acid affinities than ulterior ones. Here, these single nucleotide affinities are used to approximate all 64 × 20 trinucleotide-amino acid affinities. Analyses show that (1) on surfaces, genetic code codon-amino acid assignments tend to match high affinities for the amino acids that integrated earliest the genetic code (according to Wong's metabolic coevolution hypothesis between nucleotides and amino acids) and (2) in solution, the same principle holds for the anticodon-amino acid assignments. Affinity analyses match best genetic code assignments when assuming that trinucleotides competed for amino acids, rather than amino acids for trinucleotides. Codon-amino acid affinities stick better to genetic code assignments than anticodon-amino acid affinities. Presumably, two independent coding systems, on surfaces and in solution, converged, and formed the current translation system. Proto-translation on surfaces by direct codon-amino acid interactions without tRNA-like adaptors coadapted with a system emerging in solution by proto-tRNA anticodon-amino acid interactions. These systems assigned identical or similar cognates to codons on surfaces and to anticodons in solution. Results indicate that a prebiotic metabolism predated genetic code self-organization.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91904, Jerusalem, Israel. .,Faculty of Medicine, Université Grenoble Alpes, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700, La Tronche, France.
| |
Collapse
|
33
|
de Farias ST, José MV. Transfer RNA: The molecular demiurge in the origin of biological systems. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2020; 153:28-34. [PMID: 32105652 DOI: 10.1016/j.pbiomolbio.2020.02.006] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2019] [Revised: 02/03/2020] [Accepted: 02/11/2020] [Indexed: 01/24/2023]
Abstract
Herein, we review recent works on the role that the tRNA molecule played in the early origins of biological systems. tRNAs gave origin to the first genes (mRNA), the peptidyl transferase center (PTC), the 16S ribosomal molecule, proto-tRNAs were at the core of a proto-translation system, and the anticodon and operational codes appeared in tRNAs molecules. Metabolic pathways emerged from evolutionary pressures of the decoding systems. The transitions from the RNA world to the ribonucleoprotein world to modern biological systems were driven by two kinds of tRNAs transitions, to wit, tRNAs leading to both mRNA and rRNA.
Collapse
Affiliation(s)
- Sávio Torres de Farias
- Laboratório de Genética Evolutiva Paulo Leminsk, Departamento de Biologia Molecular, Universidade Federal da Paraíba, João Pessoa, Brazil.
| | - Marco V José
- Theoretical Biology Group, Instituto de Investigaciones Biomédicas, Universidad Nacional Autónoma de México, Ciudad de México CDMX, C.P. 04510, Mexico.
| |
Collapse
|
34
|
Demongeot J, Seligmann H. Deamination gradients within codons after 1<->2 position swap predict amino acid hydrophobicity and parallel β-sheet conformational preference. Biosystems 2020; 191-192:104116. [PMID: 32081715 DOI: 10.1016/j.biosystems.2020.104116] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 12/04/2019] [Accepted: 02/10/2020] [Indexed: 12/16/2022]
Abstract
Deaminations C->T and A->G are frequent mutations producing nucleotide content gradients across genomes proportional to singlestrandedness during replication/transcription. Hence, within single codons, deamination risks increase from first to third codon positions, while second codon positions are functionally most crucial. Here genetic codes are analyzed assuming that after anticodons protected codons from deaminations, first and second codon positions swapped (N2N1N3->N1N2N3), with lowest deamination risks for N2 in presumed primitive N2N1N3 codons. N2N1N3, not standard N1N2N3, codon structure minimizes deaminations inversely proportionally to cognate amino acid hydrophobicity and parallel betasheet conformational preference. For N1N2N3, deamination minimization increases with genetic code integration order of cognate amino acids: during the presumed N2N1N3->N1N2N3 codon structure transition, protein synthesis combined direct codon-amino acid interactions for late amino acids and tRNA-based translation for early amino acids. Hence N2N1N3 codons would correspond to tRNA-free translation by spontaneous codon-amino acid affinities, and tRNA-mediated translation presumably caused N2N1N3->N1N2N3 swaps. Results show that rational, not arbitrary rules link codon and amino acid structures. Some analyses detect mitochondrial RNAs and peptides in public data corresponding to systematic position swaps, suggesting occasional swapping polymerase activity.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical, F-38700, La Tronche, France; The National Natural History Collections, The Hebrew University of Jerusalem, 91404, Jerusalem, Israel.
| |
Collapse
|
35
|
Michel CJ, Thompson JD. Identification of a circular code periodicity in the bacterial ribosome: origin of codon periodicity in genes? RNA Biol 2020; 17:571-583. [PMID: 31960748 DOI: 10.1080/15476286.2020.1719311] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Three-base periodicity (TBP), where nucleotides and higher order n-tuples are preferentially spaced by 3, 6, 9, etc. bases, is a well-known intrinsic property of protein-coding DNA sequences. However, its origins are still not fully understood. One hypothesis is that the periodicity reflects a primordial coding system that was used before the emergence of the modern standard genetic code (SGC). Recent evidence suggests that the X circular code, a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, represents a possible ancestor of the SGC. Motifs from the X circular code have been found in the reading frame of protein-coding regions in extant organisms from bacteria to eukaryotes, in many transfer RNA (tRNA) genes and in important functional regions of the ribosomal RNA (rRNA), notably in the peptidyl transferase centre and the decoding centre. Here, we have used a powerful correlation function to search for periodicity patterns involving the 20 trinucleotides of the X circular code in a large set of bacterial protein-coding genes, as well as in the translation machinery, including rRNA and tRNA sequences. As might be expected, we found a strong circular code periodicity 0 modulo 3 in the protein-coding genes. More surprisingly, we also identified a similar circular code periodicity in a large region of the 16S rRNA. This region includes the 3' major domain corresponding to the primordial proto-ribosome decoding centre and containing numerous sites that interact with the tRNA and messenger RNA (mRNA) during translation. Furthermore, 3D structural analysis shows that the periodicity region surrounds the mRNA channel that lies between the head and the body of the SSU. Our results support the hypothesis that the X circular code may constitute an ancestral translation code involved in reading frame retrieval and maintenance, traces of which persist in modern mRNA, tRNA and rRNA despite their long evolution and adaptation to the SGC.
Collapse
Affiliation(s)
- Christian J Michel
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| | - Julie D Thompson
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| |
Collapse
|
36
|
Demongeot J, Seligmann H. Accretion history of large ribosomal subunits deduced from theoretical minimal RNA rings is congruent with histories derived from phylogenetic and structural methods. Gene 2020; 738:144436. [PMID: 32027954 DOI: 10.1016/j.gene.2020.144436] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 01/24/2020] [Accepted: 02/01/2020] [Indexed: 12/17/2022]
Abstract
Accretions of tRNAs presumably formed the large complex ribosomal RNA structures. Similarities of tRNA secondary structures with rRNA secondary structures increase with the integration order of their cognate amino acid in the genetic code, indicating tRNA evolution towards rRNA-like structures. Here analyses rank secondary structure subelements of three large ribosomal RNAs (Prokaryota: Archaea: Thermus thermophilus; Bacteria: Escherichia coli; Eukaryota: Saccharomyces cerevisiae) in relation to their similarities with secondary structures formed by presumed proto-tRNAs, represented by 25 theoretical minimal RNA rings. These ranks are compared to those derived from two independent methods (ranks provide a relative evolutionary age to the rRNA substructure), (a) cladistic phylogenetic analyses and (b) 3D-crystallography where core subelements are presumed ancient and peripheral ones recent. Comparisons of rRNA secondary structure subelements with RNA ring secondary structures show congruence between ranks deduced by this method and both (a) and (b) (more with (a) than (b)), especially for RNA rings with predicted ancient cognate amino acid. Reconstruction of accretion histories of large rRNAs will gain from adequately integrating information from independent methods. Theoretical minimal RNA rings, sequences deterministically designed in silico according to specific coding constraints, might produce adequate scales for prebiotic and early life molecular evolution.
Collapse
Affiliation(s)
- Jacques Demongeot
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700 La Tronche, France.
| | - Hervé Seligmann
- Université Grenoble Alpes, Faculty of Medicine, Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecoms4Health, F-38700 La Tronche, France; The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel.
| |
Collapse
|
37
|
|
38
|
Pilla SP, Bahadur RP. Residue conservation elucidates the evolution of r-proteins in ribosomal assembly and function. Int J Biol Macromol 2019; 140:323-329. [PMID: 31421176 DOI: 10.1016/j.ijbiomac.2019.08.127] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2019] [Revised: 08/14/2019] [Accepted: 08/14/2019] [Indexed: 02/08/2023]
Abstract
Ribosomes are the translational machineries having two unequal subunits, small subunit (SSU) and large subunit (LSU) across all the domains of life. Origin and evolution of ribosome are encoded in its structure, and the core of the ribosome is highly conserved. Here, we have used Shannon entropy to analyze the evolution of ribosomal proteins (r-proteins) across the three domains of life. Moreover, we have analyzed the residue conservation at protein-protein (PP) and protein-RNA (PR) interfaces in SSU and LSU. Furthermore, we have studied the evolution of early, intermediate and late binding r-proteins. We show that the r-proteins of Thermus thermophilus are better conserved during the evolution. Furthermore, we find the late binders are better conserved than the early and the intermediate binders. The residues at the interior of the r-proteins are the most conserved followed by those at the interface and the solvent accessible surface. Additionally, we show that the residues at the PP interfaces are better conserved than those at the PR interfaces. However, between PR and PP interfaces, the multi-interface residues at the former are better conserved than those at the latter ones. Our findings may provide insights into the evolution of r-proteins in ribosomal assembly and function.
Collapse
Affiliation(s)
- Smita P Pilla
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Ranjit Prasad Bahadur
- Computational Structural Biology Laboratory, Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur 721302, India.
| |
Collapse
|
39
|
Mughal F, Caetano-Anollés G. MANET 3.0: Hierarchy and modularity in evolving metabolic networks. PLoS One 2019; 14:e0224201. [PMID: 31648227 PMCID: PMC6812854 DOI: 10.1371/journal.pone.0224201] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 10/08/2019] [Indexed: 11/30/2022] Open
Abstract
Enzyme recruitment is a fundamental evolutionary driver of modern metabolism. We see evidence of recruitment at work in the metabolic Molecular Ancestry Networks (MANET) database, an online resource that integrates data from KEGG, SCOP and structural phylogenomic reconstruction. The database, which was introduced in 2006, traces the deep history of the structural domains of enzymes in metabolic pathways. Here we release version 3.0 of MANET, which updates data from KEGG and SCOP, links enzyme and PDB information with PDBsum, and traces evolutionary information of domains defined at fold family level of SCOP classification in metabolic subnetwork diagrams. Compared to SCOP folds used in the previous versions, fold families are cohesive units of functional similarity that are highly conserved at sequence level and offer a 10-fold increase of data entries. We surveyed enzymatic, functional and catalytic site distributions among superkingdoms showing that ancient enzymatic innovations followed a biphasic temporal pattern of diversification typical of module innovation. We grouped enzymatic activities of MANET into a hierarchical system of subnetworks and mesonetworks matching KEGG classification. The evolutionary growth of these modules of metabolic activity was studied using bipartite networks and their one-mode projections at enzyme, subnetwork and mesonetwork levels of organization. Evolving metabolic networks revealed patterns of enzyme sharing that transcended mesonetwork boundaries and supported the patchwork model of metabolic evolution. We also explored the scale-freeness, randomness and small-world properties of evolving networks as possible organizing principles of network growth and diversification. The network structure shows an increase in hierarchical modularity and scale-free behavior as metabolic networks unfold in evolutionary time. Remarkably, this evolutionary constraint on structure was stronger at lower levels of metabolic organization. Evolving metabolic structure reveals a 'principle of granularity', an evolutionary increase of the cohesiveness of lower-level parts of a hierarchical system. MANET is available at http://manet.illinois.edu.
Collapse
Affiliation(s)
- Fizza Mughal
- Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| | - Gustavo Caetano-Anollés
- Illinois Informatics Institute, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois at Urbana-Champaign, Urbana, Illinois, United States of America
| |
Collapse
|
40
|
Caetano-Anollés G, Aziz MF, Mughal F, Gräter F, Koç I, Caetano-Anollés K, Caetano-Anollés D. Emergence of Hierarchical Modularity in Evolving Networks Uncovered by Phylogenomic Analysis. Evol Bioinform Online 2019; 15:1176934319872980. [PMID: 31523127 PMCID: PMC6728656 DOI: 10.1177/1176934319872980] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 08/08/2019] [Indexed: 01/15/2023] Open
Abstract
Networks describe how parts associate with each other to form integrated systems which often have modular and hierarchical structure. In biology, network growth involves two processes, one that unifies and the other that diversifies. Here, we propose a biphasic (bow-tie) theory of module emergence. In the first phase, parts are at first weakly linked and associate variously. As they diversify, they compete with each other and are often selected for performance. The emerging interactions constrain their structure and associations. This causes parts to self-organize into modules with tight linkage. In the second phase, variants of the modules diversify and become new parts for a new generative cycle of higher level organization. The paradigm predicts the rise of hierarchical modularity in evolving networks at different timescales and complexity levels. Remarkably, phylogenomic analyses uncover this emergence in the rewiring of metabolomic and transcriptome-informed metabolic networks, the nanosecond dynamics of proteins, and evolving networks of metabolism, elementary functionomes, and protein domain organization.
Collapse
Affiliation(s)
- Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory,
Department of Crop Sciences, C.R. Woese Institute for Genomic Biology, and Illinois
Informatics Institute, University of Illinois, Urbana, IL, USA
| | - M Fayez Aziz
- Evolutionary Bioinformatics Laboratory,
Department of Crop Sciences, C.R. Woese Institute for Genomic Biology, and Illinois
Informatics Institute, University of Illinois, Urbana, IL, USA
| | - Fizza Mughal
- Evolutionary Bioinformatics Laboratory,
Department of Crop Sciences, C.R. Woese Institute for Genomic Biology, and Illinois
Informatics Institute, University of Illinois, Urbana, IL, USA
| | - Frauke Gräter
- Heidelberg Institute for Theoretical
Studies, Heidelberg, Germany
| | - Ibrahim Koç
- Department of Molecular Biology and
Genetics, Gebze Technical University, Gebze, Turkey
| | - Kelsey Caetano-Anollés
- Division of Biomedical Informatics,
College of Medicine, Seoul National University, Seoul, Republic of Korea
| | | |
Collapse
|
41
|
Seligmann H, Warthi G. Chimeric Translation for Mitochondrial Peptides: Regular and Expanded Codons. Comput Struct Biotechnol J 2019; 17:1195-1202. [PMID: 31534643 PMCID: PMC6742854 DOI: 10.1016/j.csbj.2019.08.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 08/19/2019] [Accepted: 08/21/2019] [Indexed: 02/07/2023] Open
Abstract
Frameshifting protein translation occasionally results from insertion of amino acids at isolated mono- or dinucleotide-expanded codons by tRNAs with expanded anticodons. Previous analyses of two different types of human mitochondrial MS proteomic data (Fisher and Waters technologies) detect peptides entirely corresponding to expanded codon translation. Here, these proteomic data are reanalyzed searching for peptides consisting of at least eight consecutive amino acids translated according to regular tricodons, and at least eight adjacent consecutive amino acids translated according to expanded codons. Both datasets include chimerically translated peptides (mono- and dinucleotide expansions, 42 and 37, respectively). The regular tricodon-encoded part of some chimeric peptides corresponds to standard human mitochondrial proteins (mono- and dinucleotide expansions, six (AT6, CytB, ND1, 2xND2, ND5) and one (ND1), respectively). Chimeric translation probably increases the diversity of mitogenome-encoded proteins, putatively producing functional proteins. These might result from translation by tRNAs with expanded anticodons, or from regular tricodon translation of RNAs where transcription/posttranscriptional edition systematically deleted mono- or dinucleotides after each trinucleotide. The pairwise matched combination of adjacent peptide parts translated from regular and expanded codons strengthens the hypothesis that translation of stretches of consecutive expanded codons occurs. Results indicate statistical translation producing distributions of alternative proteins. Genetic engineering should account for potential unexpected, unwanted secondary products.
Collapse
Affiliation(s)
- Hervé Seligmann
- The National Natural History Collections, The Hebrew University of Jerusalem, 91404 Jerusalem, Israel
| | - Ganesh Warthi
- Aix-Marseille University, IRD, VITROME, Institut Hospitalo-Universitaire Méditerranée-Infection, Marseille, France
| |
Collapse
|
42
|
Abstract
We tested the hypothesis that concatemers of ancestral tRNAs gave rise to the 16S ribosomal RNA. We built an ancestral sequence of proto-tRNAs that showed a significant identity of 51.69% and a percentage of structural identity of 0.941 with the 16S ribosomal molecule. We also propose a hypothesis for the emergence of translation.
Collapse
|
43
|
Ariza-Mateos A, Briones C, Perales C, Domingo E, Gómez J. The archaeology of coding RNA. Ann N Y Acad Sci 2019; 1447:119-134. [PMID: 31237363 DOI: 10.1111/nyas.14173] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2019] [Revised: 05/18/2019] [Accepted: 05/29/2019] [Indexed: 12/16/2022]
Abstract
Different theories concerning the origin of RNA (and, in particular, mRNA) point to the concatenation and expansion of proto-tRNA-like structures. Different biochemical and biophysical tools have been used to search for ancient-like RNA elements with a specific structure in genomic viral RNAs, including that of the hepatitis C virus, as well as in cellular mRNA populations, in particular those of human hepatocytes. We define this method as "archaeological," and it has been designed to discover evolutionary patterns through a nonphylogenetic and nonrepresentational strategy. tRNA-like elements were found in structurally or functionally relevant positions both in viral RNA and in one of the liver mRNAs examined, the antagonist interferon-alpha subtype 5 (IFNA5) mRNA. Additionally, tRNA-like elements are highly represented within the hepatic mRNA population, which suggests that they could have participated in the formation of coding RNAs in the distant past. Expanding on this finding, we have observed a recurring dsRNA-like motif next to the tRNA-like elements in both viral RNAs and IFNA5 mRNA. This suggested that the concatenation of these RNA motifs was an activity present in the RNA pools that might have been relevant in the RNA world. The extensive alteration of sequences that likely triggered the transition from the predecessors of coding RNAs to the first fully functional mRNAs (which was not the case in the stepwise construction of noncoding rRNAs) hinders the phylogeny-based identification of RNA elements (both sequences and structures) that might have been active before the advent of protein synthesis. Therefore, our RNA archaeological method is presented as a way to better understand the structural/functional versatility of a variety of RNA elements, which might represent "the losers" in the process of RNA evolution as they had to adapt to the selective pressures favoring the coding capacity of the progressively longer mRNAs.
Collapse
Affiliation(s)
- Ascensión Ariza-Mateos
- Laboratory of RNA Archaeology, Instituto de Parasitología y Biomedicina "López-Neyra" (CSIC), Granada, Spain.,Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Campus de Cantoblanco, Madrid, Spain
| | - Carlos Briones
- Department of Molecular Evolution, Centro de Astrobiología (CSIC-INTA), Madrid, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain
| | - Celia Perales
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Campus de Cantoblanco, Madrid, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain.,Department of Clinical Microbiology, IIS-Fundación Jiménez Díaz, UAM, Madrid, Spain
| | - Esteban Domingo
- Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Campus de Cantoblanco, Madrid, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain
| | - Jordi Gómez
- Laboratory of RNA Archaeology, Instituto de Parasitología y Biomedicina "López-Neyra" (CSIC), Granada, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas (CIBERehd), Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
44
|
Abstract
The search for extraterrestrial life, recently fueled by the discovery of exoplanets, requires defined biosignatures. Current biomarkers include those of extremophilic organisms, typically archaea. Yet these cellular organisms are highly complex, which makes it unlikely that similar life forms evolved on other planets. Earlier forms of life on Earth may serve as better models for extraterrestrial life. On modern Earth, the simplest and most abundant biological entities are viroids and viruses that exert many properties of life, such as the abilities to replicate and undergo Darwinian evolution. Viroids have virus-like features, and are related to ribozymes, consisting solely of non-coding RNA, and may serve as more universal models for early life than do cellular life forms. Among the various proposed concepts, such as “proteins-first” or “metabolism-first”, we think that “viruses-first” can be specified to “viroids-first” as the most likely scenario for the emergence of life on Earth, and possibly elsewhere. With this article we intend to inspire the integration of virus research and the biosignatures of viroids and viruses into the search for extraterrestrial life.
Collapse
|
45
|
Moelling K, Broecker F. Viruses and Evolution - Viruses First? A Personal Perspective. Front Microbiol 2019; 10:523. [PMID: 30941110 PMCID: PMC6433886 DOI: 10.3389/fmicb.2019.00523] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Accepted: 02/28/2019] [Indexed: 01/08/2023] Open
Abstract
The discovery of exoplanets within putative habitable zones revolutionized astrobiology in recent years. It stimulated interest in the question about the origin of life and its evolution. Here, we discuss what the roles of viruses might have been at the beginning of life and during evolution. Viruses are the most abundant biological entities on Earth. They are present everywhere, in our surrounding, the oceans, the soil and in every living being. Retroviruses contributed to about half of our genomic sequences and to the evolution of the mammalian placenta. Contemporary viruses reflect evolution ranging from the RNA world to the DNA-protein world. How far back can we trace their contribution? Earliest replicating and evolving entities are the ribozymes or viroids fulfilling several criteria of life. RNA can perform many aspects of life and influences our gene expression until today. The simplest structures with non-protein-coding information may represent models of life built on structural, not genetic information. Viruses today are obligatory parasites depending on host cells. Examples of how an independent lifestyle might have been lost include mitochondria, chloroplasts, Rickettsia and others, which used to be autonomous bacteria and became intracellular parasites or endosymbionts, thereby losing most of their genes. Even in vitro the loss of genes can be recapitulated all the way from coding to non-coding RNA. Furthermore, the giant viruses may indicate that there is no sharp border between living and non-living entities but an evolutionary continuum. Here, it is discussed how viruses can lose and gain genes, and that they are essential drivers of evolution. This discussion may stimulate the thinking about viruses as early possible forms of life. Apart from our view “viruses first”, there are others such as “proteins first” and “metabolism first.”
Collapse
Affiliation(s)
- Karin Moelling
- Institute of Medical Microbiology, University of Zurich, Zurich, Switzerland.,Max Planck Institute for Molecular Genetics, Berlin, Germany
| | - Felix Broecker
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| |
Collapse
|
46
|
Chatterjee S, Yadav S. The Origin of Prebiotic Information System in the Peptide/RNA World: A Simulation Model of the Evolution of Translation and the Genetic Code. Life (Basel) 2019; 9:E25. [PMID: 30832272 PMCID: PMC6463137 DOI: 10.3390/life9010025] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/09/2019] [Accepted: 02/25/2019] [Indexed: 12/20/2022] Open
Abstract
Information is the currency of life, but the origin of prebiotic information remains a mystery. We propose transitional pathways from the cosmic building blocks of life to the complex prebiotic organic chemistry that led to the origin of information systems. The prebiotic information system, specifically the genetic code, is segregated, linear, and digital, and it appeared before the emergence of DNA. In the peptide/RNA world, lipid membranes randomly encapsulated amino acids, RNA, and peptide molecules, which are drawn from the prebiotic soup, to initiate a molecular symbiosis inside the protocells. This endosymbiosis led to the hierarchical emergence of several requisite components of the translation machine: transfer RNAs (tRNAs), aminoacyl-tRNA synthetase (aaRS), messenger RNAs (mRNAs), ribosomes, and various enzymes. When assembled in the right order, the translation machine created proteins, a process that transferred information from mRNAs to assemble amino acids into polypeptide chains. This was the beginning of the prebiotic information age. The origin of the genetic code is enigmatic; herein, we propose an evolutionary explanation: the demand for a wide range of protein enzymes over peptides in the prebiotic reactions was the main selective pressure for the origin of information-directed protein synthesis. The molecular basis of the genetic code manifests itself in the interaction of aaRS and their cognate tRNAs. In the beginning, aminoacylated ribozymes used amino acids as a cofactor with the help of bridge peptides as a process for selection between amino acids and their cognate codons/anticodons. This process selects amino acids and RNA species for the next steps. The ribozymes would give rise to pre-tRNA and the bridge peptides to pre-aaRS. Later, variants would appear and evolution would produce different but specific aaRS-tRNA-amino acid combinations. Pre-tRNA designed and built pre-mRNA for the storage of information regarding its cognate amino acid. Each pre-mRNA strand became the storage device for the genetic information that encoded the amino acid sequences in triplet nucleotides. As information appeared in the digital languages of the codon within pre-mRNA and mRNA, and the genetic code for protein synthesis evolved, the prebiotic chemistry then became more organized and directional with the emergence of the translation and genetic code. The genetic code developed in three stages that are coincident with the refinement of the translation machines: the GNC code that was developed by the pre-tRNA/pre-aaRS /pre-mRNA machine, SNS code by the tRNA/aaRS/mRNA machine, and finally the universal genetic code by the tRNA/aaRS/mRNA/ribosome machine. We suggest the coevolution of translation machines and the genetic code. The emergence of the translation machines was the beginning of the Darwinian evolution, an interplay between information and its supporting structure. Our hypothesis provides the logical and incremental steps for the origin of the programmed protein synthesis. In order to better understand the prebiotic information system, we converted letter codons into numerical codons in the Universal Genetic Code Table. We have developed a software, called CATI (Codon-Amino Acid-Translator-Imitator), to translate randomly chosen numerical codons into corresponding amino acids and vice versa. This conversion has granted us insight into how the genetic code might have evolved in the peptide/RNA world. There is great potential in the application of numerical codons to bioinformatics, such as barcoding, DNA mining, or DNA fingerprinting. We constructed the likely biochemical pathways for the origin of translation and the genetic code using the Model-View-Controller (MVC) software framework, and the translation machinery step-by-step. While using AnyLogic software, we were able to simulate and visualize the entire evolution of the translation machines, amino acids, and the genetic code.
Collapse
Affiliation(s)
- Sankar Chatterjee
- Department of Geosciences, Museum of Texas Tech University, Box 43191, 3301 4th Street, Lubbock, TX 79409, USA.
| | - Surya Yadav
- Rawls College of Business, Texas Tech University, Box 42101, 703 Flint Avenue, Lubbock, TX 79409, USA.
| |
Collapse
|
47
|
Rogers SO. Evolution of the genetic code based on conservative changes of codons, amino acids, and aminoacyl tRNA synthetases. J Theor Biol 2019; 466:1-10. [PMID: 30658052 DOI: 10.1016/j.jtbi.2019.01.022] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2018] [Revised: 01/10/2019] [Accepted: 01/14/2019] [Indexed: 11/30/2022]
Abstract
The genetic code, as arranged in the standard tabular form, displays a non-random structure relating to the characteristics of the amino acids. An alternative arrangement can be made by organizing the code according to aminoacyl-tRNA synthetases (aaRSs), codons, and reverse complement codons, which illuminates a coevolutionary process that led to the contemporary genetic code. As amino acids were added to the genetic code, they were recognized by aaRSs that interact with stereochemically similar amino acids. Single nucleotide changes in the codons and anticodons were favored over more extensive changes, such that there was a logical stepwise progression in the evolution of the genetic code. The model presented traces the evolution of the genetic code accounting for these steps. Amino acid frequencies in ancient proteins and the preponderance of GNN codons in mRNAs for ancient proteins indicate that the genetic code began with alanine, aspartate, glutamate, glycine, and valine, with alanine being in the highest proportions. In addition to being consistent in terms of conservative changes in codon nucleotides, the model also is consistent with respect to aaRS classes, aaRS attachment to the tRNA, amino acid stereochemistry, and to a large extent with amino acid physicochemistry, and biochemical pathways.
Collapse
Affiliation(s)
- Scott O Rogers
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, United States.
| |
Collapse
|
48
|
Rogers SO. Integrated evolution of ribosomal RNAs, introns, and intron nurseries. Genetica 2018; 147:103-119. [PMID: 30578455 DOI: 10.1007/s10709-018-0050-y] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2018] [Accepted: 12/13/2018] [Indexed: 12/21/2022]
Abstract
The initial components of ribosomes first appeared more than 3.8 billion years ago during a time when many types of RNAs were evolving. While modern ribosomes are complex molecular machines consisting of rRNAs and proteins, they were assembled during early evolution by the association and joining of small functional RNA units. Introns may have provided the means to ligate many of these pieces together. All four classes of introns (group I, group II, spliceosomal, and archaeal) are present in many rRNA gene loci over a broad phylogenetic range. A survey of rRNA intron sequences across the three major life domains suggests that some of the classes of introns may have diverged from one another within rRNA gene loci. Analyses of rRNA sequences revealed self-splicing group I and group II introns are present in ancestral regions of the SSU (small subunit) and LSU (large subunit), whereas spliceosomal and archaeal introns appeared in sections of the rRNA that evolved later. Most classes of introns increased in number for approximately 1 billion years. However, their frequencies are low in the most recently evolved regions added to the SSU and LSU rRNAs. Furthermore, many of the introns appear to have been in the same locations for billions of years, suggesting an ancient origin for these sequences. In this Perspectives paper, I reviewed and analyzed rRNA intron sequences, locations, structural characteristics, and splicing mechanisms; and suggest that rRNA gene loci may have served as evolutionary nurseries for intron formation and diversification.
Collapse
Affiliation(s)
- Scott O Rogers
- Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, 43403, USA.
| |
Collapse
|
49
|
Colson P, Levasseur A, La Scola B, Sharma V, Nasir A, Pontarotti P, Caetano-Anollés G, Raoult D. Ancestrality and Mosaicism of Giant Viruses Supporting the Definition of the Fourth TRUC of Microbes. Front Microbiol 2018; 9:2668. [PMID: 30538677 PMCID: PMC6277510 DOI: 10.3389/fmicb.2018.02668] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Accepted: 10/18/2018] [Indexed: 12/20/2022] Open
Abstract
Giant viruses of amoebae were discovered in 2003. Since then, their diversity has greatly expanded. They were suggested to form a fourth branch of life, collectively named ‘TRUC’ (for “Things Resisting Uncompleted Classifications”) alongside Bacteria, Archaea, and Eukarya. Their origin and ancestrality remain controversial. Here, we specify the evolution and definition of giant viruses. Phylogenetic and phenetic analyses of informational gene repertoires of giant viruses and selected bacteria, archaea and eukaryota were performed, including structural phylogenomics based on protein structural domains grouped into 289 universal fold superfamilies (FSFs). Hierarchical clustering analysis was performed based on a binary presence/absence matrix constructed using 727 informational COGs from cellular organisms. The presence/absence of ‘universal’ FSF domains was used to generate an unrooted maximum parsimony phylogenomic tree. Comparison of the gene content of a giant virus with those of a bacterium, an archaeon, and a eukaryote with small genomes was also performed. Overall, both cladistic analyses based on gene sequences of very central and ancient proteins and on highly conserved protein fold structures as well as phenetic analyses were congruent regarding the delineation of a fourth branch of microbes comprised by giant viruses. Giant viruses appeared as a basal group in the tree of all proteomes. A pangenome and core genome determined for Rickettsia bellii (bacteria), Methanomassiliicoccus luminyensis (archaeon), Encephalitozoon intestinalis (eukaryote), and Tupanvirus (giant virus) showed a substantial proportion of Tupanvirus genes that overlap with those of the cellular microbes. In addition, a substantial genome mosaicism was observed, with 51, 11, 8, and 0.2% of Tupanvirus genes best matching with viruses, eukaryota, bacteria, and archaea, respectively. Finally, we found that genes themselves may be subject to lateral sequence transfers. In summary, our data highlight the quantum leap between classical and giant viruses. Phylogenetic and phyletic analyses and the study of protein fold superfamilies confirm previous evidence of the existence of a fourth TRUC of life that includes giant viruses, and highlight its ancestrality and mosaicism. They also point out that best evolutionary representations for giant viruses and cellular microorganisms are rhizomes, and that sequence transfers rather than gene transfers have to be considered.
Collapse
Affiliation(s)
- Philippe Colson
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France
| | - Anthony Levasseur
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France
| | - Bernard La Scola
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France
| | - Vikas Sharma
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France.,Centre National de la Recherche Scientifique, Marseille, France
| | - Arshan Nasir
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, IL, United States.,Department of Biosciences, COMSATS University Islamabad, Islamabad, Pakistan
| | - Pierre Pontarotti
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France.,Centre National de la Recherche Scientifique, Marseille, France
| | - Gustavo Caetano-Anollés
- Evolutionary Bioinformatics Laboratory, Department of Crop Sciences, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | - Didier Raoult
- Aix-Marseille Université, Institut de Recherche pour le Développement (IRD), Assistance Publique - Hôpitaux de Marseille (AP-HM); Microbes, Evolution, Phylogeny and Infection (MEΦI); Institut Hospitalo-Universitaire (IHU) - Méditerranée Infection, Marseille, France
| |
Collapse
|
50
|
Youkharibache P, Veretnik S, Li Q, Stanek KA, Mura C, Bourne PE. The Small β-Barrel Domain: A Survey-Based Structural Analysis. Structure 2018; 27:6-26. [PMID: 30393050 DOI: 10.1016/j.str.2018.09.012] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 06/12/2018] [Accepted: 09/19/2018] [Indexed: 11/27/2022]
Abstract
The small β-barrel (SBB) is an ancient protein structural domain characterized by extremes: it features a broad range of structural varieties, a deeply intricate evolutionary history, and it is associated with a bewildering array of cellular pathways. Here, we present a thorough, survey-based analysis of the structural properties of SBBs. We first consider the defining properties of the SBB, including various systems of nomenclature used to describe it, and we introduce the unifying concept of an "urfold." To begin elucidating how vast functional diversity can be achieved by a relatively simple domain, we explore the anatomy of the SBB and its representative structural variants. Many SBB proteins assemble into cyclic oligomers as the biologically functional units; these oligomers often bind RNA, and typically exhibit great quaternary structural plasticity (homomeric and heteromeric rings, variable subunit stoichiometries, etc.). We conclude with three themes that emerge from the rich structure ↔ function versatility of the SBB.
Collapse
Affiliation(s)
- Philippe Youkharibache
- National Center for Biotechnology Information, The National Library of Medicine, The National Institutes of Health, Bethesda, MD 20894, USA
| | - Stella Veretnik
- National Center for Biotechnology Information, The National Library of Medicine, The National Institutes of Health, Bethesda, MD 20894, USA.
| | - Qingliang Li
- National Center for Biotechnology Information, The National Library of Medicine, The National Institutes of Health, Bethesda, MD 20894, USA
| | - Kimberly A Stanek
- Department of Chemistry, University of Virginia, Charlottesville, VA 22904, USA
| | - Cameron Mura
- Department of Chemistry, University of Virginia, Charlottesville, VA 22904, USA.
| | - Philip E Bourne
- National Center for Biotechnology Information, The National Library of Medicine, The National Institutes of Health, Bethesda, MD 20894, USA.
| |
Collapse
|