1
|
Gondhalekar R, Kempes CP, McGlynn SE. Scaling of Protein Function across the Tree of Life. Genome Biol Evol 2023; 15:evad214. [PMID: 38007693 PMCID: PMC10715193 DOI: 10.1093/gbe/evad214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 11/07/2023] [Accepted: 11/12/2023] [Indexed: 11/28/2023] Open
Abstract
Scaling laws are a powerful way to compare genomes because they put all organisms onto a single curve and reveal nontrivial generalities as genomes change in size. The abundance of functional categories across genomes has previously been found to show power law scaling with respect to the total number of functional categories, suggesting that universal constraints shape genomic category abundance. Here, we look across the tree of life to understand how genome evolution may be related to functional scaling. We revisit previous observations of functional genome scaling with an expanded taxonomy by analyzing 3,726 bacterial, 220 archaeal, and 79 unicellular eukaryotic genomes. We find that for some functional classes, scaling is best described by multiple exponents, revealing previously unobserved shifts in scaling as genome-encoded protein annotations increase or decrease. Furthermore, we find that scaling varies between phyletic groups at both the domain and phyla levels and is less universal than previously thought. This variability in functional scaling is not related to taxonomic phylogeny resolved at the phyla level, suggesting that differences in cell plan or physiology outweigh broad patterns of taxonomic evolution. Since genomes are maintained and replicated by the functional proteins encoded by them, these results point to functional degeneracy between taxonomic groups and unique evolutionary trajectories toward these. We also find that individual phyla frequently span scaling exponents of functional classes, revealing that individual clades can move across scaling exponents. Together, our results reveal unique shifts in functions across the tree of life and highlight that as genomes grow or shrink, proteins of various functions may be added or lost.
Collapse
Affiliation(s)
- Riddhi Gondhalekar
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- School of Life Sciences and Technology, Tokyo Institute of Technology, Tokyo, Japan
| | | | - Shawn Erin McGlynn
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo, Japan
- School of Life Sciences and Technology, Tokyo Institute of Technology, Tokyo, Japan
- Blue Marble Space Institute of Science, Seattle, Washington, USA
- Center for Sustainable Resource Science, RIKEN, Saitama, Japan
| |
Collapse
|
2
|
Tirumalai MR, Sivaraman RV, Kutty LA, Song EL, Fox GE. Ribosomal Protein Cluster Organization in Asgard Archaea. ARCHAEA (VANCOUVER, B.C.) 2023; 2023:5512414. [PMID: 38314098 PMCID: PMC10833476 DOI: 10.1155/2023/5512414] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 08/31/2023] [Accepted: 09/08/2023] [Indexed: 02/06/2024]
Abstract
It has been proposed that the superphylum of Asgard Archaea may represent a historical link between the Archaea and Eukarya. Following the discovery of the Archaea, it was soon appreciated that archaeal ribosomes were more similar to those of Eukarya rather than Bacteria. Coupled with other eukaryotic-like features, it has been suggested that the Asgard Archaea may be directly linked to eukaryotes. However, the genomes of Bacteria and non-Asgard Archaea generally organize ribosome-related genes into clusters that likely function as operons. In contrast, eukaryotes typically do not employ an operon strategy. To gain further insight into conservation of the r-protein genes, the genome order of conserved ribosomal protein (r-protein) coding genes was identified in 17 Asgard genomes (thirteen complete genomes and four genomes with less than 20 contigs) and compared with those found previously in non-Asgard archaeal and bacterial genomes. A universal core of two clusters of 14 and 4 cooccurring r-proteins, respectively, was identified in both the Asgard and non-Asgard Archaea. The equivalent genes in the E. coli version of the cluster are found in the S10 and spc operons. The large cluster of 14 r-protein genes (uS19-uL22-uS3-uL29-uS17 from the S10 operon and uL14-uL24-uL5-uS14-uS8-uL6-uL18-uS5-uL30-uL15 from the spc operon) occurs as a complete set in the genomes of thirteen Asgard genomes (five Lokiarchaeotes, three Heimdallarchaeotes, one Odinarchaeote, and four Thorarchaeotes). Four less conserved clusters with partial bacterial equivalents were found in the Asgard. These were the L30e (str operon in Bacteria) cluster, the L18e (alpha operon in Bacteria) cluster, the S24e-S27ae-rpoE1 cluster, and the L31e, L12..L1 cluster. Finally, a new cluster referred to as L7ae was identified. In many cases, r-protein gene clusters/operons are less conserved in their organization in the Asgard group than in other Archaea. If this is generally true for nonribosomal gene clusters, the results may have implications for the history of genome organization. In particular, there may have been an early transition to or from the operon approach to genome organization. Other nonribosomal cellular features may support different relationships. For this reason, it may be important to consider ribosome features separately.
Collapse
Affiliation(s)
- Madhan R. Tirumalai
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204-5001, USA
| | | | | | | | - George E. Fox
- Department of Biology and Biochemistry, University of Houston, Houston, TX 77204-5001, USA
| |
Collapse
|
3
|
Villarreal L, Witzany G. Self-empowerment of life through RNA networks, cells and viruses. F1000Res 2023; 12:138. [PMID: 36785664 PMCID: PMC9918806 DOI: 10.12688/f1000research.130300.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/20/2023] [Indexed: 01/05/2024] Open
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
4
|
Villarreal L, Witzany G. Self-empowerment of life through RNA networks, cells and viruses. F1000Res 2023; 12:138. [PMID: 36785664 PMCID: PMC9918806 DOI: 10.12688/f1000research.130300.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/23/2023] [Indexed: 03/08/2023] Open
Abstract
Our understanding of the key players in evolution and of the development of all organisms in all domains of life has been aided by current knowledge about RNA stem-loop groups, their proposed interaction motifs in an early RNA world and their regulative roles in all steps and substeps of nearly all cellular processes, such as replication, transcription, translation, repair, immunity and epigenetic marking. Cooperative evolution was enabled by promiscuous interactions between single-stranded regions in the loops of naturally forming stem-loop structures in RNAs. It was also shown that cooperative RNA stem-loops outcompete selfish ones and provide foundational self-constructive groups (ribosome, editosome, spliceosome, etc.). Self-empowerment from abiotic matter to biological behavior does not just occur at the beginning of biological evolution; it is also essential for all levels of socially interacting RNAs, cells and viruses.
Collapse
Affiliation(s)
- Luis Villarreal
- Center for Virus Research, University of California, Irvine, California, USA
| | - Guenther Witzany
- Telos - Philosophische Praxis, Buermoos, Salzburg, 5111, Austria
| |
Collapse
|
5
|
A Short Tale of the Origin of Proteins and Ribosome Evolution. Microorganisms 2022; 10:microorganisms10112115. [DOI: 10.3390/microorganisms10112115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Revised: 09/30/2022] [Accepted: 10/19/2022] [Indexed: 11/16/2022] Open
Abstract
Proteins are the workhorses of the cell and have been key players throughout the evolution of all organisms, from the origin of life to the present era. How might life have originated from the prebiotic chemistry of early Earth? This is one of the most intriguing unsolved questions in biology. Currently, however, it is generally accepted that amino acids, the building blocks of proteins, were abiotically available on primitive Earth, which would have made the formation of early peptides in a similar fashion possible. Peptides are likely to have coevolved with ancestral forms of RNA. The ribosome is the most evident product of this coevolution process, a sophisticated nanomachine that performs the synthesis of proteins codified in genomes. In this general review, we explore the evolution of proteins from their peptide origins to their folding and regulation based on the example of superoxide dismutase (SOD1), a key enzyme in oxygen metabolism on modern Earth.
Collapse
|
6
|
Deng ZL, Münch PC, Mreches R, McHardy AC. Rapid and accurate identification of ribosomal RNA sequences via deep learning. Nucleic Acids Res 2022; 50:e60. [PMID: 35188571 PMCID: PMC9177968 DOI: 10.1093/nar/gkac112] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Revised: 01/28/2022] [Accepted: 02/07/2022] [Indexed: 11/14/2022] Open
Abstract
Advances in transcriptomic and translatomic techniques enable in-depth studies of RNA activity profiles and RNA-based regulatory mechanisms. Ribosomal RNA (rRNA) sequences are highly abundant among cellular RNA, but if the target sequences do not include polyadenylation, these cannot be easily removed in library preparation, requiring their post-hoc removal with computational techniques to accelerate and improve downstream analyses. Here, we describe RiboDetector, a novel software based on a Bi-directional Long Short-Term Memory (BiLSTM) neural network, which rapidly and accurately identifies rRNA reads from transcriptomic, metagenomic, metatranscriptomic, noncoding RNA, and ribosome profiling sequence data. Compared with state-of-the-art approaches, RiboDetector produced at least six times fewer misclassifications on the benchmark datasets. Importantly, the few false positives of RiboDetector were not enriched in certain Gene Ontology (GO) terms, suggesting a low bias for downstream functional profiling. RiboDetector also demonstrated a remarkable generalizability for detecting novel rRNA sequences that are divergent from the training data with sequence identities of <90%. On a personal computer, RiboDetector processed 40M reads in less than 6 min, which was ∼50 times faster in GPU mode and ∼15 times in CPU mode than other methods. RiboDetector is available under a GPL v3.0 license at https://github.com/hzi-bifo/RiboDetector.
Collapse
Affiliation(s)
- Zhi-Luo Deng
- Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Philipp C Münch
- Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - René Mreches
- Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| | - Alice C McHardy
- Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, Braunschweig, Germany
- Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany
| |
Collapse
|
7
|
Farias STD, Prosdocimi F. RNP-world: The ultimate essence of life is a ribonucleoprotein process. Genet Mol Biol 2022; 45:e20220127. [PMID: 36190700 PMCID: PMC9528728 DOI: 10.1590/1678-4685-gmb-2022-0127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Accepted: 06/03/2022] [Indexed: 11/22/2022] Open
Abstract
The fundamental essence of life is based on process of interaction between nucleic acids and proteins. In a prebiotic world, amino acids, peptides, ions, and other metabolites acted in protobiotic routes at the same time on which RNAs performed catalysis and self-replication. Nevertheless, it was only when nucleic acids and peptides started to interact together in an organized process that life emerged. First, the ignition was sparked with the formation of a Peptidyl Transferase Center (PTC), possibly by concatenation of proto-tRNAs. This molecule that would become the catalytic site of ribosomes started a process of self-organization that gave origin to a protoorganism named FUCA, a ribonucleic ribosomal-like apparatus capable to polymerize amino acids. In that sense, we review hypotheses about the origin and early evolution of the genetic code. Next, populations of open biological systems named progenotes were capable of accumulating and exchanging genetic material, producing the first genomes. Progenotes then evolved in two paths: some presented their own ribosomes and others used available ribosomes in the medium to translate their encoded information. At some point, two different types of organisms emerged from populations of progenotes: the ribosome-encoding organisms (cells) and the capsid-encoding organisms (viruses).
Collapse
Affiliation(s)
- Sávio Torres de Farias
- Universidade Federal da Paraíba, Brazil; Network of Researchers on the Chemical Evolution of Life, UK
| | | |
Collapse
|
8
|
Marijuán PC, Navarro J. The biological information flow: From cell theory to a new evolutionary synthesis. Biosystems 2022; 213:104631. [DOI: 10.1016/j.biosystems.2022.104631] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/19/2022] [Accepted: 01/23/2022] [Indexed: 12/19/2022]
|
9
|
Smethurst DGJ, Shcherbik N. Interchangeable utilization of metals: New perspectives on the impacts of metal ions employed in ancient and extant biomolecules. J Biol Chem 2021; 297:101374. [PMID: 34732319 PMCID: PMC8633580 DOI: 10.1016/j.jbc.2021.101374] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Revised: 10/25/2021] [Accepted: 10/28/2021] [Indexed: 02/08/2023] Open
Abstract
Metal ions provide considerable functionality across biological systems, and their utilization within biomolecules has adapted through changes in the chemical environment to maintain the activity they facilitate. While ancient earth's atmosphere was rich in iron and manganese and low in oxygen, periods of atmospheric oxygenation significantly altered the availability of certain metal ions, resulting in ion replacement within biomolecules. This adaptation mechanism has given rise to the phenomenon of metal cofactor interchangeability, whereby contemporary proteins and nucleic acids interact with multiple metal ions interchangeably, with different coordinated metals influencing biological activity, stability, and toxic potential. The ability of extant organisms to adapt to fluctuating metal availability remains relevant in a number of crucial biomolecules, including the superoxide dismutases of the antioxidant defense systems and ribonucleotide reductases. These well-studied and ancient enzymes illustrate the potential for metal interchangeability and adaptive utilization. More recently, the ribosome has also been demonstrated to exhibit interchangeable interactions with metal ions with impacts on function, stability, and stress adaptation. Using these and other examples, here we review the biological significance of interchangeable metal ions from a new angle that combines both biochemical and evolutionary viewpoints. The geochemical pressures and chemical properties that underlie biological metal utilization are discussed in the context of their impact on modern disease states and treatments.
Collapse
Affiliation(s)
- Daniel G J Smethurst
- Department for Cell Biology and Neuroscience, School of Osteopathic Medicine, Rowan University, Stratford, New Jersey, USA.
| | - Natalia Shcherbik
- Department for Cell Biology and Neuroscience, School of Osteopathic Medicine, Rowan University, Stratford, New Jersey, USA.
| |
Collapse
|
10
|
Timsit Y, Grégoire SP. Towards the Idea of Molecular Brains. Int J Mol Sci 2021; 22:ijms222111868. [PMID: 34769300 PMCID: PMC8584932 DOI: 10.3390/ijms222111868] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 10/24/2021] [Accepted: 10/28/2021] [Indexed: 02/06/2023] Open
Abstract
How can single cells without nervous systems perform complex behaviours such as habituation, associative learning and decision making, which are considered the hallmark of animals with a brain? Are there molecular systems that underlie cognitive properties equivalent to those of the brain? This review follows the development of the idea of molecular brains from Darwin’s “root brain hypothesis”, through bacterial chemotaxis, to the recent discovery of neuron-like r-protein networks in the ribosome. By combining a structural biology view with a Bayesian brain approach, this review explores the evolutionary labyrinth of information processing systems across scales. Ribosomal protein networks open a window into what were probably the earliest signalling systems to emerge before the radiation of the three kingdoms. While ribosomal networks are characterised by long-lasting interactions between their protein nodes, cell signalling networks are essentially based on transient interactions. As a corollary, while signals propagated in persistent networks may be ephemeral, networks whose interactions are transient constrain signals diffusing into the cytoplasm to be durable in time, such as post-translational modifications of proteins or second messenger synthesis. The duration and nature of the signals, in turn, implies different mechanisms for the integration of multiple signals and decision making. Evolution then reinvented networks with persistent interactions with the development of nervous systems in metazoans. Ribosomal protein networks and simple nervous systems display architectural and functional analogies whose comparison could suggest scale invariance in information processing. At the molecular level, the significant complexification of eukaryotic ribosomal protein networks is associated with a burst in the acquisition of new conserved aromatic amino acids. Knowing that aromatic residues play a critical role in allosteric receptors and channels, this observation suggests a general role of π systems and their interactions with charged amino acids in multiple signal integration and information processing. We think that these findings may provide the molecular basis for designing future computers with organic processors.
Collapse
Affiliation(s)
- Youri Timsit
- Aix Marseille Université, Université de Toulon, CNRS, IRD, MIO UM110, 13288 Marseille, France
- Research Federation for the Study of Global Ocean Systems Ecology and Evolution, FR2022/Tara GOSEE, 3 rue Michel-Ange, 75016 Paris, France
- Correspondence:
| | - Sergeant-Perthuis Grégoire
- Institut de Mathématiques de Jussieu—Paris Rive Gauche (IMJ-PRG), UMR 7586, CNRS-Université Paris Diderot, 75013 Paris, France;
| |
Collapse
|
11
|
Villarreal LP, Witzany G. Social Networking of Quasi-Species Consortia drive Virolution via Persistence. AIMS Microbiol 2021; 7:138-162. [PMID: 34250372 PMCID: PMC8255905 DOI: 10.3934/microbiol.2021010] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 04/25/2021] [Indexed: 12/31/2022] Open
Abstract
The emergence of cooperative quasi-species consortia (QS-C) thinking from the more accepted quasispecies equations of Manfred Eigen, provides a conceptual foundation from which concerted action of RNA agents can now be understood. As group membership becomes a basic criteria for the emergence of living systems, we also start to understand why the history and context of social RNA networks become crucial for survival and function. History and context of social RNA networks also lead to the emergence of a natural genetic code. Indeed, this QS-C thinking can also provide us with a transition point between the chemical world of RNA replicators and the living world of RNA agents that actively differentiate self from non-self and generate group identity with membership roles. Importantly the social force of a consortia to solve complex, multilevel problems also depend on using opposing and minority functions. The consortial action of social networks of RNA stem-loops subsequently lead to the evolution of cellular organisms representing a tree of life.
Collapse
|
12
|
Demongeot J, Moreira A, Seligmann H. Negative CG dinucleotide bias: An explanation based on feedback loops between Arginine codon assignments and theoretical minimal RNA rings. Bioessays 2020; 43:e2000071. [PMID: 33319381 DOI: 10.1002/bies.202000071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Revised: 11/23/2020] [Accepted: 11/26/2020] [Indexed: 01/05/2023]
Abstract
Theoretical minimal RNA rings are candidate primordial genes evolved for non-redundant coding of the genetic code's 22 coding signals (one codon per biogenic amino acid, a start and a stop codon) over the shortest possible length: 29520 22-nucleotide-long RNA rings solve this min-max constraint. Numerous RNA ring properties are reminiscent of natural genes. Here we present analyses showing that all RNA rings lack dinucleotide CG (a mutable, chemically instable dinucleotide coding for Arginine), bearing a resemblance to known CG-depleted genomes. CG in "incomplete" RNA rings (not coding for all coding signals, with only 3-12 nucleotides) gradually decreases towards CG absence in complete, 22-nucleotide-long RNA rings. Presumably, feedback loops during RNA ring growth during evolution (when amino acid assignment fixed the genetic code) assigned Arg to codons lacking CG (AGR) to avoid CG. Hence, as a chemical property of base pairs, CG mutability restructured the genetic code, thereby establishing itself as genetically encoded biological information.
Collapse
Affiliation(s)
- Jacques Demongeot
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, France
| | - Andrés Moreira
- Departamento de Informática, Universidad Técnica Federico Santa María, Santiago, Chile
| | - Hervé Seligmann
- Laboratory AGEIS EA 7407, Team Tools for e-Gnosis Medical & Labcom CNRS/UGA/OrangeLabs Telecom4Health, Faculty of Medicine, Université Grenoble Alpes, La Tronche, France.,The National Natural History Collections, The Hebrew University of Jerusalem, Jerusalem, Israel.,Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
13
|
Rivas M, Fox GE. Further Characterization of the Pseudo-Symmetrical Ribosomal Region. Life (Basel) 2020; 10:life10090201. [PMID: 32937913 PMCID: PMC7555685 DOI: 10.3390/life10090201] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2020] [Revised: 09/09/2020] [Accepted: 09/11/2020] [Indexed: 12/17/2022] Open
Abstract
The peptidyl transferase center of the modern ribosome has been found to encompass an area of twofold pseudosymmetry (SymR). This observation strongly suggests that the very core of the ribosome arose from a dimerization event between two modest-sized RNAs. It was previously shown that at least four non-standard interactions exist between the two halves of SymR. Herein, we verify that the structure of the SymR is highly conserved with respect to both ribosome transition state and phylogenetic diversity. These comparisons also reveal two additional sites of interaction between the two halves of SymR and refine our understanding of the previously known interactions. In addition, the possible role that magnesium may have in the coordination, stabilization, association, and evolutionary history of the two halves (A-region and P-region) was examined. Together, the results identify a likely site where structural elements and Mg2+ ions may have facilitated the ligation of two aboriginal RNAs into a single unit.
Collapse
|
14
|
Bowman JC, Petrov AS, Frenkel-Pinter M, Penev PI, Williams LD. Root of the Tree: The Significance, Evolution, and Origins of the Ribosome. Chem Rev 2020; 120:4848-4878. [PMID: 32374986 DOI: 10.1021/acs.chemrev.9b00742] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
The ribosome is an ancient molecular fossil that provides a telescope to the origins of life. Made from RNA and protein, the ribosome translates mRNA to coded protein in all living systems. Universality, economy, centrality and antiquity are ingrained in translation. The translation machinery dominates the set of genes that are shared as orthologues across the tree of life. The lineage of the translation system defines the universal tree of life. The function of a ribosome is to build ribosomes; to accomplish this task, ribosomes make ribosomal proteins, polymerases, enzymes, and signaling proteins. Every coded protein ever produced by life on Earth has passed through the exit tunnel, which is the birth canal of biology. During the root phase of the tree of life, before the last common ancestor of life (LUCA), exit tunnel evolution is dominant and unremitting. Protein folding coevolved with evolution of the exit tunnel. The ribosome shows that protein folding initiated with intrinsic disorder, supported through a short, primitive exit tunnel. Folding progressed to thermodynamically stable β-structures and then to kinetically trapped α-structures. The latter were enabled by a long, mature exit tunnel that partially offset the general thermodynamic tendency of all polypeptides to form β-sheets. RNA chaperoned the evolution of protein folding from the very beginning. The universal common core of the ribosome, with a mass of nearly 2 million Daltons, was finalized by LUCA. The ribosome entered stasis after LUCA and remained in that state for billions of years. Bacterial ribosomes never left stasis. Archaeal ribosomes have remained near stasis, except for the superphylum Asgard, which has accreted rRNA post LUCA. Eukaryotic ribosomes in some lineages appear to be logarithmically accreting rRNA over the last billion years. Ribosomal expansion in Asgard and Eukarya has been incremental and iterative, without substantial remodeling of pre-existing basal structures. The ribosome preserves information on its history.
Collapse
Affiliation(s)
- Jessica C Bowman
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Anton S Petrov
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Moran Frenkel-Pinter
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Petar I Penev
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Loren Dean Williams
- Center for the Origins of Life, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
15
|
Michel CJ, Thompson JD. Identification of a circular code periodicity in the bacterial ribosome: origin of codon periodicity in genes? RNA Biol 2020; 17:571-583. [PMID: 31960748 DOI: 10.1080/15476286.2020.1719311] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023] Open
Abstract
Three-base periodicity (TBP), where nucleotides and higher order n-tuples are preferentially spaced by 3, 6, 9, etc. bases, is a well-known intrinsic property of protein-coding DNA sequences. However, its origins are still not fully understood. One hypothesis is that the periodicity reflects a primordial coding system that was used before the emergence of the modern standard genetic code (SGC). Recent evidence suggests that the X circular code, a set of 20 trinucleotides allowing the reading frames in genes to be retrieved locally, represents a possible ancestor of the SGC. Motifs from the X circular code have been found in the reading frame of protein-coding regions in extant organisms from bacteria to eukaryotes, in many transfer RNA (tRNA) genes and in important functional regions of the ribosomal RNA (rRNA), notably in the peptidyl transferase centre and the decoding centre. Here, we have used a powerful correlation function to search for periodicity patterns involving the 20 trinucleotides of the X circular code in a large set of bacterial protein-coding genes, as well as in the translation machinery, including rRNA and tRNA sequences. As might be expected, we found a strong circular code periodicity 0 modulo 3 in the protein-coding genes. More surprisingly, we also identified a similar circular code periodicity in a large region of the 16S rRNA. This region includes the 3' major domain corresponding to the primordial proto-ribosome decoding centre and containing numerous sites that interact with the tRNA and messenger RNA (mRNA) during translation. Furthermore, 3D structural analysis shows that the periodicity region surrounds the mRNA channel that lies between the head and the body of the SSU. Our results support the hypothesis that the X circular code may constitute an ancestral translation code involved in reading frame retrieval and maintenance, traces of which persist in modern mRNA, tRNA and rRNA despite their long evolution and adaptation to the SGC.
Collapse
Affiliation(s)
- Christian J Michel
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| | - Julie D Thompson
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg, France
| |
Collapse
|
16
|
RNA Rings Strengthen Hairpin Accretion Hypotheses for tRNA Evolution: A Reply to Commentaries by Z.F. Burton and M. Di Giulio. J Mol Evol 2020; 88:243-252. [PMID: 32025759 DOI: 10.1007/s00239-020-09929-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Accepted: 01/07/2020] [Indexed: 01/08/2023]
Abstract
Theoretical minimal RNA ring design ensures coding over the shortest length once for each coding signal (start and stop codons, and each amino acid) and their hairpin configuration. These constraints define 25 RNA rings which surprisingly resemble ancestral tRNA loops, suggesting commonalities between RNA ring design and proto-tRNAs. RNA rings share several other properties with tRNAs, suggesting that primordial RNAs were multifunctional peptide coding sequences and structural RNAs. Two hypotheses, respectively, by M. Di Giulio and Z.F. Burton, derived from cloverleaf structural symmetries suggest that two and three, respectively, stem-loop hairpins agglutinated into tRNAs. Their authors commented that their respective structure-based hypotheses reflect better tRNA structure than RNA rings. Unlike these hypotheses, RNA ring design uses no tRNA-derived information, rendering model predictive power comparisons senseless. Some analyses of RNA ring primary and secondary structures stress RNA ring splicing in their predicted anticodon's midst, indicating ancestrality of split tRNAs, as the two-piece model predicts. Advancement of knowledge, rather than of specific hypotheses, gains foremost by examining independent hypotheses for commonalities, and only secondarily for discordances. RNA rings mimick ancestral biomolecules including tRNAs, and their evolution, and constitute an interesting synthetic system for early prebiotic evolution tests/simulations.
Collapse
|
17
|
Dila G, Ripp R, Mayer C, Poch O, Michel CJ, Thompson JD. Circular code motifs in the ribosome: a missing link in the evolution of translation? RNA (NEW YORK, N.Y.) 2019; 25:1714-1730. [PMID: 31506380 PMCID: PMC6859856 DOI: 10.1261/rna.072074.119] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 09/06/2019] [Indexed: 05/29/2023]
Abstract
The origin of the genetic code remains enigmatic five decades after it was elucidated, although there is growing evidence that the code coevolved progressively with the ribosome. A number of primordial codes were proposed as ancestors of the modern genetic code, including comma-free codes such as the RRY, RNY, or GNC codes (R = G or A, Y = C or T, N = any nucleotide), and the X circular code, an error-correcting code that also allows identification and maintenance of the reading frame. It was demonstrated previously that motifs of the X circular code are significantly enriched in the protein-coding genes of most organisms, from bacteria to eukaryotes. Here, we show that imprints of this code also exist in the ribosomal RNA (rRNA). In a large-scale study involving 133 organisms representative of the three domains of life, we identified 32 universal X motifs that are conserved in the rRNA of >90% of the organisms. Intriguingly, most of the universal X motifs are located in rRNA regions involved in important ribosome functions, notably in the peptidyl transferase center and the decoding center that form the original "proto-ribosome." Building on the existing accretion models for ribosome evolution, we propose that error-correcting circular codes represented an important step in the emergence of the modern genetic code. Thus, circular codes would have allowed the simultaneous coding of amino acids and synchronization of the reading frame in primitive translation systems, prior to the emergence of more sophisticated start codon recognition and translation initiation mechanisms.
Collapse
Affiliation(s)
- Gopal Dila
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Raymond Ripp
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Claudine Mayer
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
- Unité de Microbiologie Structurale, Institut Pasteur, CNRS, 75724 Paris Cedex 15, France
- Université Paris Diderot, Sorbonne Paris Cité, 75724 Paris Cedex 15, France
| | - Olivier Poch
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Christian J Michel
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| | - Julie D Thompson
- Department of Computer Science, ICube, CNRS, University of Strasbourg, Strasbourg 67000, France
| |
Collapse
|
18
|
Demongeot J, Norris V. Emergence of a "Cyclosome" in a Primitive Network Capable of Building "Infinite" Proteins. Life (Basel) 2019; 9:E51. [PMID: 31216720 PMCID: PMC6617141 DOI: 10.3390/life9020051] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Revised: 06/08/2019] [Accepted: 06/13/2019] [Indexed: 01/02/2023] Open
Abstract
We argue for the existence of an RNA sequence, called the AL (for ALpha) sequence, which may have played a role at the origin of life; this role entailed the AL sequence helping generate the first peptide assemblies via a primitive network. These peptide assemblies included "infinite" proteins. The AL sequence was constructed on an economy principle as the smallest RNA ring having one representative of each codon's synonymy class and capable of adopting a non-functional but nevertheless evolutionarily stable hairpin form that resisted denaturation due to environmental changes in pH, hydration, temperature, etc. Long subsequences from the AL ring resemble sequences from tRNAs and 5S rRNAs of numerous species like the proteobacterium, Rhodobacter sphaeroides. Pentameric subsequences from the AL are present more frequently than expected in current genomes, in particular, in genes encoding some of the proteins associated with ribosomes like tRNA synthetases. Such relics may help explain the existence of universal sequences like exon/intron frontier regions, Shine-Dalgarno sequence (present in bacterial and archaeal mRNAs), CRISPR and mitochondrial loop sequences.
Collapse
Affiliation(s)
- Jacques Demongeot
- Faculty of Medicine, Université Grenoble Alpes, AGEIS EA 7407 Tools for e-Gnosis Medical, 38700 La Tronche, France.
| | - Vic Norris
- Laboratory of Microbiology Signals and Microenvironment, Université de Rouen, 76821 Mont-Saint-Aignan CEDEX, France.
| |
Collapse
|
19
|
Timsit Y, Bennequin D. Nervous-Like Circuits in the Ribosome Facts, Hypotheses and Perspectives. Int J Mol Sci 2019; 20:ijms20122911. [PMID: 31207893 PMCID: PMC6627100 DOI: 10.3390/ijms20122911] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 06/08/2019] [Accepted: 06/10/2019] [Indexed: 12/16/2022] Open
Abstract
In the past few decades, studies on translation have converged towards the metaphor of a “ribosome nanomachine”; they also revealed intriguing ribosome properties challenging this view. Many studies have shown that to perform an accurate protein synthesis in a fluctuating cellular environment, ribosomes sense, transfer information and even make decisions. This complex “behaviour” that goes far beyond the skills of a simple mechanical machine has suggested that the ribosomal protein networks could play a role equivalent to nervous circuits at a molecular scale to enable information transfer and processing during translation. We analyse here the significance of this analogy and establish a preliminary link between two fields: ribosome structure-function studies and the analysis of information processing systems. This cross-disciplinary analysis opens new perspectives about the mechanisms of information transfer and processing in ribosomes and may provide new conceptual frameworks for the understanding of the behaviours of unicellular organisms.
Collapse
Affiliation(s)
- Youri Timsit
- Mediterranean Institute of Oceanography UM 110, Aix-Marseille Université, CNRS, IRD, Campus de Luminy, 13288 Marseille, France.
| | - Daniel Bennequin
- Institut de Mathématiques de Jussieu - Paris Rive Gauche (IMJ-PRG) Université Paris Diderot, bâtiment Sophie-Germain, 8, place Aurélie Nemours, 75013 Paris, France.
| |
Collapse
|