1
|
Joiret M, Kerff F, Rapino F, Close P, Geris L. Reversing the relative time courses of the peptide bond reaction with oligopeptides of different lengths and charged amino acid distributions in the ribosome exit tunnel. Comput Struct Biotechnol J 2024; 23:2453-2464. [PMID: 38882677 PMCID: PMC11179572 DOI: 10.1016/j.csbj.2024.05.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 05/10/2024] [Accepted: 05/27/2024] [Indexed: 06/18/2024] Open
Abstract
The kinetics of the protein elongation cycle by the ribosome depends on intertwined factors. One of these factors is the electrostatic interaction of the nascent protein with the ribosome exit tunnel. In this computational biology theoretical study, we focus on the rate of the peptide bond formation and its dependence on the ribosome exit tunnel electrostatic potential profile. We quantitatively predict how oligopeptides of variable lengths can affect the peptide bond formation rate. We applied the Michaelis-Menten model as previously extended to incorporate the mechano-biochemical effects of forces on the rate of reaction at the catalytic site of the ribosome. For a given pair of carboxy-terminal amino acid substrate at the P- and an aminoacyl-tRNA at the A-sites, the relative time courses of the peptide bond formation reaction can be reversed depending on the oligopeptide sequence embedded in the tunnel and their variable lengths from the P-site. The reversal is predicted to occur from a shift in positions of charged amino acids upstream in the oligopeptidyl-tRNA at the P-site. The position shift must be adjusted by clever design of the oligopeptide probes using the electrostatic potential profile along the exit tunnel axial path. These predicted quantitative results bring strong evidence of the importance and relative contribution of the electrostatic interaction of the ribosome exit tunnel with the nascent peptide chain during elongation.
Collapse
Affiliation(s)
- Marc Joiret
- Biomechanics Research Unit, GIGA In Silico Medicine, Liège University, CHU-B34(+5) 1 Avenue de l'Hôpital, 4000 Liège, Belgium
| | - Frederic Kerff
- UR InBios Centre d'Ingénierie des Protéines, Liège University, Bât B6a, Allèe du 6 Août, 19, B-4000 Liège, Belgium
| | - Francesca Rapino
- Cancer Signaling, GIGA Stem Cells, Liège University, CHU-B34(+2) 1 Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Pierre Close
- Cancer Signaling, GIGA Stem Cells, Liège University, CHU-B34(+2) 1 Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Liesbet Geris
- Biomechanics Research Unit, GIGA In Silico Medicine, Liège University, CHU-B34(+5) 1 Avenue de l'Hôpital, 4000 Liège, Belgium
- Skeletal Biology & Engineering Research Center, KU Leuven, ON I Herestraat 49 - Box 813, 3000 Leuven, Belgium
- Biomechanics Section, KU Leuven, Celestijnenlaan 300C - Box 2419, B-3001 Heverlee, Belgium
| |
Collapse
|
2
|
Zheng D, Wang J, Persyn L, Liu Y, Montoya FU, Cenik C, Agarwal V. Predicting the translation efficiency of messenger RNA in mammalian cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.11.607362. [PMID: 39149337 PMCID: PMC11326250 DOI: 10.1101/2024.08.11.607362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/17/2024]
Abstract
The degree to which translational control is specified by mRNA sequence is poorly understood in mammalian cells. Here, we constructed and leveraged a compendium of 3,819 ribosomal profiling datasets, distilling them into a transcriptome-wide atlas of translation efficiency (TE) measurements encompassing >140 human and mouse cell types. We subsequently developed RiboNN, a multitask deep convolutional neural network, and classic machine learning models to predict TEs in hundreds of cell types from sequence-encoded mRNA features, achieving state-of-the-art performance (r=0.79 in human and r=0.78 in mouse for mean TE across cell types). While the majority of earlier models solely considered 5' UTR sequence, RiboNN integrates contributions from the full-length mRNA sequence, learning that the 5' UTR, CDS, and 3' UTR respectively possess ~67%, 31%, and 2% per-nucleotide information density in the specification of mammalian TEs. Interpretation of RiboNN revealed that the spatial positioning of low-level di- and tri-nucleotide features (i.e., including codons) largely explain model performance, capturing mechanistic principles such as how ribosomal processivity and tRNA abundance control translational output. RiboNN is predictive of the translational behavior of base-modified therapeutic RNA, and can explain evolutionary selection pressures in human 5' UTRs. Finally, it detects a common language governing mRNA regulatory control and highlights the interconnectedness of mRNA translation, stability, and localization in mammalian organisms.
Collapse
Affiliation(s)
- Dinghai Zheng
- mRNA Center of Excellence, Sanofi, Waltham, MA 02451, USA
| | - Jun Wang
- mRNA Center of Excellence, Sanofi, Waltham, MA 02451, USA
| | - Logan Persyn
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| | - Yue Liu
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| | | | - Can Cenik
- Department of Molecular Biosciences, University of Texas at Austin, Austin, TX 78712, USA
| | - Vikram Agarwal
- mRNA Center of Excellence, Sanofi, Waltham, MA 02451, USA
| |
Collapse
|
3
|
Petrášek Z, Nidetzky B. Model of Processive Catalysis with Site Clustering and Blocking and Its Application to Cellulose Hydrolysis. J Phys Chem B 2022; 126:8472-8485. [PMID: 36251767 PMCID: PMC9623590 DOI: 10.1021/acs.jpcb.2c05956] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Interactions between particles moving on a linear track and their possible blocking by obstacles can lead to crowding, impeding the particles' transport kinetics. When the particles are enzymes processively catalyzing a reaction along a linear polymeric substrate, these crowding and blocking effects may substantially reduce the overall catalytic rate. Cellulose hydrolysis by exocellulases processively moving along cellulose chains assembled into insoluble cellulose particles is an example of such a catalytic transport process. The details of the kinetics of cellulose hydrolysis and the causes of the often observed reduction of hydrolysis rate over time are not yet fully understood. Crowding and blocking of enzyme particles are thought to be one of the important factors affecting the cellulose hydrolysis, but its exact role and mechanism are not clear. Here, we introduce a simple model based on an elementary transport process that incorporates the crowding and blocking effects in a straightforward way. This is achieved by making a distinction between binding and non-binding sites on the chain. The model reproduces a range of experimental results, mainly related to the early phase of cellulose hydrolysis. Our results indicate that the combined effects of clustering of binding sites together with the occupancy pattern of these sites by the enzyme molecules play a decisive role in the overall kinetics of cellulose hydrolysis. It is suggested that periodic desorption and rebinding of enzyme molecules could be a basis of a strategy to partially counter the clustering of and blocking by the binding sites and so enhance the rate of cellulose hydrolysis. The general nature of the model means that it could be applicable also to other transport processes that make a distinction between binding and non-binding sites, where crowding and blocking are expected to be relevant.
Collapse
Affiliation(s)
- Zdeněk Petrášek
- Institute
of Biotechnology and Biochemical Engineering, Graz University of Technology, NAWI Graz, Petersgasse 12, A-8010Graz, Austria,
| | - Bernd Nidetzky
- Institute
of Biotechnology and Biochemical Engineering, Graz University of Technology, NAWI Graz, Petersgasse 12, A-8010Graz, Austria,Austrian
Centre of Industrial Biotechnology, Petersgasse 14, A-8010Graz, Austria,. Phone: +43 (0)316 8738409, +43 (0)316 8738400
| |
Collapse
|
4
|
Lokdarshi A, von Arnim AG. Review: Emerging roles of the signaling network of the protein kinase GCN2 in the plant stress response. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2022; 320:111280. [PMID: 35643606 PMCID: PMC9197246 DOI: 10.1016/j.plantsci.2022.111280] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 03/07/2022] [Accepted: 03/30/2022] [Indexed: 06/15/2023]
Abstract
The pan-eukaryotic protein kinase GCN2 (General Control Nonderepressible2) regulates the translation of mRNAs in response to external and metabolic conditions. Although GCN2 and its substrate, translation initiation factor 2 (eIF2) α, and several partner proteins are substantially conserved in plants, this kinase has assumed novel functions in plants, including in innate immunity and retrograde signaling between the chloroplast and cytosol. How exactly some of the biochemical paradigms of the GCN2 system have diverged in the green plant lineage is only partially resolved. Specifically, conflicting data underscore and cast doubt on whether GCN2 regulates amino acid biosynthesis; also whether phosphorylation of eIF2α can in fact repress global translation or activate mRNA specific translation via upstream open reading frames; and whether GCN2 is controlled in vivo by the level of uncharged tRNA. This review examines the status of research on the eIF2α kinase, GCN2, its function in the response to xenobiotics, pathogens, and abiotic stress conditions, and its rather tenuous role in the translational control of mRNAs.
Collapse
Affiliation(s)
- Ansul Lokdarshi
- Department of Biology, Valdosta State University, Valdosta, GA 31698, USA.
| | - Albrecht G von Arnim
- Department of Biochemistry & Cellular and Molecular Biology, The University of Tennessee, Knoxville, TN 37996-1939, USA; UT-ORNL Graduate School of Genome Science and Technology, The University of Tennessee, Knoxville, TN 37996-1939, USA.
| |
Collapse
|
5
|
Flanagan K, Baradaran-Heravi A, Yin Q, Dao Duc K, Spradling AC, Greenblatt EJ. FMRP-dependent production of large dosage-sensitive proteins is highly conserved. Genetics 2022; 221:6613139. [PMID: 35731217 PMCID: PMC9339308 DOI: 10.1093/genetics/iyac094] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 06/07/2022] [Indexed: 12/01/2022] Open
Abstract
Mutations in FMR1 are the most common heritable cause of autism spectrum disorder. FMR1 encodes an RNA-binding protein, FMRP, which binds to long, autism-relevant transcripts and is essential for normal neuronal and ovarian development. In contrast to the prevailing model that FMRP acts to block translation elongation, we previously found that FMRP activates the translation initiation of large proteins in Drosophila oocytes. We now provide evidence that FMRP-dependent translation is conserved and occurs in the mammalian brain. Our comparisons of the mammalian cortex and Drosophila oocyte ribosome profiling data show that translation of FMRP-bound mRNAs decreases to a similar magnitude in FMRP-deficient tissues from both species. The steady-state levels of several FMRP targets were reduced in the Fmr1 KO mouse cortex, including a ∼50% reduction of Auts2, a gene implicated in an autosomal dominant autism spectrum disorder. To distinguish between effects on elongation and initiation, we used a novel metric to detect the rate-limiting ribosome stalling. We found no evidence that FMRP target protein production is governed by translation elongation rates. FMRP translational activation of large proteins may be critical for normal human development, as more than 20 FMRP targets including Auts2 are dosage sensitive and are associated with neurodevelopmental disorders caused by haploinsufficiency.
Collapse
Affiliation(s)
- Keegan Flanagan
- Department of Biochemistry and Molecular Biology, University of British Columbia, 2350 Health Sciences Mall, Vancouver, British Columbia, V6T 1Z3 Canada.,Department of Mathematics, University of British Columbia, 1984 Mathematics Road, Vancouver, British Columbia, BC V6T 1Z2
| | - Alireza Baradaran-Heravi
- Department of Biochemistry and Molecular Biology, University of British Columbia, 2350 Health Sciences Mall, Vancouver, British Columbia, V6T 1Z3 Canada
| | - Qi Yin
- Howard Hughes Medical Institute Research Laboratories, Department of Embryology, Carnegie Institution for Science, 3520 San Martin Dr., Baltimore, Maryland 21218 USA
| | - Khanh Dao Duc
- Department of Mathematics, University of British Columbia, 1984 Mathematics Road, Vancouver, British Columbia, BC V6T 1Z2
| | - Allan C Spradling
- Howard Hughes Medical Institute Research Laboratories, Department of Embryology, Carnegie Institution for Science, 3520 San Martin Dr., Baltimore, Maryland 21218 USA
| | - Ethan J Greenblatt
- Department of Biochemistry and Molecular Biology, University of British Columbia, 2350 Health Sciences Mall, Vancouver, British Columbia, V6T 1Z3 Canada.,Howard Hughes Medical Institute Research Laboratories, Department of Embryology, Carnegie Institution for Science, 3520 San Martin Dr., Baltimore, Maryland 21218 USA
| |
Collapse
|
6
|
Juritz J, Poulton JM, Ouldridge TE. Minimal mechanism for cyclic templating of length-controlled copolymers under isothermal conditions. J Chem Phys 2022; 156:074103. [DOI: 10.1063/5.0077865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Jordan Juritz
- Department of Bioengineering and Centre for Synthetic Biology, Imperial College London, London SW7 2AZ, United Kingdom
| | - Jenny M. Poulton
- Foundation for Fundamental Research on Matter (FOM), Institute for Atomic and Molecular Physics (AMOLF), 1098 XE Amsterdam, The Netherlands
| | - Thomas E. Ouldridge
- Department of Bioengineering and Centre for Synthetic Biology, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
7
|
Joiret M, Kerff F, Rapino F, Close P, Geris L. Ribosome exit tunnel electrostatics. Phys Rev E 2022; 105:014409. [PMID: 35193250 DOI: 10.1103/physreve.105.014409] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 12/08/2021] [Indexed: 06/14/2023]
Abstract
The impact of ribosome exit tunnel electrostatics on the protein elongation rate or on forces acting upon the nascent polypeptide chain are currently not fully elucidated. In the past, researchers have measured the electrostatic potential inside the ribosome polypeptide exit tunnel at a limited number of spatial points, at least in rabbit reticulocytes. Here we present a basic electrostatic model of the exit tunnel of the ribosome, providing a quantitative physical description of the tunnel interaction with the nascent proteins at all centro-axial points inside the tunnel. We show that a strong electrostatic screening is due to water molecules (not mobile ions) attracted to the ribosomal nucleic acid phosphate moieties buried in the immediate vicinity of the tunnel wall. We also show how the tunnel wall components and local ribosomal protein protrusions impact on the electrostatic potential profile and impede charged amino acid residues from progressing through the tunnel, affecting the elongation rate in a range of -40% to +85% when compared to the average elongation rate. The time spent by the ribosome to decode the genetic encrypted message is constrained accordingly. We quantitatively derive, at single-residue resolution, the axial forces acting on the nascent peptide from its particular sequence embedded in the tunnel. The model sheds light on how the experimental data point measurements of the potential are linked to the local structural chemistry of the inner wall, shape, and size of the tunnel. The model consistently connects experimental observations coming from different fields in molecular biology, x-ray crystallography, physical chemistry, biomechanics, and synthetic and multiomics biology. Our model should be a valuable tool to gain insight into protein synthesis dynamics, translational control, and the role of the ribosome's mechanochemistry in the cotranslational protein folding.
Collapse
Affiliation(s)
- Marc Joiret
- Biomechanics Research Unit, GIGA In Silico Medicine, Liège University, CHU-B34(+5) 1 Avenue de l'Hôpital, 4000 Liège, Belgium
| | - Frederic Kerff
- UR InBios, Centre d'Ingénierie des Protéines, Bât B6a, Allée du 6 Août, 19, B-4000 Liège, Belgium
| | - Francesca Rapino
- Cancer Signaling, GIGA Stem Cells, CHU-B34(+2) 1 Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Pierre Close
- Cancer Signaling, GIGA Stem Cells, CHU-B34(+2) 1 Avenue de l'Hôpital, B-4000 Liège, Belgium
| | - Liesbet Geris
- Biomechanics Research Unit, GIGA In Silico Medicine, Liège University, CHU-B34(+5) 1 Avenue de l'Hôpital, 4000 Liège, Belgium
- Skeletal Biology & Engineering Research Center, KU Leuven, ON I Herestraat 49 - box 813, 3000 Leuven, Belgium
- Biomechanics Section, KU Leuven, Celestijnenlaan 300C box 2419, B-3001 Heverlee, Belgium
| |
Collapse
|
8
|
Shallom D, Naiger D, Weiss S, Tuller T. Accelerating Whole-Cell Simulations of mRNA Translation Using a Dedicated Hardware. ACS Synth Biol 2021; 10:3489-3506. [PMID: 34813269 PMCID: PMC8689694 DOI: 10.1021/acssynbio.1c00415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
In recent years, intracellular biophysical simulations have been used with increasing frequency not only for answering basic scientific questions but also in the field of synthetic biology. However, since these models include networks of interaction between millions of components, they are extremely time-consuming and cannot run easily on parallel computers. In this study, we demonstrate for the first time a novel approach addressing this challenge by using a dedicated hardware designed specifically to simulate such processes. As a proof of concept, we specifically focus on mRNA translation, which is the process consuming most of the energy in the cell. We design a hardware that simulates translation in Escherichia coli and Saccharomyces cerevisiae for thousands of mRNAs and ribosomes, which is in orders of magnitude faster than a similar software solution. With the sharp increase in the amount of genomic data available today and the complexity of the corresponding models inferred from them, we believe that the strategy suggested here will become common and can be used among others for simulating entire cells with all gene expression steps.
Collapse
Affiliation(s)
- David Shallom
- School of Electrical Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Danny Naiger
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Shlomo Weiss
- School of Electrical Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| |
Collapse
|
9
|
Variability in mRNA translation: a random matrix theory approach. Sci Rep 2021; 11:5300. [PMID: 33674667 PMCID: PMC7970873 DOI: 10.1038/s41598-021-84738-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Accepted: 02/19/2021] [Indexed: 01/31/2023] Open
Abstract
The rate of mRNA translation depends on the initiation, elongation, and termination rates of ribosomes along the mRNA. These rates depend on many "local" factors like the abundance of free ribosomes and tRNA molecules in the vicinity of the mRNA molecule. All these factors are stochastic and their experimental measurements are also noisy. An important question is how protein production in the cell is affected by this considerable variability. We develop a new theoretical framework for addressing this question by modeling the rates as identically and independently distributed random variables and using tools from random matrix theory to analyze the steady-state production rate. The analysis reveals a principle of universality: the average protein production rate depends only on the of the set of possible values that the random variable may attain. This explains how total protein production can be stabilized despite the overwhelming stochasticticity underlying cellular processes.
Collapse
|
10
|
EGGTART: A tool to visualize the dynamics of biophysical transport under the inhomogeneous l-TASEP. Biophys J 2021; 120:1309-1313. [PMID: 33582139 DOI: 10.1016/j.bpj.2021.02.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 01/31/2021] [Accepted: 02/03/2021] [Indexed: 11/21/2022] Open
Abstract
The totally asymmetric simple exclusion process (TASEP), which describes the stochastic dynamics of interacting particles on a lattice, has been actively studied over the past several decades and applied to model important biological transport processes. Here, we present a software package, called EGGTART (Extensive GUI gives TASEP-realization in Real Time), which quantifies and visualizes the dynamics associated with a generalized version of the TASEP with an extended particle size and heterogeneous jump rates. This computational tool is based on analytic formulas obtained from deriving and solving the hydrodynamic limit of the process. It allows an immediate quantification of the particle density, flux, and phase diagram, as a function of a few key parameters associated with the system, which would be difficult to achieve via conventional stochastic simulations. Our software should therefore be of interest to biophysicists studying general transport processes and can in particular be used in the context of gene expression to model and quantify mRNA translation of different coding sequences.
Collapse
|
11
|
Signal Recognition Particle Suppressor Screening Reveals the Regulation of Membrane Protein Targeting by the Translation Rate. mBio 2021; 12:mBio.02373-20. [PMID: 33436432 PMCID: PMC7844537 DOI: 10.1128/mbio.02373-20] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
The signal recognition particle (SRP) is conserved in all living organisms, and it cotranslationally delivers proteins to the inner membrane or endoplasmic reticulum. Recently, SRP loss was found not to be lethal in either the eukaryote Saccharomyces cerevisiae or the prokaryote Streptococcus mutans In Escherichia coli, the role of SRP in mediating inner membrane protein (IMP) targeting has long been studied. However, the essentiality of SRP remains a controversial topic, partly hindered by the lack of strains in which SRP is completely absent. Here we show that the SRP was nonessential in E. coli by suppressor screening. We identified two classes of extragenic suppressors-two translation initiation factors and a ribosomal protein-all of which are involved in translation initiation. The translation rate and inner membrane proteomic analyses were combined to define the mechanism that compensates for the lack of SRP. The primary factor that contributes to the efficiency of IMP targeting is the extension of the time window for targeting by pausing the initiation of translation, which further reduces translation initiation and elongation rates. Furthermore, we found that easily predictable features in the nascent chain determine the specificity of protein targeting. Our results show why the loss of the SRP pathway does not lead to lethality. We report a new paradigm in which the time delay in translation initiation is beneficial during protein targeting in the absence of SRP.IMPORTANCE Inner membrane proteins (IMPs) are cotranslationally inserted into the inner membrane or endoplasmic reticulum by the signal recognition particle (SRP). Generally, the deletion of SRP can result in protein targeting defects in Escherichia coli Suppressor screening for loss of SRP reveals that pausing at the translation start site is likely to be critical in allowing IMP targeting and avoiding aggregation. In this work, we found for the first time that SRP is nonessential in E. coli The time delay in initiation is different from the previous mechanism that only slows down the elongation rate. It not only maximizes the opportunity for untranslated ribosomes to be near the inner membrane but also extends the time window for targeting translating ribosomes by decreasing the speed of translation. We anticipate that our work will be a starting point for a more delicate regulatory mechanism of protein targeting.
Collapse
|
12
|
Sarvari P, Ingram D, Stan GB. A Modelling Framework Linking Resource-Based Stochastic Translation to the Optimal Design of Synthetic Constructs. BIOLOGY 2021; 10:biology10010037. [PMID: 33430483 PMCID: PMC7826857 DOI: 10.3390/biology10010037] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Revised: 12/26/2020] [Accepted: 12/31/2020] [Indexed: 12/04/2022]
Abstract
Simple Summary In synthetic biology, it is commonplace to design and insert gene expression constructs into cells for the production of useful proteins. In order to maximise production yield, it is useful to predict the performance of these “engineered cells” in advance of conducting experiments. This is typically a complex task, which in recent years has motivated the use of “whole-cell models” (WCMs) that act as computational tools for predicting different aspects of cell growth. Many useful WCMs exist, however a common problem is their over-simplification of ribosome movement on mRNA transcripts during translation. WCMs typically don’t consider that, for constructs with inefficient (“slow”) codons, ribosomes can stall and form “traffic jams”, thereby becoming unavailable for translation of other proteins. To more accurately address these scenarios, we have built a computational framework that combines whole-cell modelling with a detailed account of ribosome movement on mRNA. We show how our framework can be used to link the modular design of a gene expression construct (via its promoter, ribosome binding site and codon composition) to protein yield during continuous cell culture, with a particular focus on how the optimal design can change over time in the presence or absence of “slow” codons. Abstract The effect of gene expression burden on engineered cells has motivated the use of “whole-cell models” (WCMs) that use shared cellular resources to predict how unnatural gene expression affects cell growth. A common problem with many WCMs is their inability to capture translation in sufficient detail to consider the impact of ribosomal queue formation on mRNA transcripts. To address this, we have built a “stochastic cell calculator” (StoCellAtor) that combines a modified TASEP with a stochastic implementation of an existing WCM. We show how our framework can be used to link a synthetic construct’s modular design (promoter, ribosome binding site (RBS) and codon composition) to protein yield during continuous culture, with a particular focus on the effects of low-efficiency codons and their impact on ribosomal queues. Through our analysis, we recover design principles previously established in our work on burden-sensing strategies, namely that changing promoter strength is often a more efficient way to increase protein yield than RBS strength. Importantly, however, we show how these design implications can change depending on both the duration of protein expression, and on the presence of ribosomal queues.
Collapse
Affiliation(s)
- Peter Sarvari
- Quantitative and Computational Biology, Dornsife College of Letters, Arts and Sciences, University of Southern California, Los Angeles, CA 90089, USA;
| | - Duncan Ingram
- Imperial College Centre for Synthetic Biology, Imperial College London, London SW7 2BU, UK;
- Department of Bioengineering, Imperial College London, London SW7 2BU, UK
| | - Guy-Bart Stan
- Imperial College Centre for Synthetic Biology, Imperial College London, London SW7 2BU, UK;
- Department of Bioengineering, Imperial College London, London SW7 2BU, UK
- Correspondence: ; Tel.: +44-020-7594-6375
| |
Collapse
|
13
|
Abstract
How did life begin on Earth? And is there life elsewhere in the Cosmos? Challenging questions, indeed. The series of conferences established by NoR CEL in 2013 addresses these very questions. This paper comprises a summary report of oral presentations that were delivered by NoR CEL’s network members during the 2018 Athens conference and, as such, disseminates the latest research which they have put forward. More in depth material can be found by consulting the contributors referenced papers. Overall, the outcome of this conspectus on the conference demonstrates a case for the existence of “probable chemistry” during the prebiotic epoch.
Collapse
|
14
|
Szavits-Nossan J, Ciandrini L. Inferring efficiency of translation initiation and elongation from ribosome profiling. Nucleic Acids Res 2020; 48:9478-9490. [PMID: 32821926 PMCID: PMC7515720 DOI: 10.1093/nar/gkaa678] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2020] [Revised: 07/29/2020] [Accepted: 08/15/2020] [Indexed: 01/13/2023] Open
Abstract
One of the main goals of ribosome profiling is to quantify the rate of protein synthesis at the level of translation. Here, we develop a method for inferring translation elongation kinetics from ribosome profiling data using recent advances in mathematical modelling of mRNA translation. Our method distinguishes between the elongation rate intrinsic to the ribosome’s stepping cycle and the actual elongation rate that takes into account ribosome interference. This distinction allows us to quantify the extent of ribosomal collisions along the transcript and identify individual codons where ribosomal collisions are likely. When examining ribosome profiling in yeast, we observe that translation initiation and elongation are close to their optima and traffic is minimized at the beginning of the transcript to favour ribosome recruitment. However, we find many individual sites of congestion along the mRNAs where the probability of ribosome interference can reach \documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$50\%$\end{document}. Our work provides new measures of translation initiation and elongation efficiencies, emphasizing the importance of rating these two stages of translation separately.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, UK
| | - Luca Ciandrini
- Centre de Biologie Structurale (CBS), CNRS, INSERM, Univ Montpellier, Montpellier 34090, France
| |
Collapse
|
15
|
Szavits-Nossan J, Waclaw B. Current-density relation in the exclusion process with dynamic obstacles. Phys Rev E 2020; 102:042117. [PMID: 33212664 DOI: 10.1103/physreve.102.042117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Accepted: 09/17/2020] [Indexed: 06/11/2023]
Abstract
We investigate the totally asymmetric simple exclusion process (TASEP) in the presence of obstacles that dynamically bind and unbind from the lattice. The model is motivated by biological processes such as transcription in the presence of DNA-binding proteins. Similar models have been studied before using the mean-field approximation, but the exact relation between the particle current and density remains elusive. Here, we first show using extensive Monte Carlo simulations that the current-density relation in this model assumes a quasiparabolic form similar to that of the ordinary TASEP without obstacles. We then attempt to explain this relation using exact calculations in the limit of low and high density of particles. Our results suggest that the symmetric, quasiparabolic current-density relation arises through a nontrivial cancellation of higher-order terms, similarly as in the standard TASEP.
Collapse
Affiliation(s)
- J Szavits-Nossan
- School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| | - B Waclaw
- School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
- Centre for Synthetic and Systems Biology, University of Edinburgh, Edinburgh EH9 3BF, United Kingdom
| |
Collapse
|
16
|
Computational discovery and modeling of novel gene expression rules encoded in the mRNA. Biochem Soc Trans 2020; 48:1519-1528. [PMID: 32662820 DOI: 10.1042/bst20191048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 06/15/2020] [Accepted: 06/17/2020] [Indexed: 11/17/2022]
Abstract
The transcript is populated with numerous overlapping codes that regulate all steps of gene expression. Deciphering these codes is very challenging due to the large number of variables involved, the non-modular nature of the codes, biases and limitations in current experimental approaches, our limited knowledge in gene expression regulation across the tree of life, and other factors. In recent years, it has been shown that computational modeling and algorithms can significantly accelerate the discovery of novel gene expression codes. Here, we briefly summarize the latest developments and different approaches in the field.
Collapse
|
17
|
Szavits-Nossan J, Evans MR. Dynamics of ribosomes in mRNA translation under steady- and nonsteady-state conditions. Phys Rev E 2020; 101:062404. [PMID: 32688522 DOI: 10.1103/physreve.101.062404] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2020] [Accepted: 05/20/2020] [Indexed: 11/07/2022]
Abstract
Recent advances in DNA sequencing and fluorescence imaging have made it possible to monitor the dynamics of ribosomes actively engaged in messenger RNA (mRNA) translation. Here, we model these experiments within the inhomogeneous totally asymmetric simple exclusion process (TASEP) using realistic kinetic parameters. In particular, we present analytic expressions to describe the following three cases: (a) translation of a newly transcribed mRNA, (b) translation in the steady state and, specifically, the dynamics of individual (tagged) ribosomes, and (c) runoff translation after inhibition of translation initiation. In cases (b) and (c) we develop an effective medium approximation to describe many-ribosome dynamics in terms of a single tagged ribosome in an effective medium. The predictions are in good agreement with stochastic simulations.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| | - Martin R Evans
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| |
Collapse
|
18
|
Levin D, Tuller T. Whole cell biophysical modeling of codon-tRNA competition reveals novel insights related to translation dynamics. PLoS Comput Biol 2020; 16:e1008038. [PMID: 32649657 PMCID: PMC7375613 DOI: 10.1371/journal.pcbi.1008038] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 07/22/2020] [Accepted: 06/10/2020] [Indexed: 11/19/2022] Open
Abstract
The importance of mRNA translation models has been demonstrated across many fields of science and biotechnology. However, a whole cell model with codon resolution and biophysical dynamics is still lacking. We describe a whole cell model of translation for E. coli. The model simulates all major translation components in the cell: ribosomes, mRNAs and tRNAs. It also includes, for the first time, fundamental aspects of translation, such as competition for ribosomes and tRNAs at a codon resolution while considering tRNAs wobble interactions and tRNA recycling. The model uses parameters that are tightly inferred from large scale measurements of translation. Furthermore, we demonstrate a robust modelling approach which relies on state-of-the-art practices of translation modelling and also provides a framework for easy generalizations. This novel approach allows simulation of thousands of mRNAs that undergo translation in the same cell with common resources such as ribosomes and tRNAs in feasible time. Based on this model, we demonstrate, for the first time, the direct importance of competition for resources on translation and its accurate modelling. An effective supply-demand ratio (ESDR) measure, which is related to translation factors such as tRNAs, has been devised and utilized to show superior predictive power in complex scenarios of heterologous gene expression. The devised model is not only more accurate than the existing models, but, more importantly, provides a framework for analyzing complex whole cell translation problems and variables that haven't been explored before, making it important in various biomedical fields. mRNA translation is a fundamental process in all living organisms and the importance of its modeling has been demonstrated across many fields of science and biotechnology. Specifically, modeling a whole cell context with a high resolution has been a great challenge in the field, making many important problems un-addressable. In this study we devised a novel model, which allows, for the first time, simultaneous simulation of thousands of mRNAs, along with various bio-physical aspects that affect translation (such as codon-resolution dynamics and shared resources pool of both ribosomes and tRNAs). We demonstrated (using experimental data) that this model is more accurate than existing ones, and, more importantly, provides a framework for addressing complex translation problems (such as heterologous expression) at whole cell scale and in reasonable time. We demonstrated the model using E. coli data, but the model can be easily tailored to other organisms as well. Our model addresses an urgent unmet need for biophysically accurate whole cell translation model with resources coupling and has potential applications in many fields, including medicine and biotechnology.
Collapse
Affiliation(s)
- Doron Levin
- Biomedical Engineering Dept., Tel Aviv University, Tel Aviv, Israel
| | - Tamir Tuller
- Biomedical Engineering Dept., Tel Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
- * E-mail:
| |
Collapse
|
19
|
Sabi R, Tuller T. Modelling and measuring intracellular competition for finite resources during gene expression. J R Soc Interface 2020; 16:20180887. [PMID: 31113334 DOI: 10.1098/rsif.2018.0887] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Dissecting the competition between genes for shared expressional resources is of fundamental importance for understanding the interplay between cellular components. Owing to the relationship between gene expression and cellular fitness, genomes are shaped by evolution to improve resource allocation. Whereas experimental approaches to investigate intracellular competition require technical resources and human expertise, computational models and in silico simulations allow vast numbers of experiments to be carried out and controlled easily, and with significantly reduced costs. Thus, modelling competition has a pivotal role in understanding the effects of competition on the biophysics of the cell. In this article, we review various computational models proposed to describe the different types of competition during gene expression. We also present relevant synthetic biology experiments and their biotechnological implications, and discuss the open questions in the field.
Collapse
Affiliation(s)
- Renana Sabi
- 1 Department of Biomedical Engineering, Tel Aviv University , Israel
| | - Tamir Tuller
- 1 Department of Biomedical Engineering, Tel Aviv University , Israel.,2 The Sagol School of Neuroscience, Tel Aviv University , Israel
| |
Collapse
|
20
|
Abstract
Messenger RNAs (mRNAs) consist of a coding region (open reading frame (ORF)) and two untranslated regions (UTRs), 5'UTR and 3'UTR. Ribosomes travel along the coding region, translating nucleotide triplets (called codons) to a chain of amino acids. The coding region was long believed to mainly encode the amino acid content of proteins, whereas regulatory signals reside in the UTRs and in other genomic regions. However, in recent years we have learned that the ORF is expansively populated with various regulatory signals, or codes, which are related to all gene expression steps and additional intracellular aspects. In this paper, we review the current knowledge related to overlapping codes inside the coding regions, such as the influence of synonymous codon usage on translation speed (and, in turn, the effect of translation speed on protein folding), ribosomal frameshifting, mRNA stability, methylation, splicing, transcription and more. All these codes come together and overlap in the ORF sequence, ensuring production of the right protein at the right time.
Collapse
Affiliation(s)
- Shaked Bergman
- Department of Biomedical Engineering, Tel-Aviv University, Tel Aviv, Israel
| | | |
Collapse
|
21
|
Molecules to Microbes. SCI 2020. [DOI: 10.3390/sci2020020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
How did life begin on Earth? And is there life elsewhere in the Cosmos? Challenging questions, indeed. The series of conferences established by NoR CEL in 2013, addresses these very same questions. The basis for this paper is the summary report of oral presentations that were delivered by NoR CEL’s network members during the 2018 Athens conference and, as such, disseminates the latest research which they have put forward. More in depth material can be found by consulting the contributors referenced papers. Overall, the outcome of this conspectus on the conference demonstrates a case for the existence of “probable chemistry” during the prebiotic epoch.
Collapse
|
22
|
Dykeman EC. A stochastic model for simulating ribosome kinetics in vivo. PLoS Comput Biol 2020; 16:e1007618. [PMID: 32049979 PMCID: PMC7015319 DOI: 10.1371/journal.pcbi.1007618] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2019] [Accepted: 12/19/2019] [Indexed: 12/15/2022] Open
Abstract
Computational modelling of in vivo protein synthesis is highly complicated, as it requires the simulation of ribosomal movement over the entire transcriptome, as well as consideration of the concentration effects from 40+ different types of tRNAs and numerous other protein factors. Here I report on the development of a stochastic model for protein translation that is capable of simulating the dynamical process of in vivo protein synthesis in a prokaryotic cell containing several thousand unique mRNA sequences, with explicit nucleotide information for each, and report on a number of biological predictions which are beyond the scope of existing models. In particular, I show that, when the complex network of concentration dependent interactions between elongation factors, tRNAs, ribosomes, and other factors required for protein synthesis are included in full detail, several biological phenomena, such as the increasing peptide elongation rate with bacterial growth rate, are predicted as emergent properties of the model. The stochastic model presented here demonstrates the importance of considering the translational process at this level of detail, and provides a platform to interrogate various aspects of translation that are difficult to study in more coarse-grained models.
Collapse
|
23
|
Erdmann-Pham DD, Dao Duc K, Song YS. The Key Parameters that Govern Translation Efficiency. Cell Syst 2020; 10:183-192.e6. [PMID: 31954660 DOI: 10.1016/j.cels.2019.12.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2019] [Revised: 08/29/2019] [Accepted: 12/17/2019] [Indexed: 11/16/2022]
Abstract
Translation of mRNA into protein is a fundamental yet complex biological process with multiple factors that can potentially affect its efficiency. Here, we study a stochastic model describing the traffic flow of ribosomes along the mRNA and identify the key parameters that govern the overall rate of protein synthesis, sensitivity to initiation rate changes, and efficiency of ribosome usage. By analyzing a continuum limit of the model, we obtain closed-form expressions for stationary currents and ribosomal densities, which agree well with Monte Carlo simulations. Furthermore, we completely characterize the phase transitions in the system, and by applying our theoretical results, we formulate design principles that detail how to tune the key parameters we identified to optimize translation efficiency. Using ribosome profiling data from S. cerevisiae, we show that its translation system is generally consistent with these principles. Our theoretical results have implications for evolutionary biology, as well as for synthetic biology.
Collapse
Affiliation(s)
- Dan D Erdmann-Pham
- Department of Mathematics, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Khanh Dao Duc
- Computer Science Division, University of California, Berkeley, Berkeley, CA 94720, USA
| | - Yun S Song
- Computer Science Division, University of California, Berkeley, Berkeley, CA 94720, USA; Department of Statistics, University of California, Berkeley, Berkeley, CA 94720, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|
24
|
Scott S, Szavits-Nossan J. Power series method for solving TASEP-based models of mRNA translation. Phys Biol 2019; 17:015004. [PMID: 31726446 DOI: 10.1088/1478-3975/ab57a0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
We develop a method for solving mathematical models of messenger RNA (mRNA) translation based on the totally asymmetric simple exclusion process (TASEP). Our main goal is to demonstrate that the method is versatile and applicable to realistic models of translation. To this end we consider the TASEP with codon-dependent elongation rates, premature termination due to ribosome drop-off and translation reinitiation due to circularisation of the mRNA. We apply the method to the model organism Saccharomyces cerevisiae under physiological conditions and find an excellent agreement with the results of stochastic simulations. Our findings suggest that the common view on translation as being rate-limited by initiation is oversimplistic. Instead we find theoretical evidence for ribosome interference and also theoretical support for the ramp hypothesis which argues that codons at the beginning of genes have slower elongation rates in order to reduce ribosome density and jamming.
Collapse
Affiliation(s)
- S Scott
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| | | |
Collapse
|
25
|
Park H, Subramaniam AR. Inverted translational control of eukaryotic gene expression by ribosome collisions. PLoS Biol 2019; 17:e3000396. [PMID: 31532761 PMCID: PMC6750593 DOI: 10.1371/journal.pbio.3000396] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2019] [Accepted: 08/05/2019] [Indexed: 11/19/2022] Open
Abstract
The canonical model of eukaryotic translation posits that efficient translation initiation increases protein expression and mRNA stability. Contrary to this model, we find that increasing initiation rate can decrease both protein expression and stability of certain mRNAs in the budding yeast Saccharomyces cerevisiae. These mRNAs encode a stretch of polybasic residues that cause ribosome stalling. Our computational modeling predicts that the observed decrease in gene expression at high initiation rates occurs when ribosome collisions at stalls stimulate abortive termination of the leading ribosome or cause endonucleolytic mRNA cleavage. Consistent with this prediction, the collision-associated quality-control factors Asc1 and Hel2 (orthologs of human RACK1 and ZNF598, respectively) decrease gene expression from stall-containing mRNAs only at high initiation rates. Remarkably, hundreds of S. cerevisiae mRNAs that contain ribosome stall sequences also exhibit lower translation efficiency. We propose that inefficient translation initiation allows these stall-containing endogenous mRNAs to escape collision-stimulated reduction in gene expression. Higher rates of translation counterintuitively lead to lower protein levels from eukaryotic mRNAs that encode ribosome stalls; modelling suggests that this occurs when ribosome collisions at stalls trigger abortive termination of the leading ribosome or cause endonucleolytic mRNA cleavage.
Collapse
Affiliation(s)
- Heungwon Park
- Basic Sciences Division and Computational Biology Section of Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Arvind R. Subramaniam
- Basic Sciences Division and Computational Biology Section of Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
26
|
Abstract
How did life begin on Earth? And is there life elsewhere in the Cosmos? Challenging questions, indeed. The series of conferences established by NoR CEL in 2013, addresses these very same questions. The basis for this paper is the summary report of oral presentations that were delivered by NoR CEL’s network members during the 2018 Athens conference and, as such, disseminates the latest research which they have put forward. More in depth material can be found by consulting the contributors referenced papers. Overall, the outcome of this conspectus on the conference demonstrates a case for the existence of “probable chemistry” during the prebiotic epoch.
Collapse
|
27
|
Abstract
Heterologously expressed genes require adaptation to the host organism to ensure adequate levels of protein synthesis, which is typically approached by replacing codons by the target organism’s preferred codons. In view of frequently encountered suboptimal outcomes we introduce the codon-specific elongation model (COSEM) as an alternative concept. COSEM simulates ribosome dynamics during mRNA translation and informs about protein synthesis rates per mRNA in an organism- and context-dependent way. Protein synthesis rates from COSEM are integrated with further relevant covariates such as translation accuracy into a protein expression score that we use for codon optimization. The scoring algorithm further enables fine-tuning of protein expression including deoptimization and is implemented in the software OCTOPOS. The protein expression score produces competitive predictions on proteomic data from prokaryotic, eukaryotic, and human expression systems. In addition, we optimized and tested heterologous expression of manA and ova genes in Salmonella enterica serovar Typhimurium. Superiority over standard methodology was demonstrated by a threefold increase in protein yield compared to wildtype and commercially optimized sequences.
Collapse
|
28
|
Sharma AK, Sormanni P, Ahmed N, Ciryam P, Friedrich UA, Kramer G, O’Brien EP. A chemical kinetic basis for measuring translation initiation and elongation rates from ribosome profiling data. PLoS Comput Biol 2019; 15:e1007070. [PMID: 31120880 PMCID: PMC6559674 DOI: 10.1371/journal.pcbi.1007070] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 06/11/2019] [Accepted: 05/06/2019] [Indexed: 01/23/2023] Open
Abstract
Analysis methods based on simulations and optimization have been previously developed to estimate relative translation rates from next-generation sequencing data. Translation involves molecules and chemical reactions, hence bioinformatics methods consistent with the laws of chemistry and physics are more likely to produce accurate results. Here, we derive simple equations based on chemical kinetic principles to measure the translation-initiation rate, transcriptome-wide elongation rate, and individual codon translation rates from ribosome profiling experiments. Our methods reproduce the known rates from ribosome profiles generated from detailed simulations of translation. By applying our methods to data from S. cerevisiae and mouse embryonic stem cells, we find that the extracted rates reproduce expected correlations with various molecular properties, and we also find that mouse embryonic stem cells have a global translation speed of 5.2 AA/s, in agreement with previous reports that used other approaches. Our analysis further reveals that a codon can exhibit up to 26-fold variability in its translation rate depending upon its context within a transcript. This broad distribution means that the average translation rate of a codon is not representative of the rate at which most instances of that codon are translated, and it suggests that translational regulation might be used by cells to a greater degree than previously thought.
Collapse
Affiliation(s)
- Ajeet K. Sharma
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Pietro Sormanni
- Centre for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Nabeel Ahmed
- Bioinformatics and Genomics Graduate Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
| | - Prajwal Ciryam
- Centre for Misfolding Diseases, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Ulrike A. Friedrich
- Center for Molecular Biology of the Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Günter Kramer
- Center for Molecular Biology of the Heidelberg University (ZMBH), DKFZ-ZMBH Alliance, Heidelberg, Germany
- German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Edward P. O’Brien
- Department of Chemistry, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Bioinformatics and Genomics Graduate Program, The Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, Pennsylvania, United States of America
- Institute for CyberScience, Pennsylvania State University, University Park, Pennsylvania, United States of America
| |
Collapse
|
29
|
Nanikashvili I, Zarai Y, Ovseevich A, Tuller T, Margaliot M. Networks of ribosome flow models for modeling and analyzing intracellular traffic. Sci Rep 2019; 9:1703. [PMID: 30737417 PMCID: PMC6368613 DOI: 10.1038/s41598-018-37864-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2018] [Accepted: 12/17/2018] [Indexed: 11/20/2022] Open
Abstract
The ribosome flow model with input and output (RFMIO) is a deterministic dynamical system that has been used to study the flow of ribosomes during mRNA translation. The input of the RFMIO controls its initiation rate and the output represents the ribosome exit rate (and thus the protein production rate) at the 3′ end of the mRNA molecule. The RFMIO and its variants encapsulate important properties that are relevant to modeling ribosome flow such as the possible evolution of “traffic jams” and non-homogeneous elongation rates along the mRNA molecule, and can also be used for studying additional intracellular processes such as transcription, transport, and more. Here we consider networks of interconnected RFMIOs as a fundamental tool for modeling, analyzing and re-engineering the complex mechanisms of protein production. In these networks, the output of each RFMIO may be divided, using connection weights, between several inputs of other RFMIOs. We show that under quite general feedback connections the network has two important properties: (1) it admits a unique steady-state and every trajectory converges to this steady-state; and (2) the problem of how to determine the connection weights so that the network steady-state output is maximized is a convex optimization problem. These mathematical properties make these networks highly suitable as models of various phenomena: property (1) means that the behavior is predictable and ordered, and property (2) means that determining the optimal weights is numerically tractable even for large-scale networks. For the specific case of a feed-forward network of RFMIOs we prove an additional useful property, namely, that there exists a spectral representation for the network steady-state, and thus it can be determined without any numerical simulations of the dynamics. We describe the implications of these results to several fundamental biological phenomena and biotechnological objectives.
Collapse
Affiliation(s)
- Itzik Nanikashvili
- School of Electrical Engineering, Tel-Aviv University, Tel-Aviv, 69978, Israel
| | - Yoram Zarai
- Department of Biomedical Engineering, Tel-Aviv University, Tel-Aviv, 69978, Israel
| | - Alexander Ovseevich
- Ishlinsky Institute for Problems in Mechanics, Russian Academy of Sciences and the Russian Quantum Center, Moscow, Russia
| | - Tamir Tuller
- Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, 69978, Israel. .,Department of Biomedical Engineering, Tel-Aviv University, Tel-Aviv, 69978, Israel.
| | - Michael Margaliot
- School of Electrical Engineering, Tel-Aviv University, Tel-Aviv, 69978, Israel.,Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, 69978, Israel
| |
Collapse
|
30
|
Ding W, Cheng J, Guo D, Mao L, Li J, Lu L, Zhang Y, Yang J, Jiang H. Engineering the 5' UTR-Mediated Regulation of Protein Abundance in Yeast Using Nucleotide Sequence Activity Relationships. ACS Synth Biol 2018; 7:2709-2714. [PMID: 30525473 DOI: 10.1021/acssynbio.8b00127] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The 5' untranslated region (5'UTR) plays a key role in post-transcriptional regulation, but interaction between nucleotides and directed evolution of 5'UTRs as synthetic regulatory elements remain unclear. By constructing a library of synthesized random 5'UTRs of 24 nucleotides in Saccharomyces cerevisiae, we observed strong epistatic interactions among bases from different positions in the 5'UTR. Taking into account these base interactions, we constructed a mathematical model to predict protein abundance with a precision of R2 = 0.60. On the basis of this model, we developed an approach to engineer 5'UTRs according to nucleotide sequence activity relationships (NuSAR), in which 5'UTRs were engineered stepwise through repeated cycles of backbone design, directed screening, and model reconstruction. After three rounds of NuSAR, the predictive accuracy of our model was improved to R2 = 0.71, and a strong 5'UTR was obtained with 5-fold higher protein abundance than the starting 5'UTR. Our findings provide new insights into the mechanism of 5'UTR regulation and contribute to a new translational elements engineering approach in synthetic biology.
Collapse
Affiliation(s)
- Wentao Ding
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
- Beijing Advanced Innovation Center for Soft Matter Science and Engineering, Beijing University of Chemical Technology, Beijing 100029, China
| | - Jian Cheng
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Dan Guo
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Ling Mao
- College of Biology and Pharmaceutical Engineering, Wuhan Polytechnic University, Wuhan 430023, China
| | - Jingwei Li
- Laboratory of Mathematics for Nonlinear Science, Shanghai Key Laboratory for Contemporary Applied Mathematics, Centre for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Lina Lu
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| | - Yunxin Zhang
- Laboratory of Mathematics for Nonlinear Science, Shanghai Key Laboratory for Contemporary Applied Mathematics, Centre for Computational Systems Biology, School of Mathematical Sciences, Fudan University, Shanghai 200433, China
| | - Jiangke Yang
- College of Biology and Pharmaceutical Engineering, Wuhan Polytechnic University, Wuhan 430023, China
| | - Huifeng Jiang
- Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, China
| |
Collapse
|
31
|
Dao Duc K, Saleem ZH, Song YS. Theoretical analysis of the distribution of isolated particles in totally asymmetric exclusion processes: Application to mRNA translation rate estimation. Phys Rev E 2018; 97:012106. [PMID: 29448386 DOI: 10.1103/physreve.97.012106] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Indexed: 11/07/2022]
Abstract
The Totally Asymmetric Exclusion Process (TASEP) is a classical stochastic model for describing the transport of interacting particles, such as ribosomes moving along the messenger ribonucleic acid (mRNA) during translation. Although this model has been widely studied in the past, the extent of collision between particles and the average distance between a particle to its nearest neighbor have not been quantified explicitly. We provide here a theoretical analysis of such quantities via the distribution of isolated particles. In the classical form of the model in which each particle occupies only a single site, we obtain an exact analytic solution using the matrix ansatz. We then employ a refined mean-field approach to extend the analysis to a generalized TASEP with particles of an arbitrary size. Our theoretical study has direct applications in mRNA translation and the interpretation of experimental ribosome profiling data. In particular, our analysis of data from Saccharomyces cerevisiae suggests a potential bias against the detection of nearby ribosomes with a gap distance of less than approximately three codons, which leads to some ambiguity in estimating the initiation rate and protein production flux for a substantial fraction of genes. Despite such ambiguity, however, we demonstrate theoretically that the interference rate associated with collisions can be robustly estimated and show that approximately 1% of the translating ribosomes get obstructed.
Collapse
Affiliation(s)
- Khanh Dao Duc
- Computer Science Division, University of California, Berkeley, California 94720, USA
| | - Zain H Saleem
- Department of Mathematics, University of Pennsylvania, Pennsylvania 19104, USA
| | - Yun S Song
- Computer Science Division and Department of Statistics, University of California, Berkeley, California 94720, USA
| |
Collapse
|
32
|
Szavits-Nossan J, Romano MC, Ciandrini L. Power series solution of the inhomogeneous exclusion process. Phys Rev E 2018; 97:052139. [PMID: 29906846 DOI: 10.1103/physreve.97.052139] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2018] [Indexed: 11/07/2022]
Abstract
We develop a power series method for the nonequilibrium steady state of the inhomogeneous one-dimensional totally asymmetric simple exclusion process (TASEP) in contact with two particle reservoirs and with site-dependent hopping rates in the bulk. The power series is performed in the entrance or exit rates governing particle exchange with the reservoirs, and the corresponding particle current is computed analytically up to the cubic term in the entry or exit rate, respectively. We also show how to compute higher-order terms using combinatorial objects known as Young tableaux. Our results address the long outstanding problem of finding the exact nonequilibrium steady state of the inhomogeneous TASEP. The findings are particularly relevant to the modeling of mRNA translation in which the rate of translation initiation, corresponding to the entrance rate in the TASEP, is typically small.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| | - M Carmen Romano
- SUPA, Institute for Complex Systems and Mathematical Biology, Department of Physics, Aberdeen AB24 3UE, United Kingdom and Institute of Medical Sciences, University of Aberdeen, Foresterhill, Aberdeen AB24 3FX, United Kingdom
| | - Luca Ciandrini
- DIMNP, Université de Montpellier, CNRS, Montpellier, France and L2C, Université de Montpellier, CNRS, Montpellier, France
| |
Collapse
|
33
|
Zarai Y, Margaliot M, Sontag ED, Tuller T. Controllability Analysis and Control Synthesis for the Ribosome Flow Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1351-1364. [PMID: 28541906 PMCID: PMC5778923 DOI: 10.1109/tcbb.2017.2707420] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
The ribosomal density along different parts of the coding regions of the mRNA molecule affects various fundamental intracellular phenomena including: protein production rates, global ribosome allocation and organismal fitness, ribosomal drop off, co-translational protein folding, mRNA degradation, and more. Thus, regulating translation in order to obtain a desired ribosomal profile along the mRNA molecule is an important biological problem. We study this problem by using a dynamical model for mRNA translation, called the ribosome flow model (RFM). In the RFM, the mRNA molecule is modeled as an ordered chain of $n$ sites. The RFM includes $n$ state-variables describing the ribosomal density profile along the mRNA molecule, and the transition rates from each site to the next are controlled by $n+1$ positive constants. To study the problem of controlling the density profile, we consider some or all of the transition rates as time-varying controls. We consider the following problem: given an initial and a desired ribosomal density profile in the RFM, determine the time-varying values of the transition rates that steer the system to the desired density profile, if they exist. More specifically, we consider two control problems. In the first, all transition rates can be regulated separately, and the goal is to steer the ribosomal density profile and the protein production rate from a given initial value to a desired value. In the second problem, one or more transition rates are jointly regulated by a single scalar control, and the goal is to steer the production rate to a desired value within a certain set of feasible values. In the first case, we show that the system is controllable, i.e., the control is powerful enough to steer the system to any desired value in finite time, and provide simple closed-form expressions for constant positive control functions (or transition rates) that asymptotically steer the system to the desired value. In the second case, we show that the system is controllable, and provide a simple algorithm for determining the constant positive control value that asymptotically steers the system to the desired value. We discuss some of the biological implications of these results.
Collapse
|
34
|
Computational analysis of the oscillatory behavior at the translation level induced by mRNA levels oscillations due to finite intracellular resources. PLoS Comput Biol 2018; 14:e1006055. [PMID: 29614119 PMCID: PMC5898785 DOI: 10.1371/journal.pcbi.1006055] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2017] [Revised: 04/13/2018] [Accepted: 02/15/2018] [Indexed: 11/22/2022] Open
Abstract
Recent studies have demonstrated how the competition for the finite pool of available gene expression factors has important effect on fundamental gene expression aspects. In this study, based on a whole-cell model simulation of translation in S. cerevisiae, we evaluate for the first time the expected effect of mRNA levels fluctuations on translation due to the finite pool of ribosomes. We show that fluctuations of a single gene or a group of genes mRNA levels induce periodic behavior in all S. cerevisiae translation factors and aspects: the ribosomal densities and the translation rates of all S. cerevisiae mRNAs oscillate. We numerically measure the oscillation amplitudes demonstrating that fluctuations of endogenous and heterologous genes can cause a significant fluctuation of up to 50% in the steady-state translation rates of the rest of the genes. Furthermore, we demonstrate by synonymous mutations that oscillating the levels of mRNAs that experience high ribosomal occupancy (e.g. ribosomal “traffic jam”) induces the largest impact on the translation of the S. cerevisiae genome. The results reported here should provide novel insights and principles related to the design of synthetic gene expression circuits and related to the evolutionary constraints shaping gene expression of endogenous genes. Each cell contains a limited number of macromolecules and factors that participate in the gene expression process. These expression resources are shared between the different molecules that encode the genetic code, resulting in non-trivial couplings and competitions between the different gene expression stages. Such competitions should be considered when analyzing the cellular economy of the cell, the genome evolution, and the design of synthetic expression circuits. Here we study the effect of couplings and competitions for ribosomes by performing a whole-cell simulation of translation of S. cerevisiae, with parameters estimated from experimental data. We demonstrate that by periodically changing the mRNA levels of a single gene (endogenous or heterologous) or a set of genes, the translation of all S. cerevisiae genes are affected in a periodic manner. We numerically estimate the exact impact of the mRNA levels periodicity on the translation process dynamics, as well as on the dynamics of the free ribosomal pool and the way it is affected by parameters such as the codon composition of the oscillating gene, its initiation rate and mRNA levels. Furthermore, we show that the codon compositions of synthetically highly expressed heterologous genes that are expected to oscillate must be carefully considered. For example, synonymous mutations resulting in “traffic jams” of ribosomes along the fluctuated mRNAs may cause significant fluctuations of up to 50% in the steady-state translation rates of all genes.
Collapse
|
35
|
Shaham G, Tuller T. Genome scale analysis of Escherichia coli with a comprehensive prokaryotic sequence-based biophysical model of translation initiation and elongation. DNA Res 2018; 25:195-205. [PMID: 29161365 PMCID: PMC6012489 DOI: 10.1093/dnares/dsx049] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2017] [Accepted: 11/04/2017] [Indexed: 11/17/2022] Open
Abstract
Translation initiation in prokaryotes is affected by the mRNA folding and interaction of the ribosome binding site with the ribosomal RNA. The elongation rate is affected, among other factors, by the local biophysical properties of the coding regions, the decoding rates of different codons, and the interactions among ribosomes. Currently, there is no comprehensive biophysical model of translation that enables the prediction of mRNA translation dynamics based only on the transcript sequence and while considering all of these fundamental aspects of translation. In this study, we provide, for the first time, a computational simulative biophysical model of both translation initiation and elongation with all aspects mentioned above. We demonstrate our model performance and advantages focusing on Escherichia coli genes. We further show that the model enables prediction of translation rate, protein levels, and ribosome densities. In addition, our model enables quantifying the effect of silent mutations on translation rate in different parts of the transcript, the relative effect of mutations on translation initiation and elongation, and the effect of mutations on ribosome traffic jams. Thus, unlike previous models, the proposed one provides comprehensive information, facilitating future research in disciplines such as molecular evolution, synthetic biology, and functional genomics. A toolkit to estimate translation dynamics of transcripts is available at: https://www.cs.tau.ac.il/∼tamirtul/transim.
Collapse
Affiliation(s)
- Gilad Shaham
- Department of Biomedical Engineering, The Engineering Faculty, Tel Aviv University, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, The Engineering Faculty, Tel Aviv University, Israel
- The Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel
| |
Collapse
|
36
|
Szavits-Nossan J, Ciandrini L, Romano MC. Deciphering mRNA Sequence Determinants of Protein Production Rate. PHYSICAL REVIEW LETTERS 2018; 120:128101. [PMID: 29694095 DOI: 10.1103/physrevlett.120.128101] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/02/2017] [Revised: 01/18/2018] [Indexed: 06/08/2023]
Abstract
One of the greatest challenges in biophysical models of translation is to identify coding sequence features that affect the rate of translation and therefore the overall protein production in the cell. We propose an analytic method to solve a translation model based on the inhomogeneous totally asymmetric simple exclusion process, which allows us to unveil simple design principles of nucleotide sequences determining protein production rates. Our solution shows an excellent agreement when compared to numerical genome-wide simulations of S. cerevisiae transcript sequences and predicts that the first 10 codons, which is the ribosome footprint length on the mRNA, together with the value of the initiation rate, are the main determinants of protein production rate under physiological conditions. Finally, we interpret the obtained analytic results based on the evolutionary role of the codons' choice for regulating translation rates and ribosome densities.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- SUPA, School of Physics and Astronomy, University of Edinburgh, Peter Guthrie Tait Road, Edinburgh EH9 3FD, United Kingdom
| | - Luca Ciandrini
- L2C, Université de Montpellier, CNRS, Montpellier, France and DIMNP, Université de Montpellier, CNRS, Montpellier, France
| | - M Carmen Romano
- SUPA, Institute for Complex Systems and Mathematical Biology, Department of Physics, Aberdeen AB24 3UE, United Kingdom and Institute of Medical Sciences, University of Aberdeen, Foresterhill, Aberdeen AB24 3FX, United Kingdom
| |
Collapse
|
37
|
Fang H, Huang YF, Radhakrishnan A, Siepel A, Lyon GJ, Schatz MC. Scikit-ribo Enables Accurate Estimation and Robust Modeling of Translation Dynamics at Codon Resolution. Cell Syst 2018; 6:180-191.e4. [PMID: 29361467 PMCID: PMC5832574 DOI: 10.1016/j.cels.2017.12.007] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2017] [Revised: 09/24/2017] [Accepted: 12/08/2017] [Indexed: 10/18/2022]
Abstract
Ribosome profiling (Ribo-seq) is a powerful technique for measuring protein translation; however, sampling errors and biological biases are prevalent and poorly understood. Addressing these issues, we present Scikit-ribo (https://github.com/schatzlab/scikit-ribo), an open-source analysis package for accurate genome-wide A-site prediction and translation efficiency (TE) estimation from Ribo-seq and RNA sequencing data. Scikit-ribo accurately identifies A-site locations and reproduces codon elongation rates using several digestion protocols (r = 0.99). Next, we show that the commonly used reads per kilobase of transcript per million mapped reads-derived TE estimation is prone to biases, especially for low-abundance genes. Scikit-ribo introduces a codon-level generalized linear model with ridge penalty that correctly estimates TE, while accommodating variable codon elongation rates and mRNA secondary structure. This corrects the TE errors for over 2,000 genes in S. cerevisiae, which we validate using mass spectrometry of protein abundances (r = 0.81), and allows us to determine the Kozak-like sequence directly from Ribo-seq. We conclude with an analysis of coverage requirements needed for robust codon-level analysis and quantify the artifacts that can occur from cycloheximide treatment.
Collapse
Affiliation(s)
- Han Fang
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Department of Applied Mathematics & Statistics, Stony Brook University, Stony Brook, NY 11794, USA
| | - Yi-Fei Huang
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Aditya Radhakrishnan
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD 21205, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Gholson J Lyon
- Stanley Institute for Cognitive Genomics, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Michael C Schatz
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA; Departments of Computer Science and Biology, Johns Hopkins University, Baltimore, MD 21211, USA.
| |
Collapse
|
38
|
Diament A, Feldman A, Schochet E, Kupiec M, Arava Y, Tuller T. The extent of ribosome queuing in budding yeast. PLoS Comput Biol 2018; 14:e1005951. [PMID: 29377894 PMCID: PMC5805374 DOI: 10.1371/journal.pcbi.1005951] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Revised: 02/08/2018] [Accepted: 01/05/2018] [Indexed: 11/18/2022] Open
Abstract
Ribosome queuing is a fundamental phenomenon suggested to be related to topics such as genome evolution, synthetic biology, gene expression regulation, intracellular biophysics, and more. However, this phenomenon hasn't been quantified yet at a genomic level. Nevertheless, methodologies for studying translation (e.g. ribosome footprints) are usually calibrated to capture only single ribosome protected footprints (mRPFs) and thus limited in their ability to detect ribosome queuing. On the other hand, most of the models in the field assume and analyze a certain level of queuing. Here we present an experimental-computational approach for studying ribosome queuing based on sequencing of RNA footprints extracted from pairs of ribosomes (dRPFs) using a modified ribosome profiling protocol. We combine our approach with traditional ribosome profiling to generate a detailed profile of ribosome traffic. The data are analyzed using computational models of translation dynamics. The approach was implemented on the Saccharomyces cerevisiae transcriptome. Our data shows that ribosome queuing is more frequent than previously thought: the measured ratio of ribosomes within dRPFs to mRPFs is 0.2–0.35, suggesting that at least one to five translating ribosomes is in a traffic jam; these queued ribosomes cannot be captured by traditional methods. We found that specific regions are enriched with queued ribosomes, such as the 5’-end of ORFs, and regions upstream to mRPF peaks, among others. While queuing is related to higher density of ribosomes on the transcript (characteristic of highly translated genes), we report cases where traffic jams are relatively more severe in lowly expressed genes and possibly even selected for. In addition, our analysis demonstrates that higher adaptation of the coding region to the intracellular tRNA levels is associated with lower queuing levels. Our analysis also suggests that the Saccharomyces cerevisiae transcriptome undergoes selection for eliminating traffic jams. Thus, our proposed approach is an essential tool for high resolution analysis of ribosome traffic during mRNA translation and understanding its evolution. During translation, multiple ribosomes may translate the same mRNA. The density of ribosomal traffic across the transcript poses several open questions, such as how often a ribosome’s path is blocked by a second ribosome, do queues of multiple ribosomes typically form on mRNAs and what is their effect on the overall translation rate of an mRNA. However, this phenomenon hasn't been quantified yet at a genomic level. Nevertheless, methodologies for monitoring translation are limited in their ability to detect ribosome queuing. On the other hand, most of the models in the field assume and analyze a certain level of queuing. Here we present an experimental-computational approach for studying ribosome queuing based on sequencing of RNA footprints extracted from pairs of adjacent translating ribosomes, and a computational model of translation dynamics. Our data shows that ribosome queuing in Saccharomyces cerevisiae is more frequent than previously thought, suggesting that at least one to five translating ribosomes is in a traffic jam; these queued ribosomes cannot be captured by traditional methods. Our analysis also suggests that the S. cerevisiae transcriptome undergoes selection for eliminating traffic jams, while specific regions and genes may possibly be under selection for increased queuing.
Collapse
Affiliation(s)
- Alon Diament
- Biomedical Engineering Dept., Tel Aviv University, Tel Aviv, Israel
| | - Anna Feldman
- Biomedical Engineering Dept., Tel Aviv University, Tel Aviv, Israel
| | - Elisheva Schochet
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| | - Martin Kupiec
- Dept. of Molecular Microbiology and Biotechnology, Tel Aviv University, Tel Aviv, Israel
| | - Yoav Arava
- Biology Dept., Technion-Israel Institute of Technology, Haifa, Israel
| | - Tamir Tuller
- Biomedical Engineering Dept., Tel Aviv University, Tel Aviv, Israel
- The Sagol School of Neuroscience, Tel Aviv University, Tel Aviv, Israel
- * E-mail:
| |
Collapse
|
39
|
The impact of ribosomal interference, codon usage, and exit tunnel interactions on translation elongation rate variation. PLoS Genet 2018; 14:e1007166. [PMID: 29337993 PMCID: PMC5786338 DOI: 10.1371/journal.pgen.1007166] [Citation(s) in RCA: 61] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2017] [Revised: 01/26/2018] [Accepted: 12/25/2017] [Indexed: 11/19/2022] Open
Abstract
Previous studies have shown that translation elongation is regulated by multiple factors, but the observed heterogeneity remains only partially explained. To dissect quantitatively the different determinants of elongation speed, we use probabilistic modeling to estimate initiation and local elongation rates from ribosome profiling data. This model-based approach allows us to quantify the extent of interference between ribosomes on the same transcript. We show that neither interference nor the distribution of slow codons is sufficient to explain the observed heterogeneity. Instead, we find that electrostatic interactions between the ribosomal exit tunnel and specific parts of the nascent polypeptide govern the elongation rate variation as the polypeptide makes its initial pass through the tunnel. Once the N-terminus has escaped the tunnel, the hydropathy of the nascent polypeptide within the ribosome plays a major role in modulating the speed. We show that our results are consistent with the biophysical properties of the tunnel.
Collapse
|
40
|
Abstract
Most biological mechanisms involve more than one type of biomolecule, and hence operate not solely at the level of either genome, transcriptome, proteome, metabolome or ionome. Datasets resulting from single-omic analysis are rapidly increasing in throughput and quality, rendering multi-omic studies feasible. These should offer a comprehensive, structured and interactive overview of a biological mechanism. However, combining single-omic datasets in a meaningful manner has so far proved challenging, and the discovery of new biological information lags behind expectation. One reason is that experiments conducted in different laboratories can typically not to be combined without restriction. Second, the interpretation of multi-omic datasets represents a significant challenge by nature, as the biological datasets are heterogeneous not only for technical, but also for biological, chemical, and physical reasons. Here, multi-layer network theory and methods of artificial intelligence might contribute to solve these problems. For the efficient application of machine learning however, biological datasets need to become more systematic, more precise - and much larger. We conclude our review with basic guidelines for the successful set-up of a multi-omic experiment.
Collapse
|
41
|
Cheng J, Maier KC, Avsec Ž, Rus P, Gagneur J. Cis-regulatory elements explain most of the mRNA stability variation across genes in yeast. RNA (NEW YORK, N.Y.) 2017; 23:1648-1659. [PMID: 28802259 PMCID: PMC5648033 DOI: 10.1261/rna.062224.117] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 07/31/2017] [Indexed: 05/09/2023]
Abstract
The stability of mRNA is one of the major determinants of gene expression. Although a wealth of sequence elements regulating mRNA stability has been described, their quantitative contributions to half-life are unknown. Here, we built a quantitative model for Saccharomyces cerevisiae based on functional mRNA sequence features that explains 59% of the half-life variation between genes and predicts half-life at a median relative error of 30%. The model revealed a new destabilizing 3' UTR motif, ATATTC, which we functionally validated. Codon usage proves to be the major determinant of mRNA stability. Nonetheless, single-nucleotide variations have the largest effect when occurring on 3' UTR motifs or upstream AUGs. Analyzing mRNA half-life data of 34 knockout strains showed that the effect of codon usage not only requires functional decapping and deadenylation, but also the 5'-to-3' exonuclease Xrn1, the nonsense-mediated decay genes, but not no-go decay. Altogether, this study quantitatively delineates the contributions of mRNA sequence features on stability in yeast, reveals their functional dependencies on degradation pathways, and allows accurate prediction of half-life from mRNA sequence.
Collapse
Affiliation(s)
- Jun Cheng
- Department of Informatics, Technical University of Munich, 85748 Garching, Germany
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, 81377 München, Germany
| | - Kerstin C Maier
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| | - Žiga Avsec
- Department of Informatics, Technical University of Munich, 85748 Garching, Germany
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, 81377 München, Germany
| | - Petra Rus
- Department of Molecular Biology, Max Planck Institute for Biophysical Chemistry, 37077 Göttingen, Germany
| | - Julien Gagneur
- Department of Informatics, Technical University of Munich, 85748 Garching, Germany
- Graduate School of Quantitative Biosciences (QBM), Ludwig-Maximilians-Universität München, 81377 München, Germany
| |
Collapse
|
42
|
Zarai Y, Margaliot M, Tuller T. Ribosome flow model with extended objects. J R Soc Interface 2017; 14:rsif.2017.0128. [PMID: 29021157 DOI: 10.1098/rsif.2017.0128] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Accepted: 09/18/2017] [Indexed: 02/06/2023] Open
Abstract
We study a deterministic mechanistic model for the flow of ribosomes along the mRNA molecule, called the ribosome flow model with extended objects (RFMEO). This model encapsulates many realistic features of translation including non-homogeneous transition rates along mRNA, the fact that every ribosome covers several codons, and the fact that ribosomes cannot overtake one another. The RFMEO is a mean-field approximation of an important model from statistical mechanics called the totally asymmetric simple exclusion process with extended objects (TASEPEO). We demonstrate that the RFMEO describes biophysical aspects of translation better than previous mean-field approximations, and that its predictions correlate well with those of TASEPEO. However, unlike TASEPEO, the RFMEO is amenable to rigorous analysis using tools from systems and control theory. We show that the ribosome density profile along the mRNA in the RFMEO converges to a unique steady-state density that depends on the length of the mRNA, the transition rates along it, and the number of codons covered by every ribosome, but not on the initial density of ribosomes along the mRNA. In particular, the protein production rate also converges to a unique steady state. Furthermore, if the transition rates along the mRNA are periodic with a common period T then the ribosome density along the mRNA and the protein production rate converge to a unique periodic pattern with period T, that is, the model entrains to periodic excitations in the transition rates. Analysis and simulations of the RFMEO demonstrate several counterintuitive results. For example, increasing the ribosome footprint may sometimes lead to an increase in the production rate. Also, for large values of the footprint the steady-state density along the mRNA may be quite complex (e.g. with quasi-periodic patterns) even for relatively simple (and non-periodic) transition rates along the mRNA. This implies that inferring the transition rates from the ribosome density may be non-trivial. We believe that the RFMEO could be useful for modelling, understanding and re-engineering translation as well as other important biological processes.
Collapse
Affiliation(s)
- Yoram Zarai
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| | - Michael Margaliot
- Department of Electrical Engineering Systems, Tel Aviv University, Tel Aviv, Israel
| | - Tamir Tuller
- Department of Biomedical Engineering, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
43
|
Abstract
A general means of viral attenuation involves the extensive recoding of synonymous codons in the viral genome. The mechanistic underpinnings of this approach remain unclear, however. Using quantitative proteomics and RNA sequencing, we explore the molecular basis of attenuation in a strain of bacteriophage T7 whose major capsid gene was engineered to carry 182 suboptimal codons. We do not detect transcriptional effects from recoding. Proteomic observations reveal that translation is halved for the recoded major capsid gene, and a more modest reduction applies to several coexpressed downstream genes. We observe no changes in protein abundances of other coexpressed genes that are encoded upstream. Viral burst size, like capsid protein abundance, is also decreased by half. Together, these observations suggest that, in this virus, reduced translation of an essential polycistronic transcript and diminished virion assembly form the molecular basis of attenuation.
Collapse
|
44
|
Abstract
The ribosome flow model on a ring (RFMR) is a deterministic model for ribosome flow along a circularized mRNA. We derive a new spectral representation for the optimal steady-state production rate and the corresponding optimal steady-state ribosomal density in the RFMR. This representation has several important advantages. First, it provides a simple and numerically stable algorithm for determining the optimal values even in very long rings. Second, it enables efficient computation of the sensitivity of the optimal production rate to small changes in the transition rates along the mRNA. Third, it implies that the optimal steady-state production rate is a strictly concave function of the transition rates. Maximizing the optimal steady-state production rate with respect to the rates under an affine constraint on the rates thus becomes a convex optimization problem that admits a unique solution. This solution can be determined numerically using highly efficient algorithms. This optimization problem is important, for example, when re-engineering heterologous genes in a host organism. We describe the implications of our results to this and other aspects of translation.
Collapse
|
45
|
Sharma AK, O'Brien EP. Increasing Protein Production Rates Can Decrease the Rate at Which Functional Protein Is Produced and Their Steady-State Levels. J Phys Chem B 2017. [PMID: 28650169 DOI: 10.1021/acs.jpcb.7b01700] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The rate at which soluble, functional protein is produced by the ribosome has recently been found to vary in complex and unexplained ways as various translation-associated rates are altered through synonymous codon substitutions. To understand this phenomenon, here, we combine a well-established ribosome-traffic model with a master-equation model of cotranslational domain folding to explore the scenarios that are possible for the protein production rate, J, and the functional-nascent protein production rate, F, as the rates of various translation processes are altered for five different E. coli proteins. We find that while J monotonically increases as the rates of translation-initiation, -elongation, and -termination increase, F can either increase or decrease. We show that F's nonmonotonic behavior arises within the model from two opposing trends: the tendency for increased translation rates to produce more total protein but less cotranslationally folded protein. We further demonstrate that under certain conditions these nonmonotonic changes in F can result in nonmonotonic variations in post-translational, steady-state levels of functional protein. These results provide a potential explanation for recent experimental observations in which the specific activity of enzymatic proteins decreased with increased synthesis rates. Additionally our model has the potential to be used to rationally design transcripts to maximize the production of functional nascent protein by simultaneously optimizing translation initiation, elongation, and termination rates.
Collapse
Affiliation(s)
- Ajeet K Sharma
- Department of Chemistry, Pennsylvania State University , University Park, Pennsylvania 16802, United States
| | - Edward P O'Brien
- Department of Chemistry, Pennsylvania State University , University Park, Pennsylvania 16802, United States
| |
Collapse
|