1
|
Potera K, Tomala K. Using yeasts for the studies of nonfunctional factors in protein evolution. Yeast 2024. [PMID: 38895906 DOI: 10.1002/yea.3970] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 05/08/2024] [Accepted: 06/06/2024] [Indexed: 06/21/2024] Open
Abstract
The evolution of protein sequence is driven not only by factors directly related to protein function and shape but also by nonfunctional factors. Such factors in protein evolution might be categorized as those connected to energetic costs, synthesis efficiency, and avoidance of misfolding and toxicity. A common approach to studying them is correlational analysis contrasting them with some characteristics of the protein, like amino acid composition, but these features are interdependent. To avoid possible bias, empirical studies are needed, and not enough work has been done to date. In this review, we describe the role of nonfunctional factors in protein evolution and present an experimental approach using yeast as a suitable model organism. The focus of the proposed approach is on the potential negative impact on the fitness of mutations that change protein properties not related to function and the frequency of mutations that change these properties. Experimental results of testing the misfolding avoidance hypothesis as an explanation for why highly expressed proteins evolve slowly are inconsistent with correlational research results. Therefore, more efforts should be made to empirically test the effects of nonfunctional factors in protein evolution and to contrast these results with the results of the correlational analysis approach.
Collapse
Affiliation(s)
- Katarzyna Potera
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
- Doctoral School of Exact and Natural Sciences, Jagiellonian University, Krakow, Poland
| | - Katarzyna Tomala
- Faculty of Biology, Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| |
Collapse
|
2
|
Roeselová A, Maslen SL, Shivakumaraswamy S, Pellowe GA, Howell S, Joshi D, Redmond J, Kjær S, Skehel JM, Balchin D. Mechanism of chaperone coordination during cotranslational protein folding in bacteria. Mol Cell 2024:S1097-2765(24)00480-5. [PMID: 38908370 DOI: 10.1016/j.molcel.2024.06.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 04/12/2024] [Accepted: 06/01/2024] [Indexed: 06/24/2024]
Abstract
Protein folding is assisted by molecular chaperones that bind nascent polypeptides during mRNA translation. Several structurally distinct classes of chaperones promote de novo folding, suggesting that their activities are coordinated at the ribosome. We used biochemical reconstitution and structural proteomics to explore the molecular basis for cotranslational chaperone action in bacteria. We found that chaperone binding is disfavored close to the ribosome, allowing folding to precede chaperone recruitment. Trigger factor recognizes compact folding intermediates that expose an extensive unfolded surface, and dictates DnaJ access to nascent chains. DnaJ uses a large surface to bind structurally diverse intermediates and recruits DnaK to sequence-diverse solvent-accessible sites. Neither Trigger factor, DnaJ, nor DnaK destabilize cotranslational folding intermediates. Instead, the chaperones collaborate to protect incipient structure in the nascent polypeptide well beyond the ribosome exit tunnel. Our findings show how the chaperone network selects and modulates cotranslational folding intermediates.
Collapse
Affiliation(s)
- Alžběta Roeselová
- Protein Biogenesis Laboratory, The Francis Crick Institute, London NW1 1AT, UK
| | - Sarah L Maslen
- Proteomics Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | | | - Grant A Pellowe
- Protein Biogenesis Laboratory, The Francis Crick Institute, London NW1 1AT, UK
| | - Steven Howell
- Proteomics Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | - Dhira Joshi
- Chemical Biology Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | - Joanna Redmond
- Chemical Biology Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | - Svend Kjær
- Structural Biology Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | - J Mark Skehel
- Proteomics Science Technology Platform, The Francis Crick Institute, London NW1 1AT, UK
| | - David Balchin
- Protein Biogenesis Laboratory, The Francis Crick Institute, London NW1 1AT, UK.
| |
Collapse
|
3
|
Luzuriaga-Neira AR, Ritchie AM, Payne BL, Carrillo-Parramon O, Liberles DA, Alvarez-Ponce D. Highly Abundant Proteins Are Highly Thermostable. Genome Biol Evol 2023; 15:evad112. [PMID: 37399326 DOI: 10.1093/gbe/evad112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/08/2023] [Indexed: 07/05/2023] Open
Abstract
Highly abundant proteins tend to evolve slowly (a trend called E-R anticorrelation), and a number of hypotheses have been proposed to explain this phenomenon. The misfolding avoidance hypothesis attributes the E-R anticorrelation to the abundance-dependent toxic effects of protein misfolding. To avoid these toxic effects, protein sequences (particularly those of highly expressed proteins) would be under selection to fold properly. One prediction of the misfolding avoidance hypothesis is that highly abundant proteins should exhibit high thermostability (i.e., a highly negative free energy of folding, ΔG). Thus far, only a handful of analyses have tested for a relationship between protein abundance and thermostability, producing contradictory results. These analyses have been limited by 1) the scarcity of ΔG data, 2) the fact that these data have been obtained by different laboratories and under different experimental conditions, 3) the problems associated with using proteins' melting energy (Tm) as a proxy for ΔG, and 4) the difficulty of controlling for potentially confounding variables. Here, we use computational methods to compare the free energy of folding of pairs of human-mouse orthologous proteins with different expression levels. Even though the effect size is limited, the most highly expressed ortholog is often the one with a more negative ΔG of folding, indicating that highly expressed proteins are often more thermostable.
Collapse
Affiliation(s)
| | - Andrew M Ritchie
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | | | | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, Pennsylvania, USA
| | | |
Collapse
|
4
|
Christensen S, Rämisch S, André I. DnaK response to expression of protein mutants is dependent on translation rate and stability. Commun Biol 2022; 5:597. [PMID: 35710941 PMCID: PMC9203555 DOI: 10.1038/s42003-022-03542-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2021] [Accepted: 05/31/2022] [Indexed: 11/09/2022] Open
Abstract
Chaperones play a central part in the quality control system in cells by clearing misfolded and aggregated proteins. The chaperone DnaK acts as a sensor for molecular stress by recognising short hydrophobic stretches of misfolded proteins. As the level of unfolded protein is a function of protein stability, we hypothesised that the level of DnaK response upon overexpression of recombinant proteins would be correlated to stability. Using a set of mutants of the λ-repressor with varying thermal stabilities and a fluorescent reporter system, the effect of stability on DnaK response and protein abundance was investigated. Our results demonstrate that the initial DnaK response is largely dependent on protein synthesis rate but as the recombinantly expressed protein accumulates and homeostasis is approached the response correlates strongly with stability. Furthermore, we observe a large degree of cell-cell variation in protein abundance and DnaK response in more stable proteins.
Collapse
Affiliation(s)
- Signe Christensen
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden.
| | | | - Ingemar André
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden.
| |
Collapse
|
5
|
Maddamsetti R. Universal Constraints on Protein Evolution in the Long-Term Evolution Experiment with Escherichia coli. Genome Biol Evol 2021; 13:evab070. [PMID: 33856016 PMCID: PMC8233687 DOI: 10.1093/gbe/evab070] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2021] [Indexed: 12/18/2022] Open
Abstract
Although it is well known that abundant proteins evolve slowly across the tree of life, there is little consensus for why this is true. Here, I report that abundant proteins evolve slowly in the hypermutator populations of Lenski's long-term evolution experiment with Escherichia coli (LTEE). Specifically, the density of all observed mutations per gene, as measured in metagenomic time series covering 60,000 generations of the LTEE, significantly anticorrelates with mRNA abundance, protein abundance, and degree of protein-protein interaction. The same pattern holds for nonsynonymous mutation density. However, synonymous mutation density, measured across the LTEE hypermutator populations, positively correlates with protein abundance. These results show that universal constraints on protein evolution are visible in data spanning three decades of experimental evolution. Therefore, it should be possible to design experiments to answer why abundant proteins evolve slowly.
Collapse
Affiliation(s)
- Rohan Maddamsetti
- Department of Biomedical Engineering, Duke University, Durham, North Carolina, USA
| |
Collapse
|
6
|
Dubreuil B, Levy ED. Abundance Imparts Evolutionary Constraints of Similar Magnitude on the Buried, Surface, and Disordered Regions of Proteins. Front Mol Biosci 2021; 8:626729. [PMID: 33996892 PMCID: PMC8119896 DOI: 10.3389/fmolb.2021.626729] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 03/29/2021] [Indexed: 12/02/2022] Open
Abstract
An understanding of the forces shaping protein conservation is key, both for the fundamental knowledge it represents and to allow for optimal use of evolutionary information in practical applications. Sequence conservation is typically examined at one of two levels. The first is a residue-level, where intra-protein differences are analyzed and the second is a protein-level, where inter-protein differences are studied. At a residue level, we know that solvent-accessibility is a prime determinant of conservation. By inverting this logic, we inferred that disordered regions are slightly more solvent-accessible on average than the most exposed surface residues in domains. By integrating abundance information with evolutionary data within and across proteins, we confirmed a previously reported strong surface-core association in the evolution of structured regions, but we found a comparatively weak association between disordered and structured regions. The facts that disordered and structured regions experience different structural constraints and evolve independently provide a unique setup to examine an outstanding question: why is a protein’s abundance the main determinant of its sequence conservation? Indeed, any structural or biophysical property linked to the abundance-conservation relationship should increase the relative conservation of regions concerned with that property (e.g., disordered residues with mis-interactions, domain residues with misfolding). Surprisingly, however, we found the conservation of disordered and structured regions to increase in equal proportion with abundance. This observation implies that either abundance-related constraints are structure-independent, or multiple constraints apply to different regions and perfectly balance each other.
Collapse
Affiliation(s)
- Benjamin Dubreuil
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
7
|
Razban RM, Dasmeh P, Serohijos AWR, Shakhnovich EI. Avoidance of protein unfolding constrains protein stability in long-term evolution. Biophys J 2021; 120:2413-2424. [PMID: 33932438 PMCID: PMC8390877 DOI: 10.1016/j.bpj.2021.03.042] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 02/24/2021] [Accepted: 03/17/2021] [Indexed: 11/28/2022] Open
Abstract
Every amino acid residue can influence a protein's overall stability, making stability highly susceptible to change throughout evolution. We consider the distribution of protein stabilities evolutionarily permittable under two previously reported protein fitness functions: flux dynamics and misfolding avoidance. We develop an evolutionary dynamics theory and find that it agrees better with an extensive protein stability data set for dihydrofolate reductase orthologs under the misfolding avoidance fitness function rather than the flux dynamics fitness function. Further investigation with ribonuclease H data demonstrates that not any misfolded state is avoided; rather, it is only the unfolded state. At the end, we discuss how our work pertains to the universal protein abundance-evolutionary rate correlation seen across organisms' proteomes. We derive a closed-form expression relating protein abundance to evolutionary rate that captures Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens experimental trends without fitted parameters.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Pouria Dasmeh
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts; Departement de Biochimie, Université de Montréal, Montreal, Quebec, Canada
| | | | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
8
|
Usmanova DR, Plata G, Vitkup D. The Relationship between the Misfolding Avoidance Hypothesis and Protein Evolutionary Rates in the Light of Empirical Evidence. Genome Biol Evol 2021; 13:6081017. [PMID: 33432359 PMCID: PMC7874998 DOI: 10.1093/gbe/evab006] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/07/2021] [Indexed: 12/14/2022] Open
Abstract
For more than a decade, the misfolding avoidance hypothesis (MAH) and related theories have dominated evolutionary discussions aimed at explaining the variance of the molecular clock across cellular proteins. In this study, we use various experimental data to further investigate the consistency of the MAH predictions with empirical evidence. We also critically discuss experimental results that motivated the MAH development and that are often viewed as evidence of its major contribution to the variability of protein evolutionary rates. We demonstrate, in Escherichia coli and Homo sapiens, the lack of a substantial negative correlation between protein evolutionary rates and Gibbs free energies of unfolding, a direct measure of protein stability. We then analyze multiple new genome-scale data sets characterizing protein aggregation and interaction propensities, the properties that are likely optimized in evolution to alleviate deleterious effects associated with toxic protein misfolding and misinteractions. Our results demonstrate that the propensity of proteins to aggregate, the fraction of charged amino acids, and protein stickiness do correlate with protein abundances. Nevertheless, across multiple organisms and various data sets we do not observe substantial correlations between proteins’ aggregation- and stability-related properties and evolutionary rates. Therefore, diverse empirical data support the conclusion that the MAH and similar hypotheses do not play a major role in mediating a strong negative correlation between protein expression and the molecular clock, and thus in explaining the variability of evolutionary rates across cellular proteins.
Collapse
Affiliation(s)
- Dinara R Usmanova
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Germán Plata
- Department of Systems Biology, Columbia University, New York, NY, USA.,Elanco Animal Health, Greenfield, IN, USA
| | - Dennis Vitkup
- Department of Systems Biology, Columbia University, New York, NY, USA.,Department of Biomedical Informatics, Columbia University, New York, NY, USA
| |
Collapse
|
9
|
Abstract
Darwin's theory of evolution emphasized that positive selection of functional proficiency provides the fitness that ultimately determines the structure of life, a view that has dominated biochemical thinking of enzymes as perfectly optimized for their specific functions. The 20th-century modern synthesis, structural biology, and the central dogma explained the machinery of evolution, and nearly neutral theory explained how selection competes with random fixation dynamics that produce molecular clocks essential e.g. for dating evolutionary histories. However, quantitative proteomics revealed that selection pressures not relating to optimal function play much larger roles than previously thought, acting perhaps most importantly via protein expression levels. This paper first summarizes recent progress in the 21st century toward recovering this universal selection pressure. Then, the paper argues that proteome cost minimization is the dominant, underlying 'non-function' selection pressure controlling most of the evolution of already functionally adapted living systems. A theory of proteome cost minimization is described and argued to have consequences for understanding evolutionary trade-offs, aging, cancer, and neurodegenerative protein-misfolding diseases.
Collapse
|
10
|
Razban RM. Protein Melting Temperature Cannot Fully Assess Whether Protein Folding Free Energy Underlies the Universal Abundance-Evolutionary Rate Correlation Seen in Proteins. Mol Biol Evol 2019; 36:1955-1963. [PMID: 31093676 PMCID: PMC6736436 DOI: 10.1093/molbev/msz119] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The protein misfolding avoidance hypothesis explains the universal negative correlation between protein abundance and sequence evolutionary rate across the proteome by identifying protein folding free energy (ΔG) as the confounding variable. Abundant proteins resist toxic misfolding events by being more stable, and more stable proteins evolve slower because their mutations are more destabilizing. Direct supporting evidence consists only of computer simulations. A study taking advantage of a recent experimental breakthrough in measuring protein stability proteome-wide through melting temperature (Tm) (Leuenberger et al. 2017), found weak misfolding avoidance hypothesis support for the Escherichia coli proteome, and no support for the Saccharomyces cerevisiae, Homo sapiens, and Thermus thermophilus proteomes (Plata and Vitkup 2018). I find that the nontrivial relationship between Tm and ΔG and inaccuracy in Tm measurements by Leuenberger et al. 2017 can be responsible for not observing strong positive abundance-Tm and strong negative Tm-evolutionary rate correlations.
Collapse
Affiliation(s)
- Rostam M Razban
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA
| |
Collapse
|
11
|
Sanchez de Groot N, Torrent Burgas M, Ravarani CN, Trusina A, Ventura S, Babu MM. The fitness cost and benefit of phase-separated protein deposits. Mol Syst Biol 2019; 15:e8075. [PMID: 30962358 PMCID: PMC6452874 DOI: 10.15252/msb.20178075] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Phase separation of soluble proteins into insoluble deposits is associated with numerous diseases. However, protein deposits can also function as membrane-less compartments for many cellular processes. What are the fitness costs and benefits of forming such deposits in different conditions? Using a model protein that phase-separates into deposits, we distinguish and quantify the fitness contribution due to the loss or gain of protein function and deposit formation in yeast. The environmental condition and the cellular demand for the protein function emerge as key determinants of fitness. Protein deposit formation can influence cell-to-cell variation in free protein abundance between individuals of a cell population (i.e., gene expression noise). This results in variable manifestation of protein function and a continuous range of phenotypes in a cell population, favoring survival of some individuals in certain environments. Thus, protein deposit formation by phase separation might be a mechanism to sense protein concentration in cells and to generate phenotypic variability. The selectable phenotypic variability, previously described for prions, could be a general property of proteins that can form phase-separated assemblies and may influence cell fitness.
Collapse
Affiliation(s)
- Natalia Sanchez de Groot
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK .,Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Spain.,Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Marc Torrent Burgas
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK.,Systems Biology of Infection Lab, Department of Biochemistry and Molecular Biology, Universitat Autònoma de Barcelona, Barcelona, Spain
| | | | - Ala Trusina
- Niels Bohr Institute, University of Copenhagen, Copenhagen, Denmark
| | - Salvador Ventura
- Institut de Biotecnologia i Biomedicina and Departament de Bioquímica i Biologia Molecular, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - M Madan Babu
- Medical Research Council Laboratory of Molecular Biology, Cambridge, UK
| |
Collapse
|
12
|
Farkas Z, Kalapis D, Bódi Z, Szamecz B, Daraba A, Almási K, Kovács K, Boross G, Pál F, Horváth P, Balassa T, Molnár C, Pettkó-Szandtner A, Klement É, Rutkai E, Szvetnik A, Papp B, Pál C. Hsp70-associated chaperones have a critical role in buffering protein production costs. eLife 2018; 7:29845. [PMID: 29377792 PMCID: PMC5788500 DOI: 10.7554/elife.29845] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2017] [Accepted: 12/23/2017] [Indexed: 02/01/2023] Open
Abstract
Proteins are necessary for cellular growth. Concurrently, however, protein production has high energetic demands associated with transcription and translation. Here, we propose that activity of molecular chaperones shape protein burden, that is the fitness costs associated with expression of unneeded proteins. To test this hypothesis, we performed a genome-wide genetic interaction screen in baker's yeast. Impairment of transcription, translation, and protein folding rendered cells hypersensitive to protein burden. Specifically, deletion of specific regulators of the Hsp70-associated chaperone network increased protein burden. In agreement with expectation, temperature stress, increased mistranslation and a chemical misfolding agent all substantially enhanced protein burden. Finally, unneeded protein perturbed interactions between key components of the Hsp70-Hsp90 network involved in folding of native proteins. We conclude that specific chaperones contribute to protein burden. Our work indicates that by minimizing the damaging impact of gratuitous protein overproduction, chaperones enable tolerance to massive changes in genomic expression.
Collapse
Affiliation(s)
- Zoltán Farkas
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Dorottya Kalapis
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Zoltán Bódi
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Béla Szamecz
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Andreea Daraba
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Karola Almási
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Károly Kovács
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Gábor Boross
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Ferenc Pál
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Péter Horváth
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Tamás Balassa
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Csaba Molnár
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Aladár Pettkó-Szandtner
- Institute of Plant Biology, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary.,Laboratory of Proteomic Research, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Éva Klement
- Laboratory of Proteomic Research, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Edit Rutkai
- Division for Biotechnology, Bay Zoltán Nonprofit Ltd, Budapest, Hungary
| | - Attila Szvetnik
- Division for Biotechnology, Bay Zoltán Nonprofit Ltd, Budapest, Hungary
| | - Balázs Papp
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| | - Csaba Pál
- Synthetic and Systems Biology Unit, Institute of Biochemistry, Biological Research Centre of the Hungarian Academy of Sciences, Szeged, Hungary
| |
Collapse
|
13
|
Plata G, Vitkup D. Protein Stability and Avoidance of Toxic Misfolding Do Not Explain the Sequence Constraints of Highly Expressed Proteins. Mol Biol Evol 2017; 35:700-703. [PMID: 29309671 DOI: 10.1093/molbev/msx323] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The avoidance of cytotoxic effects associated with protein misfolding has been proposed as a dominant constraint on the sequence evolution and molecular clock of highly expressed proteins. Recently, Leuenberger et al. developed an elegant experimental approach to measure protein thermal stability at the proteome scale. The collected data allow us to rigorously test the predictions of the misfolding avoidance hypothesis that highly expressed proteins have evolved to be more stable, and that maintaining thermodynamic stability significantly constrains their evolution. Notably, reanalysis of the Leuenberger et al. data across four different organisms reveals no substantial correlation between protein stability and protein abundance. Therefore, the key predictions of the misfolding toxicity and related hypotheses are not supported by available empirical data. The data also suggest that, regardless of protein expression, protein stability does not substantially affect the protein molecular clock across organisms.
Collapse
Affiliation(s)
- Germán Plata
- Department of Systems Biology, Columbia University, New York, NY
| | - Dennis Vitkup
- Department of Systems Biology, Columbia University, New York, NY.,Department of Biomedical Informatics, Columbia University, New York, NY
| |
Collapse
|
14
|
Omer S, Harlow TJ, Gogarten JP. Does Sequence Conservation Provide Evidence for Biological Function? Trends Microbiol 2017; 25:11-18. [DOI: 10.1016/j.tim.2016.09.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2016] [Revised: 09/16/2016] [Accepted: 09/22/2016] [Indexed: 01/14/2023]
|
15
|
Abstract
Protein metabolism is one of the most costly processes in the cell and is therefore expected to be under the effective control of natural selection. We stimulated yeast strains to overexpress each single gene product to approximately 1% of the total protein content. Consistent with previous reports, we found that excessive expression of proteins containing disordered or membrane-protruding regions resulted in an especially high fitness cost. We estimated these costs to be nearly twice as high as for other proteins. There was a ten-fold difference in cost if, instead of entire proteins, only the disordered or membrane-embedded regions were compared with other segments. Although the cost of processing bulk protein was measurable, it could not be explained by several tested protein features, including those linked to translational efficiency or intensity of physical interactions after maturation. It most likely included a number of individually indiscernible effects arising during protein synthesis, maturation, maintenance, (mal)functioning, and disposal. When scaled to the levels normally achieved by proteins in the cell, the fitness cost of dealing with one amino acid in a standard protein appears to be generally very low. Many single amino acid additions or deletions are likely to be neutral even if the effective population size is as large as that of the budding yeast. This should also apply to substitutions. Selection is much more likely to operate if point mutations affect protein structure by, for example, extending or creating stretches that tend to unfold or interact improperly with membranes.
Collapse
|
16
|
Bienick MS, Young KW, Klesmith JR, Detwiler EE, Tomek KJ, Whitehead TA. The interrelationship between promoter strength, gene expression, and growth rate. PLoS One 2014; 9:e109105. [PMID: 25286161 DOI: 10.1371/journal.pone.0109105.s008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Accepted: 09/03/2014] [Indexed: 05/27/2023] Open
Abstract
In exponentially growing bacteria, expression of heterologous protein impedes cellular growth rates. Quantitative understanding of the relationship between expression and growth rate will advance our ability to forward engineer bacteria, important for metabolic engineering and synthetic biology applications. Recently, a work described a scaling model based on optimal allocation of ribosomes for protein translation. This model quantitatively predicts a linear relationship between microbial growth rate and heterologous protein expression with no free parameters. With the aim of validating this model, we have rigorously quantified the fitness cost of gene expression by using a library of synthetic constitutive promoters to drive expression of two separate proteins (eGFP and amiE) in E. coli in different strains and growth media. In all cases, we demonstrate that the fitness cost is consistent with the previous findings. We expand upon the previous theory by introducing a simple promoter activity model to quantitatively predict how basal promoter strength relates to growth rate and protein expression. We then estimate the amount of protein expression needed to support high flux through a heterologous metabolic pathway and predict the sizable fitness cost associated with enzyme production. This work has broad implications across applied biological sciences because it allows for prediction of the interplay between promoter strength, protein expression, and the resulting cost to microbial growth rates.
Collapse
Affiliation(s)
- Matthew S Bienick
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Katherine W Young
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America; High School Honors Science/Mathematics/Engineering Program, Michigan State University, East Lansing, Michigan, United States of America
| | - Justin R Klesmith
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, United States of America
| | - Emily E Detwiler
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Kyle J Tomek
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Timothy A Whitehead
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America; Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
17
|
Bienick MS, Young KW, Klesmith JR, Detwiler EE, Tomek KJ, Whitehead TA. The interrelationship between promoter strength, gene expression, and growth rate. PLoS One 2014; 9:e109105. [PMID: 25286161 PMCID: PMC4186888 DOI: 10.1371/journal.pone.0109105] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2014] [Accepted: 09/03/2014] [Indexed: 11/19/2022] Open
Abstract
In exponentially growing bacteria, expression of heterologous protein impedes cellular growth rates. Quantitative understanding of the relationship between expression and growth rate will advance our ability to forward engineer bacteria, important for metabolic engineering and synthetic biology applications. Recently, a work described a scaling model based on optimal allocation of ribosomes for protein translation. This model quantitatively predicts a linear relationship between microbial growth rate and heterologous protein expression with no free parameters. With the aim of validating this model, we have rigorously quantified the fitness cost of gene expression by using a library of synthetic constitutive promoters to drive expression of two separate proteins (eGFP and amiE) in E. coli in different strains and growth media. In all cases, we demonstrate that the fitness cost is consistent with the previous findings. We expand upon the previous theory by introducing a simple promoter activity model to quantitatively predict how basal promoter strength relates to growth rate and protein expression. We then estimate the amount of protein expression needed to support high flux through a heterologous metabolic pathway and predict the sizable fitness cost associated with enzyme production. This work has broad implications across applied biological sciences because it allows for prediction of the interplay between promoter strength, protein expression, and the resulting cost to microbial growth rates.
Collapse
Affiliation(s)
- Matthew S. Bienick
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Katherine W. Young
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
- High School Honors Science/Mathematics/Engineering Program, Michigan State University, East Lansing, Michigan, United States of America
| | - Justin R. Klesmith
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan, United States of America
| | - Emily E. Detwiler
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Kyle J. Tomek
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
| | - Timothy A. Whitehead
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan, United States of America
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, Michigan, United States of America
| |
Collapse
|
18
|
Margaliot M, Sontag ED, Tuller T. Entrainment to periodic initiation and transition rates in a computational model for gene translation. PLoS One 2014; 9:e96039. [PMID: 24800863 PMCID: PMC4011696 DOI: 10.1371/journal.pone.0096039] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2013] [Accepted: 04/02/2014] [Indexed: 01/09/2023] Open
Abstract
Periodic oscillations play an important role in many biomedical systems. Proper functioning of biological systems that respond to periodic signals requires the ability to synchronize with the periodic excitation. For example, the sleep/wake cycle is a manifestation of an internal timing system that synchronizes to the solar day. In the terminology of systems theory, the biological system must entrain or phase-lock to the periodic excitation. Entrainment is also important in synthetic biology. For example, connecting several artificial biological systems that entrain to a common clock may lead to a well-functioning modular system. The cell-cycle is a periodic program that regulates DNA synthesis and cell division. Recent biological studies suggest that cell-cycle related genes entrain to this periodic program at the gene translation level, leading to periodically-varying protein levels of these genes. The ribosome flow model (RFM) is a deterministic model obtained via a mean-field approximation of a stochastic model from statistical physics that has been used to model numerous processes including ribosome flow along the mRNA. Here we analyze the RFM under the assumption that the initiation and/or transition rates vary periodically with a common period . We show that the ribosome distribution profile in the RFM entrains to this periodic excitation. In particular, the protein synthesis pattern converges to a unique periodic solution with period . To the best of our knowledge, this is the first proof of entrainment in a mathematical model for translation that encapsulates aspects such as initiation and termination rates, ribosomal movement and interactions, and non-homogeneous elongation speeds along the mRNA. Our results support the conjecture that periodic oscillations in tRNA levels and other factors related to the translation process can induce periodic oscillations in protein levels, and may suggest a new approach for re-engineering genetic systems to obtain a desired, periodic, protein synthesis rate.
Collapse
Affiliation(s)
- Michael Margaliot
- School of Electrical Engineering and the Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel
| | - Eduardo D. Sontag
- Dept. of Mathematics and Cancer Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
| | - Tamir Tuller
- Dept. of Biomedical Engineering and the Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv, Israel
- * E-mail:
| |
Collapse
|
19
|
Navarro S, Villar-Piqué A, Ventura S. Selection against toxic aggregation-prone protein sequences in bacteria. BIOCHIMICA ET BIOPHYSICA ACTA-MOLECULAR CELL RESEARCH 2014; 1843:866-74. [DOI: 10.1016/j.bbamcr.2014.01.020] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2013] [Revised: 01/16/2014] [Accepted: 01/21/2014] [Indexed: 01/23/2023]
|
20
|
Kepp KP, Dasmeh P. A model of proteostatic energy cost and its use in analysis of proteome trends and sequence evolution. PLoS One 2014; 9:e90504. [PMID: 24587382 PMCID: PMC3938754 DOI: 10.1371/journal.pone.0090504] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2013] [Accepted: 02/03/2014] [Indexed: 12/25/2022] Open
Abstract
A model of proteome-associated chemical energetic costs of cells is derived from protein-turnover kinetics and protein folding. Minimization of the proteostatic maintenance cost can explain a range of trends of proteomes and combines both protein function, stability, size, proteostatic cost, temperature, resource availability, and turnover rates in one simple framework. We then explore the ansatz that the chemical energy remaining after proteostatic maintenance is available for reproduction (or cell division) and thus, proportional to organism fitness. Selection for lower proteostatic costs is then shown to be significant vs. typical effective population sizes of yeast. The model explains and quantifies evolutionary conservation of highly abundant proteins as arising both from functional mutations and from changes in other properties such as stability, cost, or turnover rates. We show that typical hypomorphic mutations can be selected against due to increased cost of compensatory protein expression (both in the mutated gene and in related genes, i.e. epistasis) rather than compromised function itself, although this compensation depends on the protein's importance. Such mutations exhibit larger selective disadvantage in abundant, large, synthetically costly, and/or short-lived proteins. Selection against increased turnover costs of less stable proteins rather than misfolding toxicity per se can explain equilibrium protein stability distributions, in agreement with recent findings in E. coli. The proteostatic selection pressure is stronger at low metabolic rates (i.e. scarce environments) and in hot habitats, explaining proteome adaptations towards rough environments as a question of energy. The model may also explain several trade-offs observed in protein evolution and suggests how protein properties can coevolve to maintain low proteostatic cost.
Collapse
Affiliation(s)
- Kasper P. Kepp
- Department of Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
- * E-mail:
| | - Pouria Dasmeh
- Department of Chemistry, Technical University of Denmark, Kongens Lyngby, Denmark
| |
Collapse
|
21
|
Tomala K, Pogoda E, Jakubowska A, Korona R. Fitness costs of minimal sequence alterations causing protein instability and toxicity. Mol Biol Evol 2013; 31:703-7. [PMID: 24361995 DOI: 10.1093/molbev/mst264] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Destabilization of a protein impairs its metabolic efficiency. It is less clear how often destabilization also results in a gain of toxicity. We derived collections of temperature-sensitive, and thus structurally unstable, mutants of the yeast ADE2 and LYS2 genes by introducing single or very few amino acids substitutions. Overexpression of these mutant proteins led to a common, although unequal, fitness decrease. Interestingly, although the mutant proteins were functionally redundant, higher expression levels were associated with higher fitness. This result suggests that growth was hampered not by the accumulation of damaged chains but by the activities needed to remove them or by the damage caused before they were removed. Our results support the idea that any protein can become toxic when destabilized by a point mutation.
Collapse
Affiliation(s)
- Katarzyna Tomala
- Institute of Environmental Sciences, Jagiellonian University, Krakow, Poland
| | | | | | | |
Collapse
|
22
|
Kochanowski K, Sauer U, Chubukov V. Somewhat in control--the role of transcription in regulating microbial metabolic fluxes. Curr Opin Biotechnol 2013; 24:987-93. [PMID: 23571096 DOI: 10.1016/j.copbio.2013.03.014] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2012] [Revised: 03/13/2013] [Accepted: 03/14/2013] [Indexed: 10/27/2022]
Abstract
The most common way for microbes to control their metabolism is by controlling enzyme levels through transcriptional regulation. Yet recent studies have shown that in many cases, perturbations to the transcriptional regulatory network do not result in altered metabolic phenotypes on the level of the flux distribution. We suggest that this may be a consequence of cells protecting their metabolism against stochastic fluctuations in expression as well as enabling a fast response for those fluxes that may need to be changed quickly. Furthermore, it is impossible for a regulatory program to guarantee optimal expression levels in all conditions. Several studies have found examples of demonstrably suboptimal regulation of gene expression, and improvements to the regulatory network have been investigated in laboratory evolution experiments.
Collapse
Affiliation(s)
- Karl Kochanowski
- Institute of Molecular Systems Biology, ETH Zurich, Wolfgang-Pauli-Str. 16, CH-8093 Zurich, Switzerland; Life Science Zurich PhD Program on Systems Biology, Zurich, Switzerland
| | | | | |
Collapse
|
23
|
Margaliot M, Tuller T. On the steady-state distribution in the homogeneous ribosome flow model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1724-1736. [PMID: 23221086 DOI: 10.1109/tcbb.2012.120] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
A central biological process in all living organisms is gene translation. Developing a deeper understanding of this complex process may have ramifications to almost every biomedical discipline. Reuveni et al. recently proposed a new computational model of gene translation called the Ribosome Flow Model (RFM). In this paper, we consider a particular case of this model, called the Homogeneous Ribosome Flow Model (HRFM). From a biological viewpoint, this corresponds to the case where the transition rates of all the coding sequence codons are identical. This regime has been suggested recently based on experiments in mouse embryonic cells. We consider the steady-state distribution of the HRFM. We provide formulas that relate the different parameters of the model in steady state. We prove the following properties: 1) the ribosomal density profile is monotonically decreasing along the coding sequence; 2) the ribosomal density at each codon monotonically increases with the initiation rate; and 3) for a constant initiation rate, the translation rate monotonically decreases with the length of the coding sequence. In addition, we analyze the translation rate of the HRFM at the limit of very high and very low initiation rate, and provide explicit formulas for the translation rate in these two cases. We discuss the relationship between these theoretical results and biological findings on the translation process.
Collapse
Affiliation(s)
- Michael Margaliot
- School of Electrical Engineering-Systems, Tel-Aviv University, Tel-Aviv.
| | | |
Collapse
|
24
|
Margaliot M, Tuller T. Stability analysis of the ribosome flow model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2012; 9:1545-52. [PMID: 22732691 DOI: 10.1109/tcbb.2012.88] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
Gene translation is a central process in all living organisms. Developing a better understanding of this complex process may have ramifications to almost every biomedical discipline. Recently, Reuveni et al. proposed a new computational model of this process called the ribosome flow model (RFM). In this study, we show that the dynamical behavior of the RFM is relatively simple. There exists a unique equilibrium point e and every trajectory converges to e. Furthermore, convergence is monotone in the sense that the distance to e can never increase. This qualitative behavior is maintained for any feasible set of parameter values, suggesting that the RFM is highly robust. Our analysis is based on a contraction principle and the theory of monotone dynamical systems. These analysis tools may prove useful in studying other properties of the RFM as well as additional intracellular biological processes.
Collapse
Affiliation(s)
- Michael Margaliot
- School of Electrical Engineering and the Sagol School of Neuroscience, Tel-Aviv University, Tel-Aviv 69978, Israel.
| | | |
Collapse
|
25
|
Protein misinteraction avoidance causes highly expressed proteins to evolve slowly. Proc Natl Acad Sci U S A 2012; 109:E831-40. [PMID: 22416125 DOI: 10.1073/pnas.1117408109] [Citation(s) in RCA: 129] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The tempo and mode of protein evolution have been central questions in biology. Genomic data have shown a strong influence of the expression level of a protein on its rate of sequence evolution (E-R anticorrelation), which is currently explained by the protein misfolding avoidance hypothesis. Here, we show that this hypothesis does not fully explain the E-R anticorrelation, especially for protein surface residues. We propose that natural selection against protein-protein misinteraction, which wastes functional molecules and is potentially toxic, constrains the evolution of surface residues. Because highly expressed proteins are under stronger pressures to avoid misinteraction, surface residues are expected to show an E-R anticorrelation. Our molecular-level evolutionary simulation and yeast genomic analysis confirm multiple predictions of the hypothesis. These findings show a pluralistic origin of the E-R anticorrelation and reveal the role of protein misinteraction, an inherent property of complex cellular systems, in constraining protein evolution.
Collapse
|
26
|
Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli. MICROBIAL INFORMATICS AND EXPERIMENTATION 2011; 1:6. [PMID: 22587847 PMCID: PMC3372292 DOI: 10.1186/2042-5783-1-6] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Accepted: 06/27/2011] [Indexed: 11/10/2022]
Abstract
The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production pipeline of the Northeast Structural Genomics Consortium. Proteins expressed in E. coli and consistently purified were scored independently for expression and solubility levels. These parameters nonetheless show a very strong positive correlation. We used logistic regressions to determine whether they are systematically influenced by fractional amino acid composition or several bulk sequence parameters including hydrophobicity, sidechain entropy, electrostatic charge, and predicted backbone disorder. Decreasing hydrophobicity correlates with higher expression and solubility levels, but this correlation apparently derives solely from the beneficial effect of three charged amino acids, at least for bacterial proteins. In fact, the three most hydrophobic residues showed very different correlations with solubility level. Leu showed the strongest negative correlation among amino acids, while Ile showed a slightly positive correlation in most data segments. Several other amino acids also had unexpected effects. Notably, Arg correlated with decreased expression and, most surprisingly, solubility of bacterial proteins, an effect only partially attributable to rare codons. However, rare codons did significantly reduce expression despite use of a codon-enhanced strain. Additional analyses suggest that positively but not negatively charged amino acids may reduce translation efficiency in E. coli irrespective of codon usage. While some observed effects may reflect indirect evolutionary correlations, others may reflect basic physicochemical phenomena. We used these results to construct and validate predictors of expression and solubility levels and overall protein usability, and we propose new strategies to be explored for engineering improved protein expression and solubility.
Collapse
|
27
|
Stewart FJ, Sharma AK, Bryant JA, Eppley JM, DeLong EF. Community transcriptomics reveals universal patterns of protein sequence conservation in natural microbial communities. Genome Biol 2011; 12:R26. [PMID: 21426537 PMCID: PMC3129676 DOI: 10.1186/gb-2011-12-3-r26] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2011] [Revised: 02/28/2011] [Accepted: 03/22/2011] [Indexed: 12/02/2022] Open
Abstract
Background Combined metagenomic and metatranscriptomic datasets make it possible to study the molecular evolution of diverse microbial species recovered from their native habitats. The link between gene expression level and sequence conservation was examined using shotgun pyrosequencing of microbial community DNA and RNA from diverse marine environments, and from forest soil. Results Across all samples, expressed genes with transcripts in the RNA sample were significantly more conserved than non-expressed gene sets relative to best matches in reference databases. This discrepancy, observed for many diverse individual genomes and across entire communities, coincided with a shift in amino acid usage between these gene fractions. Expressed genes trended toward GC-enriched amino acids, consistent with a hypothesis of higher levels of functional constraint in this gene pool. Highly expressed genes were significantly more likely to fall within an orthologous gene set shared between closely related taxa (core genes). However, non-core genes, when expressed above the level of detection, were, on average, significantly more highly expressed than core genes based on transcript abundance normalized to gene abundance. Finally, expressed genes showed broad similarities in function across samples, being relatively enriched in genes of energy metabolism and underrepresented by genes of cell growth. Conclusions These patterns support the hypothesis, predicated on studies of model organisms, that gene expression level is a primary correlate of evolutionary rate across diverse microbial taxa from natural environments. Despite their complexity, meta-omic datasets can reveal broad evolutionary patterns across taxonomically, functionally, and environmentally diverse communities.
Collapse
Affiliation(s)
- Frank J Stewart
- School of Biology, Georgia Institute of Technology, Ford ES&T Building, Rm 1242, 311 Ferst Drive, Atlanta, GA 30332, USA
| | | | | | | | | |
Collapse
|
28
|
Misfolded proteins impose a dosage-dependent fitness cost and trigger a cytosolic unfolded protein response in yeast. Proc Natl Acad Sci U S A 2010; 108:680-5. [PMID: 21187411 DOI: 10.1073/pnas.1017570108] [Citation(s) in RCA: 210] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Evolving lineages face a constant intracellular threat: most new coding sequence mutations destabilize the folding of the encoded protein. Misfolded proteins form insoluble aggregates and are hypothesized to be intrinsically cytotoxic. Here, we experimentally isolate a fitness cost caused by toxicity of misfolded proteins. We exclude other costs of protein misfolding, such as loss of functional protein or attenuation of growth-limiting protein synthesis resources, by comparing growth rates of budding yeast expressing folded or misfolded variants of a gratuitous protein, YFP, at equal levels. We quantify a fitness cost that increases with misfolded protein abundance, up to as much as a 3.2% growth rate reduction when misfolded YFP represents less than 0.1% of total cellular protein. Comparable experiments on variants of the yeast gene orotidine-5'-phosphate decarboxylase (URA3) produce similar results. Quantitative proteomic measurements reveal that, within the cell, misfolded YFP induces coordinated synthesis of interacting cytosolic chaperone proteins in the absence of a wider stress response, providing evidence for an evolved modular response to misfolded proteins in the cytosol. These results underscore the distinct and evolutionarily relevant molecular threat of protein misfolding, independent of protein function. Assuming that most misfolded proteins impose similar costs, yeast cells express almost all proteins at steady-state levels sufficient to expose their encoding genes to selection against misfolding, lending credibility to the recent suggestion that such selection imposes a global constraint on molecular evolution.
Collapse
|