1
|
Abstract
Flamholz and Shih explain how photosynthetic organisms on earth have evolved carbon dioxide concentrating mechanisms to contend with an increased abundance of oxygen in the atmosphere. This critical innovation has occurred numerous times and comes in two basic flavors.
Collapse
|
2
|
Abstract
Quantitatively describing the time course of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection within an infected individual is important for understanding the current global pandemic and possible ways to combat it. Here we integrate the best current knowledge about the typical viral load of SARS-CoV-2 in bodily fluids and host tissues to estimate the total number and mass of SARS-CoV-2 virions in an infected person. We estimate that each infected person carries 109 to 1011 virions during peak infection, with a total mass in the range of 1 μg to 100 μg, which curiously implies that all SARS-CoV-2 virions currently circulating within human hosts have a collective mass of only 0.1 kg to 10 kg. We combine our estimates with the available literature on host immune response and viral mutation rates to demonstrate how antibodies markedly outnumber the spike proteins, and the genetic diversity of virions in an infected host covers all possible single nucleotide substitutions.
Collapse
|
3
|
The total number and mass of SARS-CoV-2 virions. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2021:2020.11.16.20232009. [PMID: 33236021 PMCID: PMC7685332 DOI: 10.1101/2020.11.16.20232009] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Quantitatively describing the time course of the SARS-CoV-2 infection within an infected individual is important for understanding the current global pandemic and possible ways to combat it. Here we integrate the best current knowledge about the typical viral load of SARS-CoV-2 in bodily fluids and host tissues to estimate the total number and mass of SARS-CoV-2 virions in an infected person. We estimate that each infected person carries 109-1011 virions during peak infection, with a total mass in the range of 1-100 μg, which curiously implies that all SARS-CoV-2 virions currently circulating within human hosts have a collective mass of only 0.1-10 kg. We combine our estimates with the available literature on host immune response and viral mutation rates to demonstrate how antibodies markedly outnumber the spike proteins and the genetic diversity of virions in an infected host covers all possible single nucleotide substitutions.
Collapse
|
4
|
Genome Sequencing of Sewage Detects Regionally Prevalent SARS-CoV-2 Variants. mBio 2021; 12:e02703-20. [PMID: 33468686 PMCID: PMC7845645 DOI: 10.1128/mbio.02703-20] [Citation(s) in RCA: 213] [Impact Index Per Article: 71.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 12/15/2020] [Indexed: 12/15/2022] Open
Abstract
Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and nearly complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the United States or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside California, indicating that wastewater sequencing can provide evidence for recent introductions of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context.
Collapse
|
5
|
Abstract
Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and nearly complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the United States or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside California, indicating that wastewater sequencing can provide evidence for recent introductions of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context.
Collapse
|
6
|
Abstract
Viral genome sequencing has guided our understanding of the spread and extent of genetic diversity of SARS-CoV-2 during the COVID-19 pandemic. SARS-CoV-2 viral genomes are usually sequenced from nasopharyngeal swabs of individual patients to track viral spread. Recently, RT-qPCR of municipal wastewater has been used to quantify the abundance of SARS-CoV-2 in several regions globally. However, metatranscriptomic sequencing of wastewater can be used to profile the viral genetic diversity across infected communities. Here, we sequenced RNA directly from sewage collected by municipal utility districts in the San Francisco Bay Area to generate complete and nearly complete SARS-CoV-2 genomes. The major consensus SARS-CoV-2 genotypes detected in the sewage were identical to clinical genomes from the region. Using a pipeline for single nucleotide variant calling in a metagenomic context, we characterized minor SARS-CoV-2 alleles in the wastewater and detected viral genotypes which were also found within clinical genomes throughout California. Observed wastewater variants were more similar to local California patient-derived genotypes than they were to those from other regions within the United States or globally. Additional variants detected in wastewater have only been identified in genomes from patients sampled outside California, indicating that wastewater sequencing can provide evidence for recent introductions of viral lineages before they are detected by local clinical sequencing. These results demonstrate that epidemiological surveillance through wastewater sequencing can aid in tracking exact viral strains in an epidemic context.
Collapse
|
7
|
Highly active rubiscos discovered by systematic interrogation of natural sequence diversity. EMBO J 2020; 39:e104081. [PMID: 32500941 PMCID: PMC7507306 DOI: 10.15252/embj.2019104081] [Citation(s) in RCA: 55] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 04/30/2020] [Accepted: 05/07/2020] [Indexed: 11/09/2022] Open
Abstract
CO2 is converted into biomass almost solely by the enzyme rubisco. The poor carboxylation properties of plant rubiscos have led to efforts that made it the most kinetically characterized enzyme, yet these studies focused on < 5% of its natural diversity. Here, we searched for fast-carboxylating variants by systematically mining genomic and metagenomic data. Approximately 33,000 unique rubisco sequences were identified and clustered into ≈ 1,000 similarity groups. We then synthesized, purified, and biochemically tested the carboxylation rates of 143 representatives, spanning all clusters of form-II and form-II/III rubiscos. Most variants (> 100) were active in vitro, with the fastest having a turnover number of 22 ± 1 s-1 -sixfold faster than the median plant rubisco and nearly twofold faster than the fastest measured rubisco to date. Unlike rubiscos from plants and cyanobacteria, the fastest variants discovered here are homodimers and exhibit a much simpler folding and activation kinetics. Our pipeline can be utilized to explore the kinetic space of other enzymes of interest, allowing us to get a better view of the biosynthetic potential of the biosphere.
Collapse
|
8
|
Abstract
The COVID-19 pandemic is a harsh reminder of the fact that, whether in a single human host or a wave of infection across continents, viral dynamics is often a story about the numbers. In this article we provide a one-stop, curated graphical source for the key numbers (based mostly on the peer-reviewed literature) about the SARS-CoV-2 virus that is responsible for the pandemic. The discussion is framed around two broad themes: i) the biology of the virus itself; ii) the characteristics of the infection of a single human host.
Collapse
|
9
|
Quantum chemistry reveals thermodynamic principles of redox biochemistry. PLoS Comput Biol 2018; 14:e1006471. [PMID: 30356318 PMCID: PMC6218094 DOI: 10.1371/journal.pcbi.1006471] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Revised: 11/05/2018] [Accepted: 08/29/2018] [Indexed: 01/28/2023] Open
Abstract
Thermodynamics dictates the structure and function of metabolism. Redox reactions drive cellular energy and material flow. Hence, accurately quantifying the thermodynamics of redox reactions should reveal design principles that shape cellular metabolism. However, only few redox potentials have been measured, and mostly with inconsistent experimental setups. Here, we develop a quantum chemistry approach to calculate redox potentials of biochemical reactions and demonstrate our method predicts experimentally measured potentials with unparalleled accuracy. We then calculate the potentials of all redox pairs that can be generated from biochemically relevant compounds and highlight fundamental trends in redox biochemistry. We further address the question of why NAD/NADP are used as primary electron carriers, demonstrating how their physiological potential range fits the reactions of central metabolism and minimizes the concentration of reactive carbonyls. The use of quantum chemistry can revolutionize our understanding of biochemical phenomena by enabling fast and accurate calculation of thermodynamic values. Redox reactions define the energetic constraints within which life can exist. However, measurements of reduction potentials are scarce and unstandardized, and current prediction methods fall short of desired accuracy and coverage. Here, we harness quantum chemistry tools to enable the high-throughput prediction of reduction potentials with unparalleled accuracy. We calculate the reduction potentials of all redox pairs that can be generated using known biochemical compounds. This high-resolution dataset enables us to uncover global trends in metabolism, including the differences between and within oxidoreductase groups. We further demonstrate that the redox potential of NAD(P) optimally satisfies two constraints: reversibly reducing and oxidizing the vast majority of redox reactions in central metabolism while keeping the concentration of reactive carbonyl intermediates in check.
Collapse
|
10
|
The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization. PLoS Comput Biol 2016; 12:e1005167. [PMID: 27812109 PMCID: PMC5094713 DOI: 10.1371/journal.pcbi.1005167] [Citation(s) in RCA: 99] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2016] [Accepted: 09/27/2016] [Indexed: 02/03/2023] Open
Abstract
Bacterial growth depends crucially on metabolic fluxes, which are limited by the cell’s capacity to maintain metabolic enzymes. The necessary enzyme amount per unit flux is a major determinant of metabolic strategies both in evolution and bioengineering. It depends on enzyme parameters (such as kcat and KM constants), but also on metabolite concentrations. Moreover, similar amounts of different enzymes might incur different costs for the cell, depending on enzyme-specific properties such as protein size and half-life. Here, we developed enzyme cost minimization (ECM), a scalable method for computing enzyme amounts that support a given metabolic flux at a minimal protein cost. The complex interplay of enzyme and metabolite concentrations, e.g. through thermodynamic driving forces and enzyme saturation, would make it hard to solve this optimization problem directly. By treating enzyme cost as a function of metabolite levels, we formulated ECM as a numerically tractable, convex optimization problem. Its tiered approach allows for building models at different levels of detail, depending on the amount of available data. Validating our method with measured metabolite and protein levels in E. coli central metabolism, we found typical prediction fold errors of 4.1 and 2.6, respectively, for the two kinds of data. This result from the cost-optimized metabolic state is significantly better than randomly sampled metabolite profiles, supporting the hypothesis that enzyme cost is important for the fitness of E. coli. ECM can be used to predict enzyme levels and protein cost in natural and engineered pathways, and could be a valuable computational tool to assist metabolic engineering projects. Furthermore, it establishes a direct connection between protein cost and thermodynamics, and provides a physically plausible and computationally tractable way to include enzyme kinetics into constraint-based metabolic models, where kinetics have usually been ignored or oversimplified. “Enzyme cost”, the amount of protein needed for a given metabolic flux, is crucial for the metabolic choices cells have to make. However, due to the technical limitations of linear optimization methods, this cost has traditionally been ignored by constraint-based metabolic models such as Flux Balance Analysis. On the other hand, more detailed kinetic models which use ordinary differential equations to simulate fluxes for different choices of enzyme allocation, are computationally demanding and not scalable enough. In this work, we developed a method which utilizes the full kinetic model to predict steady-state enzyme costs, using a scalable and robust algorithm based on convex optimization. We show that the minimization of enzyme cost is a meaningful optimality principle by comparing our predictions to measured enzyme and metabolite levels in exponentially growing E. coli. This method could be used to quantify the enzyme cost of many other pathways and explain why evolution has selected some low-yield metabolic strategies, including aerobic fermentation in yeast and cancer cells. Furthermore, future metabolic engineering projects could benefit from our method by choosing pathways that reduce the total amount of enzyme required for the synthesis of a value-added product.
Collapse
|
11
|
Abstract
The microscopic world of a cell can be as alien to our human-centered intuition as the confinement of quarks within protons or the event horizon of a black hole. We are prone to thinking by analogy—Golgi cisternae stack like pancakes, red blood cells look like donuts—but very little in our human experience is truly comparable to the immensely crowded, membrane-subdivided interior of a eukaryotic cell or the intricately layered structures of a mammalian tissue. So in our daily efforts to understand how cells work, we are faced with a challenge: how do we develop intuition that works at the microscopic scale?
Collapse
|
12
|
Pathway thermodynamics highlights kinetic obstacles in central metabolism. PLoS Comput Biol 2014; 10:e1003483. [PMID: 24586134 PMCID: PMC3930492 DOI: 10.1371/journal.pcbi.1003483] [Citation(s) in RCA: 189] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2013] [Accepted: 01/08/2014] [Indexed: 12/05/2022] Open
Abstract
In metabolism research, thermodynamics is usually used to determine the directionality of a reaction or the feasibility of a pathway. However, the relationship between thermodynamic potentials and fluxes is not limited to questions of directionality: thermodynamics also affects the kinetics of reactions through the flux-force relationship, which states that the logarithm of the ratio between the forward and reverse fluxes is directly proportional to the change in Gibbs energy due to a reaction (ΔrG′). Accordingly, if an enzyme catalyzes a reaction with a ΔrG′ of -5.7 kJ/mol then the forward flux will be roughly ten times the reverse flux. As ΔrG′ approaches equilibrium (ΔrG′ = 0 kJ/mol), exponentially more enzyme counterproductively catalyzes the reverse reaction, reducing the net rate at which the reaction proceeds. Thus, the enzyme level required to achieve a given flux increases dramatically near equilibrium. Here, we develop a framework for quantifying the degree to which pathways suffer these thermodynamic limitations on flux. For each pathway, we calculate a single thermodynamically-derived metric (the Max-min Driving Force, MDF), which enables objective ranking of pathways by the degree to which their flux is constrained by low thermodynamic driving force. Our framework accounts for the effect of pH, ionic strength and metabolite concentration ranges and allows us to quantify how alterations to the pathway structure affect the pathway's thermodynamics. Applying this methodology to pathways of central metabolism sheds light on some of their features, including metabolic bypasses (e.g., fermentation pathways bypassing substrate-level phosphorylation), substrate channeling (e.g., of oxaloacetate from malate dehydrogenase to citrate synthase), and use of alternative cofactors (e.g., quinone as an electron acceptor instead of NAD). The methods presented here place another arrow in metabolic engineers' quiver, providing a simple means of evaluating the thermodynamic and kinetic quality of different pathway chemistries that produce the same molecules. Given data about enzyme kinetics and reaction thermodynamics, traditional metabolic control analysis (MCA) can pinpoint the enzymes whose expression will have the largest effect on steady-state flux through the pathway. These analyses can aid experimentalists in tuning enzyme expression levels along a metabolic pathway. In this work, we offer a framework that is complementary to MCA. Rather than focusing on the relationship between enzyme levels and pathway flux, we examine a pathway's stoichiometry and thermodynamics and ask whether it is likely to support high flux in cellular conditions. Our framework calculates a single thermodynamically-derived metric (the MDF) for each pathway, which is convenient for selecting the promising pathways from a large collection. This approach has several advantages. First, enzyme kinetic properties are laborious to measure and differ between organisms and isozymes, but no kinetic data is required to calculate the MDF. Second, as our framework accounts for pH, ionic strength and allowed concentration ranges, it is simple to model the effect of these parameters on the MDF. Finally, as it can be difficult to control the exact expression level of enzymes within cells, the MDF helps identify alternative pathways that are less sensitive to the levels of their constituent enzymes.
Collapse
|
13
|
A note on the kinetics of enzyme action: A decomposition that highlights thermodynamic effects. FEBS Lett 2013; 587:2772-7. [DOI: 10.1016/j.febslet.2013.07.028] [Citation(s) in RCA: 80] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2013] [Revised: 07/07/2013] [Accepted: 07/16/2013] [Indexed: 11/25/2022]
|
14
|
Abstract
Protein levels are a dominant factor shaping natural and synthetic biological systems. Although proper functioning of metabolic pathways relies on precise control of enzyme levels, the experimental ability to balance the levels of many genes in parallel is a major outstanding challenge. Here, we introduce a rapid and modular method to span the expression space of several proteins in parallel. By combinatorially pairing genes with a compact set of ribosome-binding sites, we modulate protein abundance by several orders of magnitude. We demonstrate our strategy by using a synthetic operon containing fluorescent proteins to span a 3D color space. Using the same approach, we modulate a recombinant carotenoid biosynthesis pathway in Escherichia coli to reveal a diversity of phenotypes, each characterized by a distinct carotenoid accumulation profile. In a single combinatorial assembly, we achieve a yield of the industrially valuable compound astaxanthin 4-fold higher than previously reported. The methodology presented here provides an efficient tool for exploring a high-dimensional expression space to locate desirable phenotypes.
Collapse
|
15
|
Design and analysis of metabolic pathways supporting formatotrophic growth for electricity-dependent cultivation of microbes. BIOCHIMICA ET BIOPHYSICA ACTA-BIOENERGETICS 2012; 1827:1039-47. [PMID: 23123556 DOI: 10.1016/j.bbabio.2012.10.013] [Citation(s) in RCA: 112] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Revised: 10/05/2012] [Accepted: 10/25/2012] [Indexed: 11/29/2022]
Abstract
Electrosynthesis is a promising approach that enables the biological production of commodities, like fuels and fine chemicals, using renewably produced electricity. Several techniques have been proposed to mediate the transfer of electrons from the cathode to living cells. Of these, the electroproduction of formate as a mediator seems especially promising: formate is readily soluble, of low toxicity and can be produced at relatively high efficiency and at reasonable current density. While organisms that are capable of formatotrophic growth, i.e. growth on formate, exist naturally, they are generally less suitable for bulk cultivation and industrial needs. Hence, it may be helpful to engineer a model organism of industrial relevance, such as E. coli, for growth on formate. There are numerous metabolic pathways that can potentially support formatotrophic growth. Here we analyze these diverse pathways according to various criteria including biomass yield, thermodynamic favorability, chemical motive force, kinetics and the practical challenges posed by their expression. We find that the reductive glycine pathway, composed of the tetrahydrofolate system, the glycine cleavage system, serine hydroxymethyltransferase and serine deaminase, is a promising candidate to support electrosynthesis in E. coli. The approach presented here exemplifies how combining different computational approaches into a systematic analysis methodology provides assistance in redesigning metabolism. This article is part of a Special Issue entitled: Metals in Bioenergetics and Biomimetics Systems.
Collapse
|
16
|
An integrated open framework for thermodynamics of reactions that combines accuracy and coverage. ACTA ACUST UNITED AC 2012; 28:2037-44. [PMID: 22645166 PMCID: PMC3400964 DOI: 10.1093/bioinformatics/bts317] [Citation(s) in RCA: 94] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
Motivation: The laws of thermodynamics describe a direct, quantitative relationship between metabolite concentrations and reaction directionality. Despite great efforts, thermodynamic data suffer from limited coverage, scattered accessibility and non-standard annotations. We present a framework for unifying thermodynamic data from multiple sources and demonstrate two new techniques for extrapolating the Gibbs energies of unmeasured reactions and conditions. Results: Both methods account for changes in cellular conditions (pH, ionic strength, etc.) by using linear regression over the ΔG○ of pseudoisomers and reactions. The Pseudoisomeric Reactant Contribution method systematically infers compound formation energies using measured K′ and pKa data. The Pseudoisomeric Group Contribution method extends the group contribution method and achieves a high coverage of unmeasured reactions. We define a continuous index that predicts the reversibility of a reaction under a given physiological concentration range. In the characteristic physiological range 3μM–3mM, we find that roughly half of the reactions in Escherichia coli's metabolism are reversible. These new tools can increase the accuracy of thermodynamic-based models, especially in non-standard pH and ionic strengths. The reversibility index can help modelers decide which reactions are reversible in physiological conditions. Availability: Freely available on the web at: http://equilibrator.weizmann.ac.il. Website implemented in Python, MySQL, Apache and Django, with all major browsers supported. The framework is open-source (code.google.com/p/milo-lab), implemented in pure Python and tested mainly on Linux. Contact:ron.milo@weizmann.ac.il Supplementary Information:Supplementary data are available at Bioinformatics online.
Collapse
|
17
|
Abstract
Metabolic pathways may seem arbitrary and unnecessarily complex. In many cases, a chemist might devise a simpler route for the biochemical transformation, so why has nature chosen such complex solutions? In this review, we distill lessons from a century of metabolic research and introduce new observations suggesting that the intricate structure of metabolic pathways can be explained by a small set of biochemical principles. Using glycolysis as an example, we demonstrate how three key biochemical constraints--thermodynamic favorability, availability of enzymatic mechanisms and the physicochemical properties of pathway intermediates--eliminate otherwise plausible metabolic strategies. Considering these constraints, glycolysis contains no unnecessary steps and represents one of the very few pathway structures that meet cellular demands. The analysis presented here can be applied to metabolic engineering efforts for the rational design of pathways that produce a desired product while satisfying biochemical constraints.
Collapse
|
18
|
Abstract
The laws of thermodynamics constrain the action of biochemical systems. However, thermodynamic data on biochemical compounds can be difficult to find and is cumbersome to perform calculations with manually. Even simple thermodynamic questions like ‘how much Gibbs energy is released by ATP hydrolysis at pH 5?’ are complicated excessively by the search for accurate data. To address this problem, eQuilibrator couples a comprehensive and accurate database of thermodynamic properties of biochemical compounds and reactions with a simple and powerful online search and calculation interface. The web interface to eQuilibrator (http://equilibrator.weizmann.ac.il) enables easy calculation of Gibbs energies of compounds and reactions given arbitrary pH, ionic strength and metabolite concentrations. The eQuilibrator code is open-source and all thermodynamic source data are freely downloadable in standard formats. Here we describe the database characteristics and implementation and demonstrate its use.
Collapse
|
19
|
Hydrophobicity and charge shape cellular metabolite concentrations. PLoS Comput Biol 2011; 7:e1002166. [PMID: 21998563 PMCID: PMC3188480 DOI: 10.1371/journal.pcbi.1002166] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2011] [Accepted: 07/04/2011] [Indexed: 11/18/2022] Open
Abstract
What governs the concentrations of metabolites within living cells? Beyond specific metabolic and enzymatic considerations, are there global trends that affect their values? We hypothesize that the physico-chemical properties of metabolites considerably affect their in-vivo concentrations. The recently achieved experimental capability to measure the concentrations of many metabolites simultaneously has made the testing of this hypothesis possible. Here, we analyze such recently available data sets of metabolite concentrations within E. coli, S. cerevisiae, B. subtilis and human. Overall, these data sets encompass more than twenty conditions, each containing dozens (28-108) of simultaneously measured metabolites. We test for correlations with various physico-chemical properties and find that the number of charged atoms, non-polar surface area, lipophilicity and solubility consistently correlate with concentration. In most data sets, a change in one of these properties elicits a ~100 fold increase in metabolite concentrations. We find that the non-polar surface area and number of charged atoms account for almost half of the variation in concentrations in the most reliable and comprehensive data set. Analyzing specific groups of metabolites, such as amino-acids or phosphorylated nucleotides, reveals even a higher dependence of concentration on hydrophobicity. We suggest that these findings can be explained by evolutionary constraints imposed on metabolite concentrations and discuss possible selective pressures that can account for them. These include the reduction of solute leakage through the lipid membrane, avoidance of deleterious aggregates and reduction of non-specific hydrophobic binding. By highlighting the global constraints imposed on metabolic pathways, future research could shed light onto aspects of biochemical evolution and the chemical constraints that bound metabolic engineering efforts.
Collapse
|