1
|
Zhang Q, Cao W, Wang J, Yin Y, Sun R, Tian Z, Hu Y, Tan Y, Zhang BG. Transcriptional bursting dynamics in gene expression. Front Genet 2024; 15:1451461. [PMID: 39346775 PMCID: PMC11437526 DOI: 10.3389/fgene.2024.1451461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Accepted: 08/30/2024] [Indexed: 10/01/2024] Open
Abstract
Gene transcription is a stochastic process that occurs in all organisms. Transcriptional bursting, a critical molecular dynamics mechanism, creates significant heterogeneity in mRNA and protein levels. This heterogeneity drives cellular phenotypic diversity. Currently, the lack of a comprehensive quantitative model limits the research on transcriptional bursting. This review examines various gene expression models and compares their strengths and weaknesses to guide researchers in selecting the most suitable model for their research context. We also provide a detailed summary of the key metrics related to transcriptional bursting. We compared the temporal dynamics of transcriptional bursting across species and the molecular mechanisms influencing these bursts, and highlighted the spatiotemporal patterns of gene expression differences by utilizing metrics such as burst size and burst frequency. We summarized the strategies for modeling gene expression from both biostatistical and biochemical reaction network perspectives. Single-cell sequencing data and integrated multiomics approaches drive our exploration of cutting-edge trends in transcriptional bursting mechanisms. Moreover, we examined classical methods for parameter estimation that help capture dynamic parameters in gene expression data, assessing their merits and limitations to facilitate optimal parameter estimation. Our comprehensive summary and review of the current transcriptional burst dynamics theories provide deeper insights for promoting research on the nature of cell processes, cell fate determination, and cancer diagnosis.
Collapse
Affiliation(s)
- Qiuyu Zhang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Wenjie Cao
- School of Mathematics, Sun Yat-sen University, Guangzhou, China
| | - Jiaqi Wang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yihao Yin
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Rui Sun
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Zunyi Tian
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yuhan Hu
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yalan Tan
- School of Bioengineering & Health, Wuhan Textile University, Wu Han, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| |
Collapse
|
2
|
Volteras D, Shahrezaei V, Thomas P. Global transcription regulation revealed from dynamical correlations in time-resolved single-cell RNA sequencing. Cell Syst 2024; 15:694-708.e12. [PMID: 39121860 DOI: 10.1016/j.cels.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 02/29/2024] [Accepted: 07/11/2024] [Indexed: 08/12/2024]
Abstract
Single-cell transcriptomics reveals significant variations in transcriptional activity across cells. Yet, it remains challenging to identify mechanisms of transcription dynamics from static snapshots. It is thus still unknown what drives global transcription dynamics in single cells. We present a stochastic model of gene expression with cell size- and cell cycle-dependent rates in growing and dividing cells that harnesses temporal dimensions of single-cell RNA sequencing through metabolic labeling protocols and cel lcycle reporters. We develop a parallel and highly scalable approximate Bayesian computation method that corrects for technical variation and accurately quantifies absolute burst frequency, burst size, and degradation rate along the cell cycle at a transcriptome-wide scale. Using Bayesian model selection, we reveal scaling between transcription rates and cell size and unveil waves of gene regulation across the cell cycle-dependent transcriptome. Our study shows that stochastic modeling of dynamical correlations identifies global mechanisms of transcription regulation. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Dimitris Volteras
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK
| | - Vahid Shahrezaei
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| | - Philipp Thomas
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
3
|
Wang Q, Lin J. Homeostasis of mRNA concentrations through coupling transcription, export, and degradation. iScience 2024; 27:110531. [PMID: 39175768 PMCID: PMC11338957 DOI: 10.1016/j.isci.2024.110531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 04/16/2024] [Accepted: 07/15/2024] [Indexed: 08/24/2024] Open
Abstract
Many experiments showed that eukaryotic cells maintain a constant mRNA concentration upon various perturbations by actively regulating mRNA production and degradation rates, known as mRNA buffering. However, the underlying mechanism is still unknown. In this work, we unveil a mechanistic model of mRNA buffering: the releasing-shuttling (RS) model. The model incorporates two crucial proteins, X and Y, which play several roles, including transcription, decay, and export factors, in the different stages of mRNA metabolism. The RS model predicts the constant mRNA concentration under genome-wide genetic perturbations and cell volume changes, the slowed-down mRNA degradation after Pol II depletion, and the temporal transcription dynamics after exonuclease depletion, in agreement with multiple experiments. Finally, we present a list of X and Y candidates and propose an experimental method to identify X. Our work uncovers potentially universal pathways coupling transcription, export, and degradation that help cells maintain mRNA homeostasis.
Collapse
Affiliation(s)
- Qirun Wang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Jie Lin
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| |
Collapse
|
4
|
Zhang Z, Zabaikina I, Nieto C, Vahdat Z, Bokes P, Singh A. Stochastic Gene Expression in Proliferating Cells: Differing Noise Intensity in Single-Cell and Population Perspectives. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.28.601263. [PMID: 38979195 PMCID: PMC11230457 DOI: 10.1101/2024.06.28.601263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Random fluctuations (noise) in gene expression can be studied from two complementary perspectives: following expression in a single cell over time or comparing expression between cells in a proliferating population at a given time. Here, we systematically investigated scenarios where both perspectives lead to different levels of noise in a given gene product. We first consider a stable protein, whose concentration is diluted by cellular growth, and the protein inhibits growth at high concentrations, establishing a positive feedback loop. For a stochastic model with molecular bursting of gene products, we analytically predict and contrast the steady-state distributions of protein concentration in both frameworks. Although positive feedback amplifies the noise in expression, this amplification is much higher in the population framework compared to following a single cell over time. We also study other processes that lead to different noise levels even in the absence of such dilution-based feedback. When considering randomness in the partitioning of molecules between daughters during mitosis, we find that in the single-cell perspective, the noise in protein concentration is independent of noise in the cell cycle duration. In contrast, partitioning noise is amplified in the population perspective by increasing randomness in cell-cycle time. Overall, our results show that the commonly used single-cell framework that does not account for proliferating cells can, in some cases, underestimate the noise in gene product levels. These results have important implications for studying the inter-cellular variation of different stress-related expression programs across cell types that are known to inhibit cellular growth.
Collapse
Affiliation(s)
- Zhanhao Zhang
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Iryna Zabaikina
- Department of Applied Mathematics and Statistics, Comenius University, Bratislava 84248, Slovakia
| | - César Nieto
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Zahra Vahdat
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Pavol Bokes
- Department of Applied Mathematics and Statistics, Comenius University, Bratislava 84248, Slovakia
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| |
Collapse
|
5
|
Piho P, Thomas P. Feedback between stochastic gene networks and population dynamics enables cellular decision-making. SCIENCE ADVANCES 2024; 10:eadl4895. [PMID: 38787956 PMCID: PMC11122677 DOI: 10.1126/sciadv.adl4895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/24/2024] [Indexed: 05/26/2024]
Abstract
Phenotypic selection occurs when genetically identical cells are subject to different reproductive abilities due to cellular noise. Such noise arises from fluctuations in reactions synthesizing proteins and plays a crucial role in how cells make decisions and respond to stress or drugs. We propose a general stochastic agent-based model for growing populations capturing the feedback between gene expression and cell division dynamics. We devise a finite state projection approach to analyze gene expression and division distributions and infer selection from single-cell data in mother machines and lineage trees. We use the theory to quantify selection in multi-stable gene expression networks and elucidate that the trade-off between phenotypic switching and selection enables robust decision-making essential for synthetic circuits and developmental lineage decisions. Using live-cell data, we demonstrate that combining theory and inference provides quantitative insights into bet-hedging-like response to DNA damage and adaptation during antibiotic exposure in Escherichia coli.
Collapse
Affiliation(s)
- Paul Piho
- Department of Mathematics, Imperial College London, London, UK
| | | |
Collapse
|
6
|
Jiao F, Li J, Liu T, Zhu Y, Che W, Bleris L, Jia C. What can we learn when fitting a simple telegraph model to a complex gene expression model? PLoS Comput Biol 2024; 20:e1012118. [PMID: 38743803 PMCID: PMC11125521 DOI: 10.1371/journal.pcbi.1012118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 05/24/2024] [Accepted: 04/27/2024] [Indexed: 05/16/2024] Open
Abstract
In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data for E. coli and mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise.
Collapse
Affiliation(s)
- Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Jing Li
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Ting Liu
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Yifeng Zhu
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Wenhao Che
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Leonidas Bleris
- Bioengineering Department, The University of Texas at Dallas, Richardson, Texas, United States of America
- Center for Systems Biology, The University of Texas at Dallas, Richardson, Texas, United States of America
- Department of Biological Sciences, The University of Texas at Dallas, Richardson, Texas, United States of America
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing, China
| |
Collapse
|
7
|
Bianucci T, Zechner C. A local polynomial moment approximation for compartmentalized biochemical systems. Math Biosci 2024; 367:109110. [PMID: 38035996 DOI: 10.1016/j.mbs.2023.109110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 10/10/2023] [Accepted: 11/15/2023] [Indexed: 12/02/2023]
Abstract
Compartmentalized biochemical reactions are a ubiquitous building block of biological systems. The interplay between chemical and compartmental dynamics can drive rich and complex dynamical behaviors that are difficult to analyze mathematically - especially in the presence of stochasticity. We have recently proposed an effective moment equation approach to study the statistical properties of compartmentalized biochemical systems. So far, however, this approach is limited to polynomial rate laws and moreover, it relies on suitable moment closure approximations, which can be difficult to find in practice. In this work we propose a systematic method to derive closed moment dynamics for compartmentalized biochemical systems. We show that for the considered class of systems, the moment equations involve expectations over functions that factorize into two parts, one depending on the molecular content of the compartments and one depending on the compartment number distribution. Our method exploits this structure and approximates each function with suitable polynomial expansions, leading to a closed system of moment equations. We demonstrate the method using three systems inspired by cell populations and organelle networks and study its accuracy across different dynamical regimes.
Collapse
Affiliation(s)
- Tommaso Bianucci
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307, Dresden, Germany; Center for Systems Biology Dresden, Pfotenhauerstraße 108, 01307, Dresden, Germany; Cluster of Excellence Physics of Life, TU Dresden, Arnoldstraße 18, 01307, Dresden, Germany
| | - Christoph Zechner
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstraße 108, 01307, Dresden, Germany; Center for Systems Biology Dresden, Pfotenhauerstraße 108, 01307, Dresden, Germany; Cluster of Excellence Physics of Life, TU Dresden, Arnoldstraße 18, 01307, Dresden, Germany.
| |
Collapse
|
8
|
Golding I, Amir A. Gene expression in growing cells: A biophysical primer. ARXIV 2023:arXiv:2311.12143v1. [PMID: 38045483 PMCID: PMC10690283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Cell growth and gene expression, two essential elements of all living systems, have long been the focus of biophysical interrogation. Advances in experimental single-cell methods have invigorated theoretical studies into these processes. However, until recently, there was little dialog between the two areas of study. In particular, most theoretical models for gene regulation assumed gene activity to be oblivious to the progression of the cell cycle between birth and division. But, in fact, there are numerous ways in which the periodic character of all cellular observables can modulate gene expression. The molecular factors required for transcription and translation-RNA polymerase, transcription factors, ribosomes-increase in number during the cell cycle, but are also diluted due to the continuous increase in cell volume. The replication of the genome changes the dosage of those same cellular players but also provides competing targets for regulatory binding. Finally, cell division reduces their number again, and so forth. Stochasticity is inherent to all these biological processes, manifested in fluctuations in the synthesis and degradation of new cellular components as well as the random partitioning of molecules at each cell division event. The notion of gene expression as stationary is thus hard to justify. In this review, we survey the emerging paradigm of cell-cycle regulated gene expression, with an emphasis on the global expression patterns rather than gene-specific regulation. We discuss recent experimental reports where cell growth and gene expression were simultaneously measured in individual cells, providing first glimpses into the coupling between the two, and motivating several questions. How do the levels of gene expression products - mRNA and protein - scale with the cell volume and cell-cycle progression? What are the molecular origins of the observed scaling laws, and when do they break down to yield non-canonical behavior? What are the consequences of cell-cycle dependence for the heterogeneity ("noise") in gene expression within a cell population? While the experimental findings, not surprisingly, differ among genes, organisms, and environmental conditions, several theoretical models have emerged that attempt to reconcile these differences and form a unifying framework for understanding gene expression in growing cells.
Collapse
Affiliation(s)
- Ido Golding
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Ariel Amir
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
- Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
9
|
Ruess J, Ballif G, Aditya C. Stochastic chemical kinetics of cell fate decision systems: From single cells to populations and back. J Chem Phys 2023; 159:184103. [PMID: 37937934 DOI: 10.1063/5.0160529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 10/14/2023] [Indexed: 11/09/2023] Open
Abstract
Stochastic chemical kinetics is a widely used formalism for studying stochasticity of chemical reactions inside single cells. Experimental studies of reaction networks are generally performed with cells that are part of a growing population, yet the population context is rarely taken into account when models are developed. Models that neglect the population context lose their validity whenever the studied system influences traits of cells that can be selected in the population, a property that naturally arises in the complex interplay between single-cell and population dynamics of cell fate decision systems. Here, we represent such systems as absorbing continuous-time Markov chains. We show that conditioning on non-absorption allows one to derive a modified master equation that tracks the time evolution of the expected population composition within a growing population. This allows us to derive consistent population dynamics models from a specification of the single-cell process. We use this approach to classify cell fate decision systems into two types that lead to different characteristic phases in emerging population dynamics. Subsequently, we deploy the gained insights to experimentally study a recurrent problem in biology: how to link plasmid copy number fluctuations and plasmid loss events inside single cells to growth of cell populations in dynamically changing environments.
Collapse
Affiliation(s)
- Jakob Ruess
- Inria Saclay, 91120 Palaiseau, France
- Institut Pasteur, Université Paris Cité, 75015 Paris, France
| | | | - Chetan Aditya
- Institut Pasteur, Université Paris Cité, 75015 Paris, France
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
10
|
Allcroft TJ, Duong JT, Skardal PS, Kovarik ML. Microfluidic single-cell measurements of oxidative stress as a function of cell cycle position. Anal Bioanal Chem 2023; 415:6481-6490. [PMID: 37682313 DOI: 10.1007/s00216-023-04924-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 07/24/2023] [Accepted: 08/28/2023] [Indexed: 09/09/2023]
Abstract
Single-cell measurements routinely demonstrate high levels of variation between cells, but fewer studies provide insight into the analytical and biological sources of this variation. This is particularly true of chemical cytometry, in which individual cells are lysed and their contents separated, compared to more established single-cell measurements of the genome and transcriptome. To characterize population-level variation and its sources, we analyzed oxidative stress levels in 1278 individual Dictyostelium discoideum cells as a function of exogenous stress level and cell cycle position. Cells were exposed to varying levels of oxidative stress via singlet oxygen generation using the photosensitizer Rose Bengal. Single-cell data reproduced the dose-response observed in ensemble measurements by CE-LIF, superimposed with high levels of heterogeneity. Through experiments and data analysis, we explored possible biological sources of this heterogeneity. No trend was observed between population variation and oxidative stress level, but cell cycle position was a major contributor to heterogeneity in oxidative stress. Cells synchronized to the same stage of cell division were less heterogeneous than unsynchronized cells (RSD of 37-51% vs 93%), and mitotic cells had higher levels of reactive oxygen species than interphase cells. While past research has proposed changes in cell size during the cell cycle as a source of biological noise, the measurements presented here use an internal standard to normalize for effects of cell volume, suggesting a more complex contribution of cell cycle to heterogeneity of oxidative stress.
Collapse
|
11
|
Wang X, Li Y, Jia C. Poisson representation: a bridge between discrete and continuous models of stochastic gene regulatory networks. J R Soc Interface 2023; 20:20230467. [PMID: 38016635 PMCID: PMC10684348 DOI: 10.1098/rsif.2023.0467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/08/2023] [Indexed: 11/30/2023] Open
Abstract
Stochastic gene expression dynamics can be modelled either discretely or continuously. Previous studies have shown that the mRNA or protein number distributions of some simple discrete and continuous gene expression models are related by Gardiner's Poisson representation. Here, we systematically investigate the Poisson representation in complex stochastic gene regulatory networks. We show that when the gene of interest is unregulated, the discrete and continuous descriptions of stochastic gene expression are always related by the Poisson representation, no matter how complex the model is. This generalizes the results obtained in Dattani & Barahona (Dattani & Barahona 2017 J. R. Soc. Interface 14, 20160833 (doi:10.1098/rsif.2016.0833)). In addition, using a simple counter-example, we find that the Poisson representation in general fails to link the two descriptions when the gene is regulated. However, for a general stochastic gene regulatory network, we demonstrate that the discrete and continuous models are approximately related by the Poisson representation in the limit of large protein numbers. These theoretical results are further applied to analytically solve many complex gene expression models whose exact distributions are previously unknown.
Collapse
Affiliation(s)
- Xinyu Wang
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| | - Youming Li
- School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu 611731, People’s Republic of China
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| |
Collapse
|
12
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. Cell Syst 2023; 14:822-843.e22. [PMID: 37751736 PMCID: PMC10725240 DOI: 10.1016/j.cels.2023.08.004] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/16/2023] [Accepted: 08/25/2023] [Indexed: 09/28/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - John J Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
13
|
Gorin G, Yoshida S, Pachter L. Assessing Markovian and Delay Models for Single-Nucleus RNA Sequencing. Bull Math Biol 2023; 85:114. [PMID: 37828255 DOI: 10.1007/s11538-023-01213-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 09/11/2023] [Indexed: 10/14/2023]
Abstract
The serial nature of reactions involved in the RNA life-cycle motivates the incorporation of delays in models of transcriptional dynamics. The models couple a transcriptional process to a fairly general set of delayed monomolecular reactions with no feedback. We provide numerical strategies for calculating the RNA copy number distributions induced by these models, and solve several systems with splicing, degradation, and catalysis. An analysis of single-cell and single-nucleus RNA sequencing data using these models reveals that the kinetics of nuclear export do not appear to require invocation of a non-Markovian waiting time.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Shawn Yoshida
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA.
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125, USA.
| |
Collapse
|
14
|
Tang W, Jørgensen ACS, Marguerat S, Thomas P, Shahrezaei V. Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics. Bioinformatics 2023; 39:btad395. [PMID: 37354494 PMCID: PMC10318389 DOI: 10.1093/bioinformatics/btad395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 05/18/2023] [Accepted: 06/22/2023] [Indexed: 06/26/2023] Open
Abstract
MOTIVATION Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data. AVAILABILITY AND IMPLEMENTATION The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively.
Collapse
Affiliation(s)
- Wenhao Tang
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| | - Andreas Christ Sølvsten Jørgensen
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
- I-X Centre for AI in Science, Imperial College London, White City Campus, London W12 0BZ, United Kingdom
| | - Samuel Marguerat
- MRC London Institute of Medical Sciences (LMS), London W12 0NN, United Kingdom
- Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom
| | - Philipp Thomas
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| | - Vahid Shahrezaei
- Department of Mathematics, Imperial College London, London SW7 2BX, United Kingdom
| |
Collapse
|
15
|
Nieto C, Blanco SC, Vargas-García C, Singh A, Manuel PJ. PyEcoLib: a python library for simulating stochastic cell size dynamics. Phys Biol 2023; 20:10.1088/1478-3975/acd897. [PMID: 37224818 PMCID: PMC10665115 DOI: 10.1088/1478-3975/acd897] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 05/24/2023] [Indexed: 05/26/2023]
Abstract
Recently, there has been an increasing need for tools to simulate cell size regulation due to important applications in cell proliferation and gene expression. However, implementing the simulation usually presents some difficulties, as the division has a cycle-dependent occurrence rate. In this article, we gather a recent theoretical framework inPyEcoLib, a python-based library to simulate the stochastic dynamics of the size of bacterial cells. This library can simulate cell size trajectories with an arbitrarily small sampling period. In addition, this simulator can include stochastic variables, such as the cell size at the beginning of the experiment, the cycle duration timing, the growth rate, and the splitting position. Furthermore, from a population perspective, the user can choose between tracking a single lineage or all cells in a colony. They can also simulate the most common division strategies (adder, timer, and sizer) using the division rate formalism and numerical methods. As an example of PyecoLib applications, we explain how to couple size dynamics with gene expression predicting, from simulations, how the noise in protein levels increases by increasing the noise in division timing, the noise in growth rate and the noise in cell splitting position. The simplicity of this library and its transparency about the underlying theoretical framework yield the inclusion of cell size stochasticity in complex models of gene expression.
Collapse
Affiliation(s)
- César Nieto
- Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, United States of America
- Department of Physics. Universidad de los Andes, Bogotá, Colombia
| | - Sergio Camilo Blanco
- Department of Mathematics and Engineering. Fundacion Universitaria Konrad Lorenz, Bogota, Colombia
| | | | - Abhyudai Singh
- Department of Electrical and Computer Engineering, Department of Biomedical Engineering and Department of Mathematical Sciences, University of Delaware, Newark, DE 19716, United States of America
| | | |
Collapse
|
16
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541250. [PMID: 37292934 PMCID: PMC10245677 DOI: 10.1101/2023.05.17.541250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125
| | - John J. Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA, 02115
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125
| |
Collapse
|
17
|
Fan R, Hilfinger A. The effect of microRNA on protein variability and gene expression fidelity. Biophys J 2023; 122:905-923. [PMID: 36698314 PMCID: PMC10027439 DOI: 10.1016/j.bpj.2023.01.027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 12/23/2022] [Accepted: 01/20/2023] [Indexed: 01/27/2023] Open
Abstract
Small regulatory RNA molecules such as microRNA modulate gene expression through inhibiting the translation of messenger RNA (mRNA). Such posttranscriptional regulation has been recently hypothesized to reduce the stochastic variability of gene expression around average levels. Here, we quantify noise in stochastic gene expression models with and without such regulation. Our results suggest that silencing mRNA posttranscriptionally will always increase, rather than decrease, gene expression noise when the silencing of mRNA also increases its degradation, as is expected for microRNA interactions with mRNA. In that regime, we also find that silencing mRNA generally reduces the fidelity of signal transmission from deterministically varying upstream factors to protein levels. These findings suggest that microRNA binding to mRNA does not generically confer precision to protein expression.
Collapse
Affiliation(s)
- Raymond Fan
- Department of Physics, University of Toronto, Toronto, Ontario, Canada; Department of Chemical & Physical Sciences, University of Toronto, Mississauga, Ontario, Canada.
| | - Andreas Hilfinger
- Department of Physics, University of Toronto, Toronto, Ontario, Canada; Department of Chemical & Physical Sciences, University of Toronto, Mississauga, Ontario, Canada; Department of Cell & Systems Biology, University of Toronto, , Toronto, Ontario, Canada; Department of Mathematics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
18
|
Lunz D, Bonnans JF, Ruess J. Optimal control of bioproduction in the presence of population heterogeneity. J Math Biol 2023; 86:43. [PMID: 36745224 DOI: 10.1007/s00285-023-01876-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 01/08/2023] [Accepted: 01/18/2023] [Indexed: 02/07/2023]
Abstract
Cell-to-cell variability, born of stochastic chemical kinetics, persists even in large isogenic populations. In the study of single-cell dynamics this is typically accounted for. However, on the population level this source of heterogeneity is often sidelined to avoid the inevitable complexity it introduces. The homogeneous models used instead are more tractable but risk disagreeing with their heterogeneous counterparts and may thus lead to severely suboptimal control of bioproduction. In this work, we introduce a comprehensive mathematical framework for solving bioproduction optimal control problems in the presence of heterogeneity. We study population-level models in which such heterogeneity is retained, and propose order-reduction approximation techniques. The reduced-order models take forms typical of homogeneous bioproduction models, making them a useful benchmark by which to study the importance of heterogeneity. Moreover, the derivation from the heterogeneous setting sheds light on parameter selection in ways a direct homogeneous outlook cannot, and reveals the source of approximation error. With view to optimally controlling bioproduction in microbial communities, we ask the question: when does optimising the reduced-order models produce strategies that work well in the presence of population heterogeneity? We show that, in some cases, homogeneous approximations provide remarkably accurate surrogate models. Nevertheless, we also demonstrate that this is not uniformly true: overlooking the heterogeneity can lead to significantly suboptimal control strategies. In these cases, the heterogeneous tools and perspective are crucial to optimise bioproduction.
Collapse
Affiliation(s)
- Davin Lunz
- Inria Paris, 2 Rue Simone Iff, 75012, Paris, France. .,Institut Pasteur, 28 Rue du Docteur Roux, 75015, Paris, France.
| | - J Frédéric Bonnans
- CNRS, CentraleSupélec, Inria, Laboratory of Signals and Systems, Université Paris-Saclay, 91190, Gif-sur-Yvette, France
| | - Jakob Ruess
- Inria Paris, 2 Rue Simone Iff, 75012, Paris, France.,Institut Pasteur, 28 Rue du Docteur Roux, 75015, Paris, France
| |
Collapse
|
19
|
Jia C, Grima R. Coupling gene expression dynamics to cell size dynamics and cell cycle events: Exact and approximate solutions of the extended telegraph model. iScience 2023; 26:105746. [PMID: 36619980 PMCID: PMC9813732 DOI: 10.1016/j.isci.2022.105746] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/02/2022] [Accepted: 12/02/2022] [Indexed: 12/12/2022] Open
Abstract
The standard model describing the fluctuations of mRNA numbers in single cells is the telegraph model which includes synthesis and degradation of mRNA, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by the cell cycle phase, cellular growth and division, and other crucial aspects of cellular biology. Here, we derive the analytical time-dependent solution of an extended telegraph model that explicitly considers the doubling of gene copy numbers upon DNA replication, dependence of the mRNA synthesis rate on cellular volume, gene dosage compensation, partitioning of molecules during cell division, cell-cycle duration variability, and cell-size control strategies. Based on the time-dependent solution, we obtain the analytical distributions of transcript numbers for lineage and population measurements in steady-state growth and also find a linear relation between the Fano factor of mRNA fluctuations and cell volume fluctuations. We show that generally the lineage and population distributions in steady-state growth cannot be accurately approximated by the steady-state solution of extrinsic noise models, i.e. a telegraph model with parameters drawn from probability distributions. This is because the mRNA lifetime is often not small enough compared to the cell cycle duration to erase the memory of division and replication. Accurate approximations are possible when this memory is weak, e.g. for genes with bursty expression and for which there is sufficient gene dosage compensation when replication occurs.
Collapse
Affiliation(s)
- Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, China
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
| |
Collapse
|
20
|
Gorin G, Vastola JJ, Fang M, Pachter L. Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments. Nat Commun 2022; 13:7620. [PMID: 36494337 PMCID: PMC9734650 DOI: 10.1038/s41467-022-34857-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 11/09/2022] [Indexed: 12/13/2022] Open
Abstract
The question of how cell-to-cell differences in transcription rate affect RNA count distributions is fundamental for understanding biological processes underlying transcription. Answering this question requires quantitative models that are both interpretable (describing concrete biophysical phenomena) and tractable (amenable to mathematical analysis). This enables the identification of experiments which best discriminate between competing hypotheses. As a proof of principle, we introduce a simple but flexible class of models involving a continuous stochastic transcription rate driving a discrete RNA transcription and splicing process, and compare and contrast two biologically plausible hypotheses about transcription rate variation. One assumes variation is due to DNA experiencing mechanical strain, while the other assumes it is due to regulator number fluctuations. We introduce a framework for numerically and analytically studying such models, and apply Bayesian model selection to identify candidate genes that show signatures of each model in single-cell transcriptomic data from mouse glutamatergic neurons.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - John J Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA, 02115, USA
| | - Meichen Fang
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA.
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125, USA.
| |
Collapse
|
21
|
Concentration fluctuations in growing and dividing cells: Insights into the emergence of concentration homeostasis. PLoS Comput Biol 2022; 18:e1010574. [PMID: 36194626 PMCID: PMC9565450 DOI: 10.1371/journal.pcbi.1010574] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/14/2022] [Accepted: 09/14/2022] [Indexed: 11/19/2022] Open
Abstract
Intracellular reaction rates depend on concentrations and hence their levels are often regulated. However classical models of stochastic gene expression lack a cell size description and cannot be used to predict noise in concentrations. Here, we construct a model of gene product dynamics that includes a description of cell growth, cell division, size-dependent gene expression, gene dosage compensation, and size control mechanisms that can vary with the cell cycle phase. We obtain expressions for the approximate distributions and power spectra of concentration fluctuations which lead to insight into the emergence of concentration homeostasis. We find that (i) the conditions necessary to suppress cell division-induced concentration oscillations are difficult to achieve; (ii) mRNA concentration and number distributions can have different number of modes; (iii) two-layer size control strategies such as sizer-timer or adder-timer are ideal because they maintain constant mean concentrations whilst minimising concentration noise; (iv) accurate concentration homeostasis requires a fine tuning of dosage compensation, replication timing, and size-dependent gene expression; (v) deviations from perfect concentration homeostasis show up as deviations of the concentration distribution from a gamma distribution. Some of these predictions are confirmed using data for E. coli, fission yeast, and budding yeast.
Collapse
|
22
|
Quantifying biochemical reaction rates from static population variability within incompletely observed complex networks. PLoS Comput Biol 2022; 18:e1010183. [PMID: 35731728 PMCID: PMC9216546 DOI: 10.1371/journal.pcbi.1010183] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 05/07/2022] [Indexed: 11/19/2022] Open
Abstract
Quantifying biochemical reaction rates within complex cellular processes remains a key challenge of systems biology even as high-throughput single-cell data have become available to characterize snapshots of population variability. That is because complex systems with stochastic and non-linear interactions are difficult to analyze when not all components can be observed simultaneously and systems cannot be followed over time. Instead of using descriptive statistical models, we show that incompletely specified mechanistic models can be used to translate qualitative knowledge of interactions into reaction rate functions from covariability data between pairs of components. This promises to turn a globally intractable problem into a sequence of solvable inference problems to quantify complex interaction networks from incomplete snapshots of their stochastic fluctuations.
Collapse
|
23
|
Ciechonska M, Sturrock M, Grob A, Larrouy-Maumus G, Shahrezaei V, Isalan M. Emergent expression of fitness-conferring genes by phenotypic selection. PNAS NEXUS 2022; 1:pgac069. [PMID: 36741458 PMCID: PMC9896880 DOI: 10.1093/pnasnexus/pgac069] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Accepted: 05/23/2022] [Indexed: 02/07/2023]
Abstract
Genotypic and phenotypic adaptation is the consequence of ongoing natural selection in populations and is key to predicting and preventing drug resistance. Whereas classic antibiotic persistence is all-or-nothing, here we demonstrate that an antibiotic resistance gene displays linear dose-responsive selection for increased expression in proportion to rising antibiotic concentration in growing Escherichia coli populations. Furthermore, we report the potentially wide-spread nature of this form of emergent gene expression (EGE) by instantaneous phenotypic selection process under bactericidal and bacteriostatic antibiotic treatment, as well as an amino acid synthesis pathway enzyme under a range of auxotrophic conditions. We propose an analogy to Ohm's law in electricity (V = IR), where selection pressure acts similarly to voltage (V), gene expression to current (I), and resistance (R) to cellular machinery constraints and costs. Lastly, mathematical modeling using agent-based models of stochastic gene expression in growing populations and Bayesian model selection reveal that the EGE mechanism requires variability in gene expression within an isogenic population, and a cellular "memory" from positive feedbacks between growth and expression of any fitness-conferring gene. Finally, we discuss the connection of the observed phenomenon to a previously described general fluctuation-response relationship in biology.
Collapse
Affiliation(s)
| | | | - Alice Grob
- Department of Life Sciences, Imperial College London, London SW7 2AZ, UK
| | | | | | | |
Collapse
|
24
|
Mechanisms of cellular mRNA transcript homeostasis. Trends Cell Biol 2022; 32:655-668. [PMID: 35660047 DOI: 10.1016/j.tcb.2022.05.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/20/2022]
Abstract
For most genes, mRNA transcript abundance scales with cell size to ensure a constant concentration. Scaling of mRNA synthesis rates with cell size plays an important role, with regulation of the activity and abundance of RNA polymerase II (Pol II) now emerging as a key point of control. However, there is also considerable evidence for feedback mechanisms that kinetically couple the rates of mRNA synthesis, nuclear export, and degradation to allow cells to compensate for changes in one by adjusting the others. Researchers are beginning to integrate results from these different fields to reveal the mechanisms underlying transcript homeostasis. This will be crucial for moving beyond our current understanding of relative gene expression towards an appreciation of how absolute transcript levels are linked to other aspects of the cellular phenotype.
Collapse
|