1
|
Tang J, Du W, Shu Z, Cao Z. A generative benchmark for evaluating the performance of fluorescent cell image segmentation. Synth Syst Biotechnol 2024; 9:627-637. [PMID: 38798889 PMCID: PMC11127598 DOI: 10.1016/j.synbio.2024.05.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/25/2023] [Revised: 04/13/2024] [Accepted: 05/08/2024] [Indexed: 05/29/2024] Open
Abstract
Fluorescent cell imaging technology is fundamental in life science research, offering a rich source of image data crucial for understanding cell spatial positioning, differentiation, and decision-making mechanisms. As the volume of this data expands, precise image analysis becomes increasingly critical. Cell segmentation, a key analysis step, significantly influences quantitative analysis outcomes. However, selecting the most effective segmentation method is challenging, hindered by existing evaluation methods' inaccuracies, lack of graded evaluation, and narrow assessment scope. Addressing this, we developed a novel framework with two modules: StyleGAN2-based contour generation and Pix2PixHD-based image rendering, producing diverse, graded-density cell images. Using this dataset, we evaluated three leading cell segmentation methods: DeepCell, CellProfiler, and CellPose. Our comprehensive comparison revealed CellProfiler's superior accuracy in segmenting cytoplasm and nuclei. Our framework diversifies cell image data generation and systematically addresses evaluation challenges in cell segmentation technologies, establishing a solid foundation for advancing research and applications in cell image analysis.
Collapse
Affiliation(s)
- Jun Tang
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China
- MOE Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai, 200237, China
| | - Wei Du
- MOE Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai, 200237, China
| | - Zhanpeng Shu
- College of Electrical Engineering, Shanghai Dianji University, Shanghai, 201306, China
| | - Zhixing Cao
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, China
| |
Collapse
|
2
|
Ma M, Szavits-Nossan J, Singh A, Grima R. Analysis of a detailed multi-stage model of stochastic gene expression using queueing theory and model reduction. Math Biosci 2024; 373:109204. [PMID: 38710441 DOI: 10.1016/j.mbs.2024.109204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/03/2024] [Accepted: 04/29/2024] [Indexed: 05/08/2024]
Abstract
We introduce a biologically detailed, stochastic model of gene expression describing the multiple rate-limiting steps of transcription, nuclear pre-mRNA processing, nuclear mRNA export, cytoplasmic mRNA degradation and translation of mRNA into protein. The processes in sub-cellular compartments are described by an arbitrary number of processing stages, thus accounting for a significantly finer molecular description of gene expression than conventional models such as the telegraph, two-stage and three-stage models of gene expression. We use two distinct tools, queueing theory and model reduction using the slow-scale linear-noise approximation, to derive exact or approximate analytic expressions for the moments or distributions of nuclear mRNA, cytoplasmic mRNA and protein fluctuations, as well as lower bounds for their Fano factors in steady-state conditions. We use these to study the phase diagram of the stochastic model; in particular we derive parametric conditions determining three types of transitions in the properties of mRNA fluctuations: from sub-Poissonian to super-Poissonian noise, from high noise in the nucleus to high noise in the cytoplasm, and from a monotonic increase to a monotonic decrease of the Fano factor with the number of processing stages. In contrast, protein fluctuations are always super-Poissonian and show weak dependence on the number of mRNA processing stages. Our results delineate the region of parameter space where conventional models give qualitatively incorrect results and provide insight into how the number of processing stages, e.g. the number of rate-limiting steps in initiation, splicing and mRNA degradation, shape stochastic gene expression by modulation of molecular memory.
Collapse
Affiliation(s)
- Muhan Ma
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, UK
| | | | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware, Newark DE 19716, USA
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, UK.
| |
Collapse
|
3
|
Ham L, Coomer MA, Öcal K, Grima R, Stumpf MPH. A stochastic vs deterministic perspective on the timing of cellular events. Nat Commun 2024; 15:5286. [PMID: 38902228 PMCID: PMC11190182 DOI: 10.1038/s41467-024-49624-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 06/12/2024] [Indexed: 06/22/2024] Open
Abstract
Cells are the fundamental units of life, and like all life forms, they change over time. Changes in cell state are driven by molecular processes; of these many are initiated when molecule numbers reach and exceed specific thresholds, a characteristic that can be described as "digital cellular logic". Here we show how molecular and cellular noise profoundly influence the time to cross a critical threshold-the first-passage time-and map out scenarios in which stochastic dynamics result in shorter or longer average first-passage times compared to noise-less dynamics. We illustrate the dependence of the mean first-passage time on noise for a set of exemplar models of gene expression, auto-regulatory feedback control, and enzyme-mediated catalysis. Our theory provides intuitive insight into the origin of these effects and underscores two important insights: (i) deterministic predictions for cellular event timing can be highly inaccurate when molecule numbers are within the range known for many cells; (ii) molecular noise can significantly shift mean first-passage times, particularly within auto-regulatory genetic feedback circuits.
Collapse
Affiliation(s)
- Lucy Ham
- School of BioSciences, University of Melbourne, Parkville, Australia
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia
| | - Megan A Coomer
- School of BioSciences, University of Melbourne, Parkville, Australia
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia
| | - Kaan Öcal
- School of Informatics, University of Edinburgh, Edinburgh, UK
- School of BioSciences, University of Melbourne, Parkville, Australia
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Michael P H Stumpf
- School of BioSciences, University of Melbourne, Parkville, Australia.
- School of Mathematics and Statistics, University of Melbourne, Parkville, Australia.
| |
Collapse
|
4
|
Fang Z, Gupta A, Kumar S, Khammash M. Advanced methods for gene network identification and noise decomposition from single-cell data. Nat Commun 2024; 15:4911. [PMID: 38851792 PMCID: PMC11162465 DOI: 10.1038/s41467-024-49177-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Accepted: 05/24/2024] [Indexed: 06/10/2024] Open
Abstract
Central to analyzing noisy gene expression systems is solving the Chemical Master Equation (CME), which characterizes the probability evolution of the reacting species' copy numbers. Solving CMEs for high-dimensional systems suffers from the curse of dimensionality. Here, we propose a computational method for improved scalability through a divide-and-conquer strategy that optimally decomposes the whole system into a leader system and several conditionally independent follower subsystems. The CME is solved by combining Monte Carlo estimation for the leader system with stochastic filtering procedures for the follower subsystems. We demonstrate this method with high-dimensional numerical examples and apply it to identify a yeast transcription system at the single-cell resolution, leveraging mRNA time-course experimental data. The identification results enable an accurate examination of the heterogeneity in rate parameters among isogenic cells. To validate this result, we develop a noise decomposition technique exploiting time-course data but requiring no supplementary components, e.g., dual-reporters.
Collapse
Affiliation(s)
- Zhou Fang
- Department of Biosystems Science and Engineering, ETH Zurich, CH-4056, Basel, Switzerland
| | - Ankit Gupta
- Department of Biosystems Science and Engineering, ETH Zurich, CH-4056, Basel, Switzerland
| | - Sant Kumar
- Department of Biosystems Science and Engineering, ETH Zurich, CH-4056, Basel, Switzerland
| | - Mustafa Khammash
- Department of Biosystems Science and Engineering, ETH Zurich, CH-4056, Basel, Switzerland.
| |
Collapse
|
5
|
Piho P, Thomas P. Feedback between stochastic gene networks and population dynamics enables cellular decision-making. SCIENCE ADVANCES 2024; 10:eadl4895. [PMID: 38787956 PMCID: PMC11122677 DOI: 10.1126/sciadv.adl4895] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/24/2024] [Indexed: 05/26/2024]
Abstract
Phenotypic selection occurs when genetically identical cells are subject to different reproductive abilities due to cellular noise. Such noise arises from fluctuations in reactions synthesizing proteins and plays a crucial role in how cells make decisions and respond to stress or drugs. We propose a general stochastic agent-based model for growing populations capturing the feedback between gene expression and cell division dynamics. We devise a finite state projection approach to analyze gene expression and division distributions and infer selection from single-cell data in mother machines and lineage trees. We use the theory to quantify selection in multi-stable gene expression networks and elucidate that the trade-off between phenotypic switching and selection enables robust decision-making essential for synthetic circuits and developmental lineage decisions. Using live-cell data, we demonstrate that combining theory and inference provides quantitative insights into bet-hedging-like response to DNA damage and adaptation during antibiotic exposure in Escherichia coli.
Collapse
Affiliation(s)
- Paul Piho
- Department of Mathematics, Imperial College London, London, UK
| | | |
Collapse
|
6
|
Miles CE, McKinley SA, Ding F, Lehoucq RB. Inferring Stochastic Rates from Heterogeneous Snapshots of Particle Positions. Bull Math Biol 2024; 86:74. [PMID: 38740619 DOI: 10.1007/s11538-024-01301-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 04/20/2024] [Indexed: 05/16/2024]
Abstract
Many imaging techniques for biological systems-like fixation of cells coupled with fluorescence microscopy-provide sharp spatial resolution in reporting locations of individuals at a single moment in time but also destroy the dynamics they intend to capture. These snapshot observations contain no information about individual trajectories, but still encode information about movement and demographic dynamics, especially when combined with a well-motivated biophysical model. The relationship between spatially evolving populations and single-moment representations of their collective locations is well-established with partial differential equations (PDEs) and their inverse problems. However, experimental data is commonly a set of locations whose number is insufficient to approximate a continuous-in-space PDE solution. Here, motivated by popular subcellular imaging data of gene expression, we embrace the stochastic nature of the data and investigate the mathematical foundations of parametrically inferring demographic rates from snapshots of particles undergoing birth, diffusion, and death in a nuclear or cellular domain. Toward inference, we rigorously derive a connection between individual particle paths and their presentation as a Poisson spatial process. Using this framework, we investigate the properties of the resulting inverse problem and study factors that affect quality of inference. One pervasive feature of this experimental regime is the presence of cell-to-cell heterogeneity. Rather than being a hindrance, we show that cell-to-cell geometric heterogeneity can increase the quality of inference on dynamics for certain parameter regimes. Altogether, the results serve as a basis for more detailed investigations of subcellular spatial patterns of RNA molecules and other stochastically evolving populations that can only be observed for single instants in their time evolution.
Collapse
Affiliation(s)
| | - Scott A McKinley
- Department of Mathematics, Tulane University, New Orleans, LA, USA
| | - Fangyuan Ding
- Departments of Biomedical Engineering, Developmental and Cell Biology, University of California, Irvine, Irvine, USA
| | - Richard B Lehoucq
- Discrete Math and Optimization, Sandia National Laboratories, Albuquerque, NM, USA
| |
Collapse
|
7
|
Banerjee M, Srivastava S, Rai SN, States JC. Chronic arsenic exposure induces malignant transformation of human HaCaT cells through both deterministic and stochastic changes in transcriptome expression. Toxicol Appl Pharmacol 2024; 484:116865. [PMID: 38373578 PMCID: PMC10994602 DOI: 10.1016/j.taap.2024.116865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Revised: 02/11/2024] [Accepted: 02/13/2024] [Indexed: 02/21/2024]
Abstract
Biological processes are inherently stochastic, i.e., are partially driven by hard to predict random probabilistic processes. Carcinogenesis is driven both by stochastic and deterministic (predictable non-random) changes. However, very few studies systematically examine the contribution of stochastic events leading to cancer development. In differential gene expression studies, the established data analysis paradigms incentivize expression changes that are uniformly different across the experimental versus control groups, introducing preferential inclusion of deterministic changes at the expense of stochastic processes that might also play a crucial role in the process of carcinogenesis. In this study, we applied simple computational techniques to quantify: (i) The impact of chronic arsenic (iAs) exposure as well as passaging time on stochastic gene expression and (ii) Which genes were expressed deterministically and which were expressed stochastically at each of the three stages of cancer development. Using biological coefficient of variation as an empirical measure of stochasticity we demonstrate that chronic iAs exposure consistently suppressed passaging related stochastic gene expression at multiple time points tested, selecting for a homogenous cell population that undergo transformation. Employing multiple balanced removal of outlier data, we show that chronic iAs exposure induced deterministic and stochastic changes in the expression of unique set of genes, that populate largely unique biological pathways. Together, our data unequivocally demonstrate that both deterministic and stochastic changes in transcriptome-wide expression are critical in driving biological processes, pathways and networks towards clonal selection, carcinogenesis, and tumor heterogeneity.
Collapse
Affiliation(s)
- Mayukh Banerjee
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Sudhir Srivastava
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - Shesh N Rai
- Department of Bioinformatics and Biostatistics, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Bioinformatics Facility, James Graham Brown Cancer Center, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Biostatistics and Informatics Facility Core, Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA
| | - J Christopher States
- Department of Pharmacology and Toxicology, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA; Center for Integrative Environmental Health Sciences, University of Louisville, 505, S. Hancock Street, Louisville, KY 40202, USA.
| |
Collapse
|
8
|
Zhang C, Jiao F. Using steady-state formula to estimate time-dependent parameters of stochastic gene transcription models. Biosystems 2024; 236:105128. [PMID: 38280446 DOI: 10.1016/j.biosystems.2024.105128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 01/20/2024] [Accepted: 01/21/2024] [Indexed: 01/29/2024]
Abstract
When studying stochastic gene transcription, it is important to understand how system parameters are temporally modulated in response to varying environments. Experimentally, the dynamic distribution data of RNA copy numbers measured at multiple time points are often fitted to stochastic transcription models to estimate time-dependent parameters. However, current methods require determining which parameters are time-dependent, as well as their analytical formulas, before the optimal fit. In this study, we developed a method to estimate time-dependent parameters in a classical two-state model without prior assumptions regarding the system parameters. At each measured time point, the method fitted the dynamic distribution data using a steady-state distribution formula, in which the estimated constant parameters were approximated as time-dependent parameter values at the measured time point. The accuracy of this method can be guaranteed for RNA molecules with relatively high degradation rates and genes with relatively slow responses to induction. We quantify the accuracy of the method and implemented this method on two sets of dynamic distribution data from prokaryotic and eukaryotic cells, and revealed the temporal modulation of transcription burst size in response to environmental changes.
Collapse
Affiliation(s)
- Congrun Zhang
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China
| | - Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China.
| |
Collapse
|
9
|
Grima R, Esmenjaud PM. Quantifying and correcting bias in transcriptional parameter inference from single-cell data. Biophys J 2024; 123:4-30. [PMID: 37885177 PMCID: PMC10808030 DOI: 10.1016/j.bpj.2023.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/12/2023] [Accepted: 10/19/2023] [Indexed: 10/28/2023] Open
Abstract
The snapshot distribution of mRNA counts per cell can be measured using single-molecule fluorescence in situ hybridization or single-cell RNA sequencing. These distributions are often fit to the steady-state distribution of the two-state telegraph model to estimate the three transcriptional parameters for a gene of interest: mRNA synthesis rate, the switching on rate (the on state being the active transcriptional state), and the switching off rate. This model assumes no extrinsic noise, i.e., parameters do not vary between cells, and thus estimated parameters are to be understood as approximating the average values in a population. The accuracy of this approximation is currently unclear. Here, we develop a theory that explains the size and sign of estimation bias when inferring parameters from single-cell data using the standard telegraph model. We find specific bias signatures depending on the source of extrinsic noise (which parameter is most variable across cells) and the mode of transcriptional activity. If gene expression is not bursty then the population averages of all three parameters are overestimated if extrinsic noise is in the synthesis rate; underestimation occurs if extrinsic noise is in the switching on rate; both underestimation and overestimation can occur if extrinsic noise is in the switching off rate. We find that some estimated parameters tend to infinity as the size of extrinsic noise approaches a critical threshold. In contrast when gene expression is bursty, we find that in all cases the mean burst size (ratio of the synthesis rate to the switching off rate) is overestimated while the mean burst frequency (the switching on rate) is underestimated. We estimate the size of extrinsic noise from the covariance matrix of sequencing data and use this together with our theory to correct published estimates of transcriptional parameters for mammalian genes.
Collapse
Affiliation(s)
- Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | - Pierre-Marie Esmenjaud
- Biology Department, Ecole Polytechnique, Institut Polytechnique de Paris, Palaiseau, France
| |
Collapse
|
10
|
Hong L, Wang Z, Zhang Z, Luo S, Zhou T, Zhang J. Phase separation reduces cell-to-cell variability of transcriptional bursting. Math Biosci 2024; 367:109127. [PMID: 38070763 DOI: 10.1016/j.mbs.2023.109127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 12/05/2023] [Accepted: 12/06/2023] [Indexed: 12/25/2023]
Abstract
Gene expression is a stochastic and noisy process often occurring in "bursts". Experiments have shown that the compartmentalization of proteins by liquid-liquid phase separation is conducive to reducing the noise of gene expression. Therefore, an important goal is to explore the role of bursts in phase separation noise reduction processes. We propose a coupled model that includes phase separation and a two-state gene expression process. Using the timescale separation method, we obtain approximate solutions for the expectation, variance, and noise strength of the dilute phase. We find that a higher burst frequency weakens the ability of noise reduction by phase separation, but as the burst size increases, this ability first increases and then decreases. This study provides a deeper understanding of phase separation to reduce noise in the stochastic gene expression with burst kinetics.
Collapse
Affiliation(s)
- Lijun Hong
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China
| | - Zihao Wang
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China
| | - Zhenquan Zhang
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China
| | - Songhao Luo
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China; Department of Mathematics, University of California, Irvine, Irvine, CA 92697, USA
| | - Tianshou Zhou
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China
| | - Jiajun Zhang
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou 510275, PR China; School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, PR China.
| |
Collapse
|
11
|
Wang Y, Yu Z, Grima R, Cao Z. Exact solution of a three-stage model of stochastic gene expression including cell-cycle dynamics. J Chem Phys 2023; 159:224102. [PMID: 38063222 DOI: 10.1063/5.0173742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 10/04/2023] [Indexed: 12/18/2023] Open
Abstract
The classical three-stage model of stochastic gene expression predicts the statistics of single cell mRNA and protein number fluctuations as a function of the rates of promoter switching, transcription, translation, degradation and dilution. While this model is easily simulated, its analytical solution remains an unsolved problem. Here we modify this model to explicitly include cell-cycle dynamics and then derive an exact solution for the time-dependent joint distribution of mRNA and protein numbers. We show large differences between this model and the classical model which captures cell-cycle effects implicitly via effective first-order dilution reactions. In particular we find that the Fano factor of protein numbers calculated from a population snapshot measurement are underestimated by the classical model whereas the correlation between mRNA and protein can be either over- or underestimated, depending on the timescales of mRNA degradation and promoter switching relative to the mean cell-cycle duration time.
Collapse
Affiliation(s)
- Yiling Wang
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
| | - Zhenhua Yu
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
| | - Ramon Grima
- School of Biological Sciences, The University of Edinburgh, Max Born Crescent, Edinburgh EH9 3BF, Scotland, United Kingdom
| | - Zhixing Cao
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
- Department of Chemical Engineering, Queen's University, Kingston, Ontario K7L 3N6, Canada
| |
Collapse
|
12
|
Miles CE, McKinley SA, Ding F, Lehoucq RB. Inferring stochastic rates from heterogeneous snapshots of particle positions. ARXIV 2023:arXiv:2311.04880v1. [PMID: 37986720 PMCID: PMC10659442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
Many imaging techniques for biological systems - like fixation of cells coupled with fluorescence microscopy - provide sharp spatial resolution in reporting locations of individuals at a single moment in time but also destroy the dynamics they intend to capture. These snapshot observations contain no information about individual trajectories, but still encode information about movement and demographic dynamics, especially when combined with a well-motivated biophysical model. The relationship between spatially evolving populations and single-moment representations of their collective locations is well-established with partial differential equations (PDEs) and their inverse problems. However, experimental data is commonly a set of locations whose number is insufficient to approximate a continuous-in-space PDE solution. Here, motivated by popular subcellular imaging data of gene expression, we embrace the stochastic nature of the data and investigate the mathematical foundations of parametrically inferring demographic rates from snapshots of particles undergoing birth, diffusion, and death in a nuclear or cellular domain. Toward inference, we rigorously derive a connection between individual particle paths and their presentation as a Poisson spatial process. Using this framework, we investigate the properties of the resulting inverse problem and study factors that affect quality of inference. One pervasive feature of this experimental regime is the presence of cell-to-cell heterogeneity. Rather than being a hindrance, we show that cell-to-cell geometric heterogeneity can increase the quality of inference on dynamics for certain parameter regimes. Altogether, the results serve as a basis for more detailed investigations of subcellular spatial patterns of RNA molecules and other stochastically evolving populations that can only be observed for single instants in their time evolution.
Collapse
Affiliation(s)
| | | | - Fangyuan Ding
- Department of Biomedical Engineering, University of California, Irvine
| | | |
Collapse
|
13
|
Wang X, Li Y, Jia C. Poisson representation: a bridge between discrete and continuous models of stochastic gene regulatory networks. J R Soc Interface 2023; 20:20230467. [PMID: 38016635 PMCID: PMC10684348 DOI: 10.1098/rsif.2023.0467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/08/2023] [Indexed: 11/30/2023] Open
Abstract
Stochastic gene expression dynamics can be modelled either discretely or continuously. Previous studies have shown that the mRNA or protein number distributions of some simple discrete and continuous gene expression models are related by Gardiner's Poisson representation. Here, we systematically investigate the Poisson representation in complex stochastic gene regulatory networks. We show that when the gene of interest is unregulated, the discrete and continuous descriptions of stochastic gene expression are always related by the Poisson representation, no matter how complex the model is. This generalizes the results obtained in Dattani & Barahona (Dattani & Barahona 2017 J. R. Soc. Interface 14, 20160833 (doi:10.1098/rsif.2016.0833)). In addition, using a simple counter-example, we find that the Poisson representation in general fails to link the two descriptions when the gene is regulated. However, for a general stochastic gene regulatory network, we demonstrate that the discrete and continuous models are approximately related by the Poisson representation in the limit of large protein numbers. These theoretical results are further applied to analytically solve many complex gene expression models whose exact distributions are previously unknown.
Collapse
Affiliation(s)
- Xinyu Wang
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| | - Youming Li
- School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu 611731, People’s Republic of China
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| |
Collapse
|
14
|
Shi C, Yang X, Zhang J, Zhou T. Stochastic modeling of the mRNA life process: A generalized master equation. Biophys J 2023; 122:4023-4041. [PMID: 37653725 PMCID: PMC10598292 DOI: 10.1016/j.bpj.2023.08.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/29/2023] [Accepted: 08/29/2023] [Indexed: 09/02/2023] Open
Abstract
The mRNA life cycle is a complex biochemical process, involving transcription initiation, elongation, termination, splicing, and degradation. Each of these molecular events is multistep and can create a memory. The effect of this molecular memory on gene expression is not clear, although there are many related yet scattered experimental reports. To address this important issue, we develop a general theoretical framework formulated as a master equation in the sense of queue theory, which can reduce to multiple previously studied gene models in limiting cases. This framework allows us to interpret experimental observations, extract kinetic parameters from experimental data, and identify how the mRNA kinetics vary under regulatory influences. Notably, it allows us to evaluate the influences of elongation processes on mature RNA distribution; e.g., we find that the non-exponential elongation time can induce the bimodal mRNA expression and there is an optimal elongation noise intensity such that the mature RNA noise achieves the lowest level. In a word, our framework can not only provide insight into complex mRNA life processes but also bridge a dialogue between theoretical studies and experimental data.
Collapse
Affiliation(s)
- Changhong Shi
- State Key Laboratory of Respiratory Disease, School of Public Health, Guangzhou Medical University, Guangzhou, China
| | - Xiyan Yang
- School of Financial Mathematics and Statistics, Guangdong University of Finance, Guangzhou, China
| | - Jiajun Zhang
- School of Mathematics and Computational Science and Guangdong Province Key Laboratory of Computational Science, Sun Yat-Sen University, Guangzhou, China.
| | - Tianshou Zhou
- School of Mathematics and Computational Science and Guangdong Province Key Laboratory of Computational Science, Sun Yat-Sen University, Guangzhou, China.
| |
Collapse
|
15
|
Szavits-Nossan J, Grima R. Uncovering the effect of RNA polymerase steric interactions on gene expression noise: Analytical distributions of nascent and mature RNA numbers. Phys Rev E 2023; 108:034405. [PMID: 37849194 DOI: 10.1103/physreve.108.034405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 08/24/2023] [Indexed: 10/19/2023]
Abstract
The telegraph model is the standard model of stochastic gene expression, which can be solved exactly to obtain the distribution of mature RNA numbers per cell. A modification of this model also leads to an analytical distribution of nascent RNA numbers. These solutions are routinely used for the analysis of single-cell data, including the inference of transcriptional parameters. However, these models neglect important mechanistic features of transcription elongation, such as the stochastic movement of RNA polymerases and their steric (excluded-volume) interactions. Here we construct a model of gene expression describing promoter switching between inactive and active states, binding of RNA polymerases in the active state, their stochastic movement including steric interactions along the gene, and their unbinding leading to a mature transcript that subsequently decays. We derive the steady-state distributions of the nascent and mature RNA numbers in two important limiting cases: constitutive expression and slow promoter switching. We show that RNA fluctuations are suppressed by steric interactions between RNA polymerases, and that this suppression can in some instances even lead to sub-Poissonian fluctuations; these effects are most pronounced for nascent RNA and less prominent for mature RNA, since the latter is not a direct sensor of transcription. We find a relationship between the parameters of our microscopic mechanistic model and those of the standard models that ensures excellent consistency in their prediction of the first and second RNA number moments over vast regions of parameter space, encompassing slow, intermediate, and rapid promoter switching, provided the RNA number distributions are Poissonian or super-Poissonian. Furthermore, we identify the limitations of inference from mature RNA data, specifically showing that it cannot differentiate between highly distinct RNA polymerase traffic patterns on a gene.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| |
Collapse
|
16
|
Gao NP, Gandrillon O, Páldi A, Herbach U, Gunawan R. Single-cell transcriptional uncertainty landscape of cell differentiation. F1000Res 2023; 12:426. [PMID: 37545651 PMCID: PMC10400935 DOI: 10.12688/f1000research.131861.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/18/2023] [Indexed: 08/08/2023] Open
Abstract
Background: Single-cell studies have demonstrated the presence of significant cell-to-cell heterogeneity in gene expression. Whether such heterogeneity is only a bystander or has a functional role in the cell differentiation process is still hotly debated. Methods: In this study, we quantified and followed single-cell transcriptional uncertainty - a measure of gene transcriptional stochasticity in single cells - in 10 cell differentiation systems of varying cell lineage progressions, from single to multi-branching trajectories, using the stochastic two-state gene transcription model. Results: By visualizing the transcriptional uncertainty as a landscape over a two-dimensional representation of the single-cell gene expression data, we observed universal features in the cell differentiation trajectories that include: (i) a peak in single-cell uncertainty during transition states, and in systems with bifurcating differentiation trajectories, each branching point represents a state of high transcriptional uncertainty; (ii) a positive correlation of transcriptional uncertainty with transcriptional burst size and frequency; (iii) an increase in RNA velocity preceding the increase in the cell transcriptional uncertainty. Conclusions: Our findings suggest a possible universal mechanism during the cell differentiation process, in which stem cells engage stochastic exploratory dynamics of gene expression at the start of the cell differentiation by increasing gene transcriptional bursts, and disengage such dynamics once cells have decided on a particular terminal cell identity. Notably, the peak of single-cell transcriptional uncertainty signifies the decision-making point in the cell differentiation process.
Collapse
Affiliation(s)
- Nan Papili Gao
- Institute for Chemical and Bioengineering, ETH Zurich, Zurich, Zurich, 8093, Switzerland
| | - Olivier Gandrillon
- Laboratoire de Biologie et Modélisation de la Cellule, École Normale Supérieure de Lyon, CNRS, Université Claude Bernard Lyon 1, F69364, France
- Équipe Dracula, Inria Center Lyon, Villeurbanne, F69100, France
| | - András Páldi
- St-Antoine Research Center, Ecole Pratique des Hautes Etudes PSL, Paris, F-75012, France
| | - Ulysse Herbach
- CNRS, Inria, IECL, Université de Lorraine, Nancy, F-54000, France
| | - Rudiyanto Gunawan
- Institute for Chemical and Bioengineering, ETH Zurich, Zurich, Zurich, 8093, Switzerland
- Department of Chemical and Biological Engineering, University at Buffalo - SUNY, Buffalo, NY, 14260, USA
| |
Collapse
|
17
|
Karamched BR, Miles CE. Stochastic switching of delayed feedback suppresses oscillations in genetic regulatory systems. J R Soc Interface 2023; 20:20230059. [PMID: 37376870 DOI: 10.1098/rsif.2023.0059] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 06/06/2023] [Indexed: 06/29/2023] Open
Abstract
Delays and stochasticity have both served as crucially valuable ingredients in mathematical descriptions of control, physical and biological systems. In this work, we investigate how explicitly dynamical stochasticity in delays modulates the effect of delayed feedback. To do so, we consider a hybrid model where stochastic delays evolve by a continuous-time Markov chain, and between switching events, the system of interest evolves via a deterministic delay equation. Our main contribution is the calculation of an effective delay equation in the fast switching limit. This effective equation maintains the influence of all subsystem delays and cannot be replaced with a single effective delay. To illustrate the relevance of this calculation, we investigate a simple model of stochastically switching delayed feedback motivated by gene regulation. We show that sufficiently fast switching between two oscillatory subsystems can yield stable dynamics.
Collapse
Affiliation(s)
- Bhargav R Karamched
- Department of Mathematics, Florida State University, Tallahassee, FL 32304, USA
- Institute of Molecular Biophysics, Florida State University, Tallahassee, FL 32304, USA
- Program in Neuroscience, Florida State University, Tallahassee, FL 32304, USA
| | | |
Collapse
|
18
|
Vo HD, Forero-Quintero LS, Aguilera LU, Munsky B. Analysis and design of single-cell experiments to harvest fluctuation information while rejecting measurement noise. Front Cell Dev Biol 2023; 11:1133994. [PMID: 37305680 PMCID: PMC10250612 DOI: 10.3389/fcell.2023.1133994] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 05/10/2023] [Indexed: 06/13/2023] Open
Abstract
Introduction: Despite continued technological improvements, measurement errors always reduce or distort the information that any real experiment can provide to quantify cellular dynamics. This problem is particularly serious for cell signaling studies to quantify heterogeneity in single-cell gene regulation, where important RNA and protein copy numbers are themselves subject to the inherently random fluctuations of biochemical reactions. Until now, it has not been clear how measurement noise should be managed in addition to other experiment design variables (e.g., sampling size, measurement times, or perturbation levels) to ensure that collected data will provide useful insights on signaling or gene expression mechanisms of interest. Methods: We propose a computational framework that takes explicit consideration of measurement errors to analyze single-cell observations, and we derive Fisher Information Matrix (FIM)-based criteria to quantify the information value of distorted experiments. Results and Discussion: We apply this framework to analyze multiple models in the context of simulated and experimental single-cell data for a reporter gene controlled by an HIV promoter. We show that the proposed approach quantitatively predicts how different types of measurement distortions affect the accuracy and precision of model identification, and we demonstrate that the effects of these distortions can be mitigated through explicit consideration during model inference. We conclude that this reformulation of the FIM could be used effectively to design single-cell experiments to optimally harvest fluctuation information while mitigating the effects of image distortion.
Collapse
Affiliation(s)
- Huy D. Vo
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Linda S. Forero-Quintero
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Luis U. Aguilera
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|
19
|
Gautam P, Sinha SK. Theoretical investigation of functional responses of bio-molecular assembly networks. SOFT MATTER 2023; 19:3803-3817. [PMID: 37191191 DOI: 10.1039/d2sm01530g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/17/2023]
Abstract
Cooperative protein-protein and protein-DNA interactions form programmable complex assemblies, often performing non-linear gene regulatory operations involved in signal transductions and cell fate determination. The apparent structure of those complex assemblies is very similar, but their functional response strongly depends on the topology of the protein-DNA interaction networks. Here, we demonstrate how the coordinated self-assembly creates gene regulatory network motifs that corroborate the existence of a precise functional response at the molecular level using thermodynamic and dynamic analyses. Our theoretical and Monte Carlo simulations show that a complex network of interactions can form a decision-making loop, such as feedback and feed-forward circuits, only by a few molecular mechanisms. We characterize each possible network of interactions by systematic variations of free energy parameters associated with the binding among biomolecules and DNA looping. We also find that the higher-order networks exhibit alternative steady states from the stochastic dynamics of each network. We capture this signature by calculating stochastic potentials and attributing their multi-stability features. We validate our findings against the Gal promoter system in yeast cells. Overall, we show that the network topology is vital in phenotype diversity in regulatory circuits.
Collapse
Affiliation(s)
- Pankaj Gautam
- Theoretical and Computational Biophysical Chemistry Group, Department of Chemistry, Indian Institute of Technology Ropar, Rupnagar, Punjab 140001, India.
| | - Sudipta Kumar Sinha
- Theoretical and Computational Biophysical Chemistry Group, Department of Chemistry, Indian Institute of Technology Ropar, Rupnagar, Punjab 140001, India.
| |
Collapse
|
20
|
Wang Y, He S. Inference on autoregulation in gene expression with variance-to-mean ratio. J Math Biol 2023; 86:87. [PMID: 37131095 PMCID: PMC10154285 DOI: 10.1007/s00285-023-01924-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/14/2023] [Accepted: 04/18/2023] [Indexed: 05/04/2023]
Abstract
Some genes can promote or repress their own expressions, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation from gene expression data. This method only needs to compare the mean and variance of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, CA, 90095, USA.
- Institut des Hautes Études Scientifiques (IHÉS), Bures-sur-Yvette, 91440, Essonne, France.
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, NY, 11794, USA
| |
Collapse
|
21
|
Carilli M, Gorin G, Choi Y, Chari T, Pachter L. Biophysical modeling with variational autoencoders for bimodal, single-cell RNA sequencing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.13.523995. [PMID: 36712140 PMCID: PMC9882246 DOI: 10.1101/2023.01.13.523995] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
We motivate and present biVI, which combines the variational autoencoder framework of scVI with biophysically motivated, bivariate models for nascent and mature RNA distributions. While previous approaches to integrate bimodal data via the variational autoencoder framework ignore the causal relationship between measurements, biVI models the biophysical processes that give rise to observations. We demonstrate through simulated benchmarking that biVI captures cell type structure in a low-dimensional space and accurately recapitulates parameter values and copy number distributions. On biological data, biVI provides a scalable route for identifying the biophysical mechanisms underlying gene expression. This analytical approach outlines a generalizable strategy for treating multimodal datasets generated by high-throughput, single-cell genomic assays.
Collapse
Affiliation(s)
- Maria Carilli
- Division of Biology and Biological Engineering, California Institute of Technology
| | - Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology
| | - Yongin Choi
- Biomedical Engineering Graduate Group, University of California, Davis
- Genome Center, University of California, Davis
| | - Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology
- Department of Computing and Mathematical Sciences, California Institute of Technology
| |
Collapse
|
22
|
Fralix B, Holmes M, Löpker A. A Markovian arrival stream approach to stochastic gene expression in cells. J Math Biol 2023; 86:79. [PMID: 37086292 DOI: 10.1007/s00285-023-01913-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 12/22/2022] [Accepted: 03/31/2023] [Indexed: 04/23/2023]
Abstract
We analyse a generalisation of the stochastic gene expression model studied recently in Fromion et al. (SIAM J Appl Math 73:195-211, 2013) and Robert (Probab Surv 16:277-332, 2019) that keeps track of the production of both mRNA and protein molecules, using techniques from the theory of point processes, as well as ideas from the theory of matrix-analytic methods. Here, both the activity of a gene and the creation of mRNA are modelled with an arbitrary Markovian Arrival Process governed by finitely many phases, and each mRNA molecule during its lifetime gives rise to protein molecules in accordance with a Poisson process. This modification is important, as Markovian Arrival Processes can be used to approximate many types of point processes on the nonnegative real line, meaning this framework allows us to further relax our assumptions on the overall process of transcription.
Collapse
Affiliation(s)
- Brian Fralix
- School of Mathematical and Statistical Sciences, Clemson University, Clemson, USA.
| | - Mark Holmes
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, Australia
| | - Andreas Löpker
- Department of Computer Science and Mathematics, HTW Dresden, University of Applied Sciences, Dresden, Germany
| |
Collapse
|
23
|
Wang Y, He S. Using Fano factors to determine certain types of gene autoregulation. ARXIV 2023:arXiv:2301.06692v2. [PMID: 36713249 PMCID: PMC9882590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The expression of one gene might be regulated by its corresponding protein, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation in certain scenarios from gene expression data. This method only depends on the Fano factor, namely the ratio of variance and mean of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, California, United States of America
- Institut des Hautes Études Scientifiques, Bures-sur-Yvette, Essonne, France
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, New York, United States of America
| |
Collapse
|
24
|
Tsakiroglou M, Evans A, Pirmohamed M. Leveraging transcriptomics for precision diagnosis: Lessons learned from cancer and sepsis. Front Genet 2023; 14:1100352. [PMID: 36968610 PMCID: PMC10036914 DOI: 10.3389/fgene.2023.1100352] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 02/20/2023] [Indexed: 03/12/2023] Open
Abstract
Diagnostics require precision and predictive ability to be clinically useful. Integration of multi-omic with clinical data is crucial to our understanding of disease pathogenesis and diagnosis. However, interpretation of overwhelming amounts of information at the individual level requires sophisticated computational tools for extraction of clinically meaningful outputs. Moreover, evolution of technical and analytical methods often outpaces standardisation strategies. RNA is the most dynamic component of all -omics technologies carrying an abundance of regulatory information that is least harnessed for use in clinical diagnostics. Gene expression-based tests capture genetic and non-genetic heterogeneity and have been implemented in certain diseases. For example patients with early breast cancer are spared toxic unnecessary treatments with scores based on the expression of a set of genes (e.g., Oncotype DX). The ability of transcriptomics to portray the transcriptional status at a moment in time has also been used in diagnosis of dynamic diseases such as sepsis. Gene expression profiles identify endotypes in sepsis patients with prognostic value and a potential to discriminate between viral and bacterial infection. The application of transcriptomics for patient stratification in clinical environments and clinical trials thus holds promise. In this review, we discuss the current clinical application in the fields of cancer and infection. We use these paradigms to highlight the impediments in identifying useful diagnostic and prognostic biomarkers and propose approaches to overcome them and aid efforts towards clinical implementation.
Collapse
Affiliation(s)
- Maria Tsakiroglou
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
- *Correspondence: Maria Tsakiroglou,
| | - Anthony Evans
- Computational Biology Facility, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| | - Munir Pirmohamed
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, United Kingdom
| |
Collapse
|
25
|
Fan R, Hilfinger A. The effect of microRNA on protein variability and gene expression fidelity. Biophys J 2023; 122:905-923. [PMID: 36698314 PMCID: PMC10027439 DOI: 10.1016/j.bpj.2023.01.027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Revised: 12/23/2022] [Accepted: 01/20/2023] [Indexed: 01/27/2023] Open
Abstract
Small regulatory RNA molecules such as microRNA modulate gene expression through inhibiting the translation of messenger RNA (mRNA). Such posttranscriptional regulation has been recently hypothesized to reduce the stochastic variability of gene expression around average levels. Here, we quantify noise in stochastic gene expression models with and without such regulation. Our results suggest that silencing mRNA posttranscriptionally will always increase, rather than decrease, gene expression noise when the silencing of mRNA also increases its degradation, as is expected for microRNA interactions with mRNA. In that regime, we also find that silencing mRNA generally reduces the fidelity of signal transmission from deterministically varying upstream factors to protein levels. These findings suggest that microRNA binding to mRNA does not generically confer precision to protein expression.
Collapse
Affiliation(s)
- Raymond Fan
- Department of Physics, University of Toronto, Toronto, Ontario, Canada; Department of Chemical & Physical Sciences, University of Toronto, Mississauga, Ontario, Canada.
| | - Andreas Hilfinger
- Department of Physics, University of Toronto, Toronto, Ontario, Canada; Department of Chemical & Physical Sciences, University of Toronto, Mississauga, Ontario, Canada; Department of Cell & Systems Biology, University of Toronto, , Toronto, Ontario, Canada; Department of Mathematics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
26
|
Kilic Z, Schweiger M, Moyer C, Shepherd D, Pressé S. Gene expression model inference from snapshot RNA data using Bayesian non-parametrics. NATURE COMPUTATIONAL SCIENCE 2023; 3:174-183. [PMID: 38125199 PMCID: PMC10732567 DOI: 10.1038/s43588-022-00392-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 12/15/2022] [Indexed: 12/23/2023]
Abstract
Gene expression models, which are key towards understanding cellular regulatory response, underlie observations of single-cell transcriptional dynamics. Although RNA expression data encode information on gene expression models, existing computational frameworks do not perform simultaneous Bayesian inference of gene expression models and parameters from such data. Rather, gene expression models-composed of gene states, their connectivities and associated parameters-are currently deduced by pre-specifying gene state numbers and connectivity before learning associated rate parameters. Here we propose a method to learn full distributions over gene states, state connectivities and associated rate parameters, simultaneously and self-consistently from single-molecule RNA counts. We propagate noise from fluctuating RNA counts over models by treating models themselves as random variables. We achieve this within a Bayesian non-parametric paradigm. We demonstrate our method on the Escherichia coli lacZ pathway and the Saccharomyces cerevisiae STL1 pathway, and verify its robustness on synthetic data.
Collapse
Affiliation(s)
- Zeliha Kilic
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, TN, USA
- These authors contributed equally: Zeliha Kilic, Max Schweiger
| | - Max Schweiger
- Center for Biological Physics, ASU, Tempe, AZ, USA
- Department of Physics, ASU, Tempe, AZ, USA
- These authors contributed equally: Zeliha Kilic, Max Schweiger
| | - Camille Moyer
- Center for Biological Physics, ASU, Tempe, AZ, USA
- School of Mathematics and Statistical Sciences, ASU, Tempe, AZ, USA
| | - Douglas Shepherd
- Center for Biological Physics, ASU, Tempe, AZ, USA
- Department of Physics, ASU, Tempe, AZ, USA
| | - Steve Pressé
- Center for Biological Physics, ASU, Tempe, AZ, USA
- Department of Physics, ASU, Tempe, AZ, USA
- School of Molecular Sciences, ASU, Tempe, AZ, USA
| |
Collapse
|
27
|
A dynamical stochastic model of yeast translation across the cell cycle. Heliyon 2023; 9:e13101. [PMID: 36793957 PMCID: PMC9922973 DOI: 10.1016/j.heliyon.2023.e13101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 01/04/2023] [Accepted: 01/16/2023] [Indexed: 01/27/2023] Open
Abstract
Translation is a central step in gene expression, however its quantitative and time-resolved regulation is poorly understood. We developed a discrete, stochastic model for protein translation in S. cerevisiae in a whole-transcriptome, single-cell context. A "base case" scenario representing an average cell highlights translation initiation rates as the main co-translational regulatory parameters. Codon usage bias emerges as a secondary regulatory mechanism through ribosome stalling. Demand for anticodons with low abundancy is shown to cause above-average ribosome dwelling times. Codon usage bias correlates strongly both with protein synthesis rates and elongation rates. Applying the model to a time-resolved transcriptome estimated by combining data from FISH and RNA-Seq experiments, it could be shown that increased total transcript abundance during the cell cycle decreases translation efficiency at single transcript level. Translation efficiency grouped by gene function shows highest values for ribosomal and glycolytic genes. Ribosomal proteins peak in S phase while glycolytic proteins rank highest in later cell cycle phases.
Collapse
|
28
|
Ling MY, Chiu LJ, Hsieh CC, Shu CC. Dimerization induces bimodality in protein number distributions. Biosystems 2023; 223:104812. [PMID: 36427705 DOI: 10.1016/j.biosystems.2022.104812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Revised: 11/10/2022] [Accepted: 11/10/2022] [Indexed: 11/26/2022]
Abstract
We examined gene expression with DNA switching between two states, active and inactive. Subpopulations emerge from mechanisms that do not arise from trivial transcriptional heterogeneity. Although the RNA demonstrates a unimodal distribution, dimerization intriguingly causes protein bimodality. No control loop or deterministic bistability are present. In such a situation, increasing the degradation rate of the protein does not lead to bimodality. The bimodality is achieved through the interplay between the protein monomer and the formation of protein dimer. We applied Stochastic Simulation Algorithm (SSA) and found that cells spontaneously change states at the protein level. While sweeping parameters, decreasing the rate constant of dimerization severely impairs the bimodality. We also examined the influence of DNA switching. To have bimodality, the system requires a proper ratio of DNA in the active state to the inactive state. In addition to bimodality of the monomer, tetramerization also causes the bimodality of the dimer.
Collapse
Affiliation(s)
- Ming-Yang Ling
- Department of Chemical Engineering and Biotechnology, National Taipei University of Technology, Taipei City, Taiwan
| | - Lin-Jie Chiu
- Department of Chemical Engineering and Biotechnology, National Taipei University of Technology, Taipei City, Taiwan
| | - Ching-Chu Hsieh
- Department of Chemical Engineering and Biotechnology, National Taipei University of Technology, Taipei City, Taiwan
| | - Che-Chi Shu
- Department of Chemical Engineering and Biotechnology, National Taipei University of Technology, Taipei City, Taiwan.
| |
Collapse
|
29
|
Luo S, Wang Z, Zhang Z, Zhou T, Zhang J. Genome-wide inference reveals that feedback regulations constrain promoter-dependent transcriptional burst kinetics. Nucleic Acids Res 2022; 51:68-83. [PMID: 36583343 PMCID: PMC9874261 DOI: 10.1093/nar/gkac1204] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 11/06/2022] [Accepted: 12/06/2022] [Indexed: 12/31/2022] Open
Abstract
Gene expression in mammalian cells is highly variable and episodic, resulting in a series of discontinuous bursts of mRNAs. A challenge is to understand how static promoter architecture and dynamic feedback regulations dictate bursting on a genome-wide scale. Although single-cell RNA sequencing (scRNA-seq) provides an opportunity to address this challenge, effective analytical methods are scarce. We developed an interpretable and scalable inference framework, which combined experimental data with a mechanistic model to infer transcriptional burst kinetics (sizes and frequencies) and feedback regulations. Applying this framework to scRNA-seq data generated from embryonic mouse fibroblast cells, we found Simpson's paradoxes, i.e. genome-wide burst kinetics exhibit different characteristics in two cases without and with distinguishing feedback regulations. We also showed that feedbacks differently modulate burst frequencies and sizes and conceal the effects of transcription start site distributions on burst kinetics. Notably, only in the presence of positive feedback, TATA genes are expressed with high burst frequencies and enhancer-promoter interactions mainly modulate burst frequencies. The developed inference method provided a flexible and efficient way to investigate transcriptional burst kinetics and the obtained results would be helpful for understanding cell development and fate decision.
Collapse
Affiliation(s)
| | | | - Zhenquan Zhang
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou, 510275, P. R. China,School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, P. R. China
| | - Tianshou Zhou
- Correspondence may also be addressed to Tianshou Zhou. Tel: +86 20 84134958;
| | - Jiajun Zhang
- To whom correspondence should be addressed. Tel: +86 20 84111829;
| |
Collapse
|
30
|
Zhdanov VP. Interplay of Cellular mRNA, miRNA and Viral miRNA during Infection of a Cell. Int J Mol Sci 2022; 24:ijms24010122. [PMID: 36613566 PMCID: PMC9820072 DOI: 10.3390/ijms24010122] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2022] [Revised: 11/29/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
The understanding of the kinetics of gene expression in cells infected by viruses is currently limited. As a rule, the corresponding models do not take viral microRNAs (miRNAs) into account. Such RNAs are, however, operative during the replication of some viruses, including, e.g., herpesvirus. To clarify the kinetics of this category (with emphasis on the information available for herpesvirus), I introduce a generic model describing the transient interplay of cellular mRNA, protein, miRNA and viral miRNA. In the absence of viral miRNA, the cellular miRNA is considered to suppress the populations of mRNA and protein due to association with mRNA and subsequent degradation. During infection, the viral miRNA suppresses the population of cellular miRNA and via this pathway makes the mRNA and protein populations larger. This effect becomes appreciable with the progress of intracellular viral replication. Using biologically reasonable parameters, I investigate the corresponding mean-field kinetics and show the scale of the effect of viral miRNAs on cellular miRNA and mRNA. The scale of fluctuations of the populations of these species is illustrated as well by employing Monte Carlo simulations.
Collapse
Affiliation(s)
- Vladimir P Zhdanov
- Boreskov Institute of Catalysis, Russian Academy of Sciences, Novosibirsk 630090, Russia
| |
Collapse
|
31
|
Gorin G, Vastola JJ, Fang M, Pachter L. Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments. Nat Commun 2022; 13:7620. [PMID: 36494337 PMCID: PMC9734650 DOI: 10.1038/s41467-022-34857-7] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 11/09/2022] [Indexed: 12/13/2022] Open
Abstract
The question of how cell-to-cell differences in transcription rate affect RNA count distributions is fundamental for understanding biological processes underlying transcription. Answering this question requires quantitative models that are both interpretable (describing concrete biophysical phenomena) and tractable (amenable to mathematical analysis). This enables the identification of experiments which best discriminate between competing hypotheses. As a proof of principle, we introduce a simple but flexible class of models involving a continuous stochastic transcription rate driving a discrete RNA transcription and splicing process, and compare and contrast two biologically plausible hypotheses about transcription rate variation. One assumes variation is due to DNA experiencing mechanical strain, while the other assumes it is due to regulator number fluctuations. We introduce a framework for numerically and analytically studying such models, and apply Bayesian model selection to identify candidate genes that show signatures of each model in single-cell transcriptomic data from mouse glutamatergic neurons.
Collapse
Affiliation(s)
- Gennady Gorin
- grid.20861.3d0000000107068890Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125 USA
| | - John J. Vastola
- grid.38142.3c000000041936754XDepartment of Neurobiology, Harvard Medical School, Boston, MA 02115 USA
| | - Meichen Fang
- grid.20861.3d0000000107068890Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125 USA
| | - Lior Pachter
- grid.20861.3d0000000107068890Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125 USA ,grid.20861.3d0000000107068890Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125 USA
| |
Collapse
|
32
|
Jia C, Grima R. Coupling gene expression dynamics to cell size dynamics and cell cycle events: Exact and approximate solutions of the extended telegraph model. iScience 2022; 26:105746. [PMID: 36619980 PMCID: PMC9813732 DOI: 10.1016/j.isci.2022.105746] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/02/2022] [Accepted: 12/02/2022] [Indexed: 12/12/2022] Open
Abstract
The standard model describing the fluctuations of mRNA numbers in single cells is the telegraph model which includes synthesis and degradation of mRNA, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by the cell cycle phase, cellular growth and division, and other crucial aspects of cellular biology. Here, we derive the analytical time-dependent solution of an extended telegraph model that explicitly considers the doubling of gene copy numbers upon DNA replication, dependence of the mRNA synthesis rate on cellular volume, gene dosage compensation, partitioning of molecules during cell division, cell-cycle duration variability, and cell-size control strategies. Based on the time-dependent solution, we obtain the analytical distributions of transcript numbers for lineage and population measurements in steady-state growth and also find a linear relation between the Fano factor of mRNA fluctuations and cell volume fluctuations. We show that generally the lineage and population distributions in steady-state growth cannot be accurately approximated by the steady-state solution of extrinsic noise models, i.e. a telegraph model with parameters drawn from probability distributions. This is because the mRNA lifetime is often not small enough compared to the cell cycle duration to erase the memory of division and replication. Accurate approximations are possible when this memory is weak, e.g. for genes with bursty expression and for which there is sufficient gene dosage compensation when replication occurs.
Collapse
Affiliation(s)
- Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, China
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK,Corresponding author
| |
Collapse
|
33
|
Concentration fluctuations in growing and dividing cells: Insights into the emergence of concentration homeostasis. PLoS Comput Biol 2022; 18:e1010574. [PMID: 36194626 PMCID: PMC9565450 DOI: 10.1371/journal.pcbi.1010574] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/14/2022] [Accepted: 09/14/2022] [Indexed: 11/19/2022] Open
Abstract
Intracellular reaction rates depend on concentrations and hence their levels are often regulated. However classical models of stochastic gene expression lack a cell size description and cannot be used to predict noise in concentrations. Here, we construct a model of gene product dynamics that includes a description of cell growth, cell division, size-dependent gene expression, gene dosage compensation, and size control mechanisms that can vary with the cell cycle phase. We obtain expressions for the approximate distributions and power spectra of concentration fluctuations which lead to insight into the emergence of concentration homeostasis. We find that (i) the conditions necessary to suppress cell division-induced concentration oscillations are difficult to achieve; (ii) mRNA concentration and number distributions can have different number of modes; (iii) two-layer size control strategies such as sizer-timer or adder-timer are ideal because they maintain constant mean concentrations whilst minimising concentration noise; (iv) accurate concentration homeostasis requires a fine tuning of dosage compensation, replication timing, and size-dependent gene expression; (v) deviations from perfect concentration homeostasis show up as deviations of the concentration distribution from a gamma distribution. Some of these predictions are confirmed using data for E. coli, fission yeast, and budding yeast.
Collapse
|
34
|
Gorin G, Fang M, Chari T, Pachter L. RNA velocity unraveled. PLoS Comput Biol 2022; 18:e1010492. [PMID: 36094956 PMCID: PMC9499228 DOI: 10.1371/journal.pcbi.1010492] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 09/22/2022] [Accepted: 08/14/2022] [Indexed: 11/24/2022] Open
Abstract
We perform a thorough analysis of RNA velocity methods, with a view towards understanding the suitability of the various assumptions underlying popular implementations. In addition to providing a self-contained exposition of the underlying mathematics, we undertake simulations and perform controlled experiments on biological datasets to assess workflow sensitivity to parameter choices and underlying biology. Finally, we argue for a more rigorous approach to RNA velocity, and present a framework for Markovian analysis that points to directions for improvement and mitigation of current problems. Single-cell sequencing data are snapshots of biological processes, making it challenging to infer dynamic relationships between cell types. RNA velocity attempts to bypass this challenge by treating the unspliced RNA content as a proxy for spliced RNA content in the near future, and using this “extrapolation” to build directional relationships. However, the method, as implemented in several software packages, is not yet reliable enough to be actionable, in part due to the large number of arbitrary, user-set hyperparameters, as well as fundamental incompatibilities between the biophysics of transcription in the living cell and the models used throughout the velocity workflows. In this study, we review these issues, and use existing results from the fields of stochastic modeling and fluorescence transcriptomics to develop an alternative theoretical framework. We show that our framework can facilitate the development and inference of physically consistent models for sequencing data, as well as the unification of single-cell analyses to self-consistently treat variation due to cell type dynamics and identities, the stochasticity inherent to single-molecule processes, and the uncertainty introduced by sequencing experiments.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Meichen Fang
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California, United States of America
- * E-mail:
| |
Collapse
|
35
|
Sukys A, Öcal K, Grima R. Approximating Solutions of the Chemical Master Equation using Neural Networks. iScience 2022; 25:105010. [PMID: 36117994 PMCID: PMC9474291 DOI: 10.1016/j.isci.2022.105010] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 06/13/2022] [Accepted: 08/18/2022] [Indexed: 10/27/2022] Open
|
36
|
Dean J, Ganesh A. Noise dissipation in gene regulatory networks via second order statistics of networks of infinite server queues. J Math Biol 2022; 85:14. [DOI: 10.1007/s00285-022-01781-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 06/10/2022] [Accepted: 07/05/2022] [Indexed: 10/16/2022]
|
37
|
Öcal K, Gutmann MU, Sanguinetti G, Grima R. Inference and uncertainty quantification of stochastic gene expression via synthetic models. JOURNAL OF THE ROYAL SOCIETY, INTERFACE 2022; 19:20220153. [PMID: 35858045 PMCID: PMC9277240 DOI: 10.1098/rsif.2022.0153] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Estimating uncertainty in model predictions is a central task in quantitative biology. Biological models at the single-cell level are intrinsically stochastic and nonlinear, creating formidable challenges for their statistical estimation which inevitably has to rely on approximations that trade accuracy for tractability. Despite intensive interest, a sweet spot in this trade-off has not been found yet. We propose a flexible procedure for uncertainty quantification in a wide class of reaction networks describing stochastic gene expression including those with feedback. The method is based on creating a tractable coarse-graining of the model that is learned from simulations, a synthetic model, to approximate the likelihood function. We demonstrate that synthetic models can substantially outperform state-of-the-art approaches on a number of non-trivial systems and datasets, yielding an accurate and computationally viable solution to uncertainty quantification in stochastic models of gene expression.
Collapse
Affiliation(s)
- Kaan Öcal
- School of Informatics, University of Edinburgh, Edinburgh EH9 3JH, UK.,School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
| | - Michael U Gutmann
- School of Informatics, University of Edinburgh, Edinburgh EH9 3JH, UK
| | - Guido Sanguinetti
- Scuola Internazionale Superiore di Studi Avanzati, 34136 Trieste, Italy
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
| |
Collapse
|
38
|
Kolbe N, Hexemer L, Bammert LM, Loewer A, Lukáčová-Medvid’ová M, Legewie S. Data-based stochastic modeling reveals sources of activity bursts in single-cell TGF-β signaling. PLoS Comput Biol 2022; 18:e1010266. [PMID: 35759468 PMCID: PMC9269928 DOI: 10.1371/journal.pcbi.1010266] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 07/08/2022] [Accepted: 05/30/2022] [Indexed: 11/30/2022] Open
Abstract
Cells sense their surrounding by employing intracellular signaling pathways that transmit hormonal signals from the cell membrane to the nucleus. TGF-β/SMAD signaling encodes various cell fates, controls tissue homeostasis and is deregulated in diseases such as cancer. The pathway shows strong heterogeneity at the single-cell level, but quantitative insights into mechanisms underlying fluctuations at various time scales are still missing, partly due to inefficiency in the calibration of stochastic models that mechanistically describe signaling processes. In this work we analyze single-cell TGF-β/SMAD signaling and show that it exhibits temporal stochastic bursts which are dose-dependent and whose number and magnitude correlate with cell migration. We propose a stochastic modeling approach to mechanistically describe these pathway fluctuations with high computational efficiency. Employing high-order numerical integration and fitting to burst statistics we enable efficient quantitative parameter estimation and discriminate models that assume noise in different reactions at the receptor level. This modeling approach suggests that stochasticity in the internalization of TGF-β receptors into endosomes plays a key role in the observed temporal bursting. Further, the model predicts the single-cell dynamics of TGF-β/SMAD signaling in untested conditions, e.g., successfully reflects memory effects of signaling noise and cellular sensitivity towards repeated stimulation. Taken together, our computational framework based on burst analysis, noise modeling and path computation scheme is a suitable tool for the data-based modeling of complex signaling pathways, capable of identifying the source of temporal noise. Fluctuations in molecular networks give rise to heterogeneity in cellular behavior and therefore promote the diversification of tissues. For a better understanding of cellular decision making, it is important to identify sources of molecular fluctuations and to quantitatively describe them by predictive mathematical models. In this work, we focused on temporal fluctuations of the TGF-β signaling pathway that is important for controlling cell division and migration. We characterized a single-cell dataset comprising hundreds of cells using time series analysis and a large-scale stochastic model. By fitting several model variants to the data, we identified the stochastic internalization of cell surface receptors into endosomes as a main source of temporal fluctuations (’bursts’) in the signaling pathway. The corresponding model accurately predicted novel experimental data, and provided insights into the long-term memory of signaling fluctuations. In summary, we propose a modeling approach to quantitatively describe heterogeneous behavior in large-scale single-cell datasets and to identify the underlying biological mechanisms.
Collapse
Affiliation(s)
- Niklas Kolbe
- Institute of Geometry and Practical Mathematics, RWTH Aachen University, Aachen, Germany
- * E-mail: (NK); (SL)
| | - Lorenz Hexemer
- Institute of Molecular Biology (IMB), Mainz, Germany
- Department of Systems Biology, Institute for Biomedical Genetics (IBMG), University of Stuttgart, Stuttgart, Germany
| | | | - Alexander Loewer
- Systems Biology of the Stress Response, Department of Biology, Technical University of Darmstadt, Darmstadt, Germany
| | | | - Stefan Legewie
- Department of Systems Biology, Institute for Biomedical Genetics (IBMG), University of Stuttgart, Stuttgart, Germany
- Stuttgart Research Center for Systems Biology (SRCSB), University of Stuttgart, Stuttgart, Germany
- * E-mail: (NK); (SL)
| |
Collapse
|
39
|
Stochastic Transcription with Alterable Synthesis Rates. MATHEMATICS 2022. [DOI: 10.3390/math10132189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Background: Gene transcription is a random bursting process that leads to large variability in mRNA numbers in single cells. The main cause is largely attributed to random switching between periods of active and inactive gene transcription. In some experiments, it has been observed that variation in the number of active transcription sites causes the initiation rate to vary during elongation. Results: We established a mathematical model based on the molecular reaction mechanism in single cells and studied a stochastic transcription system consisting of two active states and one inactive state, in which mRNA molecules are produced with two different synthesis rates. Conclusions: By calculation, we obtained the average mRNA expression level, the noise strength, and the skewness of transcripts. We gave a necessary and sufficient condition that causes the average mRNA level to peak at a limited time. The model could help us to distinguish an appropriate mechanism that may be employed by cells to transcribe mRNA molecules. Our simulations were in agreement with some experimental data and showed that the skewness can measure the deviation of the distribution of transcripts from the mean value. Especially for mature mRNAs, their distributions were almost able to be determined by the mean, the noise (or the noise strength), and the skewness.
Collapse
|
40
|
Quantifying biochemical reaction rates from static population variability within incompletely observed complex networks. PLoS Comput Biol 2022; 18:e1010183. [PMID: 35731728 PMCID: PMC9216546 DOI: 10.1371/journal.pcbi.1010183] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Accepted: 05/07/2022] [Indexed: 11/19/2022] Open
Abstract
Quantifying biochemical reaction rates within complex cellular processes remains a key challenge of systems biology even as high-throughput single-cell data have become available to characterize snapshots of population variability. That is because complex systems with stochastic and non-linear interactions are difficult to analyze when not all components can be observed simultaneously and systems cannot be followed over time. Instead of using descriptive statistical models, we show that incompletely specified mechanistic models can be used to translate qualitative knowledge of interactions into reaction rate functions from covariability data between pairs of components. This promises to turn a globally intractable problem into a sequence of solvable inference problems to quantify complex interaction networks from incomplete snapshots of their stochastic fluctuations.
Collapse
|
41
|
Xu L, Zhong W, Lu J, Gao F, Qian F, Cao Z. Learning of Iterative Learning Control for Flexible Manufacturing of Batch Processes. ACS OMEGA 2022; 7:19939-19947. [PMID: 35721960 PMCID: PMC9202061 DOI: 10.1021/acsomega.2c01741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Accepted: 05/13/2022] [Indexed: 06/15/2023]
Abstract
Flexible manufacturing as an essential component of smart manufacturing implements the customized production mode, thereby requesting fast controller adaptation for producing different goods but still with high precision. This problem becomes even more acute for batch processes. Here we present a solution called learning of iterative learning control (ILC) based on neural networks. It is able to recommend control parameters for ILC controllers accordingly, so as to yield fast tracking error convergence and smaller steady-state error for disparate set-point profiles, which is deemed an abstraction of different production needs. The method substantially outperforms a benchmark ILC on a variety of systems and cases, thereby showing its potential for deployment in the industrial Internet of Things.
Collapse
Affiliation(s)
- Libin Xu
- MOE
Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai 200237, China
| | - Weimin Zhong
- MOE
Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai 200237, China
| | - Jingyi Lu
- MOE
Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai 200237, China
- Department
of Electrical Engineering and Information Technology, Paderborn University, 33098, Paderborn, Germany
| | - Furong Gao
- Department
of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong
| | - Feng Qian
- MOE
Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai 200237, China
| | - Zhixing Cao
- MOE
Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and Technology, Shanghai 200237, China
| |
Collapse
|
42
|
Kynaston JC, Guiver C, Yates CA. Equivalence framework for an age-structured multistage representation of the cell cycle. Phys Rev E 2022; 105:064411. [PMID: 35854597 DOI: 10.1103/physreve.105.064411] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2021] [Accepted: 05/26/2022] [Indexed: 06/15/2023]
Abstract
We develop theoretical equivalences between stochastic and deterministic models for populations of individual cells stratified by age. Specifically, we develop a hierarchical system of equations describing the full dynamics of an age-structured multistage Markov process for approximating cell cycle time distributions. We further demonstrate that the resulting mean behavior is equivalent, over large timescales, to the classical McKendrick-von Foerster integropartial differential equation. We conclude by extending this framework to a spatial context, facilitating the modeling of traveling wave phenomena and cell-mediated pattern formation. More generally, this methodology may be extended to myriad reaction-diffusion processes for which the age of individuals is relevant to the dynamics.
Collapse
Affiliation(s)
- Joshua C Kynaston
- Department of Mathematical Sciences, University of Bath, Claverton Down, Bath BA2 7AY, United Kingdom
| | - Chris Guiver
- School of Engineering and The Built Environment, Edinburgh Napier University, 10 Colinton Road, Edinburgh EH10 5DT, United Kingdom
| | - Christian A Yates
- Department of Mathematical Sciences, University of Bath, Claverton Down, Bath BA2 7AY, United Kingdom
| |
Collapse
|
43
|
A Novel Dynamical Regulation of mRNA Distribution by Cross-Talking Pathways. MATHEMATICS 2022. [DOI: 10.3390/math10091515] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
In this paper, we use a similar approach to the one proposed by Chen and Jiao to calculate the mathematical formulas of the generating function V(z,t) and the mass function Pm(t) of a cross-talking pathways model in large parameter regions. Together with kinetic rates from yeast and mouse genes, our numerical examples reveal novel bimodal mRNA distributions for intermediate times, whereby the mode of distribution Pm(t) displays unimodality with the peak at m=0 for initial and long times, which has not been obtained in previous works. Such regulation of mRNA distribution exactly matches the transcriptional dynamics for the osmosensitive genes in Saccharomyces cerevisiae, which has not been generated by those models with one single pathway or feedback loops. This paper may provide us with a novel observation on transcriptional distribution dynamics regulated by multiple signaling pathways in response to environmental changes and genetic perturbations.
Collapse
|
44
|
Jiao F, Tang M. Quantification of transcription noise’s impact on cell fate commitment with digital resolutions. Bioinformatics 2022; 38:3062-3069. [DOI: 10.1093/bioinformatics/btac277] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2021] [Revised: 02/18/2022] [Accepted: 04/12/2022] [Indexed: 11/14/2022] Open
Abstract
Abstract
Motivation
Gene transcription is a random and noisy process. Tremendous efforts in single cell studies have been mapping transcription noises to phenotypic variabilities between isogenic cells. However, the exact role of the noise in cell fate commitment remains largely descriptive or even controversial.
Results
For a specified cell fate, we define the jumping digit I of a critical gene as a statistical threshold that a single cell has approximately an equal chance to commit the fate as to have at least I transcripts of the gene. When the transcription is perturbed by a noise enhancer without changing the basal transcription level E 0, we find a crossing digit k such that the noise catalyzes cell fate change when I > k while stabilizes the current state when I < k; k remains stable against enormous variations of kinetic rates. We further test the reactivation of latent HIV in 22 integration sites by noise enhancers paired with transcriptional activators. Strong synergistic actions are observed when the activators increase transcription burst frequency, whereas no synergism, but antagonism, is often observed if activators increase burst size. The synergistic efficiency can be predicted accurately by the ratio I / E0. When the noise enhancers double the noise, the activators double the burst frequency, and I / E0 ≥ 7, their combination is 10 times more effective than their additive effects across all 22 sites.
Availability and implementation
The jumping digit I may provide a novel probe to explore the phenotypic consequences of transcription noise in cell functions. Code is freely available at http://cam.gzhu.edu.cn/info/1014/1223.htm.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, P. R. China
- College of Mathematics and Information Sciences, Guangzhou University, Guangzhou, 510006, P. R. China
| | - Moxun Tang
- Department of Mathematics, Michigan State University, East Lansing, MI 48824, USA
| |
Collapse
|
45
|
Filatova T, Popović N, Grima R. Modulation of nuclear and cytoplasmic mRNA fluctuations by time-dependent stimuli: Analytical distributions. Math Biosci 2022; 347:108828. [DOI: 10.1016/j.mbs.2022.108828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 04/15/2022] [Accepted: 04/15/2022] [Indexed: 10/18/2022]
|
46
|
Kim DW, Hong H, Kim JK. Systematic inference identifies a major source of heterogeneity in cell signaling dynamics: The rate-limiting step number. SCIENCE ADVANCES 2022; 8:eabl4598. [PMID: 35302852 PMCID: PMC8932658 DOI: 10.1126/sciadv.abl4598] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 01/26/2022] [Indexed: 06/14/2023]
Abstract
Identifying the sources of cell-to-cell variability in signaling dynamics is essential to understand drug response variability and develop effective therapeutics. However, it is challenging because not all signaling intermediate reactions can be experimentally measured simultaneously. This can be overcome by replacing them with a single random time delay, but the resulting process is non-Markovian, making it difficult to infer cell-to-cell heterogeneity in reaction rates and time delays. To address this, we developed an efficient and scalable moment-based Bayesian inference method (MBI) with a user-friendly computational package that infers cell-to-cell heterogeneity in the non-Markovian signaling process. We applied MBI to single-cell expression profiles from promoters responding to antibiotics and discovered a major source of cell-to-cell variability in antibiotic stress response: the number of rate-limiting steps in signaling cascades. This knowledge can help identify effective therapies that destroy all pathogenic or cancer cells, and the approach can be applied to precision medicine.
Collapse
Affiliation(s)
- Dae Wook Kim
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon 34126, Republic of Korea
| | - Hyukpyo Hong
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon 34126, Republic of Korea
| | - Jae Kyoung Kim
- Department of Mathematical Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
- Biomedical Mathematics Group, Institute for Basic Science, Daejeon 34126, Republic of Korea
| |
Collapse
|
47
|
Gorin G, Pachter L. Modeling bursty transcription and splicing with the chemical master equation. Biophys J 2022; 121:1056-1069. [PMID: 35143775 PMCID: PMC8943761 DOI: 10.1016/j.bpj.2022.02.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 11/29/2021] [Accepted: 02/03/2022] [Indexed: 11/16/2022] Open
Abstract
Splicing cascades that alter gene products posttranscriptionally also affect expression dynamics. We study a class of processes and associated distributions that emerge from models of bursty promoters coupled to directed acyclic graphs of splicing. These solutions provide full time-dependent joint distributions for an arbitrary number of species with general noise behaviors and transient phenomena, offering qualitative and quantitative insights about how splicing can regulate expression dynamics. Finally, we derive a set of quantitative constraints on the minimum complexity necessary to reproduce gene coexpression patterns using synchronized burst models. We validate these findings by analyzing long-read sequencing data, where we find evidence of expression patterns largely consistent with these constraints.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California
| | - Lior Pachter
- Division of Biology and Biological Engineering & Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California.
| |
Collapse
|
48
|
Lentiviral Nef Proteins Differentially Govern the Establishment of Viral Latency. J Virol 2022; 96:e0220621. [PMID: 35266804 DOI: 10.1128/jvi.02206-21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Despite the clinical importance of latent human immunodeficiency virus type 1 (HIV-1) infection, our understanding of the biomolecular processes involved in HIV-1 latency control is still limited. This study was designed to address whether interactions between viral proteins, specifically HIV Nef, and the host cell could affect latency establishment. The study was driven by three reported observations. First, early reports suggested that human immunodeficiency virus type 2 (HIV-2) infection in patients produces a lower viral RNA/DNA ratio than HIV-1 infection, potentially indicating an increased propensity of HIV-2 to produce latent infection. Second, Nef, an early viral gene product, has been shown to alter the activation state of infected cells in a lentiviral lineage-dependent manner. Third, it has been demonstrated that the ability of HIV-1 to establish latent infection is a function of the activation state of the host cell at the time of infection. Based on these observations, we reasoned that HIV-2 Nef may have the ability to promote latency establishment. We demonstrate that HIV-1 latency establishment in T cell lines and primary T cells is indeed differentially modulated by Nef proteins. In the context of an HIV-1 backbone, HIV-1 Nef promoted active HIV-1 infection, while HIV-2 Nef strongly promoted latency establishment. Given that Nef represents the only difference in these HIV-1 vectors and is known to interact with numerous cellular factors, these data add support to the idea that latency establishment is a host cell-virus interaction phenomenon, but they also suggest that the HIV-1 lineage may have evolved mechanisms to counteract host cell suppression. IMPORTANCE Therapeutic attempts to eliminate the latent HIV-1 reservoir have failed, at least in part due to our incomplete biomolecular understanding of how latent HIV-1 infection is established and maintained. We here address the fundamental question of whether all lentiviruses actually possess a similar capacity to establish latent infections or whether there are differences between the lentiviral lineages driving differential latency establishment that could be exploited to develop improved latency reversal agents. Research investigating the viral RNA/DNA ratio in HIV-1 and HIV-2 patients could suggest that HIV-2 indeed has a much higher propensity to establish latent infections, a trait that we found, at least in part, to be attributable to the HIV-2 Nef protein. Reported Nef-mediated effects on host cell activation thus also affect latency establishment, and HIV-1 vectors that carry different lentiviral nef genes should become key tools to develop a better understanding of the biomolecular basis of HIV-1 latency establishment.
Collapse
|
49
|
Chen L, Lin G, Jiao F. Using average transcription level to understand the regulation of stochastic gene activation. ROYAL SOCIETY OPEN SCIENCE 2022; 9:211757. [PMID: 35223065 PMCID: PMC8847896 DOI: 10.1098/rsos.211757] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 01/24/2022] [Indexed: 05/03/2023]
Abstract
Gene activation is a random process, modelled as a framework of multiple rate-limiting steps listed sequentially, in parallel or in combination. Together with suitably assumed processes of gene inactivation, transcript birth and death, the step numbers and parameters in activation frameworks can be estimated by fitting single-cell transcription data. However, current algorithms require computing master equations that are tightly correlated with prior hypothetical frameworks of gene activation. We found that prior estimation of the framework can be facilitated by the traditional dynamical data of mRNA average level M(t), presenting discriminated dynamical features. Rigorous theory regarding M(t) profiles allows to confidently rule out the frameworks that fail to capture M(t) features and to test potential competent frameworks by fitting M(t) data. We implemented this procedure for a large number of mouse fibroblast genes under tumour necrosis factor induction and determined exactly the 'cross-talking n-state' framework; the cross-talk between the signalling and basal pathways is crucial to trigger the first peak of M(t), while the following damped gentle M(t) oscillation is regulated by the multi-step basal pathway. This framework can be used to fit sophisticated single-cell data and may facilitate a more accurate understanding of stochastic activation of mouse fibroblast genes.
Collapse
Affiliation(s)
- Liang Chen
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, People’s Republic of China
- School of Mathematics and Information Sciences, Guangzhou University, Guangzhou, People’s Republic of China
| | - Genghong Lin
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, People’s Republic of China
- School of Mathematics and Information Sciences, Guangzhou University, Guangzhou, People’s Republic of China
| | - Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, People’s Republic of China
- School of Mathematics and Information Sciences, Guangzhou University, Guangzhou, People’s Republic of China
| |
Collapse
|
50
|
Wu Y, Pegoraro AF, Weitz DA, Janmey P, Sun SX. The correlation between cell and nucleus size is explained by an eukaryotic cell growth model. PLoS Comput Biol 2022; 18:e1009400. [PMID: 35180215 PMCID: PMC8893647 DOI: 10.1371/journal.pcbi.1009400] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 03/03/2022] [Accepted: 01/12/2022] [Indexed: 12/19/2022] Open
Abstract
In eukaryotes, the cell volume is observed to be strongly correlated with the nuclear volume. The slope of this correlation depends on the cell type, growth condition, and the physical environment of the cell. We develop a computational model of cell growth and proteome increase, incorporating the kinetics of amino acid import, protein/ribosome synthesis and degradation, and active transport of proteins between the cytoplasm and the nucleoplasm. We also include a simple model of ribosome biogenesis and assembly. Results show that the cell volume is tightly correlated with the nuclear volume, and the cytoplasm-nucleoplasm transport rates strongly influence the cell growth rate as well as the cell/nucleus volume ratio (C/N ratio). Ribosome assembly and the ratio of ribosomal proteins to mature ribosomes also influence the cell volume and the cell growth rate. We find that in order to regulate the cell growth rate and the cell/nucleus volume ratio, the cell must optimally control groups of kinetic and transport parameters together, which could explain the quantitative roles of canonical growth pathways. Finally, although not explicitly demonstrated in this work, we point out that it is possible to construct a detailed proteome distribution using our model and RNAseq data, provided that a quantitative cell division mechanism is known.
Collapse
Affiliation(s)
- Yufei Wu
- Department of Mechanical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- Institute for NanoBioTechnology, Johns Hopkins University, Baltimore, Maryland, United States of America
| | | | - David A. Weitz
- Department of Physics, Harvard University, Boston, Massachusetts, United States of America
| | - Paul Janmey
- Department of Cell and Developmental Biology, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania, United States of America
| | - Sean X. Sun
- Department of Mechanical Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
- Institute for NanoBioTechnology, Johns Hopkins University, Baltimore, Maryland, United States of America
- Center for Cell Dynamics, Johns Hopkins School of Medicine, Baltimore, Maryland, United States of America
- * E-mail:
| |
Collapse
|