1
|
Wang Y, Zheng P, Cheng YC, Wang Z, Aravkin A. WENDY: Covariance dynamics based gene regulatory network inference. Math Biosci 2024; 377:109284. [PMID: 39168402 DOI: 10.1016/j.mbs.2024.109284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 06/25/2024] [Accepted: 08/16/2024] [Indexed: 08/23/2024]
Abstract
Determining gene regulatory network (GRN) structure is a central problem in biology, with a variety of inference methods available for different types of data. For a widely prevalent and challenging use case, namely single-cell gene expression data measured after intervention at multiple time points with unknown joint distributions, there is only one known specifically developed method, which does not fully utilize the rich information contained in this data type. We develop an inference method for the GRN in this case, netWork infErence by covariaNce DYnamics, dubbed WENDY. The core idea of WENDY is to model the dynamics of the covariance matrix, and solve this dynamics as an optimization problem to determine the regulatory relationships. To evaluate its effectiveness, we compare WENDY with other inference methods using synthetic data and experimental data. Our results demonstrate that WENDY performs well across different data sets.
Collapse
Affiliation(s)
- Yue Wang
- Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, 10027, NY, USA.
| | - Peng Zheng
- Institute for Health Metrics and Evaluation, Seattle, 98195, WA, USA; Department of Health Metrics Sciences, University of Washington, Seattle, 98195, WA, USA
| | - Yu-Chen Cheng
- Department of Data Science, Dana-Farber Cancer Institute, Boston, 02215, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, 02115, MA, USA; Center for Cancer Evolution, Dana-Farber Cancer Institute, Boston, 02215, MA, USA; Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, 02138, MA, USA
| | - Zikun Wang
- Laboratory of Genetics, The Rockefeller University, New York, 10065, NY, USA
| | - Aleksandr Aravkin
- Department of Applied Mathematics, University of Washington, Seattle, 98195, WA, USA
| |
Collapse
|
2
|
Sun R, Cao W, Li S, Jiang J, Shi Y, Zhang B. scGRN-Entropy: Inferring cell differentiation trajectories using single-cell data and gene regulation network-based transfer entropy. PLoS Comput Biol 2024; 20:e1012638. [PMID: 39585902 PMCID: PMC11627384 DOI: 10.1371/journal.pcbi.1012638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 12/09/2024] [Accepted: 11/12/2024] [Indexed: 11/27/2024] Open
Abstract
Research on cell differentiation facilitates a deeper understanding of the fundamental processes of life, elucidates the intrinsic mechanisms underlying diseases such as cancer, and advances the development of therapeutics and precision medicine. Existing methods for inferring cell differentiation trajectories from single-cell RNA sequencing (scRNA-seq) data primarily rely on static gene expression data to measure distances between cells and subsequently infer pseudotime trajectories. In this work, we introduce a novel method, scGRN-Entropy, for inferring cell differentiation trajectories and pseudotime from scRNA-seq data. Unlike existing approaches, scGRN-Entropy improves inference accuracy by incorporating dynamic changes in gene regulatory networks (GRN). In scGRN-Entropy, an undirected graph representing state transitions between cells is constructed by integrating both static relationships in gene expression space and dynamic relationships in the GRN space. The edges of the undirected graph are then refined using pseudotime inferred based on cell entropy in the GRN space. Finally, the Minimum Spanning Tree (MST) algorithm is applied to derive the cell differentiation trajectory. We validate the accuracy of scGRN-Entropy on eight different real scRNA-seq datasets, demonstrating its superior performance in inferring cell differentiation trajectories through comparative analysis with existing state-of-the-art methods.
Collapse
Affiliation(s)
- Rui Sun
- School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, Hubei, China
- Center for Applied Mathematics and Interdisciplinary Studies, Wuhan Textile University, Wuhan, Hubei, China
| | - Wenjie Cao
- School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - ShengXuan Li
- School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, Hubei, China
- Center for Applied Mathematics and Interdisciplinary Studies, Wuhan Textile University, Wuhan, Hubei, China
| | - Jian Jiang
- School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, Hubei, China
- Center for Applied Mathematics and Interdisciplinary Studies, Wuhan Textile University, Wuhan, Hubei, China
| | - Yazhou Shi
- School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, Hubei, China
- Center for Applied Mathematics and Interdisciplinary Studies, Wuhan Textile University, Wuhan, Hubei, China
| | - Bengong Zhang
- School of Mathematical & Physical Sciences, Wuhan Textile University, Wuhan, Hubei, China
- Center for Applied Mathematics and Interdisciplinary Studies, Wuhan Textile University, Wuhan, Hubei, China
| |
Collapse
|
3
|
Silkwood K, Dollinger E, Gervin J, Atwood S, Nie Q, Lander AD. Leveraging gene correlations in single cell transcriptomic data. BMC Bioinformatics 2024; 25:305. [PMID: 39294560 PMCID: PMC11411778 DOI: 10.1186/s12859-024-05926-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 09/09/2024] [Indexed: 09/20/2024] Open
Abstract
BACKGROUND Many approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data-looking for rare cell types, subtleties of cell states, and details of gene regulatory networks-there is a growing need for algorithms with controllable accuracy and fewer ad hoc parameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data in which ground truth about biological variation is unknown (i.e., usually). RESULTS We approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization-a step that skews distributions, particularly for sparse data-and calculate p values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics and Gene Statistics from Unnormalized Reads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships. CONCLUSIONS New insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.
Collapse
Affiliation(s)
- Kai Silkwood
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Emmanuel Dollinger
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
| | - Joshua Gervin
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Scott Atwood
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
| | - Qing Nie
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
| | - Arthur D Lander
- Center for Complex Biological Systems, University of California, Irvine, Irvine, CA, USA.
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
4
|
Barcenas M, Bocci F, Nie Q. Tipping points in epithelial-mesenchymal lineages from single-cell transcriptomics data. Biophys J 2024; 123:2849-2859. [PMID: 38504523 PMCID: PMC11393678 DOI: 10.1016/j.bpj.2024.03.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 02/09/2024] [Accepted: 03/15/2024] [Indexed: 03/21/2024] Open
Abstract
Understanding cell fate decision-making during complex biological processes is an open challenge that is now aided by high-resolution single-cell sequencing technologies. Specifically, it remains challenging to identify and characterize transition states corresponding to "tipping points" whereby cells commit to new cell states. Here, we present a computational method that takes advantage of single-cell transcriptomics data to infer the stability and gene regulatory networks (GRNs) along cell lineages. Our method uses the unspliced and spliced counts from single-cell RNA sequencing data and cell ordering along lineage trajectories to train an RNA splicing multivariate model, from which cell-state stability along the lineage is inferred based on spectral analysis of the model's Jacobian matrix. Moreover, the model infers the RNA cross-species interactions resulting in GRNs and their variation along the cell lineage. When applied to epithelial-mesenchymal transition in ovarian and lung cancer-derived cell lines, our model predicts a saddle-node transition between the epithelial and mesenchymal states passing through an unstable, intermediate cell state. Furthermore, we show that the underlying GRN controlling epithelial-mesenchymal transition rearranges during the transition, resulting in denser and less modular networks in the intermediate state. Overall, our method represents a flexible tool to study cell lineages with a combination of theory-driven modeling and single-cell transcriptomics data.
Collapse
Affiliation(s)
- Manuel Barcenas
- Department of Mathematics, University of California Irvine, Irvine, California
| | - Federico Bocci
- Department of Mathematics, University of California Irvine, Irvine, California; NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, California.
| | - Qing Nie
- Department of Mathematics, University of California Irvine, Irvine, California; NSF-Simons Center for Multiscale Cell Fate Research, University of California Irvine, Irvine, California.
| |
Collapse
|
5
|
Ramirez DA, Lu M. Dissecting reversible and irreversible single cell state transitions from gene regulatory networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.30.610498. [PMID: 39257745 PMCID: PMC11384016 DOI: 10.1101/2024.08.30.610498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
Understanding cell state transitions and their governing regulatory mechanisms remains one of the fundamental questions in biology. We develop a computational method, state transition inference using cross-cell correlations (STICCC), for predicting reversible and irreversible cell state transitions at single-cell resolution by using gene expression data and a set of gene regulatory interactions. The method is inspired by the fact that the gene expression time delays between regulators and targets can be exploited to infer past and future gene expression states. From applications to both simulated and experimental single-cell gene expression data, we show that STICCC-inferred vector fields capture basins of attraction and irreversible fluxes. By connecting regulatory information with systems' dynamical behaviors, STICCC reveals how network interactions influence reversible and irreversible state transitions. Compared to existing methods that infer pseudotime and RNA velocity, STICCC provides complementary insights into the gene regulation of cell state transitions.
Collapse
Affiliation(s)
- Daniel A. Ramirez
- Center for Theoretical Biological Physics, Northeastern University, Boston, MA 02115, USA
- Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
| | - Mingyang Lu
- Center for Theoretical Biological Physics, Northeastern University, Boston, MA 02115, USA
- Department of Bioengineering, Northeastern University, Boston, MA 02115, USA
| |
Collapse
|
6
|
Johnston KG, Grieco SF, Nie Q, Theis FJ, Xu X. Small data methods in omics: the power of one. Nat Methods 2024; 21:1597-1602. [PMID: 39174710 DOI: 10.1038/s41592-024-02390-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 07/24/2024] [Indexed: 08/24/2024]
Abstract
Over the last decade, biology has begun utilizing 'big data' approaches, resulting in large, comprehensive atlases in modalities ranging from transcriptomics to neural connectomics. However, these approaches must be complemented and integrated with 'small data' approaches to efficiently utilize data from individual labs. Integration of smaller datasets with major reference atlases is critical to provide context to individual experiments, and approaches toward integration of large and small data have been a major focus in many fields in recent years. Here we discuss progress in integration of small data with consortium-sized atlases across multiple modalities, and its potential applications. We then examine promising future directions for utilizing the power of small data to maximize the information garnered from small-scale experiments. We envision that, in the near future, international consortia comprising many laboratories will work together to collaboratively build reference atlases and foundation models using small data methods.
Collapse
Affiliation(s)
- Kevin G Johnston
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
- Department of Anatomy and Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, USA
| | - Steven F Grieco
- Department of Anatomy and Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, USA
- Center for Neural Circuit Mapping, University of California, Irvine, Irvine, CA, USA
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA.
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA, USA.
| | - Fabian J Theis
- Helmholtz Center Munich-German Research Center for Environmental Health, Institute of Computational Biology, Neuherberg, Germany.
- School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany.
- Department of Mathematics, Technical University of Munich, Munich, Germany.
| | - Xiangmin Xu
- Department of Anatomy and Neurobiology, School of Medicine, University of California, Irvine, Irvine, CA, USA.
- Center for Neural Circuit Mapping, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
7
|
Zhu L, Yang S, Zhang K, Wang H, Fang X, Wang J. Uncovering underlying physical principles and driving forces of cell differentiation and reprogramming from single-cell transcriptomics. Proc Natl Acad Sci U S A 2024; 121:e2401540121. [PMID: 39150785 PMCID: PMC11348339 DOI: 10.1073/pnas.2401540121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2024] [Accepted: 06/28/2024] [Indexed: 08/18/2024] Open
Abstract
Recent advances in single-cell sequencing technology have revolutionized our ability to acquire whole transcriptome data. However, uncovering the underlying transcriptional drivers and nonequilibrium driving forces of cell function directly from these data remains challenging. We address this by learning cell state vector fields from discrete single-cell RNA velocity to quantify the single-cell global nonequilibrium driving forces as landscape and flux. From single-cell data, we quantified the Waddington landscape, showing that optimal paths for differentiation and reprogramming deviate from the naively expected landscape gradient paths and may not pass through landscape saddles at finite fluctuations, challenging conventional transition state estimation of kinetic rate for cell fate decisions due to the presence of the flux. A key insight from our study is that stem/progenitor cells necessitate greater energy dissipation for rapid cell cycles and self-renewal, maintaining pluripotency. We predict optimal developmental pathways and elucidate the nucleation mechanism of cell fate decisions, with transition states as nucleation sites and pioneer genes as nucleation seeds. The concept of loop flux quantifies the contributions of each cycle flux to cell state transitions, facilitating the understanding of cell dynamics and thermodynamic cost, and providing insights into optimizing biological functions. We also infer cell-cell interactions and cell-type-specific gene regulatory networks, encompassing feedback mechanisms and interaction intensities, predicting genetic perturbation effects on cell fate decisions from single-cell omics data. Essentially, our methodology validates the landscape and flux theory, along with its associated quantifications, offering a framework for exploring the physical principles underlying cellular differentiation and reprogramming and broader biological processes through high-throughput single-cell sequencing experiments.
Collapse
Affiliation(s)
- Ligang Zhu
- College of Physics, Jilin University, Changchun130021, China
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun130022, China
| | - Songlin Yang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun130022, China
| | - Kun Zhang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun130022, China
| | - Hong Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun130022, China
| | - Xiaona Fang
- College of Chemistry, Northeast Normal University, Changchun130024, China
| | - Jin Wang
- Center for Theoretical Interdisciplinary Sciences, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou325001, China
- Department of Chemistry, Physics and Astronomy, Stony Brook University, Stony Brook, NY11794
| |
Collapse
|
8
|
Ba T, Miao H, Zhang L, Gao C, Wang Y. ClusterMatch aligns single-cell RNA-sequencing data at the multi-scale cluster level via stable matching. Bioinformatics 2024; 40:btae480. [PMID: 39073888 PMCID: PMC11520419 DOI: 10.1093/bioinformatics/btae480] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 07/04/2024] [Accepted: 07/28/2024] [Indexed: 07/31/2024] Open
Abstract
MOTIVATION Unsupervised clustering of single-cell RNA sequencing (scRNA-seq) data holds the promise of characterizing known and novel cell type in various biological and clinical contexts. However, intrinsic multi-scale clustering resolutions poses challenges to deal with multiple sources of variability in the high-dimensional and noisy data. RESULTS We present ClusterMatch, a stable match optimization model to align scRNA-seq data at the cluster level. In one hand, ClusterMatch leverages the mutual correspondence by canonical correlation analysis and multi-scale Louvain clustering algorithms to identify cluster with optimized resolutions. In the other hand, it utilizes stable matching framework to align scRNA-seq data in the latent space while maintaining interpretability with overlapped marker gene set. Through extensive experiments, we demonstrate the efficacy of ClusterMatch in data integration, cell type annotation, and cross-species/timepoint alignment scenarios. Our results show ClusterMatch's ability to utilize both global and local information of scRNA-seq data, sets the appropriate resolution of multi-scale clustering, and offers interpretability by utilizing marker genes. AVAILABILITY AND IMPLEMENTATION The code of ClusterMatch software is freely available at https://github.com/AMSSwanglab/ClusterMatch.
Collapse
Affiliation(s)
- Teer Ba
- School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China
- School of Mathematical Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Hao Miao
- CEMS, NCMIS, HCMS, MDIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing 100049, China
| | - Lirong Zhang
- School of Physical Science and Technology, Inner Mongolia University, Hohhot 010021, China
| | - Caixia Gao
- School of Mathematical Sciences, Inner Mongolia University, Hohhot 010021, China
| | - Yong Wang
- CEMS, NCMIS, HCMS, MDIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
- School of Mathematics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing 100049, China
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming 650223, China
- Key Laboratory of Systems Biology, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Hangzhou 330106, China
| |
Collapse
|
9
|
Zhou P, Bocci F, Li T, Nie Q. Spatial transition tensor of single cells. Nat Methods 2024; 21:1053-1062. [PMID: 38755322 PMCID: PMC11166574 DOI: 10.1038/s41592-024-02266-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 04/02/2024] [Indexed: 05/18/2024]
Abstract
Spatial transcriptomics and messenger RNA splicing encode extensive spatiotemporal information for cell states and transitions. The current lineage-inference methods either lack spatial dynamics for state transition or cannot capture different dynamics associated with multiple cell states and transition paths. Here we present spatial transition tensor (STT), a method that uses messenger RNA splicing and spatial transcriptomes through a multiscale dynamical model to characterize multistability in space. By learning a four-dimensional transition tensor and spatial-constrained random walk, STT reconstructs cell-state-specific dynamics and spatial state transitions via both short-time local tensor streamlines between cells and long-time transition paths among attractors. Benchmarking and applications of STT on several transcriptome datasets via multiple technologies on epithelial-mesenchymal transitions, blood development, spatially resolved mouse brain and chicken heart development, indicate STT's capability in recovering cell-state-specific dynamics and their associated genes not seen using existing methods. Overall, STT provides a consistent multiscale description of single-cell transcriptome data across multiple spatiotemporal scales.
Collapse
Affiliation(s)
- Peijie Zhou
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
- Center for Machine Learning Research, Peking University, Beijing, China
- AI for Science Institute, Beijing, China
- National Engineering Laboratory for Big Data Analysis and Applications, Beijing, China
| | - Federico Bocci
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA
| | - Tiejun Li
- LMAM and School of Mathematical Sciences, Peking University, Beijing, China
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, Irvine, CA, USA.
- Department of Cell and Developmental Biology, University of California, Irvine, Irvine, CA, USA.
| |
Collapse
|
10
|
Hong T, Xing J. Data- and theory-driven approaches for understanding paths of epithelial-mesenchymal transition. Genesis 2024; 62:e23591. [PMID: 38553870 PMCID: PMC11017362 DOI: 10.1002/dvg.23591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Revised: 02/16/2024] [Accepted: 03/16/2024] [Indexed: 04/02/2024]
Abstract
Reversible transitions between epithelial and mesenchymal cell states are a crucial form of epithelial plasticity for development and disease progression. Recent experimental data and mechanistic models showed multiple intermediate epithelial-mesenchymal transition (EMT) states as well as trajectories of EMT underpinned by complex gene regulatory networks. In this review, we summarize recent progress in quantifying EMT and characterizing EMT paths with computational methods and quantitative experiments including omics-level measurements. We provide perspectives on how these studies can help relating fundamental cell biology to physiological and pathological outcomes of EMT.
Collapse
Affiliation(s)
- Tian Hong
- Department of Biochemistry & Cellular and Molecular Biology, The University of Tennessee, Knoxville, Knoxville TN, USA
| | - Jianhua Xing
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA
- UPMC-Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Physics and Astronomy, University of Pittsburgh, Pittsburgh, PA, USA
| |
Collapse
|
11
|
Zhu L, Wang J. Quantifying Landscape-Flux via Single-Cell Transcriptomics Uncovers the Underlying Mechanism of Cell Cycle. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308879. [PMID: 38353329 DOI: 10.1002/advs.202308879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 01/23/2024] [Indexed: 04/25/2024]
Abstract
Recent developments in single-cell sequencing technology enable the acquisition of entire transcriptome data. Understanding the underlying mechanism and identifying the driving force of transcriptional regulation governing cell function directly from these data remains challenging. This study reconstructs a continuous vector field of the cell cycle based on discrete single-cell RNA velocity to quantify the single-cell global nonequilibrium dynamic landscape-flux. It reveals that large fluctuations disrupt the global landscape and genetic perturbations alter landscape-flux, thus identifying key genes in maintaining cell cycle dynamics and predicting associated functional effects. Additionally, it quantifies the fundamental energy cost of the cell cycle initiation and unveils that sustaining the cell cycle requires curl flux and dissipation to maintain the oscillatory phase coherence. This study enables the inference of the cell cycle gene regulatory networks directly from the single-cell transcriptomic data, including the feedback mechanisms and interaction intensity. This provides a golden opportunity to experimentally verify the landscape-flux theory and also obtain its associated quantifications. It also offers a unique framework for combining the landscape-flux theory and single-cell high-through sequencing experiments for understanding the underlying mechanisms of the cell cycle and can be extended to other nonequilibrium biological processes, such as differentiation development and disease pathogenesis.
Collapse
Affiliation(s)
- Ligang Zhu
- College of Physics, Jilin University, Changchun, 130021, P. R. China
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, 130022, P. R. China
| | - Jin Wang
- Center for Theoretical Interdisciplinary Sciences, Wenzhou Institute, University of Chinese Academy of Sciences, Wenzhou, 325001, P. R. China
- Department of Chemistry, Physics and Astronomy, Stony Brook University, Stony Brook, NY, 11794, USA
| |
Collapse
|
12
|
Liu J, Li C. Data-driven energy landscape reveals critical genes in cancer progression. NPJ Syst Biol Appl 2024; 10:27. [PMID: 38459043 PMCID: PMC10923824 DOI: 10.1038/s41540-024-00354-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 02/26/2024] [Indexed: 03/10/2024] Open
Abstract
The evolution of cancer is a complex process characterized by stable states and transitions among them. Studying the dynamic evolution of cancer and revealing the mechanisms of cancer progression based on experimental data is an important topic. In this study, we aim to employ a data-driven energy landscape approach to analyze the dynamic evolution of cancer. We take Kidney renal clear cell carcinoma (KIRC) as an example. From the energy landscape, we introduce two quantitative indicators (transition probability and barrier height) to study critical shifts in KIRC cancer evolution, including cancer onset and progression, and identify critical genes involved in these transitions. Our results successfully identify crucial genes that either promote or inhibit these transition processes in KIRC. We also conduct a comprehensive biological function analysis on these genes, validating the accuracy and reliability of our predictions. This work has implications for discovering new biomarkers, drug targets, and cancer treatment strategies in KIRC.
Collapse
Affiliation(s)
- Juntan Liu
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, China
- Shanghai Center for Mathematical Sciences, Fudan University, Shanghai, 200433, China
| | - Chunhe Li
- Institute of Science and Technology for Brain-Inspired Intelligence, Fudan University, Shanghai, 200433, China.
- Shanghai Center for Mathematical Sciences, Fudan University, Shanghai, 200433, China.
- School of Mathematical Sciences and MOE Frontiers Center for Brain Science, Fudan University, Shanghai, 200433, China.
| |
Collapse
|
13
|
Wang Y, Shtylla B, Chou T. Order-of-Mutation Effects on Cancer Progression: Models for Myeloproliferative Neoplasm. Bull Math Biol 2024; 86:32. [PMID: 38363386 PMCID: PMC10873249 DOI: 10.1007/s11538-024-01257-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 01/08/2024] [Indexed: 02/17/2024]
Abstract
In some patients with myeloproliferative neoplasms (MPN), two genetic mutations are often found: JAK2 V617F and one in the TET2 gene. Whether one mutation is present influences how the other subsequent mutation will affect the regulation of gene expression. In other words, when a patient carries both mutations, the order of when they first arose has been shown to influence disease progression and prognosis. We propose a nonlinear ordinary differential equation, the Moran process, and Markov chain models to explain the non-additive and non-commutative mutation effects on recent clinical observations of gene expression patterns, proportions of cells with different mutations, and ages at diagnosis of MPN. Combined, these observations are used to shape our modeling framework. Our key proposal is that bistability in gene expression provides a natural explanation for many observed order-of-mutation effects. We also propose potential experimental measurements that can be used to confirm or refute predictions of our models.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, UCLA, Los Angeles, CA, 90095, USA
- Department of Statistics, Irving Institute for Cancer Dynamics, Columbia University, New York, NY, 10027, USA
| | - Blerta Shtylla
- Mathematics Department, Pomona College, Claremont, CA, 91711, USA
- Pharmacometrics and Systems Pharmacology, Pfizer Research and Development, San Diego, CA, 92121, USA
| | - Tom Chou
- Department of Computational Medicine, UCLA, Los Angeles, CA, 90095, USA.
- Department of Mathematics, UCLA, Los Angeles, CA, 90095, USA.
| |
Collapse
|
14
|
He X, Tang R, Lou J, Wang R. Identifying key factors in cell fate decisions by machine learning interpretable strategies. J Biol Phys 2023; 49:443-462. [PMID: 37458834 PMCID: PMC10651582 DOI: 10.1007/s10867-023-09640-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 06/15/2023] [Indexed: 11/16/2023] Open
Abstract
Cell fate decisions and transitions are common in almost all developmental processes. Therefore, it is important to identify the decision-making mechanisms and important individual molecules behind the fate decision processes. In this paper, we propose an interpretable strategy based on systematic perturbation, unsupervised hierarchical cluster analysis (HCA), machine learning (ML), and Shapley additive explanation (SHAP) analysis for inferring the contribution and importance of individual molecules in cell fate decision and transition processes. In order to verify feasibility of the approach, we apply it to the core epithelial to mesenchymal transition (EMT)-metastasis network. The key factors identified in EMT-metastasis are consistent with relevant experimental observations. The approach presented here can be applied to other biological networks to identify important factors related to cell fate decisions and transitions.
Collapse
Affiliation(s)
- Xinyu He
- Department of Mathematics, Shanghai University, Shanghai, 200444, China
| | - Ruoyu Tang
- Department of Mathematics, Shanghai University, Shanghai, 200444, China
| | - Jie Lou
- Department of Mathematics, Shanghai University, Shanghai, 200444, China.
- Newtouch Center for Mathematics of Shanghai University, Shanghai, 200444, China.
| | - Ruiqi Wang
- Department of Mathematics, Shanghai University, Shanghai, 200444, China.
- Newtouch Center for Mathematics of Shanghai University, Shanghai, 200444, China.
| |
Collapse
|
15
|
Sha Y, Qiu Y, Zhou P, Nie Q. Reconstructing growth and dynamic trajectories from single-cell transcriptomics data. NAT MACH INTELL 2023; 6:25-39. [PMID: 38274364 PMCID: PMC10805654 DOI: 10.1038/s42256-023-00763-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 10/25/2023] [Indexed: 01/27/2024]
Abstract
Time-series single-cell RNA sequencing (scRNA-seq) datasets provide unprecedented opportunities to learn dynamic processes of cellular systems. Due to the destructive nature of sequencing, it remains challenging to link the scRNA-seq snapshots sampled at different time points. Here we present TIGON, a dynamic, unbalanced optimal transport algorithm that reconstructs dynamic trajectories and population growth simultaneously as well as the underlying gene regulatory network from multiple snapshots. To tackle the high-dimensional optimal transport problem, we introduce a deep learning method using a dimensionless formulation based on the Wasserstein-Fisher-Rao (WFR) distance. TIGON is evaluated on simulated data and compared with existing methods for its robustness and accuracy in predicting cell state transition and cell population growth. Using three scRNA-seq datasets, we show the importance of growth in the temporal inference, TIGON's capability in reconstructing gene expression at unmeasured time points and its applications to temporal gene regulatory networks and cell-cell communication inference.
Collapse
Affiliation(s)
- Yutong Sha
- Department of Mathematics, University of California, Irvine, Irvine, CA USA
| | - Yuchi Qiu
- Department of Mathematics, Michigan State University, East Lansing, MI USA
| | - Peijie Zhou
- Department of Mathematics, University of California, Irvine, Irvine, CA USA
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, Irvine, CA USA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine, CA USA
- The NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, CA USA
| |
Collapse
|
16
|
Silkwood K, Dollinger E, Gervin J, Atwood S, Nie Q, Lander AD. Leveraging gene correlations in single cell transcriptomic data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.14.532643. [PMID: 36993765 PMCID: PMC10055147 DOI: 10.1101/2023.03.14.532643] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
BACKGROUND Many approaches have been developed to overcome technical noise in single cell RNA-sequencing (scRNAseq). As researchers dig deeper into data-looking for rare cell types, subtleties of cell states, and details of gene regulatory networks-there is a growing need for algorithms with controllable accuracy and fewer ad hoc parameters and thresholds. Impeding this goal is the fact that an appropriate null distribution for scRNAseq cannot simply be extracted from data when ground truth about biological variation is unknown (i.e., usually). RESULTS We approach this problem analytically, assuming that scRNAseq data reflect only cell heterogeneity (what we seek to characterize), transcriptional noise (temporal fluctuations randomly distributed across cells), and sampling error (i.e., Poisson noise). We analyze scRNAseq data without normalization-a step that skews distributions, particularly for sparse data-and calculate p-values associated with key statistics. We develop an improved method for selecting features for cell clustering and identifying gene-gene correlations, both positive and negative. Using simulated data, we show that this method, which we call BigSur (Basic Informatics and Gene Statistics from Unnormalized Reads), captures even weak yet significant correlation structures in scRNAseq data. Applying BigSur to data from a clonal human melanoma cell line, we identify thousands of correlations that, when clustered without supervision into gene communities, align with known cellular components and biological processes, and highlight potentially novel cell biological relationships. CONCLUSIONS New insights into functionally relevant gene regulatory networks can be obtained using a statistically grounded approach to the identification of gene-gene correlations.
Collapse
Affiliation(s)
- Kai Silkwood
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
| | - Emmanuel Dollinger
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
- Department of Mathematics, University of California, Irvine, Irvine CA
| | - Josh Gervin
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
| | - Scott Atwood
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
| | - Qing Nie
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
- Department of Mathematics, University of California, Irvine, Irvine CA
| | - Arthur D. Lander
- Center for Complex Biological Systems, University of California, Irvine, Irvine CA
- Department of Developmental and Cell Biology, University of California, Irvine, Irvine CA
| |
Collapse
|
17
|
Wang Y, Shtylla B, Chou T. Order-of-mutation effects on cancer progression: models for myeloproliferative neoplasm. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.08.16.23294177. [PMID: 37662184 PMCID: PMC10473807 DOI: 10.1101/2023.08.16.23294177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
In some patients with myeloproliferative neoplasms (MPN), two genetic mutations are often found, JAK2 V617F and one in the TET2 gene. Whether or not one mutation is present will influence how the other subsequent mutation affects the regulation of gene expression. When both mutations are present, the order of their occurrence has been shown to influence disease progression and prognosis. We propose a nonlinear ordinary differential equation (ODE), Moran process, and Markov chain models to explain the non-additive and non-commutative mutation effects on recent clinical observations of gene expression patterns, proportions of cells with different mutations, and ages at diagnosis of MPN. These observations consistently shape our modeling framework. Our key proposal is that bistability in gene expression provides a natural explanation for many observed order-of-mutation effects. We also propose potential experimental measurements that can be used to confirm or refute predictions of our models.
Collapse
Affiliation(s)
- Yue Wang
- Dept. of Computational Medicine, UCLA, Los Angeles, CA 90095
- Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY 10027
| | - Blerta Shtylla
- Mathematics Department, Pomona College, Claremont, CA, 91711
- Quantitative Systems Pharmacology, Oncology, Pfizer, San Diego, CA 92121
| | - Tom Chou
- Dept. of Computational Medicine, UCLA, Los Angeles, CA 90095
- Dept. of Mathematics, UCLA, Los Angeles, CA 90095
| |
Collapse
|
18
|
Bocci F, Jia D, Nie Q, Jolly MK, Onuchic J. Theoretical and computational tools to model multistable gene regulatory networks. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2023; 86:10.1088/1361-6633/acec88. [PMID: 37531952 PMCID: PMC10521208 DOI: 10.1088/1361-6633/acec88] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 08/02/2023] [Indexed: 08/04/2023]
Abstract
The last decade has witnessed a surge of theoretical and computational models to describe the dynamics of complex gene regulatory networks, and how these interactions can give rise to multistable and heterogeneous cell populations. As the use of theoretical modeling to describe genetic and biochemical circuits becomes more widespread, theoreticians with mathematical and physical backgrounds routinely apply concepts from statistical physics, non-linear dynamics, and network theory to biological systems. This review aims at providing a clear overview of the most important methodologies applied in the field while highlighting current and future challenges. It also includes hands-on tutorials to solve and simulate some of the archetypical biological system models used in the field. Furthermore, we provide concrete examples from the existing literature for theoreticians that wish to explore this fast-developing field. Whenever possible, we highlight the similarities and differences between biochemical and regulatory networks and 'classical' systems typically studied in non-equilibrium statistical and quantum mechanics.
Collapse
Affiliation(s)
- Federico Bocci
- The NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, CA 92697, USA
- Department of Mathematics, University of California, Irvine, CA 92697, USA
| | - Dongya Jia
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, USA
| | - Qing Nie
- The NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, CA 92697, USA
- Department of Mathematics, University of California, Irvine, CA 92697, USA
| | - Mohit Kumar Jolly
- Centre for BioSystems Science and Engineering, Indian Institute of Science, Bangalore 560012, India
| | - José Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, USA
- Department of Physics and Astronomy, Rice University, Houston, TX 77005, USA
- Department of Chemistry, Rice University, Houston, TX 77005, USA
- Department of Biosciences, Rice University, Houston, TX 77005, USA
| |
Collapse
|
19
|
Wang Y, Shtylla B, Chou T. Order-of-mutation effects on cancer progression: models for myeloproliferative neoplasm. ARXIV 2023:arXiv:2308.09941v1. [PMID: 37645049 PMCID: PMC10462171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
In some patients with myeloproliferative neoplasms (MPN), two genetic mutations are often found, JAK2 V617F and one in the TET2 gene. Whether or not one mutation is present will influence how the other subsequent mutation affects the regulation of gene expression. When both mutations are present, the order of their occurrence has been shown to influence disease progression and prognosis. We propose a nonlinear ordinary differential equation (ODE), Moran process, and Markov chain models to explain the non-additive and non-commutative mutation effects on recent clinical observations of gene expression patterns, proportions of cells with different mutations, and ages at diagnosis of MPN. These observations consistently shape our modeling framework. Our key proposal is that bistability in gene expression provides a natural explanation for many observed order-of-mutation effects. We also propose potential experimental measurements that can be used to confirm or refute predictions of our models.
Collapse
Affiliation(s)
- Yue Wang
- Dept. of Computational Medicine, UCLA, Los Angeles, CA 90095
- Irving Institute for Cancer Dynamics and Department of Statistics, Columbia University, New York, NY 10027
| | - Blerta Shtylla
- Mathematics Department, Pomona College, Claremont, CA, 91711
- Quantitative Systems Pharmacology, Oncology, Pfizer, San Diego, CA 92121
| | - Tom Chou
- Dept. of Computational Medicine, UCLA, Los Angeles, CA 90095
- Dept. of Mathematics, UCLA, Los Angeles, CA 90095
| |
Collapse
|
20
|
Marku M, Pancaldi V. From time-series transcriptomics to gene regulatory networks: A review on inference methods. PLoS Comput Biol 2023; 19:e1011254. [PMID: 37561790 PMCID: PMC10414591 DOI: 10.1371/journal.pcbi.1011254] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023] Open
Abstract
Inference of gene regulatory networks has been an active area of research for around 20 years, leading to the development of sophisticated inference algorithms based on a variety of assumptions and approaches. With the ever increasing demand for more accurate and powerful models, the inference problem remains of broad scientific interest. The abstract representation of biological systems through gene regulatory networks represents a powerful method to study such systems, encoding different amounts and types of information. In this review, we summarize the different types of inference algorithms specifically based on time-series transcriptomics, giving an overview of the main applications of gene regulatory networks in computational biology. This review is intended to give an updated reference of regulatory networks inference tools to biologists and researchers new to the topic and guide them in selecting the appropriate inference method that best fits their questions, aims, and experimental data.
Collapse
Affiliation(s)
- Malvina Marku
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| | - Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
- Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
21
|
Bocci F, Jia D, Nie Q, Jolly MK, Onuchic J. Theoretical and computational tools to model multistable gene regulatory networks. ARXIV 2023:arXiv:2302.07401v2. [PMID: 36824430 PMCID: PMC9949162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
The last decade has witnessed a surge of theoretical and computational models to describe the dynamics of complex gene regulatory networks, and how these interactions can give rise to multistable and heterogeneous cell populations. As the use of theoretical modeling to describe genetic and biochemical circuits becomes more widespread, theoreticians with mathematical and physical backgrounds routinely apply concepts from statistical physics, non-linear dynamics, and network theory to biological systems. This review aims at providing a clear overview of the most important methodologies applied in the field while highlighting current and future challenges. It also includes hands-on tutorials to solve and simulate some of the archetypical biological system models used in the field. Furthermore, we provide concrete examples from the existing literature for theoreticians that wish to explore this fast-developing field. Whenever possible, we highlight the similarities and differences between biochemical and regulatory networks and 'classical' systems typically studied in non-equilibrium statistical and quantum mechanics.
Collapse
|
22
|
He C, Zhou P, Nie Q. exFINDER: identify external communication signals using single-cell transcriptomics data. Nucleic Acids Res 2023; 51:e58. [PMID: 37026478 PMCID: PMC10250247 DOI: 10.1093/nar/gkad262] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 03/22/2023] [Accepted: 03/28/2023] [Indexed: 04/08/2023] Open
Abstract
Cells make decisions through their communication with other cells and receiving signals from their environment. Using single-cell transcriptomics, computational tools have been developed to infer cell-cell communication through ligands and receptors. However, the existing methods only deal with signals sent by the measured cells in the data, the received signals from the external system are missing in the inference. Here, we present exFINDER, a method that identifies such external signals received by the cells in the single-cell transcriptomics datasets by utilizing the prior knowledge of signaling pathways. In particular, exFINDER can uncover external signals that activate the given target genes, infer the external signal-target signaling network (exSigNet), and perform quantitative analysis on exSigNets. The applications of exFINDER to scRNA-seq datasets from different species demonstrate the accuracy and robustness of identifying external signals, revealing critical transition-related signaling activities, inferring critical external signals and targets, clustering signal-target paths, and evaluating relevant biological events. Overall, exFINDER can be applied to scRNA-seq data to reveal the external signal-associated activities and maybe novel cells that send such signals.
Collapse
Affiliation(s)
- Changhan He
- Department of Mathematics, University of California, Irvine, Irvine, CA 92697, USA
| | - Peijie Zhou
- Department of Mathematics, University of California, Irvine, Irvine, CA 92697, USA
| | - Qing Nie
- Department of Mathematics, University of California, Irvine, Irvine, CA 92697, USA
- Department of Cell and Developmental Biology, University of California, Irvine, Irvine, CA 92697, USA
| |
Collapse
|
23
|
Haghverdi L, Ludwig LS. Single-cell multi-omics and lineage tracing to dissect cell fate decision-making. Stem Cell Reports 2023; 18:13-25. [PMID: 36630900 PMCID: PMC9860164 DOI: 10.1016/j.stemcr.2022.12.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 12/07/2022] [Accepted: 12/07/2022] [Indexed: 01/12/2023] Open
Abstract
The concept of cell fate relates to the future identity of a cell, and its daughters, which is obtained via cell differentiation and division. Understanding, predicting, and manipulating cell fate has been a long-sought goal of developmental and regenerative biology. Recent insights obtained from single-cell genomic and integrative lineage-tracing approaches have further aided to identify molecular features predictive of cell fate. In this perspective, we discuss these approaches with a focus on theoretical concepts and future directions of the field to dissect molecular mechanisms underlying cell fate.
Collapse
Affiliation(s)
- Laleh Haghverdi
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany.
| | - Leif S Ludwig
- Max-Delbrück-Center for Molecular Medicine in the Helmholtz Association (MDC), Berlin Institute for Medical Systems Biology (BIMSB), Berlin, Germany; Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.
| |
Collapse
|