1
|
Purice MD, Lago‐Baldaia I, Fernandes VM, Singhvi A. Molecular profiling of invertebrate glia. Glia 2025; 73:632-656. [PMID: 39415317 PMCID: PMC11784859 DOI: 10.1002/glia.24623] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2024] [Revised: 09/06/2024] [Accepted: 09/18/2024] [Indexed: 10/18/2024]
Abstract
Caenorhabditis elegans and Drosophila melanogaster are powerful experimental models for uncovering fundamental tenets of nervous system organization and function. Findings over the last two decades show that molecular and cellular features are broadly conserved between invertebrates and vertebrates, indicating that insights derived from invertebrate models can broadly inform our understanding of glial operating principles across diverse species. In recent years, these model systems have led to exciting discoveries in glial biology and mechanisms of glia-neuron interactions. Here, we summarize studies that have applied current state-of-the-art "-omics" techniques to C. elegans and D. melanogaster glia. Coupled with the remarkable acceleration in the pace of mechanistic studies of glia biology in recent years, these indicate that invertebrate glia also exhibit striking molecular complexity, specificity, and heterogeneity. We provide an overview of these studies and discuss their implications as well as emerging questions where C. elegans and D. melanogaster are well-poised to fill critical knowledge gaps in our understanding of glial biology.
Collapse
Affiliation(s)
- Maria D. Purice
- Division of Basic SciencesFred Hutchinson Cancer CenterSeattleWashingtonUSA
- Department of Biological StructureSchool of Medicine, University of WashingtonSeattleWashingtonUSA
| | - Inês Lago‐Baldaia
- Department of Cell and Developmental BiologyUniversity College LondonLondonUK
| | | | - Aakanksha Singhvi
- Division of Basic SciencesFred Hutchinson Cancer CenterSeattleWashingtonUSA
- Department of Biological StructureSchool of Medicine, University of WashingtonSeattleWashingtonUSA
| |
Collapse
|
2
|
Naqvi S, Kim S, Tabatabaee S, Pampari A, Kundaje A, Pritchard JK, Wysocka J. Transfer learning reveals sequence determinants of the quantitative response to transcription factor dosage. CELL GENOMICS 2025:100780. [PMID: 40020686 DOI: 10.1016/j.xgen.2025.100780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 01/29/2025] [Accepted: 01/30/2025] [Indexed: 03/03/2025]
Abstract
Deep learning models have advanced our ability to predict cell-type-specific chromatin patterns from transcription factor (TF) binding motifs, but their application to perturbed contexts remains limited. We applied transfer learning to predict how concentrations of the dosage-sensitive TFs TWIST1 and SOX9 affect regulatory element (RE) chromatin accessibility in facial progenitor cells, achieving near-experimental accuracy. High-affinity motifs that allow for heterotypic TF co-binding and are concentrated at the center of REs buffer against quantitative changes in TF dosage and predict unperturbed accessibility. Conversely, low-affinity or homotypic binding motifs distributed throughout REs drive sensitive responses with minimal impact on unperturbed accessibility. Both buffering and sensitizing features display purifying selection signatures. We validated these sequence features through reporter assays and demonstrated that TF-nucleosome competition can explain low-affinity motifs' sensitizing effects. This combination of transfer learning and quantitative chromatin response measurements provides a novel approach for uncovering additional layers of the cis-regulatory code.
Collapse
Affiliation(s)
- Sahin Naqvi
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA; Division of Gastroenterology, Hepatology, and Nutrition, Boston Children's Hospital, Boston, MA 02115, USA; Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA.
| | - Seungsoo Kim
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Saman Tabatabaee
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Anusri Pampari
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Anshul Kundaje
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA 94305, USA; Department of Biology, Stanford University, Stanford, CA 94305, USA.
| | - Joanna Wysocka
- Departments of Chemical and Systems Biology and Developmental Biology, Stanford University School of Medicine, Stanford, CA 94305, USA; Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA.
| |
Collapse
|
3
|
Lee JY, Huang N, Samuels TJ, Davis I. Imp/IGF2BP and Syp/SYNCRIP temporal RNA interactomes uncover combinatorial networks of regulators of Drosophila brain development. SCIENCE ADVANCES 2025; 11:eadr6682. [PMID: 39919181 PMCID: PMC11804933 DOI: 10.1126/sciadv.adr6682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2024] [Accepted: 01/07/2025] [Indexed: 02/09/2025]
Abstract
Temporal patterning of neural progenitors is an evolutionarily conserved mechanism generating neural diversity. In Drosophila, postembryonic neurogenesis requires the RNA binding proteins (RBPs) Imp/IGF2BP and Syp/SYNCRIP. However, how they coachieve their function is not well understood. Here, we elucidate the in vivo temporal RNA interactome landscapes of Imp and Syp during larval brain development. Imp and Syp bind a highly overlapping set of conserved mRNAs encoding proteins involved in neurodevelopment. We identify transcripts differentially occupied by Imp/Syp over time, featuring a network of known and previously unknown candidate temporal regulators that are post-transcriptionally regulated by Imp/Syp. Furthermore, the physical and coevolutionary relationships between Imp and Syp binding sites reveal a combinatorial, rather than competitive, mode of molecular interplay. Our study establishes an in vivo framework for dissecting the temporal coregulation of RBP networks as well as providing a resource for understanding neural fate specification.
Collapse
Affiliation(s)
- Jeffrey Y. Lee
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, UK
| | - Niles Huang
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, UK
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Tamsin J. Samuels
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge CB2 3EL, UK
| | - Ilan Davis
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G12 8QQ, UK
- Department of Biochemistry, University of Oxford, Oxford OX1 3QU, UK
| |
Collapse
|
4
|
Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2024:10.1007/s11427-024-2770-x. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]
Abstract
The applications of single-cell and spatial technologies in recent times have revolutionized the present understanding of cellular states and the cellular heterogeneity inherent in complex biological systems. These advancements offer unprecedented resolution in the examination of the functional genomics of individual cells and their spatial context within tissues. In this review, we have comprehensively discussed the historical development and recent progress in the field of single-cell and spatial genomics. We have reviewed the breakthroughs in single-cell multi-omics technologies, spatial genomics methods, and the computational strategies employed toward the analyses of single-cell atlas data. Furthermore, we have highlighted the advances made in constructing cellular atlases and their clinical applications, particularly in the context of disease. Finally, we have discussed the emerging trends, challenges, and opportunities in this rapidly evolving field.
Collapse
Affiliation(s)
- Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Haoxi Chai
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
| | - Yujia Jiang
- BGI Research, Shenzhen, 518083, China
- BGI Research, Hangzhou, 310030, China
| | - Teng Wang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
| | - Xia Ran
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
| | - Qimin Xia
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Ziye Xu
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Hongshan Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
- Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
| | - Yijun Ruan
- Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
| | - Yongcheng Wang
- Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
| | - Dong Xing
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
- Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
| | - Xun Xu
- BGI Research, Shenzhen, 518083, China.
- BGI Research, Hangzhou, 310030, China.
- Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
| | - Zemin Zhang
- Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.
| |
Collapse
|
5
|
Chu SY, Lai YW, Hsu TC, Lu TM, Yu HH. Isoforms of terminal selector Mamo control axon guidance during adult Drosophila memory center construction via Semaphorin-1a. Dev Biol 2024; 515:1-6. [PMID: 38906235 DOI: 10.1016/j.ydbio.2024.06.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 06/16/2024] [Accepted: 06/18/2024] [Indexed: 06/23/2024]
Abstract
In animals undergoing metamorphosis, the appearance of the nervous system is coincidently transformed by the morphogenesis of neurons. Such morphogenic alterations are exemplified in three types of intrinsic neurons in the Drosophila memory center. In contrast to the well-characterized remodeling of γ neurons, the morphogenesis of α/β and α'/β' neurons has not been adequately explored. Here, we show that mamo, a BTB-zinc finger transcription factor that acts as a terminal selector for α'/β' neurons, controls the formation of the correct axonal pattern of α'/β' neurons. Intriguingly, specific Mamo isoforms are preferentially expressed in α'/β' neurons to regulate the expression of axon guidance molecule Semaphorin-1a. This action directs proper axon guidance in α'/β' neurons, which is also crucial for wiring of α'/β' neurons with downstream neurons. Taken together, our results provide molecular insights into how neurons establish correct axonal patterns in circuitry assembly during adult memory center construction.
Collapse
Affiliation(s)
- Sao-Yu Chu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan
| | - Yen-Wei Lai
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan
| | - Tsai-Chi Hsu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan
| | - Tsai-Ming Lu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan
| | - Hung-Hsiang Yu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan.
| |
Collapse
|
6
|
Johansen NJ, Kempynck N, Zemke NR, Somasundaram S, De Winter S, Hooper M, Dwivedi D, Lohia R, Wehbe F, Li B, Abaffyová D, Armand EJ, De Man J, Eksi EC, Hecker N, Hulselmans G, Konstantakos V, Mauduit D, Mich JK, Partel G, Daigle TL, Levi BP, Zhang K, Tanaka Y, Gillis J, Ting JT, Ben-Simon Y, Miller J, Ecker JR, Ren B, Aerts S, Lein ES, Tasic B, Bakken TE. Evaluating Methods for the Prediction of Cell Type-Specific Enhancers in the Mammalian Cortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.21.609075. [PMID: 39229027 PMCID: PMC11370467 DOI: 10.1101/2024.08.21.609075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Identifying cell type-specific enhancers in the brain is critical to building genetic tools for investigating the mammalian brain. Computational methods for functional enhancer prediction have been proposed and validated in the fruit fly and not yet the mammalian brain. We organized the 'Brain Initiative Cell Census Network (BICCN) Challenge: Predicting Functional Cell Type-Specific Enhancers from Cross-Species Multi-Omics' to assess machine learning and feature-based methods designed to nominate enhancer DNA sequences to target cell types in the mouse cortex. Methods were evaluated based on in vivo validation data from hundreds of cortical cell type-specific enhancers that were previously packaged into individual AAV vectors and retro-orbitally injected into mice. We find that open chromatin was a key predictor of functional enhancers, and sequence models improved prediction of non-functional enhancers that can be deprioritized as opposed to pursued for in vivo testing. Sequence models also identified cell type-specific transcription factor codes that can guide designs of in silico enhancers. This community challenge establishes a benchmark for enhancer prioritization algorithms and reveals computational approaches and molecular information that are crucial for the identification of functional enhancers for mammalian cortical cell types. The results of this challenge bring us closer to understanding the complex gene regulatory landscape of the mammalian brain and help us design more efficient genetic tools and potential gene therapies for human neurological diseases.
Collapse
Affiliation(s)
- Nelson J Johansen
- Allen Institute for Brain Science, Seattle, WA 98109
- These authors contributed equally
| | - Niklas Kempynck
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
- These authors contributed equally
| | - Nathan R Zemke
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093
| | | | - Seppe De Winter
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Marcus Hooper
- Allen Institute for Brain Science, Seattle, WA 98109
| | | | - Ruchi Lohia
- Physiology Department and Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Fabien Wehbe
- Maisonneuve-Rosemont Hospital Research Centre, University of Montreal, Montreal, Quebec, Canada
| | - Bocheng Li
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Darina Abaffyová
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Ethan J Armand
- Bioinformatics and Systems Biology Program, University of California, San Diego, La Jolla, CA 92093
| | - Julie De Man
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Eren Can Eksi
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Nikolai Hecker
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Gert Hulselmans
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Vasilis Konstantakos
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - David Mauduit
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - John K Mich
- Allen Institute for Brain Science, Seattle, WA 98109
| | - Gabriele Partel
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | | | - Boaz P Levi
- Allen Institute for Brain Science, Seattle, WA 98109
| | - Kai Zhang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
- Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Yoshiaki Tanaka
- Maisonneuve-Rosemont Hospital Research Centre, University of Montreal, Montreal, Quebec, Canada
| | - Jesse Gillis
- Physiology Department and Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, Ontario, Canada
| | - Jonathan T Ting
- Allen Institute for Brain Science, Seattle, WA 98109
- Department of Physiology and Biophysics, University of Washington, Seattle, WA 98195
| | | | - Jeremy Miller
- Allen Institute for Brain Science, Seattle, WA 98109
| | - Joseph R Ecker
- Salk Institute for Biological Studies, La Jolla, CA 92037
| | - Bing Ren
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093
| | - Stein Aerts
- VIB Center for AI & Computational Biology, VIB-KU Leuven Center for Brain and Disease Research & KU Leuven Department of Human Genetics, Leuven, Belgium
| | - Ed S Lein
- Allen Institute for Brain Science, Seattle, WA 98109
| | | | - Trygve E Bakken
- Allen Institute for Brain Science, Seattle, WA 98109
- Lead contact
| |
Collapse
|
7
|
Wang H, Bollepogu Raja KK, Yeung K, Morrison CA, Terrizzano A, Khodadadi-Jamayran A, Chen P, Jordan A, Fritsch C, Sprecher SG, Mardon G, Treisman JE. Synergistic activation by Glass and Pointed promotes neuronal identity in the Drosophila eye disc. Nat Commun 2024; 15:7091. [PMID: 39154080 PMCID: PMC11330500 DOI: 10.1038/s41467-024-51429-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 08/06/2024] [Indexed: 08/19/2024] Open
Abstract
The integration of extrinsic signaling with cell-intrinsic transcription factors can direct progenitor cells to differentiate into distinct cell fates. In the developing Drosophila eye, differentiation of photoreceptors R1-R7 requires EGFR signaling mediated by the transcription factor Pointed, and our single-cell RNA-Seq analysis shows that the same photoreceptors require the eye-specific transcription factor Glass. We find that ectopic expression of Glass and activation of EGFR signaling synergistically induce neuronal gene expression in the wing disc in a Pointed-dependent manner. Targeted DamID reveals that Glass and Pointed share many binding sites in the genome of developing photoreceptors. Comparison with transcriptomic data shows that Pointed and Glass induce photoreceptor differentiation through intermediate transcription factors, including the redundant homologs Scratch and Scrape, as well as directly activating neuronal effector genes. Our data reveal synergistic activation of a multi-layered transcriptional network as the mechanism by which EGFR signaling induces neuronal identity in Glass-expressing cells.
Collapse
Affiliation(s)
- Hongsu Wang
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA
| | | | - Kelvin Yeung
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX, USA
| | - Carolyn A Morrison
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA
- 10x Genomics, Pleasanton, CA, 94588, USA
| | - Antonia Terrizzano
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA
- Biology of Centrosomes and Genetic Instability Team, Curie Institute, PSL Research University, CNRS, UMR144, 12 rue Lhomond, Paris, 75005, France
| | | | - Phoenix Chen
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA
- Department of Biology, Boston University, Boston, MA, USA
| | - Ashley Jordan
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA
| | - Cornelia Fritsch
- Department of Biology, Université de Fribourg, Fribourg, Switzerland
| | - Simon G Sprecher
- Department of Biology, Université de Fribourg, Fribourg, Switzerland
| | - Graeme Mardon
- Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Jessica E Treisman
- Department of Cell Biology, NYU Grossman School of Medicine, New York, NY, USA.
| |
Collapse
|
8
|
Li Y, Ma A, Wang Y, Guo Q, Wang C, Fu H, Liu B, Ma Q. Enhancer-driven gene regulatory networks inference from single-cell RNA-seq and ATAC-seq data. Brief Bioinform 2024; 25:bbae369. [PMID: 39082647 PMCID: PMC11289686 DOI: 10.1093/bib/bbae369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Revised: 06/19/2024] [Accepted: 07/15/2024] [Indexed: 08/03/2024] Open
Abstract
Deciphering the intricate relationships between transcription factors (TFs), enhancers, and genes through the inference of enhancer-driven gene regulatory networks (eGRNs) is crucial in understanding gene regulatory programs in a complex biological system. This study introduces STREAM, a novel method that leverages a Steiner forest problem model, a hybrid biclustering pipeline, and submodular optimization to infer eGRNs from jointly profiled single-cell transcriptome and chromatin accessibility data. Compared to existing methods, STREAM demonstrates enhanced performance in terms of TF recovery, TF-enhancer linkage prediction, and enhancer-gene relation discovery. Application of STREAM to an Alzheimer's disease dataset and a diffuse small lymphocytic lymphoma dataset reveals its ability to identify TF-enhancer-gene relations associated with pseudotime, as well as key TF-enhancer-gene relations and TF cooperation underlying tumor cells.
Collapse
Affiliation(s)
- Yang Li
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, United States
| | - Yizhong Wang
- School of Mathematics, Shandong University, Jinan, Shandong 250100, China
| | - Qi Guo
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Cankun Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Hongjun Fu
- Department of Neuroscience, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong 250100, China
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH 43210, United States
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, United States
| |
Collapse
|
9
|
Mulet-Lazaro R, van Herk S, Nuetzel M, Sijs-Szabo A, Díaz N, Kelly K, Erpelinck-Verschueren C, Schwarzfischer-Pfeilschifter L, Stanewsky H, Ackermann U, Glatz D, Raithel J, Fischer A, Pohl S, Rijneveld A, Vaquerizas JM, Thiede C, Plass C, Wouters BJ, Delwel R, Rehli M, Gebhard C. Epigenetic alterations affecting hematopoietic regulatory networks as drivers of mixed myeloid/lymphoid leukemia. Nat Commun 2024; 15:5693. [PMID: 38972954 PMCID: PMC11228033 DOI: 10.1038/s41467-024-49811-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 06/19/2024] [Indexed: 07/09/2024] Open
Abstract
Leukemias with ambiguous lineage comprise several loosely defined entities, often without a clear mechanistic basis. Here, we extensively profile the epigenome and transcriptome of a subgroup of such leukemias with CpG Island Methylator Phenotype. These leukemias exhibit comparable hybrid myeloid/lymphoid epigenetic landscapes, yet heterogeneous genetic alterations, suggesting they are defined by their shared epigenetic profile rather than common genetic lesions. Gene expression enrichment reveals similarity with early T-cell precursor acute lymphoblastic leukemia and a lymphoid progenitor cell of origin. In line with this, integration of differential DNA methylation and gene expression shows widespread silencing of myeloid transcription factors. Moreover, binding sites for hematopoietic transcription factors, including CEBPA, SPI1 and LEF1, are uniquely inaccessible in these leukemias. Hypermethylation also results in loss of CTCF binding, accompanied by changes in chromatin interactions involving key transcription factors. In conclusion, epigenetic dysregulation, and not genetic lesions, explains the mixed phenotype of this group of leukemias with ambiguous lineage. The data collected here constitute a useful and comprehensive epigenomic reference for subsequent studies of acute myeloid leukemias, T-cell acute lymphoblastic leukemias and mixed-phenotype leukemias.
Collapse
Affiliation(s)
- Roger Mulet-Lazaro
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
- Oncode Institute, Utrecht, the Netherlands
| | - Stanley van Herk
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
- Oncode Institute, Utrecht, the Netherlands
| | - Margit Nuetzel
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Aniko Sijs-Szabo
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
| | - Noelia Díaz
- Max Planck Institute for Molecular Biomedicine, Muenster, Germany
- Renewable Marine Resources Department, Institute of Marine Sciences (ICM-CSIC), Barcelona, Spain
| | - Katherine Kelly
- Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Claudia Erpelinck-Verschueren
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
- Oncode Institute, Utrecht, the Netherlands
| | | | - Hanna Stanewsky
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Ute Ackermann
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Dagmar Glatz
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Johanna Raithel
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Alexander Fischer
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
| | - Sandra Pohl
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany
- Department of Conservative Dentistry and Periodontology, University Hospital Regensburg, Regensburg, Germany
| | - Anita Rijneveld
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands
| | - Juan M Vaquerizas
- Max Planck Institute for Molecular Biomedicine, Muenster, Germany
- MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital 8 Campus, London, United Kingdom
| | - Christian Thiede
- Medizinische Klinik und Poliklinik I, Universitätsklinikum Carl Gustav Carus, Dresden, Germany
| | - Christoph Plass
- Division of Cancer Epigenomics, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Bas J Wouters
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands.
- Oncode Institute, Utrecht, the Netherlands.
| | - Ruud Delwel
- Department of Hematology, Erasmus MC Cancer Institute, Rotterdam, the Netherlands.
- Oncode Institute, Utrecht, the Netherlands.
| | - Michael Rehli
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany.
- Leibniz Institute for Immunotherapy (LIT), Regensburg, Germany.
| | - Claudia Gebhard
- Department of Internal Medicine III, University Hospital Regensburg, Regensburg, Germany.
- Leibniz Institute for Immunotherapy (LIT), Regensburg, Germany.
| |
Collapse
|
10
|
Hwang H, Jeon H, Yeo N, Baek D. Big data and deep learning for RNA biology. Exp Mol Med 2024; 56:1293-1321. [PMID: 38871816 PMCID: PMC11263376 DOI: 10.1038/s12276-024-01243-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 02/27/2024] [Accepted: 03/05/2024] [Indexed: 06/15/2024] Open
Abstract
The exponential growth of big data in RNA biology (RB) has led to the development of deep learning (DL) models that have driven crucial discoveries. As constantly evidenced by DL studies in other fields, the successful implementation of DL in RB depends heavily on the effective utilization of large-scale datasets from public databases. In achieving this goal, data encoding methods, learning algorithms, and techniques that align well with biological domain knowledge have played pivotal roles. In this review, we provide guiding principles for applying these DL concepts to various problems in RB by demonstrating successful examples and associated methodologies. We also discuss the remaining challenges in developing DL models for RB and suggest strategies to overcome these challenges. Overall, this review aims to illuminate the compelling potential of DL for RB and ways to apply this powerful technology to investigate the intriguing biology of RNA more effectively.
Collapse
Affiliation(s)
- Hyeonseo Hwang
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - Hyeonseong Jeon
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
- Genome4me Inc., Seoul, Republic of Korea
| | - Nagyeong Yeo
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - Daehyun Baek
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea.
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea.
- Genome4me Inc., Seoul, Republic of Korea.
| |
Collapse
|
11
|
Benchmarking of single-cell ATAC sequencing tools. Nat Biotechnol 2024; 42:856-857. [PMID: 37537503 DOI: 10.1038/s41587-023-01897-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023]
|
12
|
Liu W, Li Q. Single-cell transcriptomics dissecting the development and evolution of nervous system in insects. CURRENT OPINION IN INSECT SCIENCE 2024; 63:101201. [PMID: 38608931 DOI: 10.1016/j.cois.2024.101201] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/07/2024] [Accepted: 04/08/2024] [Indexed: 04/14/2024]
Abstract
Insects can display a vast repertoire of complex and adaptive behaviors crucial for survival and reproduction. Yet, how the neural circuits underlying insect behaviors are assembled throughout development and remodeled during evolution remains largely obscure. The advent of single-cell transcriptomics has opened new paths to illuminate these historically intractable questions. Insect behavior is governed by its brain, whose functional complexity is realized through operations across multiple levels, from the molecular and cellular to the circuit and organ. Single-cell transcriptomics enables dissecting brain functions across all these levels and allows tracking regulatory dynamics throughout development and under perturbation. In this review, we mainly focus on the achievements of single-cell transcriptomics in dissecting the molecular and cellular architectures of nervous systems in representative insects, then discuss its applications in tracking the developmental trajectory and functional evolution of insect brains.
Collapse
Affiliation(s)
- Weiwei Liu
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Yunnan Key Laboratory of Biodiversity Information, Kunming, China.
| | - Qiye Li
- BGI Research, Shenzhen 518083, China; BGI Research, Wuhan 430074, China; College of Life Sciences, University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
13
|
Mannens CCA, Hu L, Lönnerberg P, Schipper M, Reagor CC, Li X, He X, Barker RA, Sundström E, Posthuma D, Linnarsson S. Chromatin accessibility during human first-trimester neurodevelopment. Nature 2024:10.1038/s41586-024-07234-1. [PMID: 38693260 DOI: 10.1038/s41586-024-07234-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 02/02/2024] [Indexed: 05/03/2024]
Abstract
The human brain develops through a tightly organized cascade of patterning events, induced by transcription factor expression and changes in chromatin accessibility. Although gene expression across the developing brain has been described at single-cell resolution1, similar atlases of chromatin accessibility have been primarily focused on the forebrain2-4. Here we describe chromatin accessibility and paired gene expression across the entire developing human brain during the first trimester (6-13 weeks after conception). We defined 135 clusters and used multiomic measurements to link candidate cis-regulatory elements to gene expression. The number of accessible regions increased both with age and along neuronal differentiation. Using a convolutional neural network, we identified putative functional transcription factor-binding sites in enhancers characterizing neuronal subtypes. We applied this model to cis-regulatory elements linked to ESRRB to elucidate its activation mechanism in the Purkinje cell lineage. Finally, by linking disease-associated single nucleotide polymorphisms to cis-regulatory elements, we validated putative pathogenic mechanisms in several diseases and identified midbrain-derived GABAergic neurons as being the most vulnerable to major depressive disorder-related mutations. Our findings provide a more detailed view of key gene regulatory mechanisms underlying the emergence of brain cell types during the first trimester and a comprehensive reference for future studies related to human neurodevelopment.
Collapse
Affiliation(s)
- Camiel C A Mannens
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Solna, Sweden
| | - Lijuan Hu
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Solna, Sweden
| | - Peter Lönnerberg
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Solna, Sweden
| | - Marijn Schipper
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Caleb C Reagor
- Howard Hughes Medical Institute and Laboratory of Sensory Neuroscience, The Rockefeller University, New York, NY, USA
| | - Xiaofei Li
- Division of Neurodegeneration, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Solna, Sweden
| | - Xiaoling He
- John van Geest Centre for Brain Repair, Department of Clinical Neurosciences, Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | - Roger A Barker
- John van Geest Centre for Brain Repair, Department of Clinical Neurosciences, Wellcome-MRC Cambridge Stem Cell Institute, University of Cambridge, Cambridge, UK
| | - Erik Sundström
- Division of Neurodegeneration, Department of Neurobiology, Care Sciences and Society, Karolinska Institutet, Solna, Sweden
| | - Danielle Posthuma
- Department of Complex Trait Genetics, Center for Neurogenomics and Cognitive Research (CNCR), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Sten Linnarsson
- Division of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institute, Solna, Sweden.
| |
Collapse
|
14
|
Yuan Y, Chen Q, Brovkina M, Clowney EJ, Yadlapalli S. Clock-dependent chromatin accessibility rhythms regulate circadian transcription. PLoS Genet 2024; 20:e1011278. [PMID: 38805552 PMCID: PMC11161047 DOI: 10.1371/journal.pgen.1011278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 06/07/2024] [Accepted: 04/29/2024] [Indexed: 05/30/2024] Open
Abstract
Chromatin organization plays a crucial role in gene regulation by controlling the accessibility of DNA to transcription machinery. While significant progress has been made in understanding the regulatory role of clock proteins in circadian rhythms, how chromatin organization affects circadian rhythms remains poorly understood. Here, we employed ATAC-seq (Assay for Transposase-Accessible Chromatin with Sequencing) on FAC-sorted Drosophila clock neurons to assess genome-wide chromatin accessibility at dawn and dusk over the circadian cycle. We observed significant oscillations in chromatin accessibility at promoter and enhancer regions of hundreds of genes, with enhanced accessibility either at dusk or dawn, which correlated with their peak transcriptional activity. Notably, genes with enhanced accessibility at dusk were enriched with E-box motifs, while those more accessible at dawn were enriched with VRI/PDP1-box motifs, indicating that they are regulated by the core circadian feedback loops, PER/CLK and VRI/PDP1, respectively. Further, we observed a complete loss of chromatin accessibility rhythms in per01 null mutants, with chromatin consistently accessible at both dawn and dusk, underscoring the critical role of Period protein in driving chromatin compaction during the repression phase at dawn. Together, this study demonstrates the significant role of chromatin organization in circadian regulation, revealing how the interplay between clock proteins and chromatin structure orchestrates the precise timing of biological processes throughout the day. This work further implies that variations in chromatin accessibility might play a central role in the generation of diverse circadian gene expression patterns in clock neurons.
Collapse
Affiliation(s)
- Ye Yuan
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Qianqian Chen
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Margarita Brovkina
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, Michigan, United States of America
| | - E Josephine Clowney
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Swathi Yadlapalli
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Michigan, United States of America
- Michigan Neuroscience Institute, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
15
|
Merrill CB, Titos I, Pabon MA, Montgomery AB, Rodan AR, Rothenfluh A. Iterative assay for transposase-accessible chromatin by sequencing to isolate functionally relevant neuronal subtypes. SCIENCE ADVANCES 2024; 10:eadi4393. [PMID: 38536919 PMCID: PMC10971406 DOI: 10.1126/sciadv.adi4393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 02/21/2024] [Indexed: 04/18/2024]
Abstract
The Drosophila brain contains tens of thousands of distinct cell types. Thousands of different transgenic lines reproducibly target specific neuron subsets, yet most still express in several cell types. Furthermore, most lines were developed without a priori knowledge of where the transgenes would be expressed. To aid in the development of cell type-specific tools for neuronal identification and manipulation, we developed an iterative assay for transposase-accessible chromatin (ATAC) approach. Open chromatin regions (OCRs) enriched in neurons, compared to whole bodies, drove transgene expression preferentially in subsets of neurons. A second round of ATAC-seq from these specific neuron subsets revealed additional enriched OCR2s that further restricted transgene expression within the chosen neuron subset. This approach allows for continued refinement of transgene expression, and we used it to identify neurons relevant for sleep behavior. Furthermore, this approach is widely applicable to other cell types and to other organisms.
Collapse
Affiliation(s)
- Collin B. Merrill
- Department of Psychiatry, Huntsman Mental Health Institute, University of Utah, Salt Lake City, UT 84108, USA
| | - Iris Titos
- Department of Psychiatry, Huntsman Mental Health Institute, University of Utah, Salt Lake City, UT 84108, USA
| | - Miguel A. Pabon
- Molecular Medicine Program, University of Utah, Salt Lake City, UT 84112, USA
| | | | - Aylin R. Rodan
- Molecular Medicine Program, University of Utah, Salt Lake City, UT 84112, USA
- Division of Nephrology and Hypertension, Department of Internal Medicine, University of Utah, Salt Lake City, UT 84132, USA
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
- Medical Service, Veterans Affairs Salt Lake City Health Care System, Salt Lake City, UT, USA
| | - Adrian Rothenfluh
- Department of Psychiatry, Huntsman Mental Health Institute, University of Utah, Salt Lake City, UT 84108, USA
- Molecular Medicine Program, University of Utah, Salt Lake City, UT 84112, USA
- Department of Human Genetics, University of Utah, Salt Lake City, UT 84112, USA
- Department of Neurobiology, University of Utah, Salt Lake City, UT 84112, USA
| |
Collapse
|
16
|
Zhang G, Fu Y, Yang L, Ye F, Zhang P, Zhang S, Ma L, Li J, Wu H, Han X, Wang J, Guo G. Construction of single-cell cross-species chromatin accessibility landscapes with combinatorial-hybridization-based ATAC-seq. Dev Cell 2024; 59:793-811.e8. [PMID: 38330939 DOI: 10.1016/j.devcel.2024.01.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 11/03/2023] [Accepted: 01/18/2024] [Indexed: 02/10/2024]
Abstract
Despite recent advances in single-cell genomics, the lack of maps for single-cell candidate cis-regulatory elements (cCREs) in non-mammal species has limited our exploration of conserved regulatory programs across vertebrates and invertebrates. Here, we developed a combinatorial-hybridization-based method for single-cell assay for transposase-accessible chromatin using sequencing (scATAC-seq) named CH-ATAC-seq, enabling the construction of single-cell accessible chromatin landscapes for zebrafish, Drosophila, and earthworms (Eisenia andrei). By integrating scATAC censuses of humans, monkeys, and mice, we systematically identified 152 distinct main cell types and around 0.8 million cell-type-specific cCREs. Our analysis provided insights into the conservation of neural, muscle, and immune lineages across species, while epithelial cells exhibited a higher organ-origin heterogeneity. Additionally, a large-scale gene regulatory network (GRN) was constructed in four vertebrates by integrating scRNA-seq censuses. Overall, our study provides a valuable resource for comparative epigenomics, identifying the evolutionary conservation and divergence of gene regulation across different species.
Collapse
Affiliation(s)
- Guodong Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China
| | - Yuting Fu
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Lei Yang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China
| | - Peijing Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Shuang Zhang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Lifeng Ma
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Jiaqi Li
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Hanyu Wu
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China
| | - Xiaoping Han
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou 310058, China.
| | - Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China.
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou 310000, China; Liangzhu Laboratory, Zhejiang University, Hangzhou 311121, China; Zhejiang Provincial Key Laboratory for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou 310058, China; Institute of Hematology, Zhejiang University, Hangzhou, China.
| |
Collapse
|
17
|
Shi T, Zhang H, Cui S, Liu J, Gu Z, Wang Z, Yan X, Liu Q. Stochastic neuro-fuzzy system implemented in memristor crossbar arrays. SCIENCE ADVANCES 2024; 10:eadl3135. [PMID: 38517972 PMCID: PMC10959402 DOI: 10.1126/sciadv.adl3135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 02/16/2024] [Indexed: 03/24/2024]
Abstract
Neuro-symbolic artificial intelligence has garnered considerable attention amid increasing industry demands for high-performance neural networks that are interpretable and adaptable to previously unknown problem domains with minimal reconfiguration. However, implementing neuro-symbolic hardware is challenging due to the complexity in symbolic knowledge representation and calculation. We experimentally demonstrated a memristor-based neuro-fuzzy hardware based on TiN/TaOx/HfOx/TiN chips that is superior to its silicon-based counterpart in terms of throughput and energy efficiency by using array topological structure for knowledge representation and physical laws for computing. Intrinsic memristor variability is fully exploited to increase robustness in knowledge representation. A hybrid in situ training strategy is proposed for error minimizing in training. The hardware adapts easier to a previously unknown environment, achieving ~6.6 times faster convergence and ~6 times lower error than deep learning. The hardware energy efficiency is over two orders of magnitude greater than field-programmable gate arrays. This research greatly extends the capability of memristor-based neuromorphic computing systems in artificial intelligence.
Collapse
Affiliation(s)
- Tuo Shi
- State Key Lab of Fabrication Technologies for Integrated Circuits, Institute of Microelectronics, Chinese Academy of Sciences, Beijing 100029, China
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Hui Zhang
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Shiyu Cui
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Jinchang Liu
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Zixi Gu
- Research Center for Intelligent Computing Hardware, Zhejiang Laboratory, Hangzhou 311122, China
| | - Zhanfeng Wang
- Key Laboratory of Brain-like Neuromorphic Devices and Systems of Hebei Province, Hebei University, Baoding 071002, P. R. China
| | - Xiaobing Yan
- Key Laboratory of Brain-like Neuromorphic Devices and Systems of Hebei Province, Hebei University, Baoding 071002, P. R. China
| | - Qi Liu
- Frontier Institute of Chip and System, Fudan University, Shanghai 200433, China
| |
Collapse
|
18
|
Goshisht MK. Machine Learning and Deep Learning in Synthetic Biology: Key Architectures, Applications, and Challenges. ACS OMEGA 2024; 9:9921-9945. [PMID: 38463314 PMCID: PMC10918679 DOI: 10.1021/acsomega.3c05913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/19/2024] [Accepted: 01/30/2024] [Indexed: 03/12/2024]
Abstract
Machine learning (ML), particularly deep learning (DL), has made rapid and substantial progress in synthetic biology in recent years. Biotechnological applications of biosystems, including pathways, enzymes, and whole cells, are being probed frequently with time. The intricacy and interconnectedness of biosystems make it challenging to design them with the desired properties. ML and DL have a synergy with synthetic biology. Synthetic biology can be employed to produce large data sets for training models (for instance, by utilizing DNA synthesis), and ML/DL models can be employed to inform design (for example, by generating new parts or advising unrivaled experiments to perform). This potential has recently been brought to light by research at the intersection of engineering biology and ML/DL through achievements like the design of novel biological components, best experimental design, automated analysis of microscopy data, protein structure prediction, and biomolecular implementations of ANNs (Artificial Neural Networks). I have divided this review into three sections. In the first section, I describe predictive potential and basics of ML along with myriad applications in synthetic biology, especially in engineering cells, activity of proteins, and metabolic pathways. In the second section, I describe fundamental DL architectures and their applications in synthetic biology. Finally, I describe different challenges causing hurdles in the progress of ML/DL and synthetic biology along with their solutions.
Collapse
Affiliation(s)
- Manoj Kumar Goshisht
- Department of Chemistry, Natural and
Applied Sciences, University of Wisconsin—Green
Bay, Green
Bay, Wisconsin 54311-7001, United States
| |
Collapse
|
19
|
Michielsen L, Reinders MJT, Mahfouz A. Predicting cell population-specific gene expression from genomic sequence. FRONTIERS IN BIOINFORMATICS 2024; 4:1347276. [PMID: 38501113 PMCID: PMC10944912 DOI: 10.3389/fbinf.2024.1347276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 01/23/2024] [Indexed: 03/20/2024] Open
Abstract
Most regulatory elements, especially enhancer sequences, are cell population-specific. One could even argue that a distinct set of regulatory elements is what defines a cell population. However, discovering which non-coding regions of the DNA are essential in which context, and as a result, which genes are expressed, is a difficult task. Some computational models tackle this problem by predicting gene expression directly from the genomic sequence. These models are currently limited to predicting bulk measurements and mainly make tissue-specific predictions. Here, we present a model that leverages single-cell RNA-sequencing data to predict gene expression. We show that cell population-specific models outperform tissue-specific models, especially when the expression profile of a cell population and the corresponding tissue are dissimilar. Further, we show that our model can prioritize GWAS variants and learn motifs of transcription factor binding sites. We envision that our model can be useful for delineating cell population-specific regulatory elements.
Collapse
Affiliation(s)
- Lieke Michielsen
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
- Leiden Computational Biology Center, Leiden University Medical Center, Leiden, Netherlands
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
| | - Marcel J. T. Reinders
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
- Leiden Computational Biology Center, Leiden University Medical Center, Leiden, Netherlands
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
| | - Ahmed Mahfouz
- Department of Human Genetics, Leiden University Medical Center, Leiden, Netherlands
- Leiden Computational Biology Center, Leiden University Medical Center, Leiden, Netherlands
- Delft Bioinformatics Lab, Delft University of Technology, Delft, Netherlands
| |
Collapse
|
20
|
Lim B, Domsch K, Mall M, Lohmann I. Canalizing cell fate by transcriptional repression. Mol Syst Biol 2024; 20:144-161. [PMID: 38302581 PMCID: PMC10912439 DOI: 10.1038/s44320-024-00014-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Revised: 11/28/2023] [Accepted: 12/15/2023] [Indexed: 02/03/2024] Open
Abstract
Precision in the establishment and maintenance of cellular identities is crucial for the development of multicellular organisms and requires tight regulation of gene expression. While extensive research has focused on understanding cell type-specific gene activation, the complex mechanisms underlying the transcriptional repression of alternative fates are not fully understood. Here, we provide an overview of the repressive mechanisms involved in cell fate regulation. We discuss the molecular machinery responsible for suppressing alternative fates and highlight the crucial role of sequence-specific transcription factors (TFs) in this process. Depletion of these TFs can result in unwanted gene expression and increased cellular plasticity. We suggest that these TFs recruit cell type-specific repressive complexes to their cis-regulatory elements, enabling them to modulate chromatin accessibility in a context-dependent manner. This modulation effectively suppresses master regulators of alternative fate programs and their downstream targets. The modularity and dynamic behavior of these repressive complexes enables a limited number of repressors to canalize and maintain major and minor cell fate decisions at different stages of development.
Collapse
Affiliation(s)
- Bryce Lim
- Cell Fate Engineering and Disease Modeling Group, German Cancer Research Center (DKFZ) and DKFZ-ZMBH Alliance, 69120, Heidelberg, Germany
- HITBR Hector Institute for Translational Brain Research gGmbH, 69120, Heidelberg, Germany
- Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, 68159, Mannheim, Germany
| | - Katrin Domsch
- Heidelberg University, Centre for Organismal Studies (COS) Heidelberg, Department of Developmental Biology and Cell Networks - Cluster of Excellence, Heidelberg, Germany
| | - Moritz Mall
- Cell Fate Engineering and Disease Modeling Group, German Cancer Research Center (DKFZ) and DKFZ-ZMBH Alliance, 69120, Heidelberg, Germany.
- HITBR Hector Institute for Translational Brain Research gGmbH, 69120, Heidelberg, Germany.
- Central Institute of Mental Health, Medical Faculty Mannheim, Heidelberg University, 68159, Mannheim, Germany.
| | - Ingrid Lohmann
- Heidelberg University, Centre for Organismal Studies (COS) Heidelberg, Department of Developmental Biology and Cell Networks - Cluster of Excellence, Heidelberg, Germany.
| |
Collapse
|
21
|
Taskiran II, Spanier KI, Dickmänken H, Kempynck N, Pančíková A, Ekşi EC, Hulselmans G, Ismail JN, Theunis K, Vandepoel R, Christiaens V, Mauduit D, Aerts S. Cell-type-directed design of synthetic enhancers. Nature 2024; 626:212-220. [PMID: 38086419 PMCID: PMC10830415 DOI: 10.1038/s41586-023-06936-2] [Citation(s) in RCA: 38] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 12/05/2023] [Indexed: 01/19/2024]
Abstract
Transcriptional enhancers act as docking stations for combinations of transcription factors and thereby regulate spatiotemporal activation of their target genes1. It has been a long-standing goal in the field to decode the regulatory logic of an enhancer and to understand the details of how spatiotemporal gene expression is encoded in an enhancer sequence. Here we show that deep learning models2-6, can be used to efficiently design synthetic, cell-type-specific enhancers, starting from random sequences, and that this optimization process allows detailed tracing of enhancer features at single-nucleotide resolution. We evaluate the function of fully synthetic enhancers to specifically target Kenyon cells or glial cells in the fruit fly brain using transgenic animals. We further exploit enhancer design to create 'dual-code' enhancers that target two cell types and minimal enhancers smaller than 50 base pairs that are fully functional. By examining the state space searches towards local optima, we characterize enhancer codes through the strength, combination and arrangement of transcription factor activator and transcription factor repressor motifs. Finally, we apply the same strategies to successfully design human enhancers, which adhere to enhancer rules similar to those of Drosophila enhancers. Enhancer design guided by deep learning leads to better understanding of how enhancers work and shows that their code can be exploited to manipulate cell states.
Collapse
Affiliation(s)
- Ibrahim I Taskiran
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Katina I Spanier
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Hannah Dickmänken
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Niklas Kempynck
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Alexandra Pančíková
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB-KULeuven Center for Cancer Biology, Leuven, Belgium
| | - Eren Can Ekşi
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Gert Hulselmans
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Joy N Ismail
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- UK Dementia Research Institute at Imperial College London, London, UK
| | - Koen Theunis
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Roel Vandepoel
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Valerie Christiaens
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - David Mauduit
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Stein Aerts
- Laboratory of Computational Biology, VIB Center for AI & Computational Biology (VIB.AI), Leuven, Belgium.
- VIB-KULeuven Center for Brain & Disease Research, Leuven, Belgium.
- Department of Human Genetics, KU Leuven, Leuven, Belgium.
| |
Collapse
|
22
|
Ye F, Wang J, Li J, Mei Y, Guo G. Mapping Cell Atlases at the Single-Cell Level. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2305449. [PMID: 38145338 PMCID: PMC10885669 DOI: 10.1002/advs.202305449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Revised: 12/01/2023] [Indexed: 12/26/2023]
Abstract
Recent advancements in single-cell technologies have led to rapid developments in the construction of cell atlases. These atlases have the potential to provide detailed information about every cell type in different organisms, enabling the characterization of cellular diversity at the single-cell level. Global efforts in developing comprehensive cell atlases have profound implications for both basic research and clinical applications. This review provides a broad overview of the cellular diversity and dynamics across various biological systems. In addition, the incorporation of machine learning techniques into cell atlas analyses opens up exciting prospects for the field of integrative biology.
Collapse
Affiliation(s)
- Fang Ye
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative MedicineZhejiang University School of MedicineHangzhouZhejiang310000China
- Liangzhu LaboratoryZhejiang UniversityHangzhouZhejiang311121China
| | - Jingjing Wang
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative MedicineZhejiang University School of MedicineHangzhouZhejiang310000China
- Liangzhu LaboratoryZhejiang UniversityHangzhouZhejiang311121China
| | - Jiaqi Li
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative MedicineZhejiang University School of MedicineHangzhouZhejiang310000China
| | - Yuqing Mei
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative MedicineZhejiang University School of MedicineHangzhouZhejiang310000China
| | - Guoji Guo
- Bone Marrow Transplantation Center of the First Affiliated Hospital, and Center for Stem Cell and Regenerative MedicineZhejiang University School of MedicineHangzhouZhejiang310000China
- Liangzhu LaboratoryZhejiang UniversityHangzhouZhejiang311121China
- Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative MedicineDr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative MedicineHangzhouZhejiang310058China
- Institute of HematologyZhejiang UniversityHangzhouZhejiang310000China
| |
Collapse
|
23
|
de Almeida BP, Schaub C, Pagani M, Secchia S, Furlong EEM, Stark A. Targeted design of synthetic enhancers for selected tissues in the Drosophila embryo. Nature 2024; 626:207-211. [PMID: 38086418 PMCID: PMC10830412 DOI: 10.1038/s41586-023-06905-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 11/28/2023] [Indexed: 01/19/2024]
Abstract
Enhancers control gene expression and have crucial roles in development and homeostasis1-3. However, the targeted de novo design of enhancers with tissue-specific activities has remained challenging. Here we combine deep learning and transfer learning to design tissue-specific enhancers for five tissues in the Drosophila melanogaster embryo: the central nervous system, epidermis, gut, muscle and brain. We first train convolutional neural networks using genome-wide single-cell assay for transposase-accessible chromatin with sequencing (ATAC-seq) datasets and then fine-tune the convolutional neural networks with smaller-scale data from in vivo enhancer activity assays, yielding models with 13% to 76% positive predictive value according to cross-validation. We designed and experimentally assessed 40 synthetic enhancers (8 per tissue) in vivo, of which 31 (78%) were active and 27 (68%) functioned in the target tissue (100% for central nervous system and muscle). The strategy of combining genome-wide and small-scale functional datasets by transfer learning is generally applicable and should enable the design of tissue-, cell type- and cell state-specific enhancers in any system.
Collapse
Affiliation(s)
- Bernardo P de Almeida
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
- Vienna BioCenter PhD Program, Doctoral School of the University of Vienna and Medical University of Vienna, Vienna, Austria
- InstaDeep, Paris, France
| | - Christoph Schaub
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Michaela Pagani
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
| | - Stefano Secchia
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, Heidelberg, Germany
| | - Alexander Stark
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria.
- Medical University of Vienna, Vienna BioCenter (VBC), Vienna, Austria.
| |
Collapse
|
24
|
Wang M, Liu Y, Sun R, Liu F, Li J, Yan L, Zhang J, Xie X, Li D, Wang Y, Li S, Zhu X, Li R, Lu F, Xiao Z, Wang H. Single-nucleus multi-omic profiling of human placental syncytiotrophoblasts identifies cellular trajectories during pregnancy. Nat Genet 2024; 56:294-305. [PMID: 38267607 PMCID: PMC10864176 DOI: 10.1038/s41588-023-01647-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Accepted: 12/11/2023] [Indexed: 01/26/2024]
Abstract
The human placenta has a vital role in ensuring a successful pregnancy. Despite the growing body of knowledge about its cellular compositions and functions, there has been limited research on the heterogeneity of the billions of nuclei within the syncytiotrophoblast (STB), a multinucleated entity primarily responsible for placental function. Here we conducted integrated single-nucleus RNA sequencing and single-nucleus ATAC sequencing analyses of human placentas from early and late pregnancy. Our findings demonstrate the dynamic heterogeneity and developmental trajectories of STB nuclei and their correspondence with human trophoblast stem cell (hTSC)-derived STB. Furthermore, we identified transcription factors associated with diverse STB nuclear lineages through their gene regulatory networks and experimentally confirmed their function in hTSC and trophoblast organoid-derived STBs. Together, our data provide insights into the heterogeneity of human STB and represent a valuable resource for interpreting associated pregnancy complications.
Collapse
Affiliation(s)
- Meijiao Wang
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Yawei Liu
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- Dushu Lake Hospital Affiliated to Soochow University, Suzhou, China
- Medical Center of Soochow University, Suzhou, China
- Suzhou Dushu Lake Hospital, Suzhou, China
| | - Run Sun
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Fenting Liu
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
- National Clinical Research Center for Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
- Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing, China
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Peking University Third Hospital, Beijing, China
| | - Jiaqian Li
- School of Life Science, Beijing Institute of Technology, Beijing, China
| | - Long Yan
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jixiang Zhang
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xinwei Xie
- School of Life Science, Beijing Institute of Technology, Beijing, China
| | - Dongxu Li
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Yiming Wang
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shiwen Li
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Xili Zhu
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Rong Li
- Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China.
- National Clinical Research Center for Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China.
- Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing, China.
- Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Peking University Third Hospital, Beijing, China.
| | - Falong Lu
- State Key Laboratory of Molecular Developmental Biology, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| | - Zhenyu Xiao
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China.
- School of Life Science, Beijing Institute of Technology, Beijing, China.
| | - Hongmei Wang
- The Key Laboratory of Organ Regeneration and Reconstruction, State Key Laboratory of Stem Cell and Reproductive Biology, Institute of Zoology, Chinese Academy of Sciences, Beijing, China.
- Beijing Institute for Stem Cell and Regenerative Medicine, Beijing, China.
- University of Chinese Academy of Sciences, Beijing, China.
| |
Collapse
|
25
|
Jumde G, Spanjaard B, Junker JP. Inference of differentiation trajectories by transfer learning across biological processes. Cell Syst 2024; 15:75-82.e5. [PMID: 38128536 DOI: 10.1016/j.cels.2023.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/28/2023] [Accepted: 12/06/2023] [Indexed: 12/23/2023]
Abstract
Stem cells differentiate into distinct fates by transitioning through a series of transcriptional states. Current computational approaches allow reconstruction of differentiation trajectories from single-cell transcriptomics data, but it remains unknown to what degree differentiation can be predicted across biological processes. Here, we use transfer learning to infer differentiation processes and quantify predictability in early embryonic development and adult hematopoiesis. Overall, we find that non-linear methods outperform linear approaches, and we achieved the best predictions with a custom variational autoencoder that explicitly models changes in transcriptional variance. We observed a high accuracy of predictions in embryonic development, but we found somewhat lower agreement with the real data in adult hematopoiesis. We demonstrate that this discrepancy can be explained by a higher degree of concordant transcriptional processes along embryonic differentiation compared with adult homeostasis. In summary, we establish a framework for quantifying and exploiting predictability of cellular differentiation trajectories.
Collapse
Affiliation(s)
- Gaurav Jumde
- Max Delbrück Center for Molecular Medicine, Berlin Institute for Medical Systems Biology, 10115 Berlin, Germany; Humboldt Universität zu Berlin, Faculty of Life Sciences, Department of Biology, 10115 Berlin, Germany
| | - Bastiaan Spanjaard
- Max Delbrück Center for Molecular Medicine, Berlin Institute for Medical Systems Biology, 10115 Berlin, Germany; Charité Universitätsmedizin Berlin, 10117 Berlin, Germany.
| | - Jan Philipp Junker
- Max Delbrück Center for Molecular Medicine, Berlin Institute for Medical Systems Biology, 10115 Berlin, Germany; Charité Universitätsmedizin Berlin, 10117 Berlin, Germany.
| |
Collapse
|
26
|
Wang X, Duan M, Li J, Ma A, Xin G, Xu D, Li Z, Liu B, Ma Q. MarsGT: Multi-omics analysis for rare population inference using single-cell graph transformer. Nat Commun 2024; 15:338. [PMID: 38184630 PMCID: PMC10771517 DOI: 10.1038/s41467-023-44570-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 12/14/2023] [Indexed: 01/08/2024] Open
Abstract
Rare cell populations are key in neoplastic progression and therapeutic response, offering potential intervention targets. However, their computational identification and analysis often lag behind major cell types. To fill this gap, we introduce MarsGT: Multi-omics Analysis for Rare population inference using a Single-cell Graph Transformer. It identifies rare cell populations using a probability-based heterogeneous graph transformer on single-cell multi-omics data. MarsGT outperforms existing tools in identifying rare cells across 550 simulated and four real human datasets. In mouse retina data, it reveals unique subpopulations of rare bipolar cells and a Müller glia cell subpopulation. In human lymph node data, MarsGT detects an intermediate B cell population potentially acting as lymphoma precursors. In human melanoma data, it identifies a rare MAIT-like population impacted by a high IFN-I response and reveals the mechanism of immunotherapy. Hence, MarsGT offers biological insights and suggests potential strategies for early detection and therapeutic intervention of disease.
Collapse
Affiliation(s)
- Xiaoying Wang
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Maoteng Duan
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Gang Xin
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, 65211, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China.
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA.
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, 43210, USA.
| |
Collapse
|
27
|
Alanis-Lobato G, Bartlett TE, Huang Q, Simon CS, McCarthy A, Elder K, Snell P, Christie L, Niakan KK. MICA: a multi-omics method to predict gene regulatory networks in early human embryos. Life Sci Alliance 2024; 7:e202302415. [PMID: 37879938 PMCID: PMC10599980 DOI: 10.26508/lsa.202302415] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 10/12/2023] [Accepted: 10/13/2023] [Indexed: 10/27/2023] Open
Abstract
Recent advances in single-cell omics have transformed characterisation of cell types in challenging-to-study biological contexts. In contexts with limited single-cell samples, such as the early human embryo inference of transcription factor-gene regulatory network (GRN) interactions is especially difficult. Here, we assessed application of different linear or non-linear GRN predictions to single-cell simulated and human embryo transcriptome datasets. We also compared how expression normalisation impacts on GRN predictions, finding that transcripts per million reads outperformed alternative methods. GRN inferences were more reproducible using a non-linear method based on mutual information (MI) applied to single-cell transcriptome datasets refined with chromatin accessibility (CA) (called MICA), compared with alternative network prediction methods tested. MICA captures complex non-monotonic dependencies and feedback loops. Using MICA, we generated the first GRN inferences in early human development. MICA predicted co-localisation of the AP-1 transcription factor subunit proto-oncogene JUND and the TFAP2C transcription factor AP-2γ in early human embryos. Overall, our comparative analysis of GRN prediction methods defines a pipeline that can be applied to single-cell multi-omics datasets in especially challenging contexts to infer interactions between transcription factor expression and target gene regulation.
Collapse
Affiliation(s)
| | | | - Qiulin Huang
- Human Embryo and Stem Cell Laboratory, The Francis Crick Institute, London, UK
- Department of Physiology, Development and Neuroscience, The Centre for Trophoblast Research, University of Cambridge, Cambridge, UK
| | - Claire S Simon
- Human Embryo and Stem Cell Laboratory, The Francis Crick Institute, London, UK
| | - Afshan McCarthy
- Human Embryo and Stem Cell Laboratory, The Francis Crick Institute, London, UK
| | | | | | | | - Kathy K Niakan
- Human Embryo and Stem Cell Laboratory, The Francis Crick Institute, London, UK
- Department of Physiology, Development and Neuroscience, The Centre for Trophoblast Research, University of Cambridge, Cambridge, UK
- Wellcome - Medical Research Council Cambridge Stem Cell Institute, Jeffrey Cheah Biomedical Centre, University of Cambridge, Cambridge, UK
- Epigenetics Programme, Babraham Institute, Cambridge, UK
| |
Collapse
|
28
|
de Boer CG, Taipale J. Hold out the genome: a roadmap to solving the cis-regulatory code. Nature 2024; 625:41-50. [PMID: 38093018 DOI: 10.1038/s41586-023-06661-w] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 09/20/2023] [Indexed: 01/05/2024]
Abstract
Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The 'cis-regulatory code' - how cells interpret DNA sequences to determine when, where and how much genes should be expressed - has proven to be exceedingly complex. Recently, advances in the scale and resolution of functional genomics assays and machine learning have enabled substantial progress towards deciphering this code. However, the cis-regulatory code will probably never be solved if models are trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and our genome is too short and has insufficient sequence diversity to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable testing a far larger sequence space than exists in our genomes, and designed DNA sequences enable targeted queries to maximally improve the models. As the same biochemical principles are used to interpret DNA regardless of its source, models trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by a combination of machine learning and massively parallel assays using synthetic DNA.
Collapse
Affiliation(s)
- Carl G de Boer
- School of Biomedical Engineering, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Jussi Taipale
- Applied Tumor Genomics Research Program, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
- Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden.
- Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
29
|
Bravo González-Blas C, Matetovici I, Hillen H, Taskiran II, Vandepoel R, Christiaens V, Sansores-García L, Verboven E, Hulselmans G, Poovathingal S, Demeulemeester J, Psatha N, Mauduit D, Halder G, Aerts S. Single-cell spatial multi-omics and deep learning dissect enhancer-driven gene regulatory networks in liver zonation. Nat Cell Biol 2024; 26:153-167. [PMID: 38182825 PMCID: PMC10791584 DOI: 10.1038/s41556-023-01316-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 11/15/2023] [Indexed: 01/07/2024]
Abstract
In the mammalian liver, hepatocytes exhibit diverse metabolic and functional profiles based on their location within the liver lobule. However, it is unclear whether this spatial variation, called zonation, is governed by a well-defined gene regulatory code. Here, using a combination of single-cell multiomics, spatial omics, massively parallel reporter assays and deep learning, we mapped enhancer-gene regulatory networks across mouse liver cell types. We found that zonation affects gene expression and chromatin accessibility in hepatocytes, among other cell types. These states are driven by the repressors TCF7L1 and TBX3, alongside other core hepatocyte transcription factors, such as HNF4A, CEBPA, FOXA1 and ONECUT1. To examine the architecture of the enhancers driving these cell states, we trained a hierarchical deep learning model called DeepLiver. Our study provides a multimodal understanding of the regulatory code underlying hepatocyte identity and their zonation state that can be used to engineer enhancers with specific activity levels and zonation patterns.
Collapse
Affiliation(s)
- Carmen Bravo González-Blas
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Irina Matetovici
- VIB Center for Brain & Disease Research, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
- VIB Tech Watch, VIB Headquarters, Ghent, Belgium
| | - Hanne Hillen
- VIB Center for Cancer Biology, Leuven, Belgium
- Department of Oncology, KU Leuven, Leuven, Belgium
| | - Ibrahim Ihsan Taskiran
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
| | - Roel Vandepoel
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
| | - Valerie Christiaens
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
| | - Leticia Sansores-García
- VIB Center for Cancer Biology, Leuven, Belgium
- Department of Oncology, KU Leuven, Leuven, Belgium
| | - Elisabeth Verboven
- VIB Center for Cancer Biology, Leuven, Belgium
- Department of Oncology, KU Leuven, Leuven, Belgium
| | - Gert Hulselmans
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
| | | | - Jonas Demeulemeester
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Nikoleta Psatha
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - David Mauduit
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium
| | - Georg Halder
- VIB Center for Cancer Biology, Leuven, Belgium
- Department of Oncology, KU Leuven, Leuven, Belgium
| | - Stein Aerts
- VIB Center for Brain & Disease Research, Leuven, Belgium.
- Department of Human Genetics, KU Leuven, Leuven, Belgium.
- VIB Center for AI and Computational Biology (VIB.AI), Leuven, Belgium.
| |
Collapse
|
30
|
Martens LD, Fischer DS, Yépez VA, Theis FJ, Gagneur J. Modeling fragment counts improves single-cell ATAC-seq analysis. Nat Methods 2024; 21:28-31. [PMID: 38049697 PMCID: PMC10776385 DOI: 10.1038/s41592-023-02112-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Accepted: 10/25/2023] [Indexed: 12/06/2023]
Abstract
Single-cell ATAC sequencing coverage in regulatory regions is typically binarized as an indicator of open chromatin. Here we show that binarization is an unnecessary step that neither improves goodness of fit, clustering, cell type identification nor batch integration. Fragment counts, but not read counts, should instead be modeled, which preserves quantitative regulatory information. These results have immediate implications for single-cell ATAC sequencing analysis.
Collapse
Affiliation(s)
- Laura D Martens
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany
- Helmholtz Association, Munich School for Data Science (MUDS), Munich, Germany
| | - David S Fischer
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Vicente A Yépez
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany
| | - Fabian J Theis
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany.
- Helmholtz Association, Munich School for Data Science (MUDS), Munich, Germany.
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
| | - Julien Gagneur
- School of Computation, Information and Technology, Technical University of Munich, Garching, Germany.
- Computational Health Center, Helmholtz Center Munich, Neuherberg, Germany.
- Helmholtz Association, Munich School for Data Science (MUDS), Munich, Germany.
- Institute of Human Genetics, School of Medicine, Technical University of Munich, Munich, Germany.
| |
Collapse
|
31
|
Xiong L, Liu J, Han SY, Koppitch K, Guo JJ, Rommelfanger M, Miao Z, Gao F, Hallgrimsdottir IB, Pachter L, Kim J, MacLean AL, McMahon AP. Direct androgen receptor control of sexually dimorphic gene expression in the mammalian kidney. Dev Cell 2023; 58:2338-2358.e5. [PMID: 37673062 PMCID: PMC10873092 DOI: 10.1016/j.devcel.2023.08.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Revised: 07/20/2023] [Accepted: 08/04/2023] [Indexed: 09/08/2023]
Abstract
Mammalian organs exhibit distinct physiology, disease susceptibility, and injury responses between the sexes. In the mouse kidney, sexually dimorphic gene activity maps predominantly to proximal tubule (PT) segments. Bulk RNA sequencing (RNA-seq) data demonstrated that sex differences were established from 4 and 8 weeks after birth under gonadal control. Hormone injection studies and genetic removal of androgen and estrogen receptors demonstrated androgen receptor (AR)-mediated regulation of gene activity in PT cells as the regulatory mechanism. Interestingly, caloric restriction feminizes the male kidney. Single-nuclear multiomic analysis identified putative cis-regulatory regions and cooperating factors mediating PT responses to AR activity in the mouse kidney. In the human kidney, a limited set of genes showed conserved sex-linked regulation, whereas analysis of the mouse liver underscored organ-specific differences in the regulation of sexually dimorphic gene expression. These findings raise interesting questions on the evolution, physiological significance, disease, and metabolic linkage of sexually dimorphic gene activity.
Collapse
Affiliation(s)
- Lingyun Xiong
- Department of Stem Cell Biology and Regenerative Medicine, Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90089, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Jing Liu
- Department of Stem Cell Biology and Regenerative Medicine, Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90089, USA
| | - Seung Yub Han
- Graduate Program in Genomics and Computational Biology, Biomedical Graduate Studies, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kari Koppitch
- Department of Stem Cell Biology and Regenerative Medicine, Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90089, USA
| | - Jin-Jin Guo
- Department of Stem Cell Biology and Regenerative Medicine, Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90089, USA
| | - Megan Rommelfanger
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Zhen Miao
- Graduate Program in Genomics and Computational Biology, Biomedical Graduate Studies, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Fan Gao
- Caltech Bioinformatics Resource Center at Beckman Institute, California Institute of Technology, Pasadena, CA 91125, USA
| | - Ingileif B Hallgrimsdottir
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA
| | - Junhyong Kim
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Adam L MacLean
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA 90089, USA
| | - Andrew P McMahon
- Department of Stem Cell Biology and Regenerative Medicine, Eli and Edythe Broad Center for Regenerative Medicine and Stem Cell Research, Keck School of Medicine of the University of Southern California, Los Angeles, CA 90089, USA.
| |
Collapse
|
32
|
Klie A, Laub D, Talwar JV, Stites H, Jores T, Solvason JJ, Farley EK, Carter H. Predictive analyses of regulatory sequences with EUGENe. NATURE COMPUTATIONAL SCIENCE 2023; 3:946-956. [PMID: 38177592 PMCID: PMC10768637 DOI: 10.1038/s43588-023-00544-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 09/27/2023] [Indexed: 01/06/2024]
Abstract
Deep learning has become a popular tool to study cis-regulatory function. Yet efforts to design software for deep-learning analyses in regulatory genomics that are findable, accessible, interoperable and reusable (FAIR) have fallen short of fully meeting these criteria. Here we present elucidating the utility of genomic elements with neural nets (EUGENe), a FAIR toolkit for the analysis of genomic sequences with deep learning. EUGENe consists of a set of modules and subpackages for executing the key functionality of a genomics deep learning workflow: (1) extracting, transforming and loading sequence data from many common file formats; (2) instantiating, initializing and training diverse model architectures; and (3) evaluating and interpreting model behavior. We designed EUGENe as a simple, flexible and extensible interface for streamlining and customizing end-to-end deep-learning sequence analyses, and illustrate these principles through application of the toolkit to three predictive modeling tasks. We hope that EUGENe represents a springboard towards a collaborative ecosystem for deep-learning applications in genomics research.
Collapse
Affiliation(s)
- Adam Klie
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - David Laub
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | - James V Talwar
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
| | | | - Tobias Jores
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Joe J Solvason
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA
| | - Emma K Farley
- Department of Medicine, University of California San Diego, La Jolla, CA, USA
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA
| | - Hannah Carter
- Department of Medicine, University of California San Diego, La Jolla, CA, USA.
- Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
33
|
Badia-I-Mompel P, Wessels L, Müller-Dott S, Trimbour R, Ramirez Flores RO, Argelaguet R, Saez-Rodriguez J. Gene regulatory network inference in the era of single-cell multi-omics. Nat Rev Genet 2023; 24:739-754. [PMID: 37365273 DOI: 10.1038/s41576-023-00618-5] [Citation(s) in RCA: 93] [Impact Index Per Article: 46.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 06/28/2023]
Abstract
The interplay between chromatin, transcription factors and genes generates complex regulatory circuits that can be represented as gene regulatory networks (GRNs). The study of GRNs is useful to understand how cellular identity is established, maintained and disrupted in disease. GRNs can be inferred from experimental data - historically, bulk omics data - and/or from the literature. The advent of single-cell multi-omics technologies has led to the development of novel computational methods that leverage genomic, transcriptomic and chromatin accessibility information to infer GRNs at an unprecedented resolution. Here, we review the key principles of inferring GRNs that encompass transcription factor-gene interactions from transcriptomics and chromatin accessibility data. We focus on the comparison and classification of methods that use single-cell multimodal data. We highlight challenges in GRN inference, in particular with respect to benchmarking, and potential further developments using additional data modalities.
Collapse
Affiliation(s)
- Pau Badia-I-Mompel
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Lorna Wessels
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Department of Vascular Biology and Tumor Angiogenesis, European Center for Angioscience, Medical Faculty, MannHeim Heidelberg University, Mannheim, Germany
| | - Sophia Müller-Dott
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Rémi Trimbour
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Ricardo O Ramirez Flores
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | | | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany.
| |
Collapse
|
34
|
Murugesan SN, Monteiro A. Butterfly eyespots exhibit unique patterns of open chromatin. F1000Res 2023; 12:1428. [PMID: 38778811 PMCID: PMC11109672 DOI: 10.12688/f1000research.133789.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/02/2023] [Indexed: 05/25/2024] Open
Abstract
Background: How the precise spatial regulation of genes is correlated with spatial variation in chromatin accessibilities is not yet clear. Previous studies that analysed chromatin from homogenates of whole-body parts of insects found little variation in chromatin accessibility across those parts, but single-cell studies of Drosophila brains showed extensive spatial variation in chromatin accessibility across that organ. In this work we studied the chromatin accessibility of butterfly wing tissue fated to differentiate distinct colors and patterns in pupal wings of Bicyclus anynana. Methods: We dissected small eyespot and adjacent control tissues from 3h pupae and performed ATAC-Seq to identify the chromatin accessibility differences between different sections of the wings. Results: We observed that three dissected wing regions showed unique chromatin accessibilities. Open chromatin regions specific to eyespot color patterns were highly enriched for binding motifs recognized by Suppressor of Hairless (Su(H)), Krüppel (Kr), Buttonhead (Btd) and Nubbin (Nub) transcription factors. Genes in the vicinity of the eyespot-specific open chromatin regions included those involved in wound healing and SMAD signal transduction pathways, previously proposed to be involved in eyespot development. Conclusions: We conclude that eyespot and non-eyespot tissue samples taken from the same wing have distinct patterns of chromatin accessibility, possibly driven by the eyespot-restricted expression of potential pioneer factors, such as Kr.
Collapse
Affiliation(s)
| | - Antónia Monteiro
- Biological Sciences, National University of Singapore, Singapore, 117558, Singapore
| |
Collapse
|
35
|
Nair S, Ameen M, Sundaram L, Pampari A, Schreiber J, Balsubramani A, Wang YX, Burns D, Blau HM, Karakikes I, Wang KC, Kundaje A. Transcription factor stoichiometry, motif affinity and syntax regulate single-cell chromatin dynamics during fibroblast reprogramming to pluripotency. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.04.560808. [PMID: 37873116 PMCID: PMC10592962 DOI: 10.1101/2023.10.04.560808] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Ectopic expression of OCT4, SOX2, KLF4 and MYC (OSKM) transforms differentiated cells into induced pluripotent stem cells. To refine our mechanistic understanding of reprogramming, especially during the early stages, we profiled chromatin accessibility and gene expression at single-cell resolution across a densely sampled time course of human fibroblast reprogramming. Using neural networks that map DNA sequence to ATAC-seq profiles at base-resolution, we annotated cell-state-specific predictive transcription factor (TF) motif syntax in regulatory elements, inferred affinity- and concentration-dependent dynamics of Tn5-bias corrected TF footprints, linked peaks to putative target genes, and elucidated rewiring of TF-to-gene cis-regulatory networks. Our models reveal that early in reprogramming, OSK, at supraphysiological concentrations, rapidly open transient regulatory elements by occupying non-canonical low-affinity binding sites. As OSK concentration falls, the accessibility of these transient elements decays as a function of motif affinity. We find that these OSK-dependent transient elements sequester the somatic TF AP-1. This redistribution is strongly associated with the silencing of fibroblast-specific genes within individual nuclei. Together, our integrated single-cell resource and models reveal insights into the cis-regulatory code of reprogramming at unprecedented resolution, connect TF stoichiometry and motif syntax to diversification of cell fate trajectories, and provide new perspectives on the dynamics and role of transient regulatory elements in somatic silencing.
Collapse
Affiliation(s)
- Surag Nair
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Mohamed Ameen
- Department of Cancer Biology, Stanford University, Stanford, CA, USA
- Cardiovascular Institute, Stanford University, Stanford, CA, USA
- Department of Dermatology, Stanford University, Stanford, CA, USA
- Program in Epithelial Biology, Stanford University, Stanford, CA, USA
| | | | - Anusri Pampari
- Department of Computer Science, Stanford University, Stanford, CA, USA
| | - Jacob Schreiber
- Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Yu Xin Wang
- Baxter Laboratory for Stem Cell Biology, Stanford University, Stanford, CA, USA
| | - David Burns
- Baxter Laboratory for Stem Cell Biology, Stanford University, Stanford, CA, USA
| | - Helen M Blau
- Baxter Laboratory for Stem Cell Biology, Stanford University, Stanford, CA, USA
- Department of Microbiology and Immunology, Stanford University, Stanford, CA, USA
| | - Ioannis Karakikes
- Cardiovascular Institute, Stanford University, Stanford, CA, USA
- Department of Cardiothoracic Surgery, Stanford University, Stanford, CA, USA
| | - Kevin C Wang
- Department of Dermatology, Stanford University, Stanford, CA, USA
- Program in Epithelial Biology, Stanford University, Stanford, CA, USA
- Veterans Affairs Palo Alto Healthcare System, Palo Alto, CA, USA
| | - Anshul Kundaje
- Department of Computer Science, Stanford University, Stanford, CA, USA
- Department of Genetics, Stanford University, Stanford, CA, USA
| |
Collapse
|
36
|
Tzaferis C, Karatzas E, Baltoumas FA, Pavlopoulos GA, Kollias G, Konstantopoulos D. SCALA: A complete solution for multimodal analysis of single-cell Next Generation Sequencing data. Comput Struct Biotechnol J 2023; 21:5382-5393. [PMID: 38022693 PMCID: PMC10651449 DOI: 10.1016/j.csbj.2023.10.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 10/16/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
Analysis and interpretation of high-throughput transcriptional and chromatin accessibility data at single-cell (sc) resolution are still open challenges in the biomedical field. The existence of countless bioinformatics tools, for the different analytical steps, increases the complexity of data interpretation and the difficulty to derive biological insights. In this article, we present SCALA, a bioinformatics tool for analysis and visualization of single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) datasets, enabling either independent or integrative analysis of the two modalities. SCALA combines standard types of analysis by integrating multiple software packages varying from quality control to the identification of distinct cell populations and cell states. Additional analysis options enable functional enrichment, cellular trajectory inference, ligand-receptor analysis, and regulatory network reconstruction. SCALA is fully parameterizable, presenting data in tabular format and producing publication-ready visualizations. The different available analysis modules can aid biomedical researchers in exploring, analyzing, and visualizing their data without any prior experience in coding. We demonstrate the functionality of SCALA through two use-cases related to TNF-driven arthritic mice, handling both scRNA-seq and scATAC-seq datasets. SCALA is developed in R, Shiny and JavaScript and is mainly available as a standalone version, while an online service of more limited capacity can be found at http://scala.pavlopouloslab.info or https://scala.fleming.gr.
Collapse
Affiliation(s)
- Christos Tzaferis
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
| | - George Kollias
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
- Department of Physiology, Medical School, National and Kapodistrian University of Athens, Greece
| | - Dimitris Konstantopoulos
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| |
Collapse
|
37
|
Khodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L. The evolution and mutational robustness of chromatin accessibility in Drosophila. Genome Biol 2023; 24:232. [PMID: 37845780 PMCID: PMC10578003 DOI: 10.1186/s13059-023-03079-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 09/29/2023] [Indexed: 10/18/2023] Open
Abstract
BACKGROUND The evolution of genomic regulatory regions plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems complicates the understanding of the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different species and tissues of Drosophila. RESULTS We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that our models generalize well across substantially evolutionarily diverged species of insects, implying that the sequence determinants of accessibility are highly conserved. Using our model to examine species-specific gains in accessibility, we find evidence suggesting that these regions may be ancestrally poised for evolution. Using in silico mutagenesis, we show that accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that accessibility is mutationally robust. Subsequently, we show that accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. Conversely, simulations under strong selection demonstrate that accessibility can be extremely malleable despite its robustness. Finally, we identify motifs predictive of accessibility, recovering both novel and previously known motifs. CONCLUSIONS These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks to explore fundamental questions in regulatory genomics and evolution.
Collapse
Affiliation(s)
- Samuel Khodursky
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Eric B Zheng
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Sylvia M Durkin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
- Present Address: Department of Integrative Biology and Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, USA
| | - Sigi Benjamin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Alice Gadau
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Xia Wu
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY, 10065, USA.
| |
Collapse
|
38
|
Zhang P, Wang H, Xu H, Wei L, Liu L, Hu Z, Wang X. Deep flanking sequence engineering for efficient promoter design using DeepSEED. Nat Commun 2023; 14:6309. [PMID: 37813854 PMCID: PMC10562447 DOI: 10.1038/s41467-023-41899-y] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 09/20/2023] [Indexed: 10/11/2023] Open
Abstract
Designing promoters with desirable properties is essential in synthetic biology. Human experts are skilled at identifying strong explicit patterns in small samples, while deep learning models excel at detecting implicit weak patterns in large datasets. Biologists have described the sequence patterns of promoters via transcription factor binding sites (TFBSs). However, the flanking sequences of cis-regulatory elements, have long been overlooked and often arbitrarily decided in promoter design. To address this limitation, we introduce DeepSEED, an AI-aided framework that efficiently designs synthetic promoters by combining expert knowledge with deep learning techniques. DeepSEED has demonstrated success in improving the properties of Escherichia coli constitutive, IPTG-inducible, and mammalian cell doxycycline (Dox)-inducible promoters. Furthermore, our results show that DeepSEED captures the implicit features in flanking sequences, such as k-mer frequencies and DNA shape features, which are crucial for determining promoter properties.
Collapse
Affiliation(s)
- Pengcheng Zhang
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China
| | - Haochen Wang
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China
| | - Hanwen Xu
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China
| | - Lei Wei
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China
| | - Liyang Liu
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China
| | - Zhirui Hu
- Center for Statistical Science, Tsinghua University, Beijing, China
| | - Xiaowo Wang
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, China.
| |
Collapse
|
39
|
Velten B, Stegle O. Principles and challenges of modeling temporal and spatial omics data. Nat Methods 2023; 20:1462-1474. [PMID: 37710019 DOI: 10.1038/s41592-023-01992-y] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 07/31/2023] [Indexed: 09/16/2023]
Abstract
Studies with temporal or spatial resolution are crucial to understand the molecular dynamics and spatial dependencies underlying a biological process or system. With advances in high-throughput omic technologies, time- and space-resolved molecular measurements at scale are increasingly accessible, providing new opportunities to study the role of timing or structure in a wide range of biological questions. At the same time, analyses of the data being generated in the context of spatiotemporal studies entail new challenges that need to be considered, including the need to account for temporal and spatial dependencies and compare them across different scales, biological samples or conditions. In this Review, we provide an overview of common principles and challenges in the analysis of temporal and spatial omics data. We discuss statistical concepts to model temporal and spatial dependencies and highlight opportunities for adapting existing analysis methods to data with temporal and spatial dimensions.
Collapse
Affiliation(s)
- Britta Velten
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- Cellular Genetics Programme, Wellcome Sanger Institute, Hinxton, Cambridge, UK.
- Centre for Organismal Studies (COS) and Interdisciplinary Center for Scientific Computing (IWR), Heidelberg University, Heidelberg, Germany.
| | - Oliver Stegle
- Division of Computational Genomics and Systems Genetics, German Cancer Research Center (DKFZ), Heidelberg, Germany.
- Cellular Genetics Programme, Wellcome Sanger Institute, Hinxton, Cambridge, UK.
- Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany.
| |
Collapse
|
40
|
Ju L, Glastad KM, Sheng L, Gospocic J, Kingwell CJ, Davidson SM, Kocher SD, Bonasio R, Berger SL. Hormonal gatekeeping via the blood-brain barrier governs caste-specific behavior in ants. Cell 2023; 186:4289-4309.e23. [PMID: 37683635 PMCID: PMC10807403 DOI: 10.1016/j.cell.2023.08.002] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 05/10/2023] [Accepted: 08/01/2023] [Indexed: 09/10/2023]
Abstract
Here, we reveal an unanticipated role of the blood-brain barrier (BBB) in regulating complex social behavior in ants. Using scRNA-seq, we find localization in the BBB of a key hormone-degrading enzyme called juvenile hormone esterase (Jhe), and we show that this localization governs the level of juvenile hormone (JH3) entering the brain. Manipulation of the Jhe level reprograms the brain transcriptome between ant castes. Although ant Jhe is retained and functions intracellularly within the BBB, we show that Drosophila Jhe is naturally extracellular. Heterologous expression of ant Jhe into the Drosophila BBB alters behavior in fly to mimic what is seen in ants. Most strikingly, manipulation of Jhe levels in ants reprograms complex behavior between worker castes. Our study thus uncovers a remarkable, potentially conserved role of the BBB serving as a molecular gatekeeper for a neurohormonal pathway that regulates social behavior.
Collapse
Affiliation(s)
- Linyang Ju
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Karl M Glastad
- Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA.
| | - Lihong Sheng
- Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Janko Gospocic
- Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Urology and Institute of Neuropathology, Medical Center-University of Freiburg, Freiburg, Germany
| | - Callum J Kingwell
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA
| | - Shawn M Davidson
- Lewis-Sigler Institute for Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Sarah D Kocher
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ 08544, USA; Lewis-Sigler Institute for Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Roberto Bonasio
- Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Shelley L Berger
- Department of Biology, School of Arts and Sciences, University of Pennsylvania, Philadelphia, PA 19104, USA; Epigenetics Institute, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Cell and Developmental Biology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA.
| |
Collapse
|
41
|
Ramalingam V, Yu X, Slaughter BD, Unruh JR, Brennan KJ, Onyshchenko A, Lange JJ, Natarajan M, Buck M, Zeitlinger J. Lola-I is a promoter pioneer factor that establishes de novo Pol II pausing during development. Nat Commun 2023; 14:5862. [PMID: 37735176 PMCID: PMC10514308 DOI: 10.1038/s41467-023-41408-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 08/30/2023] [Indexed: 09/23/2023] Open
Abstract
While the accessibility of enhancers is dynamically regulated during development, promoters tend to be constitutively accessible and poised for activation by paused Pol II. By studying Lola-I, a Drosophila zinc finger transcription factor, we show here that the promoter state can also be subject to developmental regulation independently of gene activation. Lola-I is ubiquitously expressed at the end of embryogenesis and causes its target promoters to become accessible and acquire paused Pol II throughout the embryo. This promoter transition is required but not sufficient for tissue-specific target gene activation. Lola-I mediates this function by depleting promoter nucleosomes, similar to the action of pioneer factors at enhancers. These results uncover a level of regulation for promoters that is normally found at enhancers and reveal a mechanism for the de novo establishment of paused Pol II at promoters.
Collapse
Affiliation(s)
- Vivekanandan Ramalingam
- Stowers Institute for Medical Research, Kansas City, MO, USA
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center----, Kansas City, KS, USA
- Department of Genetics, Stanford University, Palo Alto, CA, USA
| | - Xinyang Yu
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY, USA
| | | | - Jay R Unruh
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | | | - Jeffrey J Lange
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | - Michael Buck
- Department of Biochemistry, State University of New York at Buffalo, Buffalo, NY, USA
- Department of Biomedical Informatics, Jacobs School of Medicine & Biomedical Sciences, Buffalo, NY, USA
| | - Julia Zeitlinger
- Stowers Institute for Medical Research, Kansas City, MO, USA.
- Department of Pathology and Laboratory Medicine, University of Kansas Medical Center----, Kansas City, KS, USA.
| |
Collapse
|
42
|
Bravo González-Blas C, De Winter S, Hulselmans G, Hecker N, Matetovici I, Christiaens V, Poovathingal S, Wouters J, Aibar S, Aerts S. SCENIC+: single-cell multiomic inference of enhancers and gene regulatory networks. Nat Methods 2023; 20:1355-1367. [PMID: 37443338 PMCID: PMC10482700 DOI: 10.1038/s41592-023-01938-4] [Citation(s) in RCA: 166] [Impact Index Per Article: 83.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 06/06/2023] [Indexed: 07/15/2023]
Abstract
Joint profiling of chromatin accessibility and gene expression in individual cells provides an opportunity to decipher enhancer-driven gene regulatory networks (GRNs). Here we present a method for the inference of enhancer-driven GRNs, called SCENIC+. SCENIC+ predicts genomic enhancers along with candidate upstream transcription factors (TFs) and links these enhancers to candidate target genes. To improve both recall and precision of TF identification, we curated and clustered a motif collection with more than 30,000 motifs. We benchmarked SCENIC+ on diverse datasets from different species, including human peripheral blood mononuclear cells, ENCODE cell lines, melanoma cell states and Drosophila retinal development. Next, we exploit SCENIC+ predictions to study conserved TFs, enhancers and GRNs between human and mouse cell types in the cerebral cortex. Finally, we use SCENIC+ to study the dynamics of gene regulation along differentiation trajectories and the effect of TF perturbations on cell state. SCENIC+ is available at scenicplus.readthedocs.io .
Collapse
Affiliation(s)
- Carmen Bravo González-Blas
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Seppe De Winter
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Gert Hulselmans
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Nikolai Hecker
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Irina Matetovici
- VIB Center for Brain & Disease Research, Leuven, Belgium
- VIB Tech Watch, VIB Headquarters, Ghent, Belgium
| | - Valerie Christiaens
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | | | - Jasper Wouters
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Sara Aibar
- VIB Center for Brain & Disease Research, Leuven, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Stein Aerts
- VIB Center for Brain & Disease Research, Leuven, Belgium.
- Department of Human Genetics, KU Leuven, Leuven, Belgium.
| |
Collapse
|
43
|
Mohana G, Dorier J, Li X, Mouginot M, Smith RC, Malek H, Leleu M, Rodriguez D, Khadka J, Rosa P, Cousin P, Iseli C, Restrepo S, Guex N, McCabe BD, Jankowski A, Levine MS, Gambetta MC. Chromosome-level organization of the regulatory genome in the Drosophila nervous system. Cell 2023; 186:3826-3844.e26. [PMID: 37536338 PMCID: PMC10529364 DOI: 10.1016/j.cell.2023.07.008] [Citation(s) in RCA: 25] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Revised: 03/31/2023] [Accepted: 07/06/2023] [Indexed: 08/05/2023]
Abstract
Previous studies have identified topologically associating domains (TADs) as basic units of genome organization. We present evidence of a previously unreported level of genome folding, where distant TAD pairs, megabases apart, interact to form meta-domains. Within meta-domains, gene promoters and structural intergenic elements present in distant TADs are specifically paired. The associated genes encode neuronal determinants, including those engaged in axonal guidance and adhesion. These long-range associations occur in a large fraction of neurons but support transcription in only a subset of neurons. Meta-domains are formed by diverse transcription factors that are able to pair over long and flexible distances. We present evidence that two such factors, GAF and CTCF, play direct roles in this process. The relative simplicity of higher-order meta-domain interactions in Drosophila, compared with those previously described in mammals, allowed the demonstration that genomes can fold into highly specialized cell-type-specific scaffolds that enable megabase-scale regulatory associations.
Collapse
Affiliation(s)
- Giriram Mohana
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Julien Dorier
- Bioinformatics Competence Center, University of Lausanne, 1015 Lausanne, Switzerland; Bioinformatics Competence Center, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Xiao Li
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA
| | - Marion Mouginot
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Rebecca C Smith
- Brain Mind Institute, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Héléna Malek
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Marion Leleu
- Bioinformatics Competence Center, University of Lausanne, 1015 Lausanne, Switzerland; Bioinformatics Competence Center, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Daniel Rodriguez
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Jenisha Khadka
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Patrycja Rosa
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warsaw, Poland
| | - Pascal Cousin
- Center for Integrative Genomics, University of Lausanne, 1015 Lausanne, Switzerland
| | - Christian Iseli
- Bioinformatics Competence Center, University of Lausanne, 1015 Lausanne, Switzerland; Bioinformatics Competence Center, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Simon Restrepo
- Arcoris bio AG, Lüssirainstrasse 52, 6300 Zug, Switzerland
| | - Nicolas Guex
- Bioinformatics Competence Center, University of Lausanne, 1015 Lausanne, Switzerland; Bioinformatics Competence Center, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Brian D McCabe
- Brain Mind Institute, Swiss Federal Institute of Technology Lausanne, 1015 Lausanne, Switzerland
| | - Aleksander Jankowski
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, 02-097 Warsaw, Poland.
| | - Michael S Levine
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ, USA.
| | | |
Collapse
|
44
|
Wang X, Duan M, Li J, Ma A, Xu D, Li Z, Liu B, Ma Q. MarsGT: Multi-omics analysis for rare population inference using single-cell graph transformer. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.15.553454. [PMID: 37645917 PMCID: PMC10462017 DOI: 10.1101/2023.08.15.553454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Rare cell populations are key in neoplastic progression and therapeutic response, offering potential intervention targets. However, their computational identification and analysis often lag behind major cell types. To fill this gap, we introduced MarsGT: Multi-omics Analysis for Rare population inference using Single-cell Graph Transformer. It identifies rare cell populations using a probability-based heterogeneous graph transformer on single-cell multi-omics data. MarsGT outperformed existing tools in identifying rare cells across 400 simulated and four real human datasets. In mouse retina data, it revealed unique subpopulations of rare bipolar cells and a Müller glia cell subpopulation. In human lymph node data, MarsGT detected an intermediate B cell population potentially acting as lymphoma precursors. In human melanoma data, it identified a rare MAIT-like population impacted by a high IFN-I response and revealed the mechanism of immunotherapy. Hence, MarsGT offers biological insights and suggests potential strategies for early detection and therapeutic intervention of disease.
Collapse
Affiliation(s)
- Xiaoying Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Maoteng Duan
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO 65211, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO 65211, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, 250100, China
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, 43210, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH 43210, USA
| |
Collapse
|
45
|
Yuan Y, Chen Q, Brovkina M, Clowney EJ, Yadlapalli S. Clock-dependent chromatin accessibility rhythms regulate circadian transcription. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.15.553315. [PMID: 37645872 PMCID: PMC10462003 DOI: 10.1101/2023.08.15.553315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Chromatin organization plays a crucial role in gene regulation by controlling the accessibility of DNA to transcription machinery. While significant progress has been made in understanding the regulatory role of clock proteins in circadian rhythms, how chromatin organization affects circadian rhythms remains poorly understood. Here, we employed ATAC-seq (Assay for Transposase-Accessible Chromatin with Sequencing) on FAC-sorted Drosophila clock neurons to assess genome-wide chromatin accessibility over the circadian cycle. We observed significant circadian oscillations in chromatin accessibility at promoter and enhancer regions of hundreds of genes, with enhanced accessibility either at dusk or dawn, which correlated with their peak transcriptional activity. Notably, genes with enhanced accessibility at dusk were enriched with E-box motifs, while those more accessible at dawn were enriched with VRI/PDP1-box motifs, indicating that they are regulated by the core circadian feedback loops, PER/CLK and VRI/PDP1, respectively. Further, we observed a complete loss of chromatin accessibility rhythms in per01 null mutants, with chromatin consistently accessible throughout the circadian cycle, underscoring the critical role of Period protein in driving chromatin compaction during the repression phase. Together, this study demonstrates the significant role of chromatin organization in circadian regulation, revealing how the interplay between clock proteins and chromatin structure orchestrates the precise timing of biological processes throughout the day. This work further implies that variations in chromatin accessibility might play a central role in the generation of diverse circadian gene expression patterns in clock neurons.
Collapse
Affiliation(s)
- Ye Yuan
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, MI 48109, USA
| | - Qianqian Chen
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI 48109, USA
| | - Margarita Brovkina
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, MI 48109, USA
| | - E Josephine Clowney
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Molecular, Cellular and Developmental Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Michigan Neuroscience Institute Affiliate, University of Michigan, Ann Arbor, MI 48109, USA
| | - Swathi Yadlapalli
- Cellular and Molecular Biology Program, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, MI 48109, USA
- Michigan Neuroscience Institute Affiliate, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
46
|
Marku M, Pancaldi V. From time-series transcriptomics to gene regulatory networks: A review on inference methods. PLoS Comput Biol 2023; 19:e1011254. [PMID: 37561790 PMCID: PMC10414591 DOI: 10.1371/journal.pcbi.1011254] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/12/2023] Open
Abstract
Inference of gene regulatory networks has been an active area of research for around 20 years, leading to the development of sophisticated inference algorithms based on a variety of assumptions and approaches. With the ever increasing demand for more accurate and powerful models, the inference problem remains of broad scientific interest. The abstract representation of biological systems through gene regulatory networks represents a powerful method to study such systems, encoding different amounts and types of information. In this review, we summarize the different types of inference algorithms specifically based on time-series transcriptomics, giving an overview of the main applications of gene regulatory networks in computational biology. This review is intended to give an updated reference of regulatory networks inference tools to biologists and researchers new to the topic and guide them in selecting the appropriate inference method that best fits their questions, aims, and experimental data.
Collapse
Affiliation(s)
- Malvina Marku
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
| | - Vera Pancaldi
- CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France
- Barcelona Supercomputing Center, Barcelona, Spain
| |
Collapse
|
47
|
Jacobs J, Pagani M, Wenzl C, Stark A. Widespread regulatory specificities between transcriptional co-repressors and enhancers in Drosophila. Science 2023; 381:198-204. [PMID: 37440660 DOI: 10.1126/science.adf6149] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2022] [Accepted: 06/13/2023] [Indexed: 07/15/2023]
Abstract
Gene expression is controlled by the precise activation and repression of transcription. Repression is mediated by specialized transcription factors (TFs) that recruit co-repressors (CoRs) to silence transcription, even in the presence of activating cues. However, whether CoRs can dominantly silence all enhancers or display distinct specificities is unclear. In this work, we report that most enhancers in Drosophila can be repressed by only a subset of CoRs, and enhancers classified by CoR sensitivity show distinct chromatin features, function, TF motifs, and binding. Distinct TF motifs render enhancers more resistant or sensitive to specific CoRs, as we demonstrate by motif mutagenesis and addition. These CoR-enhancer compatibilities constitute an additional layer of regulatory specificity that allows differential regulation at close genomic distances and is indicative of distinct mechanisms of transcriptional repression.
Collapse
Affiliation(s)
- Jelle Jacobs
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, Vienna, Austria
| | - Michaela Pagani
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, Vienna, Austria
| | - Christoph Wenzl
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, Vienna, Austria
| | - Alexander Stark
- Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Campus-Vienna-Biocenter 1, Vienna, Austria
- Medical University of Vienna, Vienna BioCenter (VBC), Vienna, Austria
| |
Collapse
|
48
|
Hopkins BR, Barmina O, Kopp A. A single-cell atlas of the sexually dimorphic Drosophila foreleg and its sensory organs during development. PLoS Biol 2023; 21:e3002148. [PMID: 37379332 DOI: 10.1371/journal.pbio.3002148] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 05/03/2023] [Indexed: 06/30/2023] Open
Abstract
To respond to the world around them, animals rely on the input of a network of sensory organs distributed throughout the body. Distinct classes of sensory organs are specialized for the detection of specific stimuli such as strain, pressure, or taste. The features that underlie this specialization relate both to the neurons that innervate sensory organs and the accessory cells they comprise. To understand the genetic basis of this diversity of cell types, both within and between sensory organs, we performed single-cell RNA sequencing on the first tarsal segment of the male Drosophila melanogaster foreleg during pupal development. This tissue displays a wide variety of functionally and structurally distinct sensory organs, including campaniform sensilla, mechanosensory bristles, and chemosensory taste bristles, as well as the sex comb, a recently evolved male-specific structure. In this study, we characterize the cellular landscape in which the sensory organs reside, identify a novel cell type that contributes to the construction of the neural lamella, and resolve the transcriptomic differences among support cells within and between sensory organs. We identify the genes that distinguish between mechanosensory and chemosensory neurons, resolve a combinatorial transcription factor code that defines 4 distinct classes of gustatory neurons and several types of mechanosensory neurons, and match the expression of sensory receptor genes to specific neuron classes. Collectively, our work identifies core genetic features of a variety of sensory organs and provides a rich, annotated resource for studying their development and function.
Collapse
Affiliation(s)
- Ben R Hopkins
- Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Olga Barmina
- Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Artyom Kopp
- Department of Evolution and Ecology, University of California, Davis, California, United States of America
| |
Collapse
|
49
|
Khodursky S, Zheng EB, Svetec N, Durkin SM, Benjamin S, Gadau A, Wu X, Zhao L. The evolution and mutational robustness of chromatin accessibility in Drosophila. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.26.546587. [PMID: 37425760 PMCID: PMC10327059 DOI: 10.1101/2023.06.26.546587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
The evolution of regulatory regions in the genome plays a critical role in shaping the diversity of life. While this process is primarily sequence-dependent, the enormous complexity of biological systems has made it difficult to understand the factors underlying regulation and its evolution. Here, we apply deep neural networks as a tool to investigate the sequence determinants underlying chromatin accessibility in different tissues of Drosophila. We train hybrid convolution-attention neural networks to accurately predict ATAC-seq peaks using only local DNA sequences as input. We show that a model trained in one species has nearly identical performance when tested in another species, implying that the sequence determinants of accessibility are highly conserved. Indeed, model performance remains excellent even in distantly-related species. By using our model to examine species-specific gains in chromatin accessibility, we find that their orthologous inaccessible regions in other species have surprisingly similar model outputs, suggesting that these regions may be ancestrally poised for evolution. We then use in silico saturation mutagenesis to reveal evidence of selective constraint acting specifically on inaccessible chromatin regions. We further show that chromatin accessibility can be accurately predicted from short subsequences in each example. However, in silico knock-out of these sequences does not qualitatively impair classification, implying that chromatin accessibility is mutationally robust. Subsequently, we demonstrate that chromatin accessibility is predicted to be robust to large-scale random mutation even in the absence of selection. We also perform in silico evolution experiments under the regime of strong selection and weak mutation (SSWM) and show that chromatin accessibility can be extremely malleable despite its mutational robustness. However, selection acting in different directions in a tissue-specific manner can substantially slow adaptation. Finally, we identify motifs predictive of chromatin accessibility and recover motifs corresponding to known chromatin accessibility activators and repressors. These results demonstrate the conservation of the sequence determinants of accessibility and the general robustness of chromatin accessibility, as well as the power of deep neural networks as tools to answer fundamental questions in regulatory genomics and evolution.
Collapse
Affiliation(s)
- Samuel Khodursky
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- These authors contributed equally
| | - Eric B Zheng
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- These authors contributed equally
| | - Nicolas Svetec
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Sylvia M Durkin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
- Current Address: Department of Integrative Biology and Museum of Vertebrate Zoology, University of California, Berkeley, Berkeley, CA, USA
| | - Sigi Benjamin
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Alice Gadau
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Xia Wu
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| | - Li Zhao
- Laboratory of Evolutionary Genetics and Genomics, The Rockefeller University, New York, NY 10065, USA
| |
Collapse
|
50
|
Lu TC, Brbić M, Park YJ, Jackson T, Chen J, Kolluru SS, Qi Y, Katheder NS, Cai XT, Lee S, Chen YC, Auld N, Liang CY, Ding SH, Welsch D, D’Souza S, Pisco AO, Jones RC, Leskovec J, Lai EC, Bellen HJ, Luo L, Jasper H, Quake SR, Li H. Aging Fly Cell Atlas identifies exhaustive aging features at cellular resolution. Science 2023; 380:eadg0934. [PMID: 37319212 PMCID: PMC10829769 DOI: 10.1126/science.adg0934] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Accepted: 05/04/2023] [Indexed: 06/17/2023]
Abstract
Aging is characterized by a decline in tissue function, but the underlying changes at cellular resolution across the organism remain unclear. Here, we present the Aging Fly Cell Atlas, a single-nucleus transcriptomic map of the whole aging Drosophila. We characterized 163 distinct cell types and performed an in-depth analysis of changes in tissue cell composition, gene expression, and cell identities. We further developed aging clock models to predict fly age and show that ribosomal gene expression is a conserved predictive factor for age. Combining all aging features, we find distinctive cell type-specific aging patterns. This atlas provides a valuable resource for studying fundamental principles of aging in complex organisms.
Collapse
Affiliation(s)
- Tzu-Chiao Lu
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Maria Brbić
- School of Computer and Communication Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
| | - Ye-Jin Park
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Program in Development, Disease Models & Therapeutics, Baylor College of Medicine, Houston, TX 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX 77030, USA
| | - Tyler Jackson
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Program in Cancer Cell Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Jiaye Chen
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Program in Quantitative & Computational Biosciences, Baylor College of Medicine, Houston, TX 77030, USA
| | - Sai Saroja Kolluru
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Department of Applied Physics, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco CA, USA
| | - Yanyan Qi
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | - Xiaoyu Tracy Cai
- Regenerative Medicine, Genentech, Inc., South San Francisco, CA 94080, USA
| | - Seungjae Lee
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, New York, NY 10065, USA
| | - Yen-Chung Chen
- Department of Biology, New York University, New York, NY 10013, USA
| | - Niccole Auld
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Program in Cancer Cell Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Chung-Yi Liang
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Institute of Biochemistry and Molecular Biology, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Sophia H. Ding
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Doug Welsch
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | | | | | - Robert C. Jones
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Jure Leskovec
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | - Eric C. Lai
- Developmental Biology Program, Sloan Kettering Institute, 1275 York Ave, New York, NY 10065, USA
| | - Hugo J. Bellen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
- Program in Development, Disease Models & Therapeutics, Baylor College of Medicine, Houston, TX 77030, USA
- Jan and Dan Duncan Neurological Research Institute, Texas Children’s Hospital, Houston, TX 77030, USA
| | - Liqun Luo
- Howard Hughes Medical Institute, Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - Heinrich Jasper
- Regenerative Medicine, Genentech, Inc., South San Francisco, CA 94080, USA
| | - Stephen R. Quake
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Department of Applied Physics, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco CA, USA
| | - Hongjie Li
- Huffington Center on Aging, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|