1
|
Kuang N, Ma Q, Zheng X, Meng X, Zhai Z, Li Q, Pan J. GeTeSEPdb: A comprehensive database and online tool for the identification and analysis of gene profiles with temporal-specific expression patterns. Comput Struct Biotechnol J 2024; 23:2488-2496. [PMID: 38939556 PMCID: PMC11208770 DOI: 10.1016/j.csbj.2024.06.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2024] [Revised: 05/29/2024] [Accepted: 06/04/2024] [Indexed: 06/29/2024] Open
Abstract
Gene expression is dynamic and varies at different stages of processes. The identification of gene profiles with temporal-specific expression patterns can provide valuable insights into ongoing biological processes, such as the cell cycle, cell development, circadian rhythms, or responses to external stimuli such as drug treatments or viral infections. However, currently, no database defines, identifies or archives gene profiles with temporal-specific expression patterns. Here, using a high-throughput regression analysis approach, eight linear and nonlinear parametric models were fitted to gene expression profiles from time-series experiments to identify eight types of gene profiles with temporal-specific expression patterns. We curated 2684 time-series transcriptome datasets and identified 2644,370 gene profiles exhibiting temporal-specific expression patterns. The results were stored in the database GeTeSEPdb (gene profiles with temporal-specific expression patterns database, http://www.inbirg.com/GeTeSEPdb/). Moreover, we implemented an online tool to identify gene profiles with temporal-specific expression patterns from user-submitted data. In summary, GeTeSEPdb is a comprehensive web service that can be used to identify and analyse gene profiles with temporal-specific expression patterns. This approach facilitates the exploration of transcriptional changes and temporal patterns of responses. We firmly believe that GeTeSEPdb will become a valuable resource for biologists and bioinformaticians.
Collapse
Affiliation(s)
- Ni Kuang
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Qinfeng Ma
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Xiao Zheng
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Xuehang Meng
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Zhaoyu Zhai
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Qiang Li
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
| | - Jianbo Pan
- Basic Medicine Research and Innovation Center for Novel Target and Therapeutic Intervention, Ministry of Education, Institute of Life Sciences, Chongqing Medical University, Chongqing 400016, China
- Precision Medicine Center, The Second Affiliated Hospital of Chongqing Medical University, Chongqing 400010, China
| |
Collapse
|
2
|
Trzaskoma P, Jung S, Pękowska A, Bohrer CH, Wang X, Naz F, Dell’Orso S, Dubois WD, Olivera A, Vartak SV, Zhao Y, Nayak S, Overmiller A, Morasso MI, Sartorelli V, Larson DR, Chow CC, Casellas R, O’Shea JJ. 3D chromatin architecture, BRD4, and Mediator have distinct roles in regulating genome-wide transcriptional bursting and gene network. SCIENCE ADVANCES 2024; 10:eadl4893. [PMID: 39121214 PMCID: PMC11313860 DOI: 10.1126/sciadv.adl4893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Accepted: 07/08/2024] [Indexed: 08/11/2024]
Abstract
Discontinuous transcription is evolutionarily conserved and a fundamental feature of gene regulation; yet, the exact mechanisms underlying transcriptional bursting are unresolved. Analyses of bursting transcriptome-wide have focused on the role of cis-regulatory elements, but other factors that regulate this process remain elusive. We applied mathematical modeling to single-cell RNA sequencing data to infer bursting dynamics transcriptome-wide under multiple conditions to identify possible molecular mechanisms. We found that Mediator complex subunit 26 (MED26) primarily regulates frequency, MYC regulates burst size, while cohesin and Bromodomain-containing protein 4 (BRD4) can modulate both. Despite comparable effects on RNA levels among these perturbations, acute depletion of MED26 had the most profound impact on the entire gene regulatory network, acting downstream of chromatin spatial architecture and without affecting TATA box-binding protein (TBP) recruitment. These results indicate that later steps in the initiation of transcriptional bursts are primary nodes for integrating gene networks in single cells.
Collapse
Affiliation(s)
- Pawel Trzaskoma
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - SeolKyoung Jung
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Aleksandra Pękowska
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
- Dioscuri Centre for Chromatin Biology and Epigenomics, Nencki Institute of Experimental Biology, Polish Academy of Sciences, 3 Pasteur Street, 02-093 Warsaw, Poland
| | | | - Xiang Wang
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Faiza Naz
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Stefania Dell’Orso
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Wendy D. Dubois
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Ana Olivera
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Supriya V. Vartak
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Yongbing Zhao
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Subhashree Nayak
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Andrew Overmiller
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Maria I. Morasso
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Vittorio Sartorelli
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Daniel R. Larson
- National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Carson C. Chow
- National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Rafael Casellas
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| | - John J. O’Shea
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, USA
| |
Collapse
|
3
|
Hudaiberdiev S, Ovcharenko I. Functional characteristics and computational model of abundant hyperactive loci in the human genome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.05.527203. [PMID: 36945558 PMCID: PMC10028745 DOI: 10.1101/2023.02.05.527203] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Enhancers and promoters are classically considered to be bound by a small set of TFs in a sequence-specific manner. This assumption has come under increasing skepticism as the datasets of ChIP-seq assays of TFs have expanded. In particular, high-occupancy target (HOT) loci attract hundreds of TFs with often no detectable correlation between ChIP-seq peaks and DNA-binding motif presence. Here, we used a set of 1,003 TF ChIP-seq datasets (HepG2, K562, H1) to analyze the patterns of ChIP-seq peak co-occurrence in combination with functional genomics datasets. We identified 43,891 HOT loci forming at the promoter (53%) and enhancer (47%) regions. HOT promoters regulate housekeeping genes, whereas HOT enhancers are involved in tissue-specific process regulation. HOT loci form the foundation of human super-enhancers and evolve under strong negative selection, with some of these loci being located in ultraconserved regions. Sequence-based classification analysis of HOT loci suggested that their formation is driven by the sequence features, and the density of mapped ChIP-seq peaks across TF-bound loci correlates with sequence features and the expression level of flanking genes. Based on the affinities to bind to promoters and enhancers we detected 5 distinct clusters of TFs that form the core of the HOT loci. We report an abundance of HOT loci in the human genome and a commitment of 51% of all TF ChIP-seq binding events to HOT locus formation thus challenging the classical model of enhancer activity and propose a model of HOT locus formation based on the existence of large transcriptional condensates.
Collapse
|
4
|
Mun DG, Bhat FA, Joshi N, Sandoval L, Ding H, Jain A, Peterson JA, Kang T, Pujari GP, Tomlinson JL, Budhraja R, Zenka RM, Kannan N, Kipp BR, Dasari S, Gaspar-Maia A, Smoot RL, Kandasamy RK, Pandey A. Diversity of post-translational modifications and cell signaling revealed by single cell and single organelle mass spectrometry. Commun Biol 2024; 7:884. [PMID: 39030393 PMCID: PMC11271535 DOI: 10.1038/s42003-024-06579-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Accepted: 07/11/2024] [Indexed: 07/21/2024] Open
Abstract
The rapid evolution of mass spectrometry-based single-cell proteomics now enables the cataloging of several thousand proteins from single cells. We investigated whether we could discover cellular heterogeneity beyond proteome, encompassing post-translational modifications (PTM), protein-protein interaction, and variants. By optimizing the mass spectrometry data interpretation strategy to enable the detection of PTMs and variants, we have generated a high-definition dataset of single-cell and nuclear proteomic-states. The data demonstrate the heterogeneity of cell-states and signaling dependencies at the single-cell level and reveal epigenetic drug-induced changes in single nuclei. This approach enables the exploration of previously uncharted single-cell and organellar proteomes revealing molecular characteristics that are inaccessible through RNA profiling.
Collapse
Affiliation(s)
- Dong-Gi Mun
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Firdous A Bhat
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Neha Joshi
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
- Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India
| | - Leticia Sandoval
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Husheng Ding
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Anu Jain
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | | | - Taewook Kang
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Ganesh P Pujari
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | | | - Rohit Budhraja
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Roman M Zenka
- Proteomics Core, Mayo Clinic, Rochester, MN, 55905, USA
| | - Nagarajan Kannan
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Benjamin R Kipp
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
| | - Surendra Dasari
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, 55905, USA
| | - Alexandre Gaspar-Maia
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA
| | - Rory L Smoot
- Department of Surgery, Mayo Clinic, Rochester, MN, 55905, USA
- Department of Biochemistry and Molecular Biology, Mayo Clinic, Rochester, MN, USA
| | - Richard K Kandasamy
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA.
- Department of Quantitative Health Sciences, Mayo Clinic, Rochester, MN, 55905, USA.
| | - Akhilesh Pandey
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, 55905, USA.
- Manipal Academy of Higher Education, Manipal, 576104, Karnataka, India.
- Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA.
| |
Collapse
|
5
|
Chen Z, Wang C, Huang S, Shi Y, Xi R. Directly selecting cell-type marker genes for single-cell clustering analyses. CELL REPORTS METHODS 2024; 4:100810. [PMID: 38981475 PMCID: PMC11294843 DOI: 10.1016/j.crmeth.2024.100810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 03/16/2024] [Accepted: 06/12/2024] [Indexed: 07/11/2024]
Abstract
In single-cell RNA sequencing (scRNA-seq) studies, cell types and their marker genes are often identified by clustering and differentially expressed gene (DEG) analysis. A common practice is to select genes using surrogate criteria such as variance and deviance, then cluster them using selected genes and detect markers by DEG analysis assuming known cell types. The surrogate criteria can miss important genes or select unimportant genes, while DEG analysis has the selection-bias problem. We present Festem, a statistical method for the direct selection of cell-type markers for downstream clustering. Festem distinguishes marker genes with heterogeneous distribution across cells that are cluster informative. Simulation and scRNA-seq applications demonstrate that Festem can sensitively select markers with high precision and enables the identification of cell types often missed by other methods. In a large intrahepatic cholangiocarcinoma dataset, we identify diverse CD8+ T cell types and potential prognostic marker genes.
Collapse
Affiliation(s)
- Zihao Chen
- School of Mathematical Sciences and Center for Statistical Science, Peking University, Beijing 100871, China
| | - Changhu Wang
- School of Mathematical Sciences and Center for Statistical Science, Peking University, Beijing 100871, China
| | - Siyuan Huang
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Yang Shi
- BeiGene (Beijing) Co., Ltd., Beijing 100871, China
| | - Ruibin Xi
- School of Mathematical Sciences and Center for Statistical Science, Peking University, Beijing 100871, China.
| |
Collapse
|
6
|
Wang Q, Ding X, Xu Z, Wang B, Wang A, Wang L, Ding Y, Song S, Chen Y, Zhang S, Jiang L, Ding X. The mouse multi-organ proteome from infancy to adulthood. Nat Commun 2024; 15:5752. [PMID: 38982135 PMCID: PMC11233712 DOI: 10.1038/s41467-024-50183-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 07/03/2024] [Indexed: 07/11/2024] Open
Abstract
The early-life organ development and maturation shape the fundamental blueprint for later-life phenotype. However, a multi-organ proteome atlas from infancy to adulthood is currently not available. Herein, we present a comprehensive proteomic analysis of ten mouse organs (brain, heart, lung, liver, kidney, spleen, stomach, intestine, muscle and skin) at three crucial developmental stages (1-, 4- and 8-weeks after birth) acquired using data-independent acquisition mass spectrometry. We detect and quantify 11,533 protein groups across the ten organs and obtain 115 age-related differentially expressed protein groups that are co-expressed in all organs from infancy to adulthood. We find that spliceosome proteins prevalently play crucial regulatory roles in the early-life development of multiple organs, and detect organ-specific expression patterns and sexual dimorphism. This multi-organ proteome atlas provides a fundamental resource for understanding the molecular mechanisms underlying early-life organ development and maturation.
Collapse
Affiliation(s)
- Qingwen Wang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Xinwen Ding
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Zhixiao Xu
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Boqian Wang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Aiting Wang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Liping Wang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Yi Ding
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Sunfengda Song
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Youming Chen
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Shuang Zhang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Lai Jiang
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China
| | - Xianting Ding
- Department of Anesthesiology and Surgical Intensive Care Unit, Xinhua Hospital, School of Medicine and School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China.
- State Key Laboratory of Oncogenes and Related Genes, Institute for Personalized Medicine, Shanghai Jiao Tong University, Shanghai, China.
| |
Collapse
|
7
|
Pletenev I, Bazarevich M, Zagirova D, Kononkova A, Cherkasov A, Efimova O, Tiukacheva E, Morozov K, Ulianov K, Komkov D, Tvorogova A, Golimbet V, Kondratyev N, Razin S, Khaitovich P, Ulianov S, Khrameeva E. Extensive long-range polycomb interactions and weak compartmentalization are hallmarks of human neuronal 3D genome. Nucleic Acids Res 2024; 52:6234-6252. [PMID: 38647066 PMCID: PMC11194087 DOI: 10.1093/nar/gkae271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 03/21/2024] [Accepted: 04/06/2024] [Indexed: 04/25/2024] Open
Abstract
Chromatin architecture regulates gene expression and shapes cellular identity, particularly in neuronal cells. Specifically, polycomb group (PcG) proteins enable establishment and maintenance of neuronal cell type by reorganizing chromatin into repressive domains that limit the expression of fate-determining genes and sustain distinct gene expression patterns in neurons. Here, we map the 3D genome architecture in neuronal and non-neuronal cells isolated from the Wernicke's area of four human brains and comprehensively analyze neuron-specific aspects of chromatin organization. We find that genome segregation into active and inactive compartments is greatly reduced in neurons compared to other brain cells. Furthermore, neuronal Hi-C maps reveal strong long-range interactions, forming a specific network of PcG-mediated contacts in neurons that is nearly absent in other brain cells. These interacting loci contain developmental transcription factors with repressed expression in neurons and other mature brain cells. But only in neurons, they are rich in bivalent promoters occupied by H3K4me3 histone modification together with H3K27me3, which points to a possible functional role of PcG contacts in neurons. Importantly, other layers of chromatin organization also exhibit a distinct structure in neurons, characterized by an increase in short-range interactions and a decrease in long-range ones.
Collapse
Affiliation(s)
- Ilya A Pletenev
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Maria Bazarevich
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Diana R Zagirova
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
- A.A. Kharkevich Institute for Information Transmission Problems, Moscow 127051, Russia
| | - Anna D Kononkova
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Alexander V Cherkasov
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Olga I Efimova
- Vladimir Zelman Center for Neurobiology and Brain Rehabilitation, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Eugenia A Tiukacheva
- Department of Biological and Medical Physics, Moscow Institute of Physics and Technology, Moscow 141700, Russia
- Department of Molecular Biology, Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow 119991, Russia
- CNRS UMR9018, Institut Gustave Roussy, Villejuif 94805, France
- Koltzov Institute of Developmental Biology, Russian Academy of Sciences, Moscow 119334, Russia
- Department of Cellular Genomics, Institute of Gene Biology, Russian Academy of Sciences, Moscow 119334, Russia
| | - Kirill V Morozov
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Kirill A Ulianov
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Dmitriy Komkov
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow 119334, Russia
| | - Anna V Tvorogova
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow 119334, Russia
| | - Vera E Golimbet
- Laboratory of Clinical Genetics, Mental Health Research Center, Moscow 115522, Russia
| | - Nikolay V Kondratyev
- Laboratory of Clinical Genetics, Mental Health Research Center, Moscow 115522, Russia
| | - Sergey V Razin
- Department of Molecular Biology, Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow 119991, Russia
- Department of Cellular Genomics, Institute of Gene Biology, Russian Academy of Sciences, Moscow 119334, Russia
| | - Philipp Khaitovich
- Vladimir Zelman Center for Neurobiology and Brain Rehabilitation, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| | - Sergey V Ulianov
- Department of Molecular Biology, Faculty of Biology, M.V. Lomonosov Moscow State University, Moscow 119991, Russia
- Department of Cellular Genomics, Institute of Gene Biology, Russian Academy of Sciences, Moscow 119334, Russia
| | - Ekaterina E Khrameeva
- Center for Molecular and Cellular Biology, Skolkovo Institute of Science and Technology, Moscow 121205, Russia
| |
Collapse
|
8
|
Ergin EK, Myung JJ, Lange PF. Statistical Testing for Protein Equivalence Identifies Core Functional Modules Conserved across 360 Cancer Cell Lines and Presents a General Approach to Investigating Biological Systems. J Proteome Res 2024; 23:2169-2185. [PMID: 38804581 PMCID: PMC11166143 DOI: 10.1021/acs.jproteome.4c00131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Revised: 05/04/2024] [Accepted: 05/17/2024] [Indexed: 05/29/2024]
Abstract
Quantitative proteomics has enhanced our capability to study protein dynamics and their involvement in disease using various techniques, including statistical testing, to discern the significant differences between conditions. While most focus is on what is different between conditions, exploring similarities can provide valuable insights. However, exploring similarities directly from the analyte level, such as proteins, genes, or metabolites, is not a standard practice and is not widely adopted. In this study, we propose a statistical framework called QuEStVar (Quantitative Exploration of Stability and Variability through statistical hypothesis testing), enabling the exploration of quantitative stability and variability of features with a combined statistical framework. QuEStVar utilizes differential and equivalence testing to expand statistical classifications of analytes when comparing conditions. We applied our method to an extensive data set of cancer cell lines and revealed a quantitatively stable core proteome across diverse tissues and cancer subtypes. The functional analysis of this set of proteins highlighted the molecular mechanism of cancer cells to maintain constant conditions of the tumorigenic environment via biological processes, including transcription, translation, and nucleocytoplasmic transport.
Collapse
Affiliation(s)
- Enes K. Ergin
- Department
of Pathology, University of British Columbia, Vancouver, British Columbia V6T 1Z7, Canada
- Michael
Cuccione Childhood Cancer Research Program, BC Children’s Hospital Research Institute, Vancouver, British Columbia V5Z 2H4, Canada
| | - Junia J.K. Myung
- Department
of Pathology, University of British Columbia, Vancouver, British Columbia V6T 1Z7, Canada
- Michael
Cuccione Childhood Cancer Research Program, BC Children’s Hospital Research Institute, Vancouver, British Columbia V5Z 2H4, Canada
| | - Philipp F. Lange
- Department
of Pathology, University of British Columbia, Vancouver, British Columbia V6T 1Z7, Canada
- Michael
Cuccione Childhood Cancer Research Program, BC Children’s Hospital Research Institute, Vancouver, British Columbia V5Z 2H4, Canada
| |
Collapse
|
9
|
Park SJ, Nakai K. A computational approach for deciphering the interactions between proximal and distal gene regulators in GC B-cell response. NAR Genom Bioinform 2024; 6:lqae050. [PMID: 38711859 PMCID: PMC11071120 DOI: 10.1093/nargab/lqae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Revised: 04/15/2024] [Accepted: 04/27/2024] [Indexed: 05/08/2024] Open
Abstract
Delineating the intricate interplay between promoter-proximal and -distal regulators is crucial for understanding the function of transcriptional mediator complexes implicated in the regulation of gene expression. The present study aimed to develop a computational method for accurately modeling the spatial proximal and distal regulatory interactions. Our method combined regression-based models to identify key regulators through gene expression prediction and a graph-embedding approach to detect coregulated genes. This approach enabled a detailed investigation of the gene regulatory mechanisms for germinal center B cells, accompanied by dramatic rearrangements of the genome structure. We found that while the promoter-proximal regulatory elements were the principal regulators of gene expression, the distal regulators fine-tuned transcription. Moreover, our approach unveiled the presence of modular regulators, such as cofactors and proximal/distal transcription factors, which were co-expressed with their target genes. Some of these modules exhibited abnormal expression patterns in lymphoma. These findings suggest that the dysregulation of interactions between transcriptional and architectural factors is associated with chromatin reorganization failure, which may increase the risk of malignancy. Therefore, our computational approach helps decipher the transcriptional cis-regulatory code spatially interacting.
Collapse
Affiliation(s)
- Sung-Joon Park
- Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| | - Kenta Nakai
- Human Genome Center, The Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo 108-8639, Japan
| |
Collapse
|
10
|
Chen B, Ren C, Ouyang Z, Xu J, Xu K, Li Y, Guo H, Bai X, Tian M, Xu X, Wang Y, Li H, Bo X, Chen H. Stratifying TAD boundaries pinpoints focal genomic regions of regulation, damage, and repair. Brief Bioinform 2024; 25:bbae306. [PMID: 38935071 PMCID: PMC11210073 DOI: 10.1093/bib/bbae306] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 06/01/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
Advances in chromatin mapping have exposed the complex chromatin hierarchical organization in mammals, including topologically associating domains (TADs) and their substructures, yet the functional implications of this hierarchy in gene regulation and disease progression are not fully elucidated. Our study delves into the phenomenon of shared TAD boundaries, which are pivotal in maintaining the hierarchical chromatin structure and regulating gene activity. By integrating high-resolution Hi-C data, chromatin accessibility, and DNA double-strand breaks (DSBs) data from various cell lines, we systematically explore the complex regulatory landscape at high-level TAD boundaries. Our findings indicate that these boundaries are not only key architectural elements but also vibrant hubs, enriched with functionally crucial genes and complex transcription factor binding site-clustered regions. Moreover, they exhibit a pronounced enrichment of DSBs, suggesting a nuanced interplay between transcriptional regulation and genomic stability. Our research provides novel insights into the intricate relationship between the 3D genome structure, gene regulation, and DNA repair mechanisms, highlighting the role of shared TAD boundaries in maintaining genomic integrity and resilience against perturbations. The implications of our findings extend to understanding the complexities of genomic diseases and open new avenues for therapeutic interventions targeting the structural and functional integrity of TAD boundaries.
Collapse
Affiliation(s)
- Bijia Chen
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Chao Ren
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Zhangyi Ouyang
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Jingxuan Xu
- Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Gastrointestinal Surgery, Peking University Cancer Hospital & Institute, Beijing 100142, China
| | - Kang Xu
- School of Software, Shandong University, Jinan 250101, China
| | - Yaru Li
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Hejiang Guo
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Xuemei Bai
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Mengge Tian
- The First Affiliated Hospital of Harbin Medical University, Harbin 150001, China
| | - Xiang Xu
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Yuyang Wang
- College of Computer and Data Science, Fuzhou University, Fuzhou 350108, China
| | - Hao Li
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Xiaochen Bo
- Academy of Military Medical Sciences, Beijing 100850, China
| | - Hebing Chen
- Academy of Military Medical Sciences, Beijing 100850, China
| |
Collapse
|
11
|
Hao J, Li T, Heinzelmann M, Moussaud-Lamodière E, Lebre F, Krjutškov K, Damdimopoulos A, Arnelo C, Pettersson K, Alfaro-Moreno E, Lindskog C, van Duursen M, Damdimopoulou P. Effects of chemical in vitro activation versus fragmentation on human ovarian tissue and follicle growth in culture. Hum Reprod Open 2024; 2024:hoae028. [PMID: 38803550 PMCID: PMC11128059 DOI: 10.1093/hropen/hoae028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/15/2024] [Indexed: 05/29/2024] Open
Abstract
STUDY QUESTION What is the effect of the chemical in vitro activation (cIVA) protocol compared with fragmentation only (Frag, also known as mechanical IVA) on gene expression, follicle activation and growth in human ovarian tissue in vitro? SUMMARY ANSWER Although histological assessment shows that cIVA significantly increases follicle survival and growth compared to Frag, both protocols stimulate extensive and nearly identical transcriptomic changes in cultured tissue compared to freshly collected ovarian tissue, including marked changes in energy metabolism and inflammatory responses. WHAT IS KNOWN ALREADY Treatments based on cIVA of the phosphatase and tensin homolog (PTEN)-phosphatidylinositol 3-kinase (PI3K) pathway in ovarian tissue followed by auto-transplantation have been administered to patients with refractory premature ovarian insufficiency (POI) and resulted in live births. However, comparable effects with mere tissue fragmentation have been shown, questioning the added value of chemical stimulation that could potentially activate oncogenic responses. STUDY DESIGN SIZE DURATION Fifty-nine ovarian cortical biopsies were obtained from consenting women undergoing elective caesarean section (C-section). The samples were fragmented for culture studies. Half of the fragments were exposed to bpV (HOpic)+740Y-P (Frag+cIVA group) during the first 24 h of culture, while the other half were cultured with medium only (Frag group). Subsequently, both groups were cultured with medium only for an additional 6 days. Tissue and media samples were collected for histological, transcriptomic, steroid hormone, and cytokine/chemokine analyses at various time points. PARTICIPANTS/MATERIALS SETTING METHODS Effects on follicles were evaluated by counting and scoring serial sections stained with hematoxylin and eosin before and after the 7-day culture. Follicle function was assessed by quantification of steroids by ultra-performance liquid chromatography tandem-mass spectrometry at different time points. Cytokines and chemokines were measured by multiplex assay. Transcriptomic effects were measured by RNA-sequencing (RNA-seq) of the tissue after the initial 24-h culture. Selected differentially expressed genes (DEGs) were validated by quantitative PCR and immunofluorescence in cultured ovarian tissue as well as in KGN cell (human ovarian granulosa-like tumor cell line) culture experiments. MAIN RESULTS AND THE ROLE OF CHANCE Compared to the Frag group, the Frag+cIVA group exhibited a significantly higher follicle survival rate, increased numbers of secondary follicles, and larger follicle sizes. Additionally, the tissue in the Frag+cIVA group produced less dehydroepiandrosterone compared to Frag. Cytokine measurement showed a strong inflammatory response at the start of the culture in both groups. The RNA-seq data revealed modest differences between the Frag+cIVA and Frag groups, with only 164 DEGs identified using a relaxed cut-off of false discovery rate (FDR) <0.1. Apart from the expected PI3K-protein kinase B (Akt) pathway, cIVA also regulated pathways related to hypoxia, cytokines, and inflammation. In comparison to freshly collected ovarian tissue, gene expression in general was markedly affected in both the Frag+cIVA and Frag groups, with a total of 3119 and 2900 DEGs identified (FDR < 0.001), respectively. The top enriched gene sets in both groups included several pathways known to modulate follicle growth such as mammalian target of rapamycin (mTOR)C1 signaling. Significant changes compared to fresh tissue were also observed in the expression of genes encoding for steroidogenesis enzymes and classical granulosa cell markers in both groups. Intriguingly, we discovered a profound upregulation of genes related to glycolysis and its upstream regulator in both Frag and Frag+cIVA groups, and these changes were further boosted by the cIVA treatment. Cell culture experiments confirmed glycolysis-related genes as direct targets of the cIVA drugs. In conclusion, cIVA enhances follicle growth, as expected, but the mechanisms may be more complex than PI3K-Akt-mTOR alone, and the impact on function and quality of the follicles after the culture period remains an open question. LARGE SCALE DATA Data were deposited in the GEO data base, accession number GSE234765. The code for sequencing analysis can be found in https://github.com/tialiv/IVA_project. LIMITATIONS REASONS FOR CAUTION Similar to the published IVA protocols, the first steps in our study were performed in an in vitro culture model where the ovarian tissue was isolated from the regulation of hypothalamic-pituitary-ovarian axis. Further in vivo experiments will be needed, for example in xeno-transplantation models, to explore the long-term impacts of the discovered effects. The tissue collected from patients undergoing C-section may not be comparable to tissue of patients with POI. WIDER IMPLICATIONS OF THE FINDINGS The general impact of fragmentation and short (24 h) in vitro culture on gene expression in ovarian tissue far exceeded the effects of cIVA. Yet, follicle growth was stimulated by cIVA, which may suggest effects on specific cell populations that may be diluted in bulk RNA-seq. Nevertheless, we confirmed the impact of cIVA on glycolysis using a cell culture model, suggesting impacts on cellular signaling beyond the PI3K pathway. The profound changes in inflammation and glycolysis following fragmentation and culture could contribute to follicle activation and loss in ovarian tissue culture, as well as in clinical applications, such as fertility preservation by ovarian tissue auto-transplantation. STUDY FUNDING/COMPETING INTERESTS This study was funded by research grants from European Union's Horizon 2020 Research and Innovation Programme (Project ERIN No. 952516, FREIA No. 825100), Swedish Research Council VR (2020-02132), StratRegen funding from Karolinska Institutet, KI-China Scholarship Council (CSC) Programme and the Natural Science Foundation of Hunan (2022JJ40782). International Iberian Nanotechnology Laboratory Research was funded by the European Union's H2020 Project Sinfonia (857253) and SbDToolBox (NORTE-01-0145-FEDER-000047), supported by Norte Portugal Regional Operational Programme (NORTE 2020), under the PORTUGAL 2020 Partnership Agreement, through the European Regional Development Fund. No competing interests are declared.
Collapse
Affiliation(s)
- Jie Hao
- Department of Reproductive Medicine, Xiangya Hospital, Central South University, Changsha, P.R. China
- Department of Gynecology and Reproductive Medicine, Karolinska University Hospital, Stockholm, Sweden
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Tianyi Li
- Department of Gynecology and Reproductive Medicine, Karolinska University Hospital, Stockholm, Sweden
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Manuel Heinzelmann
- Department of Environment and Health, Amsterdam Institute for Life and Environment, Amsterdam, The Netherlands
| | - Elisabeth Moussaud-Lamodière
- Department of Gynecology and Reproductive Medicine, Karolinska University Hospital, Stockholm, Sweden
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Filipa Lebre
- Nanosafety Group, International Iberian Nanotechnology Laboratory, Braga, Portugal
| | - Kaarel Krjutškov
- Faculty of Medicine, Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
- Competence Centre on Health Technologies, Tartu, Estonia
| | | | - Catarina Arnelo
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | - Karin Pettersson
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| | | | - Cecilia Lindskog
- Department of Immunology, Genetics and Pathology, Cancer Precision Medicine Research Program, Uppsala University, Uppsala, Sweden
| | - Majorie van Duursen
- Department of Environment and Health, Amsterdam Institute for Life and Environment, Amsterdam, The Netherlands
| | - Pauliina Damdimopoulou
- Department of Gynecology and Reproductive Medicine, Karolinska University Hospital, Stockholm, Sweden
- Division of Obstetrics and Gynecology, Department of Clinical Science, Intervention and Technology, Karolinska Institutet, Stockholm, Sweden
| |
Collapse
|
12
|
Hounkpe BW, Sales LP, Ribeiro SCR, Perez MO, Caparbo VF, Domiciano DS, Figueiredo CP, Pereira RMR, Borba EF. Transcriptomic signatures of classical monocytes reveal pro-inflammatory modules and heterogeneity in polyarticular juvenile idiopathic arthritis. Front Immunol 2024; 15:1400036. [PMID: 38835762 PMCID: PMC11148224 DOI: 10.3389/fimmu.2024.1400036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 04/30/2024] [Indexed: 06/06/2024] Open
Abstract
Introduction Polyarticular juvenile idiopathic arthritis (pJIA) is a childhood-onset autoimmune disease. Immune cells contribute to persistent inflammation observed in pJIA. Despite the crucial role of monocytes in arthritis, the precise involvement of classical monocytes in the pathogenesis of pJIA remains uncertain. Here, we aimed to uncover the transcriptomic patterns of classical monocytes in pJIA, focusing on their involvement in disease mechanism and heterogeneity. Methods A total of 17 healthy subjects and 18 premenopausal women with pJIA according to ILAR criteria were included. Classical monocytes were isolated, and RNA sequencing was performed. Differential expression analysis was used to compare pJIA patients and healthy control group. Differentially expressed genes (DEGs) were identified, and gene set enrichment analysis (GSEA) was performed. Using unsupervised learning approach, patients were clustered in two groups based on their similarities at transcriptomic level. Subsequently, these clusters underwent a comparative analysis to reveal differences at the transcriptomic level. Results We identified 440 DEGs in pJIA patients of which 360 were upregulated and 80 downregulated. GSEA highlighted TNF-α and IFN-γ response. Importantly, this analysis not only detected genes targeted by pJIA therapy but also identified new modulators of immuno-inflammation. PLAUR, IL1B, IL6, CDKN1A, PIM1, and ICAM1 were pointed as drivers of chronic hyperinflammation. Unsupervised learning approach revealed two clusters within pJIA, each exhibiting varying inflammation levels. Conclusion These findings indicate the pivotal role of immuno-inflammation driven by classical monocytes in pJIA and reveals the existence of two subclusters within pJIA, regardless the positivity of rheumatoid factor and anti-CCP, paving the way to precision medicine.
Collapse
Affiliation(s)
- Bidossessi W Hounkpe
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Lucas P Sales
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Surian C R Ribeiro
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Mariana O Perez
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Valéria F Caparbo
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Diogo Souza Domiciano
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Camille P Figueiredo
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Rosa M R Pereira
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| | - Eduardo F Borba
- Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo (HCFMUSP), Sao Paulo, Brazil
| |
Collapse
|
13
|
Sadikan MZ, Abdul Nasir NA, Ibahim MJ, Iezhitsa I, Agarwal R. Identifying the stability of housekeeping genes to be used for the quantitative real-time PCR normalization in retinal tissue of streptozotocin-induced diabetic rats. Int J Ophthalmol 2024; 17:794-805. [PMID: 38766348 PMCID: PMC11074185 DOI: 10.18240/ijo.2024.05.02] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/23/2024] [Indexed: 05/22/2024] Open
Abstract
AIM To investigate the stability of the seven housekeeping genes: beta-actin (ActB), glyceraldehyde-3-phosphate dehydrogenase (GAPDH), 18s ribosomal unit 5 (18s), cyclophilin A (CycA), hypoxanthine-guanine phosphoribosyl transferase (HPRT), ribosomal protein large P0 (36B4) and terminal uridylyl transferase 1 (U6) in the diabetic retinal tissue of rat model. METHODS The expression of these seven genes in rat retinal tissues was determined using real-time quantitative reverse transcription polymerase chain reaction (RT-qPCR) in two groups; normal control rats and streptozotocin-induced diabetic rats. The stability analysis of gene expression was investigated using geNorm, NormFinder, BestKeeper, and comparative delta-Ct (ΔCt) algorithms. RESULTS The 36B4 gene was stably expressed in the retinal tissues of normal control animals; however, it was less stable in diabetic retinas. The 18s gene was expressed consistently in both normal control and diabetic rats' retinal tissue. That this gene was the best reference for data normalisation in RT-qPCR studies that used the retinal tissue of streptozotocin-induced diabetic rats. Furthermore, there was no ideal gene stably expressed for use in all experimental settings. CONCLUSION Identifying relevant genes is a need for achieving RT-qPCR validity and reliability and must be appropriately achieved based on a specific experimental setting.
Collapse
Affiliation(s)
- Muhammad Zulfiqah Sadikan
- Department of Pharmacology, Faculty of Medicine, Manipal University College Malaysia (MUCM), Bukit Baru, Melaka 75150, Malaysia
| | - Nurul Alimah Abdul Nasir
- Centre for Neuroscience Research (NeuRon), Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh Campus, Sungai Buloh, Selangor 47000, Malaysia
- Department of Medical Education, Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh Campus, Sungai Buloh, Selangor 47000, Malaysia
| | - Mohammad Johari Ibahim
- Department of Biochemistry and Molecular Medicine, Faculty of Medicine, Universiti Teknologi MARA, Sungai Buloh Campus, Sungai Buloh, Selangor 47000, Malaysia
| | - Igor Iezhitsa
- School of Medicine, International Medical University, Bukit Jalil, Kuala Lumpur 57000, Malaysia
- Department of Pharmacology and Bioinformatics, Volgograd State Medical University, Volgograd 400131, Russia
| | - Renu Agarwal
- School of Medicine, International Medical University, Bukit Jalil, Kuala Lumpur 57000, Malaysia
| |
Collapse
|
14
|
Tordai H, Torres O, Csepi M, Padányi R, Lukács GL, Hegedűs T. Analysis of AlphaMissense data in different protein groups and structural context. Sci Data 2024; 11:495. [PMID: 38744964 PMCID: PMC11094042 DOI: 10.1038/s41597-024-03327-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 04/29/2024] [Indexed: 05/16/2024] Open
Abstract
Single amino acid substitutions can profoundly affect protein folding, dynamics, and function. The ability to discern between benign and pathogenic substitutions is pivotal for therapeutic interventions and research directions. Given the limitations in experimental examination of these variants, AlphaMissense has emerged as a promising predictor of the pathogenicity of missense variants. Since heterogenous performance on different types of proteins can be expected, we assessed the efficacy of AlphaMissense across several protein groups (e.g. soluble, transmembrane, and mitochondrial proteins) and regions (e.g. intramembrane, membrane interacting, and high confidence AlphaFold segments) using ClinVar data for validation. Our comprehensive evaluation showed that AlphaMissense delivers outstanding performance, with MCC scores predominantly between 0.6 and 0.74. We observed low performance on disordered datasets and ClinVar data related to the CFTR ABC protein. However, a superior performance was shown when benchmarked against the high quality CFTR2 database. Our results with CFTR emphasizes AlphaMissense's potential in pinpointing functional hot spots, with its performance likely surpassing benchmarks calculated from ClinVar and ProteinGym datasets.
Collapse
Affiliation(s)
- Hedvig Tordai
- Institute of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Odalys Torres
- Institute of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Máté Csepi
- Institute of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Rita Padányi
- Institute of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Gergely L Lukács
- Department of Physiology and Biochemistry, McGill University, Montréal, QC, Canada
| | - Tamás Hegedűs
- Institute of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary.
- HUN-REN-SU Biophysical Virology Research Group, Budapest, Hungary.
| |
Collapse
|
15
|
González JT, Thrush K, Meer M, Levine ME, Higgins-Chen AT. Age-Invariant Genes: Multi-Tissue Identification and Characterization of Murine Reference Genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.09.588721. [PMID: 38645168 PMCID: PMC11030416 DOI: 10.1101/2024.04.09.588721] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Studies of the aging transcriptome focus on genes that change with age. But what can we learn from age-invariant genes-those that remain unchanged throughout the aging process? These genes also have a practical application: they serve as reference genes (often called housekeeping genes) in expression studies. Reference genes have mostly been identified and validated in young organisms, and no systematic investigation has been done across the lifespan. Here, we build upon a common pipeline for identifying reference genes in RNA-seq datasets to identify age-invariant genes across seventeen C57BL/6 mouse tissues (brain, lung, bone marrow, muscle, white blood cells, heart, small intestine, kidney, liver, pancreas, skin, brown, gonadal, marrow, and subcutaneous adipose tissue) spanning 1 to 21+ months of age. We identify 9 pan-tissue age-invariant genes and many tissue-specific age-invariant genes. These genes are stable across the lifespan and are validated in independent bulk RNA-seq datasets and RT-qPCR. We find age-invariant genes have shorter transcripts on average and are enriched for CpG islands. Interestingly, pathway enrichment analysis for age-invariant genes identifies an overrepresentation of molecular functions associated with some, but not all, hallmarks of aging. Thus, though hallmarks of aging typically involve changes in cell maintenance mechanisms, select genes associated with these hallmarks resist fluctuations in expression with age. Finally, our analysis concludes no classical reference gene is appropriate for aging studies in all tissues. Instead, we provide tissue-specific and pan-tissue genes for assays utilizing reference gene normalization (i.e., RT-qPCR) that can be applied to animals across the lifespan.
Collapse
Affiliation(s)
- John T. González
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
| | - Kyra Thrush
- Altos Labs, San Diego Institute of Sciences, San Diego, CA, USA
| | - Margarita Meer
- Altos Labs, San Diego Institute of Sciences, San Diego, CA, USA
| | - Morgan E. Levine
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Altos Labs, San Diego Institute of Sciences, San Diego, CA, USA
| | - Albert T. Higgins-Chen
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Department of Psychiatry, Yale University School of Medicine, New Haven CT, USA
| |
Collapse
|
16
|
Abnizova I, Stapel C, Boekhorst RT, Lee JTH, Hemberg M. Integrative analysis of transcriptomic and epigenomic data reveals distinct patterns for developmental and housekeeping gene regulation. BMC Biol 2024; 22:78. [PMID: 38600550 PMCID: PMC11005181 DOI: 10.1186/s12915-024-01869-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2023] [Accepted: 03/14/2024] [Indexed: 04/12/2024] Open
Abstract
BACKGROUND Regulation of transcription is central to the emergence of new cell types during development, and it often involves activation of genes via proximal and distal regulatory regions. The activity of regulatory elements is determined by transcription factors (TFs) and epigenetic marks, but despite extensive mapping of such patterns, the extraction of regulatory principles remains challenging. RESULTS Here we study differentially and similarly expressed genes along with their associated epigenomic profiles, chromatin accessibility and DNA methylation, during lineage specification at gastrulation in mice. Comparison of the three lineages allows us to identify genomic and epigenomic features that distinguish the two classes of genes. We show that differentially expressed genes are primarily regulated by distal elements, while similarly expressed genes are controlled by proximal housekeeping regulatory programs. Differentially expressed genes are relatively isolated within topologically associated domains, while similarly expressed genes tend to be located in gene clusters. Transcription of differentially expressed genes is associated with differentially open chromatin at distal elements including enhancers, while that of similarly expressed genes is associated with ubiquitously accessible chromatin at promoters. CONCLUSION Based on these associations of (linearly) distal genes' transcription start sites (TSSs) and putative enhancers for developmental genes, our findings allow us to link putative enhancers to their target promoters and to infer lineage-specific repertoires of putative driver transcription factors, within which we define subgroups of pioneers and co-operators.
Collapse
Affiliation(s)
- Irina Abnizova
- Epigenetics Programme, Babraham Institute, Cambridge, UK
- Wellcome Sanger Institute, Hinxton, UK
| | - Carine Stapel
- Epigenetics Programme, Babraham Institute, Cambridge, UK
| | | | | | - Martin Hemberg
- Wellcome Sanger Institute, Hinxton, UK.
- The Gene Lay Institute of Immunology and Inflammation Brigham & Women's Hospital and Harvard Medical School, Boston, USA.
| |
Collapse
|
17
|
Zhao Z, Chen S, Wei H, Ma W, Shi W, Si Y, Wang J, Wang L, Li X. Online application for the diagnosis of atherosclerosis by six genes. PLoS One 2024; 19:e0301912. [PMID: 38598492 PMCID: PMC11006159 DOI: 10.1371/journal.pone.0301912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 03/22/2024] [Indexed: 04/12/2024] Open
Abstract
BACKGROUND Atherosclerosis (AS) is a primary contributor to cardiovascular disease, leading to significant global mortality rates. Developing effective diagnostic indicators and models for AS holds the potential to substantially reduce the fatalities and disabilities associated with cardiovascular disease. Blood sample analysis has emerged as a promising avenue for facilitating diagnosis and assessing disease prognosis. Nonetheless, it lacks an accurate model or tool for AS diagnosis. Hence, the principal objective of this study is to develop a convenient, simple, and accurate model for the early detection of AS. METHODS We downloaded the expression data of blood samples from GEO databases. By dividing the mean values of housekeeping genes (meanHGs) and applying the comBat function, we aimed to reduce the batch effect. After separating the datasets into training, evaluation, and testing sets, we applied differential expression analyses (DEA) between AS and control samples from the training dataset. Then, a gradient-boosting model was used to evaluate the importance of genes and identify the hub genes. Using different machine learning algorithms, we constructed a prediction model with the highest accuracy in the testing dataset. Finally, we make the machine learning models publicly accessible by shiny app construction. RESULTS Seven datasets (GSE9874, GSE12288, GSE20129, GSE23746, GSE27034, GSE90074, and GSE202625), including 403 samples with AS and 325 healthy subjects, were obtained by comprehensive searching and filtering by specific requirements. The batch effect was successfully removed by dividing the meanHGs and applying the comBat function. 331 genes were found to be related to atherosclerosis by the DEA analysis between AS and health samples. The top 6 genes with the highest importance values from the gradient boosting model were identified. Out of the seven machine learning algorithms tested, the random forest model exhibited the most impressive performance in the testing datasets, achieving an accuracy exceeding 0.8. While the batch effect reduction analysis in our study could have contributed to the increased accuracy values, our comparison results further highlight the superiority of our model over the genes provided in published studies. This underscores the effectiveness of our approach in delivering superior predictive performance. The machine-learning models were then uploaded to the Shiny app's server, making it easy for users to distinguish AS samples from normal samples. CONCLUSIONS A prognostic Shiny application, built upon six potential atherosclerosis-associated genes, has been developed, offering an accurate diagnosis of atherosclerosis.
Collapse
Affiliation(s)
- Zunlan Zhao
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Shouhang Chen
- Department of Infectious Diseases, Children’s Hospital Affiliated to Zhengzhou University, Henan Children’s Hospital, Zhengzhou Children’s Hospital, Henan, China
| | - Hongzhao Wei
- Department of Oncology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Weile Ma
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Weili Shi
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Yixin Si
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Jun Wang
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Liuyi Wang
- Department of General Medicine, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| | - Xiqing Li
- Department of Oncology, Henan Provincial People’s Hospital, People’s Hospital of Zhengzhou University, Zhengzhou, Henan, China
| |
Collapse
|
18
|
Basurto-Cayuela L, Guerrero-Martínez JA, Gómez-Marín E, Sánchez-Escabias E, Escaño-Maestre M, Ceballos-Chávez M, Reyes JC. SWI/SNF-dependent genes are defined by their chromatin landscape. Cell Rep 2024; 43:113855. [PMID: 38427563 DOI: 10.1016/j.celrep.2024.113855] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Revised: 11/23/2023] [Accepted: 02/08/2024] [Indexed: 03/03/2024] Open
Abstract
SWI/SNF complexes are evolutionarily conserved, ATP-dependent chromatin remodeling machines. Here, we characterize the features of SWI/SNF-dependent genes using BRM014, an inhibitor of the ATPase activity of the complexes. We find that SWI/SNF activity is required to maintain chromatin accessibility and nucleosome occupancy for most enhancers but not for most promoters. SWI/SNF activity is needed for expression of genes with low to medium levels of expression that have promoters with (1) low chromatin accessibility, (2) low levels of active histone marks, (3) high H3K4me1/H3K4me3 ratio, (4) low nucleosomal phasing, and (5) enrichment in TATA-box motifs. These promoters are mostly occupied by the canonical Brahma-related gene 1/Brahma-associated factor (BAF) complex. These genes are surrounded by SWI/SNF-dependent enhancers and mainly encode signal transduction, developmental, and cell identity genes (with almost no housekeeping genes). Machine-learning models trained with different chromatin characteristics of promoters and their surrounding regulatory regions indicate that the chromatin landscape is a determinant for establishing SWI/SNF dependency.
Collapse
Affiliation(s)
- Laura Basurto-Cayuela
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - José A Guerrero-Martínez
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - Elena Gómez-Marín
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - Elena Sánchez-Escabias
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - María Escaño-Maestre
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - María Ceballos-Chávez
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain
| | - José C Reyes
- Genome Biology Department, Centro Andaluz de Biología Molecular y Medicina Regenerativa-CABIMER, Consejo Superior de Investigaciones Científicas-Universidad de Sevilla-Universidad Pablo de Olavide (CSIC-USE-UPO), Av. Americo Vespucio, 41092 Seville, Spain.
| |
Collapse
|
19
|
Toneyan S, Koo PK. Interpreting Cis-Regulatory Interactions from Large-Scale Deep Neural Networks for Genomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.03.547592. [PMID: 37461616 PMCID: PMC10349992 DOI: 10.1101/2023.07.03.547592] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]
Abstract
The rise of large-scale, sequence-based deep neural networks (DNNs) for predicting gene expression has introduced challenges in their evaluation and interpretation. Current evaluations align DNN predictions with experimental perturbation assays, which provides insights into the generalization capabilities within the studied loci but offers a limited perspective of what drives their predictions. Moreover, existing model explainability tools focus mainly on motif analysis, which becomes complex when interpreting longer sequences. Here we introduce CREME, an in silico perturbation toolkit that interrogates large-scale DNNs to uncover rules of gene regulation that it learns. Using CREME, we investigate Enformer, a prominent DNN in gene expression prediction, revealing cis-regulatory elements (CREs) that directly enhance or silence target genes. We explore the intricate complexity of higher-order CRE interactions, the relationship between CRE distance from transcription start sites on gene expression, as well as the biochemical features of enhancers and silencers learned by Enformer. Moreover, we demonstrate the flexibility of CREME to efficiently uncover a higher-resolution view of functional sequence elements within CREs. This work demonstrates how CREME can be employed to translate the powerful predictions of large-scale DNNs to study open questions in gene regulation.
Collapse
|
20
|
Lin KZ, Qiu Y, Roeder K. eSVD-DE: cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings. BMC Bioinformatics 2024; 25:113. [PMID: 38486150 PMCID: PMC10941434 DOI: 10.1186/s12859-024-05724-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 02/28/2024] [Indexed: 03/17/2024] Open
Abstract
BACKGROUND Single-cell RNA-sequencing (scRNA) datasets are becoming increasingly popular in clinical and cohort studies, but there is a lack of methods to investigate differentially expressed (DE) genes among such datasets with numerous individuals. While numerous methods exist to find DE genes for scRNA data from limited individuals, differential-expression testing for large cohorts of case and control individuals using scRNA data poses unique challenges due to substantial effects of human variation, i.e., individual-level confounding covariates that are difficult to account for in the presence of sparsely-observed genes. RESULTS We develop the eSVD-DE, a matrix factorization that pools information across genes and removes confounding covariate effects, followed by a novel two-sample test in mean expression between case and control individuals. In general, differential testing after dimension reduction yields an inflation of Type-1 errors. However, we overcome this by testing for differences between the case and control individuals' posterior mean distributions via a hierarchical model. In previously published datasets of various biological systems, eSVD-DE has more accuracy and power compared to other DE methods typically repurposed for analyzing cohort-wide differential expression. CONCLUSIONS eSVD-DE proposes a novel and powerful way to test for DE genes among cohorts after performing a dimension reduction. Accurate identification of differential expression on the individual level, instead of the cell level, is important for linking scRNA-seq studies to our understanding of the human population.
Collapse
Affiliation(s)
- Kevin Z Lin
- Department of Biostatistics, University of Washington, Seattle, WA, USA.
| | - Yixuan Qiu
- School of Statistics and Management, Shanghai University of Finance and Economics, Shanghai, People's Republic of China
| | - Kathryn Roeder
- Department of Statistics and Data Science, Carnegie Mellon University, Pittsburgh, PA, USA
| |
Collapse
|
21
|
Hembrom PS, Deepthi M, Biswas G, Mappurath B, Babu A, Reeja N, Mano N, Grace T. Reference genes for qPCR expression in black tiger shrimp, Penaeus monodon. Mol Biol Rep 2024; 51:422. [PMID: 38485790 DOI: 10.1007/s11033-024-09409-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2023] [Accepted: 03/01/2024] [Indexed: 03/19/2024]
Abstract
BACKGROUND Gene expression profiling via qPCR is an essential tool for unraveling the intricate molecular mechanisms underlying growth and development. Identifying and validating the most appropriate reference genes is essential for qPCR experiments. Nevertheless, there exists a deficiency in a thorough assessment of reference genes concerning the expression of the genes in the research in the context of the growth and development of the Black Tiger Shrimp, P. monodon. This popular marine crustacean is extensively raised for human consumption. In this study, we assessed the expression stability of seven reference genes (ACTB, 18S, EF-1α, AK, PK, cox1, and CLTC) in adult tissues (hepatopancreas, gills, and stomach) of small and large polymorphs of P. monodon. METHODS AND RESULTS The stability of gene expressions was assessed utilizing NormFinder, BestKeeper, and geNorm, and a comprehensive ranking of these genes was conducted through the online tool RefFinder. In the overall ranking, 18S and CLTC emerged as the most stable genes in the hepatopancreas and stomach, while CLTC and AK exhibited significant statistical reliability in the gills of adult P. monodon. The validation of these identified stable genes was carried out using a growth-associated gene, insr-1. CONCLUSION The results indicated that 18S and CLTC stand out as the most versatile reference genes for conducting qPCR analysis focused on the growth of P. monodon. This study represents the first comprehensive exploration that identifies and assesses reference genes for qPCR analysis in P. monodon, providing valuable tools for research involving similar crustaceans.
Collapse
Affiliation(s)
- Preety Sweta Hembrom
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Mottakunja Deepthi
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Gourav Biswas
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Bhagya Mappurath
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Adon Babu
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Narchikundil Reeja
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Neeraja Mano
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India
| | - Tony Grace
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kasaragod, Kerala, 671316, India.
| |
Collapse
|
22
|
Luo N, Huang Q, Dong L, Liu W, Song J, Sun H, Wu H, Gao Y, Yi C. Near-cognate tRNAs increase the efficiency and precision of pseudouridine-mediated readthrough of premature termination codons. Nat Biotechnol 2024:10.1038/s41587-024-02165-8. [PMID: 38448662 DOI: 10.1038/s41587-024-02165-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 02/02/2024] [Indexed: 03/08/2024]
Abstract
Programmable RNA pseudouridylation has emerged as a new type of RNA base editor to suppress premature termination codons (PTCs) that can lead to truncated and nonfunctional proteins. However, current methods to correct disease-associated PTCs suffer from low efficiency and limited precision. Here we develop RESTART v3, which uses near-cognate tRNAs to improve the readthrough efficiency of pseudouridine-modified PTCs. We show an average of ~5-fold (range: 2.1- to 9.5-fold) higher editing efficiency than RESTART v2 in cultured cells and achieve functional PTC readthrough in disease cell models of cystic fibrosis and Hurler syndrome. Furthermore, RESTART v3 enables accurate incorporation of the original amino acid for nearly half of the PTC sites, considering the naturally occurring frequencies of sense-to-nonsense codons, without affecting normal termination codons. Although off-target sites were detected, we did not observe changes to the coding information or the expression level of transcripts, and the overall natural tRNA abundance remained constant.
Collapse
Affiliation(s)
- Nan Luo
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
| | - Qiang Huang
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
| | - Liting Dong
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Wenqing Liu
- School of Life Sciences, Tsinghua University, Beijing, China
- Tsinghua-Peking Joint Center for Life Sciences, Tsinghua University, Beijing, China
| | - Jinghui Song
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
| | - Hanxiao Sun
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
| | - Hao Wu
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
- Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Yuan Gao
- Modit Therapeutics Beijing Limited, K115 Beijing ATLATL International Innovation Platform, Beijing, China
| | - Chengqi Yi
- State Key Laboratory of Protein and Plant Gene Research, School of Life Sciences, Peking University, Beijing, China.
- Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
- Department of Chemical Biology and Synthetic and Functional Biomolecules Center, College of Chemistry and Molecular Engineering, Peking University, Beijing, China.
- Beijing Advanced Center of RNA Biology (BEACON), Peking University, Beijing, China.
| |
Collapse
|
23
|
Lin KZ, Qiu Y, Roeder K. eSVD-DE: Cohort-wide differential expression in single-cell RNA-seq data using exponential-family embeddings. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.22.568369. [PMID: 38045428 PMCID: PMC10690270 DOI: 10.1101/2023.11.22.568369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Background Single-cell RNA-sequencing (scRNA) datasets are becoming increasingly popular in clinical and cohort studies, but there is a lack of methods to investigate differentially expressed (DE) genes among such datasets with numerous individuals. While numerous methods exist to find DE genes for scRNA data from limited individuals, differential-expression testing for large cohorts of case and control individuals using scRNA data poses unique challenges due to substantial effects of human variation, i.e., individual-level confounding covariates that are difficult to account for in the presence of sparsely-observed genes. Results We develop the eSVD-DE, a matrix factorization that pools information across genes and removes confounding covariate effects, followed by a novel two-sample test in mean expression between case and control individuals. In general, differential testing after dimension reduction yields an inflation of Type-1 errors. However, we overcome this by testing for differences between the case and control individuals' posterior mean distributions via a hierarchical model. In previously published datasets of various biological systems, eSVD-DE has more accuracy and power compared to other DE methods typically repurposed for analyzing cohort-wide differential expression. Conclusions eSVD-DE proposes a novel and powerful way to test for DE genes among cohorts after performing a dimension reduction. Accurate identification of differential expression on the individual level, instead of the cell level, is important for linking scRNA-seq studies to our understanding of the human population.
Collapse
Affiliation(s)
- Kevin Z Lin
- Department of Biostatistics, University of Washington, Seattle, Washington, United States of America
| | - Yixuan Qiu
- School of Statistics & Management, Shanghai University of Finance and Economics, Shanghai,People's Republic of China
| | - Kathryn Roeder
- Department of Statistics & Data Science, Carnegie Mellon University, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
24
|
Lin Y, Dong ZH, Ye TY, Yang JM, Xie M, Luo JC, Gao J, Guo AY. Optimization of FFPE preparation and identification of gene attributes associated with RNA degradation. NAR Genom Bioinform 2024; 6:lqae008. [PMID: 38298182 PMCID: PMC10830353 DOI: 10.1093/nargab/lqae008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/30/2023] [Accepted: 01/16/2024] [Indexed: 02/02/2024] Open
Abstract
Formalin-fixed paraffin-embedded (FFPE) tissues are widely available specimens for clinical studies. However, RNA degradation in FFPE tissues often restricts their utility. In this study, we determined optimal FFPE preparation conditions, including tissue ischemia at 4°C (<48 h) or 25°C for a short time (0.5 h), 48-h fixation at 25°C and sampling from FFPE scrolls instead of sections. Notably, we observed an increase in intronic reads and a significant change in gene rank based on expression level in the FFPE as opposed to fresh-frozen (FF) samples. Additionally, we found that more reads were mapped to genes associated with chemical stimulus in FFPE samples. Furthermore, we demonstrated that more degraded genes in FFPE samples were enriched in genes with short transcripts and high free energy. Besides, we found 40 housekeeping genes exhibited stable expression in FF and FFPE samples across various tissues. Moreover, our study showed that FFPE samples yielded comparable results to FF samples in dimensionality reduction and pathway analyses between case and control samples. Our study established the optimal conditions for FFPE preparation and identified gene attributes associated with degradation, which would provide useful clues for the utility of FFPE tissues in clinical practice and research.
Collapse
Affiliation(s)
- Yu Lin
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
- Department of thoracic surgery, West China Biomedical Big Data Center, West China Hospital, Med-X Center for Informatics, Sichuan University, Chengdu 610041, China
| | - Zhou-Huan Dong
- The First Medical Center, Chinese People’s Liberation Army (PLA) General Hospital, Beijing 100853, China
| | - Ting-Yue Ye
- Nanjing Vazyme Biotech Co., Ltd., Nanjing 210000, China
| | - Jing-Min Yang
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
| | - Mei Xie
- Department of Respiratory and Critical Care, Chinese People’s Liberation Army (PLA) General Hospital, Beijing 100853, China
| | | | - Jie Gao
- The First Medical Center, Chinese People’s Liberation Army (PLA) General Hospital, Beijing 100853, China
| | - An-Yuan Guo
- Hubei Bioinformatics and Molecular Imaging Key Laboratory, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, China
- Department of thoracic surgery, West China Biomedical Big Data Center, West China Hospital, Med-X Center for Informatics, Sichuan University, Chengdu 610041, China
| |
Collapse
|
25
|
Barvaux S, Okawa S, Del Sol A. SinCMat: A single-cell-based method for predicting functional maturation transcription factors. Stem Cell Reports 2024; 19:270-284. [PMID: 38215756 PMCID: PMC10874865 DOI: 10.1016/j.stemcr.2023.12.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024] Open
Abstract
A major goal of regenerative medicine is to generate tissue-specific mature and functional cells. However, current cell engineering protocols are still unable to systematically produce fully mature functional cells. While existing computational approaches aim at predicting transcription factors (TFs) for cell differentiation/reprogramming, no method currently exists that specifically considers functional cell maturation processes. To address this challenge, here, we develop SinCMat, a single-cell RNA sequencing (RNA-seq)-based computational method for predicting cell maturation TFs. Based on a model of cell maturation, SinCMat identifies pairs of identity TFs and signal-dependent TFs that co-target genes driving functional maturation. A large-scale application of SinCMat to the Mouse Cell Atlas and Tabula Sapiens accurately recapitulates known maturation TFs and predicts novel candidates. We expect SinCMat to be an important resource, complementary to preexisting computational methods, for studies aiming at producing functionally mature cells.
Collapse
Affiliation(s)
- Sybille Barvaux
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367 Esch-Belval Esch-sur-Alzette, Luxembourg
| | - Satoshi Okawa
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367 Esch-Belval Esch-sur-Alzette, Luxembourg; University of Pittsburgh School of Medicine, Vascular Medicine Institute, Department of Computational and Systems Biology, McGowan Institute for Regenerative Medicine, Pittsburgh, PA, USA
| | - Antonio Del Sol
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367 Esch-Belval Esch-sur-Alzette, Luxembourg; CIC bioGUNE-BRTA (Basque Research and Technology Alliance), Bizkaia Technology Park, 801 Building, 48160 Derio, Spain; IKERBASQUE, Basque Foundation for Science, 48013 Bilbao, Spain.
| |
Collapse
|
26
|
Loza M, Vandenbon A, Nakai K. Epigenetic characterization of housekeeping core promoters and their importance in tumor suppression. Nucleic Acids Res 2024; 52:1107-1119. [PMID: 38084904 PMCID: PMC10853790 DOI: 10.1093/nar/gkad1164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/09/2023] [Accepted: 11/20/2023] [Indexed: 02/10/2024] Open
Abstract
In this research, we elucidate the presence of around 11,000 housekeeping cis-regulatory elements (HK-CREs) and describe their main characteristics. Besides the trivial promoters of housekeeping genes, most HK-CREs reside in promoter regions and are involved in a broader role beyond housekeeping gene regulation. HK-CREs are conserved regions rich in unmethylated CpG sites. Their distribution highly correlates with that of protein-coding genes, and they interact with many genes over long distances. We observed reduced activity of a subset of HK-CREs in diverse cancer subtypes due to aberrant methylation, particularly those located in chromosome 19 and associated with zinc finger genes. Further analysis of samples from 17 cancer subtypes showed a significantly increased survival probability of patients with higher expression of these genes, suggesting them as housekeeping tumor suppressor genes. Overall, our work unravels the presence of housekeeping CREs indispensable for the maintenance and stability of cells.
Collapse
Affiliation(s)
- Martin Loza
- The Institute of Medical Science, The University of Tokyo, Japan
| | - Alexis Vandenbon
- Institute for Life and Medical Sciences, Kyoto University, Japan
| | - Kenta Nakai
- The Institute of Medical Science, The University of Tokyo, Japan
| |
Collapse
|
27
|
Lucena-Padros H, Bravo-Gil N, Tous C, Rojano E, Seoane-Zonjic P, Fernández RM, Ranea JAG, Antiñolo G, Borrego S. Bioinformatics Prediction for Network-Based Integrative Multi-Omics Expression Data Analysis in Hirschsprung Disease. Biomolecules 2024; 14:164. [PMID: 38397401 PMCID: PMC10886964 DOI: 10.3390/biom14020164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/15/2024] [Accepted: 01/27/2024] [Indexed: 02/25/2024] Open
Abstract
Hirschsprung's disease (HSCR) is a rare developmental disorder in which enteric ganglia are missing along a portion of the intestine. HSCR has a complex inheritance, with RET as the major disease-causing gene. However, the pathogenesis of HSCR is still not completely understood. Therefore, we applied a computational approach based on multi-omics network characterization and clustering analysis for HSCR-related gene/miRNA identification and biomarker discovery. Protein-protein interaction (PPI) and miRNA-target interaction (MTI) networks were analyzed by DPClusO and BiClusO, respectively, and finally, the biomarker potential of miRNAs was computationally screened by miRNA-BD. In this study, a total of 55 significant gene-disease modules were identified, allowing us to propose 178 new HSCR candidate genes and two biological pathways. Moreover, we identified 12 key miRNAs with biomarker potential among 137 predicted HSCR-associated miRNAs. Functional analysis of new candidates showed that enrichment terms related to gene ontology (GO) and pathways were associated with HSCR. In conclusion, this approach has allowed us to decipher new clues of the etiopathogenesis of HSCR, although molecular experiments are further needed for clinical validations.
Collapse
Affiliation(s)
- Helena Lucena-Padros
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
| | - Nereida Bravo-Gil
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Cristina Tous
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Elena Rojano
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
| | - Pedro Seoane-Zonjic
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 29071 Malaga, Spain
| | - Raquel María Fernández
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Juan A. G. Ranea
- Department of Molecular Biology and Biochemistry, University of Malaga, 29010 Malaga, Spain
- Biomedical Research Institute of Malaga, IBIMA, 29010 Malaga, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 29071 Malaga, Spain
- Spanish National Bioinformatics Institute (INB/ELIXIR-ES), Instituto de Salud Carlos III (ISCIII), 28029 Madrid, Spain
| | - Guillermo Antiñolo
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| | - Salud Borrego
- Department of Maternofetal Medicine, Genetics and Reproduction, Institute of Biomedicine of Seville, University Hospital Virgen del Rocío/CSIC/University of Seville, 41013 Seville, Spain
- Center for Biomedical Network Research on Rare Diseases (CIBERER), 41013 Seville, Spain
| |
Collapse
|
28
|
Bagger MM, Sjölund J, Kim J, Kohler KT, Villadsen R, Jafari A, Kassem M, Pietras K, Rønnov-Jessen L, Petersen OW. Evidence of steady-state fibroblast subtypes in the normal human breast as cells-of-origin for perturbed-state fibroblasts in breast cancer. Breast Cancer Res 2024; 26:11. [PMID: 38229104 PMCID: PMC10790388 DOI: 10.1186/s13058-024-01763-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 01/02/2024] [Indexed: 01/18/2024] Open
Abstract
BACKGROUND Human breast cancer most frequently originates within a well-defined anatomical structure referred to as the terminal duct lobular unit (TDLU). This structure is endowed with its very own lobular fibroblasts representing one out of two steady-state fibroblast subtypes-the other being interlobular fibroblasts. While cancer-associated fibroblasts (CAFs) are increasingly appreciated as covering a spectrum of perturbed states, we lack a coherent understanding of their relationship-if any-with the steady-state fibroblast subtypes. To address this, we here established two autologous CAF lines representing inflammatory CAFs (iCAFs) and myofibroblast CAFs (myCAFs) and compared them with already established interlobular- and lobular fibroblasts with respect to their origin and impact on tumor formation. METHODS Primary breast tumor-derived CAFs were transduced to express human telomerase reverse transcriptase (hTERT) and sorted into CD105low and CD105high populations using fluorescence-activated cell sorting (FACS). The two populations were tested for differentiation similarities to iCAF and myCAF states through transcriptome-wide RNA-Sequencing (RNA-Seq) including comparison to an available iCAF-myCAF cell state atlas. Inference of origin in interlobular and lobular fibroblasts relied on RNA-Seq profiles, immunocytochemistry and growth characteristics. Osteogenic differentiation and bone formation assays in culture and in vivo were employed to gauge for origin in bone marrow-derived mesenchymal stem cells (bMSCs). Functional characteristics were assessed with respect to contractility in culture and interaction with tumor cells in mouse xenografts. The cells' gene expression signatures were tested for association with clinical outcome of breast cancer patients using survival data from The Cancer Genome Atlas database. RESULTS We demonstrate that iCAFs have properties in common with interlobular fibroblasts while myCAFs and lobular fibroblasts are related. None of the CAFs qualify as bMSCs as revealed by lack of critical performance in bone formation assays. Functionally, myCAFs and lobular fibroblasts are almost equally tumor promoting as opposed to iCAFs and interlobular fibroblasts. A myCAF gene signature is found to associate with poor breast cancer-specific survival. CONCLUSIONS We propose that iCAFs and myCAFs originate in interlobular and lobular fibroblasts, respectively, and more importantly, that the tumor-promoting properties of lobular fibroblasts render the TDLU an epicenter for breast cancer evolution.
Collapse
Affiliation(s)
- Mikkel Morsing Bagger
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University Cancer Centre, Lund University, Lund, Sweden.
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark.
| | - Jonas Sjölund
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University Cancer Centre, Lund University, Lund, Sweden
| | - Jiyoung Kim
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | | | - René Villadsen
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Abbas Jafari
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Moustapha Kassem
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
- Laboratory of Molecular Endocrinology, KMEB, Department of Endocrinology, Odense University Hospital and University of Southern Denmark, Odense, Denmark
| | - Kristian Pietras
- Division of Translational Cancer Research, Department of Laboratory Medicine, Lund University Cancer Centre, Lund University, Lund, Sweden
| | - Lone Rønnov-Jessen
- Section for Cell Biology and Physiology, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Ole William Petersen
- Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
29
|
Dou C, Wu L, Zhang J, He H, Xu T, Yu Z, Su P, Zhang X, Wang J, Miao YL, Zhou J. The transcriptional activator Klf5 recruits p300-mediated H3K27ac for maintaining trophoblast stem cell pluripotency. J Mol Cell Biol 2024; 15:mjad045. [PMID: 37533201 PMCID: PMC10768793 DOI: 10.1093/jmcb/mjad045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2022] [Revised: 04/14/2023] [Accepted: 05/11/2023] [Indexed: 08/04/2023] Open
Abstract
The effective proliferation and differentiation of trophoblast stem cells (TSCs) is indispensable for the development of the placenta, which is the key to maintaining normal fetal growth during pregnancy. Kruppel-like factor 5 (Klf5) is implicated in the activation of pluripotency gene expression in embryonic stem cells (ESCs), yet its function in TSCs is poorly understood. Here, we showed that Klf5 knockdown resulted in the downregulation of core TSC-specific genes, consequently causing rapid differentiation of TSCs. Consistently, Klf5-depleted embryos lost the ability to establish TSCs in vitro. At the molecular level, Klf5 preferentially occupied the proximal promoter regions and maintained an open chromatin architecture of key TSC-specific genes. Deprivation of Klf5 impaired the enrichment of p300, a major histone acetyl transferase of H3 lysine 27 acetylation (H3K27ac), and further reduced the occupancy of H3K27ac at promoter regions, leading to decreased transcriptional activity of TSC pluripotency genes. Thus, our findings highlight a novel mechanism of Klf5 in regulating the self-renewal and differentiation of TSCs and provide a reference for understanding placental development and improving pregnancy rates.
Collapse
Affiliation(s)
- Chengli Dou
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Linhui Wu
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Jingjing Zhang
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Hainan He
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Tian Xu
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Zhisheng Yu
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Peng Su
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Xia Zhang
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| | - Junling Wang
- Department of Reproductive Medicine, Huangshi Central Hospital, Affiliated Hospital of Hubei Polytechnic, Edong Healthcare Group, Huangshi 435000, China
| | - Yi-Liang Miao
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
- Hubei Hongshan Laboratory, Wuhan 430070, China
| | - Jilong Zhou
- Institute of Stem Cell and Regenerative Biology, College of Animal Science and Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
- Key Laboratory of Agricultural Animal Genetics, Breeding and Reproduction (Huazhong Agricultural University), Ministry of Education, Wuhan 430070, China
| |
Collapse
|
30
|
Gu X, Wang M, Zhang XO. TE-TSS: an integrated data resource of human and mouse transposable element (TE)-derived transcription start site (TSS). Nucleic Acids Res 2024; 52:D322-D333. [PMID: 37956335 PMCID: PMC10767810 DOI: 10.1093/nar/gkad1048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/21/2023] [Accepted: 10/23/2023] [Indexed: 11/15/2023] Open
Abstract
Transposable elements (TEs) are abundant in the genome and serve as crucial regulatory elements. Some TEs function as epigenetically regulated promoters, and these TE-derived transcription start sites (TSSs) play a crucial role in regulating genes associated with specific functions, such as cancer and embryogenesis. However, the lack of an accessible database that systematically gathers TE-derived TSS data is a current research gap. To address this, we established TE-TSS, an integrated data resource of human and mouse TE-derived TSSs (http://xozhanglab.com/TETSS). TE-TSS has compiled 2681 RNA sequencing datasets, spanning various tissues, cell lines and developmental stages. From these, we identified 5768 human TE-derived TSSs and 2797 mouse TE-derived TSSs, with 47% and 38% being experimentally validated, respectively. TE-TSS enables comprehensive exploration of TSS usage in diverse samples, providing insights into tissue-specific gene expression patterns and transcriptional regulatory elements. Furthermore, TE-TSS compares TE-derived TSS regions across 15 mammalian species, enhancing our understanding of their evolutionary and functional aspects. The establishment of TE-TSS facilitates further investigations into the roles of TEs in shaping the transcriptomic landscape and offers valuable resources for comprehending their involvement in diverse biological processes.
Collapse
Affiliation(s)
- Xiaobing Gu
- Shanghai Key Laboratory of Maternal and Fetal Medicine, Clinical and Translational Research Center of Shanghai First Maternity and Infant Hospital, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Mingdong Wang
- Shanghai Key Laboratory of Maternal and Fetal Medicine, Clinical and Translational Research Center of Shanghai First Maternity and Infant Hospital, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| | - Xiao-Ou Zhang
- Shanghai Key Laboratory of Maternal and Fetal Medicine, Clinical and Translational Research Center of Shanghai First Maternity and Infant Hospital, Frontier Science Center for Stem Cell Research, School of Life Sciences and Technology, Tongji University, Shanghai 200092, China
| |
Collapse
|
31
|
Murphy D, Salataj E, Di Giammartino DC, Rodriguez-Hernaez J, Kloetgen A, Garg V, Char E, Uyehara CM, Ee LS, Lee U, Stadtfeld M, Hadjantonakis AK, Tsirigos A, Polyzos A, Apostolou E. 3D Enhancer-promoter networks provide predictive features for gene expression and coregulation in early embryonic lineages. Nat Struct Mol Biol 2024; 31:125-140. [PMID: 38053013 PMCID: PMC10897904 DOI: 10.1038/s41594-023-01130-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 09/18/2023] [Indexed: 12/07/2023]
Abstract
Mammalian embryogenesis commences with two pivotal and binary cell fate decisions that give rise to three essential lineages: the trophectoderm, the epiblast and the primitive endoderm. Although key signaling pathways and transcription factors that control these early embryonic decisions have been identified, the non-coding regulatory elements through which transcriptional regulators enact these fates remain understudied. Here, we characterize, at a genome-wide scale, enhancer activity and 3D connectivity in embryo-derived stem cell lines that represent each of the early developmental fates. We observe extensive enhancer remodeling and fine-scale 3D chromatin rewiring among the three lineages, which strongly associate with transcriptional changes, although distinct groups of genes are irresponsive to topological changes. In each lineage, a high degree of connectivity, or 'hubness', positively correlates with levels of gene expression and enriches for cell-type specific and essential genes. Genes within 3D hubs also show a significantly stronger probability of coregulation across lineages compared to genes in linear proximity or within the same contact domains. By incorporating 3D chromatin features, we build a predictive model for transcriptional regulation (3D-HiChAT) that outperforms models using only 1D promoter or proximal variables to predict levels and cell-type specificity of gene expression. Using 3D-HiChAT, we identify, in silico, candidate functional enhancers and hubs in each cell lineage, and with CRISPRi experiments, we validate several enhancers that control gene expression in their respective lineages. Our study identifies 3D regulatory hubs associated with the earliest mammalian lineages and describes their relationship to gene expression and cell identity, providing a framework to comprehensively understand lineage-specific transcriptional behaviors.
Collapse
Affiliation(s)
- Dylan Murphy
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
- Physiology, Biophysics and Systems Biology Program, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, USA
| | - Eralda Salataj
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Dafne Campigli Di Giammartino
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
- 3D Chromatin Conformation and RNA Genomics Laboratory, Center for Human Technologies (CHT), Istituto Italiano di Tecnologia (IIT), Genova, Italy
| | - Javier Rodriguez-Hernaez
- Department of Pathology, New York University Langone Health, New York, NY, USA
- Department of Medicine, New York University Langone Health, New York, NY, USA
- Applied Bioinformatics Laboratory, New York University Langone Health, New York, NY, USA
| | - Andreas Kloetgen
- Department of Pathology, New York University Langone Health, New York, NY, USA
- Department of Medicine, New York University Langone Health, New York, NY, USA
- Applied Bioinformatics Laboratory, New York University Langone Health, New York, NY, USA
| | - Vidur Garg
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
- Biochemistry Cell and Molecular Biology Program, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, USA
| | - Erin Char
- Tri-Institutional Training Program in Computational Biology and Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Christopher M Uyehara
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Ly-Sha Ee
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - UkJin Lee
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
- Biochemistry Cell and Molecular Biology Program, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, USA
| | - Matthias Stadtfeld
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA
| | - Anna-Katerina Hadjantonakis
- Developmental Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Aristotelis Tsirigos
- Department of Pathology, New York University Langone Health, New York, NY, USA.
- Department of Medicine, New York University Langone Health, New York, NY, USA.
- Applied Bioinformatics Laboratory, New York University Langone Health, New York, NY, USA.
| | - Alexander Polyzos
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA.
| | - Effie Apostolou
- Sanford I. Weill Department of Medicine, Sandra and Edward Meyer Cancer Center, Weill Cornell Medicine, New York, NY, USA.
| |
Collapse
|
32
|
Sur A, Wang Y, Capar P, Margolin G, Prochaska MK, Farrell JA. Single-cell analysis of shared signatures and transcriptional diversity during zebrafish development. Dev Cell 2023; 58:3028-3047.e12. [PMID: 37995681 PMCID: PMC11181902 DOI: 10.1016/j.devcel.2023.11.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 08/24/2023] [Accepted: 11/01/2023] [Indexed: 11/25/2023]
Abstract
During development, animals generate distinct cell populations with specific identities, functions, and morphologies. We mapped transcriptionally distinct populations across 489,686 cells from 62 stages during wild-type zebrafish embryogenesis and early larval development (3-120 h post-fertilization). Using these data, we identified the limited catalog of gene expression programs reused across multiple tissues and their cell-type-specific adaptations. We also determined the duration each transcriptional state is present during development and identify unexpected long-term cycling populations. Focused clustering and transcriptional trajectory analyses of non-skeletal muscle and endoderm identified transcriptional profiles and candidate transcriptional regulators of understudied cell types and subpopulations, including the pneumatic duct, individual intestinal smooth muscle layers, spatially distinct pericyte subpopulations, and recently discovered best4+ cells. To enable additional discoveries, we make this comprehensive transcriptional atlas of early zebrafish development available through our website, Daniocell.
Collapse
Affiliation(s)
- Abhinav Sur
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20814, USA
| | - Yiqun Wang
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | - Paulina Capar
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20814, USA
| | - Gennady Margolin
- Bioinformatics and Scientific Programming Core, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20814, USA
| | - Morgan Kathleen Prochaska
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20814, USA
| | - Jeffrey A Farrell
- Division of Developmental Biology, Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH, Bethesda, MD 20814, USA.
| |
Collapse
|
33
|
Rade M, Böhlen S, Neuhaus V, Löffler D, Blumert C, Merz M, Köhl U, Dehmel S, Sewald K, Reiche K. A time-resolved meta-analysis of consensus gene expression profiles during human T-cell activation. Genome Biol 2023; 24:287. [PMID: 38098113 PMCID: PMC10722659 DOI: 10.1186/s13059-023-03120-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Accepted: 11/22/2023] [Indexed: 12/17/2023] Open
Abstract
BACKGROUND The coordinated transcriptional regulation of activated T-cells is based on a complex dynamic behavior of signaling networks. Given an external stimulus, T-cell gene expression is characterized by impulse and sustained patterns over the course. Here, we analyze the temporal pattern of activation across different T-cell populations to develop consensus gene signatures for T-cell activation. RESULTS Here, we identify and verify general biomarker signatures robustly evaluating T-cell activation in a time-resolved manner. We identify time-resolved gene expression profiles comprising 521 genes of up to 10 disjunct time points during activation and different polarization conditions. The gene signatures include central transcriptional regulators of T-cell activation, representing successive waves as well as sustained patterns of induction. They cover sustained repressed, intermediate, and late response expression rates across multiple T-cell populations, thus defining consensus biomarker signatures for T-cell activation. In addition, intermediate and late response activation signatures in CAR T-cell infusion products are correlated to immune effector cell-associated neurotoxicity syndrome. CONCLUSION This study is the first to describe temporally resolved gene expression patterns across T-cell populations. These biomarker signatures are a valuable source, e.g., monitoring transcriptional changes during T-cell activation with a reasonable number of genes, annotating T-cell states in single-cell transcriptome studies, or assessing dysregulated functions of human T-cell immunity.
Collapse
Affiliation(s)
- Michael Rade
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, 04103, Leipzig, Germany.
| | - Sebastian Böhlen
- Department of Preclinical Pharmacology and In-Vitro Toxicology, Fraunhofer Institute for Toxicology and Experimental Medicine, ITEM, Nikolai-Fuchs-Strasse 1, 30625, Hannover, Germany
| | - Vanessa Neuhaus
- Department of Preclinical Pharmacology and In-Vitro Toxicology, Fraunhofer Institute for Toxicology and Experimental Medicine, ITEM, Nikolai-Fuchs-Strasse 1, 30625, Hannover, Germany
| | - Dennis Löffler
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, 04103, Leipzig, Germany
| | - Conny Blumert
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, 04103, Leipzig, Germany
| | - Maximilian Merz
- Department of Hematology, Cellular Therapy, Hemostaseology and Infectiology, University Hospital of Leipzig, Leipzig, Germany
| | - Ulrike Köhl
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, 04103, Leipzig, Germany
- Institute for Clinical Immunology, Leipzig University, Johannisallee 30, 04103, Leipzig, Germany
| | - Susann Dehmel
- Department of Preclinical Pharmacology and In-Vitro Toxicology, Fraunhofer Institute for Toxicology and Experimental Medicine, ITEM, Nikolai-Fuchs-Strasse 1, 30625, Hannover, Germany
| | - Katherina Sewald
- Department of Preclinical Pharmacology and In-Vitro Toxicology, Fraunhofer Institute for Toxicology and Experimental Medicine, ITEM, Nikolai-Fuchs-Strasse 1, 30625, Hannover, Germany
| | - Kristin Reiche
- Department of Diagnostics, Fraunhofer Institute for Cell Therapy and Immunology, Perlickstraße 1, 04103, Leipzig, Germany
- Institute for Clinical Immunology, Leipzig University, Johannisallee 30, 04103, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden, Leipzig, Germany
| |
Collapse
|
34
|
Spiegl B, Kapidzic F, Röner S, Kircher M, Speicher M. GCparagon: evaluating and correcting GC biases in cell-free DNA at the fragment level. NAR Genom Bioinform 2023; 5:lqad102. [PMID: 38025047 PMCID: PMC10657415 DOI: 10.1093/nargab/lqad102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 09/18/2023] [Accepted: 11/07/2023] [Indexed: 12/01/2023] Open
Abstract
Analyses of cell-free DNA (cfDNA) are increasingly being employed for various diagnostic and research applications. Many technologies aim to increase resolution, e.g. for detecting early-stage cancer or minimal residual disease. However, these efforts may be confounded by inherent base composition biases of cfDNA, specifically the over - and underrepresentation of guanine (G) and cytosine (C) sequences. Currently, there is no universally applicable tool to correct these effects on sequencing read-level data. Here, we present GCparagon, a two-stage algorithm for computing and correcting GC biases in cfDNA samples. In the initial step, length and GC base count parameters are determined. Here, our algorithm minimizes the inclusion of known problematic genomic regions, such as low-mappability regions, in its calculations. In the second step, GCparagon computes weights counterbalancing the distortion of cfDNA attributes (correction matrix). These fragment weights are added to a binary alignment map (BAM) file as alignment tags for individual reads. The GC correction matrix or the tagged BAM file can be used for downstream analyses. Parallel computing allows for a GC bias estimation below 1 min. We demonstrate that GCparagon vastly improves the analysis of regulatory regions, which frequently show specific GC composition patterns and will contribute to standardized cfDNA applications.
Collapse
Affiliation(s)
- Benjamin Spiegl
- Institute of Human Genetics, Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, 8010 Graz, Austria
| | - Faruk Kapidzic
- Institute of Human Genetics, Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, 8010 Graz, Austria
| | - Sebastian Röner
- Exploratory Diagnostic Sciences, Berlin Institute of Health (BIH) at Charité-Universitätsmedizin Berlin, 10117 Berlin, Germany
| | - Martin Kircher
- Exploratory Diagnostic Sciences, Berlin Institute of Health (BIH) at Charité-Universitätsmedizin Berlin, 10117 Berlin, Germany
- Institute of Human Genetics, University Medical Center Schleswig-Holstein (UKSH), University of Lübeck, 23562 Lübeck, Germany
| | - Michael R Speicher
- Institute of Human Genetics, Diagnostic and Research Center for Molecular BioMedicine, Medical University of Graz, 8010 Graz, Austria
- BioTechMed-Graz, 8010 Graz, Austria
| |
Collapse
|
35
|
Stepankiw N, Yang AWH, Hughes TR. The human genome contains over a million autonomous exons. Genome Res 2023; 33:1865-1878. [PMID: 37945377 PMCID: PMC10760453 DOI: 10.1101/gr.277792.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2023] [Accepted: 10/27/2023] [Indexed: 11/12/2023]
Abstract
Mammalian mRNA and lncRNA exons are often small compared to introns. The exon definition model predicts that exons splice autonomously, dependent on proximal exon sequence features, explaining their delineation within large introns. This model has not been examined on a genome-wide scale, however, leaving open the question of how often mRNA and lncRNA exons are autonomous. It is also unknown how frequently such exons can arise by chance. Here, we directly assayed large fragments (500-1000 bp) of the human genome by exon trapping, which detects exons spliced into a heterologous transgene, here designed with a large intron context. We define the trapped exons as "autonomous." We obtained ∼1.25 million trapped exons, including most known mRNA and well-annotated lncRNA internal exons, demonstrating that human exons are predominantly autonomous. mRNA exons are trapped with the highest efficiency. Nearly a million of the trapped exons are unannotated, most located in intergenic regions and antisense to mRNA, with depletion from the forward strand of introns. These exons are not conserved, suggesting they are nonfunctional and arose from random mutations. They are nonetheless highly enriched with known splicing promoting sequence features that delineate known exons. Novel autonomous exons are more numerous than annotated lncRNA exons, and computational models also indicate they will occur with similar frequency in any randomly generated sequence. These results show that most human coding exons splice autonomously, and provide an explanation for the existence of many unconserved lncRNAs, as well as a new annotation and inclusion levels of spliceable loci in the human genome.
Collapse
Affiliation(s)
- Nicholas Stepankiw
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada M5S 3E1
| | - Ally W H Yang
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada M5S 3E1
| | - Timothy R Hughes
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada M5S 3E1;
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
36
|
Moyers BA, Partridge EC, Mackiewicz M, Betti MJ, Darji R, Meadows SK, Newberry KM, Brandsmeier LA, Wold BJ, Mendenhall EM, Myers RM. Characterization of human transcription factor function and patterns of gene regulation in HepG2 cells. Genome Res 2023; 33:1879-1892. [PMID: 37852782 PMCID: PMC10760452 DOI: 10.1101/gr.278205.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 10/13/2023] [Indexed: 10/20/2023]
Abstract
Transcription factors (TFs) are trans-acting proteins that bind cis-regulatory elements (CREs) in DNA to control gene expression. Here, we analyzed the genomic localization profiles of 529 sequence-specific TFs and 151 cofactors and chromatin regulators in the human cancer cell line HepG2, for a total of 680 broadly termed DNA-associated proteins (DAPs). We used this deep collection to model each TF's impact on gene expression, and identified a cohort of 26 candidate transcriptional repressors. We examine high occupancy target (HOT) sites in the context of three-dimensional genome organization and show biased motif placement in distal-promoter connections involving HOT sites. We also found a substantial number of closed chromatin regions with multiple DAPs bound, and explored their properties, finding that a MAFF/MAFK TF pair correlates with transcriptional repression. Altogether, these analyses provide novel insights into the regulatory logic of the human cell line HepG2 genome and show the usefulness of large genomic analyses for elucidation of individual TF functions.
Collapse
Affiliation(s)
- Belle A Moyers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | | | - Mark Mackiewicz
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Michael J Betti
- Vanderbilt University Medical Center, Nashville, Tennessee 37232, USA
| | - Roshan Darji
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | - Sarah K Meadows
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA
| | | | | | - Barbara J Wold
- Merkin Institute for Translational Research, California Institute of Technology, Pasadena, California 91125, USA
| | - Eric M Mendenhall
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA;
| | - Richard M Myers
- HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806, USA;
| |
Collapse
|
37
|
Zhao Y, Liu L, Hassett R, Siepel A. Model-based characterization of the equilibrium dynamics of transcription initiation and promoter-proximal pausing in human cells. Nucleic Acids Res 2023; 51:e106. [PMID: 37889042 PMCID: PMC10681744 DOI: 10.1093/nar/gkad843] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 09/13/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023] Open
Abstract
In metazoans, both transcription initiation and the escape of RNA polymerase (RNAP) from promoter-proximal pausing are key rate-limiting steps in gene expression. These processes play out at physically proximal sites on the DNA template and appear to influence one another through steric interactions. Here, we examine the dynamics of these processes using a combination of statistical modeling, simulation, and analysis of real nascent RNA sequencing data. We develop a simple probabilistic model that jointly describes the kinetics of transcription initiation, pause-escape, and elongation, and the generation of nascent RNA sequencing read counts under steady-state conditions. We then extend this initial model to allow for variability across cells in promoter-proximal pause site locations and steric hindrance of transcription initiation from paused RNAPs. In an extensive series of simulations, we show that this model enables accurate estimation of initiation and pause-escape rates. Furthermore, we show by simulation and analysis of real data that pause-escape is often strongly rate-limiting and that steric hindrance can dramatically reduce initiation rates. Our modeling framework is applicable to a variety of inference problems, and our software for estimation and simulation is freely available.
Collapse
Affiliation(s)
- Yixin Zhao
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Lingjie Liu
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY, USA
| | - Rebecca Hassett
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
38
|
Soshnikova NV, Azieva AM, Klimenko NS, Khamidullina AI, Feoktistov AV, Sheynov AA, Brechalov AV, Tatarskiy VV, Georgieva SG. A novel chromatin-remodeling complex variant, dcPBAF, is involved in maintaining transcription in differentiated neurons. Front Cell Dev Biol 2023; 11:1271598. [PMID: 38033872 PMCID: PMC10682186 DOI: 10.3389/fcell.2023.1271598] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 10/19/2023] [Indexed: 12/02/2023] Open
Abstract
The Polybromo-associated BAF (BRG1- or BRM-associated factors) (PBAF) chromatin-remodeling complex is essential for transcription in mammalian cells. In this study, we describe a novel variant of the PBAF complex from differentiated neuronal cells, called dcPBAF, that differs from the canonical PBAF existing in proliferating neuroblasts. We describe that in differentiated adult neurons, a specific subunit of PBAF, PHF10, is replaced by a PHF10 isoform that lacks N- and C-terminal domains (called PHF10D). In addition, dcPBAF does not contain the canonical BRD7 subunit. dcPBAF binds promoters of the actively transcribed neuron-specific and housekeeping genes in terminally differentiated neurons of adult mice. Furthermore, in differentiated human neuronal cells, PHF10D-containing dcPBAF maintains a high transcriptional level at several neuron-specific genes.
Collapse
Affiliation(s)
- Nataliya V. Soshnikova
- Department of Transcription Factors, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Department of Eukaryotic Transcription Factors, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Asya M. Azieva
- Department of Eukaryotic Transcription Factors, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
- National Research Center “Kurchatov Institute”, Moscow, Russia
| | - Nataliya S. Klimenko
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Alvina I. Khamidullina
- Department of Molecular Oncobiology, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Alexey V. Feoktistov
- Department of Transcription Factors, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| | - Andrey A. Sheynov
- Department of Eukaryotic Transcription Factors, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Alexander V. Brechalov
- Department of Eukaryotic Transcription Factors, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Victor V. Tatarskiy
- Department of Molecular Oncobiology, Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia
| | - Sofia G. Georgieva
- Department of Transcription Factors, Engelhardt Institute of Molecular Biology, Russian Academy of Sciences, Moscow, Russia
| |
Collapse
|
39
|
Kim TK, Waldman J, Ibanez-Carrasco F, Tirloni L, Waltero C, Calixo C, Braz GR, Mulenga A, da Silva Vaz Junior I, Logullo C. Stable internal reference genes for quantitative RT-PCR analyses in Rhipicephalus microplus during embryogenesis. Ticks Tick Borne Dis 2023; 14:102251. [PMID: 37708803 PMCID: PMC10555470 DOI: 10.1016/j.ttbdis.2023.102251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 09/04/2023] [Accepted: 09/05/2023] [Indexed: 09/16/2023]
Abstract
Studies on the transcriptional control of gene expression are crucial to understand changes in organism's physiological or cellular conditions. To obtain reliable data on mRNA amounts and the estimation of gene expression levels, it is crucial to normalize the target gene with one or more internal reference gene(s). However, the use of constitutive genes as reference genes is controversial, as their expression patterns are sometimes more complex than previously thought. In various arthropod vectors, including ticks, several constitutive genes have been identified by studying gene expression in different tissues and life stages. The cattle tick Rhipicephalus microplus is a major vector for several pathogens and is widely distributed in tropical and subtropical regions globally. Tick developmental physiology is an essential aspect of research, particularly embryogenesis, where many important developmental events occur, thus the identification of stable reference genes is essential for the interpretation of reliable gene expression data. This study aimed to identify and select R. microplus housekeeping genes and evaluate their stability during embryogenesis. Reference genes used as internal control in molecular assays were selected based on previous studies. These genes were screened by quantitative PCR (qPCR) and tested for gene expression stability during embryogenesis. Results demonstrated that the relative stability of reference genes varied at different time points during the embryogenesis. The GeNorm tool showed that elongation factor 1α (Elf1a) and ribosomal protein L4 (Rpl4) were the most stable genes, while H3 histone family 3A (Hist3A) and ribosomal protein S18 (RpS18) were the least stable. The NormFinder tool showed that Rpl4 was the most stable gene, while the ranking of Elf1a was intermediate in all tested conditions. The BestKeeper tool showed that Rpl4 and cyclophilin A (CycA) were the more and less stable genes, respectively. These data collectively demonstrate that Rpl4, Elf1a, and GAPDH are suitable internal controls for normalizing qPCR during R. microplus embryogenesis. These genes were consistently identified as the most stable in various analysis methods employed in this study. Thus, findings presented in this study offer valuable information for the study of gene expression during embryogenesis in R. microplus.
Collapse
Affiliation(s)
- Tae Kwon Kim
- Department of Veterinary Pathobiology, College of Veterinary Medicine, Texas A&M University, College Station, TX, USA; Department of Diagnostic Medicine and Pathobiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS, USA; Laboratório de Bioquímica de Artrópodes Hematófagos, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, RJ, Brazil
| | - Jéssica Waldman
- Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Freddy Ibanez-Carrasco
- Department of Entomology, Minnie Bell Heep Center, Texas A&M University, Suite 412, 2475 TAMU, 370 Olsen Blvd, College Station, TX, USA; Texas A&M AgriLife Research and Extension Center, 2415 East US Highway 83, Weslaco, TX 78596, USA
| | - Lucas Tirloni
- Tick-Pathogen Transmission Unit, Laboratory of Bacteriology, National Institute of Allergy and Infectious Diseases, Hamilton, MT, USA
| | - Camila Waltero
- Laboratório de Bioquímica de Artrópodes Hematófagos, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, RJ, Brazil
| | - Christiano Calixo
- Laboratório de Bioquímica de Artrópodes Hematófagos, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, RJ, Brazil
| | - Gloria R Braz
- Departamento de Bioquímica, Instituto de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil; Instituto Nacional de Ciência e Tecnologia - Entomologia Molecular, Rio de Janeiro, RJ, Brazil
| | - Albert Mulenga
- Department of Veterinary Pathobiology, College of Veterinary Medicine, Texas A&M University, College Station, TX, USA
| | - Itabajara da Silva Vaz Junior
- Centro de Biotecnologia, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil; Instituto Nacional de Ciência e Tecnologia - Entomologia Molecular, Rio de Janeiro, RJ, Brazil; Faculdade de Veterinária, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil.
| | - Carlos Logullo
- Laboratório de Bioquímica de Artrópodes Hematófagos, Instituto de Bioquímica Médica Leopoldo de Meis, Universidade Federal do Rio de Janeiro, RJ, Brazil,; Instituto Nacional de Ciência e Tecnologia - Entomologia Molecular, Rio de Janeiro, RJ, Brazil
| |
Collapse
|
40
|
Gosch A, Bhardwaj A, Courts C. TrACES of time: Transcriptomic analyses for the contextualization of evidential stains - Identification of RNA markers for estimating time-of-day of bloodstain deposition. Forensic Sci Int Genet 2023; 67:102915. [PMID: 37598452 DOI: 10.1016/j.fsigen.2023.102915] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 07/20/2023] [Accepted: 08/01/2023] [Indexed: 08/22/2023]
Abstract
Obtaining forensically relevant information beyond who deposited a biological stain on how and under which circumstances it was deposited is a question of increasing importance in forensic molecular biology. In the past few years, several studies have been produced on the potential of gene expression analysis to deliver relevant contextualizing information, e.g. on nature and condition of a stain as well as aspects of stain deposition timing. However, previous attempts to predict the time-of-day of sample deposition were all based on and thus limited by previously described diurnal oscillators. Herein, we newly approached this goal by applying current sequencing technologies and statistical methods to identify novel candidate markers for forensic time-of-day predictions from whole transcriptome analyses. To this purpose, we collected whole blood samples from ten individuals at eight different time points throughout the day, performed whole transcriptome sequencing and applied biostatistical algorithms to identify 81 mRNA markers with significantly differential expression as candidates to predict the time of day. In addition, we performed qPCR analysis to assess the characteristics of a subset of 13 candidate predictors in dried and aged blood stains. While we demonstrated the general possibility of using the selected candidate markers to predict time-of-day of sample deposition, we also observed notable variation between different donors and storage conditions, highlighting the relevance of employing accurate quantification methods in combination with robust normalization procedures.This study's results are foundational and may be built upon when developing a targeted assay for time-of-day predictions from forensic blood samples in the future.
Collapse
Affiliation(s)
- A Gosch
- Institute of Legal Medicine, Medical Faculty, University Hospital Cologne, Cologne, Germany
| | - A Bhardwaj
- Institute of Clinical Molecular Biology, University of Kiel, Kiel, Germany
| | - C Courts
- Institute of Legal Medicine, Medical Faculty, University Hospital Cologne, Cologne, Germany.
| |
Collapse
|
41
|
Zhang R, Yang M, Schreiber J, O'Day DR, Turner JMA, Shendure J, Disteche CM, Deng X, Noble WS. Cross-species imputation and comparison of single-cell transcriptomic profiles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.19.563173. [PMID: 37905060 PMCID: PMC10614954 DOI: 10.1101/2023.10.19.563173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
Cross-species comparison and prediction of gene expression profiles are important to understand regulatory changes during evolution and to transfer knowledge learned from model organisms to humans. Single-cell RNA-seq (scRNA-seq) profiles enable us to capture gene expression profiles with respect to variations among individual cells; however, cross-species comparison of scRNA-seq profiles is challenging because of data sparsity, batch effects, and the lack of one-to-one cell matching across species. Moreover, single-cell profiles are challenging to obtain in certain biological contexts, limiting the scope of hypothesis generation. Here we developed Icebear, a neural network framework that decomposes single-cell measurements into factors representing cell identity, species, and batch factors. Icebear enables accurate prediction of single-cell gene expression profiles across species, thereby providing high-resolution cell type and disease profiles in under-characterized contexts. Icebear also facilitates direct cross-species comparison of single-cell expression profiles for conserved genes that are located on the X chromosome in eutherian mammals but on autosomes in chicken. This comparison, for the first time, revealed evolutionary and diverse adaptations of X-chromosome upregulation in mammals.
Collapse
Affiliation(s)
- Ran Zhang
- Department of Genome Sciences, University of Washington
- eScience Institute, University of Washington
| | - Mu Yang
- Department of Biomedical Informatics and Medical Education, University of Washington
| | | | - Diana R O'Day
- Brotman Baty Institute for Precision Medicine, University of Washington
| | | | - Jay Shendure
- Department of Genome Sciences, University of Washington
- Brotman Baty Institute for Precision Medicine, University of Washington
- Howard Hughes Medical Institute
- Allen Center for Cell Lineage Tracing
| | - Christine M Disteche
- Department of Laboratory Medicine and Pathology, University of Washington
- Department of Medicine, University of Washington
| | - Xinxian Deng
- Department of Laboratory Medicine and Pathology, University of Washington
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington
- Paul G. Allen School of Computer Science and Engineering, University of Washington
| |
Collapse
|
42
|
Huuki-Myers LA, Montgomery KD, Kwon SH, Page SC, Hicks SC, Maynard KR, Collado-Torres L. Data-driven identification of total RNA expression genes for estimation of RNA abundance in heterogeneous cell types highlighted in brain tissue. Genome Biol 2023; 24:233. [PMID: 37845779 PMCID: PMC10578035 DOI: 10.1186/s13059-023-03066-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 09/20/2023] [Indexed: 10/18/2023] Open
Abstract
We define and identify a new class of control genes for next-generation sequencing called total RNA expression genes (TREGs), which correlate with total RNA abundance in cell types of different sizes and transcriptional activity. We provide a data-driven method to identify TREGs from single-cell RNA sequencing data, allowing the estimation of total amount of RNA when restricted to quantifying a limited number of genes. We demonstrate our method in postmortem human brain using multiplex single-molecule fluorescent in situ hybridization and compare candidate TREGs against classic housekeeping genes. We identify AKT3 as a top TREG across five brain regions.
Collapse
Affiliation(s)
- Louise A Huuki-Myers
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
| | - Kelsey D Montgomery
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
| | - Sang Ho Kwon
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
- The Solomon H. Snyder Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Stephanie C Page
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA
| | - Stephanie C Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Kristen R Maynard
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA.
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| | - Leonardo Collado-Torres
- Lieber Institute for Brain Development, Johns Hopkins Medical Campus, Baltimore, MD, USA.
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
43
|
Liu H, Arsiè R, Schwabe D, Schilling M, Minia I, Alles J, Boltengagen A, Kocks C, Falcke M, Friedman N, Landthaler M, Rajewsky N. SLAM-Drop-seq reveals mRNA kinetic rates throughout the cell cycle. Mol Syst Biol 2023; 19:1-23. [PMID: 38778223 PMCID: PMC10568207 DOI: 10.15252/msb.202211427] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 07/24/2023] [Accepted: 08/04/2023] [Indexed: 05/25/2024] Open
Abstract
RNA abundance is tightly regulated in eukaryotic cells by modulating the kinetic rates of RNA production, processing, and degradation. To date, little is known about time‐dependent kinetic rates during dynamic processes. Here, we present SLAM‐Drop‐seq, a method that combines RNA metabolic labeling and alkylation of modified nucleotides in methanol‐fixed cells with droplet‐based sequencing to detect newly synthesized and preexisting mRNAs in single cells. As a first application, we sequenced 7280 HEK293 cells and calculated gene‐specific kinetic rates during the cell cycle using the novel package Eskrate. Of the 377 robust‐cycling genes that we identified, only a minor fraction is regulated solely by either dynamic transcription or degradation (6 and 4%, respectively). By contrast, the vast majority (89%) exhibit dynamically regulated transcription and degradation rates during the cell cycle. Our study thus shows that temporally regulated mRNA degradation is fundamental for the correct expression of a majority of cycling genes. SLAM‐Drop‐seq, combined with Eskrate, is a powerful approach to understanding the underlying mRNA kinetics of single‐cell gene expression dynamics in continuous biological processes.
Collapse
Affiliation(s)
- Haiyue Liu
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Roberto Arsiè
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Daniel Schwabe
- Mathematical Cell Physiology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Marcel Schilling
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- Lübeck Interdisciplinary Platform for Genome Analytics (LIGA), University of Lübeck, Lübeck, Germany
| | - Igor Minia
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Jonathan Alles
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Anastasiya Boltengagen
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Christine Kocks
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Martin Falcke
- Mathematical Cell Physiology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- Department of Physics, Humboldt University Berlin, Berlin, Germany
| | - Nir Friedman
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
- The Lautenberg Center for Immunology and Cancer Research, Institute of Medical Research Israel-Canada (IMRIC), Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
- The Center for Computational Medicine, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Markus Landthaler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.
- Institut für Biologie, Humboldt Universität zu Berlin, Berlin, Germany.
| | - Nikolaus Rajewsky
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.
- Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
- German Center for Cardiovascular Research (DZHK), Berlin, Germany.
- NeuroCure Cluster of Excellence, Berlin, Germany.
- German Cancer Consortium (DKTK), Berlin, Germany.
- National Center for Tumor Diseases (NCT), Berlin, Germany.
| |
Collapse
|
44
|
Wang P, Paquet ÉR, Robert C. Comprehensive transcriptomic analysis of long non-coding RNAs in bovine ovarian follicles and early embryos. PLoS One 2023; 18:e0291761. [PMID: 37725621 PMCID: PMC10508637 DOI: 10.1371/journal.pone.0291761] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 09/05/2023] [Indexed: 09/21/2023] Open
Abstract
Long non-coding RNAs (lncRNAs) have been the subject of numerous studies over the past decade. First thought to come from aberrant transcriptional events, lncRNAs are now considered a crucial component of the genome with roles in multiple cellular functions. However, the functional annotation and characterization of bovine lncRNAs during early development remain limited. In this comprehensive analysis, we review lncRNAs expression in bovine ovarian follicles and early embryos, based on a unique database comprising 468 microarray hybridizations from a single platform designed to target 7,724 lncRNA transcripts, of which 5,272 are intergenic (lincRNA), 958 are intronic, and 1,524 are antisense (lncNAT). Compared to translated mRNA, lncRNAs have been shown to be more tissue-specific and expressed in low copy numbers. This analysis revealed that protein-coding genes and lncRNAs are both expressed more in oocytes. Differences between the oocyte and the 2-cell embryo are also more apparent in terms of lncRNAs than mRNAs. Co-expression network analysis using WGCNA generated 25 modules with differing proportions of lncRNAs. The modules exhibiting a higher proportion of lncRNAs were found to be associated with fewer annotated mRNAs and housekeeping functions. Functional annotation of co-expressed mRNAs allowed attribution of lncRNAs to a wide array of key cellular events such as meiosis, translation initiation, immune response, and mitochondrial related functions. We thus provide evidence that lncRNAs play diverse physiological roles that are tissue-specific and associated with key cellular functions alongside mRNAs in bovine ovarian follicles and early embryos. This contributes to add lncRNAs as active molecules in the complex regulatory networks driving folliculogenesis, oogenesis and early embryogenesis all of which are necessary for reproductive success.
Collapse
Affiliation(s)
- Pengmin Wang
- Département des sciences animales, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec City, Québec, Canada
| | - Éric R. Paquet
- Département des sciences animales, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec City, Québec, Canada
| | - Claude Robert
- Département des sciences animales, Faculté des sciences de l’agriculture et de l’alimentation, Université Laval, Québec City, Québec, Canada
| |
Collapse
|
45
|
Wang L, Zheng Y, Sun Y, Mao S, Li H, Bo X, Li C, Chen H. TimeTalk uses single-cell RNA-seq datasets to decipher cell-cell communication during early embryo development. Commun Biol 2023; 6:901. [PMID: 37660148 PMCID: PMC10475079 DOI: 10.1038/s42003-023-05283-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 08/24/2023] [Indexed: 09/04/2023] Open
Abstract
Early embryonic development is a dynamic process that relies on proper cell-cell communication to form a correctly patterned embryo. Early embryo development-related ligand-receptor pairs (eLRs) have been shown to guide cell fate decisions and morphogenesis. However, the scope of eLRs and their influence on early embryo development remain elusive. Here, we developed a computational framework named TimeTalk from integrated public time-course mouse scRNA-seq datasets to decipher the secret of eLRs. Extensive validations and analyses were performed to ensure the involvement of identified eLRs in early embryo development. Process analysis identified that eLRs could be divided into six temporal windows corresponding to sequential events in the early embryo development process. With the interpolation strategy, TimeTalk is powerful in revealing paracrine settings and studying cell-cell communication during early embryo development. Furthermore, by using TimeTalk in the blastocyst and blastoid models, we found that the blastoid models share the core communication pathways with the epiblast and primitive endoderm lineages in the blastocysts. This result suggests that TimeTalk has transferability to other bio-dynamic processes. We also curated eLRs recognized by TimeTalk, which may provide valuable clues for understanding early embryo development and relevant disorders.
Collapse
Affiliation(s)
- Longteng Wang
- Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, School of Life Sciences, Peking University, Beijing, 100871, China
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Yang Zheng
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Yu Sun
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Shulin Mao
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
- Yuanpei College, Peking University, Beijing, 100871, China
| | - Hao Li
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China
| | - Xiaochen Bo
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China
| | - Cheng Li
- Center for Bioinformatics, School of Life Sciences, Center for Statistical Science, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, 100871, China.
| | - Hebing Chen
- Institute of Health Service and Transfusion Medicine, Beijing, 100850, China.
| |
Collapse
|
46
|
Lin KZ, Zhang NR. Quantifying common and distinct information in single-cell multimodal data with Tilted Canonical Correlation Analysis. Proc Natl Acad Sci U S A 2023; 120:e2303647120. [PMID: 37523521 PMCID: PMC10410705 DOI: 10.1073/pnas.2303647120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/24/2023] [Indexed: 08/02/2023] Open
Abstract
Multimodal single-cell technologies profile multiple modalities for each cell simultaneously, enabling a more thorough characterization of cell populations. Existing dimension-reduction methods for multimodal data capture the "union of information," producing a lower-dimensional embedding that combines the information across modalities. While these tools are useful, we focus on a fundamentally different task of separating and quantifying the information among cells that is shared between the two modalities as well as unique to only one modality. Hence, we develop Tilted Canonical Correlation Analysis (Tilted-CCA), a method that decomposes a paired multimodal dataset into three lower-dimensional embeddings-one embedding captures the "intersection of information," representing the geometric relations among the cells that is common to both modalities, while the remaining two embeddings capture the "distinct information for a modality," representing the modality-specific geometric relations. We analyze single-cell multimodal datasets sequencing RNA along surface antibodies (i.e., CITE-seq) as well as RNA alongside chromatin accessibility (i.e., 10x) for blood cells and developing neurons via Tilted-CCA. These analyses show that Tilted-CCA enables meaningful visualization and quantification of the cross-modal information. Finally, Tilted-CCA's framework allows us to perform two specific downstream analyses. First, for single-cell datasets that simultaneously profile transcriptome and surface antibody markers, we show that Tilted-CCA helps design the target antibody panel to complement the transcriptome best. Second, for developmental single-cell datasets that simultaneously profile transcriptome and chromatin accessibility, we show that Tilted-CCA helps identify development-informative genes and distinguish between transient versus terminal cell types.
Collapse
Affiliation(s)
- Kevin Z. Lin
- Department of Statistics and Data Science, University of Pennsylvania, Philadelphia, PA19104
| | - Nancy R. Zhang
- Department of Statistics and Data Science, University of Pennsylvania, Philadelphia, PA19104
| |
Collapse
|
47
|
Vadnala RN, Hannenhalli S, Narlikar L, Siddharthan R. Transcription factors organize into functional groups on the linear genome and in 3D chromatin. Heliyon 2023; 9:e18211. [PMID: 37520992 PMCID: PMC10382302 DOI: 10.1016/j.heliyon.2023.e18211] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 07/11/2023] [Accepted: 07/11/2023] [Indexed: 08/01/2023] Open
Abstract
Transcription factors (TFs) and their binding sites have evolved to interact cooperatively or competitively with each other. Here we examine in detail, across multiple cell lines, such cooperation or competition among TFs both in sequential and spatial proximity (using chromatin conformation capture assays), considering in vivo binding data as well as TF binding motifs in DNA. We ascertain significantly co-occurring ("attractive") or avoiding ("repulsive") TF pairs using robust randomized models that retain the essential characteristics of the experimental data. Across human cell lines TFs organize into two groups, with intra-group attraction and inter-group repulsion. This is true for both sequential and spatial proximity, and for both in vivo binding and sequence motifs. Attractive TF pairs exhibit significantly more physical interactions suggesting an underlying mechanism. The two TF groups differ significantly in their genomic and network properties, as well in their function-while one group regulates housekeeping function, the other potentially regulates lineage-specific functions, that are disrupted in cancer. Weaker binding sites tend to occur in spatially interacting regions of the genome. Our results suggest that a complex pattern of spatial cooperativity of TFs and chromatin has evolved with the genome to support housekeeping and lineage-specific functions.
Collapse
Affiliation(s)
- Rakesh Netha Vadnala
- The Institute of Mathematical Sciences, Chennai, India
- Homi Bhabha National Institute, Mumbai, India
| | | | - Leelavati Narlikar
- Department of Data Science, Indian Institute of Science Education and Research, Pune, India
| | - Rahul Siddharthan
- The Institute of Mathematical Sciences, Chennai, India
- Homi Bhabha National Institute, Mumbai, India
| |
Collapse
|
48
|
Hu Qian S, Shi MW, Wang DY, Fear JM, Chen L, Tu YX, Liu HS, Zhang Y, Zhang SJ, Yu SS, Oliver B, Chen ZX. Integrating massive RNA-seq data to elucidate transcriptome dynamics in Drosophila melanogaster. Brief Bioinform 2023; 24:bbad177. [PMID: 37232385 PMCID: PMC10505420 DOI: 10.1093/bib/bbad177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 04/19/2023] [Accepted: 04/20/2023] [Indexed: 05/27/2023] Open
Abstract
The volume of ribonucleic acid (RNA)-seq data has increased exponentially, providing numerous new insights into various biological processes. However, due to significant practical challenges, such as data heterogeneity, it is still difficult to ensure the quality of these data when integrated. Although some quality control methods have been developed, sample consistency is rarely considered and these methods are susceptible to artificial factors. Here, we developed MassiveQC, an unsupervised machine learning-based approach, to automatically download and filter large-scale high-throughput data. In addition to the read quality used in other tools, MassiveQC also uses the alignment and expression quality as model features. Meanwhile, it is user-friendly since the cutoff is generated from self-reporting and is applicable to multimodal data. To explore its value, we applied MassiveQC to Drosophila RNA-seq data and generated a comprehensive transcriptome atlas across 28 tissues from embryogenesis to adulthood. We systematically characterized fly gene expression dynamics and found that genes with high expression dynamics were likely to be evolutionarily young and expressed at late developmental stages, exhibiting high nonsynonymous substitution rates and low phenotypic severity, and they were involved in simple regulatory programs. We also discovered that human and Drosophila had strong positive correlations in gene expression in orthologous organs, revealing the great potential of the Drosophila system for studying human development and disease.
Collapse
Affiliation(s)
- Sheng Hu Qian
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Meng-Wei Shi
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Dan-Yang Wang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Justin M Fear
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Lu Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yi-Xuan Tu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Hong-Shan Liu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Yuan Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shuai-Jie Zhang
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Shan-Shan Yu
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
| | - Brian Oliver
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Zhen-Xia Chen
- Hubei Hongshan Laboratory, College of Biomedicine and Health, Huazhong Agricultural University, Wuhan 430070, China
- Section of Developmental Genomics, National Institute of Diabetes and Kidney and Digestive Diseases, National Institutes of Health, Bethesda, MD 20892, USA
- Hubei Key Laboratory of Agricultural Bioinformatics, College of Life Science and Technology, Huazhong Agricultural University, Wuhan 430070, China
- Interdisciplinary Sciences Institute, Huazhong Agricultural University, Wuhan 430070, China
- Shenzhen Institute of Nutrition and Health, Huazhong Agricultural University, Shenzhen 518000, China
- Shenzhen Branch, Guangdong Laboratory for Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518000, China
| |
Collapse
|
49
|
Li Y, Yi Y, Lv J, Gao X, Yu Y, Babu S, Bruno I, Zhao D, Xia B, Peng W, Zhu J, Chen H, Zhang L, Cao Q, Chen K. Low RNA stability signifies increased post-transcriptional regulation of cell identity genes. Nucleic Acids Res 2023; 51:6020-6038. [PMID: 37125636 PMCID: PMC10325912 DOI: 10.1093/nar/gkad300] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 04/04/2023] [Accepted: 04/11/2023] [Indexed: 05/02/2023] Open
Abstract
Cell identity genes are distinct from other genes with respect to the epigenetic mechanisms to activate their transcription, e.g. by super-enhancers and broad H3K4me3 domains. However, it remains unclear whether their post-transcriptional regulation is also unique. We performed a systematic analysis of transcriptome-wide RNA stability in nine cell types and found that unstable transcripts were enriched in cell identity-related pathways while stable transcripts were enriched in housekeeping pathways. Joint analyses of RNA stability and chromatin state revealed significant enrichment of super-enhancers and broad H3K4me3 domains at the gene loci of unstable transcripts. Intriguingly, the RNA m6A methyltransferase, METTL3, preferentially binds to chromatin at super-enhancers, broad H3K4me3 domains and their associated genes. METTL3 binding intensity is positively correlated with RNA m6A methylation and negatively correlated with RNA stability of cell identity genes, probably due to co-transcriptional m6A modifications promoting RNA decay. Nanopore direct RNA-sequencing showed that METTL3 knockdown has a stronger effect on RNA m6A and mRNA stability for cell identity genes. Our data suggest a run-and-brake model, where cell identity genes undergo both frequent transcription and fast RNA decay to achieve precise regulation of RNA expression.
Collapse
Affiliation(s)
- Yanqiang Li
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Yang Yi
- Department of Urology, Robert H. Lurie Comprehensive Cancer Center, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Jie Lv
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Xinlei Gao
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Yang Yu
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
| | - Sahana Suresh Babu
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Ivone Bruno
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Dongyu Zhao
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Bo Xia
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Weiqun Peng
- Department of Physics, The George Washington University, Washington, DC 20052, USA
| | - Jun Zhu
- Systems Biology Center, National Heart Lung and Blood Institute, NIH, Bethesda, MD 20892, USA
| | - Hong Chen
- Vascular Biology Program, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Surgery, Harvard Medical School, Boston, MA 02115, USA
| | - Lili Zhang
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
| | - Qi Cao
- Department of Urology, Robert H. Lurie Comprehensive Cancer Center, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Kaifu Chen
- Basic and Translational Research Division, Department of Cardiology, Boston Children's Hospital, Boston, MA 02115, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115, USA
- Houston Methodist Research Institute, The Methodist Hospital System, Houston, TX 77030, USA
- Broad Institute of MIT and Harvard, Boston, MA 02115, USA
- Dana-Farber/Harvard Cancer Center, Boston, MA 02115, USA
| |
Collapse
|
50
|
Wolf S, Melo D, Garske KM, Pallares LF, Lea AJ, Ayroles JF. Characterizing the landscape of gene expression variance in humans. PLoS Genet 2023; 19:e1010833. [PMID: 37410774 DOI: 10.1371/journal.pgen.1010833] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 06/15/2023] [Indexed: 07/08/2023] Open
Abstract
Gene expression variance has been linked to organismal function and fitness but remains a commonly neglected aspect of molecular research. As a result, we lack a comprehensive understanding of the patterns of transcriptional variance across genes, and how this variance is linked to context-specific gene regulation and gene function. Here, we use 57 large publicly available RNA-seq data sets to investigate the landscape of gene expression variance. These studies cover a wide range of tissues and allowed us to assess if there are consistently more or less variable genes across tissues and data sets and what mechanisms drive these patterns. We show that gene expression variance is broadly similar across tissues and studies, indicating that the pattern of transcriptional variance is consistent. We use this similarity to create both global and within-tissue rankings of variation, which we use to show that function, sequence variation, and gene regulatory signatures contribute to gene expression variance. Low-variance genes are associated with fundamental cell processes and have lower levels of genetic polymorphisms, have higher gene-gene connectivity, and tend to be associated with chromatin states associated with transcription. In contrast, high-variance genes are enriched for genes involved in immune response, environmentally responsive genes, immediate early genes, and are associated with higher levels of polymorphisms. These results show that the pattern of transcriptional variance is not noise. Instead, it is a consistent gene trait that seems to be functionally constrained in human populations. Furthermore, this commonly neglected aspect of molecular phenotypic variation harbors important information to understand complex traits and disease.
Collapse
Affiliation(s)
- Scott Wolf
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Diogo Melo
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
| | - Kristina M Garske
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Luisa F Pallares
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
- Friedrich Miescher Laboratory of the Max Planck Society, Tübingen, Germany
| | - Amanda J Lea
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
- Child and Brain Development, Canadian Institute for Advanced Research, Toronto, Canada
| | - Julien F Ayroles
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey, United States of America
| |
Collapse
|