1
|
Morgan D, DeMeo DL, Glass K. Using methylation data to improve transcription factor binding prediction. Epigenetics 2024; 19:2309826. [PMID: 38300850 PMCID: PMC10841018 DOI: 10.1080/15592294.2024.2309826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 01/01/2024] [Indexed: 02/03/2024] Open
Abstract
Modelling the regulatory mechanisms that determine cell fate, response to external perturbation, and disease state depends on measuring many factors, a task made more difficult by the plasticity of the epigenome. Scanning the genome for the sequence patterns defined by Position Weight Matrices (PWM) can be used to estimate transcription factor (TF) binding locations. However, this approach does not incorporate information regarding the epigenetic context necessary for TF binding. CpG methylation is an epigenetic mark influenced by environmental factors that is commonly assayed in human cohort studies. We developed a framework to score inferred TF binding locations using methylation data. We intersected motif locations identified using PWMs with methylation information captured in both whole-genome bisulfite sequencing and Illumina EPIC array data for six cell lines, scored motif locations based on these data, and compared with experimental data characterizing TF binding (ChIP-seq). We found that for most TFs, binding prediction improves using methylation-based scoring compared to standard PWM-scores. We also illustrate that our approach can be generalized to infer TF binding when methylation information is only proximally available, i.e. measured for nearby CpGs that do not directly overlap with a motif location. Overall, our approach provides a framework for inferring context-specific TF binding using methylation data. Importantly, the availability of DNA methylation data in existing patient populations provides an opportunity to use our approach to understand the impact of methylation on gene regulatory processes in the context of human disease.
Collapse
Affiliation(s)
- Daniel Morgan
- Channing Division of Network Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Dawn L. DeMeo
- Channing Division of Network Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, USA
- Department of Biostatistics, Harvard Chan School of Public Health, Boston, MA, USA
| |
Collapse
|
2
|
Huo Q, Song R, Ma Z. Recent advances in exploring transcriptional regulatory landscape of crops. FRONTIERS IN PLANT SCIENCE 2024; 15:1421503. [PMID: 38903438 PMCID: PMC11188431 DOI: 10.3389/fpls.2024.1421503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 05/23/2024] [Indexed: 06/22/2024]
Abstract
Crop breeding entails developing and selecting plant varieties with improved agronomic traits. Modern molecular techniques, such as genome editing, enable more efficient manipulation of plant phenotype by altering the expression of particular regulatory or functional genes. Hence, it is essential to thoroughly comprehend the transcriptional regulatory mechanisms that underpin these traits. In the multi-omics era, a large amount of omics data has been generated for diverse crop species, including genomics, epigenomics, transcriptomics, proteomics, and single-cell omics. The abundant data resources and the emergence of advanced computational tools offer unprecedented opportunities for obtaining a holistic view and profound understanding of the regulatory processes linked to desirable traits. This review focuses on integrated network approaches that utilize multi-omics data to investigate gene expression regulation. Various types of regulatory networks and their inference methods are discussed, focusing on recent advancements in crop plants. The integration of multi-omics data has been proven to be crucial for the construction of high-confidence regulatory networks. With the refinement of these methodologies, they will significantly enhance crop breeding efforts and contribute to global food security.
Collapse
Affiliation(s)
| | | | - Zeyang Ma
- State Key Laboratory of Maize Bio-breeding, Frontiers Science Center for Molecular Design Breeding, Joint International Research Laboratory of Crop Molecular Breeding, National Maize Improvement Center, College of Agronomy and Biotechnology, China Agricultural University, Beijing, China
| |
Collapse
|
3
|
Kuraz Abebe B, Wang J, Guo J, Wang H, Li A, Zan L. A review of the role of epigenetic studies for intramuscular fat deposition in beef cattle. Gene 2024; 908:148295. [PMID: 38387707 DOI: 10.1016/j.gene.2024.148295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 01/23/2024] [Accepted: 02/15/2024] [Indexed: 02/24/2024]
Abstract
Intramuscular fat (IMF) deposition profoundly influences meat quality and economic value in beef cattle production. Meanwhile, contemporary developments in epigenetics have opened new outlooks for understanding the molecular basics of IMF regulation, and it has become a key area of research for world scholars. Therefore, the aim of this paper was to provide insight and synthesis into the intricate relationship between epigenetic mechanisms and IMF deposition in beef cattle. The methodology involves a thorough analysis of existing literature, including pertinent books, academic journals, and online resources, to provide a comprehensive overview of the role of epigenetic studies in IMF deposition in beef cattle. This review summarizes the contemporary studies in epigenetic mechanisms in IMF regulation, high-resolution epigenomic mapping, single-cell epigenomics, multi-omics integration, epigenome editing approaches, longitudinal studies in cattle growth, environmental epigenetics, machine learning in epigenetics, ethical and regulatory considerations, and translation to industry practices from perspectives of IMF deposition in beef cattle. Moreover, this paper highlights DNA methylation, histone modifications, acetylation, phosphorylation, ubiquitylation, non-coding RNAs, DNA hydroxymethylation, epigenetic readers, writers, and erasers, chromatin immunoprecipitation followed by sequencing, whole genome bisulfite sequencing, epigenome-wide association studies, and their profound impact on the expression of crucial genes governing adipogenesis and lipid metabolism. Nutrition and stress also have significant influences on epigenetic modifications and IMF deposition. The key findings underscore the pivotal role of epigenetic studies in understanding and enhancing IMF deposition in beef cattle, with implications for precision livestock farming and ethical livestock management. In conclusion, this review highlights the crucial significance of epigenetic pathways and environmental factors in affecting IMF deposition in beef cattle, providing insightful information for improving the economics and meat quality of cattle production.
Collapse
Affiliation(s)
- Belete Kuraz Abebe
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; Department of Animal Science, Werabe University, P.O. Box 46, Werabe, Ethiopia
| | - Jianfang Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Juntao Guo
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Hongbao Wang
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Anning Li
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China
| | - Linsen Zan
- College of Animal Science and Technology, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China; National Beef Cattle Improvement Center, Northwest A&F University, Yangling, Shaanxi 712100, People's Republic of China.
| |
Collapse
|
4
|
Liufu C, Luo L, Pang T, Zheng H, Yang L, Lu L, Chang S. Integration of multi-omics summary data reveals the role of N6-methyladenosine in neuropsychiatric disorders. Mol Psychiatry 2024:10.1038/s41380-024-02574-w. [PMID: 38684796 DOI: 10.1038/s41380-024-02574-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 04/18/2024] [Accepted: 04/19/2024] [Indexed: 05/02/2024]
Abstract
N6-methyladenosine (m6A) methylation regulates gene expression/protein by influencing numerous aspects of mRNA metabolism and contributes to neuropsychiatric diseases. Here, we integrated multi-omics data and genome-wide association study summary data of schizophrenia (SCZ), bipolar disorder (BP), attention deficit hyperactivity disorder (ADHD), autism spectrum disorder (ASD), major depressive disorder (MDD), Alzheimer's disease (AD), and Parkinson's disease (PD) to reveal the role of m6A in neuropsychiatric disorders by using transcriptome-wide association study (TWAS) tool and Summary-data-based Mendelian randomization (SMR). Our investigation identified 86 m6A sites associated with seven neuropsychiatric diseases and then revealed 7881 associations between m6A sites and gene expressions. Based on these results, we discovered 916 significant m6A-gene associations involving 82 disease-related m6A sites and 606 genes. Further integrating the 58 disease-related genes from TWAS and SMR analysis, we obtained 61, 8, 7, 3, and 2 associations linking m6A-disease, m6A-gene, and gene-disease for SCZ, BP, AD, MDD, and PD separately. Functional analysis showed the m6A mapped genes were enriched in "response to stimulus" pathway. In addition, we also analyzed the effect of gene expression on m6A and the post-transcription effect of m6A on protein. Our study provided new insights into the genetic component of m6A in neuropsychiatric disorders and unveiled potential pathogenic mechanisms where m6A exerts influences on disease through gene expression/protein regulation.
Collapse
Affiliation(s)
- Chao Liufu
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
| | - Lingxue Luo
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
| | - Tao Pang
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
| | - Haohao Zheng
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
| | - Li Yang
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
| | - Lin Lu
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China
- Research Units of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences, Beijing, 100191, China
| | - Suhua Chang
- Peking University Sixth Hospital, Peking University Institute of Mental Health, NHC Key Laboratory of Mental Health (Peking University), National Clinical Research Center for Mental Disorders (Peking University Sixth Hospital), Beijing, 100191, China.
- Research Units of Diagnosis and Treatment of Mood Cognitive Disorder, Chinese Academy of Medical Sciences, Beijing, 100191, China.
| |
Collapse
|
5
|
De Marzio M, Glass K, Kuijjer ML. Single-sample network modeling on omics data. BMC Biol 2023; 21:296. [PMID: 38155351 PMCID: PMC10755944 DOI: 10.1186/s12915-023-01783-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 11/27/2023] [Indexed: 12/30/2023] Open
Affiliation(s)
- Margherita De Marzio
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medicine School, Boston, MA, USA
| | - Kimberly Glass
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medicine School, Boston, MA, USA.
- Biostatistics Department, Harvard Chan School of Public Health, Boston, MA, USA.
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo, Norway.
| |
Collapse
|
6
|
Kucharski R, Ellis N, Jurkowski TP, Hurd PJ, Maleszka R. The PWWP domain and the evolution of unique DNA methylation toolkits in Hymenoptera. iScience 2023; 26:108193. [PMID: 37920666 PMCID: PMC10618690 DOI: 10.1016/j.isci.2023.108193] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 08/11/2023] [Accepted: 10/10/2023] [Indexed: 11/04/2023] Open
Abstract
DNMT3 in Hymenoptera has a unique duplication of the essential PWWP domain. Using GST-tagged PWWP fusion proteins and histone arrays we show that these domains have gained new properties and represent the first case of PWWP domains binding to H3K27 chromatin modifications, including H3K27me3, a key modification that is important during development. Phylogenetic analyses of 107 genomes indicate that the duplicated PWWP domains separated into two sister clades, and their distinct binding capacities are supported by 3D modeling. Other features of this unique DNA methylation system include variable copies, losses, and duplications of DNMT1 and DNMT3, and combinatorial generations of DNMT3 isoforms including variants missing the catalytic domain. Some of these losses and duplications of are found only in parasitic wasps. We discuss our findings in the context of the crosstalk between DNA methylation and histone methylation, and the expanded potential of epigenomic modifications in Hymenoptera to drive evolutionary novelties.
Collapse
Affiliation(s)
- Robert Kucharski
- Research School of Biology, The Australian National University, Canberra, ACT 2601, Australia
| | - Nancy Ellis
- School of Biological & Behavioural Sciences, Queen Mary University of London, London, UK
| | | | - Paul J. Hurd
- School of Biological & Behavioural Sciences, Queen Mary University of London, London, UK
| | - Ryszard Maleszka
- Research School of Biology, The Australian National University, Canberra, ACT 2601, Australia
| |
Collapse
|
7
|
Badia-I-Mompel P, Wessels L, Müller-Dott S, Trimbour R, Ramirez Flores RO, Argelaguet R, Saez-Rodriguez J. Gene regulatory network inference in the era of single-cell multi-omics. Nat Rev Genet 2023; 24:739-754. [PMID: 37365273 DOI: 10.1038/s41576-023-00618-5] [Citation(s) in RCA: 48] [Impact Index Per Article: 48.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/12/2023] [Indexed: 06/28/2023]
Abstract
The interplay between chromatin, transcription factors and genes generates complex regulatory circuits that can be represented as gene regulatory networks (GRNs). The study of GRNs is useful to understand how cellular identity is established, maintained and disrupted in disease. GRNs can be inferred from experimental data - historically, bulk omics data - and/or from the literature. The advent of single-cell multi-omics technologies has led to the development of novel computational methods that leverage genomic, transcriptomic and chromatin accessibility information to infer GRNs at an unprecedented resolution. Here, we review the key principles of inferring GRNs that encompass transcription factor-gene interactions from transcriptomics and chromatin accessibility data. We focus on the comparison and classification of methods that use single-cell multimodal data. We highlight challenges in GRN inference, in particular with respect to benchmarking, and potential further developments using additional data modalities.
Collapse
Affiliation(s)
- Pau Badia-I-Mompel
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Lorna Wessels
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Department of Vascular Biology and Tumor Angiogenesis, European Center for Angioscience, Medical Faculty, MannHeim Heidelberg University, Mannheim, Germany
| | - Sophia Müller-Dott
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | - Rémi Trimbour
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
- Institut Pasteur, Université Paris Cité, CNRS UMR 3738, Machine Learning for Integrative Genomics Group, Paris, France
| | - Ricardo O Ramirez Flores
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany
| | | | - Julio Saez-Rodriguez
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg, Germany.
| |
Collapse
|
8
|
Recto K, Kachroo P, Huan T, Van Den Berg D, Lee GY, Bui H, Lee DH, Gereige J, Yao C, Hwang SJ, Joehanes R, Weiss ST, O'Connor GT, Levy D, DeMeo DL. Epigenome-wide DNA methylation association study of circulating IgE levels identifies novel targets for asthma. EBioMedicine 2023; 95:104758. [PMID: 37598461 PMCID: PMC10462855 DOI: 10.1016/j.ebiom.2023.104758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 08/01/2023] [Accepted: 08/02/2023] [Indexed: 08/22/2023] Open
Abstract
BACKGROUND Identifying novel epigenetic signatures associated with serum immunoglobulin E (IgE) may improve our understanding of molecular mechanisms underlying asthma and IgE-mediated diseases. METHODS We performed an epigenome-wide association study using whole blood from Framingham Heart Study (FHS; n = 3,471, 46% females) participants and validated results using the Childhood Asthma Management Program (CAMP; n = 674, 39% females) and the Genetic Epidemiology of Asthma in Costa Rica Study (CRA; n = 787, 41% females). Using the closest gene to each IgE-associated CpG, we highlighted biologically plausible pathways underlying IgE regulation and analyzed the transcription patterns linked to IgE-associated CpGs (expression quantitative trait methylation loci; eQTMs). Using prior UK Biobank summary data from genome-wide association studies of asthma and allergy, we performed Mendelian randomization (MR) for causal inference testing using the IgE-associated CpGs from FHS with methylation quantitative trait loci (mQTLs) as instrumental variables. FINDINGS We identified 490 statistically significant differentially methylated CpGs associated with IgE in FHS, of which 193 (39.3%) replicated in CAMP and CRA (FDR < 0.05). Gene ontology analysis revealed enrichment in pathways related to transcription factor binding, asthma, and other immunological processes. eQTM analysis identified 124 cis-eQTMs for 106 expressed genes (FDR < 0.05). MR in combination with drug-target analysis revealed CTSB and USP20 as putatively causal regulators of IgE levels (Bonferroni adjusted P < 7.94E-04) that can be explored as potential therapeutic targets. INTERPRETATION By integrating eQTM and MR analyses in general and clinical asthma populations, our findings provide a deeper understanding of the multidimensional inter-relations of DNA methylation, gene expression, and IgE levels. FUNDING US NIH/NHLBI grants: P01HL132825, K99HL159234. N01-HC-25195 and HHSN268201500001I.
Collapse
Affiliation(s)
- Kathryn Recto
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Priyadarshini Kachroo
- Brigham and Women's Hospital, Channing Division of Network Medicine, Boston, MA 02115, USA
| | - Tianxiao Huan
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - David Van Den Berg
- University of Southern California Methylation Characterization Center, University of Southern California, Los Angeles, CA 90033, USA
| | - Gha Young Lee
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Helena Bui
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Dong Heon Lee
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Jessica Gereige
- Boston University School of Medicine, Pulmonary Center, Boston, MA 02118, USA
| | - Chen Yao
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Shih-Jen Hwang
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Roby Joehanes
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA
| | - Scott T Weiss
- Brigham and Women's Hospital, Channing Division of Network Medicine, Boston, MA 02115, USA
| | - George T O'Connor
- The Framingham Heart Study, Framingham, MA 01702, USA; Boston University School of Medicine, Pulmonary Center, Boston, MA 02118, USA
| | - Daniel Levy
- The Population Sciences Branch, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD 20892, USA; The Framingham Heart Study, Framingham, MA 01702, USA.
| | - Dawn L DeMeo
- Brigham and Women's Hospital, Channing Division of Network Medicine, Boston, MA 02115, USA.
| |
Collapse
|
9
|
Ben Guebila M, Wang T, Lopes-Ramos CM, Fanfani V, Weighill D, Burkholz R, Schlauch D, Paulson JN, Altenbuchinger M, Shutta KH, Sonawane AR, Lim J, Calderer G, van IJzendoorn DGP, Morgan D, Marin A, Chen CY, Song Q, Saha E, DeMeo DL, Padi M, Platig J, Kuijjer ML, Glass K, Quackenbush J. The Network Zoo: a multilingual package for the inference and analysis of gene regulatory networks. Genome Biol 2023; 24:45. [PMID: 36894939 PMCID: PMC9999668 DOI: 10.1186/s13059-023-02877-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Accepted: 02/15/2023] [Indexed: 03/11/2023] Open
Abstract
Inference and analysis of gene regulatory networks (GRNs) require software that integrates multi-omic data from various sources. The Network Zoo (netZoo; netzoo.github.io) is a collection of open-source methods to infer GRNs, conduct differential network analyses, estimate community structure, and explore the transitions between biological states. The netZoo builds on our ongoing development of network methods, harmonizing the implementations in various computing languages and between methods to allow better integration of these tools into analytical pipelines. We demonstrate the utility using multi-omic data from the Cancer Cell Line Encyclopedia. We will continue to expand the netZoo to incorporate additional methods.
Collapse
Affiliation(s)
- Marouen Ben Guebila
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Tian Wang
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Present Address: Biology Department, Boston College, Chestnut Hill, MA, USA
| | - Camila M Lopes-Ramos
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Viola Fanfani
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Des Weighill
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Present Address: Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Rebekka Burkholz
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Present Address: CISPA Helmholtz Center for Information Security, Saarbrücken, Germany
| | - Daniel Schlauch
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Present Address: Genospace, LLC, Boston, MA, USA
| | - Joseph N Paulson
- Department of Biochemistry and Molecular Biology, Pennsylvania State University College of Medicine, Hershey, PA, USA
| | - Michael Altenbuchinger
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Present Address: Department of Medical Bioinformatics, University Medical Center Göttingen, Göttingen, Germany
| | - Katherine H Shutta
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Abhijeet R Sonawane
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Present Address: Center for Interdisciplinary Cardiovascular Sciences, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - James Lim
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
- Present Address: Monoceros Biosystems, LLC, San Diego, CA, USA
| | - Genis Calderer
- Center for Molecular Medicine Norway, Nordic EMBL Partnership, University of Oslo, Oslo, Norway
| | - David G P van IJzendoorn
- Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
- Present Address: Department of Pathology, Stanford University School of Medicine, Palo Alto, CA, USA
| | - Daniel Morgan
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Present Address: School of Biomedical Sciences, Hong Kong University, Pokfulam, Hong Kong
| | | | - Cho-Yi Chen
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Dana-Farber Cancer Institute, Boston, MA, USA
- Present Address: Institute of Biomedical Informatics, National Yang Ming Chiao Tung University, Taipei, 112, Taiwan
| | - Qi Song
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Present Address: Computational Biology Department, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Enakshi Saha
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Megha Padi
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - John Platig
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Marieke L Kuijjer
- Center for Molecular Medicine Norway, Nordic EMBL Partnership, University of Oslo, Oslo, Norway
- Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
- Leiden Center for Computational Oncology, Leiden University, Leiden, The Netherlands
| | - Kimberly Glass
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - John Quackenbush
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA.
- Dana-Farber Cancer Institute, Boston, MA, USA.
| |
Collapse
|
10
|
Computational approaches to understand transcription regulation in development. Biochem Soc Trans 2023; 51:1-12. [PMID: 36695505 PMCID: PMC9988001 DOI: 10.1042/bst20210145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 01/07/2023] [Accepted: 01/13/2023] [Indexed: 01/26/2023]
Abstract
Gene regulatory networks (GRNs) serve as useful abstractions to understand transcriptional dynamics in developmental systems. Computational prediction of GRNs has been successfully applied to genome-wide gene expression measurements with the advent of microarrays and RNA-sequencing. However, these inferred networks are inaccurate and mostly based on correlative rather than causative interactions. In this review, we highlight three approaches that significantly impact GRN inference: (1) moving from one genome-wide functional modality, gene expression, to multi-omics, (2) single cell sequencing, to measure cell type-specific signals and predict context-specific GRNs, and (3) neural networks as flexible models. Together, these experimental and computational developments have the potential to significantly impact the quality of inferred GRNs. Ultimately, accurately modeling the regulatory interactions between transcription factors and their target genes will be essential to understand the role of transcription factors in driving developmental gene expression programs and to derive testable hypotheses for validation.
Collapse
|
11
|
Ochoa S, Hernández-Lemus E. Functional impact of multi-omic interactions in breast cancer subtypes. Front Genet 2023; 13:1078609. [PMID: 36685900 PMCID: PMC9850112 DOI: 10.3389/fgene.2022.1078609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 12/15/2022] [Indexed: 01/07/2023] Open
Abstract
Multi-omic approaches are expected to deliver a broader molecular view of cancer. However, the promised mechanistic explanations have not quite settled yet. Here, we propose a theoretical and computational analysis framework to semi-automatically produce network models of the regulatory constraints influencing a biological function. This way, we identified functions significantly enriched on the analyzed omics and described associated features, for each of the four breast cancer molecular subtypes. For instance, we identified functions sustaining over-representation of invasion-related processes in the basal subtype and DNA modification processes in the normal tissue. We found limited overlap on the omics-associated functions between subtypes; however, a startling feature intersection within subtype functions also emerged. The examples presented highlight new, potentially regulatory features, with sound biological reasons to expect a connection with the functions. Multi-omic regulatory networks thus constitute reliable models of the way omics are connected, demonstrating a capability for systematic generation of mechanistic hypothesis.
Collapse
Affiliation(s)
- Soledad Ochoa
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico,Programa de Doctorado en Ciencias Biomédicas, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Enrique Hernández-Lemus
- Computational Genomics Division, National Institute of Genomic Medicine, Mexico City, Mexico,Center for Complexity Sciences, Universidad Nacional Autónoma de México, Mexico City, Mexico,*Correspondence: Enrique Hernández-Lemus,
| |
Collapse
|
12
|
Sonawane AR, Aikawa E, Aikawa M. Connections for Matters of the Heart: Network Medicine in Cardiovascular Diseases. Front Cardiovasc Med 2022; 9:873582. [PMID: 35665246 PMCID: PMC9160390 DOI: 10.3389/fcvm.2022.873582] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Accepted: 04/19/2022] [Indexed: 01/18/2023] Open
Abstract
Cardiovascular diseases (CVD) are diverse disorders affecting the heart and vasculature in millions of people worldwide. Like other fields, CVD research has benefitted from the deluge of multiomics biomedical data. Current CVD research focuses on disease etiologies and mechanisms, identifying disease biomarkers, developing appropriate therapies and drugs, and stratifying patients into correct disease endotypes. Systems biology offers an alternative to traditional reductionist approaches and provides impetus for a comprehensive outlook toward diseases. As a focus area, network medicine specifically aids the translational aspect of in silico research. This review discusses the approach of network medicine and its application to CVD research.
Collapse
Affiliation(s)
- Abhijeet Rajendra Sonawane
- Center for Interdisciplinary Cardiovascular Sciences, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
- Center for Excellence in Vascular Biology, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
| | - Elena Aikawa
- Center for Interdisciplinary Cardiovascular Sciences, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
- Center for Excellence in Vascular Biology, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
| | - Masanori Aikawa
- Center for Interdisciplinary Cardiovascular Sciences, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
- Center for Excellence in Vascular Biology, Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, United States
| |
Collapse
|
13
|
Amit G, Vaknin Ben Porath D, Levy O, Hamdi O, Bashan A. Global coordination level in single-cell transcriptomic data. Sci Rep 2022; 12:7547. [PMID: 35534606 PMCID: PMC9085802 DOI: 10.1038/s41598-022-11507-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 03/31/2022] [Indexed: 11/26/2022] Open
Abstract
Genes are linked by underlying regulatory mechanisms and by jointly implementing biological functions, working in coordination to apply different tasks in the cells. Assessing the coordination level between genes from single-cell transcriptomic data, without a priori knowledge of the map of gene regulatory interactions, is a challenge. A ‘top-down’ approach has recently been developed to analyze single-cell transcriptomic data by evaluating the global coordination level between genes (called GCL). Here, we systematically analyze the performance of the GCL in typical scenarios of single-cell RNA sequencing (scRNA-seq) data. We show that an individual anomalous cell can have a disproportionate effect on the GCL calculated over a cohort of cells. In addition, we demonstrate how the GCL is affected by the presence of clusters, which are very common in scRNA-seq data. Finally, we analyze the effect of the sampling size of the Jackknife procedure on the GCL statistics. The manuscript is accompanied by a description of a custom-built Python package for calculating the GCL. These results provide practical guidelines for properly pre-processing and applying the GCL measure in transcriptional data.
Collapse
|
14
|
Wu S, Zhou T, Tian T. A robust method for designing multistable systems by embedding bistable subsystems. NPJ Syst Biol Appl 2022; 8:10. [PMID: 35338169 PMCID: PMC8956579 DOI: 10.1038/s41540-022-00220-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 02/15/2022] [Indexed: 12/21/2022] Open
Abstract
Although multistability is an important dynamic property of a wide range of complex systems, it is still a challenge to develop mathematical models for realising high order multistability using realistic regulatory mechanisms. To address this issue, we propose a robust method to develop multistable mathematical models by embedding bistable models together. Using the GATA1-GATA2-PU.1 module in hematopoiesis as the test system, we first develop a tristable model based on two bistable models without any high cooperative coefficients, and then modify the tristable model based on experimentally determined mechanisms. The modified model successfully realises four stable steady states and accurately reflects a recent experimental observation showing four transcriptional states. In addition, we develop a stochastic model, and stochastic simulations successfully realise the experimental observations in single cells. These results suggest that the proposed method is a general approach to develop mathematical models for realising multistability and heterogeneity in complex systems.
Collapse
Affiliation(s)
- Siyuan Wu
- School of Mathematics, Monash University, Melbourne, VIC, Australia
| | - Tianshou Zhou
- School of Mathematics and Statistics, Sun Yet-Sen University, Guangzhou, China
| | - Tianhai Tian
- School of Mathematics, Monash University, Melbourne, VIC, Australia.
| |
Collapse
|
15
|
Guebila MB, Morgan DC, Glass K, Kuijjer ML, DeMeo DL, Quackenbush J. gpuZoo: Cost-effective estimation of gene regulatory networks using the Graphics Processing Unit. NAR Genom Bioinform 2022; 4:lqac002. [PMID: 35156023 PMCID: PMC8826808 DOI: 10.1093/nargab/lqac002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Revised: 12/28/2021] [Accepted: 02/02/2022] [Indexed: 11/14/2022] Open
Abstract
Gene regulatory network inference allows for the modeling of genome-scale regulatory processes that are altered during development, in disease, and in response to perturbations. Our group has developed a collection of tools to model various regulatory processes, including transcriptional (PANDA, SPIDER) and post-transcriptional (PUMA) gene regulation, as well as gene regulation in individual samples (LIONESS). These methods work by postulating a network structure and then optimizing that structure to be consistent with multiple lines of biological evidence through repeated operations on data matrices. Although our methods are widely used, the corresponding computational complexity, and the associated costs and run times, do limit some applications. To improve the cost/time performance of these algorithms, we developed gpuZoo which implements GPU-accelerated calculations, dramatically improving the performance of these algorithms. The runtime of the gpuZoo implementation in MATLAB and Python is up to 61 times faster and 28 times less expensive than multi-core CPU implementation of the same methods. gpuZoo is available in MATLAB through the netZooM package https://github.com/netZoo/netZooM and in Python through the netZooPy package https://github.com/netZoo/netZooPy.
Collapse
Affiliation(s)
- Marouen Ben Guebila
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
| | - Daniel C Morgan
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Kimberly Glass
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Marieke L Kuijjer
- Centre for Molecular Medicine Norway (NCMM), Nordic EMBL Partnership, University of Oslo, Oslo, Norway
- Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
- Leiden Center for Computational Oncology, Leiden University Medical Center, Leiden, The Netherlands
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - John Quackenbush
- Department of Biostatistics, Harvard School of Public Health, Boston, MA, USA
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| |
Collapse
|