1
|
Noh K, Choi H, Jo EH, Yoo W, Park KC. Role of SYT11 in human pan-cancer using comprehensive approaches. Eur J Med Res 2024; 29:338. [PMID: 38890718 PMCID: PMC11186215 DOI: 10.1186/s40001-024-01931-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 06/07/2024] [Indexed: 06/20/2024] Open
Abstract
BACKGROUND Synaptotagmin 11 (SYT11) plays a pivotal role in neuronal vesicular trafficking and exocytosis. However, no independent prognostic studies have focused on various cancers. In this study, we aimed to summarize the clinical significance and molecular landscape of SYT11 in various tumor types. METHODS Using several available public databases, we investigated abnormal SYT11 expression in different tumor types and its potential clinical association with prognosis, methylation profiling, immune infiltration, gene enrichment analysis, and protein-protein interaction analysis, and identified common pathways. RESULTS TCGA and Genotype-Tissue Expression (GTEx) showed that SYT11 was widely expressed across tumor and corresponding normal tissues. Survival analysis showed that SYT11 expression correlated with the prognosis of seven cancer types. Additionally, SYT11 mRNA expression was not affected by promoter methylation, but regulated by certain miRNAs and associated with cancer patient prognosis. In vitro experiments further verified a negative correlation between the expression of SYT11 and miR-19a-3p in human colorectal, lung, and renal cancer cell lines. Moreover, aberrant SYT11 expression was significantly associated with immune infiltration. Pathway enrichment analysis revealed that the biological and molecular processes of SYT11 were related to clathrin-mediated endocytosis, Rho GTPase signaling, and cell motility-related functions. CONCLUSIONS Our results provide a clear understanding of the role of SYT11 in various cancer types and suggest that SYT11 may be of prognostic and clinical significance.
Collapse
Affiliation(s)
- Kyunghee Noh
- Bionanotechnology Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
- Department of Nanobiotechnology, University of Science and Technology (UST), Daejeon, 34141, Republic of Korea
| | - Hyunji Choi
- Personalized Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Eun-Hye Jo
- Personalized Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Wonbeak Yoo
- Personalized Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
| | - Kyung Chan Park
- Personalized Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea.
- Department of Functional Genomics, University of Science and Technology (UST), Daejeon, 34113, Republic of Korea.
| |
Collapse
|
2
|
Bolteau M, Chebouba L, David L, Bourdon J, Guziolowski C. Boolean Network Models of Human Preimplantation Development. J Comput Biol 2024; 31:513-523. [PMID: 38814745 DOI: 10.1089/cmb.2024.0517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024] Open
Abstract
Single-cell transcriptomic studies of differentiating systems allow meaningful understanding, especially in human embryonic development and cell fate determination. We present an innovative method aimed at modeling these intricate processes by leveraging scRNAseq data from various human developmental stages. Our implemented method identifies pseudo-perturbations, since actual perturbations are unavailable due to ethical and technical constraints. By integrating these pseudo-perturbations with prior knowledge of gene interactions, our framework generates stage-specific Boolean networks (BNs). We apply our method to medium and late trophectoderm developmental stages and identify 20 pseudo-perturbations required to infer BNs. The resulting BN families delineate distinct regulatory mechanisms, enabling the differentiation between these developmental stages. We show that our program outperforms existing pseudo-perturbation identification tool. Our framework contributes to comprehending human developmental processes and holds potential applicability to diverse developmental stages and other research scenarios.
Collapse
Affiliation(s)
- Mathieu Bolteau
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000, Nantes, France
| | - Lokmane Chebouba
- Department of Electronics, University of Frères Mentouri Constantine 1, Constantine, Algeria
- LRIA Laboratory, University of Science and Technology Houari Boumediene (USTHB), Bab-Ezzouar, Algeria
| | - Laurent David
- Nantes Université, CHU Nantes, INSERM, Center for Research in Transplantation and Translational Immunology, UMR 1064, F-44000, Nantes, France
| | - Jérémie Bourdon
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000, Nantes, France
| | - Carito Guziolowski
- Nantes Université, École Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000, Nantes, France
| |
Collapse
|
3
|
Inoue Y, Lee H, Fu T, Luna A. drGAT: Attention-Guided Gene Assessment of Drug Response Utilizing a Drug-Cell-Gene Heterogeneous Network. ARXIV 2024:arXiv:2405.08979v1. [PMID: 38800657 PMCID: PMC11118660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Drug development is a lengthy process with a high failure rate. Increasingly, machine learning is utilized to facilitate the drug development processes. These models aim to enhance our understanding of drug characteristics, including their activity in biological contexts. However, a major challenge in drug response (DR) prediction is model interpretability as it aids in the validation of findings. This is important in biomedicine, where models need to be understandable in comparison with established knowledge of drug interactions with proteins. drGAT, a graph deep learning model, leverages a heterogeneous graph composed of relationships between proteins, cell lines, and drugs. drGAT is designed with two objectives: DR prediction as a binary sensitivity prediction and elucidation of drug mechanism from attention coefficients. drGAT has demonstrated superior performance over existing models, achieving 78% accuracy (and precision), and 76% F1 score for 269 DNA-damaging compounds of the NCI60 drug response dataset. To assess the model's interpretability, we conducted a review of drug-gene co-occurrences in Pubmed abstracts in comparison to the top 5 genes with the highest attention coefficients for each drug. We also examined whether known relationships were retained in the model by inspecting the neighborhoods of topoisomerase-related drugs. For example, our model retained TOP1 as a highly weighted predictive feature for irinotecan and topotecan, in addition to other genes that could potentially be regulators of the drugs. Our method can be used to accurately predict sensitivity to drugs and may be useful in the identification of biomarkers relating to the treatment of cancer patients.
Collapse
Affiliation(s)
- Yoshitaka Inoue
- Department of Computer Science and Engineering, University of Minnesota
- Computational Biology Branch, National Library of Medicine
| | - Hunmin Lee
- Department of Computer Science and Engineering, University of Minnesota
| | - Tianfan Fu
- Computer Science Department, Rensselaer Polytechnic Institute
| | - Augustin Luna
- Computational Biology Branch, National Library of Medicine
- Developmental Therapeutics Branch, National Cancer Institute
| |
Collapse
|
4
|
Wright SN, Colton S, Schaffer LV, Pillich RT, Churas C, Pratt D, Ideker T. State of the Interactomes: an evaluation of molecular networks for generating biological insights. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.26.587073. [PMID: 38746239 PMCID: PMC11092493 DOI: 10.1101/2024.04.26.587073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Advancements in genomic and proteomic technologies have powered the use of gene and protein networks ("interactomes") for understanding genotype-phenotype translation. However, the proliferation of interactomes complicates the selection of networks for specific applications. Here, we present a comprehensive evaluation of 46 current human interactomes, encompassing protein-protein interactions as well as gene regulatory, signaling, colocalization, and genetic interaction networks. Our analysis shows that large composite networks such as HumanNet, STRING, and FunCoup are most effective for identifying disease genes, while smaller networks such as DIP and SIGNOR demonstrate strong interaction prediction performance. These findings provide a benchmark for interactomes across diverse network biology applications and clarify factors that influence network performance. Furthermore, our evaluation pipeline paves the way for continued assessment of emerging and updated interaction networks in the future.
Collapse
|
5
|
Swindell WR. Meta-analysis of differential gene expression in lower motor neurons isolated by laser capture microdissection from post-mortem ALS spinal cords. Front Genet 2024; 15:1385114. [PMID: 38689650 PMCID: PMC11059082 DOI: 10.3389/fgene.2024.1385114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Accepted: 04/03/2024] [Indexed: 05/02/2024] Open
Abstract
Introduction ALS is a fatal neurodegenerative disease for which underlying mechanisms are incompletely understood. The motor neuron is a central player in ALS pathogenesis but different transcriptome signatures have been derived from bulk analysis of post-mortem tissue and iPSC-derived motor neurons (iPSC-MNs). Methods This study performed a meta-analysis of six gene expression studies (microarray and RNA-seq) in which laser capture microdissection (LCM) was used to isolate lower motor neurons from post-mortem spinal cords of ALS and control (CTL) subjects. Differentially expressed genes (DEGs) with consistent ALS versus CTL expression differences across studies were identified. Results The analysis identified 222 ALS-increased DEGs (FDR <0.10, SMD >0.80) and 278 ALS-decreased DEGs (FDR <0.10, SMD < -0.80). ALS-increased DEGs were linked to PI3K-AKT signaling, innate immunity, inflammation, motor neuron differentiation and extracellular matrix. ALS-decreased DEGs were associated with the ubiquitin-proteosome system, microtubules, axon growth, RNA-binding proteins and synaptic membrane. ALS-decreased DEG mRNAs frequently interacted with RNA-binding proteins (e.g., FUS, HuR). The complete set of DEGs (increased and decreased) overlapped significantly with genes near ALS-associated SNP loci (p < 0.01). Transcription factor target motifs with increased proximity to ALS-increased DEGs were identified, most notably DNA elements predicted to interact with forkhead transcription factors (e.g., FOXP1) and motor neuron and pancreas homeobox 1 (MNX1). Some of these DNA elements overlie ALS-associated SNPs within known enhancers and are predicted to have genotype-dependent MNX1 interactions. DEGs were compared to those identified from SOD1-G93A mice and bulk spinal cord segments or iPSC-MNs from ALS patients. There was good correspondence with transcriptome changes from SOD1-G93A mice (r ≤ 0.408) but most DEGs were not differentially expressed in bulk spinal cords or iPSC-MNs and transcriptome-wide effect size correlations were weak (bulk tissue: r ≤ 0.207, iPSC-MN: r ≤ 0.037). Conclusion This study defines a robust transcriptome signature from LCM-based motor neuron studies of post-mortem tissue from ALS and CTL subjects. This signature differs from those obtained from analysis of bulk spinal cord segments and iPSC-MNs. Results provide insight into mechanisms underlying gene dysregulation in ALS and highlight connections between these mechanisms, ALS genetics, and motor neuron biology.
Collapse
Affiliation(s)
- William R. Swindell
- Department of Internal Medicine, Division of Hospital Medicine, University of Texas Southwestern Medical Center, Dallas, TX, United States
| |
Collapse
|
6
|
Yashar WM, Estabrook J, Holly HD, Somers J, Nikolova O, Babur Ö, Braun TP, Demir E. Predicting transcription factor activity using prior biological information. iScience 2024; 27:109124. [PMID: 38455978 PMCID: PMC10918219 DOI: 10.1016/j.isci.2024.109124] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 10/20/2023] [Accepted: 01/31/2024] [Indexed: 03/09/2024] Open
Abstract
Dysregulation of normal transcription factor activity is a common driver of disease. Therefore, the detection of aberrant transcription factor activity is important to understand disease pathogenesis. We have developed Priori, a method to predict transcription factor activity from RNA sequencing data. Priori has two key advantages over existing methods. First, Priori utilizes literature-supported regulatory information to identify transcription factor-target gene relationships. It then applies linear models to determine the impact of transcription factor regulation on the expression of its target genes. Second, results from a third-party benchmarking pipeline reveals that Priori detects aberrant activity from 124 single-gene perturbation experiments with higher sensitivity and specificity than 11 other methods. We applied Priori and other top-performing methods to predict transcription factor activity from two large primary patient datasets. Our work demonstrates that Priori uniquely discovered significant determinants of survival in breast cancer and identified mediators of drug response in leukemia.
Collapse
Affiliation(s)
- William M. Yashar
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR 97239, USA
- Division of Oncologic Sciences, Department of Medicine, Oregon Health & Science University, Portland, OR 97239, USA
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR 97239, USA
| | - Joseph Estabrook
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR 97239, USA
- Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
- Cancer Early Detection Advanced Research Center, Oregon Health & Science University, Portland, OR 97239, USA
| | - Hannah D. Holly
- Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
- Cancer Early Detection Advanced Research Center, Oregon Health & Science University, Portland, OR 97239, USA
| | - Julia Somers
- Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
- Cancer Early Detection Advanced Research Center, Oregon Health & Science University, Portland, OR 97239, USA
| | - Olga Nikolova
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR 97239, USA
- Division of Oncologic Sciences, Department of Medicine, Oregon Health & Science University, Portland, OR 97239, USA
- Department of Biomedical Engineering, Oregon Health & Science University, Portland, OR 97239, USA
| | - Özgün Babur
- Computer Science Department, University of Massachusetts, Boston, MA 02125, USA
| | - Theodore P. Braun
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR 97239, USA
- Division of Oncologic Sciences, Department of Medicine, Oregon Health & Science University, Portland, OR 97239, USA
- Division of Hematology & Medical Oncology, Department of Medicine, Oregon Health & Science University, Portland, OR 97239, USA
| | - Emek Demir
- Knight Cancer Institute, Oregon Health & Science University, Portland, OR 97239, USA
- Division of Oncologic Sciences, Department of Medicine, Oregon Health & Science University, Portland, OR 97239, USA
- Department of Molecular and Medical Genetics, Oregon Health and Science University, 3181 SW Sam Jackson Park Road, Portland, OR 97239, USA
- Pacific Northwest National Laboratories, Richland, WA 99354, USA
| |
Collapse
|
7
|
Zhang Z, Wang S, Jiang L, Wei J, Lu C, Li S, Diao Y, Fang Z, He S, Tan T, Yang Y, Zou K, Shi J, Lin J, Chen L, Bao C, Fei J, Fang H. Priority index for critical Covid-19 identifies clinically actionable targets and drugs. Commun Biol 2024; 7:189. [PMID: 38366110 PMCID: PMC10873402 DOI: 10.1038/s42003-024-05897-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 02/07/2024] [Indexed: 02/18/2024] Open
Abstract
While genome-wide studies have identified genomic loci in hosts associated with life-threatening Covid-19 (critical Covid-19), the challenge of resolving these loci hinders further identification of clinically actionable targets and drugs. Building upon our previous success, we here present a priority index solution designed to address this challenge, generating the target and drug resource that consists of two indexes: the target index and the drug index. The primary purpose of the target index is to identify clinically actionable targets by prioritising genes associated with Covid-19. We illustrate the validity of the target index by demonstrating its ability to identify pre-existing Covid-19 phase-III drug targets, with the majority of these targets being found at the leading prioritisation (leading targets). These leading targets have their evolutionary origins in Amniota ('four-leg vertebrates') and are predominantly involved in cytokine-cytokine receptor interactions and JAK-STAT signaling. The drug index highlights opportunities for repurposing clinically approved JAK-STAT inhibitors, either individually or in combination. This proposed strategic focus on the JAK-STAT pathway is supported by the active pursuit of therapeutic agents targeting this pathway in ongoing phase-II/III clinical trials for Covid-19.
Collapse
Affiliation(s)
- Zhiqiang Zhang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Shan Wang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Lulu Jiang
- Translational Health Sciences, University of Bristol, Bristol, BS1 3NY, UK
| | - Jianwen Wei
- Network and Information Center, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Chang Lu
- MRC London Institute of Medical Sciences, Imperial College London, London, W12 0HS, UK
| | - Shengli Li
- Precision Research Center for Refractory Diseases, Institute for Clinical Research, Shanghai General Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 201620, China
| | - Yizhu Diao
- College of Finance and Statistics, Hunan University, Changsha, 410079, Hunan, China
| | - Zhongcheng Fang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Shuo He
- College of Health Science and Technology, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Tingting Tan
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Yisheng Yang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
| | - Kexin Zou
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jiantao Shi
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science, Chinese Academy of Sciences, Shanghai, 200031, China
| | - James Lin
- Network and Information Center, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Liye Chen
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, OX3 7LD, UK.
| | - Chaohui Bao
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
- Department of General Surgery, Ruijin Hospital Luwan Branch, Shanghai Jiao Tong University School of Medicine, Shanghai, 200020, China.
| | - Jian Fei
- Department of General Surgery, Ruijin Hospital Luwan Branch, Shanghai Jiao Tong University School of Medicine, Shanghai, 200020, China.
- Department of General Surgery, Pancreatic Disease Center, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| | - Hai Fang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
| |
Collapse
|
8
|
Ogbodo AK, Mustafov D, Arora M, Lambrou GI, Braoudaki M, Siddiqui SS. Analysis of SIGLEC12 expression, immunomodulation and prognostic value in renal cancer using multiomic databases. Heliyon 2024; 10:e24286. [PMID: 38268823 PMCID: PMC10803920 DOI: 10.1016/j.heliyon.2024.e24286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2023] [Revised: 11/30/2023] [Accepted: 01/05/2024] [Indexed: 01/26/2024] Open
Abstract
Siglecs belong to a family of immune regulatory receptors predominantly found on hematopoietic cells. They interact with Sia, resulting in the activation or inhibition of the immune response. Previous reports have suggested that the SIGLEC12 gene, which encodes the Siglec-XII protein, is expressed in the epithelial tissues and upregulated in carcinomas. However, studies deciphering the role of Siglec-XII in renal cancer (RC) are still unavailable, and here we provide insights on this question. We conducted expression analysis using the Human Protein Atlas and UALCAN databases. The impact of SIGLEC12 on RC prognosis was determined using the KM plotter, and an assessment of immune infiltration with SIGLEC12 was performed using the TIMER database. GSEA was conducted to identify the pathways affected by SIGLEC12. Finally, using GeneMania, we identified Siglec-XII interacting proteins. Our findings indicated that macrophages express SIGLEC12 in the kidney. Furthermore, we hypothesize that Siglec-XII expression might be involved in the increase of primary RC, but this effect may not be dependent on the age of the patient. In the tumour microenvironment, oncogenic pathways appeared to be upregulated by SIGLEC12. Similarly, our analysis suggested that SIGLEC12-related kidney renal papillary cell carcinomas may be more suitable for targeted immunotherapy, such as CTLA-4 and PD-1/PD-L1 inhibitors. These preliminary results suggested that high expression of SIGLEC12 is associated with poor prognosis for RC. Future studies to assess its clinical utility are necessitated.
Collapse
Affiliation(s)
- Amobichukwu K. Ogbodo
- School of Life and Medical Sciences, University of Hertfordshire, College Lane Campus, Hatfield AL10 9AB, United Kingdom
- #Current Address: Nuffield Department of Population Health, University of Oxford, Richard Doll Building, Old Road Campus, Oxford OX3 7LF, United Kingdom
| | - Denis Mustafov
- School of Life and Medical Sciences, University of Hertfordshire, College Lane Campus, Hatfield AL10 9AB, United Kingdom
- College of Health, Medicine, and Life Science, Brunel University London UB8 3PH, United Kingdom
| | - Mohit Arora
- Department of Biochemistry, All India Institute of Medical Sciences, New Delhi 110029, India
| | - George I. Lambrou
- Choremeio Research Laboratory, First Department of Paediatrics, School of Medicine, National and Kapodistrian University of Athens, Athens, Greece, Thivon & Levadeias 8, 11527, Goudi, Athens, Greece
- University Research Institute of Maternal and Child Health & Precision Medicine, National and Kapodistrian University of Athens, Thivon & Levadeias 8, 11527 Athens, Greece
| | - Maria Braoudaki
- School of Life and Medical Sciences, University of Hertfordshire, College Lane Campus, Hatfield AL10 9AB, United Kingdom
| | - Shoib S. Siddiqui
- School of Life and Medical Sciences, University of Hertfordshire, College Lane Campus, Hatfield AL10 9AB, United Kingdom
| |
Collapse
|
9
|
Chang X, Yan S, Zhang Y, Zhang Y, Li L, Gao Z, Lin X, Chi X. GINv2.0: a comprehensive topological network integrating molecular interactions from multiple knowledge bases. NPJ Syst Biol Appl 2024; 10:4. [PMID: 38218959 PMCID: PMC10787761 DOI: 10.1038/s41540-024-00330-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Accepted: 01/02/2024] [Indexed: 01/15/2024] Open
Abstract
Knowledge bases have been instrumental in advancing biological research, facilitating pathway analysis and data visualization, which are now widely employed in the scientific community. Despite the establishment of several prominent knowledge bases focusing on signaling, metabolic networks, or both, integrating these networks into a unified topological network has proven to be challenging. The intricacy of molecular interactions and the diverse formats employed to store and display them contribute to the complexity of this task. In a prior study, we addressed this challenge by introducing a "meta-pathway" structure that integrated the advantages of the Simple Interaction Format (SIF) while accommodating reaction information. Nevertheless, the earlier Global Integrative Network (GIN) was limited to reliance on KEGG alone. Here, we present GIN version 2.0, which incorporates human molecular interaction data from ten distinct knowledge bases, including KEGG, Reactome, and HumanCyc, among others. We standardized the data structure, gene IDs, and chemical IDs, and conducted a comprehensive analysis of the consistency among the ten knowledge bases before combining all unified interactions into GINv2.0. Utilizing GINv2.0, we investigated the glycolysis process and its regulatory proteins, revealing coordinated regulations on glycolysis and autophagy, particularly under glucose starvation. The expanded scope and enhanced capabilities of GINv2.0 provide a valuable resource for comprehensive systems-level analyses in the field of biological research. GINv2.0 can be accessed at: https://github.com/BIGchix/GINv2.0 .
Collapse
Affiliation(s)
- Xiao Chang
- Department of Dermatology and Venereal Disease, Xuan Wu Hospital, Beijing, 100053, China
| | - Shen Yan
- Agricultural Information Institute, Chinese Academy of Agricultural Science, Beijing, 100081, China
| | - Yizheng Zhang
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Yingchun Zhang
- Key Laboratory of Plant Molecular Physiology, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Luyang Li
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Zhanyu Gao
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing, 100101, China
- University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Xuefei Lin
- Department of Dermatology and Venereal Disease, Xuan Wu Hospital, Beijing, 100053, China
| | - Xu Chi
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences and China National Center for Bioinformation, Beijing, 100101, China.
| |
Collapse
|
10
|
Solomon M, Song B, Govindarajah V, Good S, Arasu A, Hinton EB, Thakkar K, Bartram J, Filippi MD, Cancelas JA, Salomonis N, Grimes HL, Reynaud D. Slow cycling and durable Flt3+ progenitors contribute to hematopoiesis under native conditions. J Exp Med 2024; 221:e20231035. [PMID: 37910046 PMCID: PMC10620607 DOI: 10.1084/jem.20231035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Revised: 09/18/2023] [Accepted: 10/13/2023] [Indexed: 11/03/2023] Open
Abstract
The dynamics of the hematopoietic flux responsible for blood cell production in native conditions remains a matter of debate. Using CITE-seq analyses, we uncovered a distinct progenitor population that displays a cell cycle gene signature similar to the one found in quiescent hematopoietic stem cells. We further determined that the CD62L marker can be used to phenotypically enrich this population in the Flt3+ multipotent progenitor (MPP4) compartment. Functional in vitro and in vivo analyses validated the heterogeneity of the MPP4 compartment and established the quiescent/slow-cycling properties of the CD62L- MPP4 cells. Furthermore, studies under native conditions revealed a novel hierarchical organization of the MPP compartments in which quiescent/slow-cycling MPP4 cells sustain a prolonged hematopoietic activity at steady-state while giving rise to other lineage-biased MPP populations. Altogether, our data characterize a durable and productive quiescent/slow-cycling hematopoietic intermediary within the MPP4 compartment and highlight early paths of progenitor differentiation during unperturbed hematopoiesis.
Collapse
Affiliation(s)
- Michael Solomon
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Baobao Song
- Division of Immunobiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Immunology Graduate Program, University of Cincinnati, Cincinnati, OH, USA
| | - Vinothini Govindarajah
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Samantha Good
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Ashok Arasu
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - E. Broderick Hinton
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Kairavee Thakkar
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - James Bartram
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
| | - Marie-Dominique Filippi
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Jose A. Cancelas
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Hoxworth Blood Center, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Nathan Salomonis
- Division of Biomedical Informatics, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - H. Leighton Grimes
- Division of Immunobiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| | - Damien Reynaud
- Division of Experimental Hematology and Cancer Biology, Stem Cell Program, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
- Department of Pediatrics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
| |
Collapse
|
11
|
Mecham A, Stephenson A, Quinteros BI, Brown GS, Piccolo SR. TidyGEO: preparing analysis-ready datasets from Gene Expression Omnibus. J Integr Bioinform 2023; 0:jib-2023-0021. [PMID: 38047898 DOI: 10.1515/jib-2023-0021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 11/20/2023] [Indexed: 12/05/2023] Open
Abstract
TidyGEO is a Web-based tool for downloading, tidying, and reformatting data series from Gene Expression Omnibus (GEO). As a freely accessible repository with data from over 6 million biological samples across more than 4000 organisms, GEO provides diverse opportunities for secondary research. Although scientists may find assay data relevant to a given research question, most analyses require sample-level annotations. In GEO, such annotations are stored alongside assay data in delimited, text-based files. However, the structure and semantics of the annotations vary widely from one series to another, and many annotations are not useful for analysis purposes. Thus, every GEO series must be tidied before it is analyzed. Manual approaches may be used, but these are error prone and take time away from other research tasks. Custom computer scripts can be written, but many scientists lack the computational expertise to create such scripts. To address these challenges, we created TidyGEO, which supports essential data-cleaning tasks for sample-level annotations, such as selecting informative columns, renaming columns, splitting or merging columns, standardizing data values, and filtering samples. Additionally, users can integrate annotations with assay data, restructure assay data, and generate code that enables others to reproduce these steps.
Collapse
Affiliation(s)
- Avery Mecham
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Ashlie Stephenson
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Badi I Quinteros
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Grace S Brown
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| | - Stephen R Piccolo
- Department of Biology, Brigham Young University, Provo, UT, 84602, USA
| |
Collapse
|
12
|
Yoo W, Kim S, Noh K. SAMD13 serves as a useful prognostic biomarker for hepatocellular carcinoma. Eur J Med Res 2023; 28:514. [PMID: 37968735 PMCID: PMC10648382 DOI: 10.1186/s40001-023-01347-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 09/06/2023] [Indexed: 11/17/2023] Open
Abstract
Hepatocellular carcinoma (HCC) is the most common form of liver cancer and the 5-year relative overall survival (OS) rate is less than 20%. Since there are no specific symptoms, most patients with HCC are diagnosed in an advanced stage with poor prognosis. Therefore, identifying novel prognostic biomarkers to improve the survival of patients with HCC is urgently needed. In the present study, we attempted to identify SAMD13 (Sterile Alpha Motif Domain-Containing Protein 13) as a novel biomarker associated with the prognosis of HCC using various bioinformatics tools. SAMD13 was found to be highly expressed pan-cancer; however, the SAMD13 expression was significantly correlated with the worst prognosis in HCC. Clinicopathological analysis revealed that SAMD13 upregulation was significantly associated with advanced HCC stage and high-grade tumor type. Simultaneously, high SAMD13 expression resulted in association with various immune markers in the immune cell subsets by TIMER databases and efficacy of immunotherapy. Methylation analysis showed SAMD13 was remarkably associated with prognosis. Furthermore, a six-hub gene signature associated with poor prognosis was correlated with the cell cycle, transcription, and epigenetic regulation and this analysis may support the connection between SAMD13 expression and drug-resistance. Our study illustrated the characteristics of SAMD13 role in patients with HCC using various bioinformatics tools and highlights its potential role as a therapeutic target and promising biomarker for prognosis in HCC.
Collapse
Affiliation(s)
- Wonbeak Yoo
- Personalized Genomic Medicine Research Center, Korea Research Institute of Bioscience and Biotechnology (KRIBB), Daejeon, 34141, Republic of Korea
| | - Seokho Kim
- Department of Health Sciences, The Graduate School of Dong-A University, Busan, 49315, Republic of Korea.
- Department of Medicinal Biotechnology, College of Health Sciences, Dong-A University, 37, Nakdong-daero 550 beon-gil, Saha-gu, Busan, 49315, Republic of Korea.
| | - KyungHee Noh
- Bionanotechnology Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, 34141, Republic of Korea.
- Department of Nanobiotechnology, University of Science and Technology (UST), Daejeon, 34141, Republic of Korea.
| |
Collapse
|
13
|
Hatano N, Kamada M, Kojima R, Okuno Y. Network-based prediction approach for cancer-specific driver missense mutations using a graph neural network. BMC Bioinformatics 2023; 24:383. [PMID: 37817080 PMCID: PMC10565986 DOI: 10.1186/s12859-023-05507-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Accepted: 10/02/2023] [Indexed: 10/12/2023] Open
Abstract
BACKGROUND In cancer genomic medicine, finding driver mutations involved in cancer development and tumor growth is crucial. Machine-learning methods to predict driver missense mutations have been developed because variants are frequently detected by genomic sequencing. However, even though the abnormalities in molecular networks are associated with cancer, many of these methods focus on individual variants and do not consider molecular networks. Here we propose a new network-based method, Net-DMPred, to predict driver missense mutations considering molecular networks. Net-DMPred consists of the graph part and the prediction part. In the graph part, molecular networks are learned by a graph neural network (GNN). The prediction part learns whether variants are driver variants using features of individual variants combined with the graph features learned in the graph part. RESULTS Net-DMPred, which considers molecular networks, performed better than conventional methods. Furthermore, the prediction performance differed by the molecular network structure used in learning, suggesting that it is important to consider not only the local network related to cancer but also the large-scale network in living organisms. CONCLUSIONS We propose a network-based machine learning method, Net-DMPred, for predicting cancer driver missense mutations. Our method enables us to consider the entire graph architecture representing the molecular network because it uses GNN. Net-DMPred is expected to detect driver mutations from a lot of missense mutations that are not known to be associated with cancer.
Collapse
Affiliation(s)
- Narumi Hatano
- Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Mayumi Kamada
- Graduate School of Medicine, Kyoto University, Kyoto, Japan.
| | - Ryosuke Kojima
- Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Yasushi Okuno
- Graduate School of Medicine, Kyoto University, Kyoto, Japan.
- HPC- and AI-driven Drug Development Platform Division, RIKEN Center for Computational Science(R-CCS), Kobe, Japan.
| |
Collapse
|
14
|
Piroeva KV, McDonald C, Xanthopoulos C, Fox C, Clarkson CT, Mallm JP, Vainshtein Y, Ruje L, Klett LC, Stilgenbauer S, Mertens D, Kostareli E, Rippe K, Teif VB. Nucleosome repositioning in chronic lymphocytic leukemia. Genome Res 2023; 33:1649-1661. [PMID: 37699659 PMCID: PMC10691546 DOI: 10.1101/gr.277298.122] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 09/07/2023] [Indexed: 09/14/2023]
Abstract
The location of nucleosomes in the human genome determines the primary chromatin structure and regulates access to regulatory regions. However, genome-wide information on deregulated nucleosome occupancy and its implications in primary cancer cells is scarce. Here, we conducted a genome-wide comparison of high-resolution nucleosome maps in peripheral blood B cells from patients with chronic lymphocytic leukemia (CLL) and healthy individuals at single-base-pair resolution. Our investigation uncovered significant changes of nucleosome positioning in CLL. Globally, the spacing between nucleosomes-the nucleosome repeat length (NRL)-is shortened in CLL. This effect is stronger in the more aggressive IGHV-unmutated CLL subtype than in the IGHV-mutated CLL subtype. Changes in nucleosome occupancy at specific sites are linked to active chromatin remodeling and reduced DNA methylation. Nucleosomes lost or gained in CLL marks differential binding of 3D chromatin organizers such as CTCF as well as immune response-related transcription factors and delineated mechanisms of epigenetic deregulation. The principal component analysis of nucleosome occupancy in cancer-specific regions allowed the classification of samples between cancer subtypes and normal controls. Furthermore, patients could be better assigned to CLL subtypes according to differential nucleosome occupancy than based on DNA methylation or gene expression. Thus, nucleosome positioning constitutes a novel readout to dissect molecular mechanisms of disease progression and to stratify patients. Furthermore, we anticipate that the global nucleosome repositioning detected in our study, such as changes in the NRL, can be exploited for liquid biopsy applications based on cell-free DNA to stratify patients and monitor disease progression.
Collapse
Affiliation(s)
- Kristan V Piroeva
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Charlotte McDonald
- Wellcome-Wolfson Institute for Experimental Medicine, Queen's University Belfast, Belfast BT9 7BL, United Kingdom
| | - Charalampos Xanthopoulos
- Wellcome-Wolfson Institute for Experimental Medicine, Queen's University Belfast, Belfast BT9 7BL, United Kingdom
| | - Chelsea Fox
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Christopher T Clarkson
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Jan-Philipp Mallm
- German Cancer Research Center (DKFZ) Heidelberg, Single Cell Open Lab, 69120 Heidelberg, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, 69120 Heidelberg, Germany
- Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, 69120 Heidelberg, Germany
| | - Yevhen Vainshtein
- Fraunhofer-Institut für Grenzflächen- und Bioverfahrenstechnik IGB, 70569 Stuttgart, Germany
| | - Luminita Ruje
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom
| | - Lara C Klett
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, 69120 Heidelberg, Germany
- Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, 69120 Heidelberg, Germany
| | - Stephan Stilgenbauer
- Division of CLL, University Hospital Ulm, Department of Internal Medicine III, 89081 Ulm, Germany
| | - Daniel Mertens
- Division of CLL, University Hospital Ulm, Department of Internal Medicine III, 89081 Ulm, Germany
- German Cancer Research Center (DKFZ) Heidelberg, Cooperation Unit Mechanisms of Leukemogenesis, 69120 Heidelberg, Germany
| | - Efterpi Kostareli
- Wellcome-Wolfson Institute for Experimental Medicine, Queen's University Belfast, Belfast BT9 7BL, United Kingdom;
| | - Karsten Rippe
- German Cancer Research Center (DKFZ) Heidelberg, Division of Chromatin Networks, 69120 Heidelberg, Germany;
- Center for Quantitative Analysis of Molecular and Cellular Biosystems (BioQuant), Heidelberg University, 69120 Heidelberg, Germany
| | - Vladimir B Teif
- School of Life Sciences, University of Essex, Wivenhoe Park, Colchester CO4 3SQ, United Kingdom;
| |
Collapse
|
15
|
Zhang F, Luna A, Tan T, Chen Y, Sander C, Guo T. COVIDpro: Database for Mining Protein Dysregulation in Patients with COVID-19. J Proteome Res 2023; 22:2847-2859. [PMID: 37555633 DOI: 10.1021/acs.jproteome.3c00092] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]
Abstract
The ongoing pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 still has limited treatment options. Our understanding of the molecular dysregulations that occur in response to infection remains incomplete. We developed a web application COVIDpro (https://www.guomics.com/covidPro/) that includes proteomics data obtained from 41 original studies conducted in 32 hospitals worldwide, involving 3077 patients and covering 19 types of clinical specimens, predominantly plasma and serum. The data set encompasses 53 protein expression matrices, comprising a total of 5434 samples and 14,403 unique proteins. We identified a panel of proteins that exhibit significant dysregulation, enabling the classification of COVID-19 patients into severe and non-severe disease categories. The proteomic signatures achieved promising results in distinguishing severe cases, with a mean area under the curve of 0.87 and accuracy of 0.80 across five independent test sets. COVIDpro serves as a valuable resource for testing hypotheses and exploring potential targets for novel treatments in COVID-19 patients.
Collapse
Affiliation(s)
- Fangfei Zhang
- Fudan University, 220 Handan Road, Shanghai 200433, China
- Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang Province 310024, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou, Zhejiang Province 310024, China
- Research Center for Industries of the Future, Westlake University, 600 Dunyu Road, Hangzhou, Zhejiang 310030, China
| | - Augustin Luna
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, United States
- Broad Institute of MIT and Harvard, Cambridge, Cambridge, Massachusetts 02142, United States
| | - Tingting Tan
- Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang Province 310024, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou, Zhejiang Province 310024, China
- Research Center for Industries of the Future, Westlake University, 600 Dunyu Road, Hangzhou, Zhejiang 310030, China
| | - Yingdan Chen
- Westlake Omics (Hangzhou) Biotechnology Company Limited, Hangzhou, Zhejiang Province 310024, China
| | - Chris Sander
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, United States
- Broad Institute of MIT and Harvard, Cambridge, Cambridge, Massachusetts 02142, United States
| | - Tiannan Guo
- Westlake Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang Province 310024, China
- Institute of Basic Medical Sciences, Westlake Institute for Advanced Study, Hangzhou, Zhejiang Province 310024, China
- Research Center for Industries of the Future, Westlake University, 600 Dunyu Road, Hangzhou, Zhejiang 310030, China
| |
Collapse
|
16
|
Odaka M, Magnin M, Inoue K. Gene network inference from single-cell omics data and domain knowledge for constructing COVID-19-specific ICAM1-associated pathways. Front Genet 2023; 14:1250545. [PMID: 37719701 PMCID: PMC10501835 DOI: 10.3389/fgene.2023.1250545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/16/2023] [Indexed: 09/19/2023] Open
Abstract
Introduction: Intercellular adhesion molecule 1 (ICAM-1) is a critical molecule responsible for interactions between cells. Previous studies have suggested that ICAM-1 triggers cell-to-cell transmission of HIV-1 or HTLV-1, that SARS-CoV-2 shares several features with these viruses via interactions between cells, and that SARS-CoV-2 cell-to-cell transmission is associated with COVID-19 severity. From these previous arguments, it is assumed that ICAM-1 can be related to SARS-CoV-2 cell-to-cell transmission in COVID-19 patients. Indeed, the time-dependent change of the ICAM-1 expression level has been detected in COVID-19 patients. However, signaling pathways that consist of ICAM-1 and other molecules interacting with ICAM-1 are not identified in COVID-19. For example, the current COVID-19 Disease Map has no entry for those pathways. Therefore, discovering unknown ICAM1-associated pathways will be indispensable for clarifying the mechanism of COVID-19. Materials and methods: This study builds ICAM1-associated pathways by gene network inference from single-cell omics data and multiple knowledge bases. First, single-cell omics data analysis extracts coexpressed genes with significant differences in expression levels with spurious correlations removed. Second, knowledge bases validate the models. Finally, mapping the models onto existing pathways identifies new ICAM1-associated pathways. Results: Comparison of the obtained pathways between different cell types and time points reproduces the known pathways and indicates the following two unknown pathways: (1) upstream pathway that includes proteins in the non-canonical NF-κB pathway and (2) downstream pathway that contains integrins and cytoskeleton or motor proteins for cell transformation. Discussion: In this way, data-driven and knowledge-based approaches are integrated into gene network inference for ICAM1-associated pathway construction. The results can contribute to repairing and completing the COVID-19 Disease Map, thereby improving our understanding of the mechanism of COVID-19.
Collapse
Affiliation(s)
- Mitsuhiro Odaka
- The Graduate University for Advanced Studies, SOKENDAI, Tokyo, Japan
- Principles of Informatics Research Division, National Institute of Informatics, Tokyo, Japan
- Laboratoire des Sciences du Numérique de Nantes, École Centrale de Nantes, Nantes Université, UMR 6004, Nantes, France
- Japan Society for the Promotion of Science, Tokyo, Japan
| | - Morgan Magnin
- Principles of Informatics Research Division, National Institute of Informatics, Tokyo, Japan
- Laboratoire des Sciences du Numérique de Nantes, École Centrale de Nantes, Nantes Université, UMR 6004, Nantes, France
| | - Katsumi Inoue
- The Graduate University for Advanced Studies, SOKENDAI, Tokyo, Japan
- Principles of Informatics Research Division, National Institute of Informatics, Tokyo, Japan
- Laboratoire des Sciences du Numérique de Nantes, École Centrale de Nantes, Nantes Université, UMR 6004, Nantes, France
| |
Collapse
|
17
|
Chodkowski M, Zielezinski A, Anbalagan S. A ligand-receptor interactome atlas of the zebrafish. iScience 2023; 26:107309. [PMID: 37539027 PMCID: PMC10393773 DOI: 10.1016/j.isci.2023.107309] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 05/25/2023] [Accepted: 07/04/2023] [Indexed: 08/05/2023] Open
Abstract
Studies in zebrafish can unravel the functions of cellular communication and thus identify novel bench-to-bedside drugs targeting cellular communication signaling molecules. Due to the incomplete annotation of zebrafish proteome, the knowledge of zebrafish receptors, ligands, and tools to explore their interactome is limited. To address this gap, we de novo predicted the cellular localization of zebrafish reference proteome using deep learning algorithm. We combined the predicted and existing annotations on cellular localization of zebrafish proteins and created repositories of zebrafish ligands, membrane receptome, and interactome as well as associated diseases and targeting drugs. Unlike other tools, our interactome atlas is based on both the physical interaction data of zebrafish proteome and existing human ligand-receptor pair databases. The resources are available as R and Python scripts. DanioTalk provides a novel resource for researchers interested in targeting cellular communication in zebrafish, as we demonstrate in applications studying synapse and axo-glial interactome. DanioTalk methodology can be applied to build and explore the ligand-receptor atlas of other non-mammalian model organisms.
Collapse
Affiliation(s)
- Milosz Chodkowski
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznań, Poznań, Poland
| | - Andrzej Zielezinski
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznań, Poznań, Poland
| | - Savani Anbalagan
- Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University in Poznań, Poznań, Poland
| |
Collapse
|
18
|
Gawron P, Hoksza D, Piñero J, Peña-Chilet M, Esteban-Medina M, Fernandez-Rueda JL, Colonna V, Smula E, Heirendt L, Ancien F, Groues V, Satagopam VP, Schneider R, Dopazo J, Furlong LI, Ostaszewski M. Visualization of automatically combined disease maps and pathway diagrams for rare diseases. FRONTIERS IN BIOINFORMATICS 2023; 3:1101505. [PMID: 37502697 PMCID: PMC10369067 DOI: 10.3389/fbinf.2023.1101505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 05/05/2023] [Indexed: 07/29/2023] Open
Abstract
Introduction: Investigation of molecular mechanisms of human disorders, especially rare diseases, require exploration of various knowledge repositories for building precise hypotheses and complex data interpretation. Recently, increasingly more resources offer diagrammatic representation of such mechanisms, including disease-dedicated schematics in pathway databases and disease maps. However, collection of knowledge across them is challenging, especially for research projects with limited manpower. Methods: In this article we present an automated workflow for construction of maps of molecular mechanisms for rare diseases. The workflow requires a standardized definition of a disease using Orphanet or HPO identifiers to collect relevant genes and variants, and to assemble a functional, visual repository of related mechanisms, including data overlays. The diagrams composing the final map are unified to a common systems biology format from CellDesigner SBML, GPML and SBML+layout+render. The constructed resource contains disease-relevant genes and variants as data overlays for immediate visual exploration, including embedded genetic variant browser and protein structure viewer. Results: We demonstrate the functionality of our workflow on two examples of rare diseases: Kawasaki disease and retinitis pigmentosa. Two maps are constructed based on their corresponding identifiers. Moreover, for the retinitis pigmentosa use-case, we include a list of differentially expressed genes to demonstrate how to tailor the workflow using omics datasets. Discussion: In summary, our work allows for an ad-hoc construction of molecular diagrams combined from different sources, preserving their layout and graphical style, but integrating them into a single resource. This allows to reduce time consuming tasks of prototyping of a molecular disease map, enabling visual exploration, hypothesis building, data visualization and further refinement. The code of the workflow is open and accessible at https://gitlab.lcsb.uni.lu/minerva/automap/.
Collapse
Affiliation(s)
- Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - David Hoksza
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
- Faculty of Mathematics and Physics, Charles University, Prague, Czechia
| | - Janet Piñero
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
- Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
- MedBioinformatics Solutions SL, Barcelona, Spain
| | - Maria Peña-Chilet
- Computational Medicine Platform, Fundacion Progreso y Salud, Sevilla, Spain
- Spanish Network of Research in Rare Diseases (CIBERER), Sevilla, Spain
| | | | | | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council of Italy, Naples, Rome
- Department of Genetics, Genomics and Informatics, College of Medicine, University of Tennessee Health Science Center, Memphis, TN, United States
| | - Ewa Smula
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Laurent Heirendt
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - François Ancien
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Valentin Groues
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Venkata P. Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Joaquin Dopazo
- Computational Medicine Platform, Fundacion Progreso y Salud, Sevilla, Spain
- Spanish Network of Research in Rare Diseases (CIBERER), Sevilla, Spain
| | - Laura I. Furlong
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
- Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
- MedBioinformatics Solutions SL, Barcelona, Spain
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| |
Collapse
|
19
|
Lee S, Deng L, Wang Y, Wang K, Sartor MA, Wang XS. IndepthPathway: an integrated tool for in-depth pathway enrichment analysis based on single-cell sequencing data. Bioinformatics 2023; 39:btad325. [PMID: 37243667 PMCID: PMC10275909 DOI: 10.1093/bioinformatics/btad325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 04/29/2023] [Accepted: 05/26/2023] [Indexed: 05/29/2023] Open
Abstract
MOTIVATION Single-cell sequencing enables exploring the pathways and processes of cells, and cell populations. However, there is a paucity of pathway enrichment methods designed to tolerate the high noise and low gene coverage of this technology. When gene expression data are noisy and signals are sparse, testing pathway enrichment based on the genes expression may not yield statistically significant results, which is particularly problematic when detecting the pathways enriched in less abundant cells that are vulnerable to disturbances. RESULTS In this project, we developed a Weighted Concept Signature Enrichment Analysis specialized for pathway enrichment analysis from single-cell transcriptomics (scRNA-seq). Weighted Concept Signature Enrichment Analysis took a broader approach for assessing the functional relations of pathway gene sets to differentially expressed genes, and leverage the cumulative signature of molecular concepts characteristic of the highly differentially expressed genes, which we termed as the universal concept signature, to tolerate the high noise and low coverage of this technology. We then incorporated Weighted Concept Signature Enrichment Analysis into an R package called "IndepthPathway" for biologists to broadly leverage this method for pathway analysis based on bulk and single-cell sequencing data. Through simulating technical variability and dropouts in gene expression characteristic of scRNA-seq as well as benchmarking on a real dataset of matched single-cell and bulk RNAseq data, we demonstrate that IndepthPathway presents outstanding stability and depth in pathway enrichment results under stochasticity of the data, thus will substantially improve the scientific rigor of the pathway analysis for single-cell sequencing data. AVAILABILITY AND IMPLEMENTATION The IndepthPathway R package is available through: https://github.com/wangxlab/IndepthPathway.
Collapse
Affiliation(s)
- Sanghoon Lee
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, United States
| | - Letian Deng
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15232, United States
| | - Yue Wang
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15232, United States
| | - Kai Wang
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI 48109, United States
| | - Maureen A Sartor
- Department of Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, MI 48109, United States
| | - Xiao-Song Wang
- UPMC Hillman Cancer Center, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15232, United States
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15206, United States
| |
Collapse
|
20
|
Feofanova EV, Brown MR, Alkis T, Manuel AM, Li X, Tahir UA, Li Z, Mendez KM, Kelly RS, Qi Q, Chen H, Larson MG, Lemaitre RN, Morrison AC, Grieser C, Wong KE, Gerszten RE, Zhao Z, Lasky-Su J, Yu B. Whole-Genome Sequencing Analysis of Human Metabolome in Multi-Ethnic Populations. Nat Commun 2023; 14:3111. [PMID: 37253714 PMCID: PMC10229598 DOI: 10.1038/s41467-023-38800-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 05/16/2023] [Indexed: 06/01/2023] Open
Abstract
Circulating metabolite levels may reflect the state of the human organism in health and disease, however, the genetic architecture of metabolites is not fully understood. We have performed a whole-genome sequencing association analysis of both common and rare variants in up to 11,840 multi-ethnic participants from five studies with up to 1666 circulating metabolites. We have discovered 1985 novel variant-metabolite associations, and validated 761 locus-metabolite associations reported previously. Seventy-nine novel variant-metabolite associations have been replicated, including three genetic loci located on the X chromosome that have demonstrated its involvement in metabolic regulation. Gene-based analysis have provided further support for seven metabolite-replicated loci pairs and their biologically plausible genes. Among those novel replicated variant-metabolite pairs, follow-up analyses have revealed that 26 metabolites have colocalized with 21 tissues, seven metabolite-disease outcome associations have been putatively causal, and 7 metabolites might be regulated by plasma protein levels. Our results have depicted the genetic contribution to circulating metabolite levels, providing additional insights into understanding human disease.
Collapse
Affiliation(s)
- Elena V Feofanova
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
| | - Michael R Brown
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
| | - Taryn Alkis
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
| | - Astrid M Manuel
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Xihao Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Usman A Tahir
- Division of Cardiovascular Medicine, Beth Israel Deaconess Medical Center, Harvard Medical School, Boston, MA, USA
| | - Zilin Li
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Department of Biostatistics and Health Data Science, Indiana University School of Medicine, Indianapolis, IN, USA
| | - Kevin M Mendez
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
- Retina Service, Massachusetts Eye and Ear, Harvard Medical School, 243 Charles Street, Boston, MA, USA
| | - Rachel S Kelly
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Qibin Qi
- Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Han Chen
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Martin G Larson
- Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
| | - Rozenn N Lemaitre
- Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Systems and Population Health, University of Washington, Seattle, WA, USA
| | - Alanna C Morrison
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
| | | | | | - Robert E Gerszten
- Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Zhongming Zhao
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA
- Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Jessica Lasky-Su
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Bing Yu
- Human Genetics Center, School of Public Health, The University of Texas Health Science Center, Houston, TX, USA.
| |
Collapse
|
21
|
Slenter DN, Hemel IMGM, Evelo CT, Bierau J, Willighagen EL, Steinbusch LKM. Extending inherited metabolic disorder diagnostics with biomarker interaction visualizations. Orphanet J Rare Dis 2023; 18:95. [PMID: 37101200 PMCID: PMC10131334 DOI: 10.1186/s13023-023-02683-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 04/02/2023] [Indexed: 04/28/2023] Open
Abstract
BACKGROUND Inherited Metabolic Disorders (IMDs) are rare diseases where one impaired protein leads to a cascade of changes in the adjacent chemical conversions. IMDs often present with non-specific symptoms, a lack of a clear genotype-phenotype correlation, and de novo mutations, complicating diagnosis. Furthermore, products of one metabolic conversion can be the substrate of another pathway obscuring biomarker identification and causing overlapping biomarkers for different disorders. Visualization of the connections between metabolic biomarkers and the enzymes involved might aid in the diagnostic process. The goal of this study was to provide a proof-of-concept framework for integrating knowledge of metabolic interactions with real-life patient data before scaling up this approach. This framework was tested on two groups of well-studied and related metabolic pathways (the urea cycle and pyrimidine de-novo synthesis). The lessons learned from our approach will help to scale up the framework and support the diagnosis of other less-understood IMDs. METHODS Our framework integrates literature and expert knowledge into machine-readable pathway models, including relevant urine biomarkers and their interactions. The clinical data of 16 previously diagnosed patients with various pyrimidine and urea cycle disorders were visualized on the top 3 relevant pathways. Two expert laboratory scientists evaluated the resulting visualizations to derive a diagnosis. RESULTS The proof-of-concept platform resulted in varying numbers of relevant biomarkers (five to 48), pathways, and pathway interactions for each patient. The two experts reached the same conclusions for all samples with our proposed framework as with the current metabolic diagnostic pipeline. For nine patient samples, the diagnosis was made without knowledge about clinical symptoms or sex. For the remaining seven cases, four interpretations pointed in the direction of a subset of disorders, while three cases were found to be undiagnosable with the available data. Diagnosing these patients would require additional testing besides biochemical analysis. CONCLUSION The presented framework shows how metabolic interaction knowledge can be integrated with clinical data in one visualization, which can be relevant for future analysis of difficult patient cases and untargeted metabolomics data. Several challenges were identified during the development of this framework, which should be resolved before this approach can be scaled up and implemented to support the diagnosis of other (less understood) IMDs. The framework could be extended with other OMICS data (e.g. genomics, transcriptomics), and phenotypic data, as well as linked to other knowledge captured as Linked Open Data.
Collapse
Affiliation(s)
- Denise N Slenter
- Department of Bioinformatics (BiGCaT), NUTRIM, Maastricht University, Maastricht, The Netherlands.
| | - Irene M G M Hemel
- Department of Bioinformatics (BiGCaT), NUTRIM, Maastricht University, Maastricht, The Netherlands
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, The Netherlands
| | - Chris T Evelo
- Department of Bioinformatics (BiGCaT), NUTRIM, Maastricht University, Maastricht, The Netherlands
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, The Netherlands
| | - Jörgen Bierau
- Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, The Netherlands
- Department of Clinical Genetics, Erasmus Medical Center, Rotterdam, The Netherlands
| | - Egon L Willighagen
- Department of Bioinformatics (BiGCaT), NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Laura K M Steinbusch
- Department of Clinical Genetics, Maastricht University Medical Center, Maastricht, The Netherlands
| |
Collapse
|
22
|
Sanjak J, Zhu Q, Mathé EA. Clustering rare diseases within an ontology-enriched knowledge graph. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.15.528673. [PMID: 36824742 PMCID: PMC9949046 DOI: 10.1101/2023.02.15.528673] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Objective Identifying sets of rare diseases with shared aspects of etiology and pathophysiology may enable drug repurposing and/or platform based therapeutic development. Toward that aim, we utilized an integrative knowledge graph-based approach to constructing clusters of rare diseases. Materials and Methods Data on 3,242 rare diseases were extracted from the National Center for Advancing Translational Science (NCATS) Genetic and Rare Diseases Information center (GARD) internal data resources. The rare disease data was enriched with additional biomedical data, including gene and phenotype ontologies, biological pathway data and small molecule-target activity data, to create a knowledge graph (KG). Node embeddings were used to convert nodes into vectors upon which k-means clustering was applied. We validated the disease clusters through semantic similarity and feature enrichment analysis. Results A node embedding model was trained on the ontology enriched rare disease KG and k-means clustering was applied to the embedding vectors resulting in 37 disease clusters with a mean size of 87 diseases. We validate the disease clusters quantitatively by looking at semantic similarity of clustered diseases, using the Orphanet Rare Disease Ontology. In addition, the clusters were analyzed for enrichment of associated genes, revealing that the enriched genes within clusters were shown to be highly related. Discussion We demonstrate that node embeddings are an effective method for clustering diseases within a heterogenous KG. Semantically similar diseases and relevant enriched genes have been uncovered within the clusters. Connections between disease clusters and approved or investigational drugs are enumerated for follow-up efforts. Conclusion Our study lays out a method for clustering rare diseases using the graph node embeddings. We develop an easy to maintain pipeline that can be updated when new data on rare diseases emerges. The embeddings themselves can be paired with other representation learning methods for other data types, such as drugs, to address other predictive modeling problems. Detailed subnetwork analysis and in-depth review of individual clusters may lead to translatable findings. Future work will focus on incorporation of additional data sources, with a particular focus on common disease data.
Collapse
Affiliation(s)
- Jaleal Sanjak
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD
| | - Qian Zhu
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD
| | - Ewy A Mathé
- Division of Pre-Clinical Innovation, National Center for Advancing Translational Sciences (NCATS), National Institutes of Health (NIH), Rockville, MD
| |
Collapse
|
23
|
Jain A, Gyori BM, Hakim S, Bunga S, Taub DG, Ruiz-Cantero MC, Tong-Li C, Andrews N, Sorger PK, Woolf CJ. Nociceptor neuroimmune interactomes reveal cell type- and injury-specific inflammatory pain pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.01.526526. [PMID: 36778477 PMCID: PMC9915698 DOI: 10.1101/2023.02.01.526526] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Inflammatory pain associated with tissue injury and infections, results from the heightened sensitivity of the peripheral terminals of nociceptor sensory neurons in response to exposure to inflammatory mediators. Targeting immune-derived inflammatory ligands, like prostaglandin E2, has been effective in alleviating inflammatory pain. However, the diversity of immune cells and the vast array of ligands they produce make it challenging to systematically map all neuroimmune pathways that contribute to inflammatory pain. Here, we constructed a comprehensive and updatable database of receptor-ligand pairs and complemented it with single-cell transcriptomics of immune cells and sensory neurons in three distinct inflammatory pain conditions, to generate injury-specific neuroimmune interactomes. We identified cell-type-specific neuroimmune axes that are common, as well as unique, to different injury types. This approach successfully predicts neuroimmune pathways with established roles in inflammatory pain as well as ones not previously described. We found that thrombospondin-1 produced by myeloid cells in all three conditions, is a negative regulator of nociceptor sensitization, revealing a non-canonical role of immune ligands as an endogenous reducer of peripheral sensitization. This computational platform lays the groundwork to identify novel mechanisms of immune-mediated peripheral sensitization and the specific disease contexts in which they act.
Collapse
|
24
|
Takahama M, Patil A, Johnson K, Cipurko D, Miki Y, Taketomi Y, Carbonetto P, Plaster M, Richey G, Pandey S, Cheronis K, Ueda T, Gruenbaum A, Dudek SM, Stephens M, Murakami M, Chevrier N. Organism-Wide Analysis of Sepsis Reveals Mechanisms of Systemic Inflammation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.30.526342. [PMID: 36778287 PMCID: PMC9915512 DOI: 10.1101/2023.01.30.526342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
Sepsis is a systemic response to infection with life-threatening consequences. Our understanding of the impact of sepsis across organs of the body is rudimentary. Here, using mouse models of sepsis, we generate a dynamic, organism-wide map of the pathogenesis of the disease, revealing the spatiotemporal patterns of the effects of sepsis across tissues. These data revealed two interorgan mechanisms key in sepsis. First, we discover a simplifying principle in the systemic behavior of the cytokine network during sepsis, whereby a hierarchical cytokine circuit arising from the pairwise effects of TNF plus IL-18, IFN-γ, or IL-1β explains half of all the cellular effects of sepsis on 195 cell types across 9 organs. Second, we find that the secreted phospholipase PLA2G5 mediates hemolysis in blood, contributing to organ failure during sepsis. These results provide fundamental insights to help build a unifying mechanistic framework for the pathophysiological effects of sepsis on the body.
Collapse
|
25
|
Zhang J, Singh R. Investigating the Complexity of Gene Co-expression Estimation for Single-cell Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.24.525447. [PMID: 36747724 PMCID: PMC9900775 DOI: 10.1101/2023.01.24.525447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
With the rapid advance of single-cell RNA sequencing (scRNA-seq) technology, understanding biological processes at a more refined single-cell level is becoming possible. Gene co-expression estimation is an essential step in this direction. It can annotate functionalities of unknown genes or construct the basis of gene regulatory network inference. This study thoroughly tests the existing gene co-expression estimation methods on simulation datasets with known ground truth co-expression networks. We generate these novel datasets using two simulation processes that use the parameters learned from the experimental data. We demonstrate that these simulations better capture the underlying properties of the real-world single-cell datasets than previously tested simulations for the task. Our performance results on tens of simulated and eight experimental datasets show that all methods produce estimations with a high false discovery rate potentially caused by high-sparsity levels in the data. Finally, we find that commonly used pre-processing approaches, such as normalization and imputation, do not improve the co-expression estimation. Overall, our benchmark setup contributes to the co-expression estimator development, and our study provides valuable insights for the community of single-cell data analyses.
Collapse
Affiliation(s)
- Jiaqi Zhang
- Department of Computer Science, Brown University
| | - Ritambhara Singh
- Department of Computer Science, Center for Computational Molecular Biology, Brown University
| |
Collapse
|
26
|
Li P, Bai C, Zhan L, Zhang H, Zhang Y, Zhang W, Wang Y, Zhao J. Specific gene module pair-based target identification and drug discovery. Front Pharmacol 2023; 13:1089217. [PMID: 36726786 PMCID: PMC9886283 DOI: 10.3389/fphar.2022.1089217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Accepted: 12/28/2022] [Indexed: 01/18/2023] Open
Abstract
Identification of the biological targets of a compound is of paramount importance for the exploration of the mechanism of action of drugs and for the development of novel drugs. A concept of the Connectivity Map (CMap) was previously proposed to connect genes, drugs, and disease states based on the common gene-expression signatures. For a new query compound, the CMap-based method can infer its potential targets by searching similar drugs with known targets (reference drugs) and measuring the similarities into their specific transcriptional responses between the query compound and those reference drugs. However, the available methods are often inefficient due to the requirement of the reference drugs as a medium to link the query agent and targets. Here, we developed a general procedure to extract target-induced consensus gene modules from the transcriptional profiles induced by the treatment of perturbagens of a target. A specific transcriptional gene module pair (GMP) was automatically identified for each target and could be used as a direct target signature. Based on the GMPs, we built the target network and identified some target gene clusters with similar biological mechanisms. Moreover, a gene module pair-based target identification (GMPTI) approach was proposed to predict novel compound-target interactions. Using this method, we have discovered novel inhibitors for three PI3K pathway proteins PI3Kα/β/δ, including PU-H71, alvespimycin, reversine, astemizole, raloxifene HCl, and tamoxifen.
Collapse
Affiliation(s)
- Peng Li
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China,*Correspondence: Peng Li,
| | - Chujie Bai
- Department of Orthopedic Oncology, Peking University Cancer Hospital & Institute, Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education), Beijing, China
| | - Lingmin Zhan
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| | - Haoran Zhang
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| | - Yuanyuan Zhang
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| | - Wuxia Zhang
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| | - Yingdong Wang
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| | - Jinzhong Zhao
- Shanxi key lab for modernization of TCVM, College of Basic Sciences, Shanxi Agricultural University, Jinzhong, Shanxi, China
| |
Collapse
|
27
|
Magnano CS, Gitter A. Graph algorithms for predicting subcellular localization at the pathway level. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2023; 28:145-156. [PMID: 36540972 PMCID: PMC9817068 DOI: 10.1142/9789811270611_0014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Protein subcellular localization is an important factor in normal cellular processes and disease. While many protein localization resources treat it as static, protein localization is dynamic and heavily influenced by biological context. Biological pathways are graphs that represent a specific biological context and can be inferred from large-scale data. We develop graph algorithms to predict the localization of all interactions in a biological pathway as an edge-labeling task. We compare a variety of models including graph neural networks, probabilistic graphical models, and discriminative classifiers for predicting localization annotations from curated pathway databases. We also perform a case study where we construct biological pathways and predict localizations of human fibroblasts undergoing viral infection. Pathway localization prediction is a promising approach for integrating publicly available localization data into the analysis of large-scale biological data.
Collapse
Affiliation(s)
- Chris S Magnano
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
- Center for Computational Biomedicine, Harvard Medical School, Boston, MA, USA
| | - Anthony Gitter
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, USA
- Morgridge Institute for Research, Madison, WI, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
28
|
Sun H, Ren Y, Zhou X, Chen Q, Liu Y, Zhu C, Ruan Y, Ruan H, Tong H, Ying S, Lin P. DUSP1 Signaling Pathway Regulates Cytarabine Sensitivity in Acute Myeloid Leukemia. Technol Cancer Res Treat 2023; 22:15330338231207765. [PMID: 37872685 PMCID: PMC10594969 DOI: 10.1177/15330338231207765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Revised: 07/30/2023] [Accepted: 08/23/2023] [Indexed: 10/25/2023] Open
Abstract
Objectives: Dual specificity phosphatase 1 (DUSP1) is high-expressed in various cancers and plays an important role in the cellular response to agents that damage DNA. We aimed to investigate the expressions and mechanisms of DUSP1 signaling pathway regulating cytarabine (Ara-C) resistance in acute myeloid leukemia (AML). Methods: Immunohistochemistry was performed on bone marrow biopsy specimens from AML and controls to explore the expression of DUSP1. Western blot and Q-PCR were used to detect the protein and mRNA expression levels. MTT assay was used to detect the proliferation of cells. Cell apoptosis was detected by flow cytometry. The immune protein-protein interaction (PPI) network of DUSP1 was analyzed in the platform of Pathway Commons, and immune infiltration analysis was used to study the immune microenvironment of AML. Results: We found that the expression levels of DUSP1 in AML patients exceeded that in controls. Survival analysis in public datasets showed that AML patients with higher levels of DUSP1 had poor clinical outcomes. Further public data analysis indicated that DUSP1 was overexpressed in NRAS mutated AML. DUSP1 knockdown by siRNA could sensitize AML cells to Ara-C treatments. The phosphorylation level of mitogen-activated protein kinase (MAPK) pathway was significantly elevated in DUSP1 down-regulated NRAS G13D mutated AML cells. The PPI analysis showed DUSP1 correlated with immune gene CREB1 and CXCL8 in NRAS mutated AML. We also revealed a correlation between tumor-infiltrating immune cells in RAS mutated AML microenvironment. Conclusion: Our findings suggest that DUSP1 signaling pathways may regulate Ara-C sensitivity in AML.
Collapse
Affiliation(s)
- Huali Sun
- Department of Radiotherapy, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Yanling Ren
- Myelodysplastic Syndrome Center, Department of Hematology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Xinping Zhou
- Myelodysplastic Syndrome Center, Department of Hematology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Qi Chen
- Precision Medicine Center, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Yanmei Liu
- Department of Radiotherapy, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Chumeng Zhu
- Precision Medicine Center, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Yanyun Ruan
- Precision Medicine Center, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Hongli Ruan
- Department of Emergency Medicine, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Hongyan Tong
- Myelodysplastic Syndrome Center, Department of Hematology, The First Affiliated Hospital, College of Medicine, Zhejiang University, Hangzhou, China
| | - Shenpeng Ying
- Department of Radiotherapy, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| | - Peipei Lin
- Department of Radiotherapy, Taizhou Central Hospital (Taizhou University Hospital), Taizhou, Zhejiang, China
| |
Collapse
|
29
|
Senra D, Guisoni N, Diambra L. Cell annotation using scRNA-seq data: A protein-protein interaction network approach. MethodsX 2023; 10:102179. [PMID: 37128282 PMCID: PMC10148184 DOI: 10.1016/j.mex.2023.102179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Accepted: 04/09/2023] [Indexed: 05/03/2023] Open
Abstract
Pathway analysis is an important step in the interpretation of single cell transcriptomic data, as it provides powerful information to detect which cellular processes are active in each individual cell. We have recently developed a protein-protein interaction network-based framework to quantify pluripotency associated pathways from scRNA-seq data. On this occasion, we extend this approach to quantify the activity of a pathway associated with any biological process, or even any list of genes. A systems-level characterization of pathway activities across multiple cell types provides a broadly applicable tool for the analysis of pathways in both healthy and disease conditions. Dysregulated cellular functions are a hallmark of a wide spectrum of human disorders, including cancer and autoimmune diseases. Here, we illustrate our method by analyzing various biological processes in healthy and cancer breast samples. Using this approach we found that tumor breast cells, even when they form a single group in the UMAP space, keep diverse biological programs active in a differentiated manner within the cluster.•We implement a protein-protein interaction network-based approach to quantify the activity of different biological processes.•The methodology can be used for cell annotation in scRNA-seq studies and is freely available as R package.
Collapse
|
30
|
Wendel B, Heidenreich M, Budde M, Heilbronner M, Oraki Kohshour M, Papiol S, Falkai P, Schulze TG, Heilbronner U, Bickeböller H. Kalpra: A kernel approach for longitudinal pathway regression analysis integrating network information with an application to the longitudinal PsyCourse Study. Front Genet 2022; 13:1015885. [PMID: 36561312 PMCID: PMC9767414 DOI: 10.3389/fgene.2022.1015885] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 11/24/2022] [Indexed: 12/12/2022] Open
Abstract
A popular approach to reduce the high dimensionality resulting from genome-wide association studies is to analyze a whole pathway in a single test for association with a phenotype. Kernel machine regression (KMR) is a highly flexible pathway analysis approach. Initially, KMR was developed to analyze a simple phenotype with just one measurement per individual. Recently, however, the investigation into the influence of genomic factors in the development of disease-related phenotypes across time (trajectories) has gained in importance. Thus, novel statistical approaches for KMR analyzing longitudinal data, i.e. several measurements at specific time points per individual are required. For longitudinal pathway analysis, we extend KMR to long-KMR using the estimation equivalence of KMR and linear mixed models. We include additional random effects to correct for the dependence structure. Moreover, within long-KMR we created a topology-based pathway analysis by combining this approach with a kernel including network information of the pathway. Most importantly, long-KMR not only allows for the investigation of the main genetic effect adjusting for time dependencies within an individual, but it also allows to test for the association of the pathway with the longitudinal course of the phenotype in the form of testing the genetic time-interaction effect. The approach is implemented as an R package, kalpra. Our simulation study demonstrates that the power of long-KMR exceeded that of another KMR method previously developed to analyze longitudinal data, while maintaining (slightly conservatively) the type I error. The network kernel improved the performance of long-KMR compared to the linear kernel. Considering different pathway densities, the power of the network kernel decreased with increasing pathway density. We applied long-KMR to cognitive data on executive function (Trail Making Test, part B) from the PsyCourse Study and 17 candidate pathways selected from Reactome. We identified seven nominally significant pathways.
Collapse
Affiliation(s)
- Bernadette Wendel
- Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, Germany,*Correspondence: Bernadette Wendel,
| | - Markus Heidenreich
- Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, Germany
| | - Monika Budde
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
| | - Maria Heilbronner
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
| | - Mojtaba Oraki Kohshour
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
| | - Sergi Papiol
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany,Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
| | - Peter Falkai
- Department of Psychiatry and Psychotherapy, University Hospital, LMU Munich, Munich, Germany
| | - Thomas G. Schulze
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany,Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, United States,Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD, United States
| | - Urs Heilbronner
- Institute of Psychiatric Phenomics and Genomics (IPPG), University Hospital, LMU Munich, Munich, Germany
| | - Heike Bickeböller
- Department of Genetic Epidemiology, University Medical Center Göttingen, Georg-August-University Göttingen, Göttingen, Germany
| |
Collapse
|
31
|
Wu J, Liu H, Zhao X, Hong H, Werner J. Editorial: Cell signaling status alteration in development and disease. Front Cell Dev Biol 2022; 10:1068887. [PMID: 36531965 PMCID: PMC9752079 DOI: 10.3389/fcell.2022.1068887] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Accepted: 10/24/2022] [Indexed: 07/29/2023] Open
Affiliation(s)
- Jun Wu
- School of Life Sciences, East China Normal University, Shanghai, China
| | - Haipeng Liu
- Shanghai Pulmonary Hospital, School of Medicine, Tongji University, Shanghai, China
| | - Xiaodong Zhao
- Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China
| | - Huixiao Hong
- National Center for Toxicological Research, US Food and Drug Administration, Jefferson, AR, United States
| | - Johannes Werner
- Center for Data Processing, University of Tübingen, Tübingen, Germany
| |
Collapse
|
32
|
Langthasa J, Mishra S, U M, Kalal R, Bhat R. Mutations in a set of ancient matrisomal glycoprotein genes across neoplasia predispose to disruption of morphogenetic transduction. COMPUTATIONAL AND SYSTEMS ONCOLOGY 2022. [DOI: 10.1002/cso2.1042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Affiliation(s)
- Jimpi Langthasa
- Department of Molecular Reproduction Development and Genetics Indian Institute of Science Bengaluru India
| | - Satyarthi Mishra
- Centre for Nano Science and Engineering Indian Institute of Science Bengaluru India
| | - Monica U
- Department of Molecular Reproduction Development and Genetics Indian Institute of Science Bengaluru India
| | - Ronak Kalal
- Department of Zoology University College of Science, Mohanlal Sukhadia University Udaipur India
| | - Ramray Bhat
- Department of Molecular Reproduction Development and Genetics Indian Institute of Science Bengaluru India
- Centre for BioSystems Science and Engineering Indian Institute of Science Bengaluru India
| |
Collapse
|
33
|
Kashkin KN. Looking for Tumor Specific Promoters In Silico. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2022. [DOI: 10.1134/s1068162022060127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Abstract—
Previously we demonstrated the tumor-specific activity of several human native and chimeric promoters. Here we have analyzed the DNA sequences of experimentally tested tumor-specific promoters for the presence of recognition matrices of transcription factors and for de novo motif discovery. CiiiDER and MEME Suite software tools were used for this purpose. A number of transcription factor matrices have been identified, which are present more often in tumor-specific promoters than in the promoters of housekeeping genes. New promoter–TF regulatory relationships have been predicted by pathway analysis. A motif of 44 bp characteristic of tumor-specific promoters but not of housekeeping gene promoters has been discovered. The search through 29 598 human promoters from the EPDnew promoter database has revealed a series of promoters with this motif, their genes being associated with unfavorable prognoses in cancer. We suppose that some of these promoters may possess a tumor specific activity. In addition, a close similarity in nucleotide motifs between the promoters of the BIRC5 and MCM2 genes has been shown. The results of the study may contribute to understanding the peculiarities of gene transcription in tumors, as well as to searching for native tumor-specific promoters or creating artificial ones for cancer gene therapy, as well as in the development of anticancer vaccines.
Collapse
|
34
|
Martins dos Santos V, Anton M, Szomolay B, Ostaszewski M, Arts I, Benfeitas R, Dominguez Del Angel V, Domínguez-Romero E, Ferk P, Fey D, Goble C, Golebiewski M, Gruden K, Heil KF, Hermjakob H, Kahlem P, Klapa MI, Koehorst J, Kolodkin A, Kutmon M, Leskošek B, Moretti S, Müller W, Pagni M, Rezen T, Rocha M, Rozman D, Šafránek D, T. Scott W, Sheriff RSM, Suarez Diez M, Van Steen K, Westerhoff HV, Wittig U, Wolstencroft K, Zupanic A, Evelo CT, Hancock JM. Systems Biology in ELIXIR: modelling in the spotlight. F1000Res 2022; 11:ELIXIR-1265. [PMID: 36742342 PMCID: PMC9871403 DOI: 10.12688/f1000research.126734.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 10/24/2022] [Indexed: 11/09/2022] Open
Abstract
In this white paper, we describe the founding of a new ELIXIR Community - the Systems Biology Community - and its proposed future contributions to both ELIXIR and the broader community of systems biologists in Europe and worldwide. The Community believes that the infrastructure aspects of systems biology - databases, (modelling) tools and standards development, as well as training and access to cloud infrastructure - are not only appropriate components of the ELIXIR infrastructure, but will prove key components of ELIXIR's future support of advanced biological applications and personalised medicine. By way of a series of meetings, the Community identified seven key areas for its future activities, reflecting both future needs and previous and current activities within ELIXIR Platforms and Communities. These are: overcoming barriers to the wider uptake of systems biology; linking new and existing data to systems biology models; interoperability of systems biology resources; further development and embedding of systems medicine; provisioning of modelling as a service; building and coordinating capacity building and training resources; and supporting industrial embedding of systems biology. A set of objectives for the Community has been identified under four main headline areas: Standardisation and Interoperability, Technology, Capacity Building and Training, and Industrial Embedding. These are grouped into short-term (3-year), mid-term (6-year) and long-term (10-year) objectives.
Collapse
Affiliation(s)
- Vitor Martins dos Santos
- Laboratory of Bioprocess Engineering, Wageningen University & Research, Wageningen, 6708 PB, The Netherlands
| | - Mihail Anton
- Department of Biology and Biological Engineering, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, SE-41258, Sweden
| | - Barbara Szomolay
- Division of Infection and Immunity, School of Medicine, Cardiff University, Cardiff, UK
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Ilja Arts
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Rui Benfeitas
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | | | | | - Polonca Ferk
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Dirk Fey
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, 4, Ireland
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Kristina Gruden
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | | | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Pascal Kahlem
- Scientific Network Management SL, Barcelona, 08015, Spain
| | - Maria I. Klapa
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research & Technology - Hellas (FORTH/ICE-HT), Patras, 26504, Greece
| | - Jasper Koehorst
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Alexey Kolodkin
- Competence Center for Methodology and Statistics; Transversal Translational Medicine, Translational Medicine Operations Hub, Luxembourg Institute of Health, Strassen, L-1445, Luxembourg
- ISBE.NL, VU University of Amsterdam, Amsterdam, The Netherlands
| | - Martina Kutmon
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Brane Leskošek
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | | | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Marco Pagni
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tadeja Rezen
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Damjana Rozman
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - David Šafránek
- Faculty of Informatics, Masaryk University, Brno, 602 00, Czech Republic
| | - William T. Scott
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
- UNLOCK, Wageningen University & Research, 6708 PB Wageningen, The Netherlands
| | - Rahuman S. Malik Sheriff
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Maria Suarez Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Kristel Van Steen
- BIO3 - Laboratory for Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, 3000, Belgium
- BIO3 - Systems Genetics, GIGA-R Medical Genomics, University of Liege, Liege, 4000, Belgium
| | | | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, 2333 CA, The Netherlands
| | - Anze Zupanic
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | - Chris T. Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - John M. Hancock
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| |
Collapse
|
35
|
Martins dos Santos V, Anton M, Szomolay B, Ostaszewski M, Arts I, Benfeitas R, Dominguez Del Angel V, Domínguez-Romero E, Ferk P, Fey D, Goble C, Golebiewski M, Gruden K, Heil KF, Hermjakob H, Kahlem P, Klapa MI, Koehorst J, Kolodkin A, Kutmon M, Leskošek B, Moretti S, Müller W, Pagni M, Rezen T, Rocha M, Rozman D, Šafránek D, T. Scott W, Sheriff RSM, Suarez Diez M, Van Steen K, Westerhoff HV, Wittig U, Wolstencroft K, Zupanic A, Evelo CT, Hancock JM. Systems Biology in ELIXIR: modelling in the spotlight. F1000Res 2022; 11:ELIXIR-1265. [PMID: 36742342 PMCID: PMC9871403 DOI: 10.12688/f1000research.126734.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/20/2024] [Indexed: 06/05/2024] Open
Abstract
In this white paper, we describe the founding of a new ELIXIR Community - the Systems Biology Community - and its proposed future contributions to both ELIXIR and the broader community of systems biologists in Europe and worldwide. The Community believes that the infrastructure aspects of systems biology - databases, (modelling) tools and standards development, as well as training and access to cloud infrastructure - are not only appropriate components of the ELIXIR infrastructure, but will prove key components of ELIXIR's future support of advanced biological applications and personalised medicine. By way of a series of meetings, the Community identified seven key areas for its future activities, reflecting both future needs and previous and current activities within ELIXIR Platforms and Communities. These are: overcoming barriers to the wider uptake of systems biology; linking new and existing data to systems biology models; interoperability of systems biology resources; further development and embedding of systems medicine; provisioning of modelling as a service; building and coordinating capacity building and training resources; and supporting industrial embedding of systems biology. A set of objectives for the Community has been identified under four main headline areas: Standardisation and Interoperability, Technology, Capacity Building and Training, and Industrial Embedding. These are grouped into short-term (3-year), mid-term (6-year) and long-term (10-year) objectives.
Collapse
Affiliation(s)
- Vitor Martins dos Santos
- Laboratory of Bioprocess Engineering, Wageningen University & Research, Wageningen, 6708 PB, The Netherlands
| | - Mihail Anton
- Department of Biology and Biological Engineering, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, SE-41258, Sweden
| | - Barbara Szomolay
- Division of Infection and Immunity, School of Medicine, Cardiff University, Cardiff, UK
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Belvaux, L-4367, Luxembourg
| | - Ilja Arts
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Rui Benfeitas
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | | | | | - Polonca Ferk
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Dirk Fey
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin, 4, Ireland
| | - Carole Goble
- Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Kristina Gruden
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | | | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Pascal Kahlem
- Scientific Network Management SL, Barcelona, 08015, Spain
| | - Maria I. Klapa
- Metabolic Engineering & Systems Biology Laboratory, Institute of Chemical Engineering Sciences, Foundation for Research & Technology - Hellas (FORTH/ICE-HT), Patras, 26504, Greece
| | - Jasper Koehorst
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Alexey Kolodkin
- Competence Center for Methodology and Statistics; Transversal Translational Medicine, Translational Medicine Operations Hub, Luxembourg Institute of Health, Strassen, L-1445, Luxembourg
- ISBE.NL, VU University of Amsterdam, Amsterdam, The Netherlands
| | - Martina Kutmon
- Maastricht Centre for Systems Biology (MaCSBio), Maastricht University, Maastricht, 6200 MD, The Netherlands
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - Brane Leskošek
- Faculty of Medicine, Institute for Biostatistics and Medical Informatics, Centre ELIXIR-SI, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | | | - Wolfgang Müller
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Marco Pagni
- SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tadeja Rezen
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Damjana Rozman
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| | - David Šafránek
- Faculty of Informatics, Masaryk University, Brno, 602 00, Czech Republic
| | - William T. Scott
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
- UNLOCK, Wageningen University & Research, 6708 PB Wageningen, The Netherlands
| | - Rahuman S. Malik Sheriff
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, UK
| | - Maria Suarez Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, 6708WE, The Netherlands
| | - Kristel Van Steen
- BIO3 - Laboratory for Systems Medicine, Department of Human Genetics, KU Leuven, Leuven, 3000, Belgium
- BIO3 - Systems Genetics, GIGA-R Medical Genomics, University of Liege, Liege, 4000, Belgium
| | | | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies - HITS, Heidelberg, 69118, Germany
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden University, Leiden, 2333 CA, The Netherlands
| | - Anze Zupanic
- Department of Biotechnology and Systems Biology, National Institute of Biology, Ljubljana, SI-1000, Slovenia
| | - Chris T. Evelo
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, 6200 MD, The Netherlands
| | - John M. Hancock
- Faculty of Medicine, University of Ljubljana, Ljubljana, SI-1000, Slovenia
| |
Collapse
|
36
|
Banimfreg BH, Shamayleh A, Alshraideh H. Survey for Computer-Aided Tools and Databases in Metabolomics. Metabolites 2022; 12:metabo12101002. [PMID: 36295904 PMCID: PMC9610953 DOI: 10.3390/metabo12101002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Revised: 10/08/2022] [Accepted: 10/12/2022] [Indexed: 11/14/2022] Open
Abstract
Metabolomics has advanced from innovation and functional genomics tools and is currently a basis in the big data-led precision medicine era. Metabolomics is promising in the pharmaceutical field and clinical research. However, due to the complexity and high throughput data generated from such experiments, data mining and analysis are significant challenges for researchers in the field. Therefore, several efforts were made to develop a complete workflow that helps researchers analyze data. This paper introduces a review of the state-of-the-art computer-aided tools and databases in metabolomics established in recent years. The paper provides computational tools and resources based on functionality and accessibility and provides hyperlinks to web pages to download or use. This review aims to present the latest computer-aided tools, databases, and resources to the metabolomics community in one place.
Collapse
|
37
|
Is IIIG9 a New Protein with Exclusive Ciliary Function? Analysis of Its Potential Role in Cancer and Other Pathologies. Cells 2022; 11:cells11203327. [PMID: 36291193 PMCID: PMC9600092 DOI: 10.3390/cells11203327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 09/23/2022] [Accepted: 10/03/2022] [Indexed: 11/16/2022] Open
Abstract
The identification of new proteins that regulate the function of one of the main cellular phosphatases, protein phosphatase 1 (PP1), is essential to find possible pharmacological targets to alter phosphatase function in various cellular processes, including the initiation and development of multiple diseases. IIIG9 is a regulatory subunit of PP1 initially identified in highly polarized ciliated cells. In addition to its ciliary location in ependymal cells, we recently showed that IIIG9 has extraciliary functions that regulate the integrity of adherens junctions. In this review, we perform a detailed analysis of the expression, localization, and function of IIIG9 in adult and developing normal brains. In addition, we provide a 3D model of IIIG9 protein structure for the first time, verifying that the classic structural and conformational characteristics of the PP1 regulatory subunits are maintained. Our review is especially focused on finding evidence linking IIIG9 dysfunction with the course of some pathologies, such as ciliopathies, drug dependence, diseases based on neurological development, and the development of specific high-malignancy and -frequency brain tumors in the pediatric population. Finally, we propose that IIIG9 is a relevant regulator of PP1 function in physiological and pathological processes in the CNS.
Collapse
|
38
|
Jia P, Hu R, Yan F, Dai Y, Zhao Z. scGWAS: landscape of trait-cell type associations by integrating single-cell transcriptomics-wide and genome-wide association studies. Genome Biol 2022; 23:220. [PMID: 36253801 PMCID: PMC9575201 DOI: 10.1186/s13059-022-02785-w] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 10/05/2022] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND The rapid accumulation of single-cell RNA sequencing (scRNA-seq) data presents unique opportunities to decode the genetically mediated cell-type specificity in complex diseases. Here, we develop a new method, scGWAS, which effectively leverages scRNA-seq data to achieve two goals: (1) to infer the cell types in which the disease-associated genes manifest and (2) to construct cellular modules which imply disease-specific activation of different processes. RESULTS scGWAS only utilizes the average gene expression for each cell type followed by virtual search processes to construct the null distributions of module scores, making it scalable to large scRNA-seq datasets. We demonstrated scGWAS in 40 genome-wide association studies (GWAS) datasets (average sample size N ≈ 154,000) using 18 scRNA-seq datasets from nine major human/mouse tissues (totaling 1.08 million cells) and identified 2533 trait and cell-type associations, each with significant modules for further investigation. The module genes were validated using disease or clinically annotated references from ClinVar, OMIM, and pLI variants. CONCLUSIONS We showed that the trait-cell type associations identified by scGWAS, while generally constrained to trait-tissue associations, could recapitulate many well-studied relationships and also reveal novel relationships, providing insights into the unsolved trait-tissue associations. Moreover, in each specific cell type, the associations with different traits were often mediated by different sets of risk genes, implying disease-specific activation of driving processes. In summary, scGWAS is a powerful tool for exploring the genetic basis of complex diseases at the cell type level using single-cell expression data.
Collapse
Affiliation(s)
- Peilin Jia
- grid.267308.80000 0000 9206 2401Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Ruifeng Hu
- grid.267308.80000 0000 9206 2401Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Fangfang Yan
- grid.267308.80000 0000 9206 2401Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Yulin Dai
- grid.267308.80000 0000 9206 2401Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Zhongming Zhao
- grid.267308.80000 0000 9206 2401Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA ,grid.267308.80000 0000 9206 2401Human Genetics Center, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX 77030 USA ,grid.240145.60000 0001 2291 4776MD Anderson Cancer Center UTHealth Graduate School of Biomedical Sciences, Houston, TX 77030 USA
| |
Collapse
|
39
|
Liu H, Yuan M, Mitra R, Zhou X, Long M, Lei W, Zhou S, Huang YE, Hou F, Eischen CM, Jiang W. CTpathway: a CrossTalk-based pathway enrichment analysis method for cancer research. Genome Med 2022; 14:118. [PMID: 36229842 PMCID: PMC9563764 DOI: 10.1186/s13073-022-01119-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Accepted: 09/26/2022] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND Pathway enrichment analysis (PEA) is a common method for exploring functions of hundreds of genes and identifying disease-risk pathways. Moreover, different pathways exert their functions through crosstalk. However, existing PEA methods do not sufficiently integrate essential pathway features, including pathway crosstalk, molecular interactions, and network topologies, resulting in many risk pathways that remain uninvestigated. METHODS To overcome these limitations, we develop a new crosstalk-based PEA method, CTpathway, based on a global pathway crosstalk map (GPCM) with >440,000 edges by combing pathways from eight resources, transcription factor-gene regulations, and large-scale protein-protein interactions. Integrating gene differential expression and crosstalk effects in GPCM, we assign a risk score to genes in the GPCM and identify risk pathways enriched with the risk genes. RESULTS Analysis of >8300 expression profiles covering ten cancer tissues and blood samples indicates that CTpathway outperforms the current state-of-the-art methods in identifying risk pathways with higher accuracy, reproducibility, and speed. CTpathway recapitulates known risk pathways and exclusively identifies several previously unreported critical pathways for individual cancer types. CTpathway also outperforms other methods in identifying risk pathways across all cancer stages, including early-stage cancer with a small number of differentially expressed genes. Moreover, the robust design of CTpathway enables researchers to analyze both bulk and single-cell RNA-seq profiles to predict both cancer tissue and cell type-specific risk pathways with higher accuracy. CONCLUSIONS Collectively, CTpathway is a fast, accurate, and stable pathway enrichment analysis method for cancer research that can be used to identify cancer risk pathways. The CTpathway interactive web server can be accessed here http://www.jianglab.cn/CTpathway/ . The stand-alone program can be accessed here https://github.com/Bioccjw/CTpathway .
Collapse
Affiliation(s)
- Haizhou Liu
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Mengqin Yuan
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Ramkrishna Mitra
- Department of Pharmacology, Physiology, and Cancer Biology, Sidney Kimmel Cancer Center, Thomas Jefferson University, 233 South 10th St., Philadelphia, PA, 19107, USA
| | - Xu Zhou
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Min Long
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Wanyue Lei
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Shunheng Zhou
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Yu-E Huang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Fei Hou
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China
| | - Christine M Eischen
- Department of Pharmacology, Physiology, and Cancer Biology, Sidney Kimmel Cancer Center, Thomas Jefferson University, 233 South 10th St., Philadelphia, PA, 19107, USA.
| | - Wei Jiang
- Department of Biomedical Engineering, Nanjing University of Aeronautics and Astronautics, No. 29, Jiangjun Avenue, Nanjing, 211106, Jiangsu Province, China.
| |
Collapse
|
40
|
Dora D, Dora T, Szegvari G, Gerdán C, Lohinai Z. EZCancerTarget: an open-access drug repurposing and data-collection tool to enhance target validation and optimize international research efforts against highly progressive cancers. BioData Min 2022; 15:25. [PMID: 36183137 PMCID: PMC9526900 DOI: 10.1186/s13040-022-00307-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Accepted: 09/18/2022] [Indexed: 11/22/2022] Open
Abstract
The expanding body of potential therapeutic targets requires easily accessible, structured, and transparent real-time interpretation of molecular data. Open-access genomic, proteomic and drug-repurposing databases transformed the landscape of cancer research, but most of them are difficult and time-consuming for casual users. Furthermore, to conduct systematic searches and data retrieval on multiple targets, researchers need the help of an expert bioinformatician, who is not always readily available for smaller research teams. We invite research teams to join and aim to enhance the cooperative work of more experienced groups to harmonize international efforts to overcome devastating malignancies. Here, we integrate available fundamental data and present a novel, open access, data-aggregating, drug repurposing platform, deriving our searches from the entries of Clue.io. We show how we integrated our previous expertise in small-cell lung cancer (SCLC) to initiate a new platform to overcome highly progressive cancers such as triple-negative breast and pancreatic cancer with data-aggregating approaches. Through the front end, the current content of the platform can be further expanded or replaced and users can create their drug-target list to select the clinically most relevant targets for further functional validation assays or drug trials. EZCancerTarget integrates searches from publicly available databases, such as PubChem, DrugBank, PubMed, and EMA, citing up-to-date and relevant literature of every target. Moreover, information on compounds is complemented with biological background information on eligible targets using entities like UniProt, String, and GeneCards, presenting relevant pathways, molecular- and biological function and subcellular localizations of these molecules. Cancer drug discovery requires a convergence of complex, often disparate fields. We present a simple, transparent, and user-friendly drug repurposing software to facilitate the efforts of research groups in the field of cancer research.
Collapse
Affiliation(s)
- David Dora
- Department of Anatomy, Histology, and Embryology, Semmelweis University, Tuzolto st. 58, Budapest, 1094, Hungary.
| | - Timea Dora
- Department of Management and Business Economics, Budapest University of Technology and Economics, Budapest, Hungary
| | - Gabor Szegvari
- Translational Medicine Institute, Semmelweis University, Budapest, Hungary
| | - Csongor Gerdán
- National Korányi Institute of Pulmonology, Piheno ut 1., 1121, Budapest, Hungary.,Institute of Enzymology, Research Centre for Natural Sciences, Budapest, Hungary
| | - Zoltan Lohinai
- Translational Medicine Institute, Semmelweis University, Budapest, Hungary. .,National Korányi Institute of Pulmonology, Piheno ut 1., 1121, Budapest, Hungary.
| |
Collapse
|
41
|
Chen B, Li P, Liu M, Liu K, Zou M, Geng Y, Zhuang S, Xu H, Wang L, Chen T, Li Y, Zhao Z, Qi L, Gu Y. A genetic map of the chromatin regulators to drug response in cancer cells. J Transl Med 2022; 20:438. [PMID: 36180906 PMCID: PMC9523919 DOI: 10.1186/s12967-022-03651-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Accepted: 09/18/2022] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Diverse drug vulnerabilities owing to the Chromatin regulators (CRs) genetic interaction across various cancers, but the identification of CRs genetic interaction remains challenging. METHODS In order to provide a global view of the CRs genetic interaction in cancer cells, we developed a method to identify potential drug response-related CRs genetic interactions for specific cancer types by integrating the screen of CRISPR-Cas9 and pharmacogenomic response datasets. RESULTS Totally, 625 drug response-related CRs synthetic lethality (CSL) interactions and 288 CRs synthetic viability (CSV) interactions were detected. Systematically network analysis presented CRs genetic interactions have biological function relationship. Furthermore, we validated CRs genetic interactions induce multiple omics deregulation in The Cancer Genome Atlas. We revealed the colon adenocarcinoma patients (COAD) with mutations of a CRs set (EP300, MSH6, NSD2 and TRRAP) mediate a better survival with low expression of MAP2 and could benefit from taxnes. While the COAD patients carrying at least one of the CSV interactions in Vorinostat CSV module confer a poor prognosis and may be resistant to Vorinostat treatment. CONCLUSIONS The CRs genetic interaction map provides a rich resource to investigate cancer-associated CRs genetic interaction and proposes a powerful strategy of biomarker discovery to guide the rational use of agents in cancer therapy.
Collapse
Affiliation(s)
- Bo Chen
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Pengfei Li
- Department of Radiology, Harbin Medical University Cancer Hospital, Harbin, China
| | - Mingyue Liu
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Kaidong Liu
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Min Zou
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yiding Geng
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Shuping Zhuang
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Huanhuan Xu
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Linzhu Wang
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Tingting Chen
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yawei Li
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Zhangxiang Zhao
- The Sino-Russian Medical Research Center of Jinan University, The Institute of Chronic Disease of Jinan University, The First Affiliated Hospital of Jinan University, Guangzhou, China
| | - Lishuang Qi
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.
| | - Yunyan Gu
- Department of Systems Biology, College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.
| |
Collapse
|
42
|
Zhang F, Luna A, Tan T, Chen Y, Sander C, Guo T. COVIDpro: Database for mining protein dysregulation in patients with COVID-19. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.09.27.509819. [PMID: 36203550 PMCID: PMC9536031 DOI: 10.1101/2022.09.27.509819] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Background The ongoing pandemic of the coronavirus disease 2019 (COVID-19) caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) still has limited treatment options partially due to our incomplete understanding of the molecular dysregulations of the COVID-19 patients. We aimed to generate a repository and data analysis tools to examine the modulated proteins underlying COVID-19 patients for the discovery of potential therapeutic targets and diagnostic biomarkers. Methods We built a web server containing proteomic expression data from COVID-19 patients with a toolset for user-friendly data analysis and visualization. The web resource covers expert-curated proteomic data from COVID-19 patients published before May 2022. The data were collected from ProteomeXchange and from select publications via PubMed searches and aggregated into a comprehensive dataset. Protein expression by disease subgroups across projects was compared by examining differentially expressed proteins. We also visualize differentially expressed pathways and proteins. Moreover, circulating proteins that differentiated severe cases were nominated as predictive biomarkers. Findings We built and maintain a web server COVIDpro ( https://www.guomics.com/covidPro/ ) containing proteomics data generated by 41 original studies from 32 hospitals worldwide, with data from 3077 patients covering 19 types of clinical specimens, the majority from plasma and sera. 53 protein expression matrices were collected, for a total of 5434 samples and 14,403 unique proteins. Our analyses showed that the lipopolysaccharide-binding protein, as identified in the majority of the studies, was highly expressed in the blood samples of patients with severe disease. A panel of significantly dysregulated proteins was identified to separate patients with severe disease from non-severe disease. Classification of severe disease based on these proteomic signatures on five test sets reached a mean AUC of 0.87 and ACC of 0.80. Interpretation COVIDpro is an online database with an integrated analysis toolkit. It is a unique and valuable resource for testing hypotheses and identifying proteins or pathways that could be targeted by new treatments of COVID-19 patients. Funding National Key R&D Program of China: Key PDPM technologies (2021YFA1301602, 2021YFA1301601, 2021YFA1301603), Zhejiang Provincial Natural Science Foundation for Distinguished Young Scholars (LR19C050001), Hangzhou Agriculture and Society Advancement Program (20190101A04), National Natural Science Foundation of China (81972492) and National Science Fund for Young Scholars (21904107), National Resource for Network Biology (NRNB) from the National Institute of General Medical Sciences (NIGMS-P41 GM103504). Research in context Evidence before this study: Although an increasing number of therapies against COVID-19 are being developed, they are still insufficient, especially with the rise of new variants of concern. This is partially due to our incomplete understanding of the disease’s mechanisms. As data have been collected worldwide, several questions are now worth addressing via meta-analyses. Most COVID-19 drugs function by targeting or affecting proteins. Effectiveness and resistance to therapeutics can be effectively assessed via protein measurements. Empowered by mass spectrometry-based proteomics, protein expression has been characterized in a variety of patient specimens, including body fluids (e.g., serum, plasma, urea) and tissue (i.e., formalin-fixed and paraffin-embedded (FFPE)). We expert-curated proteomic expression data from COVID-19 patients published before May 2022, from the largest proteomic data repository ProteomeXhange as well as from literature search engines. Using this resource, a COVID-19 proteome meta-analysis could provide useful insights into the mechanisms of the disease and identify new potential drug targets.Added value of this study: We integrated many published datasets from patients with COVID-19 from 11 nations, with over 3000 patients and more than 5434 proteome measurements. We collected these datasets in an online database, and generated a toolbox to easily explore, analyze, and visualize the data. Next, we used the database and its associated toolbox to identify new proteins of diagnostic and therapeutic value for COVID-19 treatment. In particular, we identified a set of significantly dysregulated proteins for distinguishing severe from non-severe patients using serum samples.Implications of all the available evidence: COVIDpro will support the navigation and analysis of patterns of dysregulated proteins in various COVID-19 clinical specimens for identification and verification of protein biomarkers and potential therapeutic targets.
Collapse
|
43
|
Königs C, Friedrichs M, Dietrich T. The heterogeneous pharmacological medical biochemical network PharMeBINet. Sci Data 2022; 9:393. [PMID: 35821017 PMCID: PMC9276653 DOI: 10.1038/s41597-022-01510-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 06/22/2022] [Indexed: 12/04/2022] Open
Abstract
Heterogeneous biomedical pharmacological databases are important for multiple fields in bioinformatics. Hetionet is a freely available database combining diverse entities and relationships from 29 public resources. Therefore, it is used as the basis for this project. 19 additional pharmacological medical and biological databases such as CTD, DrugBank, and ClinVar are parsed and integrated into Neo4j. Afterwards, the information is merged into the Hetionet structure. Different mapping methods are used such as external identification systems or name mapping. The resulting open-source Neo4j database PharMeBINet has 2,869,407 different nodes with 66 labels and 15,883,653 relationships with 208 edge types. It is a heterogeneous database containing interconnected information on ADRs, diseases, drugs, genes, gene variations, proteins, and more. Relationships between these entities represent drug-drug interactions or drug-causes-ADR relations, to name a few. It has much potential for developing further data analyses including machine learning applications. A web application for accessing the database is free to use for everyone and available at https://pharmebi.net. Additionally, the database is deposited on Zenodo at 10.5281/zenodo.6578218. Measurement(s) | data integration objective | Technology Type(s) | database creation objective |
Collapse
Affiliation(s)
- Cassandra Königs
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany.
| | - Marcel Friedrichs
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany
| | - Theresa Dietrich
- Bielefeld University, Bioinformatics/Medical Informatics Department, Bielefeld, 33615, Germany
| |
Collapse
|
44
|
A hexa-species transcriptome atlas of mammalian embryogenesis delineates metabolic regulation across three different implantation modes. Nat Commun 2022; 13:3407. [PMID: 35710749 PMCID: PMC9203550 DOI: 10.1038/s41467-022-30194-x] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Accepted: 04/21/2022] [Indexed: 12/16/2022] Open
Abstract
Mammalian embryogenesis relies on glycolysis and oxidative phosphorylation to balance the generation of biomass with energy production. However, the dynamics of metabolic regulation in the postimplantation embryo in vivo have remained elusive due to the inaccessibility of the implanted conceptus for biochemical studies. To address this issue, we compiled single-cell embryo profiling data in six mammalian species and determined their metabolic dynamics through glycolysis and oxidative phosphorylation associated gene expression. Strikingly, we identify a conserved switch from bivalent respiration in the late blastocyst towards a glycolytic metabolism in early gastrulation stages across species, which is independent of embryo implantation. Extraembryonic lineages followed the dynamics of the embryonic lineage, except visceral endoderm. Finally, we demonstrate that in vitro primate embryo culture substantially impacts metabolic gene regulation by comparison to in vivo samples. Our work reveals a conserved metabolic programme despite different implantation modes and highlights the need to optimise postimplantation embryo culture protocols.
Collapse
|
45
|
Petrov I, Alexeyenko A. Individualized discovery of rare cancer drivers in global network context. eLife 2022; 11:74010. [PMID: 35593700 PMCID: PMC9159755 DOI: 10.7554/elife.74010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 05/20/2022] [Indexed: 11/13/2022] Open
Abstract
Late advances in genome sequencing expanded the space of known cancer driver genes several-fold. However, most of this surge was based on computational analysis of somatic mutation frequencies and/or their impact on the protein function. On the contrary, experimental research necessarily accounted for functional context of mutations interacting with other genes and conferring cancer phenotypes. Eventually, just such results become ‘hard currency’ of cancer biology. The new method, NEAdriver employs knowledge accumulated thus far in the form of global interaction network and functionally annotated pathways in order to recover known and predict novel driver genes. The driver discovery was individualized by accounting for mutations’ co-occurrence in each tumour genome – as an alternative to summarizing information over the whole cancer patient cohorts. For each somatic genome change, probabilistic estimates from two lanes of network analysis were combined into joint likelihoods of being a driver. Thus, ability to detect previously unnoticed candidate driver events emerged from combining individual genomic context with network perspective. The procedure was applied to 10 largest cancer cohorts followed by evaluating error rates against previous cancer gene sets. The discovered driver combinations were shown to be informative on cancer outcome. This revealed driver genes with individually sparse mutation patterns that would not be detectable by other computational methods and related to cancer biology domains poorly covered by previous analyses. In particular, recurrent mutations of collagen, laminin, and integrin genes were observed in the adenocarcinoma and glioblastoma cancers. Considering constellation patterns of candidate drivers in individual cancer genomes opens a novel avenue for personalized cancer medicine.
Collapse
Affiliation(s)
- Iurii Petrov
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden.,Science for Life Laboratory, Solna, Sweden
| | - Andrey Alexeyenko
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden.,Science for Life Laboratory, Solna, Sweden.,Evi-networks, enskild konsultföretag, Huddinge, Sweden
| |
Collapse
|
46
|
Lee HC, Chang CY, Huang YC, Wu KL, Chiang HH, Chang YY, Liu LX, Hung JY, Hsu YL, Wu YY, Tsai YM. Downregulated ADAMTS1 Incorporating A2M Contributes to Tumorigenesis and Alters Tumor Immune Microenvironment in Lung Adenocarcinoma. BIOLOGY 2022; 11:biology11050760. [PMID: 35625488 PMCID: PMC9139094 DOI: 10.3390/biology11050760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Revised: 05/03/2022] [Accepted: 05/11/2022] [Indexed: 11/23/2022]
Abstract
Simple Summary Lung cancer is the most dreadful cancer type and has the worst cancer-related clinical outcomes. This study used specimens from the in-house lung cancer cohort and public cohort to verify the roles of downregulated ADAMTS1, a protease remodeling extracellular matrix, to facilitate cancer promotion and progress. Based on the clinical specimens, cell and animal study with the aid of the public databases, we concluded that downregulated expression of ADAMTS1 might promote tumor progression and metastasis and modify the tumor microenvironment in lung cancer. Further investigation would be required for its application in treating lung cancer. Abstract Lung adenocarcinoma (LUAD) still holds the most dreadful clinical outcomes worldwide. Despite advanced treatment strategies, there are still some unmet needs. Next-generation sequencing of large-scale cancer genomics discovery projects combined with bioinformatics provides the opportunity to take a step forward in meeting clinical conditions. Based on in-house and The Cancer Genome Atlas (TCGA) cohorts, the results showed decreased levels of ADAMTS1 conferred poor survival compared with normal parts. Gene set enrichment analyses (GSEA) indicated the negative correlation between ADAMTS1 and the potential roles of epithelial–mesenchymal transition (EMT), metastasis, and poor prognosis in LUAD patients. With the knockdown of ADAMTS1, A549 lung cancer cells exhibited more aggressive behaviors such as EMT and increased migration, resulting in cancer metastasis in a mouse model. The pathway interaction network disclosed the linkage of downregulated α2-macroglobulin (A2M), which regulates EMT and metastasis. Furthermore, immune components analysis indicated a positive relationship between ADAMTS1 and the infiltrating levels of multiple immune cells, especially anticancer CD4+ T cells in LUAD. Notably, ADAMTS1 expression was also inversely correlated with the accumulation of immunosuppressive myeloid-derived suppressor cells and regulatory T cells, implying the downregulated ADAMTS1 mediated immune adjustment to fit the tumor survival disadvantages in LUAD patients. In conclusion, our study indicates that ADAMTS1 interacts with A2M in regulating EMT and metastasis in LUAD. Additionally, ADAMTS1 contributes to poor prognosis and immune infiltration in LUAD patients
Collapse
Affiliation(s)
- Hsiao-Chen Lee
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Division of Plastic Surgery, Department of Surgery, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Chao-Yuan Chang
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Department of Anatomy, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Yung-Chi Huang
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
| | - Kuan-Li Wu
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Division of Pulmonary and Critical Care Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan;
| | - Hung-Hsing Chiang
- Division of Thoracic Surgery, Department of Surgery, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung 807, Taiwan;
| | - Yung-Yun Chang
- Division of Pulmonary and Critical Care Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan;
- Division of General Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan
| | - Lian-Xiu Liu
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
| | - Jen-Yu Hung
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Division of Pulmonary and Critical Care Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan;
- Department of Internal Medicine, Kaohsiung Municipal Ta-Tung Hospital, Kaohsiung 807, Taiwan
| | - Ya-Ling Hsu
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan
- Drug Development and Value Creation Research Center, Kaohsiung Medical University, Kaohsiung 807, Taiwan
| | - Yu-Yuan Wu
- School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan;
| | - Ying-Ming Tsai
- Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan; (H.-C.L.); (C.-Y.C.); (Y.-C.H.); (K.-L.W.); (L.-X.L.); (J.-Y.H.); (Y.-L.H.)
- Division of Pulmonary and Critical Care Medicine, Kaohsiung Medical University Hospital, Kaohsiung 807, Taiwan;
- Drug Development and Value Creation Research Center, Kaohsiung Medical University, Kaohsiung 807, Taiwan
- School of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 807, Taiwan;
- Correspondence:
| |
Collapse
|
47
|
Gyori BM, Hoyt CT. PyBioPAX: biological pathway exchange in Python. JOURNAL OF OPEN SOURCE SOFTWARE 2022; 7:4136. [PMID: 36071952 PMCID: PMC9447860 DOI: 10.21105/joss.04136] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
|
48
|
A hybrid approach unveils drug repurposing candidates targeting an Alzheimer pathophysiology mechanism. PATTERNS (NEW YORK, N.Y.) 2022; 3:100433. [PMID: 35510183 PMCID: PMC9058900 DOI: 10.1016/j.patter.2021.100433] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 08/30/2021] [Accepted: 12/23/2021] [Indexed: 01/04/2023]
Abstract
The high number of failed pre-clinical and clinical studies for compounds targeting Alzheimer disease (AD) has demonstrated that there is a need to reassess existing strategies. Here, we pursue a holistic, mechanism-centric drug repurposing approach combining computational analytics and experimental screening data. Based on this integrative workflow, we identified 77 druggable modifiers of tau phosphorylation (pTau). One of the upstream modulators of pTau, HDAC6, was screened with 5,632 drugs in a tau-specific assay, resulting in the identification of 20 repurposing candidates. Four compounds and their known targets were found to have a link to AD-specific genes. Our approach can be applied to a variety of AD-associated pathophysiological mechanisms to identify more repurposing candidates. Drug-repurposing approach that combines in silico analyses and in vitro screenings A drug- and mechanism-oriented model, the Human Brain Pharmacome (HBP) was created The HBP was used to mine data related to drugs and targets to generate a hypothesis Experimental evidence validated predicted drug-target combinations
Owing to current setbacks in the discovery and development of novel treatments tackling Alzheimer disease (AD), a re-evaluation of research and development (R&D) strategies is underway. Here, we present a holistic pharmacological approach that combines drug-target information with knowledge graphs that represent essential pathophysiology mechanisms. The resulting Human Brain Pharmacome (HBP) embeds hundreds of relevant drug-target interactions in the context of disease mechanisms governing AD. We demonstrate how such a tool can be used to aid AD research by identifying already-approved drugs that have the potential to treat the disease, thereby bypassing the expensive and time-consuming task of researching and developing a new drug. In our study, we identified new drug-target combinations and provided mechanistic explanations that help to improve our understanding of AD pathology and support future development of effective therapeutic strategies.
Collapse
|
49
|
Kankanige D, Liyanage L, O'Connor MD. Application of Transcriptomics for Predicting Protein Interaction Networks, Drug Targets and Drug Candidates. FRONTIERS IN MEDICAL TECHNOLOGY 2022; 4:693148. [PMID: 35356062 PMCID: PMC8959405 DOI: 10.3389/fmedt.2022.693148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 01/05/2022] [Indexed: 11/15/2022] Open
Abstract
Protein interaction pathways and networks are critically-required for a vast range of biological processes. Improved discovery of candidate druggable proteins within specific cell, tissue and disease contexts will aid development of new treatments. Predicting protein interaction networks from gene expression data can provide valuable insights into normal and disease biology. For example, the resulting protein networks can be used to identify potentially druggable targets and drug candidates for testing in cell and animal disease models. The advent of whole-transcriptome expression profiling techniques—that catalogue protein-coding genes expressed within cells and tissues—has enabled development of individual algorithms for particular tasks. For example,: (i) gene ontology algorithms that predict gene/protein subsets involved in related cell processes; (ii) algorithms that predict intracellular protein interaction pathways; and (iii) algorithms that correlate druggable protein targets with known drugs and/or drug candidates. This review examines approaches, advantages and disadvantages of existing gene expression, gene ontology, and protein network prediction algorithms. Using this framework, we examine current efforts to combine these algorithms into pipelines to enable identification of druggable targets, and associated known drugs, using gene expression datasets. In doing so, new opportunities are identified for development of powerful algorithm pipelines, suitable for wide use by non-bioinformaticians, that can predict protein interaction networks, druggable proteins, and related drugs from user gene expression datasets.
Collapse
Affiliation(s)
- Dulshani Kankanige
- School of Computer, Data and Mathematical Sciences, Western Sydney University, Campbelltown, NSW, Australia
| | - Liwan Liyanage
- School of Computer, Data and Mathematical Sciences, Western Sydney University, Campbelltown, NSW, Australia
| | - Michael D. O'Connor
- Translational Health Research Institute, Western Sydney University, Campbelltown, NSW, Australia
- School of Medicine, Western Sydney University, Campbelltown, NSW, Australia
- *Correspondence: Michael D. O'Connor
| |
Collapse
|
50
|
Maity AK, Stone TC, Ward V, Webster AP, Yang Z, Hogan A, McBain H, Duku M, Ho KMA, Wolfson P, Graham DG, Beck S, Teschendorff AE, Lovat LB. Novel epigenetic network biomarkers for early detection of esophageal cancer. Clin Epigenetics 2022; 14:23. [PMID: 35164838 PMCID: PMC8845366 DOI: 10.1186/s13148-022-01243-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 02/04/2022] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Early detection of esophageal cancer is critical to improve survival. Whilst studies have identified biomarkers, their interpretation and validity is often confounded by cell-type heterogeneity. RESULTS Here we applied systems-epigenomic and cell-type deconvolution algorithms to a discovery set encompassing RNA-Seq and DNA methylation data from esophageal adenocarcinoma (EAC) patients and matched normal-adjacent tissue, in order to identify robust biomarkers, free from the confounding effect posed by cell-type heterogeneity. We identify 12 gene-modules that are epigenetically deregulated in EAC, and are able to validate all 12 modules in 4 independent EAC cohorts. We demonstrate that the epigenetic deregulation is present in the epithelial compartment of EAC-tissue. Using single-cell RNA-Seq data we show that one of these modules, a proto-cadherin module centered around CTNND2, is inactivated in Barrett's Esophagus, a precursor lesion to EAC. By measuring DNA methylation in saliva from EAC cases and controls, we identify a chemokine module centered around CCL20, whose methylation patterns in saliva correlate with EAC status. CONCLUSIONS Given our observations that a CCL20 chemokine network is overactivated in EAC tissue and saliva from EAC patients, and that in independent studies CCL20 has been found to be overactivated in EAC tissue infected with the bacterium F. nucleatum, a bacterium that normally inhabits the oral cavity, our results highlight the possibility of using DNAm measurements in saliva as a proxy for changes occurring in the esophageal epithelium. Both the CTNND2/CCL20 modules represent novel promising network biomarkers for EAC that merit further investigation.
Collapse
Affiliation(s)
- Alok K Maity
- CAS Key Lab of Computational Biology, Shanghai Institute for Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
| | - Timothy C Stone
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Vanessa Ward
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Amy P Webster
- University of Exeter Medical School, University of Exeter, Exeter, UK
| | - Zhen Yang
- Key Laboratory of Medical Epigenetics and Metabolism, Institutes of Biomedical Sciences, Fudan University, Shanghai, 200032, China
| | - Aine Hogan
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Hazel McBain
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Margaraet Duku
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Kai Man Alexander Ho
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - Paul Wolfson
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK
| | - David G Graham
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK.,Division of GI Services, University College London Hospitals NHS Foundation Trust, 235 Euston Road, London, NW1 2BU, UK
| | | | - Stephan Beck
- UCL Cancer Institute, University College London, Gower Street, London, WC1E 6BT, UK
| | - Andrew E Teschendorff
- CAS Key Lab of Computational Biology, Shanghai Institute for Nutrition and Health, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China.
| | - Laurence B Lovat
- Division of Surgery and Interventional Science, University College London, Gower Street, London, WC1E 6BT, UK. .,Division of GI Services, University College London Hospitals NHS Foundation Trust, 235 Euston Road, London, NW1 2BU, UK.
| |
Collapse
|