1
|
Dall’Olio D, Magnani F, Casadei F, Matteuzzi T, Curti N, Merlotti A, Simonetti G, Della Porta MG, Remondini D, Tarozzi M, Castellani G. Emerging Signatures of Hematological Malignancies from Gene Expression and Transcription Factor-Gene Regulations. Int J Mol Sci 2024; 25:13588. [PMID: 39769352 PMCID: PMC11678896 DOI: 10.3390/ijms252413588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2024] [Revised: 12/12/2024] [Accepted: 12/13/2024] [Indexed: 01/11/2025] Open
Abstract
Hematological malignancies are a diverse group of cancers developing in the peripheral blood, the bone marrow or the lymphatic system. Due to their heterogeneity, the identification of novel and advanced molecular signatures is essential for enhancing their characterization and facilitate its translation to new pharmaceutical solutions and eventually to clinical applications. In this study, we collected publicly available microarray data for more than five thousand subjects, across thirteen hematological malignancies. Using PANDA to estimate gene regulatory networks (GRNs), we performed hierarchical clustering and network analysis to explore transcription factor (TF) interactions and their implications on biological pathways. Our findings reveal distinct clustering patterns among leukemias and lymphomas, with notable differences in gene and TF expression profiles. Gene Set Enrichment Analysis (GSEA) identified 57 significantly enriched KEGG pathways, highlighting both common and unique biological processes across HMs. We also identified potential drug targets within these pathways, emphasizing the role of TFs such as CEBPB and NFE2L1 in disease pathophysiology. Our comprehensive analysis enhances the understanding of the molecular landscape of HMs and suggests new avenues for targeted therapeutic strategies. These findings also motivate the adoption of regulatory networks, combined with modern biotechnological possibilities, for insightful pan-cancer exploratory studies.
Collapse
Affiliation(s)
- Daniele Dall’Olio
- Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy
| | - Federico Magnani
- Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy
| | - Francesco Casadei
- IRCCS Istituto delle Scienze Neurologiche di Bologna, 40139 Bologna, Italy
| | - Tommaso Matteuzzi
- Department of Physics and Astronomy, University of Firenze, 50019 Sesto Fiorentino, Italy
| | - Nico Curti
- Department of Physics and Astronomy, University of Bologna, 40127 Bologna, Italy
| | - Alessandra Merlotti
- Department of Physics and Astronomy, University of Bologna, 40127 Bologna, Italy
| | - Giorgia Simonetti
- Biosciences Laboratory, IRCCS Istituto Romagnolo per lo Studio dei Tumori (IRST) “Dino Amadori”, 47014 Meldola, Italy
| | - Matteo Giovanni Della Porta
- Comprehensive Cancer Center, IRCCS Humanitas Clinical and Research Center and Department of Biomedical Sciences, Humanitas University, 20089 Milan, Italy
| | - Daniel Remondini
- Department of Physics and Astronomy, University of Bologna, 40127 Bologna, Italy
| | - Martina Tarozzi
- Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, 40138 Bologna, Italy
| | - Gastone Castellani
- Department of Medical and Surgical Sciences, University of Bologna, 40138 Bologna, Italy
- IRCCS Azienda Ospedaliero-Universitaria di Bologna, 40138 Bologna, Italy
| |
Collapse
|
2
|
Peng D, Cahan P. OneSC: a computational platform for recapitulating cell state transitions. Bioinformatics 2024; 40:btae703. [PMID: 39570626 PMCID: PMC11630913 DOI: 10.1093/bioinformatics/btae703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 11/13/2024] [Accepted: 11/19/2024] [Indexed: 11/22/2024] Open
Abstract
MOTIVATION Computational modeling of cell state transitions has been a great interest of many in the field of developmental biology, cancer biology, and cell fate engineering because it enables performing perturbation experiments in silico more rapidly and cheaply than could be achieved in a lab. Recent advancements in single-cell RNA-sequencing (scRNA-seq) allow the capture of high-resolution snapshots of cell states as they transition along temporal trajectories. Using these high-throughput datasets, we can train computational models to generate in silico "synthetic" cells that faithfully mimic the temporal trajectories. RESULTS Here we present OneSC, a platform that can simulate cell state transitions using systems of stochastic differential equations govern by a regulatory network of core transcription factors (TFs). Different from many current network inference methods, OneSC prioritizes on generating Boolean network that produces faithful cell state transitions and terminal cell states that mimic real biological systems. Applying OneSC to real data, we inferred a core TF network using a mouse myeloid progenitor scRNA-seq dataset and showed that the dynamical simulations of that network generate synthetic single-cell expression profiles that faithfully recapitulate the four myeloid differentiation trajectories going into differentiated cell states (erythrocytes, megakaryocytes, granulocytes, and monocytes). Finally, through the in silico perturbations of the mouse myeloid progenitor core network, we showed that OneSC can accurately predict cell fate decision biases of TF perturbations that closely match with previous experimental observations. AVAILABILITY AND IMPLEMENTATION OneSC is implemented as a Python package on GitHub (https://github.com/CahanLab/oneSC) and on Zenodo (https://zenodo.org/records/14052421).
Collapse
Affiliation(s)
- Da Peng
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, United States
| | - Patrick Cahan
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD 21205, United States
- Institute for Cell Engineering, Johns Hopkins University, Baltimore, MD 21205, United States
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD 21205, United States
| |
Collapse
|
3
|
Peng D, Cahan P. OneSC: A computational platform for recapitulating cell state transitions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.31.596831. [PMID: 38895453 PMCID: PMC11185539 DOI: 10.1101/2024.05.31.596831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
Computational modelling of cell state transitions has been a great interest of many in the field of developmental biology, cancer biology and cell fate engineering because it enables performing perturbation experiments in silico more rapidly and cheaply than could be achieved in a wet lab. Recent advancements in single-cell RNA sequencing (scRNA-seq) allow the capture of high-resolution snapshots of cell states as they transition along temporal trajectories. Using these high-throughput datasets, we can train computational models to generate in silico 'synthetic' cells that faithfully mimic the temporal trajectories. Here we present OneSC, a platform that can simulate synthetic cells across developmental trajectories using systems of stochastic differential equations govern by a core transcription factors (TFs) regulatory network. Different from the current network inference methods, OneSC prioritizes on generating Boolean network that produces faithful cell state transitions and steady cell states that mimic real biological systems. Applying OneSC to real data, we inferred a core TF network using a mouse myeloid progenitor scRNA-seq dataset and showed that the dynamical simulations of that network generate synthetic single-cell expression profiles that faithfully recapitulate the four myeloid differentiation trajectories going into differentiated cell states (erythrocytes, megakaryocytes, granulocytes and monocytes). Finally, through the in-silico perturbations of the mouse myeloid progenitor core network, we showed that OneSC can accurately predict cell fate decision biases of TF perturbations that closely match with previous experimental observations.
Collapse
Affiliation(s)
- Da Peng
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, 21205, USA
| | - Patrick Cahan
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, Maryland, 21205, USA
- Institute for Cell Engineering, Johns Hopkins University, Baltimore, Maryland, 21205, USA
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, Maryland, 21205, USA
| |
Collapse
|
4
|
Triantafyllidis CP, Barberis A, Hartley F, Cuervo AM, Gjerga E, Charlton P, van Bijsterveldt L, Rodriguez JS, Buffa FM. A machine learning and directed network optimization approach to uncover TP53 regulatory patterns. iScience 2023; 26:108291. [PMID: 38047081 PMCID: PMC10692668 DOI: 10.1016/j.isci.2023.108291] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Revised: 07/21/2023] [Accepted: 10/18/2023] [Indexed: 12/05/2023] Open
Abstract
TP53, the Guardian of the Genome, is the most frequently mutated gene in human cancers and the functional characterization of its regulation is fundamental. To address this we employ two strategies: machine learning to predict the mutation status of TP53 from transcriptomic data, and directed regulatory networks to reconstruct the effect of mutations on the transcipt levels of TP53 targets. Using data from established databases (Cancer Cell Line Encyclopedia, The Cancer Genome Atlas), machine learning could predict the mutation status, but not resolve different mutations. On the contrary, directed network optimization allowed to infer the TP53 regulatory profile across: (1) mutations, (2) irradiation in lung cancer, and (3) hypoxia in breast cancer, and we could observe differential regulatory profiles dictated by (1) mutation type, (2) deleterious consequences of the mutation, (3) known hotspots, (4) protein changes, (5) stress condition (irradiation/hypoxia). This is an important first step toward using regulatory networks for the characterization of the functional consequences of mutations, and could be extended to other perturbations, with implications for drug design and precision medicine.
Collapse
Affiliation(s)
- Charalampos P. Triantafyllidis
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
- Department of Epidemiology & Biostatistics, School of Public Health, Imperial College London, London, UK
| | - Alessandro Barberis
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Fiona Hartley
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Ana Miar Cuervo
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
| | - Enio Gjerga
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg, Germany
| | - Philip Charlton
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
| | | | - Julio Saez Rodriguez
- Heidelberg University, Faculty of Medicine, Heidelberg University Hospital, Institute for Computational Biomedicine, Heidelberg, Germany
| | - Francesca M. Buffa
- Department of Oncology, Medical Sciences Division, University of Oxford, Oxford, UK
- Department of Computing Sciences, BIDSA, Bocconi University, Milan, Italy
| |
Collapse
|
5
|
Hsiao YC, Dutta A. Nonlinear control designs and their application to cancer differentiation therapy. Math Biosci 2023; 366:109105. [PMID: 37944795 DOI: 10.1016/j.mbs.2023.109105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2023] [Revised: 10/29/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023]
Abstract
We designed three new controllers: a sigmoid-based controller, a polynomial dynamic inversion-based controller, and a proportional-integral-derivative (PID) impulsive controller for cancer differentiation therapy. We compared these three controllers to existing control strategies to show the improvement in performance and compare their robustness. The sigmoid-based controller adds a sigmoid term associated with the error of the controlled state and a selected observed state. The sigmoid term is multiplied by a control gain, thereby decreasing the control effort for state transition. The polynomial dynamic inversion-based controller adds a cubic error term in the error dynamic aiming to achieve a shorter convergence time to the desired value of the controlled state. The PID impulsive controller considers the accumulated controlled state error and the rate of change of the controlled state error, thereby forcing the controlled state to converge to the desired value and alleviating the damping effect in the steady state. For the considered cancer network, the 3 new cancer control strategies exhibit superior and robust performance. The PID impulsive controller has a significant improvement in robustness compared to the impulsive controller and has greater potential for cancer differentiation therapy.
Collapse
Affiliation(s)
- Yen-Che Hsiao
- Department of Electrical and Computer Engineering, University of Connecticut, Storrs, 06269, CT, USA.
| | - Abhishek Dutta
- Department of Electrical and Computer Engineering, University of Connecticut, Storrs, 06269, CT, USA
| |
Collapse
|
6
|
Abe K, Shimamura T. UNMF: a unified nonnegative matrix factorization for multi-dimensional omics data. Brief Bioinform 2023; 24:bbad253. [PMID: 37478378 PMCID: PMC10516365 DOI: 10.1093/bib/bbad253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 04/26/2023] [Accepted: 06/16/2023] [Indexed: 07/23/2023] Open
Abstract
Factor analysis, ranging from principal component analysis to nonnegative matrix factorization, represents a foremost approach in analyzing multi-dimensional data to extract valuable patterns, and is increasingly being applied in the context of multi-dimensional omics datasets represented in tensor form. However, traditional analytical methods are heavily dependent on the format and structure of the data itself, and if these change even slightly, the analyst must change their data analysis strategy and techniques and spend a considerable amount of time on data preprocessing. Additionally, many traditional methods cannot be applied as-is in the presence of missing values in the data. We present a new statistical framework, unified nonnegative matrix factorization (UNMF), for finding informative patterns in messy biological data sets. UNMF is designed for tidy data format and structure, making data analysis easier and simplifying the development of data analysis tools. UNMF can handle a wide range of data structures and formats, and works seamlessly with tensor data including missing observations and repeated measurements. The usefulness of UNMF is demonstrated through its application to several multi-dimensional omics data, offering user-friendly and unified features for analysis and integration. Its application holds great potential for the life science community. UNMF is implemented with R and is available from GitHub (https://github.com/abikoushi/moltenNMF).
Collapse
Affiliation(s)
- Ko Abe
- Division of Systems Biology, Nagoya University Graduate School of Medicine, Showa-ku, 466-8550, Nagoya, Japan
| | - Teppei Shimamura
- Division of Systems Biology, Nagoya University Graduate School of Medicine, Showa-ku, 466-8550, Nagoya, Japan
- Department of Computational and Systems Biology, Medical Research Institute, Tokyo Medical and Dental University, Bunkyo-ku, 113-8510, Tokyo, Japan
| |
Collapse
|
7
|
Systems Drug Design for Muscle Invasive Bladder Cancer and Advanced Bladder Cancer by Genome-Wide Microarray Data and Deep Learning Method with Drug Design Specifications. Int J Mol Sci 2022; 23:ijms232213869. [PMID: 36430344 PMCID: PMC9692470 DOI: 10.3390/ijms232213869] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 11/05/2022] [Accepted: 11/07/2022] [Indexed: 11/12/2022] Open
Abstract
Bladder cancer is the 10th most common cancer worldwide. Due to the lack of understanding of the oncogenic mechanisms between muscle-invasive bladder cancer (MIBC) and advanced bladder cancer (ABC) and the limitations of current treatments, novel therapeutic approaches are urgently needed. In this study, we utilized the systems biology method via genome-wide microarray data to explore the oncogenic mechanisms of MIBC and ABC to identify their respective drug targets for systems drug discovery. First, we constructed the candidate genome-wide genetic and epigenetic networks (GWGEN) through big data mining. Second, we applied the system identification and system order detection method to delete false positives in candidate GWGENs to obtain the real GWGENs of MIBC and ABC from their genome-wide microarray data. Third, we extracted the core GWGENs from the real GWGENs by selecting the significant proteins, genes and epigenetics via the principal network projection (PNP) method. Finally, we obtained the core signaling pathways from the corresponding core GWGEN through the annotations of the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway to investigate the carcinogenic mechanisms of MIBC and ABC. Based on the carcinogenic mechanisms, we selected the significant drug targets NFKB1, LEF1 and MYC for MIBC, and LEF1, MYC, NOTCH1 and FOXO1 for ABC. To design molecular drug combinations for MIBC and ABC, we employed a deep neural network (DNN)-based drug-target interaction (DTI) model with drug specifications. The DNN-based DTI model was trained by drug-target interaction databases to predict the candidate drugs for MIBC and ABC, respectively. Subsequently, the drug design specifications based on regulation ability, sensitivity and toxicity were employed as filter criteria for screening the potential drug combinations of Embelin and Obatoclax for MIBC, and Obatoclax, Entinostat and Imiquimod for ABC from their candidate drugs. In conclusion, we not only investigated the oncogenic mechanisms of MIBC and ABC, but also provided promising therapeutic options for MIBC and ABC, respectively.
Collapse
|
8
|
Seçilmiş D, Hillerton T, Tjärnberg A, Nelander S, Nordling TEM, Sonnhammer ELL. Knowledge of the perturbation design is essential for accurate gene regulatory network inference. Sci Rep 2022; 12:16531. [PMID: 36192495 PMCID: PMC9529923 DOI: 10.1038/s41598-022-19005-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 08/23/2022] [Indexed: 11/08/2022] Open
Abstract
The gene regulatory network (GRN) of a cell executes genetic programs in response to environmental and internal cues. Two distinct classes of methods are used to infer regulatory interactions from gene expression: those that only use observed changes in gene expression, and those that use both the observed changes and the perturbation design, i.e. the targets used to cause the changes in gene expression. Considering that the GRN by definition converts input cues to changes in gene expression, it may be conjectured that the latter methods would yield more accurate inferences but this has not previously been investigated. To address this question, we evaluated a number of popular GRN inference methods that either use the perturbation design or not. For the evaluation we used targeted perturbation knockdown gene expression datasets with varying noise levels generated by two different packages, GeneNetWeaver and GeneSpider. The accuracy was evaluated on each dataset using a variety of measures. The results show that on all datasets, methods using the perturbation design matrix consistently and significantly outperform methods not using it. This was also found to be the case on a smaller experimental dataset from E. coli. Targeted gene perturbations combined with inference methods that use the perturbation design are indispensable for accurate GRN inference.
Collapse
Affiliation(s)
- Deniz Seçilmiş
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121, Solna, Sweden
| | - Thomas Hillerton
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121, Solna, Sweden
| | - Andreas Tjärnberg
- Center for Developmental Genetics, New York University, New York, USA
| | - Sven Nelander
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, 75185, Uppsala, Sweden
| | - Torbjörn E M Nordling
- Department of Mechanical Engineering, National Cheng Kung University, Tainan, 701, Taiwan, ROC
- Department of Applied Physics and Electronics, Umeå University, 90187, Umeå, Sweden
| | - Erik L L Sonnhammer
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121, Solna, Sweden.
| |
Collapse
|
9
|
Lin YC, Chen BS. Identifying Drug Targets of Oral Squamous Cell Carcinoma through a Systems Biology Method and Genome-Wide Microarray Data for Drug Discovery by Deep Learning and Drug Design Specifications. Int J Mol Sci 2022; 23:ijms231810409. [PMID: 36142321 PMCID: PMC9499358 DOI: 10.3390/ijms231810409] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 09/06/2022] [Accepted: 09/07/2022] [Indexed: 11/22/2022] Open
Abstract
In this study, we provide a systems biology method to investigate the carcinogenic mechanism of oral squamous cell carcinoma (OSCC) in order to identify some important biomarkers as drug targets. Further, a systematic drug discovery method with a deep neural network (DNN)-based drug–target interaction (DTI) model and drug design specifications is proposed to design a potential multiple-molecule drug for the medical treatment of OSCC before clinical trials. First, we use big database mining to construct the candidate genome-wide genetic and epigenetic network (GWGEN) including a protein–protein interaction network (PPIN) and a gene regulatory network (GRN) for OSCC and non-OSCC. In the next step, real GWGENs are identified for OSCC and non-OSCC by system identification and system order detection methods based on the OSCC and non-OSCC microarray data, respectively. Then, the principal network projection (PNP) method was used to extract core GWGENs of OSCC and non-OSCC from real GWGENs of OSCC and non-OSCC, respectively. Afterward, core signaling pathways were constructed through the annotation of KEGG pathways, and then the carcinogenic mechanism of OSCC was investigated by comparing the core signal pathways and their downstream abnormal cellular functions of OSCC and non-OSCC. Consequently, HES1, TCF, NF-κB and SP1 are identified as significant biomarkers of OSCC. In order to discover multiple molecular drugs for these significant biomarkers (drug targets) of the carcinogenic mechanism of OSCC, we trained a DNN-based drug–target interaction (DTI) model by DTI databases to predict candidate drugs for these significant biomarkers. Finally, drug design specifications such as adequate drug regulation ability, low toxicity and high sensitivity are employed to filter out the appropriate molecular drugs metformin, gefitinib and gallic-acid to combine as a potential multiple-molecule drug for the therapeutic treatment of OSCC.
Collapse
|
10
|
Seçilmiş D, Nelander S, Sonnhammer ELL. Optimal Sparsity Selection Based on an Information Criterion for Accurate Gene Regulatory Network Inference. Front Genet 2022; 13:855770. [PMID: 35923701 PMCID: PMC9340570 DOI: 10.3389/fgene.2022.855770] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Accepted: 05/30/2022] [Indexed: 11/25/2022] Open
Abstract
Accurate inference of gene regulatory networks (GRNs) is important to unravel unknown regulatory mechanisms and processes, which can lead to the identification of treatment targets for genetic diseases. A variety of GRN inference methods have been proposed that, under suitable data conditions, perform well in benchmarks that consider the entire spectrum of false-positives and -negatives. However, it is very challenging to predict which single network sparsity gives the most accurate GRN. Lacking criteria for sparsity selection, a simplistic solution is to pick the GRN that has a certain number of links per gene, which is guessed to be reasonable. However, this does not guarantee finding the GRN that has the correct sparsity or is the most accurate one. In this study, we provide a general approach for identifying the most accurate and sparsity-wise relevant GRN within the entire space of possible GRNs. The algorithm, called SPA, applies a “GRN information criterion” (GRNIC) that is inspired by two commonly used model selection criteria, Akaike and Bayesian Information Criterion (AIC and BIC) but adapted to GRN inference. The results show that the approach can, in most cases, find the GRN whose sparsity is close to the true sparsity and close to as accurate as possible with the given GRN inference method and data. The datasets and source code can be found at https://bitbucket.org/sonnhammergrni/spa/.
Collapse
Affiliation(s)
- Deniz Seçilmiş
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Solna, Sweden
| | - Sven Nelander
- Science for Life Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Erik L. L. Sonnhammer
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Solna, Sweden
- *Correspondence: Erik L. L. Sonnhammer,
| |
Collapse
|
11
|
Seçilmiş D, Hillerton T, Sonnhammer ELL. GRNbenchmark - a web server for benchmarking directed gene regulatory network inference methods. Nucleic Acids Res 2022; 50:W398-W404. [PMID: 35609981 PMCID: PMC9252735 DOI: 10.1093/nar/gkac377] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2022] [Revised: 04/20/2022] [Accepted: 05/19/2022] [Indexed: 11/30/2022] Open
Abstract
Accurate inference of gene regulatory networks (GRN) is an essential component of systems biology, and there is a constant development of new inference methods. The most common approach to assess accuracy for publications is to benchmark the new method against a selection of existing algorithms. This often leads to a very limited comparison, potentially biasing the results, which may stem from tuning the benchmark's properties or incorrect application of other methods. These issues can be avoided by a web server with a broad range of data properties and inference algorithms, that makes it easy to perform comprehensive benchmarking of new methods, and provides a more objective assessment. Here we present https://GRNbenchmark.org/ - a new web server for benchmarking GRN inference methods, which provides the user with a set of benchmarks with several datasets, each spanning a range of properties including multiple noise levels. As soon as the web server has performed the benchmarking, the accuracy results are made privately available to the user via interactive summary plots and underlying curves. The user can then download these results for any purpose, and decide whether or not to make them public to share with the community.
Collapse
Affiliation(s)
- Deniz Seçilmiş
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121 Solna, Sweden
| | - Thomas Hillerton
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121 Solna, Sweden
| | - Erik L L Sonnhammer
- Department of Biochemistry and Biophysics, Stockholm University, Science for Life Laboratory, Box 1031, 17121 Solna, Sweden
| |
Collapse
|
12
|
Repurposing Multiple-Molecule Drugs for COVID-19-Associated Acute Respiratory Distress Syndrome and Non-Viral Acute Respiratory Distress Syndrome via a Systems Biology Approach and a DNN-DTI Model Based on Five Drug Design Specifications. Int J Mol Sci 2022; 23:ijms23073649. [PMID: 35409008 PMCID: PMC8998971 DOI: 10.3390/ijms23073649] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 03/22/2022] [Accepted: 03/23/2022] [Indexed: 02/04/2023] Open
Abstract
The coronavirus disease 2019 (COVID-19) epidemic is currently raging around the world at a rapid speed. Among COVID-19 patients, SARS-CoV-2-associated acute respiratory distress syndrome (ARDS) is the main contribution to the high ratio of morbidity and mortality. However, clinical manifestations between SARS-CoV-2-associated ARDS and non-SARS-CoV-2-associated ARDS are quite common, and their therapeutic treatments are limited because the intricated pathophysiology having been not fully understood. In this study, to investigate the pathogenic mechanism of SARS-CoV-2-associated ARDS and non-SARS-CoV-2-associated ARDS, first, we constructed a candidate host-pathogen interspecies genome-wide genetic and epigenetic network (HPI-GWGEN) via database mining. With the help of host-pathogen RNA sequencing (RNA-Seq) data, real HPI-GWGEN of COVID-19-associated ARDS and non-viral ARDS were obtained by system modeling, system identification, and Akaike information criterion (AIC) model order selection method to delete the false positives in candidate HPI-GWGEN. For the convenience of mitigation, the principal network projection (PNP) approach is utilized to extract core HPI-GWGEN, and then the corresponding core signaling pathways of COVID-19-associated ARDS and non-viral ARDS are annotated via their core HPI-GWGEN by KEGG pathways. In order to design multiple-molecule drugs of COVID-19-associated ARDS and non-viral ARDS, we identified essential biomarkers as drug targets of pathogenesis by comparing the core signal pathways between COVID-19-associated ARDS and non-viral ARDS. The deep neural network of the drug–target interaction (DNN-DTI) model could be trained by drug–target interaction databases in advance to predict candidate drugs for the identified biomarkers. We further narrowed down these predicted drug candidates to repurpose potential multiple-molecule drugs by the filters of drug design specifications, including regulation ability, sensitivity, excretion, toxicity, and drug-likeness. Taken together, we not only enlighten the etiologic mechanisms under COVID-19-associated ARDS and non-viral ARDS but also provide novel therapeutic options for COVID-19-associated ARDS and non-viral ARDS.
Collapse
|
13
|
Network Biology and Artificial Intelligence Drive the Understanding of the Multidrug Resistance Phenotype in Cancer. Drug Resist Updat 2022; 60:100811. [DOI: 10.1016/j.drup.2022.100811] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 01/22/2022] [Accepted: 01/24/2022] [Indexed: 02/07/2023]
|