1
|
Muldoon JJ, Yu JS, Fassia MK, Bagheri N. Network inference performance complexity: a consequence of topological, experimental and algorithmic determinants. Bioinformatics 2019; 35:3421-3432. [PMID: 30932143 PMCID: PMC6748731 DOI: 10.1093/bioinformatics/btz105] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2018] [Revised: 01/24/2019] [Accepted: 02/11/2019] [Indexed: 12/21/2022] Open
Abstract
MOTIVATION Network inference algorithms aim to uncover key regulatory interactions governing cellular decision-making, disease progression and therapeutic interventions. Having an accurate blueprint of this regulation is essential for understanding and controlling cell behavior. However, the utility and impact of these approaches are limited because the ways in which various factors shape inference outcomes remain largely unknown. RESULTS We identify and systematically evaluate determinants of performance-including network properties, experimental design choices and data processing-by developing new metrics that quantify confidence across algorithms in comparable terms. We conducted a multifactorial analysis that demonstrates how stimulus target, regulatory kinetics, induction and resolution dynamics, and noise differentially impact widely used algorithms in significant and previously unrecognized ways. The results show how even if high-quality data are paired with high-performing algorithms, inferred models are sometimes susceptible to giving misleading conclusions. Lastly, we validate these findings and the utility of the confidence metrics using realistic in silico gene regulatory networks. This new characterization approach provides a way to more rigorously interpret how algorithms infer regulation from biological datasets. AVAILABILITY AND IMPLEMENTATION Code is available at http://github.com/bagherilab/networkinference/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Joseph J Muldoon
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, USA
| | - Jessica S Yu
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
| | - Mohammad-Kasim Fassia
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Department of Biomedical Engineering, Northwestern University, Evanston, IL, USA
| | - Neda Bagheri
- Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL, USA
- Interdisciplinary Biological Sciences Program, Northwestern University, Evanston, IL, USA
- Center for Synthetic Biology, Northwestern University, Evanston, IL, USA
- Chemistry of Life Processes Institute, Northwestern University, Evanston, IL, USA
- Northwestern Institute on Complex Systems, Northwestern University, Evanston, IL, USA
| |
Collapse
|
2
|
Köksal AS, Beck K, Cronin DR, McKenna A, Camp ND, Srivastava S, MacGilvray ME, Bodík R, Wolf-Yadlin A, Fraenkel E, Fisher J, Gitter A. Synthesizing Signaling Pathways from Temporal Phosphoproteomic Data. Cell Rep 2018; 24:3607-3618. [PMID: 30257219 PMCID: PMC6295338 DOI: 10.1016/j.celrep.2018.08.085] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 04/16/2018] [Accepted: 08/29/2018] [Indexed: 12/25/2022] Open
Abstract
We present a method for automatically discovering signaling pathways from time-resolved phosphoproteomic data. The Temporal Pathway Synthesizer (TPS) algorithm uses constraint-solving techniques first developed in the context of formal verification to explore paths in an interaction network. It systematically eliminates all candidate structures for a signaling pathway where a protein is activated or inactivated before its upstream regulators. The algorithm can model more than one hundred thousand dynamic phosphosites and can discover pathway members that are not differentially phosphorylated. By analyzing temporal data, TPS defines signaling cascades without needing to experimentally perturb individual proteins. It recovers known pathways and proposes pathway connections when applied to the human epidermal growth factor and yeast osmotic stress responses. Independent kinase mutant studies validate predicted substrates in the TPS osmotic stress pathway.
Collapse
Affiliation(s)
- Ali Sinan Köksal
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
| | - Kirsten Beck
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Dylan R Cronin
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA; Department of Biological Sciences, Bowling Green State University, Bowling Green, OH, USA
| | - Aaron McKenna
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Nathan D Camp
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Saurabh Srivastava
- Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA
| | | | - Rastislav Bodík
- Paul G. Allen Center for Computer Science and Engineering, University of Washington, Seattle, WA, USA
| | | | - Ernest Fraenkel
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jasmin Fisher
- Microsoft Research, Cambridge, UK; Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA; Morgridge Institute for Research, Madison, WI, USA.
| |
Collapse
|
3
|
Santra T, Rukhlenko O, Zhernovkov V, Kholodenko BN. Reconstructing static and dynamic models of signaling pathways using Modular Response Analysis. ACTA ACUST UNITED AC 2018. [DOI: 10.1016/j.coisb.2018.02.003] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
4
|
Nguyen P, Braun R. Semi-supervised network inference using simulated gene expression dynamics. Bioinformatics 2018; 34:1148-1156. [PMID: 29186340 DOI: 10.1093/bioinformatics/btx748] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 11/23/2017] [Indexed: 01/21/2023] Open
Abstract
Motivation Inferring the structure of gene regulatory networks from high-throughput datasets remains an important and unsolved problem. Current methods are hampered by problems such as noise, low sample size, and incomplete characterizations of regulatory dynamics, leading to networks with missing and anomalous links. Integration of prior network information (e.g. from pathway databases) has the potential to improve reconstructions. Results We developed a semi-supervised network reconstruction algorithm that enables the synthesis of information from partially known networks with time course gene expression data. We adapted partial least square-variable importance in projection (VIP) for time course data and used reference networks to simulate expression data from which null distributions of VIP scores are generated and used to estimate edge probabilities for input expression data. By using simulated dynamics to generate reference distributions, this approach incorporates previously known regulatory relationships and links the network to the dynamics to form a semi-supervised approach that discovers novel and anomalous connections. We applied this approach to data from a sleep deprivation study with KEGG pathways treated as prior networks, as well as to synthetic data from several DREAM challenges, and find that it is able to recover many of the true edges and identify errors in these networks, suggesting its ability to derive posterior networks that accurately reflect gene expression dynamics. Availability and implementation R code is available at https://github.com/pn51/postPLSR. Contact rbraun@northwestern.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Phan Nguyen
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, IL 60208, USA
| | - Rosemary Braun
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, IL 60208, USA.,Biostatistics Division, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| |
Collapse
|
5
|
Windowed Granger causal inference strategy improves discovery of gene regulatory networks. Proc Natl Acad Sci U S A 2018; 115:2252-2257. [PMID: 29440433 DOI: 10.1073/pnas.1710936115] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Accurate inference of regulatory networks from experimental data facilitates the rapid characterization and understanding of biological systems. High-throughput technologies can provide a wealth of time-series data to better interrogate the complex regulatory dynamics inherent to organisms, but many network inference strategies do not effectively use temporal information. We address this limitation by introducing Sliding Window Inference for Network Generation (SWING), a generalized framework that incorporates multivariate Granger causality to infer network structure from time-series data. SWING moves beyond existing Granger methods by generating windowed models that simultaneously evaluate multiple upstream regulators at several potential time delays. We demonstrate that SWING elucidates network structure with greater accuracy in both in silico and experimentally validated in vitro systems. We estimate the apparent time delays present in each system and demonstrate that SWING infers time-delayed, gene-gene interactions that are distinct from baseline methods. By providing a temporal framework to infer the underlying directed network topology, SWING generates testable hypotheses for gene-gene influences.
Collapse
|
6
|
Ciaccio MF, Jones RB. Microwestern Arrays for Systems-Level Analysis of SH2 Domain-Containing Proteins. Methods Mol Biol 2017; 1555:453-473. [PMID: 28092050 DOI: 10.1007/978-1-4939-6762-9_27] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The Microwestern Array (MWA) method combines the scalability and miniaturization afforded by the Reverse Phase Lysate Array (RPLA) approach with the electrophoretic separation characteristic of the Western blot. This technology emulates the creation of an array of small Western blots on a single sheet of nitrocellulose allowing for the sensitive and quantitative measurement of hundreds of proteins from hundreds of cell lysates with minimal cost and maximal accuracy, precision, and reproducibility. The MWA is a versatile technology that can be easily configured for purposes such as antibody screening, cell signaling network inference, protein modification/phenotype regression analysis, and genomic/proteomic relationships. Accordingly, configurations for the MWA can be optimized for maximal numbers of proteins analyzed from small numbers of cell lysates, for small numbers of antibodies against large numbers of cell lysates, or for maximal resolution of protein size achieved by increased electrophoretic separation distance. For example, on a single gel, 6 samples can be printed 96 times if a few samples need to be assayed with a large number of antibodies. Alternatively, up to 100 samples can be assayed with four antibodies on a single gel. Intermediate configurations are also discussed.The efficiency of the MWA is orders of magnitude greater in reagents, labor, and time required per data point relative to the standard Western blotting method and orders of magnitude more sensitive than standard mass spectrometry methods. The MWA is therefore a very attractive approach for capturing global changes in protein abundances and modifications including tyrosine phosphorylation and SH2 domain binding sites.
Collapse
Affiliation(s)
- Mark F Ciaccio
- Independent Biotechnology Professional, Chicago, IL, 60626, USA.
- The Ben May Department of Cancer Research, The Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL, USA.
| | - Richard B Jones
- Independent Biotechnology Professional, Moscow, ID, 83843, USA
- The Ben May Department of Cancer Research, The Institute for Genomics and Systems Biology, University of Chicago, Chicago, IL, USA
| |
Collapse
|
7
|
Casado P, Hijazi M, Britton D, Cutillas PR. Impact of phosphoproteomics in the translation of kinase-targeted therapies. Proteomics 2016; 17. [DOI: 10.1002/pmic.201600235] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2016] [Revised: 09/29/2016] [Accepted: 10/20/2016] [Indexed: 12/29/2022]
Affiliation(s)
- Pedro Casado
- Cell Signalling and Proteomics Group; Centre for Haemato-Oncology; Barts Cancer Institute; Queen Mary University of London; UK
| | - Maruan Hijazi
- Cell Signalling and Proteomics Group; Centre for Haemato-Oncology; Barts Cancer Institute; Queen Mary University of London; UK
| | - David Britton
- Cell Signalling and Proteomics Group; Centre for Haemato-Oncology; Barts Cancer Institute; Queen Mary University of London; UK
| | - Pedro R. Cutillas
- Cell Signalling and Proteomics Group; Centre for Haemato-Oncology; Barts Cancer Institute; Queen Mary University of London; UK
| |
Collapse
|
8
|
Halasz M, Kholodenko BN, Kolch W, Santra T. Integrating network reconstruction with mechanistic modeling to predict cancer therapies. Sci Signal 2016; 9:ra114. [PMID: 27879396 DOI: 10.1126/scisignal.aae0535] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Signal transduction networks are often rewired in cancer cells. Identifying these alterations will enable more effective cancer treatment. We developed a computational framework that can identify, reconstruct, and mechanistically model these rewired networks from noisy and incomplete perturbation response data and then predict potential targets for intervention. As a proof of principle, we analyzed a perturbation data set targeting epidermal growth factor receptor (EGFR) and insulin-like growth factor 1 receptor (IGF1R) pathways in a panel of colorectal cancer cells. Our computational approach predicted cell line-specific network rewiring. In particular, feedback inhibition of insulin receptor substrate 1 (IRS1) by the kinase p70S6K was predicted to confer resistance to EGFR inhibition, suggesting that disrupting this feedback may restore sensitivity to EGFR inhibitors in colorectal cancer cells. We experimentally validated this prediction with colorectal cancer cell lines in culture and in a zebrafish (Danio rerio) xenograft model.
Collapse
Affiliation(s)
- Melinda Halasz
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland. .,School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland
| | - Boris N Kholodenko
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland.,School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland.,Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland
| | - Walter Kolch
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland. .,School of Medicine, University College Dublin, Belfield, Dublin 4, Ireland.,Conway Institute of Biomolecular and Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland
| | - Tapesh Santra
- Systems Biology Ireland, University College Dublin, Belfield, Dublin 4, Ireland.
| |
Collapse
|