1
|
Pandey AK, Loscalzo J. Network medicine: an approach to complex kidney disease phenotypes. Nat Rev Nephrol 2023:10.1038/s41581-023-00705-0. [PMID: 37041415 DOI: 10.1038/s41581-023-00705-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/13/2023] [Indexed: 04/13/2023]
Abstract
Scientific reductionism has been the basis of disease classification and understanding for more than a century. However, the reductionist approach of characterizing diseases from a limited set of clinical observations and laboratory evaluations has proven insufficient in the face of an exponential growth in data generated from transcriptomics, proteomics, metabolomics and deep phenotyping. A new systematic method is necessary to organize these datasets and build new definitions of what constitutes a disease that incorporates both biological and environmental factors to more precisely describe the ever-growing complexity of phenotypes and their underlying molecular determinants. Network medicine provides such a conceptual framework to bridge these vast quantities of data while providing an individualized understanding of disease. The modern application of network medicine principles is yielding new insights into the pathobiology of chronic kidney diseases and renovascular disorders by expanding the understanding of pathogenic mediators, novel biomarkers and new options for renal therapeutics. These efforts affirm network medicine as a robust paradigm for elucidating new advances in the diagnosis and treatment of kidney disorders.
Collapse
Affiliation(s)
- Arvind K Pandey
- Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, MA, USA
| | - Joseph Loscalzo
- Division of Cardiovascular Medicine, Department of Medicine, Brigham and Women's Hospital, and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
2
|
Kotiang S, Eslami A. Boolean factor graph model for biological systems: the yeast cell-cycle network. BMC Bioinformatics 2021; 22:442. [PMID: 34535069 PMCID: PMC8447535 DOI: 10.1186/s12859-021-04361-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2021] [Accepted: 08/28/2021] [Indexed: 11/14/2022] Open
Abstract
Background The desire to understand genomic functions and the behavior of complex gene regulatory networks has recently been a major research focus in systems biology. As a result, a plethora of computational and modeling tools have been proposed to identify and infer interactions among biological entities. Here, we consider the general question of the effect of perturbation on the global dynamical network behavior as well as error propagation in biological networks to incite research pertaining to intervention strategies. Results This paper introduces a computational framework that combines the formulation of Boolean networks and factor graphs to explore the global dynamical features of biological systems. A message-passing algorithm is proposed for this formalism to evolve network states as messages in the graph. In addition, the mathematical formulation allows us to describe the dynamics and behavior of error propagation in gene regulatory networks by conducting a density evolution (DE) analysis. The model is applied to assess the network state progression and the impact of gene deletion in the budding yeast cell cycle. Simulation results show that our model predictions match published experimental data. Also, our findings reveal that the sample yeast cell-cycle network is not only robust but also consistent with real high-throughput expression data. Finally, our DE analysis serves as a tool to find the optimal values of network parameters for resilience against perturbations, especially in the inference of genetic graphs. Conclusion Our computational framework provides a useful graphical model and analytical tools to study biological networks. It can be a powerful tool to predict the consequences of gene deletions before conducting wet bench experiments because it proves to be a quick route to predicting biologically relevant dynamic properties without tunable kinetic parameters. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04361-8.
Collapse
Affiliation(s)
- Stephen Kotiang
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS, 67260, USA
| | - Ali Eslami
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS, 67260, USA.
| |
Collapse
|
3
|
Silverman EK, Schmidt HHHW, Anastasiadou E, Altucci L, Angelini M, Badimon L, Balligand JL, Benincasa G, Capasso G, Conte F, Di Costanzo A, Farina L, Fiscon G, Gatto L, Gentili M, Loscalzo J, Marchese C, Napoli C, Paci P, Petti M, Quackenbush J, Tieri P, Viggiano D, Vilahur G, Glass K, Baumbach J. Molecular networks in Network Medicine: Development and applications. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2020; 12:e1489. [PMID: 32307915 DOI: 10.1002/wsbm.1489] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 02/29/2020] [Accepted: 03/20/2020] [Indexed: 12/14/2022]
Abstract
Network Medicine applies network science approaches to investigate disease pathogenesis. Many different analytical methods have been used to infer relevant molecular networks, including protein-protein interaction networks, correlation-based networks, gene regulatory networks, and Bayesian networks. Network Medicine applies these integrated approaches to Omics Big Data (including genetics, epigenetics, transcriptomics, metabolomics, and proteomics) using computational biology tools and, thereby, has the potential to provide improvements in the diagnosis, prognosis, and treatment of complex diseases. We discuss briefly the types of molecular data that are used in molecular network analyses, survey the analytical methods for inferring molecular networks, and review efforts to validate and visualize molecular networks. Successful applications of molecular network analysis have been reported in pulmonary arterial hypertension, coronary heart disease, diabetes mellitus, chronic lung diseases, and drug development. Important knowledge gaps in Network Medicine include incompleteness of the molecular interactome, challenges in identifying key genes within genetic association regions, and limited applications to human diseases. This article is categorized under: Models of Systems Properties and Processes > Mechanistic Models Translational, Genomic, and Systems Medicine > Translational Medicine Analytical and Computational Methods > Analytical Methods Analytical and Computational Methods > Computational Methods.
Collapse
Affiliation(s)
- Edwin K Silverman
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Harald H H W Schmidt
- Department of Pharmacology and Personalized Medicine, School of Mental Health and Neuroscience, Faculty of Health, Medicine and Life Science, Maastricht University, Maastricht, The Netherlands
| | - Eleni Anastasiadou
- Department of Experimental Medicine, Sapienza University of Rome, Rome, Italy
| | - Lucia Altucci
- Department of Precision Medicine, University of Campania 'Luigi Vanvitelli', Naples, Italy
| | - Marco Angelini
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Lina Badimon
- Cardiovascular Program-ICCC, IR-Hospital de la Santa Creu i Sant Pau, CiberCV, IIB-Sant Pau, Autonomous University of Barcelona, Barcelona, Spain
| | - Jean-Luc Balligand
- Pole of Pharmacology and Therapeutics (FATH), Institute for Clinical and Experimental Research (IREC), UCLouvain, Brussels, Belgium
| | - Giuditta Benincasa
- Department of Advanced Clinical and Surgical Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Giovambattista Capasso
- Department of Translational Medical Sciences, University of Campania "L. Vanvitelli", Naples, Italy.,BIOGEM, Ariano Irpino, Italy
| | - Federica Conte
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Antonella Di Costanzo
- Department of Precision Medicine, University of Campania 'Luigi Vanvitelli', Naples, Italy
| | - Lorenzo Farina
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Giulia Fiscon
- Institute for Systems Analysis and Computer Science "Antonio Ruberti", National Research Council, Rome, Italy
| | - Laurent Gatto
- de Duve Institute, Brussels, Belgium.,Institute for Experimental and Clinical Research (IREC), UCLouvain, Brussels, Belgium
| | - Michele Gentili
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Joseph Loscalzo
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.,Division of Cardiovascular Medicine, Brigham and Women's Hospital, Boston, Massachusetts, USA
| | - Cinzia Marchese
- Department of Experimental Medicine, Sapienza University of Rome, Rome, Italy
| | - Claudio Napoli
- Department of Advanced Clinical and Surgical Sciences, University of Campania "Luigi Vanvitelli", Naples, Italy
| | - Paola Paci
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - Manuela Petti
- Department of Computer, Control and Management Engineering, Sapienza University of Rome, Rome, Italy
| | - John Quackenbush
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Paolo Tieri
- CNR National Research Council of Italy, IAC Institute for Applied Computing, Rome, Italy
| | - Davide Viggiano
- BIOGEM, Ariano Irpino, Italy.,Department of Medicine and Health Sciences, University of Molise, Campobasso, Italy
| | - Gemma Vilahur
- Cardiovascular Program-ICCC, IR-Hospital de la Santa Creu i Sant Pau, CiberCV, IIB-Sant Pau, Autonomous University of Barcelona, Barcelona, Spain
| | - Kimberly Glass
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA
| | - Jan Baumbach
- Department of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Maximus-von-Imhof-Forum 3, Freising, Germany.,Institute of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| |
Collapse
|
4
|
Orak NH. A Hybrid Bayesian Network Framework for Risk Assessment of Arsenic Exposure and Adverse Reproductive Outcomes. ECOTOXICOLOGY AND ENVIRONMENTAL SAFETY 2020; 192:110270. [PMID: 32036100 DOI: 10.1016/j.ecoenv.2020.110270] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 01/27/2020] [Accepted: 01/30/2020] [Indexed: 06/10/2023]
Abstract
Arsenic contamination of drinking water affects more than 137 million people and has been linked to several adverse health effects. The traditional toxicological approach, "dose-response" graphs, are limited in their ability to unveil the relationships between potential risk factors of arsenic exposure for adverse human health outcomes, which are critically important to understanding the risk at low exposure levels of arsenic. Therefore, to provide insight on the potential interactions of different variables of the arsenic exposure network, this study characterizes the risk factors by developing a hybrid Bayesian Belief Network (BBN) model for health risk assessment. The results show that the low inorganic arsenic concentration increases the risk of low birth weight even for low gestational age scenarios. While increasing the mother's age does not increase the low birthweight risk, it affects the distribution between other categories of baby weight. For low MMA% (<4%) in the human body, increasing gestational age decreases the risk of having low birthweight. The proposed BBN model provides 82% sensitivity and 72% specificity in average for different states of birthweight.
Collapse
Affiliation(s)
- Nur H Orak
- Duzce University, Department of Environmental Engineering, Duzce, Turkey.
| |
Collapse
|
5
|
Kotiang S, Eslami A. A probabilistic graphical model for system-wide analysis of gene regulatory networks. Bioinformatics 2020; 36:3192-3199. [DOI: 10.1093/bioinformatics/btaa122] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 01/15/2020] [Accepted: 02/18/2020] [Indexed: 01/28/2023] Open
Abstract
Abstract
Motivation
The inference of gene regulatory networks (GRNs) from DNA microarray measurements forms a core element of systems biology-based phenotyping. In the recent past, numerous computational methodologies have been formalized to enable the deduction of reliable and testable predictions in today’s biology. However, little focus has been aimed at quantifying how well existing state-of-the-art GRNs correspond to measured gene-expression profiles.
Results
Here, we present a computational framework that combines the formulation of probabilistic graphical modeling, standard statistical estimation, and integration of high-throughput biological data to explore the global behavior of biological systems and the global consistency between experimentally verified GRNs and corresponding large microarray compendium data. The model is represented as a probabilistic bipartite graph, which can handle highly complex network systems and accommodates partial measurements of diverse biological entities, e.g. messengerRNAs, proteins, metabolites and various stimulators participating in regulatory networks. This method was tested on microarray expression data from the M3D database, corresponding to sub-networks on one of the best researched model organisms, Escherichia coli. Results show a surprisingly high correlation between the observed states and the inferred system’s behavior under various experimental conditions.
Availability and implementation
Processed data and software implementation using Matlab are freely available at https://github.com/kotiang54/PgmGRNs. Full dataset available from the M3D database.
Collapse
Affiliation(s)
- Stephen Kotiang
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS 67260, USA
| | - Ali Eslami
- Department of Electrical Engineering and Computer Science, Wichita State University, Wichita, KS 67260, USA
| |
Collapse
|
6
|
Bruner A, Sharan R. A Robustness Analysis of Dynamic Boolean Models of Cellular Circuits. J Comput Biol 2020; 27:133-143. [PMID: 31770006 DOI: 10.1089/cmb.2019.0290] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
With ever growing amounts of omics data, the next challenge in biological research is the interpretation of these data to gain mechanistic insights about cellular function. Dynamic models of cellular circuits that capture the activity levels of proteins and other molecules over time offer great expressive power by allowing the simulation of the effects of specific internal or external perturbations on the workings of the cell. However, the study of such models is at its infancy and no large-scale analysis of the robustness of real models to changing conditions has been conducted to date. Here we provide a computational framework to study the robustness of such models using a combination of stochastic simulations and integer linear programming techniques. We apply our framework to a large collection of cellular circuits and benchmark the results against randomized models. We find that the steady states of real circuits tend to be more robust in multiple aspects compared with their randomized counterparts.
Collapse
Affiliation(s)
- Ariel Bruner
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| | - Roded Sharan
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
7
|
Orak NH, Small MJ, Druzdzel MJ. Bayesian network-based framework for exposure-response study design and interpretation. Environ Health 2019; 18:23. [PMID: 30902096 PMCID: PMC6431017 DOI: 10.1186/s12940-019-0461-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2018] [Accepted: 03/04/2019] [Indexed: 05/08/2023]
Abstract
Conventional environmental-health risk-assessment methods are often limited in their ability to account for uncertainty in contaminant exposure, chemical toxicity and resulting human health risk. Exposure levels and toxicity are both subject to significant measurement errors, and many predicted risks are well below those distinguishable from background incident rates in target populations. To address these issues methods are needed to characterize uncertainties in observations and inferences, including the ability to interpret the influence of improved measurements and larger datasets. Here we develop a Bayesian network (BN) model to quantify the joint effects of measurement errors and different sample sizes on an illustrative exposure-response system. Categorical variables are included in the network to describe measurement accuracies, actual and measured exposures, actual and measured response, and the true strength of the exposure-response relationship. Network scenarios are developed by fixing combinations of the exposure-response strength of relationship (none, medium or strong) and the accuracy of exposure and response measurements (low, high, perfect). Multiple cases are simulated for each scenario, corresponding to a synthetic exposure response study sampled from the known scenario population. A learn-from-cases algorithm is then used to assimilate the synthetic observations into an uninformed prior network, yielding updated probabilities for the strength of relationship. Ten replicate studies are simulated for each scenario and sample size, and results are presented for individual trials and their mean prediction. The model as parameterized yields little-to-no convergence when low accuracy measurements are used, though progressively faster convergence when employing high accuracy or perfect measurements. The inferences from the model are particularly efficient when the true strength of relationship is none or strong with smaller sample sizes. The tool developed in this study can help in the screening and design of exposure-response studies to better anticipate where such outcomes can occur under different levels of measurement error. It may also serve to inform methods of analysis for other network models that consider multiple streams of evidence from multiple studies of cumulative exposure and effects.
Collapse
Affiliation(s)
- Nur H Orak
- Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA, USA.
- Department of Environmental Engineering, Duzce University, Duzce, Turkey.
| | - Mitchell J Small
- Department of Civil and Environmental Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
- Department of Engineering and Public Policy, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Marek J Druzdzel
- School of Computing and Information Sciences, University of Pittsburgh, Pittsburgh, PA, USA
- Faculty of Computer Science, Bialystok University of Technology, Białystok, Poland
| |
Collapse
|
8
|
Wu F, Kang X, Wang M, Haider W, Price WB, Hajek B, Hanzawa Y. Transcriptome-Enabled Network Inference Revealed the GmCOL1 Feed-Forward Loop and Its Roles in Photoperiodic Flowering of Soybean. FRONTIERS IN PLANT SCIENCE 2019; 10:1221. [PMID: 31787988 PMCID: PMC6856076 DOI: 10.3389/fpls.2019.01221] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Accepted: 09/04/2019] [Indexed: 05/13/2023]
Abstract
Photoperiodic flowering, a plant response to seasonal photoperiod changes in the control of reproductive transition, is an important agronomic trait that has been a central target of crop domestication and modern breeding programs. However, our understanding about the molecular mechanisms of photoperiodic flowering regulation in crop species is lagging behind. To better understand the regulatory gene networks controlling photoperiodic flowering of soybeans, we elucidated global gene expression patterns under different photoperiod regimes using the near isogenic lines (NILs) of maturity loci (E loci). Transcriptome signatures identified the unique roles of the E loci in photoperiodic flowering and a set of genes controlled by these loci. To elucidate the regulatory gene networks underlying photoperiodic flowering regulation, we developed the network inference algorithmic package CausNet that integrates sparse linear regression and Granger causality heuristics, with Gaussian approximation of bootstrapping to provide reliability scores for predicted regulatory interactions. Using the transcriptome data, CausNet inferred regulatory interactions among soybean flowering genes. Published reports in the literature provided empirical verification for several of CausNet's inferred regulatory interactions. We further confirmed the inferred regulatory roles of the flowering suppressors GmCOL1a and GmCOL1b using GmCOL1 RNAi transgenic soybean plants. Combinations of the alleles of GmCOL1 and the major maturity locus E1 demonstrated positive interaction between these genes, leading to enhanced suppression of flowering transition. Our work provides novel insights and testable hypotheses in the complex molecular mechanisms of photoperiodic flowering control in soybean and lays a framework for de novo prediction of biological networks controlling important agronomic traits in crops.
Collapse
Affiliation(s)
- Faqiang Wu
- Department of Biology, California State University, Northridge, CA, United States
| | - Xiaohan Kang
- Department of Electrical Computer Engineering, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Minglei Wang
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Waseem Haider
- Department of Biosciences, COMSATS University Islamabad, Pakistan
| | - William B. Price
- Department of Electrical Computer Engineering, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Bruce Hajek
- Department of Crop Sciences, University of Illinois at Urbana-Champaign, Champaign, IL, United States
| | - Yoshie Hanzawa
- Department of Biology, California State University, Northridge, CA, United States
- *Correspondence: Yoshie Hanzawa,
| |
Collapse
|
9
|
Szczurek E, Beerenwinkel N. Linear effects models of signaling pathways from combinatorial perturbation data. Bioinformatics 2017; 32:i297-i305. [PMID: 27307630 PMCID: PMC4908352 DOI: 10.1093/bioinformatics/btw268] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Motivation: Perturbations constitute the central means to study signaling pathways. Interrupting components of the pathway and analyzing observed effects of those interruptions can give insight into unknown connections within the signaling pathway itself, as well as the link from the pathway to the effects. Different pathway components may have different individual contributions to the measured perturbation effects, such as gene expression changes. Those effects will be observed in combination when the pathway components are perturbed. Extant approaches focus either on the reconstruction of pathway structure or on resolving how the pathway components control the downstream effects. Results: Here, we propose a linear effects model, which can be applied to solve both these problems from combinatorial perturbation data. We use simulated data to demonstrate the accuracy of learning the pathway structure as well as estimation of the individual contributions of pathway components to the perturbation effects. The practical utility of our approach is illustrated by an application to perturbations of the mitogen-activated protein kinase pathway in Saccharomyces cerevisiae. Availability and Implementation: lem is available as a R package at http://www.mimuw.edu.pl/∼szczurek/lem. Contact:szczurek@mimuw.edu.pl; niko.beerenwinkel@bsse.ethz.ch Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ewa Szczurek
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland
| | - Niko Beerenwinkel
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland SIB Swiss Institute of Bioinformatics
| |
Collapse
|
10
|
Expectation propagation for large scale Bayesian inference of non-linear molecular networks from perturbation data. PLoS One 2017; 12:e0171240. [PMID: 28166542 PMCID: PMC5293552 DOI: 10.1371/journal.pone.0171240] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2016] [Accepted: 01/17/2017] [Indexed: 11/19/2022] Open
Abstract
Inferring the structure of molecular networks from time series protein or gene expression data provides valuable information about the complex biological processes of the cell. Causal network structure inference has been approached using different methods in the past. Most causal network inference techniques, such as Dynamic Bayesian Networks and ordinary differential equations, are limited by their computational complexity and thus make large scale inference infeasible. This is specifically true if a Bayesian framework is applied in order to deal with the unavoidable uncertainty about the correct model. We devise a novel Bayesian network reverse engineering approach using ordinary differential equations with the ability to include non-linearity. Besides modeling arbitrary, possibly combinatorial and time dependent perturbations with unknown targets, one of our main contributions is the use of Expectation Propagation, an algorithm for approximate Bayesian inference over large scale network structures in short computation time. We further explore the possibility of integrating prior knowledge into network inference. We evaluate the proposed model on DREAM4 and DREAM8 data and find it competitive against several state-of-the-art existing network inference methods.
Collapse
|
11
|
Chasman D, Fotuhi Siahpirani A, Roy S. Network-based approaches for analysis of complex biological systems. Curr Opin Biotechnol 2016; 39:157-166. [PMID: 27115495 DOI: 10.1016/j.copbio.2016.04.007] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2015] [Revised: 04/04/2016] [Accepted: 04/05/2016] [Indexed: 12/22/2022]
Abstract
Cells function and respond to changes in their environment by the coordinated activity of their molecular components, including mRNAs, proteins and metabolites. At the heart of proper cellular function are molecular networks connecting these components to process extra-cellular environmental signals and drive dynamic, context-specific cellular responses. Network-based computational approaches aim to systematically integrate measurements from high-throughput experiments to gain a global understanding of cellular function under changing environmental conditions. We provide an overview of recent methodological developments toward solving two major computational problems within this field in the past two years (2013-2015): network reconstruction and network-based interpretation. Looking forward, we envision development of methods that can predict phenotypes with high accuracy as well as provide biologically plausible mechanistic hypotheses.
Collapse
Affiliation(s)
- Deborah Chasman
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States
| | - Alireza Fotuhi Siahpirani
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, United States; Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, United States
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, United States; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, United States; Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, United States.
| |
Collapse
|
12
|
Wilentzik R, Gat-Viks I. A statistical framework for revealing signaling pathways perturbed by DNA variants. Nucleic Acids Res 2015; 43:e74. [PMID: 25765646 PMCID: PMC4477643 DOI: 10.1093/nar/gkv203] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2014] [Accepted: 02/28/2015] [Indexed: 12/03/2022] Open
Abstract
Much of the inter-individual variation in gene expression is triggered via perturbations of signaling networks by DNA variants. We present a novel probabilistic approach for identifying the particular pathways by which DNA variants perturb the signaling network. Our procedure, called PINE, relies on a systematic integration of established biological knowledge of signaling networks with data on transcriptional responses to various experimental conditions. Unlike previous approaches, PINE provides statistical aspects that are critical for prioritizing hypotheses for followup experiments. Using simulated data, we show that higher accuracy is attained with PINE than with existing methods. We used PINE to analyze transcriptional responses of immune dendritic cells to several pathogenic stimulations. PINE identified statistically significant genetic perturbations in the pathogen-sensing signaling network, suggesting previously uncharacterized regulatory mechanisms for functional DNA variants.
Collapse
Affiliation(s)
- Roni Wilentzik
- Department of Cell Research and Immunology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
| | - Irit Gat-Viks
- Department of Cell Research and Immunology, The George S. Wise Faculty of Life Sciences, Tel Aviv University, 6997801 Tel Aviv, Israel
| |
Collapse
|
13
|
Chasman D, Ho YH, Berry DB, Nemec CM, MacGilvray ME, Hose J, Merrill AE, Lee MV, Will JL, Coon JJ, Ansari AZ, Craven M, Gasch AP. Pathway connectivity and signaling coordination in the yeast stress-activated signaling network. Mol Syst Biol 2014; 10:759. [PMID: 25411400 PMCID: PMC4299600 DOI: 10.15252/msb.20145120] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Stressed cells coordinate a multi-faceted response spanning many levels of physiology. Yet
knowledge of the complete stress-activated regulatory network as well as design principles for
signal integration remains incomplete. We developed an experimental and computational approach to
integrate available protein interaction data with gene fitness contributions, mutant transcriptome
profiles, and phospho-proteome changes in cells responding to salt stress, to infer the
salt-responsive signaling network in yeast. The inferred subnetwork presented many novel predictions
by implicating new regulators, uncovering unrecognized crosstalk between known pathways, and
pointing to previously unknown ‘hubs’ of signal integration. We exploited these
predictions to show that Cdc14 phosphatase is a central hub in the network and that modification of
RNA polymerase II coordinates induction of stress-defense genes with reduction of growth-related
transcripts. We find that the orthologous human network is enriched for cancer-causing genes,
underscoring the importance of the subnetwork's predictions in understanding stress
biology.
Collapse
Affiliation(s)
- Deborah Chasman
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, USA
| | - Yi-Hsuan Ho
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| | - David B Berry
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| | - Corey M Nemec
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, USA
| | | | - James Hose
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| | - Anna E Merrill
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - M Violet Lee
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Jessica L Will
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA
| | - Joshua J Coon
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI, USA Department of Biological Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - Aseem Z Ansari
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, USA Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI, USA
| | - Mark Craven
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, USA Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI, USA Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, USA
| | - Audrey P Gasch
- Laboratory of Genetics, University of Wisconsin-Madison, Madison, WI, USA Genome Center of Wisconsin, University of Wisconsin-Madison, Madison, WI, USA
| |
Collapse
|
14
|
Guo NL, Wan YW. Network-based identification of biomarkers coexpressed with multiple pathways. Cancer Inform 2014; 13:37-47. [PMID: 25392692 PMCID: PMC4218687 DOI: 10.4137/cin.s14054] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2014] [Revised: 06/25/2014] [Accepted: 06/29/2014] [Indexed: 02/07/2023] Open
Abstract
Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson’s correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson’s correlation networks when evaluated with MSigDB database.
Collapse
Affiliation(s)
- Nancy Lan Guo
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| | - Ying-Wooi Wan
- Mary Babb Randolph Cancer Center/School of Public Health, West Virginia University, Morgantown, WV, USA
| |
Collapse
|
15
|
Gibbs DL, Gralinski L, Baric RS, McWeeney SK. Multi-omic network signatures of disease. Front Genet 2014; 4:309. [PMID: 24432028 PMCID: PMC3882664 DOI: 10.3389/fgene.2013.00309] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2013] [Accepted: 12/19/2013] [Indexed: 12/12/2022] Open
Abstract
To better understand dynamic disease processes, integrated multi-omic methods are needed, yet comparing different types of omic data remains difficult. Integrative solutions benefit experimenters by eliminating potential biases that come with single omic analysis. We have developed the methods needed to explore whether a relationship exists between co-expression network models built from transcriptomic and proteomic data types, and whether this relationship can be used to improve the disease signature discovery process. A naïve, correlation based method is utilized for comparison. Using publicly available infectious disease time series data, we analyzed the related co-expression structure of the transcriptome and proteome in response to SARS-CoV infection in mice. Transcript and peptide expression data was filtered using quality scores and subset by taking the intersection on mapped Entrez IDs. Using this data set, independent co-expression networks were built. The networks were integrated by constructing a bipartite module graph based on module member overlap, module summary correlation, and correlation to phenotypes of interest. Compared to the module level results, the naïve approach is hindered by a lack of correlation across data types, less significant enrichment results, and little functional overlap across data types. Our module graph approach avoids these problems, resulting in an integrated omic signature of disease progression, which allows prioritization across data types for down-stream experiment planning. Integrated modules exhibited related functional enrichments and could suggest novel interactions in response to infection. These disease and platform-independent methods can be used to realize the full potential of multi-omic network signatures. The data (experiment SM001) are publically available through the NIAID Systems Virology (https://www.systemsvirology.org) and PNNL (http://omics.pnl.gov) web portals. Phenotype data is found in the supplementary information. The ProCoNA package is available as part of Bioconductor 2.13.
Collapse
Affiliation(s)
- David L Gibbs
- McWeeney Lab, Division of Bioinformatics and Computational Biology, Oregon Health & Science University Portland, OR, USA
| | - Lisa Gralinski
- Baric Lab, Department of Microbiology and Immunology, University of North Carolina at Chapel Hill Chapel Hill, NC, USA
| | - Ralph S Baric
- Baric Lab, Department of Microbiology and Immunology, University of North Carolina at Chapel Hill Chapel Hill, NC, USA
| | - Shannon K McWeeney
- McWeeney Lab, Division of Bioinformatics and Computational Biology, Oregon Health & Science University Portland, OR, USA ; McWeeney Lab, OHSU Knight Cancer Institute, Oregon Health & Science University Portland, OR, USA
| |
Collapse
|
16
|
|
17
|
Karlebach G. Inferring Boolean network states from partial information. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2013; 2013:11. [PMID: 24006954 PMCID: PMC3850440 DOI: 10.1186/1687-4153-2013-11] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2013] [Accepted: 08/26/2013] [Indexed: 01/14/2023]
Abstract
Networks of molecular interactions regulate key processes in living cells. Therefore, understanding their functionality is a high priority in advancing biological knowledge. Boolean networks are often used to describe cellular networks mathematically and are fitted to experimental datasets. The fitting often results in ambiguities since the interpretation of the measurements is not straightforward and since the data contain noise. In order to facilitate a more reliable mapping between datasets and Boolean networks, we develop an algorithm that infers network trajectories from a dataset distorted by noise. We analyze our algorithm theoretically and demonstrate its accuracy using simulation and microarray expression data.
Collapse
Affiliation(s)
- Guy Karlebach
- German Cancer Research Institute (DKFZ), Im Neuenheimer Feld 280, Heidelberg 69121, Germany.
| |
Collapse
|
18
|
Failmezger H, Praveen P, Tresch A, Fröhlich H. Learning gene network structure from time laps cell imaging in RNAi Knock downs. Bioinformatics 2013; 29:1534-40. [PMID: 23595660 DOI: 10.1093/bioinformatics/btt179] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION As RNA interference is becoming a standard method for targeted gene perturbation, computational approaches to reverse engineer parts of biological networks based on measurable effects of RNAi become increasingly relevant. The vast majority of these methods use gene expression data, but little attention has been paid so far to other data types. RESULTS Here we present a method, which can infer gene networks from high-dimensional phenotypic perturbation effects on single cells recorded by time-lapse microscopy. We use data from the Mitocheck project to extract multiple shape, intensity and texture features at each frame. Features from different cells and movies are then aligned along the cell cycle time. Subsequently we use Dynamic Nested Effects Models (dynoNEMs) to estimate parts of the network structure between perturbed genes via a Markov Chain Monte Carlo approach. Our simulation results indicate a high reconstruction quality of this method. A reconstruction based on 22 gene knock downs yielded a network, where all edges could be explained via the biological literature. AVAILABILITY The implementation of dynoNEMs is part of the Bioconductor R-package nem.
Collapse
Affiliation(s)
- Henrik Failmezger
- Computational Biology and Regulatory Networks, Max-Planck Institute for Plant Breeding Research, Carl-von-Linne-Weg 10, 50829 Cologne, Germany
| | | | | | | |
Collapse
|
19
|
Experimental and computational methods for the analysis and modeling of signaling networks. N Biotechnol 2013; 30:327-32. [DOI: 10.1016/j.nbt.2012.11.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 11/05/2012] [Indexed: 01/30/2023]
|
20
|
Abstract
Since the first emergence of protein-protein interaction networks more than a decade ago, they have been viewed as static scaffolds of the signaling-regulatory events taking place in cells, and their analysis has been mainly confined to topological aspects. Recently, functional models of these networks have been suggested, ranging from Boolean to constraint-based methods. However, learning such models from large-scale data remains a formidable task, and most modeling approaches rely on extensive human curation. Here we provide a generic approach to learning Boolean models automatically from data. We apply our approach to growth and inflammatory signaling systems in humans and show how the learning phase can improve the fit of the model to experimental data, remove spurious interactions, and lead to better understanding of the system at hand.
Collapse
Affiliation(s)
- Roded Sharan
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| | | |
Collapse
|
21
|
Qin T, Tsoi LC, Sims KJ, Lu X, Zheng WJ. Signaling network prediction by the Ontology Fingerprint enhanced Bayesian network. BMC SYSTEMS BIOLOGY 2012; 6 Suppl 3:S3. [PMID: 23282239 PMCID: PMC3524013 DOI: 10.1186/1752-0509-6-s3-s3] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
BACKGROUND Despite large amounts of available genomic and proteomic data, predicting the structure and response of signaling networks is still a significant challenge. While statistical method such as Bayesian network has been explored to meet this challenge, employing existing biological knowledge for network prediction is difficult. The objective of this study is to develop a novel approach that integrates prior biological knowledge in the form of the Ontology Fingerprint to infer cell-type-specific signaling networks via data-driven Bayesian network learning; and to further use the trained model to predict cellular responses. RESULTS We applied our novel approach to address the Predictive Signaling Network Modeling challenge of the fourth (2009) Dialog for Reverse Engineering Assessment's and Methods (DREAM4) competition. The challenge results showed that our method accurately captured signal transduction of a network of protein kinases and phosphoproteins in that the predicted protein phosphorylation levels under all experimental conditions were highly correlated (R2 = 0.93) with the observed results. Based on the evaluation of the DREAM4 organizer, our team was ranked as one of the top five best performers in predicting network structure and protein phosphorylation activity under test conditions. CONCLUSIONS Bayesian network can be used to simulate the propagation of signals in cellular systems. Incorporating the Ontology Fingerprint as prior biological knowledge allows us to efficiently infer concise signaling network structure and to accurately predict cellular responses.
Collapse
Affiliation(s)
- Tingting Qin
- Bioinformatics Graduate Program, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Lam C Tsoi
- Bioinformatics Graduate Program, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Kellie J Sims
- Department of Biochemistry and Molecular Biology, Medical University of South Carolina, Charleston, SC 29425, USA
| | - Xinghua Lu
- Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15232, USA
| | - W Jim Zheng
- Department of Biochemistry and Molecular Biology, Medical University of South Carolina, Charleston, SC 29425, USA
| |
Collapse
|
22
|
Affiliation(s)
- Nancy Lan Guo
- Mary Babb Randolph Cancer Center/Department of Community Medicine, School of Medicine, West Virginia University, Morgantown, WV 26506-9300
| |
Collapse
|
23
|
Graudenzi A, Serra R, Villani M, Damiani C, Colacci A, Kauffman SA. Dynamical Properties of a Boolean Model of Gene Regulatory Network with Memory. J Comput Biol 2011; 18:1291-303. [DOI: 10.1089/cmb.2010.0069] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
Affiliation(s)
- Alex Graudenzi
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Roberto Serra
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Marco Villani
- European Centre for Living Technology (ECLT), University Cá Foscari of Venice, Venice, Italy
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Chiara Damiani
- Department of Social, Cognitive and Quantitative Sciences, University of Modena and Reggio Emilia, Reggio Emilia, Italy
| | - Annamaria Colacci
- Excellence Environmental Carcinogenesis, Environmental Protection and Health Prevention Agency, Emilia-Romagna, Bologna, Italy
| | - Stuart A. Kauffman
- UVM's Complex Systems Center, University of Vermont, Burlington, Vermont
| |
Collapse
|
24
|
Ho JWK, Charleston MA. Network modelling of gene regulation. Biophys Rev 2010; 3:1-13. [PMID: 28510232 DOI: 10.1007/s12551-010-0041-4] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2010] [Accepted: 11/04/2010] [Indexed: 11/28/2022] Open
Abstract
Gene regulatory network (GRN) modelling has gained increasing attention in the past decade. Many computational modelling techniques have been proposed to facilitate the inference and analysis of GRN. However, there is often confusion about the aim of GRN modelling, and how a gene network model can be fully utilised as a tool for systems biology. The aim of the present article is to provide an overview of this rapidly expanding subject. In particular, we review some fundamental concepts of systems biology and discuss the role of network modelling in understanding complex biological systems. Several commonly used network modelling paradigms are surveyed with emphasis on their practical use in systems biology research.
Collapse
Affiliation(s)
- Joshua W K Ho
- School of Information Technologies, The University of Sydney, Sydney, NSW, 2006, Australia.
| | - Michael A Charleston
- School of Information Technologies, The University of Sydney, Sydney, NSW, 2006, Australia.,Centre for Mathematical Biology, The University of Sydney, Sydney, NSW, 2006, Australia
| |
Collapse
|
25
|
Fröhlich H, Praveen P, Tresch A. Fast and efficient dynamic nested effects models. Bioinformatics 2010; 27:238-44. [DOI: 10.1093/bioinformatics/btq631] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
26
|
Vaske CJ, Benz SC, Sanborn JZ, Earl D, Szeto C, Zhu J, Haussler D, Stuart JM. Inference of patient-specific pathway activities from multi-dimensional cancer genomics data using PARADIGM. ACTA ACUST UNITED AC 2010; 26:i237-45. [PMID: 20529912 PMCID: PMC2881367 DOI: 10.1093/bioinformatics/btq182] [Citation(s) in RCA: 525] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Motivation: High-throughput data is providing a comprehensive view of the molecular changes in cancer tissues. New technologies allow for the simultaneous genome-wide assay of the state of genome copy number variation, gene expression, DNA methylation and epigenetics of tumor samples and cancer cell lines. Analyses of current data sets find that genetic alterations between patients can differ but often involve common pathways. It is therefore critical to identify relevant pathways involved in cancer progression and detect how they are altered in different patients. Results: We present a novel method for inferring patient-specific genetic activities incorporating curated pathway interactions among genes. A gene is modeled by a factor graph as a set of interconnected variables encoding the expression and known activity of a gene and its products, allowing the incorporation of many types of omic data as evidence. The method predicts the degree to which a pathway's activities (e.g. internal gene states, interactions or high-level ‘outputs’) are altered in the patient using probabilistic inference. Compared with a competing pathway activity inference approach called SPIA, our method identifies altered activities in cancer-related pathways with fewer false-positives in both a glioblastoma multiform (GBM) and a breast cancer dataset. PARADIGM identified consistent pathway-level activities for subsets of the GBM patients that are overlooked when genes are considered in isolation. Further, grouping GBM patients based on their significant pathway perturbations divides them into clinically-relevant subgroups having significantly different survival outcomes. These findings suggest that therapeutics might be chosen that target genes at critical points in the commonly perturbed pathway(s) of a group of patients. Availability:Source code available at http://sbenz.github.com/Paradigm Contact:jstuart@soe.ucsc.edu Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
|
27
|
|
28
|
Karlebach G, Shamir R. Minimally perturbing a gene regulatory network to avoid a disease phenotype: the glioma network as a test case. BMC SYSTEMS BIOLOGY 2010; 4:15. [PMID: 20184733 PMCID: PMC2851584 DOI: 10.1186/1752-0509-4-15] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/03/2009] [Accepted: 02/25/2010] [Indexed: 11/10/2022]
Abstract
BACKGROUND Mathematical modeling of biological networks is an essential part of Systems Biology. Developing and using such models in order to understand gene regulatory networks is a major challenge. RESULTS We present an algorithm that determines the smallest perturbations required for manipulating the dynamics of a network formulated as a Petri net, in order to cause or avoid a specified phenotype. By modifying McMillan's unfolding algorithm, we handle partial knowledge and reduce computation cost. The methodology is demonstrated on a glioma network. Out of the single gene perturbations, activation of glutathione S-transferase P (GSTP1) gene was by far the most effective in blocking the cancer phenotype. Among pairs of perturbations, NFkB and TGF-beta had the largest joint effect, in accordance with their role in the EMT process. CONCLUSION Our method allows perturbation analysis of regulatory networks and can overcome incomplete information. It can help in identifying drug targets and in prioritizing perturbation experiments.
Collapse
Affiliation(s)
- Guy Karlebach
- Tel-Aviv University, Haim Levanon St,, 69978, Tel-Aviv, Israel.
| | | |
Collapse
|
29
|
Lisboa P, Vellido A, Tagliaferri R, Napolitano F, Ceccarelli M, Martin-Guerrero J, Biganzoli E. Data Mining in Cancer Research [Application Notes. IEEE COMPUT INTELL M 2010. [DOI: 10.1109/mci.2009.935311] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
30
|
Morganella S, Zoppoli P, Ceccarelli M. IRIS: a method for reverse engineering of regulatory relations in gene networks. BMC Bioinformatics 2009; 10:444. [PMID: 20030818 PMCID: PMC2813854 DOI: 10.1186/1471-2105-10-444] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2008] [Accepted: 12/23/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The ultimate aim of systems biology is to understand and describe how molecular components interact to manifest collective behaviour that is the sum of the single parts. Building a network of molecular interactions is the basic step in modelling a complex entity such as the cell. Even if gene-gene interactions only partially describe real networks because of post-transcriptional modifications and protein regulation, using microarray technology it is possible to combine measurements for thousands of genes into a single analysis step that provides a picture of the cell's gene expression. Several databases provide information about known molecular interactions and various methods have been developed to infer gene networks from expression data. However, network topology alone is not enough to perform simulations and predictions of how a molecular system will respond to perturbations. Rules for interactions among the single parts are needed for a complete definition of the network behaviour. Another interesting question is how to integrate information carried by the network topology, which can be derived from the literature, with large-scale experimental data. RESULTS Here we propose an algorithm, called inference of regulatory interaction schema (IRIS), that uses an iterative approach to map gene expression profile values (both steady-state and time-course) into discrete states and a simple probabilistic method to infer the regulatory functions of the network. These interaction rules are integrated into a factor graph model. We test IRIS on two synthetic networks to determine its accuracy and compare it to other methods. We also apply IRIS to gene expression microarray data for the Saccharomyces cerevisiae cell cycle and for human B-cells and compare the results to literature findings. CONCLUSIONS IRIS is a rapid and efficient tool for the inference of regulatory relations in gene networks. A topological description of the network and a matrix of gene expression profiles are required as input to the algorithm. IRIS maps gene expression data onto discrete values and then computes regulatory functions as conditional probability tables. The suitability of the method is demonstrated for synthetic data and microarray data. The resulting network can also be embedded in a factor graph model.
Collapse
Affiliation(s)
- Sandro Morganella
- Department of Biological and Environmental Sciences, University of Sannio, Benevento, Italy.
| | | | | |
Collapse
|
31
|
Fröhlich H, Sahin O, Arlt D, Bender C, Beissbarth T. Deterministic Effects Propagation Networks for reconstructing protein signaling networks from multiple interventions. BMC Bioinformatics 2009; 10:322. [PMID: 19814779 PMCID: PMC2770070 DOI: 10.1186/1471-2105-10-322] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2009] [Accepted: 10/08/2009] [Indexed: 11/10/2022] Open
Abstract
Background Modern gene perturbation techniques, like RNA interference (RNAi), enable us to study effects of targeted interventions in cells efficiently. In combination with mRNA or protein expression data this allows to gain insights into the behavior of complex biological systems. Results In this paper, we propose Deterministic Effects Propagation Networks (DEPNs) as a special Bayesian Network approach to reverse engineer signaling networks from a combination of protein expression and perturbation data. DEPNs allow to reconstruct protein networks based on combinatorial intervention effects, which are monitored via changes of the protein expression or activation over one or a few time points. Our implementation of DEPNs allows for latent network nodes (i.e. proteins without measurements) and has a built in mechanism to impute missing data. The robustness of our approach was tested on simulated data. We applied DEPNs to reconstruct the ERBB signaling network in de novo trastuzumab resistant human breast cancer cells, where protein expression was monitored on Reverse Phase Protein Arrays (RPPAs) after knockdown of network proteins using RNAi. Conclusion DEPNs offer a robust, efficient and simple approach to infer protein signaling networks from multiple interventions. The method as well as the data have been made part of the latest version of the R package "nem" available as a supplement to this paper and via the Bioconductor repository.
Collapse
Affiliation(s)
- Holger Fröhlich
- German Cancer Research Center, Molecular Genome Analysis, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany.
| | | | | | | | | |
Collapse
|
32
|
van Hijum SAFT, Medema MH, Kuipers OP. Mechanisms and evolution of control logic in prokaryotic transcriptional regulation. Microbiol Mol Biol Rev 2009; 73:481-509, Table of Contents. [PMID: 19721087 PMCID: PMC2738135 DOI: 10.1128/mmbr.00037-08] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
A major part of organismal complexity and versatility of prokaryotes resides in their ability to fine-tune gene expression to adequately respond to internal and external stimuli. Evolution has been very innovative in creating intricate mechanisms by which different regulatory signals operate and interact at promoters to drive gene expression. The regulation of target gene expression by transcription factors (TFs) is governed by control logic brought about by the interaction of regulators with TF binding sites (TFBSs) in cis-regulatory regions. A factor that in large part determines the strength of the response of a target to a given TF is motif stringency, the extent to which the TFBS fits the optimal TFBS sequence for a given TF. Advances in high-throughput technologies and computational genomics allow reconstruction of transcriptional regulatory networks in silico. To optimize the prediction of transcriptional regulatory networks, i.e., to separate direct regulation from indirect regulation, a thorough understanding of the control logic underlying the regulation of gene expression is required. This review summarizes the state of the art of the elements that determine the functionality of TFBSs by focusing on the molecular biological mechanisms and evolutionary origins of cis-regulatory regions.
Collapse
Affiliation(s)
- Sacha A F T van Hijum
- Molecular Genetics, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Kerklaan 30, 9751 NN Haren, The Netherlands.
| | | | | |
Collapse
|
33
|
Fröhlich H, Tresch A, Beissbarth T. Nested effects models for learning signaling networks from perturbation data. Biom J 2009; 51:304-23. [PMID: 19358219 DOI: 10.1002/bimj.200800185] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Targeted gene perturbations have become a major tool to gain insight into complex cellular processes. In combination with the measurement of downstream effects via DNA microarrays, this approach can be used to gain insight into signaling pathways. Nested Effects Models were first introduced by Markowetz et al. as a probabilistic method to reverse engineer signaling cascades based on the nested structure of downstream perturbation effects. The basic framework was substantially extended later on by Fröhlich et al., Markowetz et al., and Tresch and Markowetz. In this paper, we present a review of the complete methodology with a detailed comparison of so far proposed algorithms on a qualitative and quantitative level. As an application, we present results on estimating the signaling network between 13 genes in the ER-alpha pathway of human MCF-7 breast cancer cells. Comparison with the literature shows a substantial overlap.
Collapse
Affiliation(s)
- Holger Fröhlich
- German Cancer Research Center (DKFZ), Im Neuenheimer Feld 580, 69120 Heidelberg, Germany.
| | | | | |
Collapse
|
34
|
Dingel J, Milenkovic O. List-decoding methods for inferring polynomials in finite dynamical gene network models. ACTA ACUST UNITED AC 2009; 25:1686-93. [PMID: 19401400 DOI: 10.1093/bioinformatics/btp281] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION The problem of reverse engineering the dynamics of gene expression profiles is of focal importance in systems biology. Due to noise and the inherent lack of sufficiently large datasets generated via high-throughput measurements, known reconstruction frameworks based on dynamical systems models fail to provide adequate settings for network analysis. This motivates the study of new approaches that produce stochastic lists of explanations for the observed network dynamics that can be efficiently inferred from small sample sets and in the presence of errors. RESULTS We introduce a novel algebraic modeling framework, termed stochastic polynomial dynamical systems (SPDSs) that can capture the dynamics of regulatory networks based on microarray expression data. Here, we refer to dynamics of the network as the trajectories of gene expression profiles over time. The model assumes that the expression data is quantized in a manner that allows for imposing a finite field structure on the observations, and the existence of polynomial update functions for each gene in the network. The underlying reverse engineering algorithm is based on ideas borrowed from coding theory, and in particular, list-decoding methods for so called Reed-Muller codes. The list-decoding method was tested on synthetic data and on microarray expression measurements from the M(3D) database, corresponding to a subnetwork of the Escherichia coli SOS repair system, as well as on the complete transcription factor network, available at RegulonDB. The results show that SPDSs constructed via list-decoders significantly outperform other algebraic reverse engineering methods, and that they also provide good guidelines for estimating the influence of genes on the dynamics of the network. AVAILABILITY Software codes for list-decoding algorithms suitable for direct application to quantized expression data will be publicly available at the authors' web-pages.
Collapse
Affiliation(s)
- Janis Dingel
- Institute for Communications Engineering, Technische Universität München, Munich, Germany.
| | | |
Collapse
|
35
|
Aldridge BB, Saez-Rodriguez J, Muhlich JL, Sorger PK, Lauffenburger DA. Fuzzy logic analysis of kinase pathway crosstalk in TNF/EGF/insulin-induced signaling. PLoS Comput Biol 2009; 5:e1000340. [PMID: 19343194 PMCID: PMC2663056 DOI: 10.1371/journal.pcbi.1000340] [Citation(s) in RCA: 104] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2007] [Accepted: 02/24/2009] [Indexed: 01/10/2023] Open
Abstract
When modeling cell signaling networks, a balance must be struck between
mechanistic detail and ease of interpretation. In this paper we apply a fuzzy
logic framework to the analysis of a large, systematic dataset describing the
dynamics of cell signaling downstream of TNF, EGF, and insulin receptors in
human colon carcinoma cells. Simulations based on fuzzy logic recapitulate most
features of the data and generate several predictions involving pathway
crosstalk and regulation. We uncover a relationship between MK2 and ERK pathways
that might account for the previously identified pro-survival influence of MK2.
We also find unexpected inhibition of IKK following EGF treatment, possibly due
to down-regulation of autocrine signaling. More generally, fuzzy logic models
are flexible, able to incorporate qualitative and noisy data, and powerful
enough to produce quantitative predictions and new biological insights about the
operation of signaling networks. Cells use networks of interacting proteins to interpret intra-cellular state and
extra-cellular cues and to execute cell-fate decisions. Even when individual
proteins are well understood at a molecular level, the dynamics and behavior of
networks as a whole are harder to understand. However, deciphering the operation
of such networks is key to understanding disease processes and therapeutic
opportunities. As a means to study signaling networks, we have modified and
applied a fuzzy logic approach originally developed for industrial control. We
use fuzzy logic to model the responses of colon cancer cells in culture to
combinations of pro-survival and pro-death cytokines, making it possible to
interpret quantitative data in the context of abstract information drawn from
the literature. Our work establishes that fuzzy logic can be used to understand
complex signaling pathways with respect to multi-factorial activity-based
protein data and prior knowledge.
Collapse
Affiliation(s)
- Bree B. Aldridge
- Center for Cell Decision Processes, Cambridge, Massachusetts, United
States of America
- Department of Biological Engineering, MIT, Cambridge, Massachusetts,
United States of America
- Department of Systems Biology, Harvard Medical School, Boston,
Massachusetts, United States of America
| | - Julio Saez-Rodriguez
- Center for Cell Decision Processes, Cambridge, Massachusetts, United
States of America
- Department of Biological Engineering, MIT, Cambridge, Massachusetts,
United States of America
- Department of Systems Biology, Harvard Medical School, Boston,
Massachusetts, United States of America
| | - Jeremy L. Muhlich
- Center for Cell Decision Processes, Cambridge, Massachusetts, United
States of America
- Department of Systems Biology, Harvard Medical School, Boston,
Massachusetts, United States of America
| | - Peter K. Sorger
- Center for Cell Decision Processes, Cambridge, Massachusetts, United
States of America
- Department of Biological Engineering, MIT, Cambridge, Massachusetts,
United States of America
- Department of Systems Biology, Harvard Medical School, Boston,
Massachusetts, United States of America
| | - Douglas A. Lauffenburger
- Center for Cell Decision Processes, Cambridge, Massachusetts, United
States of America
- Department of Biological Engineering, MIT, Cambridge, Massachusetts,
United States of America
- Department of Systems Biology, Harvard Medical School, Boston,
Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
36
|
Huang Y, Tienda-Luna IM, Wang Y. A Survey of Statistical Models for Reverse Engineering Gene Regulatory Networks. IEEE SIGNAL PROCESSING MAGAZINE 2009; 26:76-97. [PMID: 20046885 PMCID: PMC2763329 DOI: 10.1109/msp.2008.930647] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Statistical models for reverse engineering gene regulatory networks are surveyed in this article. To provide readers with a system-level view of the modeling issues in this research, a graphical modeling framework is proposed. This framework serves as the scaffolding on which the review of different models can be systematically assembled. Based on the framework, we review many existing models for many aspects of gene regulation; the pros and cons of each model are discussed. In addition, network inference algorithms are also surveyed under the graphical modeling framework by the categories of point solutions and probabilistic solutions and the connections and differences among the algorithms are provided. This survey has the potential to elucidate the development and future of reverse engineering GRNs and bring statistical signal processing closer to the core of this research.
Collapse
Affiliation(s)
- Yufei Huang
- Department of Electrical and Computer Engineering, University of Texas at San Antonio, San Antonio, TX 78249-0669,
| | | | | |
Collapse
|
37
|
Karlebach G, Shamir R. Modelling and analysis of gene regulatory networks. Nat Rev Mol Cell Biol 2008; 9:770-80. [PMID: 18797474 DOI: 10.1038/nrm2503] [Citation(s) in RCA: 568] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Gene regulatory networks have an important role in every process of life, including cell differentiation, metabolism, the cell cycle and signal transduction. By understanding the dynamics of these networks we can shed light on the mechanisms of diseases that occur when these cellular processes are dysregulated. Accurate prediction of the behaviour of regulatory networks will also speed up biotechnological projects, as such predictions are quicker and cheaper than lab experiments. Computational methods, both for supporting the development of network models and for the analysis of their functionality, have already proved to be a valuable research tool.
Collapse
Affiliation(s)
- Guy Karlebach
- The Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | | |
Collapse
|
38
|
Abstract
Yeast molecular and cell biology has accumulated large amounts of qualitative and quantitative data of diverse cellular processes. The results are often summarized as verbal or graphical descriptions. Moreover, a series of mathematical models has been developed that should help to interpret such data, to integrate them into a coherent picture and to allow for an understanding of the underlying processes. Dynamic modelling of regulatory processes in yeast focuses on central carbon metabolism, on a number of selected signalling pathways and on cell cycle regulation. These models can explain questions of general relevance, such as whether the dynamics of a network can be understood from the combination of in vitro kinetics of its individual reactions. They help to elucidate complicated dynamic features, such as glycolytic oscillations, effects of feedback regulation or the optimal regulation of gene expression. The availability of comprehensive qualitative information, such as protein interactions or pathway composition, and sets of quantitative data make yeast a perfect model organism. Therefore, yeast-related data are often used to develop and examine computational approaches and modelling methods.
Collapse
Affiliation(s)
- Edda Klipp
- Max Planck Institute for Molecular Genetics, Computational Systems Biology, Ihnestrasse 63-73, 14195 Berlin, Germany.
| |
Collapse
|
39
|
Ulitsky I, Gat-Viks I, Shamir R. MetaReg: a platform for modeling, analysis and visualization of biological systems using large-scale experimental data. Genome Biol 2008; 9:R1. [PMID: 18171474 PMCID: PMC2395235 DOI: 10.1186/gb-2008-9-1-r1] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2007] [Revised: 09/28/2007] [Accepted: 01/02/2008] [Indexed: 12/31/2022] Open
Abstract
MetaReg http://acgt.cs.tau.ac.il/metareg/application.html is a computational tool that models cellular networks and integrates experimental results with such models. MetaReg represents established knowledge about a biological system, available today mostly in informal form in the literature, as probabilistic network models with underlying combinatorial regulatory logic. MetaReg enables contrasting predictions with measurements, model improvements and studying what-if scenarios. By summarizing prior knowledge and providing visual and computational aids, it helps the expert explore and understand her system better.
Collapse
Affiliation(s)
- Igor Ulitsky
- School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | |
Collapse
|
40
|
Abstract
Common human diseases like obesity and diabetes are driven by complex networks of genes and any number of environmental factors. To understand this complexity in hopes of identifying targets and developing drugs against disease, a systematic approach is required to elucidate the genetic and environmental factors and interactions among and between these factors, and to establish how these factors induce changes in gene networks that in turn lead to disease. The explosion of large-scale, high-throughput technologies in the biological sciences has enabled researchers to take a more systems biology approach to study complex traits like disease. Genotyping of hundreds of thousands of DNA markers and profiling tens of thousands of molecular phenotypes simultaneously in thousands of individuals is now possible, and this scale of data is making it possible for the first time to reconstruct whole gene networks associated with disease. In the following sections, we review different approaches for integrating genetic expression and clinical data to infer causal relationships among gene expression traits and between expression and disease traits. We further review methods to integrate these data in a more comprehensive manner to identify common pathways shared by the causal factors driving disease, including the reconstruction of association and probabilistic causal networks. Particular attention is paid to integrating diverse information to refine these types of networks so that they are more predictive. To highlight these different approaches in practice, we step through an example on how Insig2 was identified as a causal factor for plasma cholesterol levels in mice.
Collapse
|
41
|
Abstract
In this review we give an overview of computational and statistical methods to reconstruct cellular networks. Although this area of research is vast and fast developing, we show that most currently used methods can be organized by a few key concepts. The first part of the review deals with conditional independence models including Gaussian graphical models and Bayesian networks. The second part discusses probabilistic and graph-based methods for data from experimental interventions and perturbations.
Collapse
Affiliation(s)
- Florian Markowetz
- Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
- Princeton University, Lewis-Sigler Institute for Integrative Genomics and Dept. of Computer Science, Princeton, NJ 08544, USA
| | - Rainer Spang
- Max Planck Institute for Molecular Genetics, Ihnestrasse 63-73, 14195 Berlin, Germany
- Present affiliation: University Regensburg, Institute of Functional Genomics, Josef-Engert-Str. 9, 93053 Regensburg, Germany
| |
Collapse
|
42
|
Gat-Viks I, Shamir R. Refinement and expansion of signaling pathways: the osmotic response network in yeast. Genome Res 2007; 17:358-67. [PMID: 17267811 PMCID: PMC1800927 DOI: 10.1101/gr.5750507] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The analysis of large-scale genome-wide experiments carries the promise of dramatically broadening our understanding on biological networks. The challenge of systematic integration of experimental results with established biological knowledge on a pathway is still unanswered. Here we present a methodology that attempts to answer this challenge when investigating signaling pathways. We formalize existing qualitative knowledge as a probabilistic model that depicts known interactions between molecules (genes, proteins, etc.) as a network and known regulatory relations as logics. We present algorithms that analyze experimental results (e.g., transcription profiles) vis-à-vis the model and propose improvements to the model based on the fit to the experimental data. These algorithms refine the relations between model components, as well as expand the model to include new components that are regulated by components of the original network. Using our methodology, we have modeled together the knowledge on four established signaling pathways related to osmotic shock response in Saccharomyces cerevisiae. Using over 100 published transcription profiles, our refinement methodology revealed three cross talks in the network. The expansion procedure identified with high confidence large groups of genes that are coregulated by transcription factors from the original network via a common logic. The results reveal a novel delicate repressive effect of the HOG pathway on many transcriptional target genes and suggest an unexpected alternative functional mode of the MAP kinase Hog1. These results demonstrate that, by integrated analysis of data and of well-defined knowledge, one can generate concrete biological hypotheses about signaling cascades and their downstream regulatory programs.
Collapse
Affiliation(s)
- Irit Gat-Viks
- School of Computer Science, Tel-Aviv University, Tel-Aviv 69978, Israel.
| | | |
Collapse
|