1
|
Kaya HE, Naidoo KJ. CytoCopasi: a chemical systems biology target and drug discovery visual data analytics platform. Bioinformatics 2023; 39:btad745. [PMID: 38070155 PMCID: PMC10963058 DOI: 10.1093/bioinformatics/btad745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 11/28/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023] Open
Abstract
MOTIVATION Target discovery and drug evaluation for diseases with complex mechanisms call for a streamlined chemical systems analysis platform. Currently available tools lack the emphasis on reaction kinetics, access to relevant databases, and algorithms to visualize perturbations on a chemical scale providing quantitative details as well streamlined visual data analytics functionality. RESULTS CytoCopasi, a Maven-based application for Cytoscape that combines the chemical systems analysis features of COPASI with the visualization and database access tools of Cytoscape and its plugin applications has been developed. The diverse functionality of CytoCopasi through ab initio model construction, model construction via pathway and parameter databases KEGG and BRENDA is presented. The comparative systems biology visualization analysis toolset is illustrated through a drug competence study on the cancerous RAF/MEK/ERK pathway. AVAILABILITY AND IMPLEMENTATION The COPASI files, simulation data, native libraries, and the manual are available on https://github.com/scientificomputing/CytoCopasi.
Collapse
Affiliation(s)
- Hikmet Emre Kaya
- Department of Chemistry, Scientific Computing Research Unit, PD Hahn Building, University of Cape Town, Rondebosch 7701, South Africa
| | - Kevin J Naidoo
- Department of Chemistry, Scientific Computing Research Unit, PD Hahn Building, University of Cape Town, Rondebosch 7701, South Africa
| |
Collapse
|
2
|
Burkhart JG, Wu G, Song X, Raimondi F, McWeeney S, Wong MH, Deng Y. Biology-inspired graph neural network encodes reactome and reveals biochemical reactions of disease. PATTERNS (NEW YORK, N.Y.) 2023; 4:100758. [PMID: 37521042 PMCID: PMC10382942 DOI: 10.1016/j.patter.2023.100758] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 11/20/2022] [Accepted: 05/01/2023] [Indexed: 08/01/2023]
Abstract
Functional heterogeneity of healthy human tissues complicates interpretation of molecular studies, impeding precision therapeutic target identification and treatment. Considering this, we generated a graph neural network with Reactome-based architecture and trained it using 9,115 samples from Genotype-Tissue Expression (GTEx). Our graph neural network (GNN) achieves adjusted Rand index (ARI) = 0.7909, while a Resnet18 control model achieves ARI = 0.7781, on 370 held-out healthy human tissue samples from The Cancer Genome Atlas (TCGA), despite the Resnet18 using over 600 times the parameters. Our GNN also succeeds in separating 83 healthy skin samples from 95 lesional psoriasis samples, revealing that upregulation of 26S- and NUB1-mediated degradation of NEDD8, UBD, and their conjugates is central to the largest perturbed reaction network component in psoriasis. We show that our results are not discoverable using traditional differential expression and hypergeometric pathway enrichment analyses yet are supported by separate human multi-omics and small-molecule mouse studies, suggesting future molecular disease studies may benefit from similar GNN analytical approaches.
Collapse
Affiliation(s)
- Joshua G. Burkhart
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, Honolulu, HI 96813, USA
| | - Guanming Wu
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Xubo Song
- Department of Computer Science and Electrical Engineering, Oregon Health & Science University, Portland, OR 97239, USA
| | | | - Shannon McWeeney
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Melissa H. Wong
- Department of Cell, Developmental, and Cancer Biology, Oregon Health & Science University, Portland, OR 97201, USA
| | - Youping Deng
- Department of Quantitative Health Sciences, University of Hawaii John A. Burns School of Medicine, Honolulu, HI 96813, USA
| |
Collapse
|
3
|
Woods EJ, Kannan D, Sharpe DJ, Swinburne TD, Wales DJ. Analysing ill-conditioned Markov chains. PHILOSOPHICAL TRANSACTIONS. SERIES A, MATHEMATICAL, PHYSICAL, AND ENGINEERING SCIENCES 2023; 381:20220245. [PMID: 37211032 DOI: 10.1098/rsta.2022.0245] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 11/15/2022] [Indexed: 05/23/2023]
Abstract
Discrete state Markov chains in discrete or continuous time are widely used to model phenomena in the social, physical and life sciences. In many cases, the model can feature a large state space, with extreme differences between the fastest and slowest transition timescales. Analysis of such ill-conditioned models is often intractable with finite precision linear algebra techniques. In this contribution, we propose a solution to this problem, namely partial graph transformation, to iteratively eliminate and renormalize states, producing a low-rank Markov chain from an ill-conditioned initial model. We show that the error induced by this procedure can be minimized by retaining both the renormalized nodes that represent metastable superbasins, and those through which reactive pathways concentrate, i.e. the dividing surface in the discrete state space. This procedure typically returns a much lower rank model, where trajectories can be efficiently generated with kinetic path sampling. We apply this approach to an ill-conditioned Markov chain for a model multi-community system, measuring the accuracy by direct comparison with trajectories and transition statistics. This article is part of a discussion meeting issue 'Supercomputing simulations of advanced materials'.
Collapse
Affiliation(s)
- Esmae J Woods
- Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge CB3 0HE, UK
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Deepti Kannan
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Daniel J Sharpe
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Thomas D Swinburne
- CNRS, CINaM UMR, Aix-Marseille Université, 7325, Campus de Luminy, 13288 Marseille, France
| | - David J Wales
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| |
Collapse
|
4
|
Hernandez BS, Lubenia PVN, Johnston MD, Kim JK. A framework for deriving analytic steady states of biochemical reaction networks. PLoS Comput Biol 2023; 19:e1011039. [PMID: 37053305 PMCID: PMC10129002 DOI: 10.1371/journal.pcbi.1011039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 04/25/2023] [Accepted: 03/20/2023] [Indexed: 04/15/2023] Open
Abstract
The long-term behaviors of biochemical systems are often described by their steady states. Deriving these states directly for complex networks arising from real-world applications, however, is often challenging. Recent work has consequently focused on network-based approaches. Specifically, biochemical reaction networks are transformed into weakly reversible and deficiency zero generalized networks, which allows the derivation of their analytic steady states. Identifying this transformation, however, can be challenging for large and complex networks. In this paper, we address this difficulty by breaking the complex network into smaller independent subnetworks and then transforming the subnetworks to derive the analytic steady states of each subnetwork. We show that stitching these solutions together leads to the the analytic steady states of the original network. To facilitate this process, we develop a user-friendly and publicly available package, COMPILES (COMPutIng anaLytic stEady States). With COMPILES, we can easily test the presence of bistability of a CRISPRi toggle switch model, which was previously investigated via tremendous number of numerical simulations and within a limited range of parameters. Furthermore, COMPILES can be used to identify absolute concentration robustness (ACR), the property of a system that maintains the concentration of particular species at a steady state regardless of any initial concentrations. Specifically, our approach completely identifies all the species with and without ACR in a complex insulin model. Our method provides an effective approach to analyzing and understanding complex biochemical systems.
Collapse
Affiliation(s)
- Bryan S Hernandez
- Biomedical Mathematics Group, Pioneer Research Center for Mathematical and Computational Sciences, Institute for Basic Science, Daejeon, Republic of Korea
- Institute of Mathematics, University of the Philippines Diliman, Quezon City, Philippines
| | - Patrick Vincent N Lubenia
- Systems and Computational Biology Research Unit, Center for Natural Sciences and Environmental Research, Manila, Philippines
| | - Matthew D Johnston
- Department of Mathematics and Computer Science, Lawrence Technological University, Southfield, Michigan, United States of America
| | - Jae Kyoung Kim
- Biomedical Mathematics Group, Pioneer Research Center for Mathematical and Computational Sciences, Institute for Basic Science, Daejeon, Republic of Korea
- Department of Mathematical Sciences, KAIST, Daejeon, Republic of Korea
| |
Collapse
|
5
|
Minimal frustration underlies the usefulness of incomplete regulatory network models in biology. Proc Natl Acad Sci U S A 2023; 120:e2216109120. [PMID: 36580597 PMCID: PMC9910462 DOI: 10.1073/pnas.2216109120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Regulatory networks as large and complex as those implicated in cell-fate choice are expected to exhibit intricate, very high-dimensional dynamics. Cell-fate choice, however, is a macroscopically simple process. Additionally, regulatory network models are almost always incomplete and/or inexact, and do not incorporate all the regulators and interactions that may be involved in cell-fate regulation. In spite of these issues, regulatory network models have proven to be incredibly effective tools for understanding cell-fate choice across contexts and for making useful predictions. Here, we show that minimal frustration-a feature of biological networks across contexts but not of random networks-can compel simple, low-dimensional steady-state behavior even in large and complex networks. Moreover, the steady-state behavior of minimally frustrated networks can be recapitulated by simpler networks such as those lacking many of the nodes and edges and those that treat multiple regulators as one. The present study provides a theoretical explanation for the success of network models in biology and for the challenges in network inference.
Collapse
|
6
|
Yu H, Blair RH. Integration of probabilistic regulatory networks into constraint-based models of metabolism with applications to Alzheimer's disease. BMC Bioinformatics 2019; 20:386. [PMID: 31291905 PMCID: PMC6617954 DOI: 10.1186/s12859-019-2872-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Accepted: 05/02/2019] [Indexed: 01/08/2023] Open
Abstract
Background Mathematical models of biological networks can provide important predictions and insights into complex disease. Constraint-based models of cellular metabolism and probabilistic models of gene regulatory networks are two distinct areas that have progressed rapidly in parallel over the past decade. In principle, gene regulatory networks and metabolic networks underly the same complex phenotypes and diseases. However, systematic integration of these two model systems remains a fundamental challenge. Results In this work, we address this challenge by fusing probabilistic models of gene regulatory networks into constraint-based models of metabolism. The novel approach utilizes probabilistic reasoning in BN models of regulatory networks serves as the “glue” that enables a natural interface between the two systems. Probabilistic reasoning is used to predict and quantify system-wide effects of perturbation to the regulatory network in the form of constraints for flux variability analysis. In this setting, both regulatory and metabolic networks inherently account for uncertainty. Applications leverage constraint-based metabolic models of brain metabolism and gene regulatory networks parameterized by gene expression data from the hippocampus to investigate the role of the HIF-1 pathway in Alzheimer’s disease. Integrated models support HIF-1A as effective target to reduce the effects of hypoxia in Alzheimer’s disease. However, HIF-1A activation is far less effective in shifting metabolism when compared to brain metabolism in healthy controls. Conclusions The direct integration of probabilistic regulatory networks into constraint-based models of metabolism provides novel insights into how perturbations in the regulatory network may influence metabolic states. Predictive modeling of enzymatic activity can be facilitated using probabilistic reasoning, thereby extending the predictive capacity of the network. This framework for model integration is generalizable to other systems. Electronic supplementary material The online version of this article (10.1186/s12859-019-2872-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Han Yu
- State University of New York at Buffalo, 3435 Main Street, Buffalo, 14214, US
| | | |
Collapse
|
7
|
Tian X, Pang W, Wang Y, Guo K, Zhou Y. LatinPSO: An algorithm for simultaneously inferring structure and parameters of ordinary differential equations models. Biosystems 2019; 182:8-16. [PMID: 31167112 DOI: 10.1016/j.biosystems.2019.05.006] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 05/01/2019] [Accepted: 05/14/2019] [Indexed: 10/26/2022]
Abstract
Simultaneously inferring both the structure and parameters of Ordinary Differential Equations (ODEs) for a complex dynamic system is more practical in many systems identification problems, but it remains challenging due to the complexity of the underlying search space. In this research, we propose a novel algorithm based on Particle Swarm Optimization (PSO) and Latin Hypercube Sampling (LHS) to address the above problem. The proposed algorithm is termed LatinPSO, and it can be effectively used for inferring the structure and parameters of ODE models through time course data. To start with, the real Human Immunodeficiency Virus (HIV) model and several synthetic models are used for evaluating the performance of LatinPSO. Experimental results demonstrated that LatinPSO could find satisfactory candidate ODE models with appropriate structure and parameters.
Collapse
Affiliation(s)
- Xinliang Tian
- College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, China
| | - Wei Pang
- School of Natural and Computing Sciences, University of Aberdeen, AB24 3UE, UK
| | - Yizhang Wang
- College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, China
| | - Kaimin Guo
- College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, China
| | - You Zhou
- College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, China.
| |
Collapse
|
8
|
Tsuchiya T, Fujii M, Matsuda N, Kunida K, Uda S, Kubota H, Konishi K, Kuroda S. System identification of signaling dependent gene expression with different time-scale data. PLoS Comput Biol 2017; 13:e1005913. [PMID: 29281625 PMCID: PMC5760096 DOI: 10.1371/journal.pcbi.1005913] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 01/09/2018] [Accepted: 12/01/2017] [Indexed: 01/11/2023] Open
Abstract
Cells decode information of signaling activation at a scale of tens of minutes by downstream gene expression with a scale of hours to days, leading to cell fate decisions such as cell differentiation. However, no system identification method with such different time scales exists. Here we used compressed sensing technology and developed a system identification method using data of different time scales by recovering signals of missing time points. We measured phosphorylation of ERK and CREB, immediate early gene expression products, and mRNAs of decoder genes for neurite elongation in PC12 cell differentiation and performed system identification, revealing the input-output relationships between signaling and gene expression with sensitivity such as graded or switch-like response and with time delay and gain, representing signal transfer efficiency. We predicted and validated the identified system using pharmacological perturbation. Thus, we provide a versatile method for system identification using data with different time scales.
Collapse
Affiliation(s)
- Takaho Tsuchiya
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | - Masashi Fujii
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
- Molecular Genetics Research Laboratory, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | - Naoki Matsuda
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
| | - Katsuyuki Kunida
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
- Laboratory of Computational Biology, Graduate School of Biological Sciences, Nara Institute of Science and Technology, Ikoma, Japan
| | - Shinsuke Uda
- Division of Integrated Omics, Research Center for Transomics Medicine, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
| | - Hiroyuki Kubota
- Division of Integrated Omics, Research Center for Transomics Medicine, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
| | - Katsumi Konishi
- Department of Computer Science, Faculty of Informatics, Kogakuin University, Tokyo, Japan
| | - Shinya Kuroda
- Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan
- CREST, Japan Science and Technology Corporation, Tokyo, Japan
| |
Collapse
|
9
|
A Precision Medicine Approach to Improve Cancer Rehabilitation’s Impact and Integration with Cancer Care and Optimize Patient Wellness. CURRENT PHYSICAL MEDICINE AND REHABILITATION REPORTS 2017. [DOI: 10.1007/s40141-017-0145-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
|
10
|
Cellular metabolic network analysis: discovering important reactions in Treponema pallidum. BIOMED RESEARCH INTERNATIONAL 2015; 2015:328568. [PMID: 26495292 PMCID: PMC4606156 DOI: 10.1155/2015/328568] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Revised: 05/19/2015] [Accepted: 05/30/2015] [Indexed: 11/26/2022]
Abstract
T. pallidum, the syphilis-causing pathogen, performs very differently in metabolism compared with other bacterial pathogens. The desire for safe and effective vaccine of syphilis requests identification of important steps in T. pallidum's metabolism. Here, we apply Flux Balance Analysis to represent the reactions quantitatively. Thus, it is possible to cluster all reactions in T. pallidum. By calculating minimal cut sets and analyzing topological structure for the metabolic network of T. pallidum, critical reactions are identified. As a comparison, we also apply the analytical approaches to the metabolic network of H. pylori to find coregulated drug targets and unique drug targets for different microorganisms. Based on the clustering results, all reactions are further classified into various roles. Therefore, the general picture of their metabolic network is obtained and two types of reactions, both of which are involved in nucleic acid metabolism, are found to be essential for T. pallidum. It is also discovered that both hubs of reactions and the isolated reactions in purine and pyrimidine metabolisms play important roles in T. pallidum. These reactions could be potential drug targets for treating syphilis.
Collapse
|
11
|
Güell O, Massucci FA, Font-Clos F, Sagués F, Serrano MÁ. Mapping high-growth phenotypes in the flux space of microbial metabolism. J R Soc Interface 2015; 12:0543. [PMID: 26289659 DOI: 10.1098/rsif.2015.0543] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Experimental and empirical observations on cell metabolism cannot be understood as a whole without their integration into a consistent systematic framework. However, the characterization of metabolic flux phenotypes is typically reduced to the study of a single optimal state, such as maximum biomass yield that is by far the most common assumption. Here, we confront optimal growth solutions to the whole set of feasible flux phenotypes (FFPs), which provides a benchmark to assess the likelihood of optimal and high-growth states and their agreement with experimental results. In addition, FFP maps are able to uncover metabolic behaviours, such as aerobic fermentation accompanying exponential growth on sugars at nutrient excess conditions, that are unreachable using standard models based on optimality principles. The information content of the full FFP space provides us with a map to explore and evaluate metabolic behaviour and capabilities, and so it opens new avenues for biotechnological and biomedical applications.
Collapse
Affiliation(s)
- Oriol Güell
- Departament de Química Física, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain
| | | | - Francesc Font-Clos
- Centre de Recerca Matemàtica, Edifici C, Campus Bellaterra, Bellaterra, 08193 Barcelona, Spain Departament de Matemàtiques, Universitat Autònoma de Barcelona, Edifici C, Campus Bellaterra, Bellaterra, 08193 Barcelona, Spain
| | - Francesc Sagués
- Departament de Química Física, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain
| | - M Ángeles Serrano
- Departament de Física, Fonamental, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain
| |
Collapse
|
12
|
Ben Yahia B, Malphettes L, Heinzle E. Macroscopic modeling of mammalian cell growth and metabolism. Appl Microbiol Biotechnol 2015; 99:7009-24. [PMID: 26198881 PMCID: PMC4536272 DOI: 10.1007/s00253-015-6743-6] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2015] [Revised: 05/28/2015] [Accepted: 05/30/2015] [Indexed: 12/24/2022]
Abstract
We review major modeling strategies and methods to understand and simulate the macroscopic behavior of mammalian cells. These strategies comprise two important steps: the first step is to identify stoichiometric relationships for the cultured cells connecting the extracellular inputs and outputs. In a second step, macroscopic kinetic models are introduced. These relationships together with bioreactor and metabolite balances provide a complete description of a system in the form of a set of differential equations. These can be used for the simulation of cell culture performance and further for optimization of production.
Collapse
Affiliation(s)
- Bassem Ben Yahia
- />Biochemical Engineering Institute, Saarland University, Campus A1.5, D-66123 Saarbruecken, Germany
- />Upstream Process Sciences Biotech Sciences, UCB Pharma S.A., Avenue de l’Industrie, B-1420, Braine l’Alleud, Belgium
| | - Laetitia Malphettes
- />Upstream Process Sciences Biotech Sciences, UCB Pharma S.A., Avenue de l’Industrie, B-1420, Braine l’Alleud, Belgium
| | - Elmar Heinzle
- />Biochemical Engineering Institute, Saarland University, Campus A1.5, D-66123 Saarbruecken, Germany
| |
Collapse
|
13
|
Eddy JA, Funk CC, Price ND. Fostering synergy between cell biology and systems biology. Trends Cell Biol 2015; 25:440-5. [PMID: 26013981 DOI: 10.1016/j.tcb.2015.04.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2014] [Revised: 04/25/2015] [Accepted: 04/28/2015] [Indexed: 01/05/2023]
Abstract
In the shared pursuit of elucidating detailed mechanisms of cell function, systems biology presents a natural complement to ongoing efforts in cell biology. Systems biology aims to characterize biological systems through integrated and quantitative modeling of cellular information. The process of model building and analysis provides value through synthesizing and cataloging information about cells and molecules, predicting mechanisms and identifying generalizable themes, generating hypotheses and guiding experimental design, and highlighting knowledge gaps and refining understanding. In turn, incorporating domain expertise and experimental data is crucial for building towards whole cell models. An iterative cycle of interaction between cell and systems biologists advances the goals of both fields and establishes a framework for mechanistic understanding of the genome-to-phenome relationship.
Collapse
Affiliation(s)
- James A Eddy
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109, USA
| | - Cory C Funk
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109, USA
| | - Nathan D Price
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA 98109, USA.
| |
Collapse
|
14
|
ElKalaawy N, Wassal A. Methodologies for the modeling and simulation of biochemical networks, illustrated for signal transduction pathways: a primer. Biosystems 2015; 129:1-18. [PMID: 25637875 DOI: 10.1016/j.biosystems.2015.01.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2014] [Revised: 01/23/2015] [Accepted: 01/23/2015] [Indexed: 01/30/2023]
Abstract
Biochemical networks depict the chemical interactions that take place among elements of living cells. They aim to elucidate how cellular behavior and functional properties of the cell emerge from the relationships between its components, i.e. molecules. Biochemical networks are largely characterized by dynamic behavior, and exhibit high degrees of complexity. Hence, the interest in such networks is growing and they have been the target of several recent modeling efforts. Signal transduction pathways (STPs) constitute a class of biochemical networks that receive, process, and respond to stimuli from the environment, as well as stimuli that are internal to the organism. An STP consists of a chain of intracellular signaling processes that ultimately result in generating different cellular responses. This primer presents the methodologies used for the modeling and simulation of biochemical networks, illustrated for STPs. These methodologies range from qualitative to quantitative, and include structural as well as dynamic analysis techniques. We describe the different methodologies, outline their underlying assumptions, and provide an assessment of their advantages and disadvantages. Moreover, publicly and/or commercially available implementations of these methodologies are listed as appropriate. In particular, this primer aims to provide a clear introduction and comprehensive coverage of biochemical modeling and simulation methodologies for the non-expert, with specific focus on relevant literature of STPs.
Collapse
Affiliation(s)
- Nesma ElKalaawy
- Department of Computer Engineering, Faculty of Engineering, Cairo University, Giza 12613, Egypt.
| | - Amr Wassal
- Department of Computer Engineering, Faculty of Engineering, Cairo University, Giza 12613, Egypt.
| |
Collapse
|
15
|
Quintero A, Ramírez J, Leal LG, López-Kleine L. Algorithm for the Construction of a Global Enzymatic Network to be Used for Gene Network Reconstruction. Curr Genomics 2014; 15:400-7. [PMID: 25435802 PMCID: PMC4245699 DOI: 10.2174/1389202915666140807004909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Revised: 07/28/2014] [Accepted: 08/05/2014] [Indexed: 11/22/2022] Open
Abstract
Relationships between genes are best represented using networks constructed from information of different types, with metabolic information being the most valuable and widely used for genetic network reconstruction. Other types of information are usually also available, and it would be desirable to systematically include them in algorithms for network reconstruction. Here, we present an algorithm to construct a global metabolic network that uses all available enzymatic and metabolic information about the organism. We construct a global enzymatic network (GEN) with a total of 4226 nodes (EC numbers) and 42723 edges representing all known metabolic reactions. As an example we use microarray data for Arabidopsis thaliana and combine it with the metabolic network constructing a final gene interaction network for this organism with 8212 nodes (genes) and 4606,901 edges. All scripts are available to be used for any organism for which genomic data is available.
Collapse
Affiliation(s)
- Andrés Quintero
- Universidad Nacional de Colombia, Department of Biology, Master Student, Universidad Nacional de Colombia-Sede Bogotá
| | - Jorge Ramírez
- Universidad Nacional de Colombia, School of Mathematics, Associate Professor, Universidad Nacional de Colombia-Sede Medellín
| | - Luis Guillermo Leal
- Universidad Nacional de Colombia, Department of Statistics, Master Student, Universidad Nacional de Colombia-Sede Bogotá
| | - Liliana López-Kleine
- Universidad Nacional de Colombia, Department of Statistics, Associate Professor, Universidad Nacional de Colombia-Sede Bogotá, Colombia
| |
Collapse
|
16
|
Dudek RM, Chuang Y, Leonard JN. Engineered cell-based therapies: a vanguard of design-driven medicine. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 844:369-91. [PMID: 25480651 DOI: 10.1007/978-1-4939-2095-2_18] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Engineered cell-based therapies are uniquely capable of performing sophisticated therapeutic functions in vivo, and this strategy is yielding promising clinical benefits for treating cancer. In this review, we discuss key opportunities and challenges for engineering customized cellular functions using cell-based therapy for cancer as a representative case study. We examine the historical development of chimeric antigen receptor (CAR) therapies as an illustration of the engineering design cycle. We also consider the potential roles that the complementary disciplines of systems biology and synthetic biology may play in realizing safe and effective treatments for a broad range of patients and diseases. In particular, we discuss how systems biology may facilitate both fundamental research and clinical translation, and we describe how the emerging field of synthetic biology is providing novel modalities for building customized cellular functions to overcome existing clinical barriers. Together, these approaches provide a powerful set of conceptual and experimental tools for transforming information into understanding, and for translating understanding into novel therapeutics to establish a new framework for design-driven medicine.
Collapse
Affiliation(s)
- Rachel M Dudek
- Northwestern University, 2145 Sheridan Road, Technological Institute, Rm. E136, Evanston, IL, 60208-3120, USA,
| | | | | |
Collapse
|
17
|
Saito TH, Uda S, Tsuchiya T, Ozaki YI, Kuroda S. Temporal decoding of MAP kinase and CREB phosphorylation by selective immediate early gene expression. PLoS One 2013; 8:e57037. [PMID: 23469182 PMCID: PMC3587639 DOI: 10.1371/journal.pone.0057037] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 01/16/2013] [Indexed: 12/23/2022] Open
Abstract
A wide range of growth factors encode information into specific temporal patterns of MAP kinase (MAPK) and CREB phosphorylation, which are further decoded by expression of immediate early gene products (IEGs) to exert biological functions. However, the IEG decoding system remain unknown. We built a data-driven based on time courses of MAPK and CREB phosphorylation and IEG expression in response to various growth factors to identify how signal is processed. We found that IEG expression uses common decoding systems regardless of growth factors and expression of each IEG differs in upstream dependency, switch-like response, and linear temporal filters. Pulsatile ERK phosphorylation was selectively decoded by expression of EGR1 rather than c-FOS. Conjunctive NGF and PACAP stimulation was selectively decoded by synergistic JUNB expression through switch-like response to c-FOS. Thus, specific temporal patterns and combinations of MAPKs and CREB phosphorylation can be decoded by selective IEG expression via distinct temporal filters and switch-like responses. The data-driven modeling is versatile for analysis of signal processing and does not require detailed prior knowledge of pathways.
Collapse
Affiliation(s)
- Takeshi H. Saito
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Shinsuke Uda
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Takaho Tsuchiya
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Yu-ichi Ozaki
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Shinya Kuroda
- Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, Bunkyo-ku, Tokyo, Japan
- CREST, Japan Science and Technology Corporation, Bunkyo-ku, Tokyo, Japan
- * E-mail:
| |
Collapse
|
18
|
Global probabilistic annotation of metabolic networks enables enzyme discovery. Nat Chem Biol 2013; 8:848-54. [PMID: 22960854 PMCID: PMC3696893 DOI: 10.1038/nchembio.1063] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2012] [Accepted: 08/07/2012] [Indexed: 11/08/2022]
Abstract
Annotation of organism-specific metabolic networks is one of the main challenges of systems biology. Importantly, due to inherent uncertainty of computational annotations, predictions of biochemical function need to be treated probabilistically. We present a global probabilistic approach to annotate genome-scale metabolic networks that integrates sequence homology and context-based correlations under a single principled framework. The developed method for Global Biochemical reconstruction Using Sampling (GLOBUS) not only provides annotation probabilities for each functional assignment, but also suggests likely alternative functions. GLOBUS is based on statistical Gibbs sampling of probable metabolic annotations and is able to make accurate functional assignments even in cases of remote sequence identity to known enzymes. We apply GLOBUS to genomes of Bacillus subtilis and Staphylococcus aureus, and validate the method predictions by experimentally demonstrating the 6-phosphogluconolactonase activity of ykgB and the role of the sps pathway for rhamnose biosynthesis in B. subtilis.
Collapse
|
19
|
|
20
|
Bebek G, Koyutürk M, Price ND, Chance MR. Network biology methods integrating biological data for translational science. Brief Bioinform 2012; 13:446-59. [PMID: 22390873 PMCID: PMC3404396 DOI: 10.1093/bib/bbr075] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2011] [Revised: 11/29/2011] [Indexed: 12/29/2022] Open
Abstract
The explosion of biomedical data, both on the genomic and proteomic side as well as clinical data, will require complex integration and analysis to provide new molecular variables to better understand the molecular basis of phenotype. Currently, much data exist in silos and is not analyzed in frameworks where all data are brought to bear in the development of biomarkers and novel functional targets. This is beginning to change. Network biology approaches, which emphasize the interactions between genes, proteins and metabolites provide a framework for data integration such that genome, proteome, metabolome and other -omics data can be jointly analyzed to understand and predict disease phenotypes. In this review, recent advances in network biology approaches and results are identified. A common theme is the potential for network analysis to provide multiplexed and functionally connected biomarkers for analyzing the molecular basis of disease, thus changing our approaches to analyzing and modeling genome- and proteome-wide data.
Collapse
|
21
|
Pathway analysis of genomic data: concepts, methods, and prospects for future development. Trends Genet 2012; 28:323-32. [PMID: 22480918 DOI: 10.1016/j.tig.2012.03.004] [Citation(s) in RCA: 215] [Impact Index Per Article: 17.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2012] [Revised: 03/02/2012] [Accepted: 03/07/2012] [Indexed: 12/31/2022]
Abstract
Genome-wide data sets are increasingly being used to identify biological pathways and networks underlying complex diseases. In particular, analyzing genomic data through sets defined by functional pathways offers the potential of greater power for discovery and natural connections to biological mechanisms. With the burgeoning availability of next-generation sequencing, this is an opportune moment to revisit strategies for pathway-based analysis of genomic data. Here, we synthesize relevant concepts and extant methodologies to guide investigators in study design and execution. We also highlight ongoing challenges and proposed solutions. As relevant analytical strategies mature, pathways and networks will be ideally placed to integrate data from diverse -omics sources to harness the extensive, rich information related to disease and treatment mechanisms.
Collapse
|
22
|
Millon T. Classifying personality disorders: an evolution-based alternative to an evidence-based approach. J Pers Disord 2011; 25:279-304. [PMID: 21699392 DOI: 10.1521/pedi.2011.25.3.279] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
The study of personality disorders, no less psychology as a wole, remains divorced from broader spheres of scientific knowledge. Development of a conceptual schema for classifying personality disorders should include the examination of research limitations and inductive inconsistences that undermine the likely achievements of the evidential approach. An alternative course of action is outlined here, one that looks to evolutionary theory rather than evidence-based methods for classification guidance.
Collapse
Affiliation(s)
- Theodore Millon
- Institute for Advanced Studies in Personology and Psychopathology.
| |
Collapse
|
23
|
Perkins EJ, Chipman JK, Edwards S, Habib T, Falciani F, Taylor R, Van Aggelen G, Vulpe C, Antczak P, Loguinov A. Reverse engineering adverse outcome pathways. ENVIRONMENTAL TOXICOLOGY AND CHEMISTRY 2011; 30:22-38. [PMID: 20963852 DOI: 10.1002/etc.374] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
The toxicological effects of many stressors are mediated through unknown, or incompletely characterized, mechanisms of action. The application of reverse engineering complex interaction networks from high dimensional omics data (gene, protein, metabolic, signaling) can be used to overcome these limitations. This approach was used to characterize adverse outcome pathways (AOPs) for chemicals that disrupt the hypothalamus-pituitary-gonadal endocrine axis in fathead minnows (FHM, Pimephales promelas). Gene expression changes in FHM ovaries in response to seven different chemicals, over different times, doses, and in vivo versus in vitro conditions, were captured in a large data set of 868 arrays. Potential AOPs of the antiandrogen flutamide were examined using two mutual information-based methods to infer gene regulatory networks and potential AOPs. Representative networks from these studies were used to predict network paths from stressor to adverse outcome as candidate AOPs. The relationship of individual chemicals to an adverse outcome can be determined by following perturbations through the network in response to chemical treatment, thus leading to the nodes associated with the adverse outcome. Identification of candidate pathways allows for formation of testable hypotheses about key biological processes, biomarkers, or alternative endpoints that can be used to monitor an AOP. Finally, the unique challenges facing the application of this approach in ecotoxicology were identified and a road map for the utilization of these tools presented.
Collapse
Affiliation(s)
- Edward J Perkins
- U.S. Army Engineering Research and Development Center, Vicksburg, Mississippi, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Schrattenholz A, Groebe K, Soskic V. Systems biology approaches and tools for analysis of interactomes and multi-target drugs. Methods Mol Biol 2010; 662:29-58. [PMID: 20824465 DOI: 10.1007/978-1-60761-800-3_2] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Systems biology is essentially a proteomic and epigenetic exercise because the relatively condensed information of genomes unfolds on the level of proteins. The flexibility of cellular architectures is not only mediated by a dazzling number of proteinaceous species but moreover by the kinetics of their molecular changes: The time scales of posttranslational modifications range from milliseconds to years. The genetic framework of an organism only provides the blue print of protein embodiments which are constantly shaped by external input. Indeed, posttranslational modifications of proteins represent the scope and velocity of these inputs and fulfil the requirements of integration of external spatiotemporal signal transduction inside an organism. The optimization of biochemical networks for this type of information processing and storage results in chemically extremely fine tuned molecular entities. The huge dynamic range of concentrations, the chemical diversity and the necessity of synchronisation of complex protein expression patterns pose the major challenge of systemic analysis of biological models. One further message is that many of the key reactions in living systems are essentially based on interactions of moderate affinities and moderate selectivities. This principle is responsible for the enormous flexibility and redundancy of cellular circuitries. In complex disorders such as cancer or neurodegenerative diseases, which initially appear to be rooted in relatively subtle dysfunctions of multimodal physiologic pathways, drug discovery programs based on the concept of high affinity/high specificity compounds ("one-target, one-disease"), which has been dominating the pharmaceutical industry for a long time, increasingly turn out to be unsuccessful. Despite improvements in rational drug design and high throughput screening methods, the number of novel, single-target drugs fell much behind expectations during the past decade, and the treatment of "complex diseases" remains a most pressing medical need. Currently, a change of paradigm can be observed with regard to a new interest in agents that modulate multiple targets simultaneously, essentially "dirty drugs." Targeting cellular function as a system rather than on the level of the single target, significantly increases the size of the drugable proteome and is expected to introduce novel classes of multi-target drugs with fewer adverse effects and toxicity. Multiple target approaches have recently been used to design medications against atherosclerosis, cancer, depression, psychosis and neurodegenerative diseases. A focussed approach towards "systemic" drugs will certainly require the development of novel computational and mathematical concepts for appropriate modelling of complex data. But the key is the extraction of relevant molecular information from biological systems by implementing rigid statistical procedures to differential proteomic analytics.
Collapse
|
25
|
Ma S, Funk CC, Price ND. Systems approaches to molecular cancer diagnostics. DISCOVERY MEDICINE 2010; 10:531-542. [PMID: 21189224 PMCID: PMC3155470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]
Abstract
The search for improved molecular cancer diagnostics is a challenge for which systems approaches show great promise. As is becoming increasingly clear, cancer is a perpetually-evolving, highly multi-factorial disease. With next generation sequencing providing an ever-increasing amount of high-throughput data, the need for analytical tools that can provide meaningful context is critical. Systems approaches have demonstrated an ability to separate meaningful signal from noise that arises from population heterogeneity, heterogeneity within and across tumors, and multiple sources of technical variation when sufficient sample sizes are obtained and standardized measurement technologies are used. The ability to develop clinically useful molecular cancer diagnostics will be predicated on advancements on two major fronts: 1) more comprehensive and accurate measurements of multiple endpoints, and 2) more sophisticated analytical tools that synthesize high-throughput data into meaningful reflections of cellular states. To this end, systems approaches that have integrated transcriptomic data onto biomolecular networks have shown promise in their ability to classify tumor subtypes, predict clinical progression, and inform treatment options. Ultimately, the success of systems approaches will be measured by their ability to develop molecular cancer diagnostics through distilling complex, systems-wide information into actionable information in the clinic.
Collapse
|
26
|
Jiang Z, Liu X, Peng Z, Wan Y, Ji Y, He W, Wan W, Luo J, Guo H. AHD2.0: an update version of Arabidopsis Hormone Database for plant systematic studies. Nucleic Acids Res 2010; 39:D1123-9. [PMID: 21045062 PMCID: PMC3013673 DOI: 10.1093/nar/gkq1066] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Phytohormone studies enlightened our knowledge of plant responses to various changes. To provide a systematic and comprehensive view of genes participating in plant hormonal regulation, an online accessible database Arabidopsis Hormone Database (AHD) has been developed, which is a collection of hormone related genes of the model organism Arabidopsis thaliana (AHRGs). Recently we updated our database from AHD to a new version AHD2.0 by adding several pronounced features: (i) updating our collection of AHRGs based on most recent publications as well as constructing elaborate schematic diagrams of each hormone biosynthesis and signaling pathways; (ii) adding orthologs of sequenced plants listed in OrthoMCL-DB to each AHRG in the updated database; (iii) providing predicted miRNA splicing site(s) for each AHRG; (iv) integrating genes that genetically interact with each AHRG according to literatures mining; (v) providing links to a powerful online analysis platform WebLab for the convenience of in-time bioinformatics analysis and (vi) providing links to widely used protein databases and integrating more expression profiling information that would facilitate users for a more systematic and integrative analysis related to phytohormone research.
Collapse
Affiliation(s)
- Zhiqiang Jiang
- National Laboratory of Protein Engineering and Plant Genetic Engineering, College of Life Sciences, Peking University, Beijing 100871, PR China
| | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Probabilistic integrative modeling of genome-scale metabolic and regulatory networks in Escherichia coli and Mycobacterium tuberculosis. Proc Natl Acad Sci U S A 2010; 107:17845-50. [PMID: 20876091 DOI: 10.1073/pnas.1005139107] [Citation(s) in RCA: 276] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Prediction of metabolic changes that result from genetic or environmental perturbations has several important applications, including diagnosing metabolic disorders and discovering novel drug targets. A cardinal challenge in obtaining accurate predictions is the integration of transcriptional regulatory networks with the corresponding metabolic network. We propose a method called probabilistic regulation of metabolism (PROM) that achieves this synthesis and enables straightforward, automated, and quantitative integration of high-throughput data into constraint-based modeling, making it an ideal tool for constructing genome-scale regulatory-metabolic network models for less-studied organisms. PROM introduces probabilities to represent gene states and gene-transcription factor interactions. By using PROM, we constructed an integrated regulatory-metabolic network for the model organism, Escherichia coli, and demonstrated that our method based on automated inference is more accurate and comprehensive than the current state of the art, which is based on manual curation of literature. After validating the approach, we used PROM to build a genome-scale integrated metabolic-regulatory model for Mycobacterium tuberculosis, a critically important human pathogen. This study incorporated data from more than 1,300 microarrays, 2,000 transcription factor-target interactions regulating 3,300 metabolic reactions, and 1,905 KO phenotypes for E. coli and M. tuberculosis. PROM identified KO phenotypes with accuracies as high as 95%, and predicted growth rates quantitatively with correlation of 0.95. Importantly, PROM represents the successful integration of a top-down reconstructed, statistically inferred regulatory network with a bottom-up reconstructed, biochemically detailed metabolic network, bridging two important classes of systems biology models that are rarely combined quantitatively.
Collapse
|
28
|
Edelman LB, Eddy JA, Price ND. In silico models of cancer. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2010; 2:438-459. [PMID: 20836040 PMCID: PMC3157287 DOI: 10.1002/wsbm.75] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Cancer is a complex disease that involves multiple types of biological interactions across diverse physical, temporal, and biological scales. This complexity presents substantial challenges for the characterization of cancer biology, and motivates the study of cancer in the context of molecular, cellular, and physiological systems. Computational models of cancer are being developed to aid both biological discovery and clinical medicine. The development of these in silico models is facilitated by rapidly advancing experimental and analytical tools that generate information-rich, high-throughput biological data. Statistical models of cancer at the genomic, transcriptomic, and pathway levels have proven effective in developing diagnostic and prognostic molecular signatures, as well as in identifying perturbed pathways. Statistically inferred network models can prove useful in settings where data overfitting can be avoided, and provide an important means for biological discovery. Mechanistically based signaling and metabolic models that apply a priori knowledge of biochemical processes derived from experiments can also be reconstructed where data are available, and can provide insight and predictive ability regarding the behavior of these systems. At longer length scales, continuum and agent-based models of the tumor microenvironment and other tissue-level interactions enable modeling of cancer cell populations and tumor progression. Even though cancer has been among the most-studied human diseases using systems approaches, significant challenges remain before the enormous potential of in silico cancer biology can be fully realized.
Collapse
Affiliation(s)
- Lucas B. Edelman
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - James A. Eddy
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - Nathan D. Price
- Department of Chemical and Biomolecular Engineering, Institute for Genomic Biology, Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign
| |
Collapse
|
29
|
Segrè D, Marx CJ. Introduction to focus issue: genetic interactions. CHAOS (WOODBURY, N.Y.) 2010; 20:026101. [PMID: 20590330 PMCID: PMC2909308 DOI: 10.1063/1.3456057] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Indexed: 05/29/2023]
Abstract
The perturbation of a gene in an organism's genome often causes changes in the organism's observable properties or phenotypes. It is not obvious a priori whether the simultaneous perturbation of two genes produces a phenotypic change that is easily predictable from the changes caused by individual perturbations. In fact, this is often not the case: the nonlinearity and interdependence between genetic variants in determining phenotypes, also known as epistasis, is a prevalent phenomenon in biological systems. This focus issue presents recent developments in the study of epistasis and genetic interactions, emphasizing the broad implications of this phenomenon in evolutionary biology, functional genomics, and human diseases.
Collapse
Affiliation(s)
- Daniel Segrè
- Department of Biology, Boston University, Boston, Massachusetts 02215, USA
| | | |
Collapse
|
30
|
Raj A, Mueller SG, Young K, Laxer KD, Weiner M. Network-level analysis of cortical thickness of the epileptic brain. Neuroimage 2010; 52:1302-13. [PMID: 20553893 DOI: 10.1016/j.neuroimage.2010.05.045] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2010] [Accepted: 05/16/2010] [Indexed: 11/29/2022] Open
Abstract
Temporal lobe epilepsy (TLE) characterized by an epileptogenic focus in the medial temporal lobe is the most common form of focal epilepsy. However, the seizures are not confined to the temporal lobe but can spread to other, anatomically connected brain regions where they can cause similar structural abnormalities as observed in the focus. The aim of this study was to derive whole-brain networks from volumetric data and obtain network-centric measures, which can capture cortical thinning characteristic of TLE and can be used for classifying a given MRI into TLE or normal, and to obtain additional summary statistics that relate to the extent and spread of the disease. T1-weighted whole-brain images were acquired on a 4-T magnet in 13 patients with TLE with mesial temporal lobe sclerosis (TLE-MTS), 14 patients with TLE with normal MRI (TLE-no), and 30 controls. Mean cortical thickness and curvature measurements were obtained using the FreeSurfer software. These values were used to derive a graph, or network, for each subject. The nodes of the graph are brain regions, and edges represent disease progression paths. We show how to obtain summary statistics like mean, median, and variance defined for these networks and to perform exploratory analyses like correlation and classification. Our results indicate that the proposed network approach can improve accuracy of classifying subjects into two groups (control and TLE) from 78% for non-network classifiers to 93% using the proposed approach. We also obtain network "peakiness" values using statistical measures like entropy and complexity-this appears to be a good characterizer of the disease and may have utility in surgical planning.
Collapse
Affiliation(s)
- A Raj
- Department of Radiology, Weill Cornell Medical College, New York, NY 10021, USA.
| | | | | | | | | |
Collapse
|
31
|
Pu L, Brady S. Systems biology update: cell type-specific transcriptional regulatory networks. PLANT PHYSIOLOGY 2010; 152:411-9. [PMID: 19965967 PMCID: PMC2815853 DOI: 10.1104/pp.109.148668] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2009] [Accepted: 11/30/2009] [Indexed: 05/23/2023]
|
32
|
Daigle BJ, Srinivasan BS, Flannick JA, Novak AF, Batzoglou S. Current Progress in Static and Dynamic Modeling of Biological Networks. SYSTEMS BIOLOGY FOR SIGNALING NETWORKS 2010. [DOI: 10.1007/978-1-4419-5797-9_2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
|
33
|
Edelman LB, Eddy JA, Price ND. In silico models of cancer. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2010; 2. [PMID: 20836040 PMCID: PMC3157287 DOI: 10.1002/wsbm.75 10.1002/wsbm.75] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Cancer is a complex disease that involves multiple types of biological interactions across diverse physical, temporal, and biological scales. This complexity presents substantial challenges for the characterization of cancer biology, and motivates the study of cancer in the context of molecular, cellular, and physiological systems. Computational models of cancer are being developed to aid both biological discovery and clinical medicine. The development of these in silico models is facilitated by rapidly advancing experimental and analytical tools that generate information-rich, high-throughput biological data. Statistical models of cancer at the genomic, transcriptomic, and pathway levels have proven effective in developing diagnostic and prognostic molecular signatures, as well as in identifying perturbed pathways. Statistically inferred network models can prove useful in settings where data overfitting can be avoided, and provide an important means for biological discovery. Mechanistically based signaling and metabolic models that apply a priori knowledge of biochemical processes derived from experiments can also be reconstructed where data are available, and can provide insight and predictive ability regarding the behavior of these systems. At longer length scales, continuum and agent-based models of the tumor microenvironment and other tissue-level interactions enable modeling of cancer cell populations and tumor progression. Even though cancer has been among the most-studied human diseases using systems approaches, significant challenges remain before the enormous potential of in silico cancer biology can be fully realized.
Collapse
Affiliation(s)
- Lucas B. Edelman
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - James A. Eddy
- Institute for Genomic Biology, Department of Bioengineering, University of Illinois, Urbana-Champaign
| | - Nathan D. Price
- Department of Chemical and Biomolecular Engineering, Institute for Genomic Biology, Center for Biophysics and Computational Biology, University of Illinois, Urbana-Champaign
| |
Collapse
|
34
|
Vilela M, Vinga S, Maia MAGM, Voit EO, Almeida JS. Identification of neutral biochemical network models from time series data. BMC SYSTEMS BIOLOGY 2009. [PMID: 19416537 DOI: 10.1186/1752‐0509‐3‐47] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
BACKGROUND The major difficulty in modeling biological systems from multivariate time series is the identification of parameter sets that endow a model with dynamical behaviors sufficiently similar to the experimental data. Directly related to this parameter estimation issue is the task of identifying the structure and regulation of ill-characterized systems. Both tasks are simplified if the mathematical model is canonical, i.e., if it is constructed according to strict guidelines. RESULTS In this report, we propose a method for the identification of admissible parameter sets of canonical S-systems from biological time series. The method is based on a Monte Carlo process that is combined with an improved version of our previous parameter optimization algorithm. The method maps the parameter space into the network space, which characterizes the connectivity among components, by creating an ensemble of decoupled S-system models that imitate the dynamical behavior of the time series with sufficient accuracy. The concept of sloppiness is revisited in the context of these S-system models with an exploration not only of different parameter sets that produce similar dynamical behaviors but also different network topologies that yield dynamical similarity. CONCLUSION The proposed parameter estimation methodology was applied to actual time series data from the glycolytic pathway of the bacterium Lactococcus lactis and led to ensembles of models with different network topologies. In parallel, the parameter optimization algorithm was applied to the same dynamical data upon imposing a pre-specified network topology derived from prior biological knowledge, and the results from both strategies were compared. The results suggest that the proposed method may serve as a powerful exploration tool for testing hypotheses and the design of new experiments.
Collapse
Affiliation(s)
- Marco Vilela
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Apartado, Oeiras, Portugal.
| | | | | | | | | |
Collapse
|
35
|
Vilela M, Vinga S, Maia MAGM, Voit EO, Almeida JS. Identification of neutral biochemical network models from time series data. BMC SYSTEMS BIOLOGY 2009; 3:47. [PMID: 19416537 PMCID: PMC2694766 DOI: 10.1186/1752-0509-3-47] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2009] [Accepted: 05/05/2009] [Indexed: 12/02/2022]
Abstract
Background The major difficulty in modeling biological systems from multivariate time series is the identification of parameter sets that endow a model with dynamical behaviors sufficiently similar to the experimental data. Directly related to this parameter estimation issue is the task of identifying the structure and regulation of ill-characterized systems. Both tasks are simplified if the mathematical model is canonical, i.e., if it is constructed according to strict guidelines. Results In this report, we propose a method for the identification of admissible parameter sets of canonical S-systems from biological time series. The method is based on a Monte Carlo process that is combined with an improved version of our previous parameter optimization algorithm. The method maps the parameter space into the network space, which characterizes the connectivity among components, by creating an ensemble of decoupled S-system models that imitate the dynamical behavior of the time series with sufficient accuracy. The concept of sloppiness is revisited in the context of these S-system models with an exploration not only of different parameter sets that produce similar dynamical behaviors but also different network topologies that yield dynamical similarity. Conclusion The proposed parameter estimation methodology was applied to actual time series data from the glycolytic pathway of the bacterium Lactococcus lactis and led to ensembles of models with different network topologies. In parallel, the parameter optimization algorithm was applied to the same dynamical data upon imposing a pre-specified network topology derived from prior biological knowledge, and the results from both strategies were compared. The results suggest that the proposed method may serve as a powerful exploration tool for testing hypotheses and the design of new experiments.
Collapse
Affiliation(s)
- Marco Vilela
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Apartado, Oeiras, Portugal.
| | | | | | | | | |
Collapse
|
36
|
The role of predictive modelling in rationally re-engineering biological systems. Nat Rev Microbiol 2009; 7:297-305. [PMID: 19252506 DOI: 10.1038/nrmicro2107] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Technologies to synthesize and transplant a complete genome into a cell have opened limitless potential to redesign organisms for complex, specialized tasks. However, large-scale re-engineering of a biological circuit will require systems-level optimization that will come from a deep understanding of operational relationships among all the constituent parts of a cell. The integrated framework necessary for conducting such complex bioengineering requires the convergence of systems and synthetic biology. Here, we review the status of these rapidly developing interdisciplinary fields of biology and provide a perspective on plausible venues for their merger.
Collapse
|
37
|
Danchin A. Bacteria as computers making computers. FEMS Microbiol Rev 2009; 33:3-26. [PMID: 19016882 PMCID: PMC2704931 DOI: 10.1111/j.1574-6976.2008.00137.x] [Citation(s) in RCA: 102] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2008] [Revised: 09/20/2008] [Accepted: 09/21/2008] [Indexed: 12/13/2022] Open
Abstract
Various efforts to integrate biological knowledge into networks of interactions have produced a lively microbial systems biology. Putting molecular biology and computer sciences in perspective, we review another trend in systems biology, in which recursivity and information replace the usual concepts of differential equations, feedback and feedforward loops and the like. Noting that the processes of gene expression separate the genome from the cell machinery, we analyse the role of the separation between machine and program in computers. However, computers do not make computers. For cells to make cells requires a specific organization of the genetic program, which we investigate using available knowledge. Microbial genomes are organized into a paleome (the name emphasizes the role of the corresponding functions from the time of the origin of life), comprising a constructor and a replicator, and a cenome (emphasizing community-relevant genes), made up of genes that permit life in a particular context. The cell duplication process supposes rejuvenation of the machine and replication of the program. The paleome also possesses genes that enable information to accumulate in a ratchet-like process down the generations. The systems biology must include the dynamics of information creation in its future developments.
Collapse
Affiliation(s)
- Antoine Danchin
- Génétique des Génomes Bactériens, Institut Pasteur, Paris, France.
| |
Collapse
|
38
|
Kimura T, Noguchi Y, Shikata N, Takahashi M. Plasma amino acid analysis for diagnosis and amino acid-based metabolic networks. Curr Opin Clin Nutr Metab Care 2009; 12:49-53. [PMID: 19057187 DOI: 10.1097/mco.0b013e3283169242] [Citation(s) in RCA: 47] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
PURPOSE OF REVIEW To highlight the usefulness of amino acid profiling in clinical diagnosis and current developments in analysis revealing underlying metabolic relationships. RECENT FINDINGS Recent innovations in metabolomics and systems biology enable high throughput measurement of diverse amino acids and the subsequent data mining for various uses. Recent studies show new possibilities of using plasma amino acid analysis as biomarker discovery tools by generating diagnostic indices through systematic computation. Such studies show that amino acid-based clinical diagnostic indices for hepatic fibrosis in type C hepatitis patients can be generated. In addition, several studies show the potential of treating amino acid profile data as a metabolomic subset, which can be integrated through the analysis of correlation with different types of 'omics' data for describing metabolite-to-metabolite or metabolite-to-gene interaction networks. CONCLUSION Amino acid profiling of biological samples could be used to generate indices that could be used for clinical diagnosis and is a useful tool for understanding metabolic implications under various physiological conditions. Although further improvements in analytical methods are needed, amino acids could be useful indicators for facilitating nutritional management of specific physiological and pathological states.
Collapse
Affiliation(s)
- Takeshi Kimura
- Quality Assurance and External Scientific Affairs Department, Ajinomoto Co., Inc, Tokyo, Japan.
| | | | | | | |
Collapse
|
39
|
|
40
|
Dzeroski S, Todorovski L. Equation discovery for systems biology: finding the structure and dynamics of biological networks from time course data. Curr Opin Biotechnol 2008; 19:360-8. [PMID: 18672061 DOI: 10.1016/j.copbio.2008.07.002] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Revised: 06/30/2008] [Accepted: 07/01/2008] [Indexed: 10/21/2022]
Abstract
Reconstructing biological networks, such as metabolic and signaling networks, is at the heart of systems biology. Although many approaches exist for reconstructing network structure, few approaches recover the full dynamic behavior of a network. We survey such approaches that originate from computational scientific discovery, a subfield of machine learning. These take as input measured time course data, as well as existing domain knowledge, such as partial knowledge of the network structure. We demonstrate the use of these approaches on illustrative tasks of finding the complete dynamics of biological networks, which include examples of rediscovering known networks and their dynamics, as well as examples of proposing models for unknown networks.
Collapse
Affiliation(s)
- Saso Dzeroski
- Department of Knowledge Technologies, Jozef Stefan Institute, Jamova 39, SI-1000 Ljubljana, Slovenia.
| | | |
Collapse
|
41
|
Abstract
Complex gene regulatory networks are composed of genes, noncoding RNAs, proteins, metabolites, and signaling components. The availability of genome-wide mutagenesis libraries; large-scale transcriptome, proteome, and metabalome data sets; and new high-throughput methods that uncover protein interactions underscores the need for mathematical modeling techniques that better enable scientists to synthesize these large amounts of information and to understand the properties of these biological systems. Systems biology approaches can allow researchers to move beyond a reductionist approach and to both integrate and comprehend the interactions of multiple components within these systems. Descriptive and mathematical models for gene regulatory networks can reveal emergent properties of these plant systems. This review highlights methods that researchers are using to obtain large-scale data sets, and examples of gene regulatory networks modeled with these data. Emergent properties revealed by the use of these network models and perspectives on the future of systems biology are discussed.
Collapse
Affiliation(s)
- Terri A. Long
- Department of Biology, Duke University, Durham, North Carolina 27708
- IGSP Center for Systems Biology, Duke University, Durham, North Carolina 27708
| | - Siobhan M. Brady
- Department of Biology, Duke University, Durham, North Carolina 27708
- IGSP Center for Systems Biology, Duke University, Durham, North Carolina 27708
| | - Philip N. Benfey
- Department of Biology, Duke University, Durham, North Carolina 27708
- IGSP Center for Systems Biology, Duke University, Durham, North Carolina 27708
| |
Collapse
|
42
|
Abstract
The identification, purification and characterization of cancer stem cells (CSCs) holds tremendous promise for improving the treatment of cancer. Mounting evidence is demonstrating that only certain tumour cells (i.e. the CSCs) can give rise to tumours when injected and that these purified cell populations generate heterogeneous tumours. While the cell of origin is still not determined definitively, specific molecular markers for populations containing these CSCs have been found for leukaemia, brain cancer and breast cancer, among others. Systems approaches, particularly molecular profiling, have proven to be of great utility for cancer diagnosis and characterization. These approaches also hold significant promise for identifying distinctive properties of the CSCs, and progress is already being made.
Collapse
|