1
|
Vahlensieck C, Thiel CS, Mosimann M, Bradley T, Caldana F, Polzer J, Lauber BA, Ullrich O. Transcriptional Response in Human Jurkat T Lymphocytes to a near Physiological Hypergravity Environment and to One Common in Routine Cell Culture Protocols. Int J Mol Sci 2023; 24:ijms24021351. [PMID: 36674869 PMCID: PMC9863927 DOI: 10.3390/ijms24021351] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 01/06/2023] [Accepted: 01/06/2023] [Indexed: 01/13/2023] Open
Abstract
Cellular effects of hypergravity have been described in many studies. We investigated the transcriptional dynamics in Jurkat T cells between 20 s and 60 min of 9 g hypergravity and characterized a highly dynamic biphasic time course of gene expression response with a transition point between rapid adaptation and long-term response at approximately 7 min. Upregulated genes were shifted towards the center of the nuclei, whereby downregulated genes were shifted towards the periphery. Upregulated gene expression was mostly located on chromosomes 16-22. Protein-coding transcripts formed the majority with more than 90% of all differentially expressed genes and followed a continuous trend of downregulation, whereas retained introns demonstrated a biphasic time-course. The gene expression pattern of hypergravity response was not comparable with other stress factors such as oxidative stress, heat shock or inflammation. Furthermore, we tested a routine centrifugation protocol that is widely used to harvest cells for subsequent RNA analysis and detected a huge impact on the transcriptome compared to non-centrifuged samples, which did not return to baseline within 15 min. Thus, we recommend carefully studying the response of any cell types used for any experiments regarding the hypergravity time and levels applied during cell culture procedures and analysis.
Collapse
Affiliation(s)
- Christian Vahlensieck
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Cora Sandra Thiel
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- Department of Machine Design, Engineering Design and Product Development, Institute of Mechanical Engineering, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106 Magdeburg, Germany
- Space Life Sciences Laboratory (SLSL), Kennedy Space Center, 505 Odyssey Way, Exploration Park, FL 32953, USA
- UZH Space Hub, Air Force Center, Air Base Dübendorf, Überlandstrasse 270, 8600 Dübendorf, Switzerland
- Correspondence: (C.S.T.); (O.U.)
| | - Meret Mosimann
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Timothy Bradley
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Fabienne Caldana
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Jennifer Polzer
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Beatrice Astrid Lauber
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
| | - Oliver Ullrich
- Institute of Anatomy, Faculty of Medicine, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- Department of Machine Design, Engineering Design and Product Development, Institute of Mechanical Engineering, Otto-von-Guericke-University Magdeburg, Universitätsplatz 2, 39106 Magdeburg, Germany
- Space Life Sciences Laboratory (SLSL), Kennedy Space Center, 505 Odyssey Way, Exploration Park, FL 32953, USA
- UZH Space Hub, Air Force Center, Air Base Dübendorf, Überlandstrasse 270, 8600 Dübendorf, Switzerland
- Ernst-Abbe-Hochschule (EAH) Jena, Department of Industrial Engineering, Carl-Zeiss-Promenade 2, 07745 Jena, Germany
- Zurich Center for Integrative Human Physiology (ZIHP), University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland
- Correspondence: (C.S.T.); (O.U.)
| |
Collapse
|
2
|
Weber I, Oehrn CR. NoLiTiA: An Open-Source Toolbox for Non-linear Time Series Analysis. Front Neuroinform 2022; 16:876012. [PMID: 35811996 PMCID: PMC9263366 DOI: 10.3389/fninf.2022.876012] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 05/23/2022] [Indexed: 11/13/2022] Open
Abstract
In many scientific fields including neuroscience, climatology or physics, complex relationships can be described most parsimoniously by non-linear mechanics. Despite their relevance, many neuroscientists still apply linear estimates in order to evaluate complex interactions. This is partially due to the lack of a comprehensive compilation of non-linear methods. Available packages mostly specialize in only one aspect of non-linear time-series analysis and most often require some coding proficiency to use. Here, we introduce NoLiTiA, a free open-source MATLAB toolbox for non-linear time series analysis. In comparison to other currently available non-linear packages, NoLiTiA offers (1) an implementation of a broad range of classic and recently developed methods, (2) an implementation of newly proposed spatially and time-resolved recurrence amplitude analysis and (3) an intuitive environment accessible even to users with little coding experience due to a graphical user interface and batch-editor. The core methodology derives from three distinct fields of complex systems theory, including dynamical systems theory, recurrence quantification analysis and information theory. Besides established methodology including estimation of dynamic invariants like Lyapunov exponents and entropy-based measures, such as active information storage, we include recent developments of quantifying time-resolved aperiodic oscillations. In general, the toolbox will make non-linear methods accessible to the broad neuroscientific community engaged in time series processing.
Collapse
Affiliation(s)
- Immo Weber
- Department of Neurology, Philipps University of Marburg, Marburg, Germany
| | - Carina R. Oehrn
- Department of Neurology, Philipps University of Marburg, Marburg, Germany
- Center for Mind, Brain and Behavior, Philipps University of Marburg, Marburg, Germany
| |
Collapse
|
3
|
|
4
|
Huang H, Handel A, Song X. A Bayesian approach to estimate parameters of ordinary differential equation. Comput Stat 2020. [DOI: 10.1007/s00180-020-00962-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
5
|
Dony L, He F, Stumpf MPH. Parametric and non-parametric gradient matching for network inference: a comparison. BMC Bioinformatics 2019; 20:52. [PMID: 30683048 PMCID: PMC6346534 DOI: 10.1186/s12859-018-2590-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 12/21/2018] [Indexed: 11/24/2022] Open
Abstract
Background Reverse engineering of gene regulatory networks from time series gene-expression data is a challenging problem, not only because of the vast sets of candidate interactions but also due to the stochastic nature of gene expression. We limit our analysis to nonlinear differential equation based inference methods. In order to avoid the computational cost of large-scale simulations, a two-step Gaussian process interpolation based gradient matching approach has been proposed to solve differential equations approximately. Results We apply a gradient matching inference approach to a large number of candidate models, including parametric differential equations or their corresponding non-parametric representations, we evaluate the network inference performance under various settings for different inference objectives. We use model averaging, based on the Bayesian Information Criterion (BIC), to combine the different inferences. The performance of different inference approaches is evaluated using area under the precision-recall curves. Conclusions We found that parametric methods can provide comparable, and often improved inference compared to non-parametric methods; the latter, however, require no kinetic information and are computationally more efficient. Electronic supplementary material The online version of this article (10.1186/s12859-018-2590-7) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Leander Dony
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK.,Institute of Computational Biology, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, 85764, Germany.,Max Planck Institute of Psychiatry, Kraepelinstr. 2-10, Munich, 80804, Germany
| | - Fei He
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK.,School of Computing, Electronics, and Mathematics, Coventry University, Coventry, CV1 2JH, UK
| | - Michael P H Stumpf
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London, London, SW7 2AZ, UK. .,Melbourne Integrative Genomics, School of BioScience & School of Mathematics and Statistics, University of Melbourne, Parkville Melbourne, 3010, Australia.
| |
Collapse
|
6
|
Chen CK. Inference of gene networks from gene expression time series using recurrent neural networks and sparse MAP estimation. J Bioinform Comput Biol 2018; 16:1850009. [PMID: 30051742 DOI: 10.1142/s0219720018500099] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
BACKGROUND The inference of genetic regulatory networks (GRNs) provides insight into the cellular responses to signals. A class of recurrent neural networks (RNNs) capturing the dynamics of GRN has been used as a basis for inferring small-scale GRNs from gene expression time series. The Bayesian framework facilitates incorporating the hypothesis of GRN into the model estimation to improve the accuracy of GRN inference. RESULTS We present new methods for inferring small-scale GRNs based on RNNs. The weights of wires of RNN represent the strengths of gene-to-gene regulatory interactions. We use a class of automatic relevance determination (ARD) priors to enforce the sparsity in the maximum a posteriori (MAP) estimates of wire weights of RNN. A particle swarm optimization (PSO) is integrated as an optimization engine into the MAP estimation process. Likely networks of genes generated based on estimated wire weights are combined using the majority rule to determine a final estimated GRN. As an alternative, a class of <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -norm ( <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>q</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:math> ) priors is used for attaining the sparse MAP estimates of wire weights of RNN. We also infer the GRN using the maximum likelihood (ML) estimates of wire weights of RNN. The RNN-based GRN inference algorithms, ARD-RNN, <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN, and ML-RNN are tested on simulated and experimental E. coli and yeast time series containing 6-11 genes and 7-19 data points. Published GRN inference algorithms based on regressions and mutual information networks are performed on the benchmark datasets to compare performances. CONCLUSION ARD and <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -norm priors are used for the estimation of wire weights of RNN. Results of GRN inference experiments show that ARD-RNN, <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN have similar best accuracies on the simulated time series. The ARD-RNN is more accurate than <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mrow><mml:mi>L</mml:mi></mml:mrow><mml:mrow><mml:mi>q</mml:mi></mml:mrow></mml:msub></mml:math> -RNN, ML-RNN, and mostly more accurate than the reference algorithms on the experimental time series. The effectiveness of ARD-RNN for inferring small-scale GRNs using gene expression time series of limited length is empirically verified.
Collapse
Affiliation(s)
- Chi-Kan Chen
- Department of Applied Mathematics, National Chung Hsing University, Taiwan
| |
Collapse
|
7
|
Alim MA, Ay A, Hasan MM, Thai MT, Kahveci T. Construction of Signaling Pathways with RNAi Data and Multiple Reference Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1079-1091. [PMID: 30102599 DOI: 10.1109/tcbb.2017.2710129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Signaling networks are involved in almost all major diseases such as cancer. As a result of this, understanding how signaling networks function is vital for finding new treatments for many diseases. Using gene knockdown assays such as RNA interference (RNAi) technology, many genes involved in these networks can be identified. However, determining the interactions between these genes in the signaling networks using only experimental techniques is very challenging, as performing extensive experiments is very expensive and sometimes, even impractical. Construction of signaling networks from RNAi data using computational techniques have been proposed as an alternative way to solve this challenging problem. However, the earlier approaches are either not scalable to large scale networks, or their accuracy levels are not satisfactory. In this study, we integrate RNAi data given on a target network with multiple reference signaling networks and phylogenetic trees to construct the topology of the target signaling network. In our work, the network construction is considered as finding the minimum number of edit operations on given multiple reference networks, in which their contributions are weighted by their phylogenetic distances to the target network. The edit operations on the reference networks lead to a target network that satisfies the RNAi knockdown observations. Here, we propose two new reference-based signaling network construction methods that provide optimal results and scale well to large-scale signaling networks of hundreds of components. We compare the performance of these approaches to the state-of-the-art reference-based network construction method SiNeC on synthetic, semi-synthetic, and real datasets. Our analyses show that the proposed methods outperform SiNeC method in terms of accuracy. Furthermore, we show that our methods function well even if evolutionarily distant reference networks are used. Application of our methods to the Apoptosis and Wnt signaling pathways recovers the known protein-protein interactions and suggests additional relevant interactions that can be tested experimentally.
Collapse
|
8
|
Reconstructing Genetic Regulatory Networks Using Two-Step Algorithms with the Differential Equation Models of Neural Networks. Interdiscip Sci 2017; 10:823-835. [PMID: 28748400 DOI: 10.1007/s12539-017-0254-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2016] [Revised: 07/01/2017] [Accepted: 07/14/2017] [Indexed: 10/19/2022]
Abstract
BACKGROUND The identification of genetic regulatory networks (GRNs) provides insights into complex cellular processes. A class of recurrent neural networks (RNNs) captures the dynamics of GRN. Algorithms combining the RNN and machine learning schemes were proposed to reconstruct small-scale GRNs using gene expression time series. RESULTS We present new GRN reconstruction methods with neural networks. The RNN is extended to a class of recurrent multilayer perceptrons (RMLPs) with latent nodes. Our methods contain two steps: the edge rank assignment step and the network construction step. The former assigns ranks to all possible edges by a recursive procedure based on the estimated weights of wires of RNN/RMLP (RERNN/RERMLP), and the latter constructs a network consisting of top-ranked edges under which the optimized RNN simulates the gene expression time series. The particle swarm optimization (PSO) is applied to optimize the parameters of RNNs and RMLPs in a two-step algorithm. The proposed RERNN-RNN and RERMLP-RNN algorithms are tested on synthetic and experimental gene expression time series of small GRNs of about 10 genes. The experimental time series are from the studies of yeast cell cycle regulated genes and E. coli DNA repair genes. CONCLUSION The unstable estimation of RNN using experimental time series having limited data points can lead to fairly arbitrary predicted GRNs. Our methods incorporate RNN and RMLP into a two-step structure learning procedure. Results show that the RERMLP using the RMLP with a suitable number of latent nodes to reduce the parameter dimension often result in more accurate edge ranks than the RERNN using the regularized RNN on short simulated time series. Combining by a weighted majority voting rule the networks derived by the RERMLP-RNN using different numbers of latent nodes in step one to infer the GRN, the method performs consistently and outperforms published algorithms for GRN reconstruction on most benchmark time series. The framework of two-step algorithms can potentially incorporate with different nonlinear differential equation models to reconstruct the GRN.
Collapse
|
9
|
Xi JY, Ouyang Q. Using Sub-Network Combinations to Scale Up an Enumeration Method for Determining the Network Structures of Biological Functions. PLoS One 2016; 11:e0168214. [PMID: 27992476 PMCID: PMC5161363 DOI: 10.1371/journal.pone.0168214] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2016] [Accepted: 10/12/2016] [Indexed: 11/18/2022] Open
Abstract
Deduction of biological regulatory networks from their functions is one of the focus areas of systems biology. Among the different techniques used in this reverse-engineering task, one powerful method is to enumerate all candidate network structures to find suitable ones. However, this method is severely limited by calculation capability: due to the brute-force approach, it is infeasible for networks with large number of nodes to be studied using traditional enumeration method because of the combinatorial explosion. In this study, we propose a new reverse-engineering technique based on the enumerating method: sub-network combinations. First, a complex biological function is divided into several sub-functions. Next, the three-node-network enumerating method is applied to search for sub-networks that are able to realize each of the sub-functions. Finally, complex whole networks are constructed by enumerating all possible combinations of sub-networks. The optimal ones are selected and analyzed. To demonstrate the effectiveness of this new method, we used it to deduct the network structures of a Pavlovian-like function. The whole Pavlovian-like network was successfully constructed by combining robust sub-networks, and the results were analyzed. With sub-network combination, the complexity has been largely reduced. Our method also provides a functional modular view of biological systems.
Collapse
Affiliation(s)
- J. Y. Xi
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Q. Ouyang
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
- State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing, China
- * E-mail:
| |
Collapse
|
10
|
McGoff KA, Guo X, Deckard A, Kelliher CM, Leman AR, Francey LJ, Hogenesch JB, Haase SB, Harer JL. The Local Edge Machine: inference of dynamic models of gene regulation. Genome Biol 2016; 17:214. [PMID: 27760556 PMCID: PMC5072315 DOI: 10.1186/s13059-016-1076-z] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2016] [Accepted: 10/03/2016] [Indexed: 12/31/2022] Open
Abstract
We present a novel approach, the Local Edge Machine, for the inference of regulatory interactions directly from time-series gene expression data. We demonstrate its performance, robustness, and scalability on in silico datasets with varying behaviors, sizes, and degrees of complexity. Moreover, we demonstrate its ability to incorporate biological prior information and make informative predictions on a well-characterized in vivo system using data from budding yeast that have been synchronized in the cell cycle. Finally, we use an atlas of transcription data in a mammalian circadian system to illustrate how the method can be used for discovery in the context of large complex networks.
Collapse
Affiliation(s)
- Kevin A McGoff
- Department of Mathematics and Statistics, UNC Charlotte, 9201 University City Blvd., Charlotte, 28269, NC, USA.
| | - Xin Guo
- Department of Applied Mathematics, The Hong Kong Polytechnic University, Hong Kong, China
| | | | | | - Adam R Leman
- Department of Biology, Duke University, Durham, NC, USA
| | - Lauren J Francey
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | - John B Hogenesch
- Department of Molecular and Cellular Physiology, University of Cincinnati, Cincinnati, OH, USA
| | | | - John L Harer
- Department of Mathematics, Duke University, Durham, NC, USA
| |
Collapse
|
11
|
Ren Y, Wang Q, Hasan MM, Ay A, Kahveci T. Identifying the topology of signaling networks from partial RNAi data. BMC SYSTEMS BIOLOGY 2016; 10 Suppl 2:53. [PMID: 27490106 PMCID: PMC4977480 DOI: 10.1186/s12918-016-0301-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Background Methods for inferring signaling networks using single gene knockdown RNAi experiments and reference networks have been proposed in recent years. These methods assume that RNAi information is available for all the genes in the signal transduction pathway, i.e., complete. This assumption does not always hold up since RNAi experiments are often incomplete and information for some genes is missing. Results In this article, we develop two methods to construct signaling networks from incomplete RNAi data with the help of a reference network. These methods infer the RNAi constraints for the missing genes such that the inferred network is closest to the reference network. We perform extensive experiments with both real and synthetic networks and demonstrate that these methods produce accurate results efficiently. Conclusions Application of our methods to Wnt signal transduction pathway has shown that our methods can be used to construct highly accurate signaling networks from experimental data in less than 100 ms. The two methods that produce accurate results efficiently show great promise of constructing real signaling networks.
Collapse
Affiliation(s)
- Yuanfang Ren
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, 32611, FL, USA.
| | - Qiyao Wang
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, 32611, FL, USA
| | - Md Mahmudul Hasan
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, 32611, FL, USA
| | - Ahmet Ay
- Department of Biology & Mathematics, Colgate University, Hamilton, 13346, NY, USA
| | - Tamer Kahveci
- Department of Computer & Information Science & Engineering, University of Florida, Gainesville, 32611, FL, USA
| |
Collapse
|
12
|
Berrones A, Jiménez E, Alcorta-García MA, Almaguer FJ, Peña B. Parameter inference of general nonlinear dynamical models of gene regulatory networks from small and noisy time series. Neurocomputing 2016. [DOI: 10.1016/j.neucom.2015.10.095] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
13
|
Kesić S. Systems biology, emergence and antireductionism. Saudi J Biol Sci 2015; 23:584-91. [PMID: 27579007 PMCID: PMC4992115 DOI: 10.1016/j.sjbs.2015.06.015] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2015] [Revised: 06/15/2015] [Accepted: 06/19/2015] [Indexed: 12/16/2022] Open
Abstract
This study explores the conceptual history of systems biology and its impact on philosophical and scientific conceptions of reductionism, antireductionism and emergence. Development of systems biology at the beginning of 21st century transformed biological science. Systems biology is a new holistic approach or strategy how to research biological organisms, developed through three phases. The first phase was completed when molecular biology transformed into systems molecular biology. Prior to the second phase, convergence between applied general systems theory and nonlinear dynamics took place, hence allowing the formation of systems mathematical biology. The second phase happened when systems molecular biology and systems mathematical biology, together, were applied for analysis of biological data. Finally, after successful application in science, medicine and biotechnology, the process of the formation of modern systems biology was completed. Systems and molecular reductionist views on organisms were completely opposed to each other. Implications of systems and molecular biology on reductionist–antireductionist debate were quite different. The analysis of reductionism, antireductionism and emergence issues, in the era of systems biology, revealed the hierarchy between methodological, epistemological and ontological antireductionism. Primarily, methodological antireductionism followed from the systems biology. Only after, epistemological and ontological antireductionism could be supported.
Collapse
Affiliation(s)
- Srdjan Kesić
- Department of Neurophysiology, Institute for Biological Research "Siniša Stanković", University of Belgrade, Despot Stefan Blvd. 142, 11060 Belgrade, Serbia
| |
Collapse
|
14
|
Lim N, d’Alché-Buc F, Auliac C, Michailidis G. Operator-valued kernel-based vector autoregressive models for network inference. Mach Learn 2014. [DOI: 10.1007/s10994-014-5479-3] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
15
|
Abstract
Mathematical models of natural systems are abstractions of much more complicated processes. Developing informative and realistic models of such systems typically involves suitable statistical inference methods, domain expertise, and a modicum of luck. Except for cases where physical principles provide sufficient guidance, it will also be generally possible to come up with a large number of potential models that are compatible with a given natural system and any finite amount of data generated from experiments on that system. Here we develop a computational framework to systematically evaluate potentially vast sets of candidate differential equation models in light of experimental and prior knowledge about biological systems. This topological sensitivity analysis enables us to evaluate quantitatively the dependence of model inferences and predictions on the assumed model structures. Failure to consider the impact of structural uncertainty introduces biases into the analysis and potentially gives rise to misleading conclusions.
Collapse
|
16
|
Kiani NA, Kaderali L. Dynamic probabilistic threshold networks to infer signaling pathways from time-course perturbation data. BMC Bioinformatics 2014; 15:250. [PMID: 25047753 PMCID: PMC4133630 DOI: 10.1186/1471-2105-15-250] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2013] [Accepted: 07/15/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Network inference deals with the reconstruction of molecular networks from experimental data. Given N molecular species, the challenge is to find the underlying network. Due to data limitations, this typically is an ill-posed problem, and requires the integration of prior biological knowledge or strong regularization. We here focus on the situation when time-resolved measurements of a system's response after systematic perturbations are available. RESULTS We present a novel method to infer signaling networks from time-course perturbation data. We utilize dynamic Bayesian networks with probabilistic Boolean threshold functions to describe protein activation. The model posterior distribution is analyzed using evolutionary MCMC sampling and subsequent clustering, resulting in probability distributions over alternative networks. We evaluate our method on simulated data, and study its performance with respect to data set size and levels of noise. We then use our method to study EGF-mediated signaling in the ERBB pathway. CONCLUSIONS Dynamic Probabilistic Threshold Networks is a new method to infer signaling networks from time-series perturbation data. It exploits the dynamic response of a system after external perturbation for network reconstruction. On simulated data, we show that the approach outperforms current state of the art methods. On the ERBB data, our approach recovers a significant fraction of the known interactions, and predicts novel mechanisms in the ERBB pathway.
Collapse
Affiliation(s)
- Narsis A Kiani
- Technische Universität Dresden, Medical Faculty Carl Gustav Carus, Institute for Medical Informatics and Biometry, Fetscherstr, 74, 01307 Dresden, Germany.
| | | |
Collapse
|
17
|
Wang YXR, Huang H. Review on statistical methods for gene network reconstruction using expression data. J Theor Biol 2014; 362:53-61. [PMID: 24726980 DOI: 10.1016/j.jtbi.2014.03.040] [Citation(s) in RCA: 97] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2014] [Revised: 03/29/2014] [Accepted: 03/31/2014] [Indexed: 12/16/2022]
Abstract
Network modeling has proven to be a fundamental tool in analyzing the inner workings of a cell. It has revolutionized our understanding of biological processes and made significant contributions to the discovery of disease biomarkers. Much effort has been devoted to reconstruct various types of biochemical networks using functional genomic datasets generated by high-throughput technologies. This paper discusses statistical methods used to reconstruct gene regulatory networks using gene expression data. In particular, we highlight progress made and challenges yet to be met in the problems involved in estimating gene interactions, inferring causality and modeling temporal changes of regulation behaviors. As rapid advances in technologies have made available diverse, large-scale genomic data, we also survey methods of incorporating all these additional data to achieve better, more accurate inference of gene networks.
Collapse
Affiliation(s)
- Y X Rachel Wang
- Department of Statistics, University of California, Berkeley, CA 94720, USA.
| | - Haiyan Huang
- Department of Statistics, University of California, Berkeley, CA 94720, USA.
| |
Collapse
|
18
|
Molinelli EJ, Korkut A, Wang W, Miller ML, Gauthier NP, Jing X, Kaushik P, He Q, Mills G, Solit DB, Pratilas CA, Weigt M, Braunstein A, Pagnani A, Zecchina R, Sander C. Perturbation biology: inferring signaling networks in cellular systems. PLoS Comput Biol 2013; 9:e1003290. [PMID: 24367245 PMCID: PMC3868523 DOI: 10.1371/journal.pcbi.1003290] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 08/26/2013] [Indexed: 12/16/2022] Open
Abstract
We present a powerful experimental-computational technology for inferring network models that predict the response of cells to perturbations, and that may be useful in the design of combinatorial therapy against cancer. The experiments are systematic series of perturbations of cancer cell lines by targeted drugs, singly or in combination. The response to perturbation is quantified in terms of relative changes in the measured levels of proteins, phospho-proteins and cellular phenotypes such as viability. Computational network models are derived de novo, i.e., without prior knowledge of signaling pathways, and are based on simple non-linear differential equations. The prohibitively large solution space of all possible network models is explored efficiently using a probabilistic algorithm, Belief Propagation (BP), which is three orders of magnitude faster than standard Monte Carlo methods. Explicit executable models are derived for a set of perturbation experiments in SKMEL-133 melanoma cell lines, which are resistant to the therapeutically important inhibitor of RAF kinase. The resulting network models reproduce and extend known pathway biology. They empower potential discoveries of new molecular interactions and predict efficacious novel drug perturbations, such as the inhibition of PLK1, which is verified experimentally. This technology is suitable for application to larger systems in diverse areas of molecular biology. Drugs that target specific effects of signaling proteins are promising agents for treating cancer. One of the many obstacles facing optimal drug design is inadequate quantitative understanding of the coordinated interactions between signaling proteins. De novo model inference of network or pathway models refers to the algorithmic construction of mathematical predictive models from experimental data without dependence on prior knowledge. De novo inference is difficult because of the prohibitively large number of possible sets of interactions that may or may not be consistent with observations. Our new method overcomes this difficulty by adapting a method from statistical physics, called Belief Propagation, which first calculates probabilistically the most likely interactions in the vast space of all possible solutions, then derives a set of individual, highly probable solutions in the form of executable models. In this paper, we test this method on artificial data and then apply it to model signaling pathways in a BRAF-mutant melanoma cancer cell line based on a large set of rich output measurements from a systematic set of perturbation experiments using drug combinations. Our results are in agreement with established biological knowledge, predict novel interactions, and predict efficacious drug targets that are specific to the experimental cell line and potentially to related tumors. The method has the potential, with sufficient systematic perturbation data, to model, de novo and quantitatively, the effects of hundreds of proteins on cellular responses, on a scale that is currently unreachable in diverse areas of cell biology. In a disease context, the method is applicable to the computational design of novel combination drug treatments.
Collapse
Affiliation(s)
- Evan J. Molinelli
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
- Tri-Institutional Program for Computational Biology and Medicine, Weill Cornell Medical College, New York, New York, United States of America
| | - Anil Korkut
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Weiqing Wang
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Martin L. Miller
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Nicholas P. Gauthier
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Xiaohong Jing
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Poorvi Kaushik
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
- Tri-Institutional Program for Computational Biology and Medicine, Weill Cornell Medical College, New York, New York, United States of America
| | - Qin He
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Gordon Mills
- Department of Systems Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America
| | - David B. Solit
- Program in Molecular Pharmacology, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
- Human Oncology and Pathogenesis Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Christine A. Pratilas
- Program in Molecular Pharmacology, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
- Department of Pediatrics, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
| | - Martin Weigt
- Laboratoire de Génomique des Microorganismes, Université Pierre et Marie Curie, Paris, France
| | - Alfredo Braunstein
- Politecnico di Torino and Human Genetics Foundation, HuGeF, Torino, Italy
| | - Andrea Pagnani
- Politecnico di Torino and Human Genetics Foundation, HuGeF, Torino, Italy
| | - Riccardo Zecchina
- Politecnico di Torino and Human Genetics Foundation, HuGeF, Torino, Italy
| | - Chris Sander
- Computational Biology Program, Memorial Sloan-Kettering Cancer Center, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
19
|
Jia B, Wang X. Gene regulatory network inference by point-based Gaussian approximation filters incorporating the prior information. EURASIP JOURNAL ON BIOINFORMATICS & SYSTEMS BIOLOGY 2013; 2013:16. [PMID: 24341668 PMCID: PMC3977693 DOI: 10.1186/1687-4153-2013-16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/25/2013] [Accepted: 11/11/2013] [Indexed: 11/17/2022]
Abstract
The extended Kalman filter (EKF) has been applied to inferring gene regulatory
networks. However, it is well known that the EKF becomes less accurate when the
system exhibits high nonlinearity. In addition, certain prior information about
the gene regulatory network exists in practice, and no systematic approach has
been developed to incorporate such prior information into the Kalman-type filter
for inferring the structure of the gene regulatory network. In this paper, an
inference framework based on point-based Gaussian approximation filters that can
exploit the prior information is developed to solve the gene regulatory network
inference problem. Different point-based Gaussian approximation filters, including
the unscented Kalman filter (UKF), the third-degree cubature Kalman filter
(CKF3), and the fifth-degree cubature Kalman filter
(CKF5) are employed. Several types of network prior information,
including the existing network structure information, sparsity assumption, and the
range constraint of parameters, are considered, and the corresponding filters
incorporating the prior information are developed. Experiments on a synthetic
network of eight genes and the yeast protein synthesis network of five genes are
carried out to demonstrate the performance of the proposed framework. The results
show that the proposed methods provide more accurate inference results than
existing methods, such as the EKF and the traditional UKF.
Collapse
Affiliation(s)
| | - Xiaodong Wang
- Department of Electrical Engineering, Columbia University, New York, NY 10027, USA.
| |
Collapse
|
20
|
Knapp B, Kaderali L. Reconstruction of cellular signal transduction networks using perturbation assays and linear programming. PLoS One 2013; 8:e69220. [PMID: 23935958 PMCID: PMC3728289 DOI: 10.1371/journal.pone.0069220] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Accepted: 06/06/2013] [Indexed: 12/23/2022] Open
Abstract
Perturbation experiments for example using RNA interference (RNAi) offer an attractive way to elucidate gene function in a high throughput fashion. The placement of hit genes in their functional context and the inference of underlying networks from such data, however, are challenging tasks. One of the problems in network inference is the exponential number of possible network topologies for a given number of genes. Here, we introduce a novel mathematical approach to address this question. We formulate network inference as a linear optimization problem, which can be solved efficiently even for large-scale systems. We use simulated data to evaluate our approach, and show improved performance in particular on larger networks over state-of-the art methods. We achieve increased sensitivity and specificity, as well as a significant reduction in computing time. Furthermore, we show superior performance on noisy data. We then apply our approach to study the intracellular signaling of human primary nave CD4+ T-cells, as well as ErbB signaling in trastuzumab resistant breast cancer cells. In both cases, our approach recovers known interactions and points to additional relevant processes. In ErbB signaling, our results predict an important role of negative and positive feedback in controlling the cell cycle progression.
Collapse
Affiliation(s)
- Bettina Knapp
- Institute for Medical Informatics and Biometry, Medical Faculty Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- ViroQuant Research Group Modeling, BioQuant, Heidelberg University, Heidelberg, Germany
| | - Lars Kaderali
- Institute for Medical Informatics and Biometry, Medical Faculty Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- ViroQuant Research Group Modeling, BioQuant, Heidelberg University, Heidelberg, Germany
- * E-mail:
| |
Collapse
|
21
|
Böck M, Ogishima S, Tanaka H, Kramer S, Kaderali L. Hub-centered gene network reconstruction using automatic relevance determination. PLoS One 2012; 7:e35077. [PMID: 22570688 PMCID: PMC3343044 DOI: 10.1371/journal.pone.0035077] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2011] [Accepted: 03/11/2012] [Indexed: 11/17/2022] Open
Abstract
Network inference deals with the reconstruction of biological networks from experimental data. A variety of different reverse engineering techniques are available; they differ in the underlying assumptions and mathematical models used. One common problem for all approaches stems from the complexity of the task, due to the combinatorial explosion of different network topologies for increasing network size. To handle this problem, constraints are frequently used, for example on the node degree, number of edges, or constraints on regulation functions between network components. We propose to exploit topological considerations in the inference of gene regulatory networks. Such systems are often controlled by a small number of hub genes, while most other genes have only limited influence on the network's dynamic. We model gene regulation using a Bayesian network with discrete, Boolean nodes. A hierarchical prior is employed to identify hub genes. The first layer of the prior is used to regularize weights on edges emanating from one specific node. A second prior on hyperparameters controls the magnitude of the former regularization for different nodes. The net effect is that central nodes tend to form in reconstructed networks. Network reconstruction is then performed by maximization of or sampling from the posterior distribution. We evaluate our approach on simulated and real experimental data, indicating that we can reconstruct main regulatory interactions from the data. We furthermore compare our approach to other state-of-the art methods, showing superior performance in identifying hubs. Using a large publicly available dataset of over 800 cell cycle regulated genes, we are able to identify several main hub genes. Our method may thus provide a valuable tool to identify interesting candidate genes for further study. Furthermore, the approach presented may stimulate further developments in regularization methods for network reconstruction from data.
Collapse
Affiliation(s)
- Matthias Böck
- ViroQuant Research Group Modeling, University of Heidelberg, BioQuant BQ26, Heidelberg, Germany
- Institute of Informatics I12, Technische Universität München, Garching, Germany
| | - Soichi Ogishima
- Department of Bioinformatics, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo, Japan
| | - Hiroshi Tanaka
- Department of Bioinformatics, Tokyo Medical and Dental University, Bunkyo-ku, Tokyo, Japan
| | - Stefan Kramer
- Institute of Informatics I12, Technische Universität München, Garching, Germany
- Institute of Informatics, Johannes Gutenberg-Universität Mainz, Mainz, Germany
| | - Lars Kaderali
- ViroQuant Research Group Modeling, University of Heidelberg, BioQuant BQ26, Heidelberg, Germany
- Institute of Medical Informatics and Biometry, Technische Universität Dresden, Dresden, Germany
| |
Collapse
|
22
|
Poultney CS, Greenfield A, Bonneau R. Integrated inference and analysis of regulatory networks from multi-level measurements. Methods Cell Biol 2012; 110:19-56. [PMID: 22482944 PMCID: PMC5615108 DOI: 10.1016/b978-0-12-388403-9.00002-3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Regulatory and signaling networks coordinate the enormously complex interactions and processes that control cellular processes (such as metabolism and cell division), coordinate response to the environment, and carry out multiple cell decisions (such as development and quorum sensing). Regulatory network inference is the process of inferring these networks, traditionally from microarray data but increasingly incorporating other measurement types such as proteomics, ChIP-seq, metabolomics, and mass cytometry. We discuss existing techniques for network inference. We review in detail our pipeline, which consists of an initial biclustering step, designed to estimate co-regulated groups; a network inference step, designed to select and parameterize likely regulatory models for the control of the co-regulated groups from the biclustering step; and a visualization and analysis step, designed to find and communicate key features of the network. Learning biological networks from even the most complete data sets is challenging; we argue that integrating new data types into the inference pipeline produces networks of increased accuracy, validity, and biological relevance.
Collapse
Affiliation(s)
- Christopher S Poultney
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, USA
| | | | | |
Collapse
|
23
|
Herman D, Thomas CM, Stekel DJ. Global transcription regulation of RK2 plasmids: a case study in the combined use of dynamical mathematical models and statistical inference for integration of experimental data and hypothesis exploration. BMC SYSTEMS BIOLOGY 2011; 5:119. [PMID: 21801369 PMCID: PMC3199767 DOI: 10.1186/1752-0509-5-119] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2011] [Accepted: 07/29/2011] [Indexed: 11/10/2022]
Abstract
Background IncP-1 plasmids are broad host range plasmids that have been found in clinical and environmental bacteria. They often carry genes for antibiotic resistance or catabolic pathways. The archetypal IncP-1 plasmid RK2 is a well-characterized biological system, with a fully sequenced and annotated genome and wide range of experimental measurements. Its central control operon, encoding two global regulators KorA and KorB, is a natural example of a negatively self-regulated operon. To increase our understanding of the regulation of this operon, we have constructed a dynamical mathematical model using Ordinary Differential Equations, and employed a Bayesian inference scheme, Markov Chain Monte Carlo (MCMC) using the Metropolis-Hastings algorithm, as a way of integrating experimental measurements and a priori knowledge. We also compared MCMC and Metabolic Control Analysis (MCA) as approaches for determining the sensitivity of model parameters. Results We identified two distinct sets of parameter values, with different biological interpretations, that fit and explain the experimental data. This allowed us to highlight the proportion of repressor protein as dimers as a key experimental measurement defining the dynamics of the system. Analysis of joint posterior distributions led to the identification of correlations between parameters for protein synthesis and partial repression by KorA or KorB dimers, indicating the necessary use of joint posteriors for correct parameter estimation. Using MCA, we demonstrated that the system is highly sensitive to the growth rate but insensitive to repressor monomerization rates in their selected value regions; the latter outcome was also confirmed by MCMC. Finally, by examining a series of different model refinements for partial repression by KorA or KorB dimers alone, we showed that a model including partial repression by KorA and KorB was most compatible with existing experimental data. Conclusions We have demonstrated that the combination of dynamical mathematical models with Bayesian inference is valuable in integrating diverse experimental data and identifying key determinants and parameters for the IncP-1 central control operon. Moreover, we have shown that Bayesian inference and MCA are complementary methods for identification of sensitive parameters. We propose that this demonstrates generic value in applying this combination of approaches to systems biology dynamical modelling.
Collapse
Affiliation(s)
- Dorota Herman
- Center for Systems Biology, School of Biosciences, University of Birmingham, Edgbaston, Birmingham B15 2TT, UK.
| | | | | |
Collapse
|
24
|
Jenkinson G, Zhong X, Goutsias J. Thermodynamically consistent Bayesian analysis of closed biochemical reaction systems. BMC Bioinformatics 2010; 11:547. [PMID: 21054868 PMCID: PMC3248051 DOI: 10.1186/1471-2105-11-547] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2010] [Accepted: 11/05/2010] [Indexed: 12/04/2022] Open
Abstract
Background Estimating the rate constants of a biochemical reaction system with known stoichiometry from noisy time series measurements of molecular concentrations is an important step for building predictive models of cellular function. Inference techniques currently available in the literature may produce rate constant values that defy necessary constraints imposed by the fundamental laws of thermodynamics. As a result, these techniques may lead to biochemical reaction systems whose concentration dynamics could not possibly occur in nature. Therefore, development of a thermodynamically consistent approach for estimating the rate constants of a biochemical reaction system is highly desirable. Results We introduce a Bayesian analysis approach for computing thermodynamically consistent estimates of the rate constants of a closed biochemical reaction system with known stoichiometry given experimental data. Our method employs an appropriately designed prior probability density function that effectively integrates fundamental biophysical and thermodynamic knowledge into the inference problem. Moreover, it takes into account experimental strategies for collecting informative observations of molecular concentrations through perturbations. The proposed method employs a maximization-expectation-maximization algorithm that provides thermodynamically feasible estimates of the rate constant values and computes appropriate measures of estimation accuracy. We demonstrate various aspects of the proposed method on synthetic data obtained by simulating a subset of a well-known model of the EGF/ERK signaling pathway, and examine its robustness under conditions that violate key assumptions. Software, coded in MATLAB®, which implements all Bayesian analysis techniques discussed in this paper, is available free of charge at http://www.cis.jhu.edu/~goutsias/CSS%20lab/software.html. Conclusions Our approach provides an attractive statistical methodology for estimating thermodynamically feasible values for the rate constants of a biochemical reaction system from noisy time series observations of molecular concentrations obtained through perturbations. The proposed technique is theoretically sound and computationally feasible, but restricted to quantitative data obtained from closed biochemical reaction systems. This necessitates development of similar techniques for estimating the rate constants of open biochemical reaction systems, which are more realistic models of cellular function.
Collapse
Affiliation(s)
- Garrett Jenkinson
- Whitaker Biomedical Engineering Institute, The Johns Hopkins University, Baltimore, MD 21218, USA
| | | | | |
Collapse
|
25
|
Wang J, Jia M, Zhu L, Yuan Z, Li P, Chang C, Luo J, Liu M, Shi T. Systematical detection of significant genes in microarray data by incorporating gene interaction relationship in biological systems. PLoS One 2010; 5:e13721. [PMID: 21060778 PMCID: PMC2966410 DOI: 10.1371/journal.pone.0013721] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2010] [Accepted: 10/05/2010] [Indexed: 02/02/2023] Open
Abstract
Many methods, including parametric, nonparametric, and Bayesian methods, have been used for detecting differentially expressed genes based on the assumption that biological systems are linear, which ignores the nonlinear characteristics of most biological systems. More importantly, those methods do not simultaneously consider means, variances, and high moments, resulting in relatively high false positive rate. To overcome the limitations, the SWang test is proposed to determine differentially expressed genes according to the equality of distributions between case and control. Our method not only latently incorporates functional relationships among genes to consider nonlinear biological system but also considers the mean, variance, skewness, and kurtosis of expression profiles simultaneously. To illustrate biological significance of high moments, we construct a nonlinear gene interaction model, demonstrating that skewness and kurtosis could contain useful information of function association among genes in microarrays. Simulations and real microarray results show that false positive rate of SWang is lower than currently popular methods (T-test, F-test, SAM, and Fold-change) with much higher statistical power. Additionally, SWang can uniquely detect significant genes in real microarray data with imperceptible differential expression but higher variety in kurtosis and skewness. Those identified genes were confirmed with previous published literature or RT-PCR experiments performed in our lab.
Collapse
Affiliation(s)
- Junwei Wang
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Meiwen Jia
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Liping Zhu
- The College of Financial and Statistics, East China Normal University, Shanghai, China
| | - Zengjin Yuan
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Peng Li
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Chang Chang
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Jian Luo
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Mingyao Liu
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
| | - Tieliu Shi
- The Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences, The School of Life Sciences, East China Normal University, Shanghai, China
- * E-mail:
| |
Collapse
|
26
|
Greenfield A, Madar A, Ostrer H, Bonneau R. DREAM4: Combining genetic and dynamic information to identify biological networks and dynamical models. PLoS One 2010; 5:e13397. [PMID: 21049040 PMCID: PMC2963605 DOI: 10.1371/journal.pone.0013397] [Citation(s) in RCA: 125] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2010] [Accepted: 09/09/2010] [Indexed: 11/23/2022] Open
Abstract
Background Current technologies have lead to the availability of multiple genomic data types in sufficient quantity and quality to serve as a basis for automatic global network inference. Accordingly, there are currently a large variety of network inference methods that learn regulatory networks to varying degrees of detail. These methods have different strengths and weaknesses and thus can be complementary. However, combining different methods in a mutually reinforcing manner remains a challenge. Methodology We investigate how three scalable methods can be combined into a useful network inference pipeline. The first is a novel t-test–based method that relies on a comprehensive steady-state knock-out dataset to rank regulatory interactions. The remaining two are previously published mutual information and ordinary differential equation based methods (tlCLR and Inferelator 1.0, respectively) that use both time-series and steady-state data to rank regulatory interactions; the latter has the added advantage of also inferring dynamic models of gene regulation which can be used to predict the system's response to new perturbations. Conclusion/Significance Our t-test based method proved powerful at ranking regulatory interactions, tying for first out of methods in the DREAM4 100-gene in-silico network inference challenge. We demonstrate complementarity between this method and the two methods that take advantage of time-series data by combining the three into a pipeline whose ability to rank regulatory interactions is markedly improved compared to either method alone. Moreover, the pipeline is able to accurately predict the response of the system to new conditions (in this case new double knock-out genetic perturbations). Our evaluation of the performance of multiple methods for network inference suggests avenues for future methods development and provides simple considerations for genomic experimental design. Our code is publicly available at http://err.bio.nyu.edu/inferelator/.
Collapse
Affiliation(s)
- Alex Greenfield
- Computational Biology Program, New York University Sackler School of Medicine, New York, New York, United States of America
| | - Aviv Madar
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
| | - Harry Ostrer
- Human Genetics Program, Department of Pediatrics, New York University Langone Medical Center, New York, New York, United States of America
| | - Richard Bonneau
- Computational Biology Program, New York University Sackler School of Medicine, New York, New York, United States of America
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, New York, United States of America
- Computer Science Department, Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
27
|
Andersen ME, Al-Zoughool M, Croteau M, Westphal M, Krewski D. The future of toxicity testing. JOURNAL OF TOXICOLOGY AND ENVIRONMENTAL HEALTH. PART B, CRITICAL REVIEWS 2010; 13:163-196. [PMID: 20574896 DOI: 10.1080/10937404.2010.483933] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
In 2007, the U.S. National Research Council (NRC) released a report, "Toxicity Testing in the 21st Century: A Vision and a Strategy," that proposes a paradigm shift for toxicity testing of environmental agents. The vision is based on the notion that exposure to environmental agents leads to adverse health outcomes through the perturbation of toxicity pathways that are operative in humans. Implementation of the NRC vision will involve a fundamental change in the assessment of toxicity of environmental agents, moving away from adverse health outcomes observed in experimental animals to the identification of critical perturbations of toxicity pathways. Pathway perturbations will be identified using in vitro assays and quantified for dose response using methods in computational toxicology and other recent scientific advances in basic biology. Implementation of the NRC vision will require a major research effort, not unlike that required to successfully map the human genome, extending over 10 to 20 years, involving the broad scientific community to map important toxicity pathways operative in humans. This article provides an overview of the scientific tools and technologies that will form the core of the NRC vision for toxicity testing. Of particular importance will be the development of rapidly performed in vitro screening assays using human cells and cell lines or human tissue surrogates to efficiently identify environmental agents producing critical pathway perturbations. In addition to the overview of the NRC vision, this study documents the reaction by a number of stakeholder groups since 2007, including the scientific, risk assessment, regulatory, and animal welfare communities.
Collapse
Affiliation(s)
- Melvin E Andersen
- Hamner Institutes for Health Sciences, Research Triangle Park, North Carolina, USA
| | | | | | | | | |
Collapse
|