1
|
Ajmal HB, Madden MG. Dynamic Bayesian Network Learning to Infer Sparse Models From Time Series Gene Expression Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2794-2805. [PMID: 34181549 DOI: 10.1109/tcbb.2021.3092879] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
One of the key challenges in systems biology is to derive gene regulatory networks (GRNs) from complex high-dimensional sparse data. Bayesian networks (BNs) and dynamic Bayesian networks (DBNs) have been widely applied to infer GRNs from gene expression data. GRNs are typically sparse but traditional approaches of BN structure learning to elucidate GRNs often produce many spurious (false positive) edges. We present two new BN scoring functions, which are extensions to the Bayesian Information Criterion (BIC) score, with additional penalty terms and use them in conjunction with DBN structure search methods to find a graph structure that maximises the proposed scores. Our BN scoring functions offer better solutions for inferring networks with fewer spurious edges compared to the BIC score. The proposed methods are evaluated extensively on auto regressive and DREAM4 benchmarks. We found that they significantly improve the precision of the learned graphs, relative to the BIC score. The proposed methods are also evaluated on three real time series gene expression datasets. The results demonstrate that our algorithms are able to learn sparse graphs from high-dimensional time series data. The implementation of these algorithms is open source and is available in form of an R package on GitHub at https://github.com/HamdaBinteAjmal/DBN4GRN, along with the documentation and tutorials.
Collapse
|
2
|
Ajmal HB, Madden MG. Inferring dynamic gene regulatory networks with low-order conditional independencies – an evaluation of the method. Stat Appl Genet Mol Biol 2020. [DOI: 10.1515/sagmb-2020-0051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractOver a decade ago, Lèbre (2009) proposed an inference method, G1DBN, to learn the structure of gene regulatory networks (GRNs) from high dimensional, sparse time-series gene expression data. Their approach is based on concept of low-order conditional independence graphs that they extend to dynamic Bayesian networks (DBNs). They present results to demonstrate that their method yields better structural accuracy compared to the related Lasso and Shrinkage methods, particularly where the data is sparse, that is, the number of time measurements n is much smaller than the number of genes p. This paper challenges these claims using a careful experimental analysis, to show that the GRNs reverse engineered from time-series data using the G1DBN approach are less accurate than claimed by Lèbre (2009). We also show that the Lasso method yields higher structural accuracy for graphs learned from the simulated data, compared to the G1DBN method, particularly when the data is sparse ($n{< }{< }p$). The Lasso method is also better than G1DBN at identifying the transcription factors (TFs) involved in the cell cycle of Saccharomyces cerevisiae.
Collapse
Affiliation(s)
- Hamda B. Ajmal
- School of Computer Science, National University of Ireland, Galway, Ireland
| | - Michael G. Madden
- School of Computer Science, National University of Ireland, Galway, Ireland
| |
Collapse
|
3
|
He R, Chen G, Shen X, Jiang S, Chen G. Reliability assessment of repairable closed-loop process systems under uncertainties. ISA TRANSACTIONS 2020; 104:222-232. [PMID: 32402436 DOI: 10.1016/j.isatra.2020.05.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2019] [Revised: 04/17/2020] [Accepted: 05/05/2020] [Indexed: 06/11/2023]
Abstract
System reliability assessment plays a crucial role in making maintenance decisions and reducing hazard frequencies. Although many engineering methods can effectively evaluate the process reliability, most of them are often unreasonable for closed-loop systems because of the combination of closed-loop structures, maintenance characteristics, and dynamic failure mechanisms. Also, uncertainties generally exist in the reliability assessment due to the insufficient reliability data and expert knowledge. Therefore, an integrated approach is proposed in present works to assess the dynamic reliability of repairable closed-loop systems with the consideration of uncertainties. Firstly, Bayesian inference and fuzzy theorem are developed to characterize system uncertainties and estimate lifetime parameters of components. After that, a closed-loop probabilistic reliability assessment (CPRA) method is proposed for the dynamic reliability assessment of closed-loop systems by integrating cyclic Bayesian network modeling and dynamic Bayesian network solving. Besides, a novel non-probabilistic reliability assessment (NPRA) approach based on the probabilistic method and Monte Carlo simulation is presented to make maintenance decisions for repairable systems. Finally, an application of reliability assessment for the offshore crude oil separation system is introduced to verify the proposed methods.
Collapse
Affiliation(s)
- Rui He
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China
| | - Guoming Chen
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China.
| | - Xiaoyu Shen
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China
| | - Shengyu Jiang
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China
| | - Guoxing Chen
- Centre for Offshore Engineering and Safety Technology (COEST), China University of Petroleum (East China), No.66, Changjiang West Road, Qingdao, China
| |
Collapse
|
4
|
Affiliation(s)
- Marco Scutari
- Istituto Dalle Molle di Studi sull'Intelligenza Artificiale (IDSIA) Manno Switzerland
| |
Collapse
|
5
|
Liang Y, Kelemen A. Dynamic modeling and network approaches for omics time course data: overview of computational approaches and applications. Brief Bioinform 2019; 19:1051-1068. [PMID: 28430854 DOI: 10.1093/bib/bbx036] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2016] [Indexed: 12/23/2022] Open
Abstract
Inferring networks and dynamics of genes, proteins, cells and other biological entities from high-throughput biological omics data is a central and challenging issue in computational and systems biology. This is essential for understanding the complexity of human health, disease susceptibility and pathogenesis for Predictive, Preventive, Personalized and Participatory (P4) system and precision medicine. The delineation of the possible interactions of all genes/proteins in a genome/proteome is a task for which conventional experimental techniques are ill suited. Urgently needed are rapid and inexpensive computational and statistical methods that can identify interacting candidate disease genes or drug targets out of thousands that can be further investigated or validated by experimentations. Moreover, identifying biological dynamic systems, and simultaneously estimating the important kinetic structural and functional parameters, which may not be experimentally accessible could be important directions for drug-disease-gene network studies. In this article, we present an overview and comparison of recent developments of dynamic modeling and network approaches for time-course omics data, and their applications to various biological systems, health conditions and disease statuses. Moreover, various data reduction and analytical schemes ranging from mathematical to computational to statistical methods are compared including their merits, drawbacks and limitations. The most recent software, associated web resources and other potentials for the compared methods are also presented and discussed in detail.
Collapse
Affiliation(s)
- Yulan Liang
- Department of Family and Community Health, University of Maryland, Baltimore, MD, USA
| | - Arpad Kelemen
- Department of Family and Community Health, University of Maryland, Baltimore, MD, USA
| |
Collapse
|
6
|
Dondelinger F, Mukherjee S. Statistical Network Inference for Time-Varying Molecular Data with Dynamic Bayesian Networks. Methods Mol Biol 2019; 1883:25-48. [PMID: 30547395 DOI: 10.1007/978-1-4939-8882-2_2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
In this chapter, we review the problem of network inference from time-course data, focusing on a class of graphical models known as dynamic Bayesian networks (DBNs). We discuss the relationship of DBNs to models based on ordinary differential equations, and consider extensions to nonlinear time dynamics. We provide an introduction to time-varying DBN models, which allow for changes to the network structure and parameters over time. We also discuss causal perspectives on network inference, including issues around model semantics that can arise due to missing variables. We present a case study of applying time-varying DBNs to gene expression measurements over the life cycle of Drosophila melanogaster. We finish with a discussion of future perspectives, including possible applications of time-varying network inference to single-cell gene expression data.
Collapse
Affiliation(s)
| | - Sach Mukherjee
- German Center for Neurodegenerative Diseases (DZNE), Bonn, Germany
| |
Collapse
|
7
|
Abstract
Gene regulatory networks are powerful abstractions of biological systems. Since the advent of high-throughput measurement technologies in biology in the late 1990s, reconstructing the structure of such networks has been a central computational problem in systems biology. While the problem is certainly not solved in its entirety, considerable progress has been made in the last two decades, with mature tools now available. This chapter aims to provide an introduction to the basic concepts underpinning network inference tools, attempting a categorization which highlights commonalities and relative strengths. While the chapter is meant to be self-contained, the material presented should provide a useful background to the later, more specialized chapters of this book.
Collapse
Affiliation(s)
- Vân Anh Huynh-Thu
- Department of Electrical Engineering and Computer Science, University of Liège, Liège, Belgium
| | | |
Collapse
|
8
|
Abstract
We consider the problem of modeling conditional independence structures in heterogeneous data in the presence of additional subject-level covariates - termed Graphical Regression. We propose a novel specification of a conditional (in)dependence function of covariates - which allows the structure of a directed graph to vary flexibly with the covariates; imposes sparsity in both edge and covariate selection; produces both subject-specific and predictive graphs; and is computationally tractable. We provide theoretical justifications of our modeling endeavor, in terms of graphical model selection consistency. We demonstrate the performance of our method through rigorous simulation studies. We illustrate our approach in a cancer genomics-based precision medicine paradigm, where-in we explore gene regulatory networks in multiple myeloma taking prognostic clinical factors into account to obtain both population-level and subject-level gene regulatory networks.
Collapse
Affiliation(s)
- Yang Ni
- Department of Statistics and Data Sciences, The University of Texas at Austin
- Department of Statistics, Rice University
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center
| | - Francesco C Stingo
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center
- Department of Statistics, Computer Science, Applications "G. Parenti", The University of Florence
| | | |
Collapse
|
9
|
Mahdi Mahmoudi S, Wit EC. Estimating Causal Effects from Nonparanormal Observational Data. Int J Biostat 2018; 14:/j/ijb.ahead-of-print/ijb-2018-0030/ijb-2018-0030.xml. [PMID: 30173203 DOI: 10.1515/ijb-2018-0030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2017] [Accepted: 07/26/2018] [Indexed: 11/15/2022]
Abstract
One of the basic aims of science is to unravel the chain of cause and effect of particular systems. Especially for large systems, this can be a daunting task. Detailed interventional and randomized data sampling approaches can be used to resolve the causality question, but for many systems, such interventions are impossible or too costly to obtain. Recently, Maathuis et al. (2010), following ideas from Spirtes et al. (2000), introduced a framework to estimate causal effects in large scale Gaussian systems. By describing the causal network as a directed acyclic graph it is a possible to estimate a class of Markov equivalent systems that describe the underlying causal interactions consistently, even for non-Gaussian systems. In these systems, causal effects stop being linear and cannot be described any more by a single coefficient. In this paper, we derive the general functional form of a causal effect in a large subclass of non-Gaussian distributions, called the non-paranormal. We also derive a convenient approximation, which can be used effectively in estimation. We show that the estimate is consistent under certain conditions and we apply the method to an observational gene expression dataset of the Arabidopsis thaliana circadian clock system.
Collapse
Affiliation(s)
- Seyed Mahdi Mahmoudi
- Department of Statistics, Faculty of Mathematics, Statistics and Computer Science, Semnan University, Semnan, Iran
| | - Ernst C Wit
- Johann Bernoulli Institute (FWN), Rijksuniversiteit Groningen Faculteit voor Wiskunde en Natuurwetenschappen, Groningen, Netherlands
| |
Collapse
|
10
|
Vinciotti V, Augugliaro L, Abbruzzo A, Wit EC. Model selection for factorial Gaussian graphical models with an application to dynamic regulatory networks. Stat Appl Genet Mol Biol 2018; 15:193-212. [PMID: 27023322 DOI: 10.1515/sagmb-2014-0075] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Factorial Gaussian graphical Models (fGGMs) have recently been proposed for inferring dynamic gene regulatory networks from genomic high-throughput data. In the search for true regulatory relationships amongst the vast space of possible networks, these models allow the imposition of certain restrictions on the dynamic nature of these relationships, such as Markov dependencies of low order - some entries of the precision matrix are a priori zeros - or equal dependency strengths across time lags - some entries of the precision matrix are assumed to be equal. The precision matrix is then estimated by l1-penalized maximum likelihood, imposing a further constraint on the absolute value of its entries, which results in sparse networks. Selecting the optimal sparsity level is a major challenge for this type of approaches. In this paper, we evaluate the performance of a number of model selection criteria for fGGMs by means of two simulated regulatory networks from realistic biological processes. The analysis reveals a good performance of fGGMs in comparison with other methods for inferring dynamic networks and of the KLCV criterion in particular for model selection. Finally, we present an application on a high-resolution time-course microarray data from the Neisseria meningitidis bacterium, a causative agent of life-threatening infections such as meningitis. The methodology described in this paper is implemented in the R package sglasso, freely available at CRAN, http://CRAN.R-project.org/package=sglasso.
Collapse
|
11
|
Wu PPY, McMahon K, Rasheed MA, Kendrick GA, York PH, Chartrand K, Caley MJ, Mengersen K. Managing seagrass resilience under cumulative dredging affecting light: Predicting risk using dynamic Bayesian networks. J Appl Ecol 2017. [DOI: 10.1111/1365-2664.13037] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Paul Pao-Yen Wu
- Australian Research Council; Centre of Excellence in Mathematical and Statistical Frontiers; Brisbane Qld Australia
- School of Mathematical Sciences, Science and Engineering Faculty; Queensland University of Technology; Brisbane Qld Australia
| | - Kathryn McMahon
- The Western Australian Marine Science Institution; Crawley WA Australia
- School of Natural Sciences; Edith Cowan University; Joondalup WA Australia
| | - Michael A. Rasheed
- Centre for Tropical Water & Aquatic Ecosystem Research; James Cook University; Townsville Qld Australia
| | - Gary A. Kendrick
- The Western Australian Marine Science Institution; Crawley WA Australia
- UWA Oceans Institute and School of Plant Biology; University of Western Australia; Perth WA Australia
| | - Paul H. York
- Centre for Tropical Water & Aquatic Ecosystem Research; James Cook University; Townsville Qld Australia
| | - Kathryn Chartrand
- Centre for Tropical Water & Aquatic Ecosystem Research; James Cook University; Townsville Qld Australia
| | - M. Julian Caley
- Australian Research Council; Centre of Excellence in Mathematical and Statistical Frontiers; Brisbane Qld Australia
- School of Mathematical Sciences, Science and Engineering Faculty; Queensland University of Technology; Brisbane Qld Australia
| | - Kerrie Mengersen
- Australian Research Council; Centre of Excellence in Mathematical and Statistical Frontiers; Brisbane Qld Australia
- School of Mathematical Sciences, Science and Engineering Faculty; Queensland University of Technology; Brisbane Qld Australia
| |
Collapse
|
12
|
Liang Y, Kelemen A. Bayesian state space models for dynamic genetic network construction across multiple tissues. Stat Appl Genet Mol Biol 2017; 15:273-90. [PMID: 27343475 DOI: 10.1515/sagmb-2014-0055] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Construction of gene-gene interaction networks and potential pathways is a challenging and important problem in genomic research for complex diseases while estimating the dynamic changes of the temporal correlations and non-stationarity are the keys in this process. In this paper, we develop dynamic state space models with hierarchical Bayesian settings to tackle this challenge for inferring the dynamic profiles and genetic networks associated with disease treatments. We treat both the stochastic transition matrix and the observation matrix time-variant and include temporal correlation structures in the covariance matrix estimations in the multivariate Bayesian state space models. The unevenly spaced short time courses with unseen time points are treated as hidden state variables. Hierarchical Bayesian approaches with various prior and hyper-prior models with Monte Carlo Markov Chain and Gibbs sampling algorithms are used to estimate the model parameters and the hidden state variables. We apply the proposed Hierarchical Bayesian state space models to multiple tissues (liver, skeletal muscle, and kidney) Affymetrix time course data sets following corticosteroid (CS) drug administration. Both simulation and real data analysis results show that the genomic changes over time and gene-gene interaction in response to CS treatment can be well captured by the proposed models. The proposed dynamic Hierarchical Bayesian state space modeling approaches could be expanded and applied to other large scale genomic data, such as next generation sequence (NGS) combined with real time and time varying electronic health record (EHR) for more comprehensive and robust systematic and network based analysis in order to transform big biomedical data into predictions and diagnostics for precision medicine and personalized healthcare with better decision making and patient outcomes.
Collapse
|
13
|
Intosalmi J, Nousiainen K, Ahlfors H, Lähdesmäki H. Data-driven mechanistic analysis method to reveal dynamically evolving regulatory networks. Bioinformatics 2017; 32:i288-i296. [PMID: 27307629 PMCID: PMC4908358 DOI: 10.1093/bioinformatics/btw274] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Motivation: Mechanistic models based on ordinary differential equations provide powerful and accurate means to describe the dynamics of molecular machinery which orchestrates gene regulation. When combined with appropriate statistical techniques, mechanistic models can be calibrated using experimental data and, in many cases, also the model structure can be inferred from time–course measurements. However, existing mechanistic models are limited in the sense that they rely on the assumption of static network structure and cannot be applied when transient phenomena affect, or rewire, the network structure. In the context of gene regulatory network inference, network rewiring results from the net impact of possible unobserved transient phenomena such as changes in signaling pathway activities or epigenome, which are generally difficult, but important, to account for. Results: We introduce a novel method that can be used to infer dynamically evolving regulatory networks from time–course data. Our method is based on the notion that all mechanistic ordinary differential equation models can be coupled with a latent process that approximates the network structure rewiring process. We illustrate the performance of the method using simulated data and, further, we apply the method to study the regulatory interactions during T helper 17 (Th17) cell differentiation using time–course RNA sequencing data. The computational experiments with the real data show that our method is capable of capturing the experimentally verified rewiring effects of the core Th17 regulatory network. We predict Th17 lineage specific subnetworks that are activated sequentially and control the differentiation process in an overlapping manner. Availability and Implementation: An implementation of the method is available at http://research.ics.aalto.fi/csb/software/lem/. Contacts:jukka.intosalmi@aalto.fi or harri.lahdesmaki@aalto.fi
Collapse
Affiliation(s)
- Jukka Intosalmi
- Department of Computer Science, Aalto University, Aalto, FI-00076, Finland
| | - Kari Nousiainen
- Department of Computer Science, Aalto University, Aalto, FI-00076, Finland
| | - Helena Ahlfors
- Lymphocyte Signalling and Development, The Babraham Institute, Cambridgeshire CB22 3AT, UK
| | - Harri Lähdesmäki
- Department of Computer Science, Aalto University, Aalto, FI-00076, Finland
| |
Collapse
|
14
|
Liang Y, Kelemen A. Computational dynamic approaches for temporal omics data with applications to systems medicine. BioData Min 2017. [PMID: 28638442 PMCID: PMC5473988 DOI: 10.1186/s13040-017-0140-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Modeling and predicting biological dynamic systems and simultaneously estimating the kinetic structural and functional parameters are extremely important in systems and computational biology. This is key for understanding the complexity of the human health, drug response, disease susceptibility and pathogenesis for systems medicine. Temporal omics data used to measure the dynamic biological systems are essentials to discover complex biological interactions and clinical mechanism and causations. However, the delineation of the possible associations and causalities of genes, proteins, metabolites, cells and other biological entities from high throughput time course omics data is challenging for which conventional experimental techniques are not suited in the big omics era. In this paper, we present various recently developed dynamic trajectory and causal network approaches for temporal omics data, which are extremely useful for those researchers who want to start working in this challenging research area. Moreover, applications to various biological systems, health conditions and disease status, and examples that summarize the state-of-the art performances depending on different specific mining tasks are presented. We critically discuss the merits, drawbacks and limitations of the approaches, and the associated main challenges for the years ahead. The most recent computing tools and software to analyze specific problem type, associated platform resources, and other potentials for the dynamic trajectory and interaction methods are also presented and discussed in detail.
Collapse
Affiliation(s)
- Yulan Liang
- Department of Family and Community Health, University of Maryland, Baltimore, MD 21201 USA
| | - Arpad Kelemen
- Department of Organizational Systems and Adult Health, University of Maryland, Baltimore, MD 21201 USA
| |
Collapse
|
15
|
Wu PP, Julian Caley M, Kendrick GA, McMahon K, Mengersen K. Dynamic Bayesian network inferencing for non‐homogeneous complex systems. J R Stat Soc Ser C Appl Stat 2017. [DOI: 10.1111/rssc.12228] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Paul P.‐Y. Wu
- Queensland University of Technology, and Australian Research Council Centre of Excellence in Mathematical and Statistical Frontiers Brisbane Australia
| | - M. Julian Caley
- Queensland University of Technology, and Australian Research Council Centre of Excellence in Mathematical and Statistical Frontiers Brisbane Australia
| | - Gary A. Kendrick
- University of Western Australia, Crawley, and Western Australia Marine Science Institution Perth Australia
| | - Kathryn McMahon
- Edith Cowan University, Joondalup, and Western Australia Marine Science Institution Perth Australia
| | - Kerrie Mengersen
- Queensland University of Technology, and Australian Research Council Centre of Excellence in Mathematical and Statistical Frontiers Brisbane Australia
| |
Collapse
|
16
|
Grzegorczyk M. A non-homogeneous dynamic Bayesian network with a hidden Markov model dependency structure among the temporal data points. Mach Learn 2016. [DOI: 10.1007/s10994-015-5503-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
17
|
Jennen DGJ, van Leeuwen DM, Hendrickx DM, Gottschalk RWH, van Delft JHM, Kleinjans JCS. Bayesian Network Inference Enables Unbiased Phenotypic Anchoring of Transcriptomic Responses to Cigarette Smoke in Humans. Chem Res Toxicol 2015; 28:1936-48. [PMID: 26360787 DOI: 10.1021/acs.chemrestox.5b00145] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Microarray-based transcriptomic analysis has been demonstrated to hold the opportunity to study the effects of human exposure to, e.g., chemical carcinogens at the whole genome level, thus yielding broad-ranging molecular information on possible carcinogenic effects. Since genes do not operate individually but rather through concerted interactions, analyzing and visualizing networks of genes should provide important mechanistic information, especially upon connecting them to functional parameters, such as those derived from measurements of biomarkers for exposure and carcinogenic risk. Conventional methods such as hierarchical clustering and correlation analyses are frequently used to address these complex interactions but are limited as they do not provide directional causal dependence relationships. Therefore, our aim was to apply Bayesian network inference with the purpose of phenotypic anchoring of modified gene expressions. We investigated a use case on transcriptomic responses to cigarette smoking in humans, in association with plasma cotinine levels as biomarkers of exposure and aromatic DNA-adducts in blood cells as biomarkers of carcinogenic risk. Many of the genes that appear in the Bayesian networks surrounding plasma cotinine, and to a lesser extent around aromatic DNA-adducts, hold biologically relevant functions in inducing severe adverse effects of smoking. In conclusion, this study shows that Bayesian network inference enables unbiased phenotypic anchoring of transcriptomics responses. Furthermore, in all inferred Bayesian networks several dependencies are found which point to known but also to new relationships between the expression of specific genes, cigarette smoke exposure, DNA damaging-effects, and smoking-related diseases, in particular associated with apoptosis, DNA repair, and tumor suppression, as well as with autoimmunity.
Collapse
Affiliation(s)
- Danyel G J Jennen
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| | - Danitsja M van Leeuwen
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| | - Diana M Hendrickx
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| | - Ralph W H Gottschalk
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| | - Joost H M van Delft
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| | - Jos C S Kleinjans
- Department of Toxicogenomics, Maastricht University , Universiteitssingel 40, 6229 ER Maastricht, The Netherlands
| |
Collapse
|
18
|
Network Analysis Identifies Crosstalk Interactions Governing TGF-β Signaling Dynamics during Endoderm Differentiation of Human Embryonic Stem Cells. Processes (Basel) 2015. [DOI: 10.3390/pr3020286] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
|
19
|
Maiti A, Reddy R, Mukherjee A. Structural prediction of dynamic Bayesian network with partial prior information. IEEE Trans Nanobioscience 2014; 14:95-103. [PMID: 25314704 DOI: 10.1109/tnb.2014.2361838] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The prediction of the structure of a hidden dynamic Bayesian network (DBN) from a noisy dataset is an important and challenging task. This work presents a generalized framework to infer the DBN network structure with partial prior information. In the proposed framework, the partial information about the network structure is provided in the form of prior. The proposed method makes use of the prior information regarding the presence and as well as absence of some of the edges. Using the noisy dataset and partial prior information, this method is able to infer nearly accurate structure of the network. The proposed method is validated using simulated datasets. In addition, two real biological datasets are used to infer hidden biological interaction networks.
Collapse
|
20
|
Xiong J, Zhou T. A Kalman-filter based approach to identification of time-varying gene regulatory networks. PLoS One 2013; 8:e74571. [PMID: 24116005 PMCID: PMC3792119 DOI: 10.1371/journal.pone.0074571] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Accepted: 08/04/2013] [Indexed: 11/18/2022] Open
Abstract
Motivation Conventional identification methods for gene regulatory networks (GRNs) have overwhelmingly adopted static topology models, which remains unchanged over time to represent the underlying molecular interactions of a biological system. However, GRNs are dynamic in response to physiological and environmental changes. Although there is a rich literature in modeling static or temporally invariant networks, how to systematically recover these temporally changing networks remains a major and significant pressing challenge. The purpose of this study is to suggest a two-step strategy that recovers time-varying GRNs. Results It is suggested in this paper to utilize a switching auto-regressive model to describe the dynamics of time-varying GRNs, and a two-step strategy is proposed to recover the structure of time-varying GRNs. In the first step, the change points are detected by a Kalman-filter based method. The observed time series are divided into several segments using these detection results; and each time series segment belonging to two successive demarcating change points is associated with an individual static regulatory network. In the second step, conditional network structure identification methods are used to reconstruct the topology for each time interval. This two-step strategy efficiently decouples the change point detection problem and the topology inference problem. Simulation results show that the proposed strategy can detect the change points precisely and recover each individual topology structure effectively. Moreover, computation results with the developmental data of Drosophila Melanogaster show that the proposed change point detection procedure is also able to work effectively in real world applications and the change point estimation accuracy exceeds other existing approaches, which means the suggested strategy may also be helpful in solving actual GRN reconstruction problem.
Collapse
Affiliation(s)
- Jie Xiong
- Department of Automation, Tsinghua University, Beijing, China
- * E-mail:
| | - Tong Zhou
- Department of Automation and Tsinghua National Laboratory for Information Science and Technology(TNList), Tsinghua University, Beijing, China
| |
Collapse
|
21
|
Kim Y, Han S, Choi S, Hwang D. Inference of dynamic networks using time-course data. Brief Bioinform 2013; 15:212-28. [PMID: 23698724 DOI: 10.1093/bib/bbt028] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Cells execute their functions through dynamic operations of biological networks. Dynamic networks delineate the operation of biological networks in terms of temporal changes of abundances or activities of nodes (proteins and RNAs), as well as formation of new edges and disappearance of existing edges over time. Global genomic and proteomic technologies can be used to decode dynamic networks. However, using these experimental methods, it is still challenging to identify temporal transition of nodes and edges. Thus, several computational methods for estimating dynamic topological and functional characteristics of networks have been introduced. In this review, we summarize concepts and applications of these computational methods for inferring dynamic networks and further summarize methods for estimating spatial transition of biological networks.
Collapse
Affiliation(s)
- Yongsoo Kim
- POSTECH, Pohang, 790-784, Republic of Korea. Tel.: 82-54-279-2393; Fax: 82-54-279-8409;
| | | | | | | |
Collapse
|
22
|
Abegaz F, Wit E. Sparse time series chain graphical models for reconstructing genetic networks. Biostatistics 2013; 14:586-99. [PMID: 23462022 DOI: 10.1093/biostatistics/kxt005] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We propose a sparse high-dimensional time series chain graphical model for reconstructing genetic networks from gene expression data parametrized by a precision matrix and autoregressive coefficient matrix. We consider the time steps as blocks or chains. The proposed approach explores patterns of contemporaneous and dynamic interactions by efficiently combining Gaussian graphical models and Bayesian dynamic networks. We use penalized likelihood inference with a smoothly clipped absolute deviation penalty to explore the relationships among the observed time course gene expressions. The method is illustrated on simulated data and on real data examples from Arabidopsis thaliana and mammary gland time course microarray gene expressions.
Collapse
Affiliation(s)
- Fentaw Abegaz
- Johann Bernoulli Institute of Mathematics and Computer Science, University of Groningen, Nijenborgh 9, The Netherlands.
| | | |
Collapse
|
23
|
Regularization of non-homogeneous dynamic Bayesian networks with global information-coupling based on hierarchical Bayesian models. Mach Learn 2013. [DOI: 10.1007/s10994-012-5326-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
24
|
Hill SM, Lu Y, Molina J, Heiser LM, Spellman PT, Speed TP, Gray JW, Mills GB, Mukherjee S. Bayesian inference of signaling network topology in a cancer cell line. Bioinformatics 2012; 28:2804-10. [PMID: 22923301 PMCID: PMC3476330 DOI: 10.1093/bioinformatics/bts514] [Citation(s) in RCA: 80] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2012] [Revised: 07/27/2012] [Accepted: 08/13/2012] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Protein signaling networks play a key role in cellular function, and their dysregulation is central to many diseases, including cancer. To shed light on signaling network topology in specific contexts, such as cancer, requires interrogation of multiple proteins through time and statistical approaches to make inferences regarding network structure. RESULTS In this study, we use dynamic Bayesian networks to make inferences regarding network structure and thereby generate testable hypotheses. We incorporate existing biology using informative network priors, weighted objectively by an empirical Bayes approach, and exploit a connection between variable selection and network inference to enable exact calculation of posterior probabilities of interest. The approach is computationally efficient and essentially free of user-set tuning parameters. Results on data where the true, underlying network is known place the approach favorably relative to existing approaches. We apply these methods to reverse-phase protein array time-course data from a breast cancer cell line (MDA-MB-468) to predict signaling links that we independently validate using targeted inhibition. The methods proposed offer a general approach by which to elucidate molecular networks specific to biological context, including, but not limited to, human cancers. AVAILABILITY http://mukherjeelab.nki.nl/DBN (code and data).
Collapse
Affiliation(s)
- Steven M Hill
- Department of Biochemistry, The Netherlands Cancer Institute, 1066 CX, Amsterdam, The Netherlands
| | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Aderhold A, Husmeier D, Lennon JJ, Beale CM, Smith VA. Hierarchical Bayesian models in ecology: Reconstructing species interaction networks from non-homogeneous species abundance data. ECOL INFORM 2012. [DOI: 10.1016/j.ecoinf.2012.05.002] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
26
|
Dondelinger F, Lèbre S, Husmeier D. Non-homogeneous dynamic Bayesian networks with Bayesian regularization for inferring gene regulatory networks with gradually time-varying structure. Mach Learn 2012. [DOI: 10.1007/s10994-012-5311-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|