1
|
Spitzner DJ. Calibrated Bayes factors under flexible priors. STAT METHOD APPL-GER 2023. [DOI: 10.1007/s10260-023-00683-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2023]
|
2
|
Samanta S, Khare K, Michailidis G. A generalized likelihood-based Bayesian approach for scalable joint regression and covariance selection in high dimensions. STATISTICS AND COMPUTING 2022; 32:47. [PMID: 36713060 PMCID: PMC9881595 DOI: 10.1007/s11222-022-10102-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Accepted: 04/27/2022] [Indexed: 06/05/2023]
Abstract
The paper addresses joint sparsity selection in the regression coefficient matrix and the error precision (inverse covariance) matrix for high-dimensional multivariate regression models in the Bayesian paradigm. The selected sparsity patterns are crucial to help understand the network of relationships between the predictor and response variables, as well as the conditional relationships among the latter. While Bayesian methods have the advantage of providing natural uncertainty quantification through posterior inclusion probabilities and credible intervals, current Bayesian approaches either restrict to specific sub-classes of sparsity patterns and/or are not scalable to settings with hundreds of responses and predictors. Bayesian approaches which only focus on estimating the posterior mode are scalable, but do not generate samples from the posterior distribution for uncertainty quantification. Using a bi-convex regression based generalized likelihood and spike-and-slab priors, we develop an algorithm called Joint Regression Network Selector (JRNS) for joint regression and covariance selection which (a) can accommodate general sparsity patterns, (b) provides posterior samples for uncertainty quantification, and (c) is scalable and orders of magnitude faster than the state-of-the-art Bayesian approaches providing uncertainty quantification. We demonstrate the statistical and computational efficacy of the proposed approach on synthetic data and through the analysis of selected cancer data sets. We also establish high-dimensional posterior consistency for one of the developed algorithms.
Collapse
|
3
|
Castelletti F, Consonni G, La Rocca L. Discussion to: Bayesian graphical models for modern biological applications by Y. Ni, V. Baladandayuthapani, M. Vannucci and F.C. Stingo. STAT METHOD APPL-GER 2022. [DOI: 10.1007/s10260-021-00601-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
|
4
|
Castelletti F, Peluso S. Equivalence class selection of categorical graphical models. Comput Stat Data Anal 2021. [DOI: 10.1016/j.csda.2021.107304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
5
|
Ha MJ, Stingo FC, Baladandayuthapani V. Bayesian Structure Learning in Multi-layered Genomic Networks. J Am Stat Assoc 2021; 116:605-618. [PMID: 34239216 DOI: 10.1080/01621459.2020.1775611] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Integrative network modeling of data arising from multiple genomic platforms provides insight into the holistic picture of the interactive system, as well as the flow of information across many disease domains including cancer. The basic data structure consists of a sequence of hierarchically ordered datasets for each individual subject, which facilitates integration of diverse inputs, such as genomic, transcriptomic, and proteomic data. A primary analytical task in such contexts is to model the layered architecture of networks where the vertices can be naturally partitioned into ordered layers, dictated by multiple platforms, and exhibit both undirected and directed relationships. We propose a multi-layered Gaussian graphical model (mlGGM) to investigate conditional independence structures in such multi-level genomic networks in human cancers. We implement a Bayesian node-wise selection (BANS) approach based on variable selection techniques that coherently accounts for the multiple types of dependencies in mlGGM; this flexible strategy exploits edge-specific prior knowledge and selects sparse and interpretable models. Through simulated data generated under various scenarios, we demonstrate that BANS outperforms other existing multivariate regression-based methodologies. Our integrative genomic network analysis for key signaling pathways across multiple cancer types highlights commonalities and differences of p53 integrative networks and epigenetic effects of BRCA2 on p53 and its interaction with T68 phosphorylated CHK2, that may have translational utilities of finding biomarkers and therapeutic targets.
Collapse
Affiliation(s)
- Min Jin Ha
- Department of Biostatistics, The University of Texas MD Anderson Cancer Center
| | - Francesco Claudio Stingo
- Department of Statistics, Computer Science, Applications "G. Parenti", The University of Florence
| | | |
Collapse
|
6
|
Hinoveanu LC, Leisen F, Villa C. A loss‐based prior for Gaussian graphical models. AUST NZ J STAT 2021. [DOI: 10.1111/anzs.12307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Laurenţiu Cătălin Hinoveanu
- School of Mathematics, Statistics and Actuarial Science University of Kent Sibson Building Canterbury CT2 7FSUK
| | - Fabrizio Leisen
- School of Mathematical Sciences University of Nottingham University Park Nottingham NG7 2RDUK
| | - Cristiano Villa
- School of Mathematics, Statistics and Physics Newcastle University Herschel Building Newcastle NE1 7RUUK
| |
Collapse
|
7
|
Castelletti F, La Rocca L, Peluso S, Stingo FC, Consonni G. Bayesian learning of multiple directed networks from observational data. Stat Med 2020; 39:4745-4766. [PMID: 32969059 DOI: 10.1002/sim.8751] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2019] [Revised: 06/29/2020] [Accepted: 08/25/2020] [Indexed: 11/08/2022]
Abstract
Graphical modeling represents an established methodology for identifying complex dependencies in biological networks, as exemplified in the study of co-expression, gene regulatory, and protein interaction networks. The available observations often exhibit an intrinsic heterogeneity, which impacts on the network structure through the modification of specific pathways for distinct groups, such as disease subtypes. We propose to infer the resulting multiple graphs jointly in order to benefit from potential similarities across groups; on the other hand our modeling framework is able to accommodate group idiosyncrasies. We consider directed acyclic graphs (DAGs) as network structures, and develop a Bayesian method for structural learning of multiple DAGs. We explicitly account for Markov equivalence of DAGs, and propose a suitable prior on the collection of graph spaces that induces selective borrowing strength across groups. The resulting inference allows in particular to compute the posterior probability of edge inclusion, a useful summary for representing flow directions within the network. Finally, we detail a simulation study addressing the comparative performance of our method, and present an analysis of two protein networks together with a substantive interpretation of our findings.
Collapse
Affiliation(s)
- Federico Castelletti
- Department of Statistical Sciences, Università Cattolica del Sacro Cuore, Milan, Italy
| | - Luca La Rocca
- Department of Physics, Informatics and Mathematics, Università degli Studi di Modena e Reggio Emilia, Modena, Italy
| | - Stefano Peluso
- Department of Statistical Sciences, Università Cattolica del Sacro Cuore, Milan, Italy
| | - Francesco C Stingo
- Department of Statistics, Computer Science, Applications "G. Parenti", Università degli Studi di Firenze, Florence, Italy
| | - Guido Consonni
- Department of Statistical Sciences, Università Cattolica del Sacro Cuore, Milan, Italy
| |
Collapse
|
8
|
Zhang H, Huang X, Han S, Rezwan FI, Karmaus W, Arshad H, Holloway JW. Gaussian Bayesian network comparisons with graph ordering unknown. Comput Stat Data Anal 2020; 157. [PMID: 33408431 DOI: 10.1016/j.csda.2020.107156] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Abstract
A Bayesian approach is proposed that unifies Gaussian Bayesian network constructions and comparisons between two networks (identical or differential) for data with graph ordering unknown. When sampling graph ordering, to escape from local maximums, an adjusted single queue equi-energy algorithm is applied. The conditional posterior probability mass function for network differentiation is derived and its asymptotic proposition is theoretically assessed. Simulations are used to demonstrate the approach and compare with existing methods. Based on epigenetic data at a set of DNA methylation sites (CpG sites), the proposed approach is further examined on its ability to detect network differentiations. Findings from theoretical assessment, simulations, and real data applications support the efficacy and efficiency of the proposed method for network comparisons.
Collapse
Affiliation(s)
- Hongmei Zhang
- Division of Epidemiology, Biostatistics, and Environmental Health, School of Public Health, University of Memphis, Memphis, TN, USA
| | - Xianzheng Huang
- Department of Statistics, University of South Carolina, Columbia, SC, USA
| | - Shengtong Han
- Joseph J. Zilber School of Public Health, University of Wisconsin, Milwaukee, WI, USA
| | - Faisal I Rezwan
- School of Water, Energy and Environment, Cranfield University, Cranfield, Bedfordshire, UK
| | - Wilfried Karmaus
- Division of Epidemiology, Biostatistics, and Environmental Health, School of Public Health, University of Memphis, Memphis, TN, USA
| | - Hasan Arshad
- Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Sothampton, UK
- David Hide Asthma and Allergy Research Centre, Isle of Wight, UK
| | - John W Holloway
- Human Development and Health, Faculty of Medicine, University of Southampton, Sothampton, UK
| |
Collapse
|
9
|
Castelletti F. Bayesian Model Selection of Gaussian Directed Acyclic Graph Structures. Int Stat Rev 2020. [DOI: 10.1111/insr.12379] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Federico Castelletti
- Department of Statistical Sciences Università Cattolica del Sacro Cuore Milano Italy
| |
Collapse
|
10
|
Petrakis N, Peluso S, Fouskakis D, Consonni G. Objective methods for graphical structural learning. STAT NEERL 2020. [DOI: 10.1111/stan.12211] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Nikolaos Petrakis
- Department of Statistics and Quantitative MethodsUniversity of Milano‐Bicocca Milano Italy
| | - Stefano Peluso
- Department of Statistical SciencesUniversità Cattolica del Sacro Cuore Milano Italy
| | - Dimitris Fouskakis
- Department of MathematicsNational Technical University of Athens Athens Greece
| | - Guido Consonni
- Department of Statistical SciencesUniversità Cattolica del Sacro Cuore Milano Italy
| |
Collapse
|
11
|
Paci L, Consonni G. Structural learning of contemporaneous dependencies in graphical VAR models. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2019.106880] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
12
|
Cao X, Ding L, Mersha TB. Joint variable selection and network modeling for detecting eQTLs. Stat Appl Genet Mol Biol 2020; 19:/j/sagmb.ahead-of-print/sagmb-2019-0032/sagmb-2019-0032.xml. [PMID: 32078577 DOI: 10.1515/sagmb-2019-0032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
In this study, we conduct a comparison of three most recent statistical methods for joint variable selection and covariance estimation with application of detecting expression quantitative trait loci (eQTL) and gene network estimation, and introduce a new hierarchical Bayesian method to be included in the comparison. Unlike the traditional univariate regression approach in eQTL, all four methods correlate phenotypes and genotypes by multivariate regression models that incorporate the dependence information among phenotypes, and use Bayesian multiplicity adjustment to avoid multiple testing burdens raised by traditional multiple testing correction methods. We presented the performance of three methods (MSSL - Multivariate Spike and Slab Lasso, SSUR - Sparse Seemingly Unrelated Bayesian Regression, and OBFBF - Objective Bayes Fractional Bayes Factor), along with the proposed, JDAG (Joint estimation via a Gaussian Directed Acyclic Graph model) method through simulation experiments, and publicly available HapMap real data, taking asthma as an example. Compared with existing methods, JDAG identified networks with higher sensitivity and specificity under row-wise sparse settings. JDAG requires less execution in small-to-moderate dimensions, but is not currently applicable to high dimensional data. The eQTL analysis in asthma data showed a number of known gene regulations such as STARD3, IKZF3 and PGAP3, all reported in asthma studies. The code of the proposed method is freely available at GitHub (https://github.com/xuan-cao/Joint-estimation-for-eQTL).
Collapse
Affiliation(s)
- Xuan Cao
- Division of Statistics and Data Science, Department of Mathematical Sciences, University of Cincinnati, Cincinnati, OH45221,USA
| | - Lili Ding
- Division of Biostatistics and Epidemiology, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH45229,USA
| | - Tesfaye B Mersha
- Division of Asthma Research, Department of Pediatrics, Cincinnati Children's Hospital Medical Center, University of Cincinnati, Cincinnati, OH45229,USA
| |
Collapse
|
13
|
Peluso S, Consonni G. Compatible priors for model selection of high-dimensional Gaussian DAGs. Electron J Stat 2020. [DOI: 10.1214/20-ejs1768] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
14
|
Castelletti F, Consonni G. Objective Bayes model selection of Gaussian interventional essential graphs for the identification of signaling pathways. Ann Appl Stat 2019. [DOI: 10.1214/19-aoas1275] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
15
|
Cao X, Khare K, Ghosh M. Posterior graph selection and estimation consistency for high-dimensional Bayesian DAG models. Ann Stat 2019. [DOI: 10.1214/18-aos1689] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|