Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	[Subscribe] [Scholar Register]

Number

Cited by Other Article(s)

Bernal V, Soancatl-Aguilar V, Bulthuis J, Guryev V, Horvatovich P, Grzegorczyk M. GeneNetTools: tests for Gaussian graphical models with shrinkage. Bioinformatics 2022;38:5049-5054. [PMID: 36179082 PMCID: PMC9665865 DOI: 10.1093/bioinformatics/btac657] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 09/14/2022] [Accepted: 09/29/2022] [Indexed: 12/24/2022] Open

Abstract

MOTIVATION

Gaussian graphical models (GGMs) are network representations of random variables (as nodes) and their partial correlations (as edges). GGMs overcome the challenges of high-dimensional data analysis by using shrinkage methodologies. Therefore, they have become useful to reconstruct gene regulatory networks from gene-expression profiles. However, it is often ignored that the partial correlations are 'shrunk' and that they cannot be compared/assessed directly. Therefore, accurate (differential) network analyses need to account for the number of variables, the sample size, and also the shrinkage value, otherwise, the analysis and its biological interpretation would turn biased. To date, there are no appropriate methods to account for these factors and address these issues.

RESULTS

We derive the statistical properties of the partial correlation obtained with the Ledoit-Wolf shrinkage. Our result provides a toolbox for (differential) network analyses as (i) confidence intervals, (ii) a test for zero partial correlation (null-effects) and (iii) a test to compare partial correlations. Our novel (parametric) methods account for the number of variables, the sample size and the shrinkage values. Additionally, they are computationally fast, simple to implement and require only basic statistical knowledge. Our simulations show that the novel tests perform better than DiffNetFDR-a recently published alternative-in terms of the trade-off between true and false positives. The methods are demonstrated on synthetic data and two gene-expression datasets from Escherichia coli and Mus musculus.

AVAILABILITY AND IMPLEMENTATION

The R package with the methods and the R script with the analysis are available in https://github.com/V-Bernal/GeneNetTools.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Wang Z, Kaseb AO, Amin HM, Hassan MM, Wang W, Morris JS. Bayesian Edge Regression in Undirected Graphical Models to Characterize Interpatient Heterogeneity in Cancer. J Am Stat Assoc 2022;117:533-546. [PMID: 36090952 PMCID: PMC9454401 DOI: 10.1080/01621459.2021.2000866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 07/13/2021] [Accepted: 10/24/2021] [Indexed: 10/19/2022]

Tan YT, Ou-Yang L, Jiang X, Yan H, Zhang XF. Identifying Gene Network Rewiring Based on Partial Correlation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:513-521. [PMID: 32750866 DOI: 10.1109/tcbb.2020.3002906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]

Kim B, Liu S, Kolar M. Two‐sample inference for high‐dimensional Markov networks. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12446] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Dai R, Kolar M. Inference for high-dimensional varying-coefficient quantile regression. Electron J Stat 2021. [DOI: 10.1214/21-ejs1919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Zhang XF, Ou-Yang L, Yang S, Hu X, Yan H. DiffNetFDR: differential network analysis with false discovery rate control. Bioinformatics 2020;35:3184-3186. [PMID: 30689728 DOI: 10.1093/bioinformatics/btz051] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2018] [Revised: 01/10/2019] [Accepted: 01/20/2019] [Indexed: 11/13/2022] Open

Statistics in the Genomic Era. Genes (Basel) 2020;11:genes11040443. [PMID: 32325634 PMCID: PMC7230157 DOI: 10.3390/genes11040443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 11/29/2022] Open

Pan Y, Mai Q. Efficient computation for differential network analysis with applications to quadratic discriminant analysis. Comput Stat Data Anal 2020. [DOI: 10.1016/j.csda.2019.106884] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Zhang Q. Testing Differential Gene Networks under Nonparanormal Graphical Models with False Discovery Rate Control. Genes (Basel) 2020;11:E167. [PMID: 32033447 PMCID: PMC7073847 DOI: 10.3390/genes11020167] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 01/27/2020] [Accepted: 01/30/2020] [Indexed: 11/16/2022] Open

Zhang Q. Direct estimation of differential networks under high‐dimensional nonparanormal graphical models. CAN J STAT 2019. [DOI: 10.1002/cjs.11526] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Zhang Q, Du Y. Model-free feature screening for categorical outcomes: Nonlinear effect detection and false discovery rate control. PLoS One 2019;14:e0217463. [PMID: 31150453 PMCID: PMC6544247 DOI: 10.1371/journal.pone.0217463] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Accepted: 05/13/2019] [Indexed: 11/19/2022] Open

He Y, Ji J, Xie L, Zhang X, Xue F. A new insight into underlying disease mechanism through semi-parametric latent differential network model. BMC Bioinformatics 2018;19:493. [PMID: 30591011 PMCID: PMC6309076 DOI: 10.1186/s12859-018-2461-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open

Abstract

BACKGROUND

In genomic studies, to investigate how the structure of a genetic network differs between two experiment conditions is a very interesting but challenging problem, especially in high-dimensional setting. Existing literatures mostly focus on differential network modelling for continuous data. However, in real application, we may encounter discrete data or mixed data, which urges us to propose a unified differential network modelling for various data types.

RESULTS

We propose a unified latent Gaussian copula differential network model which provides deeper understanding of the unknown mechanism than that among the observed variables. Adaptive rank-based estimation approaches are proposed with the assumption that the true differential network is sparse. The adaptive estimation approaches do not require precision matrices to be sparse, and thus can allow the individual networks to contain hub nodes. Theoretical analysis shows that the proposed methods achieve the same parametric convergence rate for both the difference of the precision matrices estimation and differential structure recovery, which means that the extra modeling flexibility comes at almost no cost of statistical efficiency. Besides theoretical analysis, thorough numerical simulations are conducted to compare the empirical performance of the proposed methods with some other state-of-the-art methods. The result shows that the proposed methods work quite well for various data types. The proposed method is then applied on gene expression data associated with lung cancer to illustrate its empirical usefulness.

CONCLUSIONS

The proposed latent variable differential network models allows for various data-types and thus are more flexible, which also provide deeper understanding of the unknown mechanism than that among the observed variables. Theoretical analysis, numerical simulation and real application all demonstrate the great advantages of the latent differential network modelling and thus are highly recommended.

Collapse