Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhao H, Duan ZH. Cancer Genetic Network Inference Using Gaussian Graphical Models. Bioinform Biol Insights 2019;13:1177932219839402. [PMID: 31007526 PMCID: PMC6456846 DOI: 10.1177/1177932219839402] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 03/04/2019] [Indexed: 02/06/2023] Open

For:	Zhao H, Duan ZH. Cancer Genetic Network Inference Using Gaussian Graphical Models. Bioinform Biol Insights 2019;13:1177932219839402. [PMID: 31007526 PMCID: PMC6456846 DOI: 10.1177/1177932219839402] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Accepted: 03/04/2019] [Indexed: 02/06/2023] Open

Number

Cited by Other Article(s)

Martins S, Coletti R, Lopes MB. Disclosing transcriptomics network-based signatures of glioma heterogeneity using sparse methods. BioData Min 2023;16:26. [PMID: 37752578 PMCID: PMC10523751 DOI: 10.1186/s13040-023-00341-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 08/13/2023] [Indexed: 09/28/2023] Open

Buck L, Schmidt T, Feist M, Schwarzfischer P, Kube D, Oefner PJ, Zacharias HU, Altenbuchinger M, Dettmer K, Gronwald W, Spang R. Anomaly detection in mixed high-dimensional molecular data. Bioinformatics 2023;39:btad501. [PMID: 37584673 PMCID: PMC10457663 DOI: 10.1093/bioinformatics/btad501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 07/21/2023] [Accepted: 08/14/2023] [Indexed: 08/17/2023] Open

Abstract

MOTIVATION

Mixed molecular data combines continuous and categorical features of the same samples, such as OMICS profiles with genotypes, diagnoses, or patient sex. Like all high-dimensional molecular data, it is prone to incorrect values that can stem from various sources for example the technical limitations of the measurement devices, errors in the sample preparation, or contamination. Most anomaly detection algorithms identify complete samples as outliers or anomalies. However, in most cases, not all measurements of those samples are erroneous but only a few one-dimensional features within the samples are incorrect. These one-dimensional data errors are continuous measurements that are either located outside or inside the normal ranges of their features but in both cases show atypical values given all other continuous and categorical features in the sample. Additionally, categorical anomalies can occur for example when the genotype or diagnosis was submitted wrongly.

RESULTS

We introduce ADMIRE (Anomaly Detection using MIxed gRaphical modEls), a novel approach for the detection and correction of anomalies in mixed high-dimensional data. Hereby, we focus on the detection of single (one-dimensional) data errors in the categorical and continuous features of a sample. For that the joint distribution of continuous and categorical features is learned by mixed graphical models, anomalies are detected by the difference between measured and model-based estimations and are corrected using imputation. We evaluated ADMIRE in simulation and by screening for anomalies in one of our own metabolic datasets. In simulation experiments, ADMIRE outperformed the state-of-the-art methods of Local Outlier Factor, stray, and Isolation Forest.

AVAILABILITY AND IMPLEMENTATION

All data and code is available at https://github.com/spang-lab/adadmire. ADMIRE is implemented in a Python package called adadmire which can be found at https://pypi.org/project/adadmire.

Collapse

Saint-Antoine M, Singh A. Benchmarking Gene Regulatory Network Inference Methods on Simulated and Experimental Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.12.540581. [PMID: 37215029 PMCID: PMC10197678 DOI: 10.1101/2023.05.12.540581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]

Aldirawi H, Morales FG. Univariate and Multivariate Statistical Analysis of Microbiome Data: An Overview. Appl Microbiol 2023. [DOI: 10.3390/applmicrobiol3020023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]

Vásquez AR, Márquez Urbina JU, González Farías G, Escarela G. Controlling the false discovery rate by a Latent Gaussian Copula Knockoff procedure. Comput Stat 2023. [DOI: 10.1007/s00180-023-01346-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]

Chen Y, Zhang XF, Ou-Yang L. Inferring cancer common and specific gene networks via multi-layer joint graphical model. Comput Struct Biotechnol J 2023;21:974-990. [PMID: 36733706 PMCID: PMC9873583 DOI: 10.1016/j.csbj.2023.01.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Revised: 01/08/2023] [Accepted: 01/14/2023] [Indexed: 01/19/2023] Open

Seal S, Li Q, Basner EB, Saba LM, Kechris K. RCFGL: Rapid Condition adaptive Fused Graphical Lasso and application to modeling brain region co-expression networks. PLoS Comput Biol 2023;19:e1010758. [PMID: 36607897 PMCID: PMC9821764 DOI: 10.1371/journal.pcbi.1010758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2022] [Accepted: 11/24/2022] [Indexed: 01/07/2023] Open

Abstract

Inferring gene co-expression networks is a useful process for understanding gene regulation and pathway activity. The networks are usually undirected graphs where genes are represented as nodes and an edge represents a significant co-expression relationship. When expression data of multiple (p) genes in multiple (K) conditions (e.g., treatments, tissues, strains) are available, joint estimation of networks harnessing shared information across them can significantly increase the power of analysis. In addition, examining condition-specific patterns of co-expression can provide insights into the underlying cellular processes activated in a particular condition. Condition adaptive fused graphical lasso (CFGL) is an existing method that incorporates condition specificity in a fused graphical lasso (FGL) model for estimating multiple co-expression networks. However, with computational complexity of O(p2K log K), the current implementation of CFGL is prohibitively slow even for a moderate number of genes and can only be used for a maximum of three conditions. In this paper, we propose a faster alternative of CFGL named rapid condition adaptive fused graphical lasso (RCFGL). In RCFGL, we incorporate the condition specificity into another popular model for joint network estimation, known as fused multiple graphical lasso (FMGL). We use a more efficient algorithm in the iterative steps compared to CFGL, enabling faster computation with complexity of O(p2K) and making it easily generalizable for more than three conditions. We also present a novel screening rule to determine if the full network estimation problem can be broken down into estimation of smaller disjoint sub-networks, thereby reducing the complexity further. We demonstrate the computational advantage and superior performance of our method compared to two non-condition adaptive methods, FGL and FMGL, and one condition adaptive method, CFGL in both simulation study and real data analysis. We used RCFGL to jointly estimate the gene co-expression networks in different brain regions (conditions) using a cohort of heterogeneous stock rats. We also provide an accommodating C and Python based package that implements RCFGL.

Collapse

Elgart M, Goodman MO, Isasi C, Chen H, Morrison AC, de Vries PS, Xu H, Manichaikul AW, Guo X, Franceschini N, Psaty BM, Rich SS, Rotter JI, Lloyd-Jones DM, Fornage M, Correa A, Heard-Costa NL, Vasan RS, Hernandez R, Kaplan RC, Redline S, Sofer T. Correlations between complex human phenotypes vary by genetic background, gender, and environment. Cell Rep Med 2022;3:100844. [PMID: 36513073 PMCID: PMC9797952 DOI: 10.1016/j.xcrm.2022.100844] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2021] [Revised: 07/11/2022] [Accepted: 11/09/2022] [Indexed: 12/15/2022]

Affiliation(s)

Michael Elgart Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA.
Matthew O Goodman Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
Carmen Isasi Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA
Han Chen Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA; Center for Precision Health, School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX, USA
Alanna C Morrison Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Paul S de Vries Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA
Huichun Xu Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
Ani W Manichaikul Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
Xiuqing Guo The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Nora Franceschini Department of Epidemiology, University of North Carolina, Chapel Hill, NC, USA
Bruce M Psaty Cardiovascular Health Research Unit, Departments of Medicine, Epidemiology, and Health Services, University of Washington, Seattle, WA, USA
Stephen S Rich Center for Public Health Genomics, University of Virginia School of Medicine, Charlottesville, VA, USA
Jerome I Rotter The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
Donald M Lloyd-Jones Department of Preventive Medicine, Northwestern University, Chicago, IL, USA
Myriam Fornage Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX, USA; Brown Foundation Institute of Molecular Medicine, McGovern Medical School, University of Texas Health Science Center at Houston, Houston, TX, USA
Adolfo Correa Department of Population Health Science, University of Mississippi Medical Center, Jackson, MS, USA
Nancy L Heard-Costa Boston University and National Heart Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA; Department of Neurology, Boston University School of Medicine, Boston, MA, USA
Ramachandran S Vasan Boston University and National Heart Lung and Blood Institute's Framingham Heart Study, Framingham, MA, USA; Preventive Medicine & Epidemiology, and Cardiovascular Medicine, Medicine, Boston University School of Medicine, and Epidemiology, Boston University School of Public Health, Boston, MA, USA
Ryan Hernandez Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA
Robert C Kaplan Department of Epidemiology and Population Health, Albert Einstein College of Medicine, Bronx, NY, USA; Fred Hutchinson Cancer Research Center, Division of Public Health Sciences, Seattle, WA, USA
Susan Redline Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA
Tamar Sofer Division of Sleep and Circadian Disorders, Brigham and Women's Hospital, Boston, MA, USA; Department of Medicine, Harvard Medical School, Boston, MA, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA.

Collapse

Learning complex dependency structure of gene regulatory networks from high dimensional microarray data with Gaussian Bayesian networks. Sci Rep 2022;12:18704. [PMID: 36333425 PMCID: PMC9636198 DOI: 10.1038/s41598-022-21957-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2022] [Accepted: 10/06/2022] [Indexed: 11/06/2022] Open

Leng J, Wu LY. Interaction-based transcriptome analysis via differential network inference. Brief Bioinform 2022;23:6768051. [PMID: 36274239 PMCID: PMC9677477 DOI: 10.1093/bib/bbac466] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 09/13/2022] [Accepted: 09/28/2022] [Indexed: 12/14/2022] Open

On principal graphical models with application to gene network. Comput Stat Data Anal 2022. [DOI: 10.1016/j.csda.2021.107344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Bodein A, Scott-Boyer MP, Perin O, Lê Cao KA, Droit A. Interpretation of network-based integration from multi-omics longitudinal data. Nucleic Acids Res 2021;50:e27. [PMID: 34883510 PMCID: PMC8934642 DOI: 10.1093/nar/gkab1200] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2021] [Revised: 10/19/2021] [Accepted: 11/22/2021] [Indexed: 12/26/2022] Open

Estimation of Gene Regulatory Networks from Cancer Transcriptomics Data. Processes (Basel) 2021. [DOI: 10.3390/pr9101758] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open

scLink: Inferring Sparse Gene Co-expression Networks from Single-cell Expression Data. GENOMICS PROTEOMICS & BIOINFORMATICS 2021;19:475-492. [PMID: 34252628 PMCID: PMC8896229 DOI: 10.1016/j.gpb.2020.11.006] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 10/23/2020] [Accepted: 12/26/2020] [Indexed: 11/23/2022]

Yi H, Zhang Q, Sun Y, Ma S. Assisted estimation of gene expression graphical models. Genet Epidemiol 2021;45:372-385. [PMID: 33527531 PMCID: PMC8137544 DOI: 10.1002/gepi.22377] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2020] [Revised: 12/16/2020] [Accepted: 12/31/2020] [Indexed: 02/02/2023]

Hoang T, Lee J, Kim J. Differences in Dietary Patterns Identified by the Gaussian Graphical Model in Korean Adults With and Without a Self-Reported Cancer Diagnosis. J Acad Nutr Diet 2020;121:1484-1496.e3. [PMID: 33288494 DOI: 10.1016/j.jand.2020.11.006] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2019] [Revised: 11/04/2020] [Accepted: 11/10/2020] [Indexed: 01/02/2023]

Abstract

BACKGROUND

The synergistic effect of food groups on health outcomes is better captured by examining dietary patterns (DPs) than single food groups. Regarding this issue, a Gaussian graphical model (GGM) can identify pairwise correlations between food groups and adjust for the remaining items. However, the application of GGMs in the nutritional field has not been widely investigated, especially in Korean adults.

OBJECTIVE

The aim of this study was to identify the major DPs of Korean adults by using a GGM and to examine the associations between the DP scores and prevalence of self-reported cancer.

DESIGN

This cross-sectional study used baseline data from the 2007-2019 Cancer Screenee Cohort of the National Cancer Center, Korea.

PARTICIPANTS/SETTING

In total, 10,777 Korean adults who completed a questionnaire regarding their general medical history, including clinical test results, and a validated food frequency questionnaire were included.

MAIN OUTCOME MEASURES

The main outcome measure was the prevalence of self-reported cancer at baseline.

STATISTICAL ANALYSIS

DP networks were identified using a GGM. The GGM-identified networks were scored and categorized into tertiles, and their association with the prevalence of self-reported cancer was investigated using a multivariable logistic regression model.

RESULTS

The GGM identified the following 4 DP networks: principal, oil-sweet, meat, and fruit. After adjusting for covariates, the odds of moderate and high consumption of foods in the oil-sweet DP for participants who self-reported cancer were 25% and 34% lower than those for participants who did not report a cancer diagnosis (odds ratio [OR] = 0.75, 95% confidence interval [CI] = 0.62-0.90 and OR = 0.66, 95% CI = 0.53-0.81, respectively). Additionally, the odds of meat DP consumption in the self-reported cancer group was 29% lower than in participants who did not report a cancer diagnosis (OR = 0.71 and 95% CI = 0.57-0.88). In contrast, an increase in the odds of fruit DP consumption was observed for self-reported cancer participants (OR = 1.34 and 95% CI = 1.09-1.65). Similar results were observed among the female but not the male subjects.

CONCLUSIONS

GGM is a novel method that can distinguish the direct pairwise correlation of food groups and control for the indirect effect of other foods. Future large-scale longitudinal population-based studies are needed to build on these findings in general populations.

Collapse

Wu N, Yin F, Ou-Yang L, Zhu Z, Xie W. Joint learning of multiple gene networks from single-cell gene expression data. Comput Struct Biotechnol J 2020;18:2583-2595. [PMID: 33033579 PMCID: PMC7527714 DOI: 10.1016/j.csbj.2020.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Revised: 08/31/2020] [Accepted: 09/01/2020] [Indexed: 11/24/2022] Open

Kim AA, Rachid Zaim S, Subbian V. Assessing reproducibility and veracity across machine learning techniques in biomedicine: A case study using TCGA data. Int J Med Inform 2020;141:104148. [DOI: 10.1016/j.ijmedinf.2020.104148] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2019] [Revised: 03/22/2020] [Accepted: 04/16/2020] [Indexed: 11/28/2022]

Jiang S, Xiao G, Koh AY, Chen Y, Yao B, Li Q, Zhan X. HARMONIES: A Hybrid Approach for Microbiome Networks Inference via Exploiting Sparsity. Front Genet 2020;11:445. [PMID: 32582274 PMCID: PMC7283552 DOI: 10.3389/fgene.2020.00445] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2019] [Accepted: 04/14/2020] [Indexed: 12/19/2022] Open

Saint-Antoine MM, Singh A. Network inference in systems biology: recent developments, challenges, and applications. Curr Opin Biotechnol 2020;63:89-98. [PMID: 31927423 PMCID: PMC7308210 DOI: 10.1016/j.copbio.2019.12.002] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Accepted: 12/03/2019] [Indexed: 12/12/2022]

Abbas-Aghababazadeh F, Mo Q, Fridley BL. Statistical genomics in rare cancer. Semin Cancer Biol 2019;61:1-10. [PMID: 31437624 DOI: 10.1016/j.semcancer.2019.08.021] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2019] [Revised: 08/14/2019] [Accepted: 08/17/2019] [Indexed: 12/26/2022]