Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Pendergrass SA, Frase A, Wallace J, Wolfe D, Katiyar N, Moore C, Ritchie MD. Genomic analyses with biofilter 2.0: knowledge driven filtering, annotation, and model development. BioData Min 2013;6:25. [PMID: 24378202 PMCID: PMC3917600 DOI: 10.1186/1756-0381-6-25] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Accepted: 12/19/2013] [Indexed: 01/01/2023] Open

For:	Pendergrass SA, Frase A, Wallace J, Wolfe D, Katiyar N, Moore C, Ritchie MD. Genomic analyses with biofilter 2.0: knowledge driven filtering, annotation, and model development. BioData Min 2013;6:25. [PMID: 24378202 PMCID: PMC3917600 DOI: 10.1186/1756-0381-6-25] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2013] [Accepted: 12/19/2013] [Indexed: 01/01/2023] Open

Number

Cited by Other Article(s)

Verma SS, Guare L, Ehsan S, Gastounioti A, Scales G, Ritchie MD, Kontos D, McCarthy AM. Genome-Wide Association Study of Breast Density among Women of African Ancestry. Cancers (Basel) 2023;15:2776. [PMID: 37345113 DOI: 10.3390/cancers15102776] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 05/03/2023] [Accepted: 05/11/2023] [Indexed: 06/23/2023] Open

Chattopadhyay A, Shih CY, Hsu YC, Juang JMJ, Chuang EY, Lu TP. CLIN_SKAT: an R package to conduct association analysis using functionally relevant variants. BMC Bioinformatics 2022;23:441. [PMID: 36274122 PMCID: PMC9590128 DOI: 10.1186/s12859-022-04987-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 10/16/2022] [Indexed: 12/03/2022] Open

Abstract

Background

Availability of next generation sequencing data, allows low-frequency and rare variants to be studied through strategies other than the commonly used genome-wide association studies (GWAS). Rare variants are important keys towards explaining the heritability for complex diseases that remains to be explained by common variants due to their low effect sizes. However, analysis strategies struggle to keep up with the huge amount of data at disposal therefore creating a bottleneck. This study describes CLIN_SKAT, an R package, that provides users with an easily implemented analysis pipeline with the goal of (i) extracting clinically relevant variants (both rare and common), followed by (ii) gene-based association analysis by grouping the selected variants.

Results

CLIN_SKAT offers four simple functions that can be used to obtain clinically relevant variants, map them to genes or gene sets, calculate weights from global healthy populations and conduct weighted case–control analysis. CLIN_SKAT introduces improvements by adding certain pre-analysis steps and customizable features to make the SKAT results clinically more meaningful. Moreover, it offers several plot functions that can be availed towards obtaining visualizations for interpretation of the analyses results. CLIN_SKAT is available on Windows/Linux/MacOS and is operative for R version 4.0.4 or later. It can be freely downloaded from https://github.com/ShihChingYu/CLIN_SKAT, installed through devtools::install_github("ShihChingYu/CLIN_SKAT", force=T) and executed by loading the package into R using library(CLIN_SKAT). All outputs (tabular and graphical) can be downloaded in simple, publishable formats.

Conclusions

Statistical association analysis is often underpowered due to low sample sizes and high numbers of variants to be tested, limiting detection of causal ones. Therefore, retaining a subset of variants that are biologically meaningful seems to be a more effective strategy for identifying explainable associations while reducing the degrees of freedom. CLIN_SKAT offers users a one-stop R package that identifies disease risk variants with improved power via a series of tailor-made procedures that allows dimension reduction, by retaining functionally relevant variants, and incorporating ethnicity based priors. Furthermore, it also eliminates the requirement for high computational resources and bioinformatics expertise.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-022-04987-2.

Collapse

Kumar A, Sandhu N, Kumar P, Pruthi G, Singh J, Kaur S, Chhuneja P. Genome-wide identification and in silico analysis of NPF, NRT2, CLC and SLAC1/SLAH nitrate transporters in hexaploid wheat (Triticum aestivum). Sci Rep 2022;12:11227. [PMID: 35781289 PMCID: PMC9250930 DOI: 10.1038/s41598-022-15202-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2021] [Accepted: 06/20/2022] [Indexed: 11/09/2022] Open

Slim L, Chatelain C, Foucauld HD, Azencott CA. A systematic analysis of gene-gene interaction in multiple sclerosis. BMC Med Genomics 2022;15:100. [PMID: 35501860 PMCID: PMC9063218 DOI: 10.1186/s12920-022-01247-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 03/28/2022] [Indexed: 12/03/2022] Open

Abstract

Background

For the most part, genome-wide association studies (GWAS) have only partially explained the heritability of complex diseases. One of their limitations is to assume independent contributions of individual variants to the phenotype. Many tools have therefore been developed to investigate the interactions between distant loci, or epistasis. Among them, the recently proposed EpiGWAS models the interactions between a target variant and the rest of the genome. However, applying this approach to studying interactions along all genes of a disease map is not straightforward. Here, we propose a pipeline to that effect, which we illustrate by investigating a multiple sclerosis GWAS dataset from the Wellcome Trust Case Control Consortium 2 through 19 disease maps from the MetaCore pathway database.

Results

For each disease map, we build an epistatic network by connecting the genes that are deemed to interact. These networks tend to be connected, complementary to the disease maps and contain hubs. In addition, we report 4 epistatic gene pairs involving missense variants, and 25 gene pairs with a deleterious epistatic effect mediated by eQTLs. Among these, we highlight the interaction of GLI-1 and SUFU, and of IP10 and NF-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\kappa$$\end{document}κB, as they both match known biological interactions. The latter pair is particularly promising for therapeutic development, as both genes have known inhibitors.

Conclusions

Our study showcases the ability of EpiGWAS to uncover biologically interpretable epistatic interactions that are potentially actionable for the development of combination therapy.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12920-022-01247-3.

Collapse

Coltelli L, Allegrini G, Orlandi P, Finale C, Fontana A, Masini LC, Scalese M, Arrighi G, Barletta MT, De Maio E, Banchi M, Fini E, Guidi P, Frenzilli G, Donati S, Giovannelli S, Tanganelli L, Salvadori B, Livi L, Meattini I, Pazzagli I, Di Lieto M, Pistelli M, Casadei V, Ferro A, Cupini S, Orlandi F, Francesca D, Lorenzini G, Barellini L, Falcone A, Cosimi A, Bocci G. A pharmacogenetic interaction analysis of bevacizumab with paclitaxel in advanced breast cancer patients. NPJ Breast Cancer 2022;8:33. [PMID: 35314692 PMCID: PMC8938486 DOI: 10.1038/s41523-022-00400-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Accepted: 02/07/2022] [Indexed: 11/18/2022] Open

Abstract

To investigate pharmacogenetic interactions among VEGF-A, VEGFR-2, IL-8, HIF-1α, EPAS-1, and TSP-1 SNPs and their role on progression-free survival (PFS) in metastatic breast cancer (MBC) patients treated with bevacizumab plus first-line paclitaxel or with paclitaxel alone. Analyses were performed on germline DNA, and SNPs were investigated by real-time PCR technique. The multifactor dimensionality reduction (MDR) methodology was applied to investigate the interaction between SNPs. The present study was an explorative, ambidirectional cohort study: 307 patients from 11 Oncology Units were evaluated retrospectively from 2009 to 2016, then followed prospectively (NCT01935102). Two hundred and fifteen patients were treated with paclitaxel and bevacizumab, whereas 92 patients with paclitaxel alone. In the bevacizumab plus paclitaxel group, the MDR software provided two pharmacogenetic interaction profiles consisting of the combination between specific VEGF-A rs833061 and VEGFR-2 rs1870377 genotypes. Median PFS for favorable genetic profile was 16.8 vs. the 10.6 months of unfavorable genetic profile (p = 0.0011). Cox proportional hazards model showed an adjusted hazard ratio of 0.64 (95% CI, 0.5–0.9; p = 0.004). Median OS for the favorable genetic profile was 39.6 vs. 28 months of unfavorable genetic profile (p = 0.0103). Cox proportional hazards model revealed an adjusted hazard ratio of 0.71 (95% CI, 0.5–1.01; p = 0.058). In the 92 patients treated with paclitaxel alone, the results showed no effect of the favorable genetic profile, as compared to the unfavorable genetic profile, either on the PFS (p = 0.509) and on the OS (p = 0.732). The pharmacogenetic statistical interaction between VEGF-A rs833061 and VEGFR-2 rs1870377 genotypes may identify a population of bevacizumab-treated patients with a better PFS.

Collapse

Duroux D, Climente-González H, Azencott CA, Van Steen K. Interpretable network-guided epistasis detection. Gigascience 2022;11:6521880. [PMID: 35134928 PMCID: PMC8848319 DOI: 10.1093/gigascience/giab093] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 10/12/2021] [Accepted: 12/13/2021] [Indexed: 11/15/2022] Open

Investigation of gene-gene interactions in cardiac traits and serum fatty acid levels in the LURIC Health Study. PLoS One 2020;15:e0238304. [PMID: 32915819 PMCID: PMC7485803 DOI: 10.1371/journal.pone.0238304] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 08/13/2020] [Indexed: 01/25/2023] Open

Abstract

Epistasis analysis elucidates the effects of gene-gene interactions (G×G) between multiple loci for complex traits. However, the large computational demands and the high multiple testing burden impede their discoveries. Here, we illustrate the utilization of two methods, main effect filtering based on individual GWAS results and biological knowledge-based modeling through Biofilter software, to reduce the number of interactions tested among single nucleotide polymorphisms (SNPs) for 15 cardiac-related traits and 14 fatty acids. We performed interaction analyses using the two filtering methods, adjusting for age, sex, body mass index (BMI), waist-hip ratio, and the first three principal components from genetic data, among 2,824 samples from the Ludwigshafen Risk and Cardiovascular (LURIC) Health Study. Using Biofilter, one interaction nearly met Bonferroni significance: an interaction between rs7735781 in XRCC4 and rs10804247 in XRCC5 was identified for venous thrombosis with a Bonferroni-adjusted likelihood ratio test (LRT) p: 0.0627. A total of 57 interactions were identified from main effect filtering for the cardiac traits G×G (10) and fatty acids G×G (47) at Bonferroni-adjusted LRT p < 0.05. For cardiac traits, the top interaction involved SNPs rs1383819 in SNTG1 and rs1493939 (138kb from 5’ of SAMD12) with Bonferroni-adjusted LRT p: 0.0228 which was significantly associated with history of arterial hypertension. For fatty acids, the top interaction between rs4839193 in KCND3 and rs10829717 in LOC107984002 with Bonferroni-adjusted LRT p: 2.28×10⁻⁵ was associated with 9-trans 12-trans octadecanoic acid, an omega-6 trans fatty acid. The model inflation factor for the interactions under different filtering methods was evaluated from the standard median and the linear regression approach. Here, we applied filtering approaches to identify numerous genetic interactions related to cardiac-related outcomes as potential targets for therapy. The approaches described offer ways to detect epistasis in the complex traits and to improve precision medicine capability.

Collapse

Basile AO, Byrska-Bishop M, Wallace J, Frase AT, Ritchie MD. Novel features and enhancements in BioBin, a tool for the biologically inspired binning and association analysis of rare variants. Bioinformatics 2018;34:527-529. [PMID: 28968757 PMCID: PMC5860358 DOI: 10.1093/bioinformatics/btx559] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 09/13/2017] [Indexed: 11/27/2022] Open

Manduchi E, Williams SM, Chesi A, Johnson ME, Wells AD, Grant SFA, Moore JH. Leveraging epigenomics and contactomics data to investigate SNP pairs in GWAS. Hum Genet 2018;137:413-425. [PMID: 29797095 PMCID: PMC5996751 DOI: 10.1007/s00439-018-1893-0] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2018] [Accepted: 05/20/2018] [Indexed: 12/29/2022]

Ritchie MD, Van Steen K. The search for gene-gene interactions in genome-wide association studies: challenges in abundance of methods, practical considerations, and biological interpretation. ANNALS OF TRANSLATIONAL MEDICINE 2018;6:157. [PMID: 29862246 DOI: 10.21037/atm.2018.04.05] [Citation(s) in RCA: 58] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Abstract

One of the primary goals in this era of precision medicine is to understand the biology of human diseases and their treatment, such that each individual patient receives the best possible treatment for their disease based on their genetic and environmental exposures. One way to work towards achieving this goal is to identify the environmental exposures and genetic variants that are relevant to each disease in question, as well as the complex interplay between genes and environment. Genome-wide association studies (GWAS) have allowed for a greater understanding of the genetic component of many complex traits. However, these genetic effects are largely small and thus, our ability to use these GWAS finding for precision medicine is limited. As more and more GWAS have been performed, rather than focusing only on common single nucleotide polymorphisms (SNPs) and additive genetic models, many researchers have begun to explore alternative heritable components of complex traits including rare variants, structural variants, epigenetics, and genetic interactions. While genetic interactions are a plausible reality that could explain some of the heritabliy that has not yet been identified, especially when one considers the identification of genetic interactions in model organisms as well as our understanding of biological complexity, still there are significant challenges and considerations in identifying these genetic interactions. Broadly, these can be summarized in three categories: abundance of methods, practical considerations, and biological interpretation. In this review, we will discuss these important elements in the search for genetic interactions along with some potential solutions. While genetic interactions are theoretically understood to be important for complex human disease, the body of evidence is still building to support this component of the underlying genetic architecture of complex human traits. Our hope is that more sophisticated modeling approaches and more robust computational techniques will enable the community to identify these important genetic interactions and improve our ability to implement precision medicine in the future.

Collapse

Verma SS, Josyula N, Verma A, Zhang X, Veturi Y, Dewey FE, Hartzel DN, Lavage DR, Leader J, Ritchie MD, Pendergrass SA. Rare variants in drug target genes contributing to complex diseases, phenome-wide. Sci Rep 2018;8:4624. [PMID: 29545597 PMCID: PMC5854600 DOI: 10.1038/s41598-018-22834-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Accepted: 03/01/2018] [Indexed: 12/30/2022] Open

Manduchi E, Chesi A, Hall MA, Grant SFA, Moore JH. Leveraging putative enhancer-promoter interactions to investigate two-way epistasis in Type 2 Diabetes GWAS. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018;23:548-558. [PMID: 29218913 PMCID: PMC5728670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Kim D, Li R, Lucas A, Verma SS, Dudek SM, Ritchie MD. Using knowledge-driven genomic interactions for multi-omics data analysis: metadimensional models for predicting clinical outcomes in ovarian carcinoma. J Am Med Inform Assoc 2017;24:577-587. [PMID: 28040685 DOI: 10.1093/jamia/ocw165] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Accepted: 12/02/2016] [Indexed: 02/07/2023] Open

Abstract

It is common that cancer patients have different molecular signatures even though they have similar clinical features, such as histology, due to the heterogeneity of tumors. To overcome this variability, we previously developed a new approach incorporating prior biological knowledge that identifies knowledge-driven genomic interactions associated with outcomes of interest. However, no systematic approach has been proposed to identify interaction models between pathways based on multi-omics data. Here we have proposed such a novel methodological framework, called metadimensional knowledge-driven genomic interactions (MKGIs). To test the utility of the proposed framework, we applied it to an ovarian cancer dataset including multi-omics profiles from The Cancer Genome Atlas to predict grade, stage, and survival outcome. We found that each knowledge-driven genomic interaction model, based on different genomic datasets, contains different sets of pathway features, which suggests that each genomic data type may contribute to outcomes in ovarian cancer via a different pathway. In addition, MKGI models significantly outperformed the single knowledge-driven genomic interaction model. From the MKGI models, many interactions between pathways associated with outcomes were found, including the mitogen-activated protein kinase (MAPK) signaling pathway and the gonadotropin-releasing hormone (GnRH) signaling pathway, which are known to play important roles in cancer pathogenesis. The beauty of incorporating biological knowledge into the model based on multi-omics data is the ability to improve diagnosis and prognosis and provide better interpretability. Thus, determining variability in molecular signatures based on these interactions between pathways may lead to better diagnostic/treatment strategies for better precision medicine.

Collapse

Hall MA, Wallace J, Lucas A, Kim D, Basile AO, Verma SS, McCarty CA, Brilliant MH, Peissig PL, Kitchner TE, Verma A, Pendergrass SA, Dudek SM, Moore JH, Ritchie MD. PLATO software provides analytic framework for investigating complexity beyond genome-wide association studies. Nat Commun 2017;8:1167. [PMID: 29079728 PMCID: PMC5660079 DOI: 10.1038/s41467-017-00802-2] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 07/28/2017] [Indexed: 12/22/2022] Open

Affiliation(s)

Molly A Hall Institute for Biomedical Informatics, Departments of Genetics and Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
John Wallace Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA
Anastasia Lucas Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA
Dokyoon Kim Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA
Anna O Basile Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, PA, 16802, USA
Shefali S Verma Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA.,Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, PA, 16802, USA
Cathy A McCarty Essentia Institute of Rural Health, Duluth, MN, 55805, USA
Murray H Brilliant Marshfield Clinic Research Institute, Marshfield, WI, 54449, USA
Peggy L Peissig Marshfield Clinic Research Institute, Marshfield, WI, 54449, USA
Terrie E Kitchner Marshfield Clinic Research Institute, Marshfield, WI, 54449, USA
Anurag Verma Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA.,Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, PA, 16802, USA
Sarah A Pendergrass Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA
Scott M Dudek Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA
Jason H Moore Institute for Biomedical Informatics, Departments of Genetics and Biostatistics and Epidemiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
Marylyn D Ritchie Biomedical and Translational Informatics Institute, Geisinger Health System, Danville, PA, 17821, USA. .,Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, PA, 16802, USA.

Collapse

McAllister K, Mechanic LE, Amos C, Aschard H, Blair IA, Chatterjee N, Conti D, Gauderman WJ, Hsu L, Hutter CM, Jankowska MM, Kerr J, Kraft P, Montgomery SB, Mukherjee B, Papanicolaou GJ, Patel CJ, Ritchie MD, Ritz BR, Thomas DC, Wei P, Witte JS. Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases. Am J Epidemiol 2017;186:753-761. [PMID: 28978193 PMCID: PMC5860428 DOI: 10.1093/aje/kwx227] [Citation(s) in RCA: 106] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 03/14/2017] [Accepted: 03/16/2017] [Indexed: 12/25/2022] Open

Ritchie MD, Davis JR, Aschard H, Battle A, Conti D, Du M, Eskin E, Fallin MD, Hsu L, Kraft P, Moore JH, Pierce BL, Bien SA, Thomas DC, Wei P, Montgomery SB. Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions. Am J Epidemiol 2017;186:771-777. [PMID: 28978191 PMCID: PMC5860556 DOI: 10.1093/aje/kwx229] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2016] [Revised: 04/07/2017] [Accepted: 04/10/2017] [Indexed: 12/12/2022] Open

Holzinger ER, Verma SS, Moore CB, Hall M, De R, Gilbert-Diamond D, Lanktree MB, Pankratz N, Amuzu A, Burt A, Dale C, Dudek S, Furlong CE, Gaunt TR, Kim DS, Riess H, Sivapalaratnam S, Tragante V, van Iperen EP, Brautbar A, Carrell DS, Crosslin DR, Jarvik GP, Kuivaniemi H, Kullo IJ, Larson EB, Rasmussen-Torvik LJ, Tromp G, Baumert J, Cruickshanks KJ, Farrall M, Hingorani AD, Hovingh GK, Kleber ME, Klein BE, Klein R, Koenig W, Lange LA, Mӓrz W, North KE, Charlotte Onland-Moret N, Reiner AP, Talmud PJ, van der Schouw YT, Wilson JG, Kivimaki M, Kumari M, Moore JH, Drenos F, Asselbergs FW, Keating BJ, Ritchie MD. Discovery and replication of SNP-SNP interactions for quantitative lipid traits in over 60,000 individuals. BioData Min 2017;10:25. [PMID: 28770004 PMCID: PMC5525436 DOI: 10.1186/s13040-017-0145-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2017] [Accepted: 07/12/2017] [Indexed: 12/01/2022] Open

Affiliation(s)

Emily R. Holzinger Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institute for General Medical Sciences, National Institutes of Health, Baltimore, MD USA
Shefali S. Verma The Center for Systems Genomics, The Pennsylvania State University, University Park, State College, PA USA
Carrie B. Moore Department of Surgery, Duke University, Durham, NC USA
Molly Hall The Center for Systems Genomics, The Pennsylvania State University, University Park, State College, PA USA
Rishika De Department of Genetics, Geisel School of Medicine at Dartmouth, Hanover, NH USA
Diane Gilbert-Diamond Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, NH USA
Matthew B. Lanktree Department of Medicine, McMaster University, Hamilton, ON Canada
Nathan Pankratz Department of Lab Medicine and Pathology, University of Minnesota, Minneapolis, MN USA
Antoinette Amuzu London School of Hygiene and Tropical Medicine, London, UK
Amber Burt Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
Caroline Dale London School of Hygiene and Tropical Medicine, London, UK
Scott Dudek The Center for Systems Genomics, The Pennsylvania State University, University Park, State College, PA USA
Clement E. Furlong Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
Tom R. Gaunt MRC Integrative Epidemiology Unit, University of Bristol, Oakfield House, Oakfield Grove, Bristol, UK
Daniel Seung Kim Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
Helene Riess Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Suthesh Sivapalaratnam Department of Vascular Medicine, Academic Medical Center, Amsterdam, The Netherlands
Vinicius Tragante Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht, The Netherlands Department of Medical Genetics, Biomedical Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
Erik P.A. van Iperen Durrer Center for Cardiogenetic Research, ICIN-Netherlands Heart Institute, Utrecht, The Netherlands Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Center, Amsterdam, The Netherlands
Ariel Brautbar Department of Medical Genetics, Marshfield Clinic, Marshfield, WI USA
David S. Carrell Group Health Research Institute, Group Health Cooperative, Seattle, WA USA
David R. Crosslin Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
Gail P. Jarvik Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA USA
Helena Kuivaniemi Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Stellenbosch University, Tygerberg, South Africa
Iftikhar J. Kullo Division of Cardiovascular Diseases, Mayo Clinic, Rochester, MN USA
Eric B. Larson Group Health Research Institute, Group Health Cooperative, Seattle, WA USA
Laura J. Rasmussen-Torvik Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL USA
Gerard Tromp Division of Molecular Biology and Human Genetics, Department of Biomedical Sciences, Stellenbosch University, Tygerberg, South Africa
Jens Baumert Institute of Epidemiology II, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
Karen J. Cruickshanks Department of Population Health Sciences, Department of Ophthalmology and Visual Sciences, University of Wisconsin-Madison, Madison, WI USA
Martin Farrall Department of Cardiovascular Medicine, The Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
Aroon D. Hingorani Department of Epidemiology and Public Health, UCL Institute of Epidemiology & Health Care, University College London, London, UK
G. K. Hovingh Department of Vascular Medicine, Academic Medical Center, Amsterdam, The Netherlands
Marcus E. Kleber Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Heidelberg, Germany
Barbara E. Klein Department of Population Health Sciences, Department of Ophthalmology and Visual Sciences, University of Wisconsin-Madison, Madison, WI USA
Ronald Klein Department of Population Health Sciences, Department of Ophthalmology and Visual Sciences, University of Wisconsin-Madison, Madison, WI USA
Wolfgang Koenig Department of Internal Medicine II – Cardiology, University of Ulm Medical Centre, Ulm, Germany
Leslie A. Lange Department of Genetics, University of North Carolina School of Medicine at Chapel Hill, Chapel Hill, NC USA
Winfried Mӓrz Vth Department of Medicine, Medical Faculty Mannheim, Heidelberg University, Heidelberg, Germany Synlab Academy, Synlab Services GmbH, Mannheim, Germany
Kari E. North Department of Epidemiology, School of Public Health, University of North Carolina at Chapel Hill, Chapel Hill, NC USA
N. Charlotte Onland-Moret Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
Alex P. Reiner Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, WA USA
Philippa J. Talmud MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of Bristol, Bristol, UK
Yvonne T. van der Schouw Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht, The Netherlands
James G. Wilson Department of Physiology and Biophysics, University of Mississippi Medical Center, Jackson, MS USA
Mika Kivimaki Department of Epidemiology and Public Health, UCL Institute of Epidemiology & Health Care, University College London, London, UK
Meena Kumari Department of Epidemiology and Public Health, UCL Institute of Epidemiology & Health Care, University College London, London, UK ISER, University of Essex, Essex, UK
Jason H. Moore Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA USA
Fotios Drenos MRC Integrative Epidemiology Unit, School of Social and Community Medicine, University of Bristol, Bristol, UK Centre of Cardiovascular Genetics, Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, UK
Folkert W. Asselbergs Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Utrecht, The Netherlands Durrer Center for Cardiogenetic Research, ICIN-Netherlands Heart Institute, Utrecht, The Netherlands Centre of Cardiovascular Genetics, Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, London, UK
Brendan J. Keating Division of Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA USA Division of Transplantation, Department of Surgery, University of Pennsylvania, Philadelphia, PA USA
Marylyn D. Ritchie Biomedical and Translational Informatics, Geisinger Clinic, Danville, PA USA

Collapse

Identifying gene-gene interactions that are highly associated with four quantitative lipid traits across multiple cohorts. Hum Genet 2016;136:165-178. [PMID: 27848076 DOI: 10.1007/s00439-016-1738-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2016] [Accepted: 10/07/2016] [Indexed: 10/20/2022]

Moore CCB, Basile AO, Wallace JR, Frase AT, Ritchie MD. A biologically informed method for detecting rare variant associations. BioData Min 2016;9:27. [PMID: 27582876 PMCID: PMC5006419 DOI: 10.1186/s13040-016-0107-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Accepted: 06/18/2016] [Indexed: 11/29/2022] Open

Phenome-Wide Association Study to Explore Relationships between Immune System Related Genetic Loci and Complex Traits and Diseases. PLoS One 2016;11:e0160573. [PMID: 27508393 PMCID: PMC4980020 DOI: 10.1371/journal.pone.0160573] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2016] [Accepted: 07/16/2016] [Indexed: 12/21/2022] Open

Butkiewicz M, Cooke Bailey JN, Frase A, Dudek S, Yaspan BL, Ritchie MD, Pendergrass SA, Haines JL. Pathway analysis by randomization incorporating structure-PARIS: an update. ACTA ACUST UNITED AC 2016;32:2361-3. [PMID: 27153576 DOI: 10.1093/bioinformatics/btw130] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2015] [Accepted: 03/03/2016] [Indexed: 01/11/2023]

Hohman TJ, Bush WS, Jiang L, Brown-Gentry KD, Torstenson ES, Dudek SM, Mukherjee S, Naj A, Kunkle BW, Ritchie MD, Martin ER, Schellenberg GD, Mayeux R, Farrer LA, Pericak-Vance MA, Haines JL, Thornton-Wells TA. Discovery of gene-gene interactions across multiple independent data sets of late onset Alzheimer disease from the Alzheimer Disease Genetics Consortium. Neurobiol Aging 2016;38:141-150. [PMID: 26827652 PMCID: PMC4735733 DOI: 10.1016/j.neurobiolaging.2015.10.031] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2015] [Revised: 10/28/2015] [Accepted: 10/28/2015] [Indexed: 12/20/2022]

Affiliation(s)

Timothy J Hohman Vanderbilt Memory & Alzheimer's Center, Department of Neurology, Vanderbilt University Medical Center, Nashville, TN, USA
William S Bush Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Lan Jiang Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Kristin D Brown-Gentry Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Eric S Torstenson Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Scott M Dudek Vanderbilt Genetics Institute, Vanderbilt University Medical Center, Nashville, TN, USA
Shubhabrata Mukherjee Department of Medicine, University of Washington, Seattle, WA, USA
Adam Naj Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, PA, USA
Brian W Kunkle Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, FL, USA
Marylyn D Ritchie Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, USA
Eden R Martin Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, FL, USA; Department of Public Health Sciences, Miller School of Medicine, University of Miami, Miami, FL, USA
Gerard D Schellenberg Department of Pathology and Laboratory Medicine, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
Richard Mayeux Gertrude H. Sergievsky Center, Department of Neurology and the Taub Institute for Research on Alzheimer's Disease and the Aging Brain, College of Physicians and Surgeons, Columbia University, New York, NY, USA
Lindsay A Farrer Department of Medicine (Biomedical Genetics), Boston University, Boston, MA, USA; Department of Neurology, Boston University, Boston, MA, USA; Department of Ophthalmology, Boston University, Boston, MA, USA; Department of Epidemiology, Boston University, Boston, MA, USA; Department of Biostatistics, Boston University, Boston, MA, USA
Margaret A Pericak-Vance Dr. John T. Macdonald Foundation Department of Human Genetics and John P. Hussman Institute for Human Genomics, Miller School of Medicine, University of Miami, Miami, FL, USA; Department of Neurology, Miller School of Medicine, University of Miami, Miami, FL, USA
Jonathan L Haines Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH, USA
Tricia A Thornton-Wells Vanderbilt Genetics Institute, Department of Molecular Physiology & Biophysics, Vanderbilt University Medical Center, Nashville, TN, USA.

Collapse

Basile AO, Wallace JR, Peissig P, McCarty CA, Brilliant M, Ritchie MD. KNOWLEDGE DRIVEN BINNING AND PHEWAS ANALYSIS IN MARSHFIELD PERSONALIZED MEDICINE RESEARCH PROJECT USING BIOBIN. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016;21:249-260. [PMID: 26776191 PMCID: PMC4824557] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Abstract

Next-generation sequencing technology has presented an opportunity for rare variant discovery and association of these variants with disease. To address the challenges of rare variant analysis, multiple statistical methods have been developed for combining rare variants to increase statistical power for detecting associations. BioBin is an automated tool that expands on collapsing/binning methods by performing multi-level variant aggregation with a flexible, biologically informed binning strategy using an internal biorepository, the Library of Knowledge (LOKI). The databases within LOKI provide variant details, regional annotations and pathway interactions which can be used to generate bins of biologically-related variants, thereby increasing the power of any subsequent statistical test. In this study, we expand the framework of BioBin to incorporate statistical tests, including a dispersion-based test, SKAT, thereby providing the option of performing a unified collapsing and statistical rare variant analysis in one tool. Extensive simulation studies performed on gene-coding regions showed a Bin-KAT analysis to have greater power than BioBin-regression in all simulated conditions, including variants influencing the phenotype in the same direction, a scenario where burden tests often retain greater power. The use of Madsen- Browning variant weighting increased power in the burden analysis to that equitable with Bin-KAT; but overall Bin-KAT retained equivalent or higher power under all conditions. Bin-KAT was applied to a study of 82 pharmacogenes sequenced in the Marshfield Personalized Medicine Research Project (PMRP). We looked for association of these genes with 9 different phenotypes extracted from the electronic health record. This study demonstrates that Bin-KAT is a powerful tool for the identification of genes harboring low frequency variants for complex phenotypes.

Collapse

Butkiewicz M, Bush WS. In Silico Functional Annotation of Genomic Variation. CURRENT PROTOCOLS IN HUMAN GENETICS 2016;88:6.15.1-6.15.17. [PMID: 26724722 PMCID: PMC4722816 DOI: 10.1002/0471142905.hg0615s88] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Verma SS, Frase AT, Verma A, Pendergrass SA, Mahony S, Haas DW, Ritchie MD. PHENOME-WIDE INTERACTION STUDY (PheWIS) IN AIDS CLINICAL TRIALS GROUP DATA (ACTG). PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016;21:57-68. [PMID: 26776173 PMCID: PMC4722952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Abstract

Association studies have shown and continue to show a substantial amount of success in identifying links between multiple single nucleotide polymorphisms (SNPs) and phenotypes. These studies are also believed to provide insights toward identification of new drug targets and therapies. Albeit of all the success, challenges still remain for applying and prioritizing these associations based on available biological knowledge. Along with single variant association analysis, genetic interactions also play an important role in uncovering the etiology and progression of complex traits. For gene-gene interaction analysis, selection of the variants to test for associations still poses a challenge in identifying epistatic interactions among the large list of variants available in high-throughput, genome-wide datasets. Therefore in this study, we propose a pipeline to identify interactions among genetic variants that are associated with multiple phenotypes by prioritizing previously published results from main effect association analysis (genome-wide and phenome-wide association analysis) based on a-priori biological knowledge in AIDS Clinical Trials Group (ACTG) data. We approached the prioritization and filtration of variants by using the results of a previously published single variant PheWAS and then utilizing biological information from the Roadmap Epigenome project. We removed variants in low functional activity regions based on chromatin states annotation and then conducted an exhaustive pairwise interaction search using linear regression analysis. We performed this analysis in two independent pre-treatment clinical trial datasets from ACTG to allow for both discovery and replication. Using a regression framework, we observed 50,798 associations that replicate at p-value 0.01 for 26 phenotypes, among which 2,176 associations for 212 unique SNPs for fasting blood glucose phenotype reach Bonferroni significance and an additional 9,970 interactions for high-density lipoprotein (HDL) phenotype and fasting blood glucose (total of 12,146 associations) reach FDR significance. We conclude that this method of prioritizing variants to look for epistatic interactions can be used extensively for generating hypotheses for genomewide and phenome-wide interaction analyses. This original Phenome-wide Interaction study (PheWIS) can be applied further to patients enrolled in randomized clinical trials to establish the relationship between patient's response to a particular drug therapy and non-linear combination of variants that might be affecting the outcome.

Collapse

KIM DOKYOON, LUCAS ANASTASIA, GLESSNER JOSEPH, VERMA SHEFALIS, BRADFORD YUKI, LI RUOWANG, FRASE ALEXT, HAKONARSON HAKON, PEISSIG PEGGY, BRILLIANT MURRAY, RITCHIE MARYLYND. BIOFILTER AS A FUNCTIONAL ANNOTATION PIPELINE FOR COMMON AND RARE COPY NUMBER BURDEN. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2016;21:357-368. [PMID: 26776200 PMCID: PMC4722964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Abstract

Recent studies on copy number variation (CNV) have suggested that an increasing burden of CNVs is associated with susceptibility or resistance to disease. A large number of genes or genomic loci contribute to complex diseases such as autism. Thus, total genomic copy number burden, as an accumulation of copy number change, is a meaningful measure of genomic instability to identify the association between global genetic effects and phenotypes of interest. However, no systematic annotation pipeline has been developed to interpret biological meaning based on the accumulation of copy number change across the genome associated with a phenotype of interest. In this study, we develop a comprehensive and systematic pipeline for annotating copy number variants into genes/genomic regions and subsequently pathways and other gene groups using Biofilter - a bioinformatics tool that aggregates over a dozen publicly available databases of prior biological knowledge. Next we conduct enrichment tests of biologically defined groupings of CNVs including genes, pathways, Gene Ontology, or protein families. We applied the proposed pipeline to a CNV dataset from the Marshfield Clinic Personalized Medicine Research Project (PMRP) in a quantitative trait phenotype derived from the electronic health record - total cholesterol. We identified several significant pathways such as toll-like receptor signaling pathway and hepatitis C pathway, gene ontologies (GOs) of nucleoside triphosphatase activity (NTPase) and response to virus, and protein families such as cell morphogenesis that are associated with the total cholesterol phenotype based on CNV profiles (permutation p-value < 0.01). Based on the copy number burden analysis, it follows that the more and larger the copy number changes, the more likely that one or more target genes that influence disease risk and phenotypic severity will be affected. Thus, our study suggests the proposed enrichment pipeline could improve the interpretability of copy number burden analysis where hundreds of loci or genes contribute toward disease susceptibility via biological knowledge groups such as pathways. This CNV annotation pipeline with Biofilter can be used for CNV data from any genotyping or sequencing platform and to explore CNV enrichment for any traits or phenotypes. Biofilter continues to be a powerful bioinformatics tool for annotating, filtering, and constructing biologically informed models for association analysis - now including copy number variants.

Collapse

De R, Verma SS, Drenos F, Holzinger ER, Holmes MV, Hall MA, Crosslin DR, Carrell DS, Hakonarson H, Jarvik G, Larson E, Pacheco JA, Rasmussen-Torvik LJ, Moore CB, Asselbergs FW, Moore JH, Ritchie MD, Keating BJ, Gilbert-Diamond D. Identifying gene-gene interactions that are highly associated with Body Mass Index using Quantitative Multifactor Dimensionality Reduction (QMDR). BioData Min 2015;8:41. [PMID: 26674805 PMCID: PMC4678717 DOI: 10.1186/s13040-015-0074-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2015] [Accepted: 12/04/2015] [Indexed: 11/22/2022] Open

Abstract

Background

Despite heritability estimates of 40–70 % for obesity, less than 2 % of its variation is explained by Body Mass Index (BMI) associated loci that have been identified so far. Epistasis, or gene-gene interactions are a plausible source to explain portions of the missing heritability of BMI.

Methods

Using genotypic data from 18,686 individuals across five study cohorts – ARIC, CARDIA, FHS, CHS, MESA – we filtered SNPs (Single Nucleotide Polymorphisms) using two parallel approaches. SNPs were filtered either on the strength of their main effects of association with BMI, or on the number of knowledge sources supporting a specific SNP-SNP interaction in the context of BMI. Filtered SNPs were specifically analyzed for interactions that are highly associated with BMI using QMDR (Quantitative Multifactor Dimensionality Reduction). QMDR is a nonparametric, genetic model-free method that detects non-linear interactions associated with a quantitative trait.

Results

We identified seven novel, epistatic models with a Bonferroni corrected p-value of association < 0.1. Prior experimental evidence helps explain the plausible biological interactions highlighted within our results and their relationship with obesity. We identified interactions between genes involved in mitochondrial dysfunction (POLG2), cholesterol metabolism (SOAT2), lipid metabolism (CYP11B2), cell adhesion (EZR), cell proliferation (MAP2K5), and insulin resistance (IGF1R). Moreover, we found an 8.8 % increase in the variance in BMI explained by these seven SNP-SNP interactions, beyond what is explained by the main effects of an index FTO SNP and the SNPs within these interactions. We also replicated one of these interactions and 58 proxy SNP-SNP models representing it in an independent dataset from the eMERGE study.

Conclusion

This study highlights a novel approach for discovering gene-gene interactions by combining methods such as QMDR with traditional statistics.

Electronic supplementary material

The online version of this article (doi:10.1186/s13040-015-0074-0) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Rishika De Computational Genetics Laboratory, Department of Genetics, Geisel School of Medicine at Dartmouth, Dartmouth-Hitchcock Medical Center, 706 Rubin Building, HB7937, One Medical Center Dr, Lebanon, NH 03756 USA
Shefali S Verma Center for Systems Genomics, Department of Biochemistry and Molecular Biology, 512 Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802 USA
Fotios Drenos Centre for Cardiovascular Genetics, Institute of Cardiovascular Science, Faculty of Population Health Sciences, University College London, 5 University Street, London, WC1E 6JF UK
Emily R Holzinger Center for Systems Genomics, Department of Biochemistry and Molecular Biology, 512 Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802 USA
Michael V Holmes Division of Transplant Surgery, Perelman School of Medicine, University of Pennsylvania, 3400 Spruce Street, 2 Dulles Pvln, Philadelphia, PA 19104 USA
Molly A Hall Center for Systems Genomics, Department of Biochemistry and Molecular Biology, 512 Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802 USA
David R Crosslin Department of Genome Sciences, University of Washington, 3720 15th Ave NE, Seattle, WA 98195-5065 USA
David S Carrell Group Health Research Institute, Metropolitan Park East, 1730 Minor Avenue, Suite 1600, Seattle, WA 98101-1448 USA
Hakon Hakonarson The Joseph Stokes Jr. Research Institute, The Children's Hospital of Philadelphia, Office 1016 Abramson Building, Room 1216E, 3615 Civic Center Blvd, Philadelphia, PA 19104 USA
Gail Jarvik Department of Genome Sciences, University of Washington, 3720 15th Ave NE, Seattle, WA 98195-5065 USA ; Division of Medical Genetics, Department of Medicine, University of Washington, Health Sciences Building, K-253B, Medical Genetics, Box 357720, Seattle, WA 98195-7720 USA
Eric Larson Group Health Research Institute, Metropolitan Park East, 1730 Minor Avenue, Suite 1600, Seattle, WA 98101-1448 USA
Jennifer A Pacheco Center for Genetic Medicine, Northwestern University Feinberg School of Medicine, 303 E. Superior Street, Lurie 7-125, Chicago, IL 60611 USA
Laura J Rasmussen-Torvik Department of Preventive Medicine, Northwestern University, Feinberg School of Medicine, 680 N Lake Shore Drive, Suite 1400, Chicago, IL 60611 USA
Carrie B Moore Center for Systems Genomics, Department of Biochemistry and Molecular Biology, 512 Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802 USA ; Center for Human Genetics Research, Vanderbilt University School of Medicine, 519 Light Hall, Nashville, TN 37232 USA
Folkert W Asselbergs Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, Room E03.511, P.O. Box 85500, 3508 GA Utrecht, The Netherlands ; Institute of Cardiovascular Science, University College London, London, UK ; Durrer Center for Cardiogenetic Research, ICIN-Netherlands Heart Institute, Utrecht, The Netherlands
Jason H Moore Institute for Biomedical Informatics, The Perelman School of Medicine, University of Pennsylvania, 1418 Blockley Hall, 423 Guardian Drive, Philadelphia, PA 19104-6021 USA
Marylyn D Ritchie Center for Systems Genomics, Department of Biochemistry and Molecular Biology, 512 Wartik Laboratory, The Pennsylvania State University, University Park, PA 16802 USA
Brendan J Keating The Joseph Stokes Jr. Research Institute, The Children's Hospital of Philadelphia, Office 1016 Abramson Building, Room 1216E, 3615 Civic Center Blvd, Philadelphia, PA 19104 USA ; University Medical Center Utrecht, Utrecht, The Netherlands
Diane Gilbert-Diamond Institute for Quantitative Biomedical Sciences at Dartmouth, Hanover, NH USA ; Department of Epidemiology, Geisel School of Medicine at Dartmouth, One Medical Center Drive, 7927 Rubin Building, Lebanon, NH 03756 USA

Collapse

Niel C, Sinoquet C, Dina C, Rocheleau G. A survey about methods dedicated to epistasis detection. Front Genet 2015;6:285. [PMID: 26442103 PMCID: PMC4564769 DOI: 10.3389/fgene.2015.00285] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2015] [Accepted: 08/27/2015] [Indexed: 12/25/2022] Open

Pendergrass SA, Verma A, Okula A, Hall MA, Crawford DC, Ritchie MD. Phenome-Wide Association Studies: Embracing Complexity for Discovery. Hum Hered 2015. [PMID: 26201697 DOI: 10.1159/000381851] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open

Hall MA, Verma SS, Wallace J, Lucas A, Berg RL, Connolly J, Crawford DC, Crosslin DR, de Andrade M, Doheny KF, Haines JL, Harley JB, Jarvik GP, Kitchner T, Kuivaniemi H, Larson EB, Carrell DS, Tromp G, Vrabec TR, Pendergrass SA, McCarty CA, Ritchie MD. Biology-Driven Gene-Gene Interaction Analysis of Age-Related Cataract in the eMERGE Network. Genet Epidemiol 2015;39:376-84. [PMID: 25982363 PMCID: PMC4550090 DOI: 10.1002/gepi.21902] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2014] [Revised: 02/27/2015] [Accepted: 03/13/2015] [Indexed: 01/19/2023]

Affiliation(s)

Molly A Hall Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Shefali S Verma Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, Pennsylvania, United States of America
John Wallace Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Anastasia Lucas Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Richard L Berg Marshfield Clinic, Marshfield, Wisconsin, United States of America
John Connolly Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania, United States of America
Dana C Crawford Department of Epidemiology and Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, United States of America
David R Crosslin Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
Mariza de Andrade Mayo Clinic, Rochester, Minnesota, United States of America
Kimberly F Doheny Center for Inherited Disease Research, IGM, Johns Hopkins University SOM, Baltimore, Maryland, United States of America
Jonathan L Haines Department of Epidemiology and Biostatistics, Institute for Computational Biology, Case Western Reserve University, Cleveland, Ohio, United States of America
John B Harley Department of Pediatrics, Cincinnati Children's Hospital, University of Cincinnati, Cincinnati, Ohio, United States of America
Gail P Jarvik Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America.,Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, Washington, United States of America
Terrie Kitchner Marshfield Clinic, Marshfield, Wisconsin, United States of America
Helena Kuivaniemi Geisinger Health System, Danville, Pennsylvania, United States of America
Eric B Larson Group Health Research Institute, Seattle, Washington, United States of America
David S Carrell Group Health Research Institute, Seattle, Washington, United States of America
Gerard Tromp Geisinger Health System, Danville, Pennsylvania, United States of America
Tamara R Vrabec Geisinger Health System, Danville, Pennsylvania, United States of America
Sarah A Pendergrass Geisinger Health System, Danville, Pennsylvania, United States of America
Catherine A McCarty Essentia Rural Health, Duluth, Minnesota, United States of America
Marylyn D Ritchie Department of Biochemistry and Molecular Biology, Center for Systems Genomics, Eberly College of Science, The Pennsylvania State University, University Park, Pennsylvania, United States of America.,Geisinger Health System, Danville, Pennsylvania, United States of America

Collapse

HU TING, DARABOS CHRISTIAN, CRICCO MARIAE, KONG EMILY, MOORE JASONH. Genome-wide genetic interaction analysis of glaucoma using expert knowledge derived from human phenotype networks. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015;20:207-18. [PMID: 25592582 PMCID: PMC4299930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Moore CB, Verma A, Pendergrass S, Verma SS, Johnson DH, Daar ES, Gulick RM, Haubrich R, Robbins GK, Ritchie MD, Haas DW. Phenome-wide Association Study Relating Pretreatment Laboratory Parameters With Human Genetic Variants in AIDS Clinical Trials Group Protocols. Open Forum Infect Dis 2015;2:ofu113. [PMID: 25884002 PMCID: PMC4396430 DOI: 10.1093/ofid/ofu113] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2014] [Accepted: 12/02/2014] [Indexed: 01/11/2023] Open

Kim D, Li R, Dudek SM, Wallace JR, Ritchie MD. Binning somatic mutations based on biological knowledge for predicting survival: an application in renal cell carcinoma. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2015:96-107. [PMID: 25592572 PMCID: PMC4299944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Hall MA, Verma A, Brown-Gentry KD, Goodloe R, Boston J, Wilson S, McClellan B, Sutcliffe C, Dilks HH, Gillani NB, Jin H, Mayo P, Allen M, Schnetz-Boutaud N, Crawford DC, Ritchie MD, Pendergrass SA. Detection of pleiotropy through a Phenome-wide association study (PheWAS) of epidemiologic data as part of the Environmental Architecture for Genes Linked to Environment (EAGLE) study. PLoS Genet 2014;10:e1004678. [PMID: 25474351 PMCID: PMC4256091 DOI: 10.1371/journal.pgen.1004678] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2014] [Accepted: 08/16/2014] [Indexed: 12/19/2022] Open

Abstract

We performed a Phenome-wide association study (PheWAS) utilizing diverse genotypic and phenotypic data existing across multiple populations in the National Health and Nutrition Examination Surveys (NHANES), conducted by the Centers for Disease Control and Prevention (CDC), and accessed by the Epidemiological Architecture for Genes Linked to Environment (EAGLE) study. We calculated comprehensive tests of association in Genetic NHANES using 80 SNPs and 1,008 phenotypes (grouped into 184 phenotype classes), stratified by race-ethnicity. Genetic NHANES includes three surveys (NHANES III, 1999-2000, and 2001-2002) and three race-ethnicities: non-Hispanic whites (n = 6,634), non-Hispanic blacks (n = 3,458), and Mexican Americans (n = 3,950). We identified 69 PheWAS associations replicating across surveys for the same SNP, phenotype-class, direction of effect, and race-ethnicity at p<0.01, allele frequency >0.01, and sample size >200. Of these 69 PheWAS associations, 39 replicated previously reported SNP-phenotype associations, 9 were related to previously reported associations, and 21 were novel associations. Fourteen results had the same direction of effect across more than one race-ethnicity: one result was novel, 11 replicated previously reported associations, and two were related to previously reported results. Thirteen SNPs showed evidence of pleiotropy. We further explored results with gene-based biological networks, contrasting the direction of effect for pleiotropic associations across phenotypes. One PheWAS result was ABCG2 missense SNP rs2231142, associated with uric acid levels in both non-Hispanic whites and Mexican Americans, protoporphyrin levels in non-Hispanic whites and Mexican Americans, and blood pressure levels in Mexican Americans. Another example was SNP rs1800588 near LIPC, significantly associated with the novel phenotypes of folate levels (Mexican Americans), vitamin E levels (non-Hispanic whites) and triglyceride levels (non-Hispanic whites), and replication for cholesterol levels. The results of this PheWAS show the utility of this approach for exposing more of the complex genetic architecture underlying multiple traits, through generating novel hypotheses for future research.

Collapse

Affiliation(s)

Molly A. Hall Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Anurag Verma Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Kristin D. Brown-Gentry Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Robert Goodloe Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Jonathan Boston Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Sarah Wilson Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Bob McClellan Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Cara Sutcliffe Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Holly H. Dilks Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
Nila B. Gillani Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Hailing Jin Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Ping Mayo Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Melissa Allen Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Nathalie Schnetz-Boutaud Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America
Dana C. Crawford Center for Human Genetics Research, Vanderbilt University, Nashville, Tennessee, United States of America Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, Tennessee, United States of America
Marylyn D. Ritchie Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, United States of America
Sarah A. Pendergrass Center for Systems Genomics, Department of Biochemistry and Molecular Biology, The Huck Institutes of the Life Sciences, The Pennsylvania State University, University Park, Pennsylvania, United States of America

Collapse

Chhibber A, Kroetz DL, Tantisira KG, McGeachie M, Cheng C, Plenge R, Stahl E, Sadee W, Ritchie MD, Pendergrass SA. Genomic architecture of pharmacological efficacy and adverse events. Pharmacogenomics 2014;15:2025-48. [PMID: 25521360 PMCID: PMC4308414 DOI: 10.2217/pgs.14.144] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Kim D, Li R, Dudek SM, Frase AT, Pendergrass SA, Ritchie MD. Knowledge-driven genomic interactions: an application in ovarian cancer. BioData Min 2014;7:20. [PMID: 25214892 PMCID: PMC4161273 DOI: 10.1186/1756-0381-7-20] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2014] [Accepted: 08/28/2014] [Indexed: 12/11/2022] Open

Abstract

Background

Effective cancer clinical outcome prediction for understanding of the mechanism of various types of cancer has been pursued using molecular-based data such as gene expression profiles, an approach that has promise for providing better diagnostics and supporting further therapies. However, clinical outcome prediction based on gene expression profiles varies between independent data sets. Further, single-gene expression outcome prediction is limited for cancer evaluation since genes do not act in isolation, but rather interact with other genes in complex signaling or regulatory networks. In addition, since pathways are more likely to co-operate together, it would be desirable to incorporate expert knowledge to combine pathways in a useful and informative manner.

Methods

Thus, we propose a novel approach for identifying knowledge-driven genomic interactions and applying it to discover models associated with cancer clinical phenotypes using grammatical evolution neural networks (GENN). In order to demonstrate the utility of the proposed approach, an ovarian cancer data from the Cancer Genome Atlas (TCGA) was used for predicting clinical stage as a pilot project.

Results

We identified knowledge-driven genomic interactions associated with cancer stage from single knowledge bases such as sources of pathway-pathway interaction, but also knowledge-driven genomic interactions across different sets of knowledge bases such as pathway-protein family interactions by integrating different types of information. Notably, an integration model from different sources of biological knowledge achieved 78.82% balanced accuracy and outperformed the top models with gene expression or single knowledge-based data types alone. Furthermore, the results from the models are more interpretable because they are framed in the context of specific biological pathways or other expert knowledge.

Conclusions

The success of the pilot study we have presented herein will allow us to pursue further identification of models predictive of clinical cancer survival and recurrence. Understanding the underlying tumorigenesis and progression in ovarian cancer through the global view of interactions within/between different biological knowledge sources has the potential for providing more effective screening strategies and therapeutic targets for many types of cancer.

Collapse

Crawford DC, Crosslin DR, Tromp G, Kullo IJ, Kuivaniemi H, Hayes MG, Denny JC, Bush WS, Haines JL, Roden DM, McCarty CA, Jarvik GP, Ritchie MD. eMERGEing progress in genomics-the first seven years. Front Genet 2014;5:184. [PMID: 24987407 PMCID: PMC4060012 DOI: 10.3389/fgene.2014.00184] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2014] [Accepted: 05/30/2014] [Indexed: 12/15/2022] Open

Affiliation(s)

Dana C Crawford Center for Human Genetics Research, Vanderbilt University Nashville, TN, USA ; Department of Molecular Physiology and Biophysics, Vanderbilt University Nashville, TN, USA
David R Crosslin Medical Genetics, Department of Medicine, School of Medicine, University of Washington Seattle, WA, USA ; Department of Genome Sciences, University of Washington Seattle, WA, USA
Gerard Tromp The Sigfried and Janet Weis Center for Research, Geisinger Health System Danville, PA, USA
Iftikhar J Kullo Division of Cardiovascular Diseases and the Gonda Vascular Center, Mayo Clinic Rochester, MN, USA
Helena Kuivaniemi The Sigfried and Janet Weis Center for Research, Geisinger Health System Danville, PA, USA
M Geoffrey Hayes Division of Endocrinology, Metabolism, and Molecular Medicine, Department of Medicine, Feinberg School of Medicine, Northwestern University Chicago, IL, USA
Joshua C Denny Department of Biomedical Informatics, Vanderbilt University Nashville, TN, USA ; Department of Medicine, Vanderbilt University Nashville, TN, USA
William S Bush Center for Human Genetics Research, Vanderbilt University Nashville, TN, USA ; Department of Biomedical Informatics, Vanderbilt University Nashville, TN, USA
Jonathan L Haines Department of Epidemiology and Biostatistics, Case Western Reserve University Cleveland, OH, USA ; Institute for Computational Biology, Case Western Reserve University Cleveland, OH, USA
Dan M Roden Department of Medicine, Vanderbilt University Nashville, TN, USA ; Department of Pharmacology, Vanderbilt University Nashville, TN, USA
Catherine A McCarty Essentia Institute of Rural Health Duluth, MN, USA
Gail P Jarvik Medical Genetics, Department of Medicine, School of Medicine, University of Washington Seattle, WA, USA ; Department of Genome Sciences, University of Washington Seattle, WA, USA
Marylyn D Ritchie Department of Biochemistry and Molecular Biology, Pennsylvania State University University Park, PA, USA ; Center for Systems Genomics, Pennsylvania State University University Park, PA, USA

Collapse

Sun X, Lu Q, Mukherjee S, Crane PK, Elston R, Ritchie MD. Analysis pipeline for the epistasis search - statistical versus biological filtering. Front Genet 2014;5:106. [PMID: 24817878 PMCID: PMC4012196 DOI: 10.3389/fgene.2014.00106] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2014] [Accepted: 04/10/2014] [Indexed: 12/15/2022] Open