1
|
Yang Z, Yao S, Xu Y, Zhang X, Shi Y, Wang L, Cui D. Identification of a Predictive Model for Schizophrenia Based on SNPs in a Chinese Population. Neuropsychiatr Dis Treat 2024; 20:1553-1561. [PMID: 39139656 PMCID: PMC11321330 DOI: 10.2147/ndt.s466554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Accepted: 07/26/2024] [Indexed: 08/15/2024] Open
Abstract
Background Schizophrenia is a devastating mental disease with high heritability. A growing number of susceptibility genes associated with schizophrenia, as well as their corresponding SNPs loci, have been revealed by genome-wide association studies. However, using SNPs as predictors of disease and diagnosis remains difficult. Here, we aimed to uncover susceptibility SNPs in a Chinese population and to construct a prediction model for schizophrenia. Methods A total of 210 participants, including 70 patients with schizophrenia, 70 patients with bipolar disorder, and 70 healthy controls, were enrolled in this study. We estimated 14 SNPs using published risk loci of schizophrenia, and used these SNPs to build a model for predicting schizophrenia via comparison of genotype frequencies and regression. We evaluated the efficacy of the diagnostic model in schizophrenia and control patients using ROC curves and then used the 70 patients with bipolar disorder to evaluate the model's differential diagnostic efficacy. Results 5 SNPs were selected to construct the model: rs148415900, rs71428218, rs4666990, rs112222723 and rs1716180. Correlation analysis results suggested that, compared with the risk SNP of 0, the risk SNP of 3 was associated with an increased risk of schizophrenia (OR = 13.00, 95% CI: 2.35-71.84, p = 0.003). The ROC-AUC of this prediction model for schizophrenia was 0.719 (95% CI: 0.634-0.804), with the greatest sensitivity and specificity being 60% and 80%, respectively. The ROC-AUC of the model in distinguishing between schizophrenia and bipolar disorder was 0.591 (95% CI: 0.497-0.686), with the greatest sensitivity and specificity being 60% and 55.7%, respectively. Conclusion The SNP risk score prediction model had good performance in predicting schizophrenia. To the best of our knowledge, previous studies have not applied SNP-based models to differentiate between cases of schizophrenia and other mental illnesses. It could have several potential clinical applications, including shaping disease diagnosis, treatment, and outcomes.
Collapse
Affiliation(s)
- Zhiying Yang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Shun Yao
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Yichong Xu
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Xiaoqing Zhang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Yuan Shi
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Lijun Wang
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| | - Donghong Cui
- Shanghai Mental Health Center, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
- Shanghai Key Laboratory of Psychotic Disorders, Shanghai Jiao Tong University School of Medicine, Shanghai, People’s Republic of China
| |
Collapse
|
2
|
Lee S, Yang HK, Lee HJ, Park DJ, Kong SH, Park SK. Systematic review of gastric cancer-associated genetic variants, gene-based meta-analysis, and gene-level functional analysis to identify candidate genes for drug development. Front Genet 2022; 13:928783. [PMID: 36081994 PMCID: PMC9446437 DOI: 10.3389/fgene.2022.928783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 07/25/2022] [Indexed: 12/05/2022] Open
Abstract
Objective: Despite being a powerful tool to identify novel variants, genome-wide association studies (GWAS) are not sufficient to explain the biological function of variants. In this study, we aimed to elucidate at the gene level the biological mechanisms involved in gastric cancer (GC) development and to identify candidate drug target genes. Materials and methods: We conducted a systematic review for GWAS on GC following the PRISMA guidelines. Single nucleotide polymorphism (SNP)-level meta-analysis and gene-based analysis (GBA) were performed to identify SNPs and genes significantly associated with GC. Expression quantitative trait loci (eQTL), disease network, pathway enrichment, gene ontology, gene-drug, and chemical interaction analyses were conducted to elucidate the function of the genes identified by GBA. Results: A review of GWAS on GC identified 226 SNPs located in 91 genes. In the comprehensive GBA, 44 genes associated with GC were identified, among which 12 genes (THBS3, GBAP1, KRTCAP2, TRIM46, HCN3, MUC1, DAP3, EFNA1, MTX1, PRKAA1, PSCA, and ABO) were eQTL. Using disease network and pathway analyses, we identified that PRKAA, THBS3, and EFNA1 were significantly associated with the PI3K-Alt-mTOR-signaling pathway, which is involved in various oncogenic processes, and that MUC1 acts as a regulator in both the PI3K-Alt-mTOR and P53 signaling pathways. Furthermore, RPKAA1 had the highest number of interactions with drugs and chemicals. Conclusion: Our study suggests that PRKAA1, a gene in the PI3K-Alt-mTOR-signaling pathway, could be a potential target gene for drug development associated with GC in the future. Systematic Review Registration: website, identifier registration number.
Collapse
Affiliation(s)
- Sangjun Lee
- Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, South Korea
- Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
- Department of Biomedical Sciences, Seoul National University Graduate School, Seoul, South Korea
| | - Han-Kwang Yang
- Department of Surgery and Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
| | - Hyuk-Joon Lee
- Department of Surgery and Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
| | - Do Joong Park
- Department of Surgery and Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
| | - Seong-Ho Kong
- Department of Surgery and Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
| | - Sue K. Park
- Department of Preventive Medicine, Seoul National University College of Medicine, Seoul, South Korea
- Cancer Research Institute, Seoul National University College of Medicine, Seoul, South Korea
- Integrated Major in Innovative Medical Science, Seoul National University College of Medicine, Seoul, South Korea
| |
Collapse
|
3
|
Hébert F, Causeur D, Emily M. Omnibus testing approach for gene-based gene-gene interaction. Stat Med 2022; 41:2854-2878. [PMID: 35338506 DOI: 10.1002/sim.9389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2020] [Revised: 03/03/2022] [Accepted: 03/04/2022] [Indexed: 11/07/2022]
Abstract
Genetic interaction is considered as one of the main heritable component of complex traits. With the emergence of genome-wide association studies (GWAS), a collection of statistical methods dedicated to the identification of interaction at the SNP level have been proposed. More recently, gene-based gene-gene interaction testing has emerged as an attractive alternative as they confer advantage in both statistical power and biological interpretation. Most of the gene-based interaction methods rely on a multidimensional modeling of the interaction, thus facing a lack of robustness against the huge space of interaction patterns. In this paper, we study a global testing approaches to address the issue of gene-based gene-gene interaction. Based on a logistic regression modeling framework, all SNP-SNP interaction tests are combined to produce a gene-level test for interaction. We propose an omnibus test that takes advantage of (1) the heterogeneity between existing global tests and (2) the complementarity between allele-based and genotype-based coding of SNPs. Through an extensive simulation study, it is demonstrated that the proposed omnibus test has the ability to detect with high power the most common interaction genetic models with one causal pair as well as more complex genetic models where more than one causal pair is involved. On the other hand, the flexibility of the proposed approach is shown to be robust and improves power compared to single global tests in replication studies. Furthermore, the application of our procedure to real datasets confirms the adaptability of our approach to replicate various gene-gene interactions.
Collapse
Affiliation(s)
- Florian Hébert
- Department of Statistics and Computer Science, Institut Agro, CNRS, IRMAR, Univ Rennes, F-35000, Rennes, France
| | - David Causeur
- Department of Statistics and Computer Science, Institut Agro, CNRS, IRMAR, Univ Rennes, F-35000, Rennes, France
| | - Mathieu Emily
- Department of Statistics and Computer Science, Institut Agro, CNRS, IRMAR, Univ Rennes, F-35000, Rennes, France
| |
Collapse
|
4
|
Exploring single nucleotide polymorphisms in GH, IGF-I, MC4R and DGAT1 genes as predictors for growth performance in dromedary camel using multiple linear regression analysis. Small Rumin Res 2022. [DOI: 10.1016/j.smallrumres.2022.106619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
5
|
Crowder SL, Hoogland AI, Welniak TL, LaFranchise EA, Carpenter KM, Li D, Rotroff DM, Mariam A, Pierce CM, Extermann M, Kim RD, Tometich DB, Figueiredo JC, Muzaffar J, Bari S, Turner K, Weinstock GM, Jim HS. Metagenomics and chemotherapy-induced nausea: A roadmap for future research. Cancer 2022; 128:461-470. [PMID: 34643945 PMCID: PMC8776572 DOI: 10.1002/cncr.33892] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Revised: 07/06/2021] [Accepted: 08/13/2021] [Indexed: 02/03/2023]
Abstract
Uncontrolled chemotherapy-induced nausea and vomiting can reduce patients' quality of life and may result in premature discontinuation of chemotherapy. Although nausea and vomiting are commonly grouped together, research has shown that antiemetics are clinically effective against chemotherapy-induced vomiting (CIV) but less so against chemotherapy-induced nausea (CIN). Nausea remains a problem for up to 68% of patients who are prescribed guideline-consistent antiemetics. Despite the high prevalence of CIN, relatively little is known regarding its etiology independent of CIV. This review summarizes a metagenomics approach to the study and treatment of CIN with the goal of encouraging future research. Metagenomics focuses on genetic risk factors and encompasses both human (ie, host) and gut microbial genetic variation. Little work to date has focused on metagenomics as a putative biological mechanism of CIN. Metagenomics has the potential to be a powerful tool in advancing scientific understanding of CIN by identifying new biological pathways and intervention targets. The investigation of metagenomics in the context of well-established demographic, clinical, and patient-reported risk factors may help to identify patients at risk and facilitate the prevention and management of CIN.
Collapse
Affiliation(s)
| | | | | | | | | | - Daneng Li
- Department of Medical Oncology and Therapeutics Research, City of Hope Comprehensive Cancer Center
| | - Daniel M. Rotroff
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| | - Arshiya Mariam
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, Cleveland, OH, USA
| | | | | | - Richard D. Kim
- Department of Hematology Oncology, Moffitt Cancer Center
| | | | | | - Jameel Muzaffar
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center
| | - Shahla Bari
- Department of Hematology Oncology, Moffitt Cancer Center
| | - Kea Turner
- Department of Health Outcomes and Behavior, Moffitt Cancer Center
| | | | - Heather S.L. Jim
- Department of Health Outcomes and Behavior, Moffitt Cancer Center
| |
Collapse
|
6
|
Mirza N, Stevelink R, Taweel B, Koeleman BPC, Marson AG. Using common genetic variants to find drugs for common epilepsies. Brain Commun 2021; 3:fcab287. [PMID: 34988442 PMCID: PMC8710935 DOI: 10.1093/braincomms/fcab287] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 09/17/2021] [Accepted: 10/20/2021] [Indexed: 12/18/2022] Open
Abstract
Better drugs are needed for common epilepsies. Drug repurposing offers the potential of significant savings in the time and cost of developing new treatments. In order to select the best candidate drug(s) to repurpose for a disease, it is desirable to predict the relative clinical efficacy that drugs will have against the disease. Common epilepsy can be divided into different types and syndromes. Different antiseizure medications are most effective for different types and syndromes of common epilepsy. For predictions of antiepileptic efficacy to be clinically translatable, it is essential that the predictions are specific to each form of common epilepsy, and reflect the patterns of drug efficacy observed in clinical studies and practice. These requirements are not fulfilled by previously published drug predictions for epilepsy. We developed a novel method for predicting the relative efficacy of drugs against any common epilepsy, by using its Genome-Wide Association Study summary statistics and drugs' activity data. The methodological advancement in our technique is that the drug predictions for a disease are based upon drugs' effects on the function and abundance of proteins, and the magnitude and direction of those effects, relative to the importance, degree and direction of the proteins' dysregulation in the disease. We used this method to predict the relative efficacy of all drugs, licensed for any condition, against each of the major types and syndromes of common epilepsy. Our predictions are concordant with findings from real-world experience and randomized clinical trials. Our method predicts the efficacy of existing antiseizure medications against common epilepsies; in this prediction, our method outperforms the best alternative existing method: area under receiver operating characteristic curve (mean ± standard deviation) 0.83 ± 0.03 and 0.63 ± 0.04, respectively. Importantly, our method predicts which antiseizure medications are amongst the more efficacious in clinical practice, and which antiseizure medications are amongst the less efficacious in clinical practice, for each of the main syndromes of common epilepsy, and it predicts the distinct order of efficacy of individual antiseizure medications in clinical trials of different common epilepsies. We identify promising candidate drugs for each of the major syndromes of common epilepsy. We screen five promising predicted drugs in an animal model: each exerts a significant dose-dependent effect upon seizures. Our predictions are a novel resource for selecting suitable candidate drugs that could potentially be repurposed for each of the major syndromes of common epilepsy. Our method is potentially generalizable to other complex diseases.
Collapse
Affiliation(s)
- Nasir Mirza
- Department of Pharmacology & Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3GE, UK
| | - Remi Stevelink
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584 CX, the Netherlands; member of the ERN EpiCARE
- Department of Child Neurology, University Medical Center Utrecht Brain Center, Utrecht 3584 CX, the Netherlands
| | - Basel Taweel
- School of Medicine, University of Liverpool, Liverpool L69 3GE, UK
| | - Bobby P C Koeleman
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht 3584 CX, the Netherlands; member of the ERN EpiCARE
| | - Anthony G Marson
- Department of Pharmacology & Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3GE, UK
| |
Collapse
|
7
|
DNA polymorphisms of FGFBP1, leptin, κ-casein, and αs1-casein genes and their association with reproductive performance in dromedary she-camels. Theriogenology 2021; 178:18-29. [PMID: 34749032 DOI: 10.1016/j.theriogenology.2021.11.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 10/31/2021] [Accepted: 11/01/2021] [Indexed: 11/22/2022]
Abstract
In livestock, selection programs on reproductive traits are time consuming due to long generation interval and low heritability; however understanding the genetic factors that influence reproductive performance facilitates an accurate selection and genetic improvement of these quantitative traits. Therefore, the aim of this study was to identify the effect of lactation order and polymorphisms of FGFBP1, leptin, κ-casein, and αs1-casein genes on reproductive performance in dromedary she-camels. Blood samples were collected from one hundred she-camels using vacutainer tube containing EDTA as an anticoagulant for DNA extraction. PCR-DNA sequencing approach was performed to investigate the polymorphisms in fragments for coding sequence (CDS) parts of FGFBP1 (539-bp) and leptin (359-bp), 5'-flanking region of κ-casein (488-bp) and intervening non-coding fragment spanning from exon 4 to exon 6 (924-bp) of αs1-casein genes. Results revealed that there were single nucleotide polymorphisms (SNPs) in the investigated genes between enrolled she-camels except for leptin gene that elicited a monomorphic pattern. The identified SNPs allocated camels into different groups according to which animal harbored the explored SNPs. Multiple linear regression analysis (MLR) reveled that identified SNPs and parity order had significant effects on age at first calving (AFC), days open (DO), calving interval (CI), number of services/conception (NS/C) and milk yield traits. Determination coefficients (R2) value indicated also that variability in the phenotypic measurements of studied traits might be correlated with the identified SNPs in the reproduction related genes. This study highlights that parity order has a remarkable effect on reproductive performance. Single nucleotide polymorphisms in the investigated genes could be also used as proxy predictors and genetic markers that permit a marker assisted selection for reproductive performance traits in dromedary she-camels.
Collapse
|
8
|
Targeting Common Signaling Pathways for the Treatment of Stroke and Alzheimer's: a Comprehensive Review. Neurotox Res 2021; 39:1589-1612. [PMID: 34169405 DOI: 10.1007/s12640-021-00381-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 04/11/2021] [Accepted: 05/24/2021] [Indexed: 12/30/2022]
Abstract
Neurodegenerative diseases such as stroke and Alzheimer's disease (AD) are two inter-related disorders that affect the neurons in the brain and central nervous system. Alzheimer's is a disease by undefined origin and causes. Stroke and its most common type, ischemic stroke (IS), occurs due to the blockade of cerebral blood vessels. As an important feature, both of disorders are associated with irreversible damages to the brain and nervous system. In this regard, finding common signaling pathways and the same molecular origin between these two diseases may be a promising way for their solution. On the basis of literature appraisal, the most common signaling cascades implicated in the pathogenesis of AD and stroke including notch, autophagy, inflammatory, and insulin signaling pathways were reviewed. Furthermore, current therapeutic strategies including natural and synthetic pharmaceuticals aiming modulation of respective signaling factors were scrutinized to ameliorate neural deficits in AD and stroke. Taken together, digging deeper in the common connections and signal targeting can be greatly helpful in understanding and unified treating of these disorders.
Collapse
|
9
|
Deng Y, Wu S, Fan H. Genome-wide pathway-based quantitative multiple phenotypes analysis. PLoS One 2020; 15:e0240910. [PMID: 33175855 PMCID: PMC7657528 DOI: 10.1371/journal.pone.0240910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2020] [Accepted: 10/06/2020] [Indexed: 11/18/2022] Open
Abstract
For complex diseases, genome-wide pathway association studies have become increasingly promising. Currently, however, pathway-based association analysis mainly focus on a single phenotype, which may insufficient to describe the complex diseases and physiological processes. This work proposes a combination model to evaluate the association between a pathway and multiple phenotypes and to reduce the run time based on asymptotic results. For a single phenotype, we propose a semi-supervised maximum kernel-based U-statistics (mSKU) method to assess the pathway-based association analysis. For multiple phenotypes, we propose the fisher combination function with dependent phenotypes (FC) to transform the p-values between the pathway and each marginal phenotype individually to achieve pathway-based multiple phenotypes analysis. With real data from the Alzheimer Disease Neuroimaging Initiative (ADNI) study and Human Liver Cohort (HLC) study, the FC-mSKU method allows us to specify which pathways are specific to a single phenotype or contribute to common genetic constructions of multiple phenotypes. If we only focus on single-phenotype tests, we may miss some findings for etiology studies. Through extensive simulation studies, the FC-mSKU method demonstrates its advantages compared with its counterparts.
Collapse
Affiliation(s)
- Yamin Deng
- Statistics Center, First Hospital of Shanxi Medical University, Taiyuan, China.,Division of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China
| | - Shiman Wu
- Statistics Center, First Hospital of Shanxi Medical University, Taiyuan, China
| | - Huifang Fan
- Statistics Center, First Hospital of Shanxi Medical University, Taiyuan, China
| |
Collapse
|
10
|
Ghulam A, Lei X, Guo M, Bian C. A Review of Pathway Databases and Related Methods Analysis. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191018162505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Pathway analysis integrates most of the computational tools for the investigation of
high-level and complex human diseases. In the field of bioinformatics research, biological pathways
analysis is an important part of systems biology. The molecular complexities of biological
pathways are difficult to understand in human diseases, which can be explored through pathway
analysis. In this review, we describe essential information related to pathway databases and their
mechanisms, algorithms and methods. In the pathway database analysis, we present a brief introduction
on how to gain knowledge from fundamental pathway data in regard to specific human
pathways and how to use pathway databases and pathway analysis to predict diseases during an
experiment. We also provide detailed information related to computational tools that are used in
complex pathway data analysis, the roles of these tools in the bioinformatics field and how to store
the pathway data. We illustrate various methodological difficulties that are faced during pathway
analysis. The main ideas and techniques for the pathway-based examination approaches are presented.
We provide the list of pathway databases and analytical tools. This review will serve as a
helpful manual for pathway analysis databases.
Collapse
Affiliation(s)
- Ali Ghulam
- School of Computer Science, Shaanxi Normal University, Xian, China
| | - Xiujuan Lei
- School of Computer Science, Shaanxi Normal University, Xian, China
| | - Min Guo
- School of Computer Science, Shaanxi Normal University, Xian, China
| | - Chen Bian
- School of Computer Science, Shaanxi Normal University, Xian, China
| |
Collapse
|
11
|
Escala-Garcia M, Abraham J, Andrulis IL, Anton-Culver H, Arndt V, Ashworth A, Auer PL, Auvinen P, Beckmann MW, Beesley J, Behrens S, Benitez J, Bermisheva M, Blomqvist C, Blot W, Bogdanova NV, Bojesen SE, Bolla MK, Børresen-Dale AL, Brauch H, Brenner H, Brucker SY, Burwinkel B, Caldas C, Canzian F, Chang-Claude J, Chanock SJ, Chin SF, Clarke CL, Couch FJ, Cox A, Cross SS, Czene K, Daly MB, Dennis J, Devilee P, Dunn JA, Dunning AM, Dwek M, Earl HM, Eccles DM, Eliassen AH, Ellberg C, Evans DG, Fasching PA, Figueroa J, Flyger H, Gago-Dominguez M, Gapstur SM, García-Closas M, García-Sáenz JA, Gaudet MM, George A, Giles GG, Goldgar DE, González-Neira A, Grip M, Guénel P, Guo Q, Haiman CA, Håkansson N, Hamann U, Harrington PA, Hiller L, Hooning MJ, Hopper JL, Howell A, Huang CS, Huang G, Hunter DJ, Jakubowska A, John EM, Kaaks R, Kapoor PM, Keeman R, Kitahara CM, Koppert LB, Kraft P, Kristensen VN, Lambrechts D, Le Marchand L, Lejbkowicz F, Lindblom A, Lubiński J, Mannermaa A, Manoochehri M, Manoukian S, Margolin S, Martinez ME, Maurer T, Mavroudis D, Meindl A, Milne RL, Mulligan AM, Neuhausen SL, Nevanlinna H, Newman WG, Olshan AF, Olson JE, Olsson H, Orr N, Peterlongo P, Petridis C, Prentice RL, Presneau N, Punie K, Ramachandran D, Rennert G, Romero A, Sachchithananthan M, Saloustros E, Sawyer EJ, Schmutzler RK, Schwentner L, Scott C, Simard J, Sohn C, Southey MC, Swerdlow AJ, Tamimi RM, Tapper WJ, Teixeira MR, Terry MB, Thorne H, Tollenaar RAEM, Tomlinson I, Troester MA, Truong T, Turnbull C, Vachon CM, van der Kolk LE, Wang Q, Winqvist R, Wolk A, Yang XR, Ziogas A, Pharoah PDP, Hall P, Wessels LFA, Chenevix-Trench G, Bader GD, Dörk T, Easton DF, Canisius S, Schmidt MK. A network analysis to identify mediators of germline-driven differences in breast cancer prognosis. Nat Commun 2020; 11:312. [PMID: 31949161 PMCID: PMC6965101 DOI: 10.1038/s41467-019-14100-6] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 12/17/2019] [Indexed: 11/09/2022] Open
Abstract
Identifying the underlying genetic drivers of the heritability of breast cancer prognosis remains elusive. We adapt a network-based approach to handle underpowered complex datasets to provide new insights into the potential function of germline variants in breast cancer prognosis. This network-based analysis studies ~7.3 million variants in 84,457 breast cancer patients in relation to breast cancer survival and confirms the results on 12,381 independent patients. Aggregating the prognostic effects of genetic variants across multiple genes, we identify four gene modules associated with survival in estrogen receptor (ER)-negative and one in ER-positive disease. The modules show biological enrichment for cancer-related processes such as G-alpha signaling, circadian clock, angiogenesis, and Rho-GTPases in apoptosis.
Collapse
Affiliation(s)
- Maria Escala-Garcia
- Division of Molecular Pathology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
| | - Jean Abraham
- Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
- Cambridge Experimental Cancer Medicine Centre, Cambridge, UK
- Cambridge Breast Unit and NIHR Cambridge Biomedical Research Centre, University of Cambridge NHS Foundation Hospitals, Cambridge, UK
| | - Irene L Andrulis
- Fred A. Litwin Center for Cancer Genetics, Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
| | - Hoda Anton-Culver
- Department of Epidemiology, Genetic Epidemiology Research Institute, University of California Irvine, Irvine, CA, USA
| | - Volker Arndt
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Alan Ashworth
- UCSF Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA, USA
| | - Paul L Auer
- Cancer Prevention Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
- Zilber School of Public Health, University of Wisconsin-Milwaukee, Milwaukee, WI, USA
| | - Päivi Auvinen
- Cancer Center, Kuopio University Hospital, Kuopio, Finland
- Institute of Clinical Medicine, Oncology, University of Eastern Finland, Kuopio, Finland
- Translational Cancer Research Area, University of Eastern Finland, Kuopio, Finland
| | - Matthias W Beckmann
- Department of Gynecology and Obstetrics, Comprehensive Cancer Center ER-EMN, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nuremberg, Erlangen, Germany
| | - Jonathan Beesley
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Sabine Behrens
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Javier Benitez
- Human Cancer Genetics Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
- Biomedical Network on Rare Diseases (CIBERER), Madrid, Spain
| | - Marina Bermisheva
- Institute of Biochemistry and Genetics, Ufa Scientific Center of Russian Academy of Sciences, Ufa, Russia
| | - Carl Blomqvist
- Department of Oncology, Helsinki University Hospital, University of Helsinki, Helsinki, Finland
- Department of Oncology, Örebro University Hospital, Örebro, Sweden
| | - William Blot
- Division of Epidemiology, Department of Medicine, Vanderbilt Epidemiology Center, Vanderbilt-Ingram Cancer Center, Vanderbilt University School of Medicine, Nashville, TN, USA
- International Epidemiology Institute, Rockville, MD, USA
| | - Natalia V Bogdanova
- Department of Radiation Oncology, Hannover Medical School, Hannover, Germany
- Gynaecology Research Unit, Hannover Medical School, Hannover, Germany
- N.N. Alexandrov Research Institute of Oncology and Medical Radiology, Minsk, Belarus
| | - Stig E Bojesen
- Copenhagen General Population Study, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
- Department of Clinical Biochemistry, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
- Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Manjeet K Bolla
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Anne-Lise Børresen-Dale
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
- Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Hiltrud Brauch
- Dr. Margarete Fischer-Bosch-Institute of Clinical Pharmacology, Stuttgart, Germany
- iFIT-Cluster of Excellence, University of Tuebingen, Tuebingen, Germany
- German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Hermann Brenner
- Division of Clinical Epidemiology and Aging Research, German Cancer Research Center (DKFZ), Heidelberg, Germany
- German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
- Division of Preventive Oncology, German Cancer Research Center (DKFZ) and National Center for Tumor Diseases (NCT), Heidelberg, Germany
| | - Sara Y Brucker
- Department of Gynecology and Obstetrics, University of Tübingen, Tübingen, Germany
| | - Barbara Burwinkel
- Molecular Epidemiology Group, C080, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Molecular Biology of Breast Cancer, University Womens Clinic Heidelberg, University of Heidelberg, Heidelberg, Germany
| | - Carlos Caldas
- Cancer Research UK Cambridge Institute, Department of Oncology, Li Ka Shing Centre, University of Cambridge, Cambridge, UK
- Breast Cancer Programme, CRUK Cambridge Cancer Centre and NIHR Biomedical Research Centre, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Federico Canzian
- Genomic Epidemiology Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Jenny Chang-Claude
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Cancer Epidemiology Group, University Cancer Center Hamburg (UCCH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Suet-Feung Chin
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
| | - Christine L Clarke
- Westmead Institute for Medical Research, University of Sydney, Sydney, NSW, Australia
| | - Fergus J Couch
- Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA
| | - Angela Cox
- Department of Oncology and Metabolism, Sheffield Institute for Nucleic Acids (SInFoNiA), University of Sheffield, Sheffield, UK
| | - Simon S Cross
- Academic Unit of Pathology, Department of Neuroscience, University of Sheffield, Sheffield, UK
| | - Kamila Czene
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Mary B Daly
- Department of Clinical Genetics, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Joe Dennis
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Peter Devilee
- Department of Pathology, Leiden University Medical Center, Leiden, The Netherlands
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Janet A Dunn
- Warwick Clinical Trials Unit, University of Warwick, Coventry, UK
| | - Alison M Dunning
- Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Miriam Dwek
- Department of Biomedical Sciences, Faculty of Science and Technology, University of Westminster, London, UK
| | - Helena M Earl
- Cambridge Breast Unit and NIHR Cambridge Biomedical Research Centre, University of Cambridge NHS Foundation Hospitals, Cambridge, UK
- Department of Oncology, University of Cambridge, Cambridge, UK
| | - Diana M Eccles
- Cancer Sciences Academic Unit, Faculty of Medicine, University of Southampton, Southampton, UK
| | - A Heather Eliassen
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Carolina Ellberg
- Department of Cancer Epidemiology, Clinical Sciences, Lund University, Lund, Sweden
| | - D Gareth Evans
- Division of Evolution and Genomic Medicine, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Genomic Medicine, St Mary's Hospital, Manchester Centre for Genomic Medicine, Manchester University Hospitals NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
- NIHR Manchester Biomedical Research Centre, Manchester Academic Health Science Centre, Manchester University NHS Foundation Trust, Manchester, UK
| | - Peter A Fasching
- Department of Gynecology and Obstetrics, Comprehensive Cancer Center ER-EMN, University Hospital Erlangen, Friedrich-Alexander-University Erlangen-Nuremberg, Erlangen, Germany
- Division of Hematology and Oncology, Department of Medicine, David Geffen School of Medicine, University of California at Los Angeles, Los Angeles, CA, USA
| | - Jonine Figueroa
- Division of Cancer Epidemiology and Genetics, Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Usher Institute of Population Health Sciences and Informatics, The University of Edinburgh Medical School, Edinburgh, UK
- Cancer Research UK Edinburgh Centre, Edinburgh, UK
| | - Henrik Flyger
- Department of Breast Surgery, Herlev and Gentofte Hospital, Copenhagen University Hospital, Herlev, Denmark
| | - Manuela Gago-Dominguez
- Genomic Medicine Group, Galician Foundation of Genomic Medicine, Instituto de Investigación Sanitaria de Santiago de Compostela (IDIS), Complejo Hospitalario Universitario de Santiago, SERGAS, Santiago de Compostela, Spain
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
| | - Susan M Gapstur
- Epidemiology Research Program, American Cancer Society, Atlanta, GA, USA
| | - Montserrat García-Closas
- Division of Cancer Epidemiology and Genetics, Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK
| | - José A García-Sáenz
- Medical Oncology Department, Hospital Clínico San Carlos, Instituto de Investigación Sanitaria San Carlos (IdISSC), Centro Investigación Biomédica en Red de Cáncer (CIBERONC), Madrid, Spain
| | - Mia M Gaudet
- Epidemiology Research Program, American Cancer Society, Atlanta, GA, USA
| | - Angela George
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Graham G Giles
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, VIC, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
- Department of Epidemiology and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | - David E Goldgar
- Department of Dermatology, Huntsman Cancer Institute, University of Utah School of Medicine, Salt Lake City, UT, USA
| | - Anna González-Neira
- Human Cancer Genetics Programme, Spanish National Cancer Research Centre (CNIO), Madrid, Spain
| | - Mervi Grip
- Department of Surgery, Oulu University Hospital, University of Oulu, Oulu, Finland
| | - Pascal Guénel
- Cancer & Environment Group, Center for Research in Epidemiology and Population Health (CESP), University Paris-Saclay, INSERM, University Paris-Sud, Villejuif, France
| | - Qi Guo
- Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Christopher A Haiman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Niclas Håkansson
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Ute Hamann
- Molecular Genetics of Breast Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Patricia A Harrington
- Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Louise Hiller
- Warwick Clinical Trials Unit, University of Warwick, Coventry, UK
| | - Maartje J Hooning
- Department of Medical Oncology, Family Cancer Clinic, Erasmus MC Cancer Institute, Rotterdam, The Netherlands
| | - John L Hopper
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
| | - Anthony Howell
- Division of Cancer Sciences, University of Manchester, Manchester, UK
| | - Chiun-Sheng Huang
- Department of Surgery, National Taiwan University Hospital and National Taiwan University College of Medicine, Taipei, Taiwan
| | - Guanmengqian Huang
- Molecular Genetics of Breast Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - David J Hunter
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Anna Jakubowska
- Department of Genetics and Pathology, Pomeranian Medical University, Szczecin, Poland
- Independent Laboratory of Molecular Biology and Genetic Diagnostics, Pomeranian Medical University, Szczecin, Poland
| | - Esther M John
- Division of Oncology, Department of Medicine, Stanford Cancer Institute, Stanford University School of Medicine, Stanford, CA, USA
| | - Rudolf Kaaks
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Pooja Middha Kapoor
- Division of Cancer Epidemiology, German Cancer Research Center (DKFZ), Heidelberg, Germany
- Faculty of Medicine, University of Heidelberg, Heidelberg, Germany
| | - Renske Keeman
- Division of Molecular Pathology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
| | - Cari M Kitahara
- Radiation Epidemiology Branch, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Linetta B Koppert
- Department of Surgical Oncology, Family Cancer Clinic, Erasmus MC Cancer Institute, Rotterdam, The Netherlands
| | - Peter Kraft
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Vessela N Kristensen
- Department of Cancer Genetics, Institute for Cancer Research, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
- Institute of Clinical Medicine, Faculty of Medicine, University of Oslo, Oslo, Norway
| | - Diether Lambrechts
- VIB, VIB Center for Cancer Biology, Leuven, Belgium
- Laboratory for Translational Genetics, Department of Human Genetics, University of Leuven, Leuven, Belgium
| | - Loic Le Marchand
- Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
| | - Flavio Lejbkowicz
- Carmel Medical Center and Technion Faculty of Medicine, Clalit National Cancer Control Center, Haifa, Israel
| | - Annika Lindblom
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Jan Lubiński
- Department of Genetics and Pathology, Pomeranian Medical University, Szczecin, Poland
| | - Arto Mannermaa
- Translational Cancer Research Area, University of Eastern Finland, Kuopio, Finland
- Institute of Clinical Medicine, Pathology and Forensic Medicine, University of Eastern Finland, Kuopio, Finland
- Department of Clinical Pathology, Imaging Center, Kuopio University Hospital, Kuopio, Finland
| | - Mehdi Manoochehri
- Molecular Genetics of Breast Cancer, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Siranoush Manoukian
- Unit of Medical Genetics, Department of Medical Oncology and Hematology, Fondazione IRCCS Istituto Nazionale dei Tumori di Milano (INT), Milan, Italy
| | - Sara Margolin
- Department of Oncology, Sšdersjukhuset, Stockholm, Sweden
- Department of Clinical Science and Education, Sšdersjukhuset, Karolinska Institutet, Stockholm, Sweden
| | - Maria Elena Martinez
- Moores Cancer Center, University of California San Diego, La Jolla, CA, USA
- Department of Family Medicine and Public Health, University of California San Diego, La Jolla, CA, USA
| | - Tabea Maurer
- Cancer Epidemiology Group, University Cancer Center Hamburg (UCCH), University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Dimitrios Mavroudis
- Department of Medical Oncology, University Hospital of Heraklion, Heraklion, Greece
| | - Alfons Meindl
- Department of Gynecology and Obstetrics, Ludwig Maximilian University of Munich, Munich, Germany
| | - Roger L Milne
- Cancer Epidemiology Division, Cancer Council Victoria, Melbourne, VIC, Australia
- Centre for Epidemiology and Biostatistics, Melbourne School of Population and Global Health, The University of Melbourne, Melbourne, VIC, Australia
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia
| | - Anna Marie Mulligan
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada
- Laboratory Medicine Program, University Health Network, Toronto, ON, Canada
| | - Susan L Neuhausen
- Department of Population Sciences, Beckman Research Institute of City of Hope, Duarte, CA, USA
| | - Heli Nevanlinna
- Department of Obstetrics and Gynecology, Helsinki University Hospital, University of Helsinki, Helsinki, Finland
| | - William G Newman
- Division of Evolution and Genomic Medicine, School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, UK
- Genomic Medicine, St Mary's Hospital, Manchester Centre for Genomic Medicine, Manchester University Hospitals NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester, UK
| | - Andrew F Olshan
- Department of Epidemiology, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Janet E Olson
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Håkan Olsson
- Department of Cancer Epidemiology, Clinical Sciences, Lund University, Lund, Sweden
| | - Nick Orr
- Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, Ireland, UK
| | - Paolo Peterlongo
- Genome Diagnostics Program, IFOM - the FIRC (Italian Foundation for Cancer Research) Institute of Molecular Oncology, Milan, Italy
| | - Christos Petridis
- Research Oncology, Guy's Hospital, King's College London, London, UK
| | - Ross L Prentice
- Cancer Prevention Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Nadege Presneau
- Department of Biomedical Sciences, Faculty of Science and Technology, University of Westminster, London, UK
| | - Kevin Punie
- Department of Oncology, Leuven Multidisciplinary Breast Center, Leuven Cancer Institute, University Hospitals Leuven, Leuven, Belgium
| | | | - Gad Rennert
- Carmel Medical Center and Technion Faculty of Medicine, Clalit National Cancer Control Center, Haifa, Israel
| | - Atocha Romero
- Medical Oncology Department, Hospital Universitario Puerta de Hierro, Madrid, Spain
| | | | | | - Elinor J Sawyer
- Research Oncology, Guy's Hospital, King's College London, London, UK
| | - Rita K Schmutzler
- Center for Hereditary Breast and Ovarian Cancer, University Hospital of Cologne, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
| | - Lukas Schwentner
- Department of Gynaecology and Obstetrics, University Hospital Ulm, Ulm, Germany
| | - Christopher Scott
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Jacques Simard
- Genomics Center, Research Center, Centre Hospitalier Universitaire de Québec - Université Laval, Québec City, QC, Canada
| | - Christof Sohn
- National Center for Tumor Diseases, University of Heidelberg, Heidelberg, Germany
| | - Melissa C Southey
- Precision Medicine, School of Clinical Sciences at Monash Health, Monash University, Clayton, VIC, Australia
- Department of Clinical Pathology, The University of Melbourne, Melbourne, VIC, Australia
| | - Anthony J Swerdlow
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
- Division of Breast Cancer Research, The Institute of Cancer Research, London, UK
| | - Rulla M Tamimi
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
- Program in Genetic Epidemiology and Statistical Genetics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | | | - Manuel R Teixeira
- Department of Genetics, Portuguese Oncology Institute, Porto, Portugal
- Biomedical Sciences Institute (ICBAS), University of Porto, Porto, Portugal
| | - Mary Beth Terry
- Department of Epidemiology, Mailman School of Public Health, Columbia University, New York, NY, USA
| | - Heather Thorne
- Peter MacCallum Cancer Center, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, The University of Melbourne, Melbourne, VIC, Australia
| | - Rob A E M Tollenaar
- Department of Surgery, Leiden University Medical Center, Leiden, The Netherlands
| | - Ian Tomlinson
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK
- Wellcome Trust Centre for Human Genetics and Oxford NIHR Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Melissa A Troester
- Department of Epidemiology, Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Thérèse Truong
- Cancer & Environment Group, Center for Research in Epidemiology and Population Health (CESP), University Paris-Saclay, INSERM, University Paris-Sud, Villejuif, France
| | - Clare Turnbull
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Celine M Vachon
- Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
| | - Lizet E van der Kolk
- Family Cancer Clinic, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
| | - Qin Wang
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Robert Winqvist
- Biocenter Oulu, Cancer and Translational Medicine Research Unit, Laboratory of Cancer Genetics and Tumor Biology, University of Oulu, Oulu, Finland
- Laboratory of Cancer Genetics and Tumor Biology, Northern Finland Laboratory Centre Oulu, Oulu, Finland
| | - Alicja Wolk
- Institute of Environmental Medicine, Karolinska Institutet, Stockholm, Sweden
- Department of Surgical Sciences, Uppsala University, Uppsala, Sweden
| | - Xiaohong R Yang
- Division of Cancer Epidemiology and Genetics, Department of Health and Human Services, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA
| | - Argyrios Ziogas
- Department of Epidemiology, Genetic Epidemiology Research Institute, University of California Irvine, Irvine, CA, USA
| | - Paul D P Pharoah
- Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Per Hall
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Department of Oncology, Sšdersjukhuset, Stockholm, Sweden
| | - Lodewyk F A Wessels
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands
- Faculty of EEMCS, Delft University of Technology, Delft, The Netherlands
| | - Georgia Chenevix-Trench
- Department of Genetics and Computational Biology, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Gary D Bader
- Department of Molecular Genetics, University of Toronto, Toronto, Canada
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | - Thilo Dörk
- Gynaecology Research Unit, Hannover Medical School, Hannover, Germany
| | - Douglas F Easton
- Department of Oncology, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
- Department of Public Health and Primary Care, Centre for Cancer Genetic Epidemiology, University of Cambridge, Cambridge, UK
| | - Sander Canisius
- Division of Molecular Pathology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.
- Division of Molecular Carcinogenesis, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.
| | - Marjanka K Schmidt
- Division of Molecular Pathology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.
- Division of Psychosocial Research and Epidemiology, The Netherlands Cancer Institute - Antoni van Leeuwenhoek Hospital, Amsterdam, The Netherlands.
| |
Collapse
|
12
|
Cui P, Ma X, Li H, Lang W, Hao J. Shared Biological Pathways Between Alzheimer's Disease and Ischemic Stroke. Front Neurosci 2018; 12:605. [PMID: 30245614 PMCID: PMC6137293 DOI: 10.3389/fnins.2018.00605] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 08/10/2018] [Indexed: 12/21/2022] Open
Abstract
Alzheimer's disease (AD) and ischemic stroke (IS) are an immense socioeconomic burden worldwide. There is a possibility that shared genetic factors lead to their links at epidemiological and pathophysiological levels. Although recent genome-wide association studies (GWAS) have provided profound insights into the genetics of AD and IS, no shared genetic variants have been identified to date. This prompted us to initiate this study, which sought to identify shared pathways linking AD and IS. We took advantage of large-scale GWAS summary data of AD (17,008 AD cases and 37,154 controls) and IS (10,307 cases and 19,326 controls) to conduct pathway analyses using genetic pathways from multiple well-studied databases, including GO, KEGG, PANTHER, Reactome, and Wikipathways. Collectively, we discovered that AD and IS shared 179 GO categories (56 biological processes, 95 cellular components, and 28 molecular functions); and the following pathways: six KEGG pathways; two PANTHER pathways; four Reactome pathways; and one in Wikipathways pathway. The more fine-grained GO terms were mainly summarized into different functional categories: transcriptional and post-transcriptional regulation, synapse, endocytic membrane traffic through the endosomal system, signaling transduction, immune process, multi-organism process, protein catabolic metabolism, and cell adhesion. The shared pathways were roughly classified into three categories: immune system; cancer (NSCLC and glioma); and signal transduction pathways involving the cadherin signaling pathway, Wnt signaling pathway, G-protein signaling and downstream signaling mediated by phosphoinositides (PIPs). The majority of these common pathways linked to both AD and IS were supported by convincing evidence from the literature. In conclusion, our findings contribute to a better understanding of common biological mechanisms underlying AD and IS and serve as a guide to direct future research.
Collapse
Affiliation(s)
- Pan Cui
- Department of Neurology, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin, China.,Key Laboratory of Post-neurotrauma Neuro-repair and Regeneration in Central Nervous System, Tianjin Neurological Institute, Ministry of Education and Tianjin City, Tianjin, China
| | - Xiaofeng Ma
- Department of Neurology, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin, China.,Key Laboratory of Post-neurotrauma Neuro-repair and Regeneration in Central Nervous System, Tianjin Neurological Institute, Ministry of Education and Tianjin City, Tianjin, China
| | - He Li
- Department of Neurology, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin, China.,Key Laboratory of Post-neurotrauma Neuro-repair and Regeneration in Central Nervous System, Tianjin Neurological Institute, Ministry of Education and Tianjin City, Tianjin, China
| | - Wenjing Lang
- Department of Neurology, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin, China.,Key Laboratory of Post-neurotrauma Neuro-repair and Regeneration in Central Nervous System, Tianjin Neurological Institute, Ministry of Education and Tianjin City, Tianjin, China
| | - Junwei Hao
- Department of Neurology, Tianjin Neurological Institute, Tianjin Medical University General Hospital, Tianjin, China.,Key Laboratory of Post-neurotrauma Neuro-repair and Regeneration in Central Nervous System, Tianjin Neurological Institute, Ministry of Education and Tianjin City, Tianjin, China
| |
Collapse
|
13
|
Mousavi MJ, Jamshidi A, Chopra A, Aslani S, Akhlaghi M, Mahmoudi M. Implications of the noncoding RNAs in rheumatoid arthritis pathogenesis. J Cell Physiol 2018; 234:335-347. [DOI: 10.1002/jcp.26911] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 06/13/2018] [Indexed: 12/26/2022]
Affiliation(s)
- Mohammad Javad Mousavi
- Rheumatology Research Center, Tehran University of Medical Sciences Tehran Iran
- Department of Hematology Faculty of Allied Medicine, Bushehr University of Medical Sciences Bushehr Iran
- Department of Immunology School of Medicine, Tehran University of Medical Sciences Tehran Iran
| | - Ahmadreza Jamshidi
- Rheumatology Research Center, Tehran University of Medical Sciences Tehran Iran
| | - Arvind Chopra
- Centre for Rheumatic Diseases Pune Maharashtra India
| | - Saeed Aslani
- Rheumatology Research Center, Tehran University of Medical Sciences Tehran Iran
| | - Massoomeh Akhlaghi
- Rheumatology Research Center, Tehran University of Medical Sciences Tehran Iran
| | - Mahdi Mahmoudi
- Rheumatology Research Center, Tehran University of Medical Sciences Tehran Iran
| |
Collapse
|
14
|
Saeed M. Locus and gene-based GWAS meta-analysis identifies new diabetic nephropathy genes. Immunogenetics 2018; 70:347-353. [PMID: 29147756 DOI: 10.1007/s00251-017-1044-0/tables/2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 11/02/2017] [Indexed: 05/22/2023]
Abstract
Objective Assimilation of SNPs Interacting in Synchrony (OASIS) is a locus-based clustering algorithm recently described that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders. Diabetic nephropathy (DN) is incompletely understood due to a paucity of genes identified despite several GWAS. OASIS was applied to three DN dbGAP GWAS datasets (4725 subjects; 1.06 million SNPs). OASIS identified 19 DN genes which were verified using single variant replication in a standard association study and gene-based analysis using GATES. CARS and FRMD3 were confirmed as DN genes, and five known diabetes-associated genes, viz. NLRP3, INPPL1, PIK3C2G, NRXN3, and TBC1D4, not previously identified using these datasets were discovered. Furthermore, three additional novel DN genes were found which replicated in two sets of analysis, viz. NTN1, EBF2, and DNAH11. Hence, composite analysis with OASIS, gene-based, and single variant association testing can be universally applied to existing GWAS datasets for the identification of new genes.
Collapse
Affiliation(s)
- Mohammad Saeed
- Department of Genomics, Arkana Laboratories, 10810 Executive Center Drive, Suite 100, Little Rock, AR, 72211, USA.
| |
Collapse
|
15
|
Saeed M. Locus and gene-based GWAS meta-analysis identifies new diabetic nephropathy genes. Immunogenetics 2017; 70:347-353. [PMID: 29147756 DOI: 10.1007/s00251-017-1044-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2017] [Accepted: 11/02/2017] [Indexed: 12/13/2022]
Abstract
Objective Assimilation of SNPs Interacting in Synchrony (OASIS) is a locus-based clustering algorithm recently described that can potentially address false positives and negatives in genome-wide association studies (GWAS) of complex disorders. Diabetic nephropathy (DN) is incompletely understood due to a paucity of genes identified despite several GWAS. OASIS was applied to three DN dbGAP GWAS datasets (4725 subjects; 1.06 million SNPs). OASIS identified 19 DN genes which were verified using single variant replication in a standard association study and gene-based analysis using GATES. CARS and FRMD3 were confirmed as DN genes, and five known diabetes-associated genes, viz. NLRP3, INPPL1, PIK3C2G, NRXN3, and TBC1D4, not previously identified using these datasets were discovered. Furthermore, three additional novel DN genes were found which replicated in two sets of analysis, viz. NTN1, EBF2, and DNAH11. Hence, composite analysis with OASIS, gene-based, and single variant association testing can be universally applied to existing GWAS datasets for the identification of new genes.
Collapse
Affiliation(s)
- Mohammad Saeed
- Department of Genomics, Arkana Laboratories, 10810 Executive Center Drive, Suite 100, Little Rock, AR, 72211, USA.
| |
Collapse
|
16
|
Abstract
Combining statistical significances (P-values) from a set of single-locus association tests in genome-wide association studies is a proof-of-principle method for identifying disease-associated genomic segments, functional genes and biological pathways. We review P-value combinations for genome-wide association studies and introduce an integrated analysis tool, Omnibus P-value Association Tests (OPATs), which provides popular analysis methods of P-value combinations. The software OPATs programmed in R and R graphical user interface features a user-friendly interface. In addition to analysis modules for data quality control and single-locus association tests, OPATs provides three types of set-based association test: window-, gene- and biopathway-based association tests. P-value combinations with or without threshold and rank truncation are provided. The significance of a set-based association test is evaluated by using resampling procedures. Performance of the set-based association tests in OPATs has been evaluated by simulation studies and real data analyses. These set-based association tests help boost the statistical power, alleviate the multiple-testing problem, reduce the impact of genetic heterogeneity, increase the replication efficiency of association tests and facilitate the interpretation of association signals by streamlining the testing procedures and integrating the genetic effects of multiple variants in genomic regions of biological relevance. In summary, P-value combinations facilitate the identification of marker sets associated with disease susceptibility and uncover missing heritability in association studies, thereby establishing a foundation for the genetic dissection of complex diseases and traits. OPATs provides an easy-to-use and statistically powerful analysis tool for P-value combinations. OPATs, examples, and user guide can be downloaded from http://www.stat.sinica.edu.tw/hsinchou/genetics/association/OPATs.htm.
Collapse
Affiliation(s)
| | - Hsin-Chou Yang
- Institute of Statistical Science, Academia Sinica
- Corresponding author: Hsin-Chou Yang, Institute of Statistical Science, Academia Sinica, No 128, Academia Road, Section 2, Nankang, Taipei 115, Taiwan. Tel.: 886-2-27835611 ext. 113; Fax: 886-2-27831523; E-mail:
| |
Collapse
|
17
|
Improving the detection of pathways in genome-wide association studies by combined effects of SNPs from Linkage Disequilibrium blocks. Sci Rep 2017; 7:3512. [PMID: 28615668 PMCID: PMC5471232 DOI: 10.1038/s41598-017-03826-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2016] [Accepted: 05/05/2017] [Indexed: 01/31/2023] Open
Abstract
Genome-wide association studies (GWAS) have successfully identified single variants associated with diseases. To increase the power of GWAS, gene-based and pathway-based tests are commonly employed to detect more risk factors. However, the gene- and pathway-based association tests may be biased towards genes or pathways containing a large number of single-nucleotide polymorphisms (SNPs) with small P-values caused by high linkage disequilibrium (LD) correlations. To address such bias, numerous pathway-based methods have been developed. Here we propose a novel method, DGAT-path, to divide all SNPs assigned to genes in each pathway into LD blocks, and to sum the chi-square statistics of LD blocks for assessing the significance of the pathway by permutation tests. The method was proven robust with the type I error rate >1.6 times lower than other methods. Meanwhile, the method displays a higher power and is not biased by the pathway size. The applications to the GWAS summary statistics for schizophrenia and breast cancer indicate that the detected top pathways contain more genes close to associated SNPs than other methods. As a result, the method identified 17 and 12 significant pathways containing 20 and 21 novel associated genes, respectively for two diseases. The method is available online by http://sparks-lab.org/server/DGAT-path.
Collapse
|
18
|
Lee S, Park Y, Kim S. MIDAS: Mining differentially activated subpaths of KEGG pathways from multi-class RNA-seq data. Methods 2017; 124:13-24. [PMID: 28579402 DOI: 10.1016/j.ymeth.2017.05.026] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2017] [Accepted: 05/30/2017] [Indexed: 11/18/2022] Open
Abstract
Pathway based analysis of high throughput transcriptome data is a widely used approach to investigate biological mechanisms. Since a pathway consists of multiple functions, the recent approach is to determine condition specific sub-pathways or subpaths. However, there are several challenges. First, few existing methods utilize explicit gene expression information from RNA-seq. More importantly, subpath activity is usually an average of statistical scores, e.g., correlations, of edges in a candidate subpath, which fails to reflect gene expression quantity information. In addition, none of existing methods can handle multiple phenotypes. To address these technical problems, we designed and implemented an algorithm, MIDAS, that determines condition specific subpaths, each of which has different activities across multiple phenotypes. MIDAS utilizes gene expression quantity information fully and the network centrality information to determine condition specific subpaths. To test performance of our tool, we used TCGA breast cancer RNA-seq gene expression profiles with five molecular subtypes. 36 differentially activate subpaths were determined. The utility of our method, MIDAS, was demonstrated in four ways. All 36 subpaths are well supported by the literature information. Subsequently, we showed that these subpaths had a good discriminant power for five cancer subtype classification and also had a prognostic power in terms of survival analysis. Finally, in a performance comparison of MIDAS to a recent subpath prediction method, PATHOME, our method identified more subpaths and much more genes that are well supported by the literature information. AVAILABILITY http://biohealth.snu.ac.kr/software/MIDAS/.
Collapse
Affiliation(s)
- Sangseon Lee
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Youngjune Park
- Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea; Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea; Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea.
| |
Collapse
|
19
|
Rosenberger A, Sohns M, Friedrichs S, Hung RJ, Fehringer G, McLaughlin J, Amos CI, Brennan P, Risch A, Brüske I, Caporaso NE, Landi MT, Christiani DC, Wei Y, Bickeböller H. Gene-set meta-analysis of lung cancer identifies pathway related to systemic lupus erythematosus. PLoS One 2017; 12:e0173339. [PMID: 28273134 PMCID: PMC5342225 DOI: 10.1371/journal.pone.0173339] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Accepted: 02/20/2017] [Indexed: 02/03/2023] Open
Abstract
INTRODUCTION Gene-set analysis (GSA) is an approach using the results of single-marker genome-wide association studies when investigating pathways as a whole with respect to the genetic basis of a disease. METHODS We performed a meta-analysis of seven GSAs for lung cancer, applying the method META-GSA. Overall, the information taken from 11,365 cases and 22,505 controls from within the TRICL/ILCCO consortia was used to investigate a total of 234 pathways from the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. RESULTS META-GSA reveals the systemic lupus erythematosus KEGG pathway hsa05322, driven by the gene region 6p21-22, as also implicated in lung cancer (p = 0.0306). This gene region is known to be associated with squamous cell lung carcinoma. The most important genes driving the significance of this pathway belong to the genomic areas HIST1-H4L, -1BN, -2BN, -H2AK, -H4K and C2/C4A/C4B. Within these areas, the markers most significantly associated with LC are rs13194781 (located within HIST12BN) and rs1270942 (located between C2 and C4A). CONCLUSIONS We have discovered a pathway currently marked as specific to systemic lupus erythematosus as being significantly implicated in lung cancer. The gene region 6p21-22 in this pathway appears to be more extensively associated with lung cancer than previously assumed. Given wide-stretched linkage disequilibrium to the area APOM/BAG6/MSH5, there is currently simply not enough information or evidence to conclude whether the potential pleiotropy of lung cancer and systemic lupus erythematosus is spurious, biological, or mediated. Further research into this pathway and gene region will be necessary.
Collapse
Affiliation(s)
- Albert Rosenberger
- Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany
| | - Melanie Sohns
- Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany
| | - Stefanie Friedrichs
- Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany
| | - Rayjean J. Hung
- Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, Canada
- Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | - Gord Fehringer
- Lunenfeld-Tanenbaum Research Institute of Mount Sinai Hospital, Toronto, Canada
| | | | - Christopher I. Amos
- Department of Biomedical Data Science, Geisel School of Medicine at Dartmouth, Hanover, New Hampshire, United States of America
| | - Paul Brennan
- International Agency for Research on Cancer, Lyon, France
| | - Angela Risch
- Division of Molecular Biology, University Salzburg, Salzburg, Austria
| | - Irene Brüske
- Institute of Epidemiology I, Helmholtz Center Munich, Munich, Germany
| | - Neil E. Caporaso
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, United States of America
| | - Maria Teresa Landi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, Maryland, United States of America
| | - David C. Christiani
- Harvard University School of Public Health, Boston, Massachusetts, United States of America
| | - Yongyue Wei
- Harvard University School of Public Health, Boston, Massachusetts, United States of America
| | - Heike Bickeböller
- Department of Genetic Epidemiology, University Medical Center, Georg-August-University Göttingen, Göttingen, Germany
| |
Collapse
|
20
|
Lee J, Jo K, Lee S, Kang J, Kim S. Prioritizing biological pathways by recognizing context in time-series gene expression data. BMC Bioinformatics 2016; 17:477. [PMID: 28155707 PMCID: PMC5259824 DOI: 10.1186/s12859-016-1335-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Background The primary goal of pathway analysis using transcriptome data is to find significantly perturbed pathways. However, pathway analysis is not always successful in identifying pathways that are truly relevant to the context under study. A major reason for this difficulty is that a single gene is involved in multiple pathways. In the KEGG pathway database, there are 146 genes, each of which is involved in more than 20 pathways. Thus activation of even a single gene will result in activation of many pathways. This complex relationship often makes the pathway analysis very difficult. While we need much more powerful pathway analysis methods, a readily available alternative way is to incorporate the literature information. Results In this study, we propose a novel approach for prioritizing pathways by combining results from both pathway analysis tools and literature information. The basic idea is as follows. Whenever there are enough articles that provide evidence on which pathways are relevant to the context, we can be assured that the pathways are indeed related to the context, which is termed as relevance in this paper. However, if there are few or no articles reported, then we should rely on the results from the pathway analysis tools, which is termed as significance in this paper. We realized this concept as an algorithm by introducing Context Score and Impact Score and then combining the two into a single score. Our method ranked truly relevant pathways significantly higher than existing pathway analysis tools in experiments with two data sets. Conclusions Our novel framework was implemented as ContextTRAP by utilizing two existing tools, TRAP and BEST. ContextTRAP will be a useful tool for the pathway based analysis of gene expression data since the user can specify the context of the biological experiment in a set of keywords. The web version of ContextTRAP is available at http://biohealth.snu.ac.kr/software/contextTRAP. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1335-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jusang Lee
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Kyuri Jo
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea
| | - Sunwon Lee
- Department of Computer Science and Engineering, Korea University, Seoul, Republic of Korea
| | - Jaewoo Kang
- Department of Computer Science and Engineering, Korea University, Seoul, Republic of Korea
| | - Sun Kim
- Department of Computer Science and Engineering, Seoul National University, Seoul, Republic of Korea. .,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, Republic of Korea. .,Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea.
| |
Collapse
|
21
|
Yoo YJ, Sun L, Poirier JG, Paterson AD, Bull SB. Multiple linear combination (MLC) regression tests for common variants adapted to linkage disequilibrium structure. Genet Epidemiol 2016; 41:108-121. [PMID: 27885705 PMCID: PMC5245123 DOI: 10.1002/gepi.22024] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2016] [Revised: 05/25/2016] [Accepted: 09/27/2016] [Indexed: 11/21/2022]
Abstract
By jointly analyzing multiple variants within a gene, instead of one at a time, gene‐based multiple regression can improve power, robustness, and interpretation in genetic association analysis. We investigate multiple linear combination (MLC) test statistics for analysis of common variants under realistic trait models with linkage disequilibrium (LD) based on HapMap Asian haplotypes. MLC is a directional test that exploits LD structure in a gene to construct clusters of closely correlated variants recoded such that the majority of pairwise correlations are positive. It combines variant effects within the same cluster linearly, and aggregates cluster‐specific effects in a quadratic sum of squares and cross‐products, producing a test statistic with reduced degrees of freedom (df) equal to the number of clusters. By simulation studies of 1000 genes from across the genome, we demonstrate that MLC is a well‐powered and robust choice among existing methods across a broad range of gene structures. Compared to minimum P‐value, variance‐component, and principal‐component methods, the mean power of MLC is never much lower than that of other methods, and can be higher, particularly with multiple causal variants. Moreover, the variation in gene‐specific MLC test size and power across 1000 genes is less than that of other methods, suggesting it is a complementary approach for discovery in genome‐wide analysis. The cluster construction of the MLC test statistics helps reveal within‐gene LD structure, allowing interpretation of clustered variants as haplotypic effects, while multiple regression helps to distinguish direct and indirect associations.
Collapse
Affiliation(s)
- Yun Joo Yoo
- Department of Mathematics Education, Seoul National University, Seoul, South Korea.,Interdisciplinary Program in Bioinformatics, Seoul National University, Seoul, South Korea
| | - Lei Sun
- Department of Statistical Sciences, University of Toronto, Toronto, Canada.,Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada
| | - Julia G Poirier
- Prosserman Centre for Health Research, Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Canada
| | - Andrew D Paterson
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada.,Program in Genetics and Genome Biology, Hospital for Sick Children Research Institute, Toronto, Canada
| | - Shelley B Bull
- Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Canada.,Prosserman Centre for Health Research, Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Canada
| |
Collapse
|
22
|
Chen S, Nunez S, Reilly MP, Foulkes AS. Bayesian variable selection for post-analytic interrogation of susceptibility loci. Biometrics 2016; 73:603-614. [PMID: 27858978 DOI: 10.1111/biom.12620] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2015] [Revised: 09/01/2016] [Accepted: 09/01/2016] [Indexed: 11/26/2022]
Abstract
Understanding the complex interplay among protein coding genes and regulatory elements requires rigorous interrogation with analytic tools designed for discerning the relative contributions of overlapping genomic regions. To this aim, we offer a novel application of Bayesian variable selection (BVS) for classifying genomic class level associations using existing large meta-analysis summary level resources. This approach is applied using the expectation maximization variable selection (EMVS) algorithm to typed and imputed SNPs across 502 protein coding genes (PCGs) and 220 long intergenic non-coding RNAs (lncRNAs) that overlap 45 known loci for coronary artery disease (CAD) using publicly available Global Lipids Gentics Consortium (GLGC) (Teslovich et al., 2010; Willer et al., 2013) meta-analysis summary statistics for low-density lipoprotein cholesterol (LDL-C). The analysis reveals 33 PCGs and three lncRNAs across 11 loci with >50% posterior probabilities for inclusion in an additive model of association. The findings are consistent with previous reports, while providing some new insight into the architecture of LDL-cholesterol to be investigated further. As genomic taxonomies continue to evolve, additional classes such as enhancer elements and splicing regions, can easily be layered into the proposed analysis framework. Moreover, application of this approach to alternative publicly available meta-analysis resources, or more generally as a post-analytic strategy to further interrogate regions that are identified through single point analysis, is straightforward. All coding examples are implemented in R version 3.2.1 and provided as supplemental material.
Collapse
Affiliation(s)
- Siying Chen
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, Massachusetts, U.S.A
| | - Sara Nunez
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, Massachusetts, U.S.A
| | - Muredach P Reilly
- Department of Medicine, Division of Cardiology, and the Irving Institute for Clinical and Translational Research at Columbia University, New York City, New York, U.S.A
| | - Andrea S Foulkes
- Department of Mathematics and Statistics, Mount Holyoke College, South Hadley, Massachusetts, U.S.A
| |
Collapse
|
23
|
Zhang M, Mu H, Shang Z, Kang K, Lv H, Duan L, Li J, Chen X, Teng Y, Jiang Y, Zhang R. Genome-wide pathway-based association analysis identifies risk pathways associated with Parkinson's disease. Neuroscience 2016; 340:398-410. [PMID: 27840232 DOI: 10.1016/j.neuroscience.2016.11.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2016] [Revised: 11/03/2016] [Accepted: 11/03/2016] [Indexed: 01/02/2023]
Abstract
Parkinson's disease (PD) is the second most common neurodegenerative disease. It is generally believed that it is influenced by both genetic and environmental factors, but the precise pathogenesis of PD is unknown to date. In this study, we performed a pathway analysis based on genome-wide association study (GWAS) to detect risk pathways of PD in three GWAS datasets. We first mapped all SNP markers to autosomal genes in each GWAS dataset. Then, we evaluated gene risk values using the minimum P-value of the tagSNPs. We took a pathway as a unit to identify the risk pathways based on the cumulative risks of the genes in the pathway. Finally, we combine the analysis results of the three datasets to detect the high risk pathways associated with PD. We found there were five same pathways in the three datasets. Besides, we also found there were five pathways which were shared in two datasets. Most of these pathways are associated with nervoussystem. Five pathways had been reported to be PD-related pathways in the previous literature. Our findings also implied that there was a close association between immune response and PD. Continued investigation of these pathways will further help us explain the pathogenesis of PD.
Collapse
Affiliation(s)
- Mingming Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Hongbo Mu
- College of Science, Northeast Forestry University, Harbin, China
| | - Zhenwei Shang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Kai Kang
- Department of Cardiovascular Surgery, The Second Affiliated Hospital of Harbin Medical University, China
| | - Hongchao Lv
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Lian Duan
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Jin Li
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xinren Chen
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yanbo Teng
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.
| | - Ruijie Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China.
| |
Collapse
|
24
|
Embracing Integrative Multiomics Approaches. Int J Genomics 2016; 2016:1715985. [PMID: 27689071 PMCID: PMC5027047 DOI: 10.1155/2016/1715985] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2016] [Accepted: 06/22/2016] [Indexed: 11/26/2022] Open
Abstract
As “-omics” data technology advances and becomes more readily accessible to address complex biological questions, increasing amount of cross “-omics” dataset is inspiring the use and development of integrative bioinformatics analysis. In the current review, we discuss multiple options for integrating data across “-omes” for a range of study designs. We discuss established methods for such analysis and point the reader to in-depth discussions for the various topics. Additionally, we discuss challenges and new directions in the area.
Collapse
|
25
|
Brodie A, Azaria JR, Ofran Y. How far from the SNP may the causative genes be? Nucleic Acids Res 2016; 44:6046-54. [PMID: 27269582 PMCID: PMC5291268 DOI: 10.1093/nar/gkw500] [Citation(s) in RCA: 109] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Revised: 05/20/2016] [Accepted: 05/22/2016] [Indexed: 02/03/2023] Open
Abstract
While GWAS identify many disease-associated SNPs, using them to decipher disease mechanisms is hindered by the difficulty in mapping SNPs to genes. Most SNPs are in non-coding regions and it is often hard to identify the genes they implicate. To explore how far the SNP may be from the affected genes we used a pathway-based approach. We found that affected genes are often up to 2 Mbps away from the associated SNP, and are not necessarily the closest genes to the SNP. Existing approaches for mapping SNPs to genes leave many SNPs unmapped to genes and reveal only 86 significant phenotype-pathway associations for all known GWAS hits combined. Using the pathway-based approach we propose here allows mapping of virtually all SNPs to genes and reveals 435 statistically significant phenotype-pathway associations. In search for mechanisms that may explain the relationships between SNPs and distant genes, we found that SNPs that are mapped to distant genes have significantly more large insertions/deletions around them than other SNPs, suggesting that these SNPs may sometimes be markers for large insertions/deletions that may affect large genomic regions.
Collapse
Affiliation(s)
- Aharon Brodie
- The Goodman faculty of life sciences, Nanotechnology building, Bar Ilan University, Ramat Gan 52900, Israel
| | - Johnathan Roy Azaria
- The Goodman faculty of life sciences, Nanotechnology building, Bar Ilan University, Ramat Gan 52900, Israel
| | - Yanay Ofran
- The Goodman faculty of life sciences, Nanotechnology building, Bar Ilan University, Ramat Gan 52900, Israel
| |
Collapse
|
26
|
Chen M, Rothman N, Ye Y, Gu J, Scheet PA, Huang M, Chang DW, Dinney CP, Silverman DT, Figueroa JD, Chanock SJ, Wu X. Pathway analysis of bladder cancer genome-wide association study identifies novel pathways involved in bladder cancer development. Genes Cancer 2016; 7:229-239. [PMID: 27738493 PMCID: PMC5059113 DOI: 10.18632/genesandcancer.113] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Accepted: 07/28/2016] [Indexed: 11/25/2022] Open
Abstract
Genome-wide association studies (GWAS) are designed to identify individual regions associated with cancer risk, but only explain a small fraction of the inherited variability. Alternative approach analyzing genetic variants within biological pathways has been proposed to discover networks of susceptibility genes with additional effects. The gene set enrichment analysis (GSEA) may complement and expand traditional GWAS analysis to identify novel genes and pathways associated with bladder cancer risk. We selected three GSEA methods: Gen-Gen, Aligator, and the SNP Ratio Test to evaluate cellular signaling pathways involved in bladder cancer susceptibility in a Texas GWAS population. The candidate genetic polymorphisms from the significant pathway selected by GSEA were validated in an independent NCI GWAS. We identified 18 novel pathways (P < 0.05) significantly associated with bladder cancer risk. Five of the most promising pathways (P ≤ 0.001 in any of the three GSEA methods) among the 18 pathways included two cell cycle pathways and neural cell adhesion molecule (NCAM), platelet-derived growth factor (PDGF), and unfolded protein response pathways. We validated the candidate polymorphisms in the NCI GWAS and found variants of RAPGEF1, SKP1, HERPUD1, CACNB2, CACNA1C, CACNA1S, COL4A2, SRC, and CACNA1C were associated with bladder cancer risk. Two CCNE1 variants, rs8102137 and rs997669, from cell cycle pathways showed the strongest associations; the CCNE1 signal at 19q12 has already been reported in previous GWAS. These findings offer additional etiologic insights highlighting the specific genes and pathways associated with bladder cancer development. GSEA may be a complementary tool to GWAS to identify additional loci of cancer susceptibility.
Collapse
Affiliation(s)
- Meng Chen
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Nathaniel Rothman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Yuanqing Ye
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Jian Gu
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Paul A Scheet
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Maosheng Huang
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - David W Chang
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Colin P Dinney
- Department of Urology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| | - Debra T Silverman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jonine D Figueroa
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Xifeng Wu
- Department of Epidemiology, The University of Texas M. D. Anderson Cancer Center, Houston, TX, USA
| |
Collapse
|
27
|
Ballantyne RL, Zhang X, Nuñez S, Xue C, Zhao W, Reed E, Salaheen D, Foulkes AS, Li M, Reilly MP. Genome-wide interrogation reveals hundreds of long intergenic noncoding RNAs that associate with cardiometabolic traits. Hum Mol Genet 2016; 25:3125-3141. [PMID: 27288454 DOI: 10.1093/hmg/ddw154] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2015] [Revised: 04/26/2016] [Accepted: 05/18/2016] [Indexed: 02/07/2023] Open
Abstract
Long intergenic noncoding RNAs (lincRNAs) play important roles in disease, but the vast majority of these transcripts remain uncharacterized. We defined a set of 54 944 human lincRNAs by drawing on four publicly available lincRNA datasets, and annotated ∼2.5 million single nucleotide polymorphisms (SNPs) from each of 15 cardiometabolic genome-wide association study datasets into these lincRNAs. We identified hundreds of lincRNAs with at least one trait-associated SNP: 898 SNPs in 343 unique lincRNAs at 5% false discovery rate, and 469 SNPs in 146 unique lincRNAs meeting Bonferroni-corrected P < 0.05. An additional 64 trait-associated lincRNAs were identified using a class-level testing strategy at Bonferroni-corrected P < 0.05. To better understand the genomic context and prioritize trait-associated lincRNAs, we examined the pattern of linkage disequilibrium between SNPs in the lincRNAs and SNPs that met genome-wide-significance in the region (±500 kb of lincRNAs). A subset of the lincRNA-trait association findings was replicated in independent Genome-wide association studies data from the Pakistan Risk of Myocardial Infarction Study study. For trait-associated lincRNAs, we also investigated synteny and conservation relative to mouse, expression patterns in five cardiometabolic-relevant tissues, and allele-specific expression in RNA sequencing data for adipose tissue and leukocytes. Finally, we revealed a functional role in human adipocytes for linc-NFE2L3-1, which is expressed in adipose and is associated with waist-hip ratio adjusted for BMI. This comprehensive profile of trait-associated lincRNAs provides novel insights into disease mechanism and serves as a launching point for interrogation of the biology of specific lincRNAs in cardiometabolic disease.
Collapse
Affiliation(s)
| | - Xuan Zhang
- Department of Medicine, Division of Cardiology, Columbia University Medical Center, New York, NY 10032, USA
| | - Sara Nuñez
- Department of Mathematics and Statistics, Mount Holyoke College, MA 01075, USA
| | - Chenyi Xue
- Department of Medicine, Division of Cardiology, Columbia University Medical Center, New York, NY 10032, USA
| | - Wei Zhao
- Division of Translational Medicine and Human Genetics
| | - Eric Reed
- Department of Mathematics and Statistics, Mount Holyoke College, MA 01075, USA
| | - Danish Salaheen
- Department of Biostatistics and Epidemiology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Andrea S Foulkes
- Department of Mathematics and Statistics, Mount Holyoke College, MA 01075, USA
| | - Mingyao Li
- Department of Biostatistics and Epidemiology, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Muredach P Reilly
- Department of Medicine, Division of Cardiology, Columbia University Medical Center, New York, NY 10032, USA
| |
Collapse
|
28
|
A Simple Test of Class-Level Genetic Association Can Reveal Novel Cardiometabolic Trait Loci. PLoS One 2016; 11:e0148218. [PMID: 26859766 PMCID: PMC4747495 DOI: 10.1371/journal.pone.0148218] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2015] [Accepted: 01/14/2016] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Characterizing the genetic determinants of complex diseases can be further augmented by incorporating knowledge of underlying structure or classifications of the genome, such as newly developed mappings of protein-coding genes, epigenetic marks, enhancer elements and non-coding RNAs. METHODS We apply a simple class-level testing framework, termed Genetic Class Association Testing (GenCAT), to identify protein-coding gene association with 14 cardiometabolic (CMD) related traits across 6 publicly available genome wide association (GWA) meta-analysis data resources. GenCAT uses SNP-level meta-analysis test statistics across all SNPs within a class of elements, as well as the size of the class and its unique correlation structure, to determine if the class is statistically meaningful. The novelty of findings is evaluated through investigation of regional signals. A subset of findings are validated using recently updated, larger meta-analysis resources. A simulation study is presented to characterize overall performance with respect to power, control of family-wise error and computational efficiency. All analysis is performed using the GenCAT package, R version 3.2.1. RESULTS We demonstrate that class-level testing complements the common first stage minP approach that involves individual SNP-level testing followed by post-hoc ascribing of statistically significant SNPs to genes and loci. GenCAT suggests 54 protein-coding genes at 41 distinct loci for the 13 CMD traits investigated in the discovery analysis, that are beyond the discoveries of minP alone. An additional application to biological pathways demonstrates flexibility in defining genetic classes. CONCLUSIONS We conclude that it would be prudent to include class-level testing as standard practice in GWA analysis. GenCAT, for example, can be used as a simple, complementary and efficient strategy for class-level testing that leverages existing data resources, requires only summary level data in the form of test statistics, and adds significant value with respect to its potential for identifying multiple novel and clinically relevant trait associations.
Collapse
|
29
|
Kulminski AM, Arbeev KG, Culminskaya I, Ukraintseva SV, Stallard E, Province MA, Yashin AI. Trade-offs in the effects of the apolipoprotein E polymorphism on risks of diseases of the heart, cancer, and neurodegenerative disorders: insights on mechanisms from the Long Life Family Study. Rejuvenation Res 2016; 18:128-35. [PMID: 25482294 DOI: 10.1089/rej.2014.1616] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The lack of evolutionary established mechanisms linking genes to age-related traits makes the problem of genetic susceptibility to health span inherently complex. One complicating factor is genetic trade-off. Here we focused on long-living participants of the Long Life Family Study (LLFS), their offspring, and spouses to: (1) Elucidate whether trade-offs in the effect of the apolipoprotein E e4 allele documented in the Framingham Heart Study (FHS) are a more general phenomenon, and (2) explore potential mechanisms generating age- and gender-specific trade-offs in the effect of the e4 allele on cancer, diseases of the heart, and neurodegenerative disorders assessed retrospectively in the LLFS populations. The e4 allele can diminish risks of cancer and diseases of the heart and confer risks of diseases of the heart in a sex-, age-, and LLFS-population-specific manner. A protective effect against cancer is seen in older long-living men and, potentially, their sons (>75 years, relative risk [RR]>75=0.48, p=0.086), which resembles our findings in the FHS. The protective effect against diseases of the heart is limited to long-living older men (RR>76=0.50, p=0.016), as well. A detrimental effect against diseases of the heart is characteristic for a normal LLFS population of male spouses and is specific for myocardial infarction (RR=3.07, p=2.1×10(-3)). These trade-offs are likely associated with two inherently different mechanisms, including disease-specific (detrimental; characteristic for a normal male population) and systemic, aging-related (protective; characteristic for older long-living men) mechanisms. The e4 allele confers risks of neurological disorders in men and women (RR=1.98, p=0.046). The results highlight the complex role of the e4 allele in genetic susceptibility to health span.
Collapse
|
30
|
Rosenberger A, Friedrichs S, Amos CI, Brennan P, Fehringer G, Heinrich J, Hung RJ, Muley T, Müller-Nurasyid M, Risch A, Bickeböller H. META-GSA: Combining Findings from Gene-Set Analyses across Several Genome-Wide Association Studies. PLoS One 2015; 10:e0140179. [PMID: 26501144 PMCID: PMC4621033 DOI: 10.1371/journal.pone.0140179] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2015] [Accepted: 09/21/2015] [Indexed: 01/31/2023] Open
Abstract
INTRODUCTION Gene-set analysis (GSA) methods are used as complementary approaches to genome-wide association studies (GWASs). The single marker association estimates of a predefined set of genes are either contrasted with those of all remaining genes or with a null non-associated background. To pool the p-values from several GSAs, it is important to take into account the concordance of the observed patterns resulting from single marker association point estimates across any given gene set. Here we propose an enhanced version of Fisher's inverse χ2-method META-GSA, however weighting each study to account for imperfect correlation between association patterns. SIMULATION AND POWER We investigated the performance of META-GSA by simulating GWASs with 500 cases and 500 controls at 100 diallelic markers in 20 different scenarios, simulating different relative risks between 1 and 1.5 in gene sets of 10 genes. Wilcoxon's rank sum test was applied as GSA for each study. We found that META-GSA has greater power to discover truly associated gene sets than simple pooling of the p-values, by e.g. 59% versus 37%, when the true relative risk for 5 of 10 genes was assume to be 1.5. Under the null hypothesis of no difference in the true association pattern between the gene set of interest and the set of remaining genes, the results of both approaches are almost uncorrelated. We recommend not relying on p-values alone when combining the results of independent GSAs. APPLICATION We applied META-GSA to pool the results of four case-control GWASs of lung cancer risk (Central European Study and Toronto/Lunenfeld-Tanenbaum Research Institute Study; German Lung Cancer Study and MD Anderson Cancer Center Study), which had already been analyzed separately with four different GSA methods (EASE; SLAT, mSUMSTAT and GenGen). This application revealed the pathway GO0015291 "transmembrane transporter activity" as significantly enriched with associated genes (GSA-method: EASE, p = 0.0315 corrected for multiple testing). Similar results were found for GO0015464 "acetylcholine receptor activity" but only when not corrected for multiple testing (all GSA-methods applied; p ≈ 0.02).
Collapse
Affiliation(s)
- Albert Rosenberger
- Department of Genetic Epidemiology, University Medical Center, Georg-August University Göttingen, Göttingen, Germany
| | - Stefanie Friedrichs
- Department of Genetic Epidemiology, University Medical Center, Georg-August University Göttingen, Göttingen, Germany
| | - Christopher I. Amos
- Geisel School of Medicine, Dartmouth College, Lebanon, NH, United States of America
| | - Paul Brennan
- International Agency for Research on Cancer (IARC), Lyon, France
| | - Gordon Fehringer
- Prosserman Centre for Health Research, Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Joachim Heinrich
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Rayjean J. Hung
- Prosserman Centre for Health Research, Samuel Lunenfeld Research Institute, Mount Sinai Hospital, Toronto, Ontario, Canada
| | - Thomas Muley
- Translational Lung Research Center Heidelberg (TLRC-H), Member of the German Center for Lung Research (DZL), Heidelberg, Germany
- Thoraxklinik at University of Heidelberg, Heidelberg, Germany
| | - Martina Müller-Nurasyid
- Department of Medicine I, Ludwig-Maximilians-University Munich, Munich, Germany
- Institute of Medical Informatics, Biometry and Epidemiology, Chair of Genetic Epidemiology, Ludwig-Maximilians-University, Munich, Germany
- Institute of Genetic Epidemiology, Helmholtz Zentrum München—German Research Center for Environmental Health, Neuherberg, Germany
- DZHK (German Centre for Cardiovascular Research), partner site Munich Heart Alliance, Munich, Germany
| | - Angela Risch
- Translational Lung Research Center Heidelberg (TLRC-H), Member of the German Center for Lung Research (DZL), Heidelberg, Germany
- Division of Epigenomics and Cancer Risk Factors, German Cancer Research Center, Heidelberg, Germany
- Division of Molecular Biology, University Salzburg, Salzburg, Austria
| | - Heike Bickeböller
- Department of Genetic Epidemiology, University Medical Center, Georg-August University Göttingen, Göttingen, Germany
| |
Collapse
|
31
|
Mooney MA, Wilmot B. Gene set analysis: A step-by-step guide. Am J Med Genet B Neuropsychiatr Genet 2015; 168:517-27. [PMID: 26059482 PMCID: PMC4638147 DOI: 10.1002/ajmg.b.32328] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/16/2015] [Accepted: 05/20/2015] [Indexed: 12/21/2022]
Abstract
To maximize the potential of genome-wide association studies, many researchers are performing secondary analyses to identify sets of genes jointly associated with the trait of interest. Although methods for gene-set analyses (GSA), also called pathway analyses, have been around for more than a decade, the field is still evolving. There are numerous algorithms available for testing the cumulative effect of multiple SNPs, yet no real consensus in the field about the best way to perform a GSA. This paper provides an overview of the factors that can affect the results of a GSA, the lessons learned from past studies, and suggestions for how to make analysis choices that are most appropriate for different types of data. © 2015 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Michael A. Mooney
- Department of Medical Informatics & Clinical Epidemiology, Division of Bioinformatics & Computational Biology, Oregon Health & Science University, Portland, Oregon,OHSU Knight Cancer Institute, Portland, Oregon
| | - Beth Wilmot
- Department of Medical Informatics & Clinical Epidemiology, Division of Bioinformatics & Computational Biology, Oregon Health & Science University, Portland, Oregon,OHSU Knight Cancer Institute, Portland, Oregon,Oregon Clinical and Translational Research Institute, Portland, Oregon,Correspondence to: Beth Wilmot, Department of Medical Informatics & Clinical Epidemiology, Division of Bioinformatics & Computational Biology, Oregon Health & Science University, Portland, OR 97239.
| |
Collapse
|
32
|
Kertai MD, Li YJ, Li YW, Ji Y, Alexander J, Newman MF, Smith PK, Joseph D, Mathew JP, Podgoreanu MV. Genome-wide association study of perioperative myocardial infarction after coronary artery bypass surgery. BMJ Open 2015; 5:e006920. [PMID: 25948407 PMCID: PMC4431169 DOI: 10.1136/bmjopen-2014-006920] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
OBJECTIVES Identification of patient subpopulations susceptible to develop myocardial infarction (MI) or, conversely, those displaying either intrinsic cardioprotective phenotypes or highly responsive to protective interventions remain high-priority knowledge gaps. We sought to identify novel common genetic variants associated with perioperative MI in patients undergoing coronary artery bypass grafting using genome-wide association methodology. SETTING 107 secondary and tertiary cardiac surgery centres across the USA. PARTICIPANTS We conducted a stage I genome-wide association study (GWAS) in 1433 ethnically diverse patients of both genders (112 cases/1321 controls) from the Genetics of Myocardial Adverse Outcomes and Graft Failure (GeneMAGIC) study, and a stage II analysis in an expanded population of 2055 patients (225 cases/1830 controls) combined from the GeneMAGIC and Duke Perioperative Genetics and Safety Outcomes (PEGASUS) studies. Patients undergoing primary non-emergent coronary bypass grafting were included. PRIMARY AND SECONDARY OUTCOME MEASURES The primary outcome variable was perioperative MI, defined as creatine kinase MB isoenzyme (CK-MB) values ≥10× upper limit of normal during the first postoperative day, and not attributable to preoperative MI. Secondary outcomes included postoperative CK-MB as a quantitative trait, or a dichotomised phenotype based on extreme quartiles of the CK-MB distribution. RESULTS Following quality control and adjustment for clinical covariates, we identified 521 single nucleotide polymorphisms in the stage I GWAS analysis. Among these, 8 common variants in 3 genes or intergenic regions met p<10(-5) in stage II. A secondary analysis using CK-MB as a quantitative trait (minimum p=1.26×10(-3) for rs609418), or a dichotomised phenotype based on extreme CK-MB values (minimum p=7.72×10(-6) for rs4834703) supported these findings. Pathway analysis revealed that genes harbouring top-scoring variants cluster in pathways of biological relevance to extracellular matrix remodelling, endoplasmic reticulum-to-Golgi transport and inflammation. CONCLUSIONS Using a two-stage GWAS and pathway analysis, we identified and prioritised several potential susceptibility loci for perioperative MI.
Collapse
Affiliation(s)
- Miklos D Kertai
- Division of Cardiothoracic Anesthesiology, Duke University, Durham, North Carolina, USA
| | - Yi-Ju Li
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
- Duke Molecular Physiology Institute; Duke University, Durham, North Carolina, USA
| | - Yen-Wei Li
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
| | - Yunqi Ji
- Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina, USA
| | - John Alexander
- Division of Cardiology, Duke University, Durham, North Carolina, USA
- Duke Clinical Research Institute; Duke University, Durham, North Carolina, USA
| | - Mark F Newman
- Division of Cardiothoracic Anesthesiology, Duke University, Durham, North Carolina, USA
- Duke Clinical Research Institute; Duke University, Durham, North Carolina, USA
| | - Peter K Smith
- Cardiac Surgery; Duke University, Durham, North Carolina, USA
| | - Diane Joseph
- Duke Clinical Research Institute; Duke University, Durham, North Carolina, USA
| | - Joseph P Mathew
- Division of Cardiothoracic Anesthesiology, Duke University, Durham, North Carolina, USA
| | - Mihai V Podgoreanu
- Division of Cardiothoracic Anesthesiology, Duke University, Durham, North Carolina, USA
- Duke Clinical Research Institute; Duke University, Durham, North Carolina, USA
| |
Collapse
|
33
|
The oxytocin receptor gene (OXTR) is associated with autism spectrum disorder: a meta-analysis. Mol Psychiatry 2015; 20:640-6. [PMID: 25092245 DOI: 10.1038/mp.2014.77] [Citation(s) in RCA: 224] [Impact Index Per Article: 24.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/12/2013] [Revised: 05/08/2014] [Accepted: 06/17/2014] [Indexed: 12/27/2022]
Abstract
The oxytocin receptor gene (OXTR) has been studied as a risk factor for autism spectrum disorder (ASD) owing to converging evidence from multiple levels of analysis that oxytocin (OXT) has an important role in the regulation of affiliative behavior and social bonding in both nonhuman mammals and humans. Inconsistency in the effect sizes of the OXTR variants included in association studies render it unclear whether OXTR is truly associated with ASD, and, if so, which OXTR single-nucleotide polymorphisms (SNPs) are associated. Thus, a meta-analytic review of extant studies is needed to determine whether OXTR shows association with ASD, and to elucidate which specific SNPs have a significant effect on ASD. The current meta-analysis of 16 OXTR SNPs included 3941 individuals with ASD from 11 independent samples, although analyses of each individual SNP included a subset of this total. We found significant associations between ASD and the SNPs rs7632287, rs237887, rs2268491 and rs2254298. OXTR was also significantly associated with ASD in a gene-based test. The current meta-analysis is the largest and most comprehensive investigation of the association of OXTR with ASD and the findings suggest directions for future studies of the etiology of ASD.
Collapse
|
34
|
Pratt AG, Isaacs JD. Genotyping in rheumatoid arthritis: a game changer in clinical management? Expert Rev Clin Immunol 2015; 11:303-5. [PMID: 25644535 DOI: 10.1586/1744666x.2015.1008454] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Rheumatoid arthritis (RA) is a genetically complex disease of immune dysregulation characterized by painful inflammation of synovial joints. Despite advances in its management afforded by biologic drug development, efforts to improve outcomes for patients are confounded by the condition's heterogeneous pathobiology, and consequent variability in therapeutic responses. Great strides have been made in understanding the genetic epidemiology of rheumatoid arthritis since its association with the HLA locus was established in the 1980s, with over 100 additional disease-associated variants now confirmed through cumulative genome-wide association studies. Yet translation of this new knowledge for patient benefit - whether as a route to predicting disease risk, drug development or personalized medicine - has been slow. To address this, collaborating teams of interdisciplinary scientists will need to pool resources, including ever larger, well-characterized patient cohorts and sophisticated biostatistical approaches. Recent advances suggest that the fruits of these endeavors are beginning to come within reach.
Collapse
Affiliation(s)
- Arthur G Pratt
- Institute of Cellular Medicine (Musculoskeletal Research Group), Newcastle University, Newcastle upon Tyne, NE2 4HH, UK
| | | |
Collapse
|
35
|
Ren H, Zhang C, Huang C, Li N, Li M, Li Y, Deng W, Ma X, Xiang B, Wang Q, Li T. Unravelling genes and pathways implicated in working memory of schizophrenia in Han Chinese. Int J Mol Sci 2015; 16:2145-61. [PMID: 25608650 PMCID: PMC4307354 DOI: 10.3390/ijms16012145] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2014] [Revised: 11/05/2014] [Accepted: 01/12/2015] [Indexed: 02/05/2023] Open
Abstract
Working memory deficit is the core neurocognitive disorder in schizophrenia patients. To identify the factors underlying working memory deficit in schizophrenia patients and to explore the implication of possible genes in the working memory using genome-wide association study (GWAS) of schizophrenia, computerized delay-matching-to-sample (DMS) and whole genome genotyping data were obtained from 100 first-episode, treatment-naïve patients with schizophrenia and 140 healthy controls from the Mental Health Centre of the West China Hospital, Sichuan University. A composite score, delay-matching-to-sample total correct numbers (DMS-TC), was found to be significantly different between the patients and control. On associating quantitative DMS-TC with interactive variables of groups × genotype, one SNP (rs1411832), located downstream of YWHAZP5 in chromosome 10, was found to be associated with the working memory deficit in schizophrenia patients with lowest p-value (p = 2.02 × 10(-7)). ConsensusPathDB identified that genes with SNPs for which p values below the threshold of 5 × 10(-5) were significantly enriched in GO:0007155 (cell adhesion, p < 0.001). This study indicates that working memory, as an endophenotype of schizophrenia, could improve the efficacy of GWAS in schizophrenia. However, further study is required to replicate the results from our study.
Collapse
Affiliation(s)
- Hongyan Ren
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Chengcheng Zhang
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Chaohua Huang
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Na Li
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Mingli Li
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Yinfei Li
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Wei Deng
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Xiaohong Ma
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Bo Xiang
- State Key Laboratory of Biotherapy, Psychiatric Laboratory, West China Hospital, Sichuan University, 1 Ke Yuan 4 Road, Hi-Tech Developmental Zone, Chengdu 610041, China.
| | - Qiang Wang
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| | - Tao Li
- Mental Health Center, West China Hospital, Sichuan University, 28 Dian Xin Nan Road, Chengdu 610041, China.
| |
Collapse
|
36
|
Huang T, Hu FB. Gene-environment interactions and obesity: recent developments and future directions. BMC Med Genomics 2015; 8 Suppl 1:S2. [PMID: 25951849 PMCID: PMC4315311 DOI: 10.1186/1755-8794-8-s1-s2] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Obesity, a major public health concern, is a multifactorial disease caused by both environmental and genetic factors. Although recent genome-wide association studies have identified many loci related to obesity or body mass index, the identified variants explain only a small proportion of the heritability of obesity. Better understanding of the interplay between genetic and environmental factors is the basis for developing effective personalized obesity prevention and management strategies. This article reviews recent advances in identifying gene-environment interactions related to obesity and describes epidemiological designs and newly developed statistical approaches to characterizing and discovering gene-environment interactions on obesity risk.
Collapse
|
37
|
Jin L, Zuo XY, Su WY, Zhao XL, Yuan MQ, Han LZ, Zhao X, Chen YD, Rao SQ. Pathway-based analysis tools for complex diseases: a review. GENOMICS PROTEOMICS & BIOINFORMATICS 2014; 12:210-20. [PMID: 25462153 PMCID: PMC4411419 DOI: 10.1016/j.gpb.2014.10.002] [Citation(s) in RCA: 93] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/21/2014] [Revised: 08/30/2014] [Accepted: 09/04/2014] [Indexed: 11/23/2022]
Abstract
Genetic studies are traditionally based on single-gene analysis. The use of these analyses can pose tremendous challenges for elucidating complicated genetic interplays involved in complex human diseases. Modern pathway-based analysis provides a technique, which allows a comprehensive understanding of the molecular mechanisms underlying complex diseases. Extensive studies utilizing the methods and applications for pathway-based analysis have significantly advanced our capacity to explore large-scale omics data, which has rapidly accumulated in biomedical fields. This article is a comprehensive review of the pathway-based analysis methods—the powerful methods with the potential to uncover the biological depths of the complex diseases. The general concepts and procedures for the pathway-based analysis methods are introduced and then, a comprehensive review of the major approaches for this analysis is presented. In addition, a list of available pathway-based analysis software and databases is provided. Finally, future directions and challenges for the methodological development and applications of pathway-based analysis techniques are discussed. This review will provide a useful guide to dissect complex diseases.
Collapse
Affiliation(s)
- Lv Jin
- Institute for Medical Systems Biology, and Department of Medical Statistics and Epidemiology, School of Public Health, Guangdong Medical College, Dongguan 523808, China
| | - Xiao-Yu Zuo
- Department of Medical Statistics and Epidemiology, School of Public Health, Sun Yat-Sen University, Guangzhou 510080, China
| | - Wei-Yang Su
- Community Health Service Management Center of Panyu District, Guangzhou 511400, China
| | - Xiao-Lei Zhao
- Institute for Medical Systems Biology, and Department of Medical Statistics and Epidemiology, School of Public Health, Guangdong Medical College, Dongguan 523808, China
| | - Man-Qiong Yuan
- Department of Statistical Sciences, School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou 510275, China
| | - Li-Zhen Han
- Department of Medical Statistics and Epidemiology, School of Public Health, Sun Yat-Sen University, Guangzhou 510080, China
| | - Xiang Zhao
- Institute for Medical Systems Biology, and Department of Medical Statistics and Epidemiology, School of Public Health, Guangdong Medical College, Dongguan 523808, China
| | - Ye-Da Chen
- Institute for Medical Systems Biology, and Department of Medical Statistics and Epidemiology, School of Public Health, Guangdong Medical College, Dongguan 523808, China
| | - Shao-Qi Rao
- Institute for Medical Systems Biology, and Department of Medical Statistics and Epidemiology, School of Public Health, Guangdong Medical College, Dongguan 523808, China; Department of Medical Statistics and Epidemiology, School of Public Health, Sun Yat-Sen University, Guangzhou 510080, China; Department of Statistical Sciences, School of Mathematics and Computational Science, Sun Yat-Sen University, Guangzhou 510275, China.
| |
Collapse
|
38
|
Golozar A, Beaty TH, Gravitt PE, Ruczinski I, Qiao YL, Fan JH, Ding T, Tang ZZ, Etemadi A, Hu N, Hyland PL, Wang L, Wang C, Dawsey SM, Freedman ND, Abnet CC, Goldstein AM, Taylor PR. Oesophageal squamous cell carcinoma in high-risk Chinese populations: Possible role for vascular epithelial growth factor A. Eur J Cancer 2014; 50:2855-65. [PMID: 25172294 DOI: 10.1016/j.ejca.2014.07.022] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2014] [Revised: 07/24/2014] [Accepted: 07/29/2014] [Indexed: 12/25/2022]
Abstract
BACKGROUND Mechanisms involved in wound healing play some role in carcinogenesis in multiple organs, likely by creating a chronic inflammatory milieu. This study sought to assess the role of genetic markers in selected inflammation-related genes involved in wound healing (interleukin (IL)-1a, IL-1b, IL-1 Receptor type I (IL-1Ra), IL-1 Receptor type II (IL-1Rb), tumour necrosis factor (TNF)-α, tumour necrosis factor receptor superfamily member (TNFRSF)1A, nuclear factor kappa beta (NF-kB)1, NF-kB2, inducible nitric oxide synthase (iNOS), cyclooxygenase (COX)-2, hypoxia induced factor (HIF)-1α, vascular endothelial growth factor (VEGF)A and P-53) in risk to oesophageal squamous cell carcinoma (OSCC). METHODS We genotyped 125 tag single nucleotide polymorphism (SNP)s in 410 cases and 377 age and sex matched disease-free individuals from Nutritional Intervention Trial (NIT) cohort, and 546 cases and 556 controls individually matched for age, sex and neighbourhood from Shanxi case-control study, both conducted in high-risk areas of north-central China (1985-2007). Cox proportional-hazard models and conditional logistic regression models were used for SNPs analyses for NIT and Shanxi, respectively. Fisher's inverse test statistics were used to obtain gene-level significance. RESULTS Multiple SNPs were significantly associated with OSCC in both studies, however, none retained their significance after a conservative Bonferroni adjustment. Empiric p-values for tag SNPs in VEGFA in NIT were highly concentrated in the lower tail of the distribution, suggesting this gene may be influencing risk. Permutation tests confirmed the significance of this pattern. At the gene level, VEGFA yielded an empiric significance (P=0.027) in NIT. We also observed some evidence for interaction between environmental factors and some VEGFA tag SNPs. CONCLUSION Our finding adds further evidence for a potential role for markers in the VEGFA gene in the development and progression of early precancerous lesions of oesophagus.
Collapse
Affiliation(s)
- Asieh Golozar
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA; Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| | - Terri H Beaty
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Patti E Gravitt
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Ingo Ruczinski
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - You-Lin Qiao
- Department of Epidemiology, Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, People's Republic of China
| | - Jin-Hu Fan
- Department of Epidemiology, Cancer Institute (Hospital), Chinese Academy of Medical Sciences, Beijing, People's Republic of China
| | - Ti Ding
- Shanxi Cancer Hospital, Taiyuan, Shanxi, People's Republic of China
| | - Ze-Zhong Tang
- Shanxi Cancer Hospital, Taiyuan, Shanxi, People's Republic of China
| | - Arash Etemadi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Nan Hu
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Paula L Hyland
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Lemin Wang
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Chaoyu Wang
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Sanford M Dawsey
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Neal D Freedman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Christian C Abnet
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Alisa M Goldstein
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Philip R Taylor
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| |
Collapse
|
39
|
Mooney MA, Nigg JT, McWeeney SK, Wilmot B. Functional and genomic context in pathway analysis of GWAS data. Trends Genet 2014; 30:390-400. [PMID: 25154796 DOI: 10.1016/j.tig.2014.07.004] [Citation(s) in RCA: 86] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2014] [Revised: 07/18/2014] [Accepted: 07/18/2014] [Indexed: 02/07/2023]
Abstract
Gene set analysis (GSA) is a promising tool for uncovering the polygenic effects associated with complex diseases. However, the available techniques reflect a wide variety of hypotheses about how genetic effects interact to contribute to disease susceptibility. The lack of consensus about the best way to perform GSA has led to confusion in the field and has made it difficult to compare results across methods. A clear understanding of the various choices made during GSA - such as how gene sets are defined, how single-nucleotide polymorphisms (SNPs) are assigned to genes, and how individual SNP-level effects are aggregated to produce gene- or pathway-level effects - will improve the interpretability and comparability of results across methods and studies. In this review we provide an overview of the various data sources used to construct gene sets and the statistical methods used to test for gene set association, as well as provide guidelines for ensuring the comparability of results.
Collapse
Affiliation(s)
- Michael A Mooney
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA; OHSU Knight Cancer Institute, Portland, OR, USA
| | - Joel T Nigg
- Division of Psychology, Department of Psychiatry, Oregon Health & Science University, Portland, OR, USA; Department of Behavioral Neuroscience, Oregon Health & Science University, Portland, OR, USA
| | - Shannon K McWeeney
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA; Oregon Clinical and Translational Research Institute, Portland, OR, USA; OHSU Knight Cancer Institute, Portland, OR, USA.
| | - Beth Wilmot
- Division of Bioinformatics and Computational Biology, Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, OR, USA; Oregon Clinical and Translational Research Institute, Portland, OR, USA; OHSU Knight Cancer Institute, Portland, OR, USA
| |
Collapse
|
40
|
Pathway-based association analysis of two genome-wide screening data identifies rheumatoid arthritis-related pathways. Genes Immun 2014; 15:487-94. [DOI: 10.1038/gene.2014.48] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2014] [Revised: 05/06/2014] [Accepted: 06/23/2014] [Indexed: 12/26/2022]
|
41
|
Zhao J, Zhu Y, Boerwinkle E, Xiong M. Pathway analysis with next-generation sequencing data. Eur J Hum Genet 2014; 23:507-15. [PMID: 24986826 DOI: 10.1038/ejhg.2014.121] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2013] [Revised: 03/29/2014] [Accepted: 04/26/2014] [Indexed: 12/21/2022] Open
Abstract
Although pathway analysis methods have been developed and successfully applied to association studies of common variants, the statistical methods for pathway-based association analysis of rare variants have not been well developed. Many investigators observed highly inflated false-positive rates and low power in pathway-based tests of association of rare variants. The inflated false-positive rates and low true-positive rates of the current methods are mainly due to their lack of ability to account for gametic phase disequilibrium. To overcome these serious limitations, we develop a novel statistic that is based on the smoothed functional principal component analysis (SFPCA) for pathway association tests with next-generation sequencing data. The developed statistic has the ability to capture position-level variant information and account for gametic phase disequilibrium. By intensive simulations, we demonstrate that the SFPCA-based statistic for testing pathway association with either rare or common or both rare and common variants has the correct type 1 error rates. Also the power of the SFPCA-based statistic and 22 additional existing statistics are evaluated. We found that the SFPCA-based statistic has a much higher power than other existing statistics in all the scenarios considered. To further evaluate its performance, the SFPCA-based statistic is applied to pathway analysis of exome sequencing data in the early-onset myocardial infarction (EOMI) project. We identify three pathways significantly associated with EOMI after the Bonferroni correction. In addition, our preliminary results show that the SFPCA-based statistic has much smaller P-values to identify pathway association than other existing methods.
Collapse
Affiliation(s)
- Jinying Zhao
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Yun Zhu
- Department of Epidemiology, Tulane University School of Public Health and Tropical Medicine, New Orleans, LA, USA
| | - Eric Boerwinkle
- Human Genetics Center, Division of Biostatistics, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Momiao Xiong
- Human Genetics Center, Division of Biostatistics, University of Texas Health Science Center at Houston, Houston, TX, USA
| |
Collapse
|
42
|
Hicks C, Koganti T, Giri S, Tekere M, Ramani R, Sitthi-Amorn J, Vijayakumar S. Integrative genomic analysis for the discovery of biomarkers in prostate cancer. Biomark Insights 2014; 9:39-51. [PMID: 25057237 PMCID: PMC4085106 DOI: 10.4137/bmi.s13729] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 04/03/2014] [Accepted: 04/06/2014] [Indexed: 12/18/2022] Open
Abstract
Genome-wide association studies (GWAS) have achieved great success in identifying single nucleotide polymorphisms (SNPs, herein called genetic variants) and genes associated with risk of developing prostate cancer. However, GWAS do not typically link the genetic variants to the disease state or inform the broader context in which the genetic variants operate. Here, we present a novel integrative genomics approach that combines GWAS information with gene expression data to infer the causal association between gene expression and the disease and to identify the network states and biological pathways enriched for genetic variants. We identified gene regulatory networks and biological pathways enriched for genetic variants, including the prostate cancer, IGF-1, JAK2, androgen, and prolactin signaling pathways. The integration of GWAS information with gene expression data provides insights about the broader context in which genetic variants associated with an increased risk of developing prostate cancer operate.
Collapse
Affiliation(s)
- Chindo Hicks
- Cancer Institute, University of Mississippi Medical Center, Jackson, MS, USA. ; Department of Medicine, University of Mississippi Medical Center, Jackson, MS, USA. ; Department of Radiation Oncology, University of Mississippi Medical Center, Jackson, MS, USA. ; Department of Public Health Sciences, University of Lusaka, Lusaka, Zambia
| | - Tejaswi Koganti
- Cancer Institute, University of Mississippi Medical Center, Jackson, MS, USA
| | - Shankar Giri
- Cancer Institute, University of Mississippi Medical Center, Jackson, MS, USA
| | - Memory Tekere
- Department of Environmental Sciences, University of South Africa, UNISA Florida Campus, Florida, South Africa
| | - Ritika Ramani
- Cancer Institute, University of Mississippi Medical Center, Jackson, MS, USA
| | | | - Srinivasan Vijayakumar
- Department of Radiation Oncology, University of Mississippi Medical Center, Jackson, MS, USA
| |
Collapse
|
43
|
Abstract
Although many genetic factors have been successfully identified for human diseases in genome-wide association studies, genes discovered to date only account for a small proportion of overall genetic contributions to many complex traits. Association studies have difficulty in detecting the remaining true genetic variants that are either common variants with weak allelic effects, or rare variants that have strong allelic effects but are weakly associated at the population level. In this work, we applied a goodness-of-fit test for detecting sets of common and rare variants associated with quantitative or binary traits by using whole genome sequencing data. This test has been proved optimal for detecting weak and sparse signals in the literature, which fits the requirements for targeting the genetic components of missing heritability. Furthermore, this p value-combining method allows one to incorporate different data and/or research results for meta-analysis. The method was used to simultaneously analyse the whole genome sequencing and genome-wide association studies data of Genetic Analysis Workshop 18 for detecting true genetic variants. The results show that goodness-of-fit test is comparable or better than the influential sequence kernel association test in many cases.
Collapse
Affiliation(s)
- Li Yang
- Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609-2280, USA
| | - Jing Xuan
- Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609-2280, USA
| | - Zheyang Wu
- Department of Mathematical Sciences, Worcester Polytechnic Institute, 100 Institute Road, Worcester, MA 01609-2280, USA
| |
Collapse
|
44
|
Wu Z, Sun Y, He S, Cho J, Zhao H, Jin J. Detection boundary and Higher Criticism approach for rare and weak genetic effects. Ann Appl Stat 2014. [DOI: 10.1214/14-aoas724] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
45
|
Kulminski AM. Unraveling genetic origin of aging-related traits: evolving concepts. Rejuvenation Res 2014; 16:304-12. [PMID: 23768105 DOI: 10.1089/rej.2013.1441] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
Discovering the genetic origin of aging-related traits could greatly advance strategies aiming to extend health span. The results of genome-wide association studies (GWAS) addressing this problem are controversial, and new genetic concepts have been fostered to advance the progress in the field. A limitation of GWAS and new genetic concepts is that they do not thoroughly address specifics of aging-related traits. Integration of theoretical concepts in genetics and aging research with empirical evidence from different disciplines highlights the conceptual problems in studies of genetic origin of aging-related traits. To address these problems, novel approaches of systemic nature are required. These approaches should adopt the non-deterministic nature of linkage of genes with aging-related traits and, consequently, reinforce research strategies for improving our understanding of mechanisms shaping genetic effects on these traits. Investigation of mechanisms will help determine conditions that activate specific genetic variants or profiles and explore to what extent these conditions that shape genetic effects are conserved across human lives and generations.
Collapse
Affiliation(s)
- Alexander M Kulminski
- Center for Population Health and Aging, Duke University, Durham, North Carolina 27708, USA.
| |
Collapse
|
46
|
Lu M, Lee HS, Hadley D, Huang JZ, Qian X. Supervised categorical principal component analysis for genome-wide association analyses. BMC Genomics 2014; 15 Suppl 1:S10. [PMID: 24564304 PMCID: PMC4046680 DOI: 10.1186/1471-2164-15-s1-s10] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023] Open
Abstract
In order to have a better understanding of unexplained heritability for complex diseases in conventional Genome-Wide Association Studies (GWAS), aggregated association analyses based on predefined functional regions, such as genes and pathways, become popular recently as they enable evaluating joint effect of multiple Single-Nucleotide Polymorphisms (SNPs), which helps increase the detection power, especially when investigating genetic variants with weak individual effects. In this paper, we focus on aggregated analysis methods based on the idea of Principal Component Analysis (PCA). The past approaches using PCA mostly make some inherent genotype data and/or risk effect model assumptions, which may hinder the accurate detection of potential disease SNPs that influence disease phenotypes. In this paper, we derive a general Supervised Categorical Principal Component Analysis (SCPCA), which explicitly models categorical SNP data without imposing any risk effect model assumption. We have evaluated the efficacy of SCPCA with the comparison to a traditional Supervised PCA (SPCA) and a previously developed Supervised Logistic Principal Component Analysis (SLPCA) based on both the simulated genotype data by HAPGEN2 and the genotype data of Crohn's Disease (CD) from Wellcome Trust Case Control Consortium (WTCCC). Our preliminary results have demonstrated the superiority of SCPCA over both SPCA and SLPCA due to its modeling explicitly designed for categorical SNP data as well as its flexibility on the risk effect model assumption.
Collapse
|
47
|
Simonson MA, McQueen MB, Keller MC. Whole-genome pathway analysis on 132,497 individuals identifies novel gene-sets associated with body mass index. PLoS One 2014; 9:e78546. [PMID: 24497910 PMCID: PMC3908858 DOI: 10.1371/journal.pone.0078546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2013] [Accepted: 09/14/2013] [Indexed: 01/28/2023] Open
Abstract
Whole genome pathway analysis is a powerful tool for the exploration of the combined effects of gene-sets within biological pathways. This study applied Interval Based Enrichment Analysis (INRICH) to perform whole-genome pathway analysis of body-mass index (BMI). We used a discovery set composed of summary statistics from a meta-analysis of 123,865 subjects performed by the GIANT Consortium, and an independent sample of 8,632 subjects to assess replication of significant pathways. We examined SNPs within nominally significant pathways using linear mixed models to estimate their contribution to overall BMI heritability. Six pathways replicated as having significant enrichment for association after correcting for multiple testing, including the previously unknown relationships between BMI and the Reactome regulation of ornithine decarboxylase pathway, the KEGG lysosome pathway, and the Reactome stabilization of P53 pathway. Two non-overlapping sets of genes emerged from the six significant pathways. The clustering of shared genes based on previously identified protein-protein interactions listed in PubMed and OMIM supported the relatively independent biological effects of these two gene-sets. We estimate that the SNPs located in examined pathways explain ∼20% of the heritability for BMI that is tagged by common SNPs (3.35% of the 16.93% total).
Collapse
Affiliation(s)
- Matthew A. Simonson
- Department of Psychology and Neuroscience, Institute for Behavioral Genetics, University of Colorado, Boulder, Colorado, United States of America
- Mayo Clinic, Department of Health Sciences, Division of Biomedical Statistics and Informatics, Rochester, Minnesota, United States of America
- * E-mail:
| | - Matthew B. McQueen
- Department of Integrative Physiology, Institute for Behavioral Genetics, University of Colorado, Boulder, Colorado, United States of America
| | - Matthew C. Keller
- Department of Psychology and Neuroscience, Institute for Behavioral Genetics, University of Colorado, Boulder, Colorado, United States of America
| |
Collapse
|
48
|
Incorporating prior knowledge to increase the power of genome-wide association studies. Methods Mol Biol 2014; 1019:519-41. [PMID: 23756909 DOI: 10.1007/978-1-62703-447-0_25] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
Typical methods of analyzing genome-wide single nucleotide variant (SNV) data in cases and controls involve testing each variant's genotypes separately for phenotype association, and then using a substantial multiple-testing penalty to minimize the rate of false positives. This approach, however, can result in low power for modestly associated SNVs. Furthermore, simply looking at the most associated SNVs may not directly yield biological insights about disease etiology. SNVset methods attempt to address both limitations of the traditional approach by testing biologically meaningful sets of SNVs (e.g., genes or pathways). The number of tests run in a SNVset analysis is typically much lower (hundreds or thousands instead of millions) than in a traditional analysis, so the false-positive rate is lower. Additionally, by testing SNVsets that are biologically meaningful finding a significant set may more quickly yield insights into disease etiology.In this chapter we summarize the short history of SNVset testing and provide an overview of the many recently proposed methods. Furthermore, we provide detailed step-by-step instructions on how to perform a SNVset analysis, including a substantial number of practical tips and questions that researchers should consider before undertaking a SNVset analysis. Lastly, we describe a companion R package (snvset) that implements recently proposed SNVset methods. While SNVset testing is a new approach, with many new methods still being developed and many open questions, the promise of the approach is worth serious consideration when considering analytic methods for GWAS.
Collapse
|
49
|
Liu G, Jiang Y, Chen X, Zhang R, Ma G, Feng R, Zhang L, Liao M, Miao Y, Chen Z, Zeng R, Li K. Measles contributes to rheumatoid arthritis: evidence from pathway and network analyses of genome-wide association studies. PLoS One 2013; 8:e75951. [PMID: 24204584 PMCID: PMC3799991 DOI: 10.1371/journal.pone.0075951] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2013] [Accepted: 08/19/2013] [Indexed: 12/27/2022] Open
Abstract
Growing evidence from epidemiological studies indicates the association between rheumatoid arthritis (RA) and measles. However, the exact mechanism for this association is still unclear now. We consider that the strong association between both diseases may be caused by shared genetic pathways. We performed a pathway analysis of large-scale RA genome-wide association studies (GWAS) dataset with 5,539 cases and 20,169 controls of European descent. Meanwhile, we evaluated our findings using previously identified RA loci, protein-protein interaction network and previous results from pathway analysis of RA and other autoimmune diseases GWAS. We confirmed four pathways including Cytokine-cytokine receptor interaction, Jak-STAT signaling, T cell receptor signaling and Cell adhesion molecules. Meanwhile, we highlighted for the first time the involvement of Measles and Intestinal immune network for IgA production pathways in RA. Our results may explain the strong association between RA and measles, which may be caused by the shared genetic pathway. We believe that our results will be helpful for future genetic studies in RA pathogenesis and may significantly assist in the development of therapeutic strategies.
Collapse
Affiliation(s)
- Guiyou Liu
- Institute of Neurology, Guangdong Medical College, Zhanjiang, China
- Genome Analysis Laboratory, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, China
| | - Yongshuai Jiang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Xiaoguang Chen
- Key Laboratory of Sweetpotato Biology and Genetic Breeding, Sweetpotato Research Institute, Chinese Academy of Agricultural Sciences, Xuzhou, Jiangsu, China
| | - Ruijie Zhang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Guoda Ma
- Institute of Neurology, Guangdong Medical College, Zhanjiang, China
| | - Rennan Feng
- Department of Nutrition and Food Hygiene, School of Public Health, Harbin Medical University, Harbin, China
| | - Liangcai Zhang
- Department of Bioinformatics and Computational Biology, Division of Quantitative Sciences, The University of Texas M.D. Anderson Cancer Center, Houston, Texas, United States of America
| | - Mingzhi Liao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Yingbo Miao
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Zugen Chen
- Department of Human Genetics, University of California Los Angeles, Los Angeles, California, United States of America
| | - Rong Zeng
- Department of Orthopedic Surgery, Affiliated Hospital of Guangdong Medical College, Zhanjiang, China
- * E-mail: (RZ); (KL)
| | - Keshen Li
- Institute of Neurology, Guangdong Medical College, Zhanjiang, China
- * E-mail: (RZ); (KL)
| |
Collapse
|
50
|
Carbonetto P, Stephens M. Integrated enrichment analysis of variants and pathways in genome-wide association studies indicates central role for IL-2 signaling genes in type 1 diabetes, and cytokine signaling genes in Crohn's disease. PLoS Genet 2013; 9:e1003770. [PMID: 24098138 PMCID: PMC3789883 DOI: 10.1371/journal.pgen.1003770] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 07/22/2013] [Indexed: 12/17/2022] Open
Abstract
Pathway analyses of genome-wide association studies aggregate information over sets of related genes, such as genes in common pathways, to identify gene sets that are enriched for variants associated with disease. We develop a model-based approach to pathway analysis, and apply this approach to data from the Wellcome Trust Case Control Consortium (WTCCC) studies. Our method offers several benefits over existing approaches. First, our method not only interrogates pathways for enrichment of disease associations, but also estimates the level of enrichment, which yields a coherent way to promote variants in enriched pathways, enhancing discovery of genes underlying disease. Second, our approach allows for multiple enriched pathways, a feature that leads to novel findings in two diseases where the major histocompatibility complex (MHC) is a major determinant of disease susceptibility. Third, by modeling disease as the combined effect of multiple markers, our method automatically accounts for linkage disequilibrium among variants. Interrogation of pathways from eight pathway databases yields strong support for enriched pathways, indicating links between Crohn's disease (CD) and cytokine-driven networks that modulate immune responses; between rheumatoid arthritis (RA) and "Measles" pathway genes involved in immune responses triggered by measles infection; and between type 1 diabetes (T1D) and IL2-mediated signaling genes. Prioritizing variants in these enriched pathways yields many additional putative disease associations compared to analyses without enrichment. For CD and RA, 7 of 8 additional non-MHC associations are corroborated by other studies, providing validation for our approach. For T1D, prioritization of IL-2 signaling genes yields strong evidence for 7 additional non-MHC candidate disease loci, as well as suggestive evidence for several more. Of the 7 strongest associations, 4 are validated by other studies, and 3 (near IL-2 signaling genes RAF1, MAPK14, and FYN) constitute novel putative T1D loci for further study.
Collapse
Affiliation(s)
- Peter Carbonetto
- Dept. of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
| | - Matthew Stephens
- Dept. of Human Genetics, University of Chicago, Chicago, Illinois, United States of America
- Dept. of Statistics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|