1
|
Lauby SC, Lapp HE, Salazar M, Semyrenko S, Chauhan D, Margolis AE, Champagne FA. Postnatal maternal care moderates the effects of prenatal bisphenol exposure on offspring neurodevelopmental, behavioral, and transcriptomic outcomes. PLoS One 2024; 19:e0305256. [PMID: 38861567 PMCID: PMC11166292 DOI: 10.1371/journal.pone.0305256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 05/28/2024] [Indexed: 06/13/2024] Open
Abstract
Bisphenols (BP), including BPA and "BPA-free" structural analogs, are commonly used plasticizers that are present in many plastics and are known endocrine disrupting chemicals. Prenatal exposure to BPA has been associated with negative neurodevelopmental and behavioral outcomes in children and in rodent models. Prenatal BPA exposure has also been shown to impair postnatal maternal care provisioning, which can also affect offspring neurodevelopment and behavior. However, there is limited knowledge regarding the biological effects of prenatal exposure to bisphenols other than BPA and the interplay between prenatal bisphenol exposure and postnatal maternal care on adult behavior. The purpose of the current study was to determine the interactive impact of prenatal bisphenol exposure and postnatal maternal care on neurodevelopment and behavior in rats. Our findings suggest that the effects of prenatal bisphenol exposure on eye-opening, adult attentional set shifting and anxiety-like behavior in the open field are dependent on maternal care in the first five days of life. Interestingly, maternal care might also attenuate the effects of prenatal bisphenol exposure on eye opening and adult attentional set shifting. Finally, transcriptomic profiles in male and female medial prefrontal cortex and amygdala suggest that the interactive effects of prenatal bisphenol exposure and postnatal maternal care converge on estrogen receptor signaling and are involved in biological processes related to gene expression and protein translation and synthesis. Overall, these findings indicate that postnatal maternal care plays a critical role in the expression of the effects of prenatal bisphenol exposure on neurodevelopment and adult behavior. Understanding the underlying biological mechanisms involved might allow us to identify potential avenues to mitigate the adverse effects of prenatal bisphenol exposure and improve health and well-being in human populations.
Collapse
Affiliation(s)
- Samantha C. Lauby
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
- Center for Molecular Carcinogenesis and Toxicology, University of Texas at Austin, Austin, Texas, United States of America
| | - Hannah E. Lapp
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
| | - Melissa Salazar
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
| | - Sofiia Semyrenko
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
| | - Danyal Chauhan
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
| | - Amy E. Margolis
- Department of Psychiatry, Columbia University Irving Medical Center, New York City, New York, United States of America
| | - Frances A. Champagne
- Department of Psychology, College of Liberal Arts, University of Texas at Austin, Austin, Texas, United States of America
- Center for Molecular Carcinogenesis and Toxicology, University of Texas at Austin, Austin, Texas, United States of America
| |
Collapse
|
2
|
Liu CM, Chen WS, Chang SL, Hsieh YC, Hsu YH, Chang HX, Lin YJ, Lo LW, Hu YF, Chung FP, Chao TF, Tuan TC, Liao JN, Lin CY, Chang TY, Kuo L, Wu CI, Wu MH, Chen CK, Chang YY, Shiu YC, Lu HHS, Chen SA. Use of artificial intelligence and I-Score for prediction of recurrence before catheter ablation of atrial fibrillation. Int J Cardiol 2024; 402:131851. [PMID: 38360099 DOI: 10.1016/j.ijcard.2024.131851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/14/2024] [Accepted: 02/10/2024] [Indexed: 02/17/2024]
Abstract
BACKGROUND Based solely on pre-ablation characteristics, previous risk scores have demonstrated variable predictive performance. This study aimed to predict the recurrence of AF after catheter ablation by using artificial intelligence (AI)-enabled pre-ablation computed tomography (PVCT) images and pre-ablation clinical data. METHODS A total of 638 drug-refractory paroxysmal atrial fibrillation (AF) patients undergone ablation were recruited. For model training, we used left atria (LA) acquired from pre-ablation PVCT slices (126,288 images). A total of 29 clinical variables were collected before ablation, including baseline characteristics, medical histories, laboratory results, transthoracic echocardiographic parameters, and 3D reconstructed LA volumes. The I-Score was applied to select variables for model training. For the prediction of one-year AF recurrence, PVCT deep-learning and clinical variable machine-learning models were developed. We then applied machine learning to ensemble the PVCT and clinical variable models. RESULTS The PVCT model achieved an AUC of 0.63 in the test set. Various combinations of clinical variables selected by I-Score can yield an AUC of 0.72, which is significantly better than all variables or features selected by nonparametric statistics (AUCs of 0.66 to 0.69). The ensemble model (PVCT images and clinical variables) significantly improved predictive performance up to an AUC of 0.76 (sensitivity of 86.7% and specificity of 51.0%). CONCLUSIONS Before ablation, AI-enabled PVCT combined with I-Score features was applicable in predicting recurrence in paroxysmal AF patients. Based on all possible predictors, the I-Score is capable of identifying the most influential combination.
Collapse
Affiliation(s)
- Chih-Min Liu
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan.
| | - Wei-Shiang Chen
- Institute of Statistics, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Shih-Lin Chang
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yu-Cheng Hsieh
- Cardiovascular Center, Taichung Veterans General Hospital, Taichung, Taiwan
| | - Yuan-Heng Hsu
- Institute of Statistics, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Hao-Xiang Chang
- Institute of Statistics, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
| | - Yenn-Jiang Lin
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Li-Wei Lo
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Yu-Feng Hu
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Fa-Po Chung
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Tze-Fan Chao
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Ta-Chuan Tuan
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Jo-Nan Liao
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Chin-Yu Lin
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Ting-Yung Chang
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Ling Kuo
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Cheng-I Wu
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan
| | - Mei-Han Wu
- Department of Medical Imaging, Diagnostic Radiology, Cheng Hsin General Hospital, Taipei, Taiwan
| | - Chun-Ku Chen
- Department of Radiology, Taipei Veterans General Hospital, Taipei, Taiwan
| | - Ying-Yueh Chang
- Department of Radiology, Taipei Veterans General Hospital, Taipei, Taiwan
| | - Yang-Che Shiu
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan
| | - Henry Horng-Shing Lu
- Institute of Statistics, National Yang Ming Chiao Tung University, Hsinchu, Taiwan; Department of Statistics and Data Science, Cornell University, Ithaca, New York, USA.
| | - Shih-Ann Chen
- Heart Rhythm Center, Division of Cardiology, Department of Medicine, Taipei Veterans General Hospital, Taiwan; Cardiovascular Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan; Cardiovascular Center, Taichung Veterans General Hospital, Taichung, Taiwan; National Chung Hsing University, Taichung, Taiwan
| |
Collapse
|
3
|
Lauby SC, Lapp HE, Salazar M, Semyrenko S, Chauhan D, Margolis AE, Champagne FA. Postnatal maternal care moderates the effects of prenatal bisphenol exposure on offspring neurodevelopmental, behavioral, and transcriptomic outcomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.19.558481. [PMID: 37786706 PMCID: PMC10541647 DOI: 10.1101/2023.09.19.558481] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
Bisphenols (BPs), including BPA and "BPA-free" structural analogs, are commonly used plasticizers that are present in many plastics and are known endocrine disrupting chemicals. Prenatal exposure to BPA has been associated with negative neurodevelopmental and behavioral outcomes in children and rodent models. Prenatal BPA exposure has also been shown to impair postnatal maternal care provisioning, which can also affect offspring neurodevelopment and behavior. However, there is limited knowledge regarding the biological effects of prenatal exposure to bisphenols other than BPA and the interplay between prenatal BP exposure and postnatal maternal care on adult behavior. The purpose of the current study was to determine the interactive impact of prenatal BP exposure and postnatal maternal care on neurodevelopment and behavior. Our findings suggest that the effects of prenatal BP exposure on eye-opening, adult attentional set shifting and anxiety-like behavior in the open field are dependent on maternal care in the first five days of life. Interestingly, maternal care might also attenuate the effects of prenatal BP exposure on eye opening and adult attentional set shifting. Finally, transcriptomic profiles in male and female medial prefrontal cortex and amygdala suggest that the interactive effects of prenatal BP exposure and postnatal maternal care converge on estrogen receptor signaling and are involved in biological processes related to gene expression and protein translation and synthesis. Overall, these findings indicate that postnatal maternal care plays a critical role in the expression of the effects of prenatal BP exposure on neurodevelopment and adult behavior. Understanding the underlying biological mechanisms involved might allow us to identify potential avenues to mitigate the adverse effects of prenatal BP exposure and improve health and well-being in human populations.
Collapse
Affiliation(s)
- Samantha C Lauby
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
- Center for Molecular Carcinogenesis and Toxicology, University of Texas at Austin
| | - Hannah E Lapp
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
| | - Melissa Salazar
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
| | - Sofiia Semyrenko
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
| | - Danyal Chauhan
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
| | - Amy E Margolis
- Department of Psychiatry, Columbia University Irving Medical Center
| | - Frances A Champagne
- Department of Psychology, College of Liberal Arts, University of Texas at Austin
- Center for Molecular Carcinogenesis and Toxicology, University of Texas at Austin
| |
Collapse
|
4
|
Okazaki A, Horpaopan S, Zhang Q, Randesi M, Ott J. Genotype Pattern Mining for Pairs of Interacting Variants Underlying Digenic Traits. Genes (Basel) 2021; 12:1160. [PMID: 34440333 PMCID: PMC8391494 DOI: 10.3390/genes12081160] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 07/23/2021] [Accepted: 07/27/2021] [Indexed: 12/15/2022] Open
Abstract
Some genetic diseases ("digenic traits") are due to the interaction between two DNA variants, which presumably reflects biochemical interactions. For example, certain forms of Retinitis Pigmentosa, a type of blindness, occur in the presence of two mutant variants, one each in the ROM1 and RDS genes, while the occurrence of only one such variant results in a normal phenotype. Detecting variant pairs underlying digenic traits by standard genetic methods is difficult and is downright impossible when individual variants alone have minimal effects. Frequent pattern mining (FPM) methods are known to detect patterns of items. We make use of FPM approaches to find pairs of genotypes (from different variants) that can discriminate between cases and controls. Our method is based on genotype patterns of length two, and permutation testing allows assigning p-values to genotype patterns, where the null hypothesis refers to equal pattern frequencies in cases and controls. We compare different interaction search approaches and their properties on the basis of published datasets. Our implementation of FPM to case-control studies is freely available.
Collapse
Affiliation(s)
- Atsuko Okazaki
- Department of Diagnostics and Therapeutics of Intractable Diseases, Juntendo University, Bunkyo-ku, Tokyo 113-8421, Japan;
- Laboratory of Statistical Genetics, Rockefeller University, New York, NY 10065, USA
| | - Sukanya Horpaopan
- Department of Anatomy, Faculty of Medical Science, Naresuan University, Phitsanulok 65000, Thailand;
| | - Qingrun Zhang
- Department of Mathematics and Statistics, University of Calgary, Calgary, AB T2N 1N4, Canada;
| | - Matthew Randesi
- Laboratory of the Biology of Addictive Diseases, Rockefeller University, New York, NY 10065, USA;
| | - Jurg Ott
- Laboratory of Statistical Genetics, Rockefeller University, New York, NY 10065, USA
| |
Collapse
|
5
|
Kothari C, Diorio C, Durocher F. Gene signatures of breast cancer development and the potential for novel targeted treatments. Pharmacogenomics 2021; 21:157-161. [PMID: 31967517 DOI: 10.2217/pgs-2019-0158] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Affiliation(s)
- Charu Kothari
- Département de Médecine Moléculaire, Faculté de Médecine, Université Laval, Québec, QC G1V 0A6, Canada.,Centre de Recherche sur le Cancer, Centre de Recherche du CHU de Québec-Université Laval, Québec, QC G1V 0A6, Canada
| | - Caroline Diorio
- Centre de Recherche sur le Cancer, Centre de Recherche du CHU de Québec-Université Laval, Québec, QC G1V 0A6, Canada.,Département de Médecine Sociale et Préventive, Faculté de Médecine, Université Laval, Québec, QC G1V 0A6, Canada
| | - Francine Durocher
- Département de Médecine Moléculaire, Faculté de Médecine, Université Laval, Québec, QC G1V 0A6, Canada.,Centre de Recherche sur le Cancer, Centre de Recherche du CHU de Québec-Université Laval, Québec, QC G1V 0A6, Canada
| |
Collapse
|
6
|
Yang YX, Wei L, Zhang YJ, Hayano T, Piñeiro Pereda MDP, Nakaoka H, Li Q, Barragán Mallofret I, Lu YZ, Tamagnone L, Inoue I, Li X, Luo JY, Zheng K, You H. Long non-coding RNA p10247, high expressed in breast cancer (lncRNA-BCHE), is correlated with metastasis. Clin Exp Metastasis 2018; 35:109-121. [DOI: 10.1007/s10585-018-9901-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Accepted: 05/11/2018] [Indexed: 10/14/2022]
|
7
|
Polymorphisms of miR-196a2 (rs11614913) and miR-605 (rs2043556) confer susceptibility to gastric cancer. GENE REPORTS 2017. [DOI: 10.1016/j.genrep.2017.04.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
8
|
Lo A, Agne M, Auerbach J, Fan R, Lo SH, Wang P, Zheng T. Network-guided interaction mining for the blood pressure phenotype of unrelated individuals in genetic analysis workshop 19. BMC Proc 2016; 10:333-336. [PMID: 27980658 PMCID: PMC5133535 DOI: 10.1186/s12919-016-0052-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Interactions between genes are an important part of the genetic architecture of complex diseases. In this paper, we use literature-guided individual genes known to be associated with type 2 diabetes (referred to as "seed genes") to create a larger list of genes that share implied or direct networks with these seed genes. This larger list of genes are known to interact with each other, but whether they interact in ways to influence hypertension in individuals presents an interesting question. Using Genetic Analysis Workshop data on individuals with diabetes, for which only case-control labels of hypertension are known, we offer a foray into identification of diabetes-related gene interactions that are associated with hypertension. We use the approach of Lo et al. (Proc Natl Acad Sci U S A 105: 12387-12392, 2008), which creates a score to identify pairwise significant gene associations. We find that the genes GCK and PAX4, formerly known to be found within similar coexpression and pathway networks but without specific direct interactions, do, in fact, show significant joint interaction effects for hypertension.
Collapse
Affiliation(s)
- Adeline Lo
- Department of Politics, Princeton University, 130 Corwin Hall, Princeton, NJ 08544-1012 USA
| | - Michael Agne
- Department of Statistics, Columbia University, New York, 10027 USA
| | | | - Rachel Fan
- Department of Statistics, Columbia University, New York, 10027 USA
| | - Shaw-Hwa Lo
- Department of Statistics, Columbia University, New York, 10027 USA
| | - Pei Wang
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, 10029 USA
| | - Tian Zheng
- Department of Statistics, Columbia University, New York, 10027 USA
| |
Collapse
|
9
|
Wang MH, Sun R, Guo J, Weng H, Lee J, Hu I, Sham PC, Zee BCY. A fast and powerful W-test for pairwise epistasis testing. Nucleic Acids Res 2016; 44:e115. [PMID: 27112568 PMCID: PMC4937324 DOI: 10.1093/nar/gkw347] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2015] [Revised: 04/14/2016] [Accepted: 04/15/2016] [Indexed: 01/08/2023] Open
Abstract
Epistasis plays an essential role in the development of complex diseases. Interaction methods face common challenge of seeking a balance between persistent power, model complexity, computation efficiency, and validity of identified bio-markers. We introduce a novel W-test to identify pairwise epistasis effect, which measures the distributional difference between cases and controls through a combined log odds ratio. The test is model-free, fast, and inherits a Chi-squared distribution with data adaptive degrees of freedom. No permutation is needed to obtain the P-values. Simulation studies demonstrated that the W-test is more powerful in low frequency variants environment than alternative methods, which are the Chi-squared test, logistic regression and multifactor-dimensionality reduction (MDR). In two independent real bipolar disorder genome-wide associations (GWAS) datasets, the W-test identified significant interactions pairs that can be replicated, including SLIT3-CENPN, SLIT3-TMEM132D, CNTNAP2-NDST4 and CNTCAP2-RTN4R The genes in the pairs play central roles in neurotransmission and synapse formation. A majority of the identified loci are undiscoverable by main effect and are low frequency variants. The proposed method offers a powerful alternative tool for mapping the genetic puzzle underlying complex disorders.
Collapse
Affiliation(s)
- Maggie Haitian Wang
- Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China CUHK Shenzhen Research Institute, Shenzhen, China
| | - Rui Sun
- Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China CUHK Shenzhen Research Institute, Shenzhen, China
| | - Junfeng Guo
- The Australian National University, Canberra, Australia
| | - Haoyi Weng
- Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China CUHK Shenzhen Research Institute, Shenzhen, China
| | - Jack Lee
- Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China
| | - Inchi Hu
- ISOM Department and Biomedical Engineering Division, the Hong Kong University of Science and Technology, Kowloon, Hong Kong SAR, China
| | - Pak Chung Sham
- Department of Psychiatry; Centre for Genomic Sciences, the University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China
| | - Benny Chung-Ying Zee
- Division of Biostatistics and Centre for Clinical Research and Biostatistics, JC School of Public Health and Primary Care, the Chinese University of Hong Kong, Shatin, N.T., Hong Kong SAR, China CUHK Shenzhen Research Institute, Shenzhen, China
| |
Collapse
|
10
|
Abstract
BACKGROUND New technologies for acquisition of genomic data, while offering unprecedented opportunities for genetic discovery, also impose severe burdens of interpretation and penalties for multiple testing. METHODS The Pathway-based Analyses Group of the Genetic Analysis Workshop 19 (GAW19) sought reduction of multiple-testing burden through various approaches to aggregation of highdimensional data in pathways informed by prior biological knowledge. RESULTS Experimental methods testedincluded the use of "synthetic pathways" (random sets of genes) to estimate power and false-positive error rate of methods applied to simulated data; data reduction via independent components analysis, single-nucleotide polymorphism (SNP)-SNP interaction, and use of gene sets to estimate genetic similarity; and general assessment of the efficacy of prior biological knowledge to reduce the dimensionality of complex genomic data. CONCLUSIONS The work of this group explored several promising approaches to managing high-dimensional data, with the caveat that these methods are necessarily constrained by the quality of external bioinformatic annotation.
Collapse
Affiliation(s)
- Jack W Kent
- Department of Genetics, Texas Biomedical Research Institute, PO Box 760549, San Antonio, TX, 78245-0549, USA.
| |
Collapse
|
11
|
Abstract
Thus far, genome-wide association studies (GWAS) have been disappointing in the inability of investigators to use the results of identified, statistically significant variants in complex diseases to make predictions useful for personalized medicine. Why are significant variables not leading to good prediction of outcomes? We point out that this problem is prevalent in simple as well as complex data, in the sciences as well as the social sciences. We offer a brief explanation and some statistical insights on why higher significance cannot automatically imply stronger predictivity and illustrate through simulations and a real breast cancer example. We also demonstrate that highly predictive variables do not necessarily appear as highly significant, thus evading the researcher using significance-based methods. We point out that what makes variables good for prediction versus significance depends on different properties of the underlying distributions. If prediction is the goal, we must lay aside significance as the only selection standard. We suggest that progress in prediction requires efforts toward a new research agenda of searching for a novel criterion to retrieve highly predictive variables rather than highly significant variables. We offer an alternative approach that was not designed for significance, the partition retention method, which was very effective predicting on a long-studied breast cancer data set, by reducing the classification error rate from 30% to 8%.
Collapse
|
12
|
Yuan Z, Zhang X, Li F, Zhao J, Xue F. Comparing partial least square approaches in a gene- or region-based association study for multiple quantitative phenotypes. Hum Biol 2014; 86:51-8. [PMID: 25401986 DOI: 10.3378/027.086.0106] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/31/2013] [Indexed: 11/05/2022]
Abstract
On thinking quantitatively of complex diseases, there are at least three statistical strategies for association studies: one single-nucleotide polymorphism (SNP) on a single trait, gene or region (with multiple SNPs) on a single trait, and gene or region on multiple traits. The third approach is the most general in dissecting genetic mechanisms underlying complex diseases underpinning multiple quantitative traits. Gene or region association methods based on partial least square (PLS) approaches have been shown to have apparent power advantage. However, few approaches have been developed for multiple quantitative phenotypes or traits underlying a condition or disease, and the performance of various PLS approaches used in association studies for multiple quantitative traits have not been assessed. Here we exploit association between multiple SNPs and multiple phenotypes or traits, from a regression perspective, through exhaustive scan statistics (sliding window) using PLS and sparse PLS regressions. Simulations were conducted to assess the performance of the proposed scan statistics and compare them with existing methods. The proposed methods were applied to 12 regions of genome-wide association study data from the European Prospective Investigation of Cancer-Norfolk study.
Collapse
Affiliation(s)
- Zhongshang Yuan
- Department of Epidemiology and Biostatistics, School of Public Health, Shandong University, Shandong, China
| | - Xiaoshuai Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Shandong University, Shandong, China
| | - Fangyu Li
- Department of Epidemiology and Biostatistics, School of Public Health, Shandong University, Shandong, China
| | - Jinghua Zhao
- MRC Epidemiology Unit, Institute of Metabolic Science, University of Cambridge, Addenbrooke's Hospital, Cambridge, UK
| | - Fuzhong Xue
- Department of Epidemiology and Biostatistics, School of Public Health, Shandong University, Shandong, China
| |
Collapse
|
13
|
Fan R, Lo SH. A robust model-free approach for rare variants association studies incorporating gene-gene and gene-environmental interactions. PLoS One 2013; 8:e83057. [PMID: 24358248 PMCID: PMC3866272 DOI: 10.1371/journal.pone.0083057] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2013] [Accepted: 10/30/2013] [Indexed: 11/19/2022] Open
Abstract
Recently more and more evidence suggest that rare variants with much lower minor allele frequencies play significant roles in disease etiology. Advances in next-generation sequencing technologies will lead to many more rare variants association studies. Several statistical methods have been proposed to assess the effect of rare variants by aggregating information from multiple loci across a genetic region and testing the association between the phenotype and aggregated genotype. One limitation of existing methods is that they only look into the marginal effects of rare variants but do not systematically take into account effects due to interactions among rare variants and between rare variants and environmental factors. In this article, we propose the summation of partition approach (SPA), a robust model-free method that is designed specifically for detecting both marginal effects and effects due to gene-gene (G×G) and gene-environmental (G×E) interactions for rare variants association studies. SPA has three advantages. First, it accounts for the interaction information and gains considerable power in the presence of unknown and complicated G×G or G×E interactions. Secondly, it does not sacrifice the marginal detection power; in the situation when rare variants only have marginal effects it is comparable with the most competitive method in current literature. Thirdly, it is easy to extend and can incorporate more complex interactions; other practitioners and scientists can tailor the procedure to fit their own study friendly. Our simulation studies show that SPA is considerably more powerful than many existing methods in the presence of G×G and G×E interactions.
Collapse
Affiliation(s)
- Ruixue Fan
- Department of Statistics, Columbia University, New York, New York, United States of America
| | - Shaw-Hwa Lo
- Department of Statistics, Columbia University, New York, New York, United States of America
- * E-mail: (SHL)
| |
Collapse
|
14
|
Sapkota Y, Mackey JR, Lai R, Franco-Villalobos C, Lupichuk S, Robson PJ, Kopciuk K, Cass CE, Yasui Y, Damaraju S. Assessing SNP-SNP interactions among DNA repair, modification and metabolism related pathway genes in breast cancer susceptibility. PLoS One 2013; 8:e64896. [PMID: 23755158 PMCID: PMC3670937 DOI: 10.1371/journal.pone.0064896] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2013] [Accepted: 04/19/2013] [Indexed: 01/23/2023] Open
Abstract
Genome-wide association studies (GWASs) have identified low-penetrance common variants (i.e., single nucleotide polymorphisms, SNPs) associated with breast cancer susceptibility. Although GWASs are primarily focused on single-locus effects, gene-gene interactions (i.e., epistasis) are also assumed to contribute to the genetic risks for complex diseases including breast cancer. While it has been hypothesized that moderately ranked (P value based) weak single-locus effects in GWASs could potentially harbor valuable information for evaluating epistasis, we lack systematic efforts to investigate SNPs showing consistent associations with weak statistical significance across independent discovery and replication stages. The objectives of this study were i) to select SNPs showing single-locus effects with weak statistical significance for breast cancer in a GWAS and/or candidate-gene studies; ii) to replicate these SNPs in an independent set of breast cancer cases and controls; and iii) to explore their potential SNP-SNP interactions contributing to breast cancer susceptibility. A total of 17 SNPs related to DNA repair, modification and metabolism pathway genes were selected since these pathways offer a priori knowledge for potential epistatic interactions and an overall role in breast carcinogenesis. The study design included predominantly Caucasian women (2,795 cases and 4,505 controls) from Alberta, Canada. We observed two two-way SNP-SNP interactions (APEX1-rs1130409 and RPAP1-rs2297381; MLH1-rs1799977 and MDM2-rs769412) in logistic regression that conferred elevated risks for breast cancer (Pinteraction<7.3×10−3). Logic regression identified an interaction involving four SNPs (MBD2-rs4041245, MLH1-rs1799977, MDM2-rs769412, BRCA2-rs1799943) (Ppermutation = 2.4×10−3). SNPs involved in SNP-SNP interactions also showed single-locus effects with weak statistical significance, while BRCA2-rs1799943 showed stronger statistical significance (Pcorrelation/trend = 3.2×10−4) than the others. These single-locus effects were independent of body mass index. Our results provide a framework for evaluating SNPs showing statistically weak but reproducible single-locus effects for epistatic effects contributing to disease susceptibility.
Collapse
Affiliation(s)
- Yadav Sapkota
- Cross Cancer Institute, Alberta Health Services, Edmonton, Alberta, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, Alberta, Canada
| | - John R. Mackey
- Cross Cancer Institute, Alberta Health Services, Edmonton, Alberta, Canada
- Department of Oncology, University of Alberta, Edmonton, Alberta, Canada
| | - Raymond Lai
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, Alberta, Canada
| | | | - Sasha Lupichuk
- Department of Oncology, University of Calgary, Calgary, Alberta, Canada
- Tom Baker Cancer Centre, Alberta Health Services, Calgary, Alberta, Canada
| | - Paula J. Robson
- Department of Agricultural, Food and Nutritional Sciences, University of Alberta, Edmonton, Alberta, Canada
- Alberta Health Services – Cancer Care, Edmonton, Alberta, Canada
| | - Karen Kopciuk
- Department of Oncology, University of Calgary, Calgary, Alberta, Canada
- Department of Mathematics and Statistics, University of Calgary, Calgary, Alberta, Canada
| | - Carol E. Cass
- Cross Cancer Institute, Alberta Health Services, Edmonton, Alberta, Canada
- Department of Oncology, University of Alberta, Edmonton, Alberta, Canada
| | - Yutaka Yasui
- School of Public Health, University of Alberta, Edmonton, Alberta, Canada
| | - Sambasivarao Damaraju
- Cross Cancer Institute, Alberta Health Services, Edmonton, Alberta, Canada
- Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, Alberta, Canada
- * E-mail:
| |
Collapse
|
15
|
SNP set association analysis for genome-wide association studies. PLoS One 2013; 8:e62495. [PMID: 23658731 PMCID: PMC3643925 DOI: 10.1371/journal.pone.0062495] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2012] [Accepted: 03/22/2013] [Indexed: 11/29/2022] Open
Abstract
Genome-wide association study (GWAS) is a promising approach for identifying common genetic variants of the diseases on the basis of millions of single nucleotide polymorphisms (SNPs). In order to avoid low power caused by overmuch correction for multiple comparisons in single locus association study, some methods have been proposed by grouping SNPs together into a SNP set based on genomic features, then testing the joint effect of the SNP set. We compare the performances of principal component analysis (PCA), supervised principal component analysis (SPCA), kernel principal component analysis (KPCA), and sliced inverse regression (SIR). Simulated SNP sets are generated under scenarios of 0, 1 and ≥2 causal SNPs model. Our simulation results show that all of these methods can control the type I error at the nominal significance level. SPCA is always more powerful than the other methods at different settings of linkage disequilibrium structures and minor allele frequency of the simulated datasets. We also apply these four methods to a real GWAS of non-small cell lung cancer (NSCLC) in Han Chinese population
Collapse
|
16
|
A two-stage association study identifies methyl-CpG-binding domain protein 2 gene polymorphisms as candidates for breast cancer susceptibility. Eur J Hum Genet 2012; 20:682-9. [PMID: 22258532 PMCID: PMC3355265 DOI: 10.1038/ejhg.2011.273] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
Genome-wide association studies for breast cancer have identified over 40 single-nucleotide polymorphisms (SNPs), a subset of which remains statistically significant after genome-wide correction. Improved strategies for mining of genome-wide association data have been suggested to address heritable component of genetic risk in breast cancer. In this study, we attempted a two-stage association design using markers from a genome-wide study (stage 1, Affymetrix Human SNP 6.0 array, cases=302, controls=321). We restricted our analysis to DNA repair/modifications/metabolism pathway related gene polymorphisms for their obvious role in carcinogenesis in general and for their known protein–protein interactions vis-à-vis, potential epistatic effects. We selected 22 SNPs based on linkage disequilibrium patterns and high statistical significance. Genotyping assays in an independent replication study of 1178 cases and 1314 controls were attempted using Sequenom iPLEX Gold platform (stage 2). Six SNPs (rs8094493, rs4041245, rs7614, rs13250873, rs1556459 and rs2297381) showed consistent and statistically significant associations with breast cancer risk in both stages, with allelic odds ratios (and P-values) of 0.85 (0.0021), 0.86 (0.0026), 0.86 (0.0041), 1.17 (0.0043), 1.20 (0.0103) and 1.13 (0.0154), respectively, in combined analysis (N=3115). Of these, three polymorphisms were located in methyl-CpG-binding domain protein 2 gene regions and were in strong linkage disequilibrium. The remaining three SNPs were in proximity to RAD21 homolog (S. pombe), O-6-methylguanine-DNA methyltransferase and RNA polymerase II-associated protein 1. The identified markers may be relevant to breast cancer susceptibility in populations if these findings are confirmed in independent cohorts.
Collapse
|
17
|
Stepwise Paring down Variation for Identifying Influential Multi-factor Interactions Related to a Continuous Response Variable. STATISTICS IN BIOSCIENCES 2011. [DOI: 10.1007/s12561-011-9045-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
|
18
|
Gao Q, He Y, Yuan Z, Zhao J, Zhang B, Xue F. Gene- or region-based association study via kernel principal component analysis. BMC Genet 2011; 12:75. [PMID: 21871061 PMCID: PMC3176196 DOI: 10.1186/1471-2156-12-75] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2011] [Accepted: 08/26/2011] [Indexed: 11/12/2022] Open
Abstract
Background In genetic association study, especially in GWAS, gene- or region-based methods have been more popular to detect the association between multiple SNPs and diseases (or traits). Kernel principal component analysis combined with logistic regression test (KPCA-LRT) has been successfully used in classifying gene expression data. Nevertheless, the purpose of association study is to detect the correlation between genetic variations and disease rather than to classify the sample, and the genomic data is categorical rather than numerical. Recently, although the kernel-based logistic regression model in association study has been proposed by projecting the nonlinear original SNPs data into a linear feature space, it is still impacted by multicolinearity between the projections, which may lead to loss of power. We, therefore, proposed a KPCA-LRT model to avoid the multicolinearity. Results Simulation results showed that KPCA-LRT was always more powerful than principal component analysis combined with logistic regression test (PCA-LRT) at different sample sizes, different significant levels and different relative risks, especially at the genewide level (1E-5) and lower relative risks (RR = 1.2, 1.3). Application to the four gene regions of rheumatoid arthritis (RA) data from Genetic Analysis Workshop16 (GAW16) indicated that KPCA-LRT had better performance than single-locus test and PCA-LRT. Conclusions KPCA-LRT is a valid and powerful gene- or region-based method for the analysis of GWAS data set, especially under lower relative risks and lower significant levels.
Collapse
Affiliation(s)
- Qingsong Gao
- Department of Epidemiology and Health Statistics, School of Public Health, Shandong University, Jinan 250012, China
| | | | | | | | | | | |
Collapse
|
19
|
Huang CH, Cong L, Xie J, Qiao B, Lo SH, Zheng T. Rheumatoid arthritis-associated gene-gene interaction network for rheumatoid arthritis candidate genes. BMC Proc 2009; 3 Suppl 7:S75. [PMID: 20018070 PMCID: PMC2795977 DOI: 10.1186/1753-6561-3-s7-s75] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Rheumatoid arthritis (RA, MIM 180300) is a chronic and complex autoimmune disease. Using the North American Rheumatoid Arthritis Consortium (NARAC) data set provided in Genetic Analysis Workshop 16 (GAW16), we used the genotype-trait distortion (GTD) scores and proposed analysis procedures to capture the gene-gene interaction effects of multiple susceptibility gene regions on RA. In this paper, we focused on 27 RA candidate gene regions (531 SNPs) based on a literature search. Statistical significance was evaluated using 1000 permutations. HLADRB1 was found to have strong marginal association with RA. We identified 14 significant interactions (p < 0.01), which were aggregated into an association network among 12 selected candidate genes PADI4, FCGR3, TNFRSF1B, ITGAV, BTLA, SLC22A4, IL3, VEGF, TNF, NFKBIL1, TRAF1-C5, and MIF. Based on our and other contributors' findings during the GAW16 conference, we further studied 24 candidate regions with 336 SNPs. We found 23 significant interactions (p-value < 0.01), nine interactions in addition to our initial findings, and the association network was extended to include candidate genes HLA-A, HLA-B, HLA-C, CTLA4, and IL6. As we will discuss in this paper, the reported possible interactions between genes may suggest potential biological activities of RA.
Collapse
Affiliation(s)
- Chien-Hsun Huang
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue, 10th Floor, MC44690, New York, New York 10027, USA.
| | | | | | | | | | | |
Collapse
|
20
|
Qiao B, Huang CH, Cong L, Xie J, Lo SH, Zheng T. Genome-wide gene-based analysis of rheumatoid arthritis-associated interaction with PTPN22 and HLA-DRB1. BMC Proc 2009; 3 Suppl 7:S132. [PMID: 20017999 PMCID: PMC2795906 DOI: 10.1186/1753-6561-3-s7-s132] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
The genes PTPN22 and HLA-DRB1 have been found by a number of studies to confer an increased risk for rheumatoid arthritis (RA), which indicates that both genes play an important role in RA etiology. It is believed that they not only have strong association with RA individually, but also interact with other related genes that have not been found to have predisposing RA mutations. In this paper, we conduct genome-wide searches for RA-associated gene-gene interactions that involve PTPN22 or HLA-DRB1 using the Genetic Analysis Workshop 16 Problem 1 data from the North American Rheumatoid Arthritis Consortium. MGC13017, HSPCAL3, MIA, PTPNS1L, and IGLVI-70, which showed association with RA in previous studies, have been confirmed in our analysis.
Collapse
Affiliation(s)
- Bo Qiao
- Department of Statistics, Columbia University, 1255 Amsterdam Avenue, 10th Floor, MC4690, New York, New York 10027, USA.
| | | | | | | | | | | |
Collapse
|
21
|
Chernoff H, Lo SH, Zheng T. Discovering influential variables: A method of partitions. Ann Appl Stat 2009. [DOI: 10.1214/09-aoas265] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
22
|
Kayano M, Takigawa I, Shiga M, Tsuda K, Mamitsuka H. Efficiently finding genome-wide three-way gene interactions from transcript- and genotype-data. ACTA ACUST UNITED AC 2009; 25:2735-43. [PMID: 19736252 PMCID: PMC2781753 DOI: 10.1093/bioinformatics/btp531] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Motivation: We address the issue of finding a three-way gene interaction, i.e. two interacting genes in expression under the genotypes of another gene, given a dataset in which expressions and genotypes are measured at once for each individual. This issue can be a general, switching mechanism in expression of two genes, being controlled by categories of another gene, and finding this type of interaction can be a key to elucidating complex biological systems. The most suitable method for this issue is likelihood ratio test using logistic regressions, which we call interaction test, but a serious problem of this test is computational intractability at a genome-wide level. Results: We developed a fast method for this issue which improves the speed of interaction test by around 10 times for any size of datasets, keeping highly interacting genes with an accuracy of ∼85%. We applied our method to ∼3 × 108 three-way combinations generated from a dataset on human brain samples and detected three-way gene interactions with small P-values. To check the reliability of our results, we first conducted permutations by which we can show that the obtained P-values are significantly smaller than those obtained from permuted null examples. We then used GEO (Gene Expression Omnibus) to generate gene expression datasets with binary classes to confirm the detected three-way interactions by using these datasets and interaction tests. The result showed us some datasets with significantly small P-values, strongly supporting the reliability of the detected three-way interactions. Availability: Software is available from http://www.bic.kyoto-u.ac.jp/pathway/kayano/bioinfo_three-way.html Contact:kayano@kuicr.kyoto-u.ac.jp Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mitsunori Kayano
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji 611-0011, Japan.
| | | | | | | | | |
Collapse
|