1
|
Schuurmans IK, Mulder RH, Baltramonaityte V, Lahtinen A, Qiuyu F, Rothmann LM, Staginnus M, Tuulari J, Burt SA, Buss C, Craig JM, Donald KA, Felix JF, Freeman TP, Grassi-Oliveira R, Huels A, Hyde LW, Jones SA, Karlsson H, Karlsson L, Koen N, Lawn W, Mitchell C, Monk CS, Mooney MA, Muetzel R, Nigg JT, Belangero SIN, Notterman D, O'Connor T, O'Donnell KJ, Pan PM, Paunio T, Ryabinin P, Saffery R, Salum GA, Seal M, Silk TJ, Stein DJ, Zar H, Walton E, Cecil CAM. Consortium Profile: The Methylation, Imaging and NeuroDevelopment (MIND) Consortium. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.06.23.24309353. [PMID: 38978656 PMCID: PMC11230303 DOI: 10.1101/2024.06.23.24309353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Epigenetic processes, such as DNA methylation, show potential as biological markers and mechanisms underlying gene-environment interplay in the prediction of mental health and other brain-based phenotypes. However, little is known about how peripheral epigenetic patterns relate to individual differences in the brain itself. An increasingly popular approach to address this is by combining epigenetic and neuroimaging data; yet, research in this area is almost entirely comprised of cross-sectional studies in adults. To bridge this gap, we established the Methylation, Imaging and NeuroDevelopment (MIND) Consortium, which aims to bring a developmental focus to the emerging field of Neuroimaging Epigenetics by (i) promoting collaborative, adequately powered developmental research via multi-cohort analyses; (ii) increasing scientific rigor through the establishment of shared pipelines and open science practices; and (iii) advancing our understanding of DNA methylation-brain dynamics at different developmental periods (from birth to emerging adulthood), by leveraging data from prospective, longitudinal pediatric studies. MIND currently integrates 15 cohorts worldwide, comprising (repeated) measures of DNA methylation in peripheral tissues (blood, buccal cells, and saliva) and neuroimaging by magnetic resonance imaging across up to five time points over a period of up to 21 years (Npooled DNAm = 11,299; Npooled neuroimaging = 10,133; Npooled combined = 4,914). By triangulating associations across multiple developmental time points and study types, we hope to generate new insights into the dynamic relationships between peripheral DNA methylation and the brain, and how these ultimately relate to neurodevelopmental and psychiatric phenotypes.
Collapse
|
2
|
Shorey-Kendrick LE, Davis B, Gao L, Park B, Vu A, Morris CD, Breton CV, Fry R, Garcia E, Schmidt RJ, O’Shea TM, Tepper RS, McEvoy CT, Spindel ER. Development and Validation of a Novel Placental DNA Methylation Biomarker of Maternal Smoking during Pregnancy in the ECHO Program. ENVIRONMENTAL HEALTH PERSPECTIVES 2024; 132:67005. [PMID: 38885141 PMCID: PMC11218700 DOI: 10.1289/ehp13838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 02/27/2024] [Accepted: 05/22/2024] [Indexed: 06/20/2024]
Abstract
BACKGROUND Maternal cigarette smoking during pregnancy (MSDP) is associated with numerous adverse health outcomes in infants and children with potential lifelong consequences. Negative effects of MSDP on placental DNA methylation (DNAm), placental structure, and function are well established. OBJECTIVE Our aim was to develop biomarkers of MSDP using DNAm measured in placentas (N = 96 ), collected as part of the Vitamin C to Decrease the Effects of Smoking in Pregnancy on Infant Lung Function double-blind, placebo-controlled randomized clinical trial conducted between 2012 and 2016. We also aimed to develop a digital polymerase chain reaction (PCR) assay for the top ranking cytosine-guanine dinucleotide (CpG) so that large numbers of samples can be screened for exposure at low cost. METHODS We compared the ability of four machine learning methods [logistic least absolute shrinkage and selection operator (LASSO) regression, logistic elastic net regression, random forest, and gradient boosting machine] to classify MSDP based on placental DNAm signatures. We developed separate models using the complete EPIC array dataset and on the subset of probes also found on the 450K array so that models exist for both platforms. For comparison, we developed a model using CpGs previously associated with MSDP in placenta. For each final model, we used model coefficients and normalized beta values to calculate placental smoking index (PSI) scores for each sample. Final models were validated in two external datasets: the Extremely Low Gestational Age Newborn observational study, N = 426 ; and the Rhode Island Children's Health Study, N = 237 . RESULTS Logistic LASSO regression demonstrated the highest performance in cross-validation testing with the lowest number of input CpGs. Accuracy was greatest in external datasets when using models developed for the same platform. PSI scores in smokers only (n = 72 ) were moderately correlated with maternal plasma cotinine levels. One CpG (cg27402634), with the largest coefficient in two models, was measured accurately by digital PCR compared with measurement by EPIC array (R 2 = 0.98 ). DISCUSSION To our knowledge, we have developed the first placental DNAm-based biomarkers of MSDP with broad utility to studies of prenatal disease origins. https://doi.org/10.1289/EHP13838.
Collapse
Affiliation(s)
- Lyndsey E. Shorey-Kendrick
- Division of Neuroscience, Oregon National Primate Research Center, Oregon Health & Science University, Beaverton, Oregon, USA
| | - Brett Davis
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health & Science University, Portland, Oregon, USA
| | - Lina Gao
- Biostatistics Shared Resources, Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon, USA
- Bioinformatics & Biostatistics Core, Oregon National Primate Research Center, Oregon Health & Science University, Portland, Oregon, USA
| | - Byung Park
- Biostatistics Shared Resources, Knight Cancer Institute, Oregon Health & Science University, Portland, Oregon, USA
- Bioinformatics & Biostatistics Core, Oregon National Primate Research Center, Oregon Health & Science University, Portland, Oregon, USA
- Oregon Health & Science University–Portland State University School of Public Health, Portland, Oregon, USA
| | - Annette Vu
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
| | - Cynthia D. Morris
- Department of Medical Informatics and Clinical Epidemiology, Oregon Health & Science University, Portland, Oregon, USA
- Oregon Clinical and Translational Research Institute, Oregon Health & Science University, Portland, Oregon, USA
| | - Carrie V. Breton
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA
| | - Rebecca Fry
- Department of Environmental Sciences and Engineering, UNC Gillings School of Public Health, Chapel Hill, North Carolina, USA
| | - Erika Garcia
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, California, USA
| | - Rebecca J. Schmidt
- Department of Public Health Sciences, School of Medicine, University of California, Davis, Davis, California, USA
- MIND Institute, School of Medicine, University of California Davis, Davis, California, USA
| | - T. Michael O’Shea
- Department of Pediatrics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Robert S. Tepper
- Department of Pediatrics, Herman B Wells Center for Pediatric Research, Indiana University School of Medicine, Indianapolis, Indiana, USA
| | - Cindy T. McEvoy
- Department of Pediatrics, Oregon Health & Science University, Portland, Oregon, USA
| | - Eliot R. Spindel
- Division of Neuroscience, Oregon National Primate Research Center, Oregon Health & Science University, Beaverton, Oregon, USA
| | | |
Collapse
|
3
|
Aguilar-Lacasaña S, Fontes Marques I, de Castro M, Dadvand P, Escribà X, Fossati S, González JR, Nieuwenhuijsen M, Alfano R, Annesi-Maesano I, Brescianini S, Burrows K, Calas L, Elhakeem A, Heude B, Hough A, Isaevska E, W V Jaddoe V, Lawlor DA, Monaghan G, Nawrot T, Plusquin M, Richiardi L, Watmuff A, Yang TC, Vrijheid M, F Felix J, Bustamante M. Green space exposure and blood DNA methylation at birth and in childhood - A multi-cohort study. ENVIRONMENT INTERNATIONAL 2024; 188:108684. [PMID: 38776651 DOI: 10.1016/j.envint.2024.108684] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 03/21/2024] [Accepted: 04/21/2024] [Indexed: 05/25/2024]
Abstract
Green space exposure has been associated with improved mental, physical and general health. However, the underlying biological mechanisms remain largely unknown. The aim of this study was to investigate the association between green space exposure and cord and child blood DNA methylation. Data from eight European birth cohorts with a total of 2,988 newborns and 1,849 children were used. Two indicators of residential green space exposure were assessed: (i) surrounding greenness (satellite-based Normalized Difference Vegetation Index (NDVI) in buffers of 100 m and 300 m) and (ii) proximity to green space (having a green space ≥ 5,000 m2 within a distance of 300 m). For these indicators we assessed two exposure windows: (i) pregnancy, and (ii) the period from pregnancy to child blood DNA methylation assessment, named as cumulative exposure. DNA methylation was measured with the Illumina 450K or EPIC arrays. To identify differentially methylated positions (DMPs) we fitted robust linear regression models between pregnancy green space exposure and cord blood DNA methylation and between cumulative green space exposure and child blood DNA methylation. Two sensitivity analyses were conducted: (i) without adjusting for cellular composition, and (ii) adjusting for air pollution. Cohort results were combined through fixed-effect inverse variance weighted meta-analyses. Differentially methylated regions (DMRs) were identified from meta-analysed results using the Enmix-combp and DMRcate methods. There was no statistical evidence of pregnancy or cumulative exposures associating with any DMP (False Discovery Rate, FDR, p-value < 0.05). However, surrounding greenness exposure was inversely associated with four DMRs (three in cord blood and one in child blood) annotated to ADAMTS2, KCNQ1DN, SLC6A12 and SDK1 genes. Results did not change substantially in the sensitivity analyses. Overall, we found little evidence of the association between green space exposure and blood DNA methylation. Although we identified associations between surrounding greenness exposure with four DMRs, these findings require replication.
Collapse
Affiliation(s)
- Sofia Aguilar-Lacasaña
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain; Universitat de Barcelona, Barcelona, Spain.
| | - Irene Fontes Marques
- Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands; Department of Pediatrics, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Montserrat de Castro
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Payam Dadvand
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Xavier Escribà
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Serena Fossati
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Juan R González
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Mark Nieuwenhuijsen
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Rossella Alfano
- Centre for Environmental Sciences, Hasselt University, Agoralaan Building D, 3590, Diepenbeek, Belgium
| | - Isabella Annesi-Maesano
- Desbrest Institute of Epidemiology and Public Health (IDESP), Montpellier University and Inserm, Montpellier, Service des Maladies Allergiques et Respiratoires, CHU, Montpellier, France
| | - Sonia Brescianini
- Centre for Behavioural Science and Mental Health, Istituto Superiore di Sanità, Rome, Italy
| | - Kimberley Burrows
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Lucinda Calas
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), F-75004 Paris, France
| | - Ahmed Elhakeem
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Barbara Heude
- Université Paris Cité and Université Sorbonne Paris Nord, Inserm, INRAE, Center for Research in Epidemiology and StatisticS (CRESS), F-75004 Paris, France
| | - Amy Hough
- Born in Bradford, Wolfson Centre for Applied Health Research, Bradford Royal Infirmary, Bradford, UK
| | - Elena Isaevska
- Department of Pediatrics, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands; Department of Medical Sciences, University of Turin, CPO-Piemonte, Turin, Italy
| | - Vincent W V Jaddoe
- Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands; Department of Pediatrics, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Deborah A Lawlor
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK; Medical Research Council Integrative Epidemiology Unit at the University of Bristol, Bristol, UK
| | - Genevieve Monaghan
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Tim Nawrot
- Centre for Environmental Sciences, Hasselt University, Agoralaan Building D, 3590, Diepenbeek, Belgium; Department of Public Health, Leuven University (KU Leuven), Leuven, Belgium
| | - Michelle Plusquin
- Centre for Environmental Sciences, Hasselt University, Agoralaan Building D, 3590, Diepenbeek, Belgium
| | - Lorenzo Richiardi
- Department of Medical Sciences, University of Turin, CPO-Piemonte, Turin, Italy
| | - Aidan Watmuff
- Born in Bradford, Wolfson Centre for Applied Health Research, Bradford Royal Infirmary, Bradford, UK
| | - Tiffany C Yang
- Bradford Institute for Health Research, Bradford Teaching Hospitals NHS Foundation Trust, UK
| | - Martine Vrijheid
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| | - Janine F Felix
- Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands; Department of Pediatrics, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Mariona Bustamante
- ISGlobal, Barcelona, Spain; Universitat Pompeu Fabra (UPF), Barcelona, Spain; CIBER Epidemiología y Salud Pública, Spain
| |
Collapse
|
4
|
Stark L, Kasajima A, Stögbauer F, Schmidl B, Rinecker J, Holzmann K, Färber S, Pfarr N, Steiger K, Wollenberg B, Ruland J, Winter C, Wirth M. Head and neck cancer of unknown primary: unveiling primary tumor sites through machine learning on DNA methylation profiles. Clin Epigenetics 2024; 16:47. [PMID: 38528631 PMCID: PMC10964705 DOI: 10.1186/s13148-024-01657-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Accepted: 03/13/2024] [Indexed: 03/27/2024] Open
Abstract
BACKGROUND The unknown tissue of origin in head and neck cancer of unknown primary (hnCUP) leads to invasive diagnostic procedures and unspecific and potentially inefficient treatment options for patients. The most common histologic subtype, squamous cell carcinoma, can stem from various tumor primary sites, including the oral cavity, oropharynx, larynx, head and neck skin, lungs, and esophagus. DNA methylation profiles are highly tissue-specific and have been successfully used to classify tissue origin. We therefore developed a support vector machine (SVM) classifier trained with publicly available DNA methylation profiles of commonly cervically metastasizing squamous cell carcinomas (n = 1103) in order to identify the primary tissue of origin of our own cohort of squamous cell hnCUP patient's samples (n = 28). Methylation analysis was performed with Infinium MethylationEPIC v1.0 BeadChip by Illumina. RESULTS The SVM algorithm achieved the highest overall accuracy of tested classifiers, with 87%. Squamous cell hnCUP samples on DNA methylation level resembled squamous cell carcinomas commonly metastasizing into cervical lymph nodes. The most frequently predicted cancer localization was the oral cavity in 11 cases (39%), followed by the oropharynx and larynx (both 7, 25%), skin (2, 7%), and esophagus (1, 4%). These frequencies concord with the expected distribution of lymph node metastases in epidemiological studies. CONCLUSIONS On DNA methylation level, hnCUP is comparable to primary tumor tissue cancer types that commonly metastasize to cervical lymph nodes. Our SVM-based classifier can accurately predict these cancers' tissues of origin and could significantly reduce the invasiveness of hnCUP diagnostics and enable a more precise therapy after clinical validation.
Collapse
Affiliation(s)
- Leonhard Stark
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany.
- Institute of Clinical Chemistry and Pathobiochemistry, School of Medicine and Health, Technical University of Munich, Munich, Germany.
| | - Atsuko Kasajima
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Fabian Stögbauer
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Benedikt Schmidl
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Jakob Rinecker
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Katharina Holzmann
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Sarah Färber
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Nicole Pfarr
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Katja Steiger
- Institute of Pathology, School of Medicine and Health, Technical University of Munich, Munich, Germany
| | - Barbara Wollenberg
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Partner Site Munich and German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Jürgen Ruland
- Institute of Clinical Chemistry and Pathobiochemistry, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Center for Translational Cancer Research, TranslaTUM, Technical University of Munich, Munich, Germany
- Partner Site Munich and German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Christof Winter
- Institute of Clinical Chemistry and Pathobiochemistry, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Center for Translational Cancer Research, TranslaTUM, Technical University of Munich, Munich, Germany
- Partner Site Munich and German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| | - Markus Wirth
- Department of Otolaryngology, Head and Neck Surgery, School of Medicine and Health, Technical University of Munich, Munich, Germany
- Partner Site Munich and German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| |
Collapse
|
5
|
Giuili E, Grolaux R, Macedo CZNM, Desmyter L, Pichon B, Neuens S, Vilain C, Olsen C, Van Dooren S, Smits G, Defrance M. Comprehensive evaluation of the implementation of episignatures for diagnosis of neurodevelopmental disorders (NDDs). Hum Genet 2023; 142:1721-1735. [PMID: 37889307 PMCID: PMC10676303 DOI: 10.1007/s00439-023-02609-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 10/10/2023] [Indexed: 10/28/2023]
Abstract
Episignatures are popular tools for the diagnosis of rare neurodevelopmental disorders. They are commonly based on a set of differentially methylated CpGs used in combination with a support vector machine model. DNA methylation (DNAm) data often include missing values due to changes in data generation technology and batch effects. While many normalization methods exist for DNAm data, their impact on episignature performance have never been assessed. In addition, technologies to quantify DNAm evolve quickly and this may lead to poor transposition of existing episignatures generated on deprecated array versions to new ones. Indeed, probe removal between array versions, technologies or during preprocessing leads to missing values. Thus, the effect of missing data on episignature performance must also be carefully evaluated and addressed through imputation or an innovative approach to episignatures design. In this paper, we used data from patients suffering from Kabuki and Sotos syndrome to evaluate the influence of normalization methods, classification models and missing data on the prediction performances of two existing episignatures. We compare how six popular normalization methods for methylarray data affect episignature classification performances in Kabuki and Sotos syndromes and provide best practice suggestions when building new episignatures. In this setting, we show that Illumina, Noob or Funnorm normalization methods achieved higher classification performances on the testing sets compared to Quantile, Raw and Swan normalization methods. We further show that penalized logistic regression and support vector machines perform best in the classification of Kabuki and Sotos syndrome patients. Then, we describe a new paradigm to build episignatures based on the detection of differentially methylated regions (DMRs) and evaluate their performance compared to classical differentially methylated cytosines (DMCs)-based episignatures in the presence of missing data. We show that the performance of classical DMC-based episignatures suffers from the presence of missing data more than the DMR-based approach. We present a comprehensive evaluation of how the normalization of DNA methylation data affects episignature performance, using three popular classification models. We further evaluate how missing data affect those models' predictions. Finally, we propose a novel methodology to develop episignatures based on differentially methylated regions identification and show how this method slightly outperforms classical episignatures in the presence of missing data.
Collapse
Affiliation(s)
- Edoardo Giuili
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
| | - Robin Grolaux
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
| | - Catarina Z N M Macedo
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
| | - Laurence Desmyter
- Center for Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
| | - Bruno Pichon
- Center for Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
| | - Sebastian Neuens
- Center for Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
- Department of Genetics, Hôpital Universitaire Des Enfants Reine Fabiola, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
| | - Catheline Vilain
- Center for Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
- Department of Genetics, Hôpital Universitaire Des Enfants Reine Fabiola, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
| | - Catharina Olsen
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
- Clinical Sciences, Research Group Reproduction and Genetics, Brussels Interuniversity Genomics High Throughput Core (BRIGHTcore), Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium
- Clinical Sciences, Research Group Reproduction and Genetics, Centre for Medical Genetics, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium
| | - Sonia Van Dooren
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
- Clinical Sciences, Research Group Reproduction and Genetics, Brussels Interuniversity Genomics High Throughput Core (BRIGHTcore), Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium
- Clinical Sciences, Research Group Reproduction and Genetics, Centre for Medical Genetics, Vrije Universiteit Brussel (VUB), Universitair Ziekenhuis Brussel (UZ Brussel), Brussels, Belgium
| | - Guillaume Smits
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium
- Center for Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
- Department of Genetics, Hôpital Universitaire Des Enfants Reine Fabiola, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, Brussels, Belgium
| | - Matthieu Defrance
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, Brussels, Belgium.
| |
Collapse
|
6
|
Bulka CM, Everson TM, Burt AA, Marsit CJ, Karagas MR, Boyle KE, Niemiec S, Kechris K, Davidson EJ, Yang IV, Feinberg JI, Volk HE, Ladd-Acosta C, Breton CV, O’Shea TM, Fry RC. Sex-based differences in placental DNA methylation profiles related to gestational age: an NIH ECHO meta-analysis. Epigenetics 2023; 18:2179726. [PMID: 36840948 PMCID: PMC9980626 DOI: 10.1080/15592294.2023.2179726] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 12/08/2022] [Accepted: 12/21/2022] [Indexed: 02/26/2023] Open
Abstract
The placenta undergoes many changes throughout gestation to support the evolving needs of the foetus. There is also a growing appreciation that male and female foetuses develop differently in utero, with unique epigenetic changes in placental tissue. Here, we report meta-analysed sex-specific associations between gestational age and placental DNA methylation from four cohorts in the National Institutes of Health (NIH) Environmental influences on Child Health Outcomes (ECHO) Programme (355 females/419 males, gestational ages 23-42 weeks). We identified 407 cytosine-guanine dinucleotides (CpGs) in females and 794 in males where placental methylation levels were associated with gestational age. After cell-type adjustment, 55 CpGs in females and 826 in males were significant. These were enriched for biological processes critical to the immune system in females and transmembrane transport in males. Our findings are distinct between the sexes: in females, associations with gestational age are largely explained by differences in placental cellular composition, whereas in males, gestational age is directly associated with numerous alterations in methylation levels.
Collapse
Affiliation(s)
- Catherine M. Bulka
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- College of Public Health, University of South Florida, Tampa, FL, USA
| | - Todd M. Everson
- Gangarosa Department of Environmental Health, Emory University Rollins School of Public Health, Atlanta, GA, USA
| | - Amber A. Burt
- Gangarosa Department of Environmental Health, Emory University Rollins School of Public Health, Atlanta, GA, USA
| | - Carmen J. Marsit
- Gangarosa Department of Environmental Health, Emory University Rollins School of Public Health, Atlanta, GA, USA
| | - Margaret R. Karagas
- Department of Epidemiology, Geisel School of Medicine at Dartmouth, Hanover, NH, USA
| | - Kristen E. Boyle
- Section of Nutrition, Department of Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
- Colorado School of Public Health, The Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, Aurora, CO, USA
| | - Sierra Niemiec
- Colorado School of Public Health, The Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, Aurora, CO, USA
| | - Katerina Kechris
- Colorado School of Public Health, The Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, Aurora, CO, USA
- Department of Biostatistics & Informatics, Colorado School of Public Health, Aurora, CO, USA
| | | | - Ivana V. Yang
- Colorado School of Public Health, The Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, Aurora, CO, USA
- Department of Medicine, University of Colorado School of Medicine, Aurora, CO, USA
| | - Jason I. Feinberg
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, ML, USA
| | - Heather E. Volk
- Department of Mental Health, Johns Hopkins Bloomberg School of Public Health, Baltimore, ML, USA
| | - Christine Ladd-Acosta
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, ML, USA
| | - Carrie V. Breton
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
| | - T. Michael O’Shea
- Department of Pediatrics, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Rebecca C. Fry
- Department of Environmental Sciences and Engineering, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Institute for Environmental Health Solutions, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Curriculum in Toxicology and Environmental Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
7
|
Augustine J, Jereesh AS. Identification of gene-level methylation for disease prediction. Interdiscip Sci 2023; 15:678-695. [PMID: 37603212 DOI: 10.1007/s12539-023-00584-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Revised: 07/30/2023] [Accepted: 08/01/2023] [Indexed: 08/22/2023]
Abstract
DNA methylation is an epigenetic alteration that plays a fundamental part in governing gene regulatory processes. The DNA methylation mechanism affixes methyl groups to distinct cytosine residues, influencing chromatin architectures. Multiple studies have demonstrated that DNA methylation's regulatory effect on genes is linked to the beginning and progression of several disorders. Researchers have recently uncovered thousands of phenotype-related methylation sites through the epigenome-wide association study (EWAS). However, combining the methylation levels of several sites within a gene and determining the gene-level DNA methylation remains challenging. In this study, we proposed the supervised UMAP Assisted Gene-level Methylation method (sUAGM) for disease prediction based on supervised UMAP (Uniform Manifold Approximation and Projection), a manifold learning-based method for reducing dimensionality. The methylation values at the gene level generated using the proposed method are evaluated by employing various feature selection and classification algorithms on three distinct DNA methylation datasets derived from blood samples. The performance has been assessed employing classification accuracy, F-1 score, Mathews Correlation Coefficient (MCC), Kappa, Classification Success Index (CSI) and Jaccard Index. The Support Vector Machine with the linear kernel (SVML) classifier with Recursive Feature Elimination (RFE) performs best across all three datasets. From comparative analysis, our method outperformed existing gene-level and site-level approaches by achieving 100% accuracy and F1-score with fewer genes. The functional analysis of the top 28 genes selected from the Parkinson's disease dataset revealed a significant association with the disease.
Collapse
Affiliation(s)
- Jisha Augustine
- Bioinformatics Lab, Department of Computer Science, Cochin University of Science and Technology, Cochin, Kerala, 682022, India.
| | - A S Jereesh
- Bioinformatics Lab, Department of Computer Science, Cochin University of Science and Technology, Cochin, Kerala, 682022, India
| |
Collapse
|
8
|
Van Asselt AJ, Beck JJ, Finnicum CT, Johnson BN, Kallsen N, Hottenga JJ, de Geus EJC, Boomsma DI, Ehli EA, van Dongen J. Genome-Wide DNA Methylation Profiles in Whole-Blood and Buccal Samples-Cross-Sectional, Longitudinal, and across Platforms. Int J Mol Sci 2023; 24:14640. [PMID: 37834090 PMCID: PMC10572275 DOI: 10.3390/ijms241914640] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 09/22/2023] [Accepted: 09/24/2023] [Indexed: 10/15/2023] Open
Abstract
The field of DNA methylation research is rapidly evolving, focusing on disease and phenotype changes over time using methylation measurements from diverse tissue sources and multiple array platforms. Consequently, identifying the extent of longitudinal, inter-tissue, and inter-platform variation in DNA methylation is crucial for future advancement. DNA methylation was measured in 375 individuals, with 197 of those having 2 blood sample measurements ~10 years apart. Whole-blood samples were measured on Illumina Infinium 450K and EPIC methylation arrays, and buccal samples from a subset of 58 participants were measured on EPIC array. The data were analyzed with the aims to examine the correlation between methylation levels in longitudinal blood samples in 197 individuals, examine the correlation between methylation levels in the blood and buccal samples in 58 individuals, and examine the correlation between blood methylation profiles assessed on the EPIC and 450K arrays in 83 individuals. We identified 136,833, 7674, and 96,891 CpGs significantly and strongly correlated (>0.50) longitudinally, across blood and buccal samples as well as array platforms, respectively. A total of 3674 of these CpGs were shared across all three sets. Analysis of these shared CpGs identified previously found associations with aging, ancestry, and 7016 mQTLs as well.
Collapse
Affiliation(s)
- Austin J. Van Asselt
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
- Department of Biological Psychology, Vrije Universiteit, 1081 HV Amsterdam, The Netherlands
| | - Jeffrey J. Beck
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
| | - Casey T. Finnicum
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
| | - Brandon N. Johnson
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
| | - Noah Kallsen
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
| | - Jouke Jan Hottenga
- Department of Biological Psychology, Vrije Universiteit, 1081 HV Amsterdam, The Netherlands
| | - Eco J. C. de Geus
- Department of Biological Psychology, Vrije Universiteit, 1081 HV Amsterdam, The Netherlands
| | | | - Dorret I. Boomsma
- Department of Biological Psychology, Vrije Universiteit, 1081 HV Amsterdam, The Netherlands
| | - Erik A. Ehli
- Avera McKennan Hospital, University Health Center, Sioux Falls, SD 57105, USA; (A.J.V.A.)
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit, 1081 HV Amsterdam, The Netherlands
| |
Collapse
|
9
|
Fernandez-Jimenez N, Fore R, Cilleros-Portet A, Lepeule J, Perron P, Kvist T, Tian FY, Lesseur C, Binder AM, Lozano M, Martorell-Marugán J, Loke YJ, Bakulski KM, Zhu Y, Forhan A, Sammallahti S, Everson TM, Chen J, Michels KB, Belmonte T, Carmona-Sáez P, Halliday J, Daniele Fallin M, LaSalle JM, Tost J, Czamara D, Fernández MF, Gómez-Martín A, Craig JM, Gonzalez-Alzaga B, Schmidt RJ, Dou JF, Muggli E, Lacasaña M, Vrijheid M, Marsit CJ, Karagas MR, Räikkönen K, Bouchard L, Heude B, Santa-Marina L, Bustamante M, Hivert MF, Bilbao JR. A meta-analysis of pre-pregnancy maternal body mass index and placental DNA methylation identifies 27 CpG sites with implications for mother-child health. Commun Biol 2022; 5:1313. [PMID: 36446949 PMCID: PMC9709064 DOI: 10.1038/s42003-022-04267-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 11/16/2022] [Indexed: 12/05/2022] Open
Abstract
Higher maternal pre-pregnancy body mass index (ppBMI) is associated with increased neonatal morbidity, as well as with pregnancy complications and metabolic outcomes in offspring later in life. The placenta is a key organ in fetal development and has been proposed to act as a mediator between the mother and different health outcomes in children. The overall aim of the present work is to investigate the association of ppBMI with epigenome-wide placental DNA methylation (DNAm) in 10 studies from the PACE consortium, amounting to 2631 mother-child pairs. We identify 27 CpG sites at which we observe placental DNAm variations of up to 2.0% per 10 ppBMI-unit. The CpGs that are differentially methylated in placenta do not overlap with CpGs identified in previous studies in cord blood DNAm related to ppBMI. Many of the identified CpGs are located in open sea regions, are often close to obesity-related genes such as GPX1 and LGR4 and altogether, are enriched in cancer and oxidative stress pathways. Our findings suggest that placental DNAm could be one of the mechanisms by which maternal obesity is associated with metabolic health outcomes in newborns and children, although further studies will be needed in order to corroborate these findings.
Collapse
Affiliation(s)
- Nora Fernandez-Jimenez
- Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (UPV/EHU) and Biocruces-Bizkaia Health Research Institute, Leioa, Basque Country, Spain
| | - Ruby Fore
- Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, USA
| | - Ariadna Cilleros-Portet
- Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (UPV/EHU) and Biocruces-Bizkaia Health Research Institute, Leioa, Basque Country, Spain
| | - Johanna Lepeule
- University Grenoble Alpes, Inserm, CNRS, Team of Environmental Epidemiology Applied to Reproduction and Respiratory Health, IAB, Grenoble, France
| | - Patrice Perron
- Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CHUS), Sherbrooke, QC, Canada
| | - Tuomas Kvist
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Fu-Ying Tian
- Gangarosa Department of Environmental Health, Rollins School of Public Health at Emory University, Atlanta, GA, USA
| | - Corina Lesseur
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Alexandra M Binder
- Cancer Epidemiology Program, University of Hawaii Cancer Center, Honolulu, HI, USA
- Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, CA, USA
| | - Manuel Lozano
- Epidemiology and Environmental Health Joint Research Unit, FISABIO-Universitat Jaume I-Universitat de València, Valencia, Spain
- Preventive Medicine and Public Health, Food Sciences, Toxicology and Forensic Medicine Department, Universitat de València, Valencia, Spain
| | - Jordi Martorell-Marugán
- Department of Statistics and Operations Research, University of Granada, Granada, Spain
- Bioinformatics Unit. GENYO, Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government, PTS Granada, Granada, Spain
| | - Yuk J Loke
- Murdoch Children's Research Institute, Parkville, VIC, Australia
- Department of Paediatrics, University of Melbourne, Parkville, VIC, Australia
| | - Kelly M Bakulski
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Yihui Zhu
- Department of Medical Microbiology and Immunology, MIND Institute, Genome Center, University of California, Davis, CA, USA
| | - Anne Forhan
- Université de Paris, Centre for Research in Epidemiology and Statistics (CRESS), INSERM, INRAE, Paris, France
| | - Sara Sammallahti
- Department of Child and Adolescent Psychiatry and Psychology, Erasmus MC, Rotterdam, The Netherlands
| | - Todd M Everson
- Gangarosa Department of Environmental Health, Rollins School of Public Health at Emory University, Atlanta, GA, USA
- Department of Epidemiology, Rollins School of Public health at Emory University, Atlanta, GA, USA
| | - Jia Chen
- Department of Environmental Medicine and Public Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Karin B Michels
- Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, CA, USA
- Institute for Prevention and Cancer Epidemiology, Faculty of Medicine and Medical Center, University of Freiburg, Freiburg, Germany
| | - Thalia Belmonte
- Health Research Institute of Asturias, ISPA and Biomedical Research and Innovation Institute of Cadiz (INiBICA), Research Unit, Puerta del Mar University Hospital, Cadiz, Spain
| | - Pedro Carmona-Sáez
- Department of Statistics and Operations Research, University of Granada, Granada, Spain
- Bioinformatics Unit. GENYO, Centre for Genomics and Oncological Research: Pfizer, University of Granada, Andalusian Regional Government, PTS Granada, Granada, Spain
| | - Jane Halliday
- Murdoch Children's Research Institute, Parkville, VIC, Australia
- Department of Paediatrics, University of Melbourne, Parkville, VIC, Australia
| | - M Daniele Fallin
- Wendy Klag Center for Autism and Developmental Disabilities, Bloomberg School of Public Health, Johns Hopkins University, Baltimore, MD, USA
| | - Janine M LaSalle
- Department of Medical Microbiology and Immunology, MIND Institute, Genome Center, University of California, Davis, CA, USA
| | - Jorg Tost
- Laboratory for Epigenetics & Environment, Centre National de Recherche en Génomique Humaine, CEA-Institut de Biologie François Jacob, Evry, France
| | - Darina Czamara
- Max-Planck-Institute of Psychiatry, Department of Translational Research in Psychiatry, Munich, Germany
| | - Mariana F Fernández
- University of Granada, Center for Biomedical Research (CIBM), Granada, Spain
- Instituto de Investigación Biosanitaria ibs.GRANADA, Granada, Spain
- CIBER of Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - Antonio Gómez-Martín
- Instituto de Investigación Biosanitaria ibs.GRANADA, Granada, Spain
- Andalusian School of Public Health (EASP), Granada, Spain
| | - Jeffrey M Craig
- Murdoch Children's Research Institute, Parkville, VIC, Australia
- Deakin University, IMPACT - the Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Geelong, Australia
| | - Beatriz Gonzalez-Alzaga
- Instituto de Investigación Biosanitaria ibs.GRANADA, Granada, Spain
- Andalusian School of Public Health (EASP), Granada, Spain
| | - Rebecca J Schmidt
- Department of Public Health Sciences and the MIND Institute, University of California Davis School of Medicine, Davis, CA, USA
| | - John F Dou
- Department of Epidemiology, School of Public Health, University of Michigan, Ann Arbor, MI, USA
| | - Evelyne Muggli
- Murdoch Children's Research Institute, Parkville, VIC, Australia
- Department of Paediatrics, University of Melbourne, Parkville, VIC, Australia
| | - Marina Lacasaña
- Instituto de Investigación Biosanitaria ibs.GRANADA, Granada, Spain
- CIBER of Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Andalusian School of Public Health (EASP), Granada, Spain
| | - Martine Vrijheid
- CIBER of Epidemiology and Public Health (CIBERESP), Madrid, Spain
- ISGlobal, Barcelona Institute for Global Health, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Carmen J Marsit
- Gangarosa Department of Environmental Health, Rollins School of Public Health at Emory University, Atlanta, GA, USA
- Department of Epidemiology, Rollins School of Public health at Emory University, Atlanta, GA, USA
| | - Margaret R Karagas
- Department of Biochemistry and Functional Genomics, Universite de Sherbrooke, Sherbrooke, QC, Canada
| | - Katri Räikkönen
- Department of Psychology and Logopedics, University of Helsinki, Helsinki, Finland
| | - Luigi Bouchard
- Department of Biochemistry and Functional Genomics, Universite de Sherbrooke, Sherbrooke, QC, Canada
- Department of Laboratory Medicine, CIUSSS du Saguenay-Lac-St-Jean - Hôpital Universitaire de Chicoutimi, Chicoutimi, QC, Canada
| | - Barbara Heude
- Université de Paris, Centre for Research in Epidemiology and Statistics (CRESS), INSERM, INRAE, Paris, France
| | - Loreto Santa-Marina
- CIBER of Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Biodonostia, Epidemiology and Public Health Area, Environmental Epidemiology and Child Development Group, 20014, San Sebastian, Basque Country, Spain
- Health Department of Basque Government, Sub-directorate of Public Health of Gipuzkoa, San Sebastian, Basque Country, Spain
| | - Mariona Bustamante
- CIBER of Epidemiology and Public Health (CIBERESP), Madrid, Spain
- ISGlobal, Barcelona Institute for Global Health, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Marie-France Hivert
- Department of Population Medicine, Harvard Medical School and Harvard Pilgrim Health Care Institute, Boston, MA, USA
- Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CHUS), Sherbrooke, QC, Canada
- Diabetes Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Jose Ramon Bilbao
- Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country (UPV/EHU) and Biocruces-Bizkaia Health Research Institute, Leioa, Basque Country, Spain.
- CIBER of diabetes and associated metabolic disorders (CIBERDEM), Madrid, Spain.
| |
Collapse
|
10
|
Olstad EW, Nordeng HME, Sandve GK, Lyle R, Gervin K. Low reliability of DNA methylation across Illumina Infinium platforms in cord blood: implications for replication studies and meta-analyses of prenatal exposures. Clin Epigenetics 2022; 14:80. [PMID: 35765087 PMCID: PMC9238140 DOI: 10.1186/s13148-022-01299-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2022] [Accepted: 06/12/2022] [Indexed: 11/20/2022] Open
Abstract
Background There is an increasing interest in the role of epigenetics in epidemiology, but the emerging research field faces several critical biological and technical challenges. In particular, recent studies have shown poor correlation of measured DNA methylation (DNAm) levels within and across Illumina Infinium platforms in various tissues. In this study, we have investigated concordance between 450 k and EPIC Infinium platforms in cord blood. We could not replicate our previous findings on the association of prenatal paracetamol exposure with cord blood DNAm, which prompted an investigation of cross-platform DNAm differences. Results This study is based on two DNAm data sets from cord blood samples selected from the Norwegian Mother, Father and Child Cohort Study (MoBa). DNAm of one data set was measured using the 450 k platform and the other data set was measured using the EPIC platform. Initial analyses of the EPIC data could not replicate any of our previous significant findings in the 450 k data on associations between prenatal paracetamol exposure and cord blood DNAm. A subset of the samples (n = 17) was included in both data sets, which enabled analyses of technical sources potentially contributing to the negative replication. Analyses of these 17 samples with repeated measurements revealed high per-sample correlations (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\stackrel{\mathrm{-}}{\text{R}}\hspace{0.17em}\approx$$\end{document}R-≈ 0.99), but low per-CpG correlations (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$\stackrel{\mathrm{-}}{\text{R}}$$\end{document}R- ≈ 0.24) between the platforms. 1.7% of the CpGs exhibited a mean DNAm difference across platforms > 0.1. Furthermore, only 26.7% of the CpGs exhibited a moderate or better cross-platform reliability (intra-class correlation coefficient ≥ 0.5). Conclusion The observations of low cross-platform probe correlation and reliability corroborate previous reports in other tissues. Our study cannot determine the origin of the differences between platforms. Nevertheless, it emulates the setting in studies using data from multiple Infinium platforms, often analysed several years apart. Therefore, the findings may have important implications for future epigenome-wide association studies (EWASs), in replication, meta-analyses and longitudinal studies. Cognisance and transparency of the challenges related to cross-platform studies may enhance the interpretation, replicability and validity of EWAS results both in cord blood and other tissues, ultimately improving the clinical relevance of epigenetic epidemiology. Supplementary Information The online version contains supplementary material available at 10.1186/s13148-022-01299-3.
Collapse
|
11
|
Vanderlinden LA, Johnson RK, Carry PM, Dong F, DeMeo DL, Yang IV, Norris JM, Kechris K. An effective processing pipeline for harmonizing DNA methylation data from Illumina's 450K and EPIC platforms for epidemiological studies. BMC Res Notes 2021; 14:352. [PMID: 34496950 PMCID: PMC8424820 DOI: 10.1186/s13104-021-05741-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 08/16/2021] [Indexed: 02/06/2023] Open
Abstract
OBJECTIVE Illumina BeadChip arrays are commonly used to generate DNA methylation data for large epidemiological studies. Updates in technology over time create challenges for data harmonization within and between studies, many of which obtained data from the older 450K and newer EPIC platforms. The pre-processing pipeline for DNA methylation is not trivial, and influences the downstream analyses. Incorporating different platforms adds a new level of technical variability that has not yet been taken into account by recommended pipelines. Our study evaluated the performance of various tools on different versions of platform data harmonization at each step of pre-processing pipeline, including quality control (QC), normalization, batch effect adjustment, and genomic inflation. We illustrate our novel approach using 450K and EPIC data from the Diabetes Autoimmunity Study in the Young (DAISY) prospective cohort. RESULTS We found normalization and probe filtering had the biggest effect on data harmonization. Employing a meta-analysis was an effective and easily executable method for accounting for platform variability. Correcting for genomic inflation also helped with harmonization. We present guidelines for studies seeking to harmonize data from the 450K and EPIC platforms, which includes the use of technical replicates for evaluating numerous pre-processing steps, and employing a meta-analysis.
Collapse
Affiliation(s)
- Lauren A Vanderlinden
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Randi K Johnson
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Patrick M Carry
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Fran Dong
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Ivana V Yang
- School of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | - Jill M Norris
- Department of Epidemiology, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Katerina Kechris
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.
| |
Collapse
|
12
|
Gao X, Liu S, Song H, Feng X, Duan M, Huang L, Zhou F. AgeGuess, a Methylomic Prediction Model for Human Ages. Front Bioeng Biotechnol 2020; 8:80. [PMID: 32211384 PMCID: PMC7075810 DOI: 10.3389/fbioe.2020.00080] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Accepted: 01/29/2020] [Indexed: 12/15/2022] Open
Abstract
Aging was a biological process under regulations from both inherited genetic factors and various molecular modifications within cells during the lifespan. Multiple studies demonstrated that the chronological age may be accurately predicted using the methylomic data. This study proposed a three-step feature selection algorithm AgeGuess for the age regression problem. AgeGuess selected 107 methylomic features as the gender-independent age biomarkers and the Support Vector Regressor (SVR) model using these biomarkers achieved 2.0267 in the mean absolute deviation (MAD) compared with the real chronological ages. Another regression algorithm Ridge achieved a slightly better MAD 1.9859 using the same biomarkers. The gender-independent age prediction models may be further improved by establishing two gender-specific models. And it's interesting to observe that there were only two methylation biomarkers shared by the two gender-specific biomarker sets and these two biomarkers were within the two known age-associated biomarker genes CALB1 and KLF14.
Collapse
Affiliation(s)
- Xiaoqian Gao
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| | - Shuai Liu
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| | - Haoqiu Song
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China.,College of Computer Science, Hubei University of Technology, Wuhan, China
| | - Xin Feng
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| | - Meiyu Duan
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| | - Lan Huang
- Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| | - Fengfeng Zhou
- BioKnow Health Informatics Laboratory Key Laboratory of Symbolic Computation and Knowledge Engineering, College of Computer Science and Technology, Ministry of Education, Jilin University, Changchun, China
| |
Collapse
|
13
|
Cheung K, Burgers MJ, Young DA, Cockell S, Reynard LN. Correlation of Infinium HumanMethylation450K and MethylationEPIC BeadChip arrays in cartilage. Epigenetics 2019; 15:594-603. [PMID: 31833794 PMCID: PMC7574380 DOI: 10.1080/15592294.2019.1700003] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
DNA methylation of CpG sites is commonly measured using Illumina Infinium BeadChip platforms. The Infinium MethylationEPIC array has replaced the Infinium Methylation450K array. The two arrays use the same technology, with the EPIC array assaying almost double the number of sites than the 450K array. In this study, we compare DNA methylation values of shared CpGs of the same human cartilage samples assayed using both platforms. DNA methylation was measured in 21 human cartilage samples using the both 450K and EPIC arrays. Additional matched 450K and EPIC data in whole tumour and whole blood were downloaded from GEO GSE92580 and GSE86833, respectively. Data were processed using the Bioconductor package Minfi. DNA methylation of six CpG sites was validated for the same 21 cartilage samples by pyrosequencing. In cartilage samples, overall sample correlations of methylation values between arrays were high (Pearson’s r > 0.96). However, 50.5% of CpG sites showed poor correlation (r < 0.2) between arrays. Sites with limited variance and with either very high or very low methylation levels in cartilage exhibited lower correlation values, corroborating prior studies in whole blood. Bisulphite pyrosequencing did not highlight one array as generating more accurate methylation values. For a specific CpG site, the array methylation correlation coefficient differed between cartilage, tumour, and whole blood, reflecting the difference in methylation variance between cell types. Researchers should be cautious when analysing methylation of CpG sites that show low methylation variance within the cell type of interest, regardless of the method used to assay methylation.
Collapse
Affiliation(s)
- Kathleen Cheung
- Skeletal Research Group, Institute of Genetic Medicine, Newcastle University, Central Parkway , Newcastle upon Tyne, UK.,Bioinformatics Support Unit, Faculty of Medical Sciences, Newcastle University , Newcastle upon Tyne, UK
| | - Marjolein J Burgers
- Skeletal Research Group, Institute of Genetic Medicine, Newcastle University, Central Parkway , Newcastle upon Tyne, UK
| | - David A Young
- Skeletal Research Group, Institute of Genetic Medicine, Newcastle University, Central Parkway , Newcastle upon Tyne, UK
| | - Simon Cockell
- Bioinformatics Support Unit, Faculty of Medical Sciences, Newcastle University , Newcastle upon Tyne, UK
| | - Louise N Reynard
- Skeletal Research Group, Institute of Genetic Medicine, Newcastle University, Central Parkway , Newcastle upon Tyne, UK
| |
Collapse
|