1
|
African ancestry-derived APOL1 risk genotypes show proximal epigenetic associations. BMC Genomics 2024; 25:452. [PMID: 38714935 PMCID: PMC11077761 DOI: 10.1186/s12864-024-10226-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2023] [Accepted: 03/14/2024] [Indexed: 05/12/2024] Open
Abstract
Apolipoprotein L1 (APOL1) coding variants, termed G1 and G2, are established genetic risk factors for a growing spectrum of diseases, including kidney disease, in individuals of African ancestry. Evidence suggests that the risk variants, which show a recessive mode of inheritance, lead to toxic gain-of-function changes of the APOL1 protein. Disease occurrence and presentation vary, likely due to modifiers or second hits. To understand the role of the epigenetic landscape in relation to APOL1 risk variants, we performed methylation quantitative trait locus (meQTL) analysis to identify differentially methylated CpGs influenced by APOL1 risk variants in 611 African American individuals. We identified five CpGs that were significantly associated with APOL1 risk alleles in discovery and replication studies, and one CpG-APOL1 association was independent of other genomic variants. Our study highlights proximal DNA methylation alterations that may help explain the variable disease risk and clinical manifestation of APOL1 variants.
Collapse
|
2
|
Epigenome-wide association study of lung cancer among never smokers in two prospective cohorts in Shanghai, China. Thorax 2024:thorax-2023-220352. [PMID: 38702190 DOI: 10.1136/thorax-2023-220352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 02/17/2024] [Indexed: 05/06/2024]
Abstract
BACKGROUND The aetiology of lung cancer among individuals who never smoked remains elusive, despite 15% of lung cancer cases in men and 53% in women worldwide being unrelated to smoking. Epigenetic alterations, particularly DNA methylation (DNAm) changes, have emerged as potential drivers. Yet, few prospective epigenome-wide association studies (EWAS), primarily focusing on peripheral blood DNAm with limited representation of never smokers, have been conducted. METHODS We conducted a nested case-control study of 80 never-smoking incident lung cancer cases and 83 never-smoking controls within the Shanghai Women's Health Study and Shanghai Men's Health Study. DNAm was measured in prediagnostic oral rinse samples using Illumina MethylationEPIC array. Initially, we conducted an EWAS to identify differentially methylated positions (DMPs) associated with lung cancer in the discovery sample of 101 subjects. The top 50 DMPs were further evaluated in a replication sample of 62 subjects, and results were pooled using fixed-effect meta-analysis. RESULTS Our study identified three DMPs significantly associated with lung cancer at the epigenome-wide significance level of p<8.22×10-8. These DMPs were identified as cg09198866 (MYH9; TXN2), cg01411366 (SLC9A10) and cg12787323. Furthermore, examination of the top 1000 DMPs indicated significant enrichment in epithelial regulatory regions and their involvement in small GTPase-mediated signal transduction pathways. Additionally, GrimAge acceleration was identified as a risk factor for lung cancer (OR=1.19 per year; 95% CI 1.06 to 1.34). CONCLUSIONS While replication in a larger sample size is necessary, our findings suggest that DNAm patterns in prediagnostic oral rinse samples could provide novel insights into the underlying mechanisms of lung cancer in never smokers.
Collapse
|
3
|
Analysis of blood methylation quantitative trait loci in East Asians reveals ancestry-specific impacts on complex traits. Nat Genet 2024; 56:846-860. [PMID: 38641644 DOI: 10.1038/s41588-023-01494-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 08/02/2023] [Indexed: 04/21/2024]
Abstract
Methylation quantitative trait loci (mQTLs) are essential for understanding the role of DNA methylation changes in genetic predisposition, yet they have not been fully characterized in East Asians (EAs). Here we identified mQTLs in whole blood from 3,523 Chinese individuals and replicated them in additional 1,858 Chinese individuals from two cohorts. Over 9% of mQTLs displayed specificity to EAs, facilitating the fine-mapping of EA-specific genetic associations, as shown for variants associated with height. Trans-mQTL hotspots revealed biological pathways contributing to EA-specific genetic associations, including an ERG-mediated 233 trans-mCpG network, implicated in hematopoietic cell differentiation, which likely reflects binding efficiency modulation of the ERG protein complex. More than 90% of mQTLs were shared between different blood cell lineages, with a smaller fraction of lineage-specific mQTLs displaying preferential hypomethylation in the respective lineages. Our study provides new insights into the mQTL landscape across genetic ancestries and their downstream effects on cellular processes and diseases/traits.
Collapse
|
4
|
Comprehensive evaluation of smoking exposures and their interactions on DNA methylation. EBioMedicine 2024; 100:104956. [PMID: 38199042 PMCID: PMC10825325 DOI: 10.1016/j.ebiom.2023.104956] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 12/19/2023] [Accepted: 12/20/2023] [Indexed: 01/12/2024] Open
Abstract
BACKGROUND Smoking impacts DNA methylation, but data are lacking on smoking-related differential methylation by sex or dietary intake, recent smoking cessation (<1 year), persistence of differential methylation from in utero smoking exposure, and effects of environmental tobacco smoke (ETS). METHODS We meta-analysed data from up to 15,014 adults across 5 cohorts with DNA methylation measured in blood using Illumina's EPIC array for current smoking (2560 exposed), quit < 1 year (500 exposed), in utero (286 exposed), and ETS exposure (676 exposed). We also evaluated the interaction of current smoking with sex or diet (fibre, folate, and vitamin C). FINDINGS Using false discovery rate (FDR < 0.05), 65,857 CpGs were differentially methylated in relation to current smoking, 4025 with recent quitting, 594 with in utero exposure, and 6 with ETS. Most current smoking CpGs attenuated within a year of quitting. CpGs related to in utero exposure in adults were enriched for those previously observed in newborns. Differential methylation by current smoking at 4-71 CpGs may be modified by sex or dietary intake. Nearly half (35-50%) of differentially methylated CpGs on the 450 K array were associated with blood gene expression. Current smoking and in utero smoking CpGs implicated 3049 and 1067 druggable targets, including chemotherapy drugs. INTERPRETATION Many smoking-related methylation sites were identified with Illumina's EPIC array. Most signals revert to levels observed in never smokers within a year of cessation. Many in utero smoking CpGs persist into adulthood. Smoking-related druggable targets may provide insights into cancer treatment response and shared mechanisms across smoking-related diseases. FUNDING Intramural Research Program of the National Institutes of Health, Norwegian Ministry of Health and Care Services and the Ministry of Education and Research, Chief Scientist Office of the Scottish Government Health Directorates and the Scottish Funding Council, Medical Research Council UK and the Wellcome Trust.
Collapse
|
5
|
Genome-wide study investigating effector genes and polygenic prediction for kidney function in persons with ancestry from Africa and the Americas. CELL GENOMICS 2024; 4:100468. [PMID: 38190104 PMCID: PMC10794846 DOI: 10.1016/j.xgen.2023.100468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 08/31/2023] [Accepted: 11/28/2023] [Indexed: 01/09/2024]
Abstract
Chronic kidney disease is a leading cause of death and disability globally and impacts individuals of African ancestry (AFR) or with ancestry in the Americas (AMS) who are under-represented in genome-wide association studies (GWASs) of kidney function. To address this bias, we conducted a large meta-analysis of GWASs of estimated glomerular filtration rate (eGFR) in 145,732 AFR and AMS individuals. We identified 41 loci at genome-wide significance (p < 5 × 10-8), of which two have not been previously reported in any ancestry group. We integrated fine-mapped loci with epigenomic and transcriptomic resources to highlight potential effector genes relevant to kidney physiology and disease, and reveal key regulatory elements and pathways involved in renal function and development. We demonstrate the varying but increased predictive power offered by a multi-ancestry polygenic score for eGFR and highlight the importance of population diversity in GWASs and multi-omics resources to enhance opportunities for clinical translation for all.
Collapse
|
6
|
FORGEdb: a tool for identifying candidate functional variants and uncovering target genes and mechanisms for complex diseases. Genome Biol 2024; 25:3. [PMID: 38167104 PMCID: PMC10763681 DOI: 10.1186/s13059-023-03126-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2023] [Accepted: 11/27/2023] [Indexed: 01/05/2024] Open
Abstract
The majority of disease-associated variants identified through genome-wide association studies are located outside of protein-coding regions. Prioritizing candidate regulatory variants and gene targets to identify potential biological mechanisms for further functional experiments can be challenging. To address this challenge, we developed FORGEdb ( https://forgedb.cancer.gov/ ; https://forge2.altiusinstitute.org/files/forgedb.html ; and https://doi.org/10.5281/zenodo.10067458 ), a standalone and web-based tool that integrates multiple datasets, delivering information on associated regulatory elements, transcription factor binding sites, and target genes for over 37 million variants. FORGEdb scores provide researchers with a quantitative assessment of the relative importance of each variant for targeted functional experiments.
Collapse
|
7
|
|
8
|
Universal DNA methylation age across mammalian tissues. NATURE AGING 2023; 3:1144-1166. [PMID: 37563227 PMCID: PMC10501909 DOI: 10.1038/s43587-023-00462-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 06/21/2023] [Indexed: 08/12/2023]
Abstract
Aging, often considered a result of random cellular damage, can be accurately estimated using DNA methylation profiles, the foundation of pan-tissue epigenetic clocks. Here, we demonstrate the development of universal pan-mammalian clocks, using 11,754 methylation arrays from our Mammalian Methylation Consortium, which encompass 59 tissue types across 185 mammalian species. These predictive models estimate mammalian tissue age with high accuracy (r > 0.96). Age deviations correlate with human mortality risk, mouse somatotropic axis mutations and caloric restriction. We identified specific cytosines with methylation levels that change with age across numerous species. These sites, highly enriched in polycomb repressive complex 2-binding locations, are near genes implicated in mammalian development, cancer, obesity and longevity. Our findings offer new evidence suggesting that aging is evolutionarily conserved and intertwined with developmental processes across all mammals.
Collapse
|
9
|
Abstract
Using DNA methylation profiles (n = 15,456) from 348 mammalian species, we constructed phyloepigenetic trees that bear marked similarities to traditional phylogenetic ones. Using unsupervised clustering across all samples, we identified 55 distinct cytosine modules, of which 30 are related to traits such as maximum life span, adult weight, age, sex, and human mortality risk. Maximum life span is associated with methylation levels in HOXL subclass homeobox genes and developmental processes and is potentially regulated by pluripotency transcription factors. The methylation state of some modules responds to perturbations such as caloric restriction, ablation of growth hormone receptors, consumption of high-fat diets, and expression of Yamanaka factors. This study reveals an intertwined evolution of the genome and epigenome that mediates the biological characteristics and traits of different mammalian species.
Collapse
|
10
|
Predictors of postoperative complications and readmissions in laparoscopic pancreas resection: Results of a cohort 105 consecutive cases. A retrospective study. Cir Esp 2023; 101:333-340. [PMID: 35500758 DOI: 10.1016/j.cireng.2022.04.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 04/14/2022] [Indexed: 05/16/2023]
Abstract
INTRODUCTION Laparoscopic resection of the pancreas (LRP) has been implemented to a varying degree because it is technically demanding and requires a long learning curve. In the present study we analyze the risk factors for complications and hospital readmissions in a single center study of 105 consecutive LRPs. METHODS We conducted a retrospective study using a prospective database. Data were collected on age, gender, BMI, ASA score, type of surgery, histologic type, operative time, hospital stay, postoperative complications, degree of severity and hospital readmission. RESULTS The cohort included 105 patients, 63 females and 42 males with a median age and BMI of 58 (53-70) and 25.5 (22,2-27.9) respectively. Eighteen (17%) central pancreatectomies, 5 (4.8%) enucleations, 81 (77.6%) distal pancreatectomies and one total pancreatectomy were performed. Fifty-six patients (53.3%) experienced some type of complication, of which 13 (12.3%) were severe (Clavien-Dindo > IIIb) and 11 (10.5%) patients were readmitted in the first 30 days after surgery. In the univariate analysis, age, male gender, ASA score, central pancreatectomy and operative time were significantly associated with the development of complications (P <0.05). In the multivariate analysis, male gender (OR 7.97; 95% CI 1.08-58.88)), severe complications (OR 59.40; 95% CI, 7.69-458.99), and the development of intrabdominal collections (OR 8.97; 95% CI, 1.28-63.02)) were associated with hospital readmission. CONCLUSIONS Age, male gender, ASA score, operative time and central pancreatectomy are associated with a higher incidence of complications. Male gender, severe complications and intraabdominal collections are associated with more hospital readmissions.
Collapse
|
11
|
Potentially causal associations between placental DNA methylation and schizophrenia and other neuropsychiatric disorders. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.03.07.23286905. [PMID: 36945560 PMCID: PMC10029044 DOI: 10.1101/2023.03.07.23286905] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
Abstract
Increasing evidence supports the role of placenta in neurodevelopment and potentially, in the later onset of neuropsychiatric disorders. Recently, methylation quantitative trait loci (mQTL) and interaction QTL (iQTL) maps have proven useful to understand SNP-genome wide association study (GWAS) relationships, otherwise missed by conventional expression QTLs. In this context, we propose that part of the genetic predisposition to complex neuropsychiatric disorders acts through placental DNA methylation (DNAm). We constructed the first public placental cis-mQTL database including nearly eight million mQTLs calculated in 368 fetal placenta DNA samples from the INMA project, ran cell type- and gestational age-imQTL models and combined those data with the summary statistics of the largest GWAS on 10 neuropsychiatric disorders using Summary-based Mendelian Randomization (SMR) and colocalization. Finally, we evaluated the influence of the DNAm sites identified on placental gene expression in the RICHS cohort. We found that placental cis-mQTLs are highly enriched in placenta-specific active chromatin regions, and useful to map the etiology of neuropsychiatric disorders at prenatal stages. Specifically, part of the genetic burden for schizophrenia, bipolar disorder and major depressive disorder confers risk through placental DNAm. The potential causality of several of the observed associations is reinforced by secondary association signals identified in conditional analyses, regional pleiotropic methylation signals associated to the same disorder, and cell type-imQTLs, additionally associated to the expression levels of relevant immune genes in placenta. In conclusion, the genetic risk of several neuropsychiatric disorders could operate, at least in part, through DNAm and associated gene expression in placenta.
Collapse
|
12
|
Pulmonary Function and Blood DNA Methylation: A Multiancestry Epigenome-Wide Association Meta-analysis. Am J Respir Crit Care Med 2022; 206:321-336. [PMID: 35536696 PMCID: PMC9890261 DOI: 10.1164/rccm.202108-1907oc] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Rationale: Methylation integrates factors present at birth and modifiable across the lifespan that can influence pulmonary function. Studies are limited in scope and replication. Objectives: To conduct large-scale epigenome-wide meta-analyses of blood DNA methylation and pulmonary function. Methods: Twelve cohorts analyzed associations of methylation at cytosine-phosphate-guanine probes (CpGs), using Illumina 450K or EPIC/850K arrays, with FEV1, FVC, and FEV1/FVC. We performed multiancestry epigenome-wide meta-analyses (total of 17,503 individuals; 14,761 European, 2,549 African, and 193 Hispanic/Latino ancestries) and interpreted results using integrative epigenomics. Measurements and Main Results: We identified 1,267 CpGs (1,042 genes) differentially methylated (false discovery rate, <0.025) in relation to FEV1, FVC, or FEV1/FVC, including 1,240 novel and 73 also related to chronic obstructive pulmonary disease (1,787 cases). We found 294 CpGs unique to European or African ancestry and 395 CpGs unique to never or ever smokers. The majority of significant CpGs correlated with nearby gene expression in blood. Findings were enriched in key regulatory elements for gene function, including accessible chromatin elements, in both blood and lung. Sixty-nine implicated genes are targets of investigational or approved drugs. One example novel gene highlighted by integrative epigenomic and druggable target analysis is TNFRSF4. Mendelian randomization and colocalization analyses suggest that epigenome-wide association study signals capture causal regulatory genomic loci. Conclusions: We identified numerous novel loci differentially methylated in relation to pulmonary function; few were detected in large genome-wide association studies. Integrative analyses highlight functional relevance and potential therapeutic targets. This comprehensive discovery of potentially modifiable, novel lung function loci expands knowledge gained from genetic studies, providing insights into lung pathogenesis.
Collapse
|
13
|
Diversity in EWAS: current state, challenges, and solutions. Genome Med 2022; 14:71. [PMID: 35794667 PMCID: PMC9258042 DOI: 10.1186/s13073-022-01065-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Accepted: 05/20/2022] [Indexed: 11/23/2022] Open
Abstract
Here, we report a lack of diversity in epigenome-wide association studies (EWAS) and DNA methylation (DNAm) data, discuss current challenges, and propose solutions for EWAS and DNAm research in diverse populations. The strategies we propose include fostering community involvement, new data generation, and cost-effective approaches such as locus-specific analysis and ancestry variable region analysis.
Collapse
|
14
|
Abstract
Recent work has highlighted a lack of diversity in genomic studies. However, less attention has been given to epigenomics. Here, we show that epigenomic studies are lacking in diversity and propose several solutions to address this problem.
Collapse
|
15
|
Cell Type-Specific Signal Analysis in Epigenome-Wide Association Studies. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2432:57-71. [PMID: 35505207 DOI: 10.1007/978-1-0716-1994-0_5] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Hundreds of epigenome-wide association studies (EWAS) have been performed, successfully identifying replicated epigenomic signals in processes such as aging and smoking. Despite this progress, it remains a major challenge in EWAS to detect both cell type-specific and cell type confounding effects impacting study results. One way to identify these effects is through eFORGE (experimentally derived Functional element Overlap analysis of ReGions from EWAS), a published tool that uses 815 datasets from large-scale mapping studies to detect enriched tissues, cell types, and genomic regions. Here, I show that eFORGE analysis can be extended to EWAS differentially variable positions (DVPs), identifying target cell types and tissues. In addition, I also show that eFORGE tissue-specific enrichment can be detected for sites below EWAS significance threshold. I develop on these and other analysis examples, extending our knowledge of eFORGE cell type- and tissue-specific enrichment results for different EWAS.
Collapse
|
16
|
A pan-tissue DNA methylation atlas enables in silico decomposition of human tissue methylomes at cell-type resolution. Nat Methods 2022; 19:296-306. [PMID: 35277705 PMCID: PMC8916958 DOI: 10.1038/s41592-022-01412-7] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2021] [Accepted: 01/28/2022] [Indexed: 02/07/2023]
Abstract
Bulk-tissue DNA methylomes represent an average over many different cell types, hampering our understanding of cell-type-specific contributions to disease development. As single-cell methylomics is not scalable to large cohorts of individuals, cost-effective computational solutions are needed, yet current methods are limited to tissues such as blood. Here we leverage the high-resolution nature of tissue-specific single-cell RNA-sequencing datasets to construct a DNA methylation atlas defined for 13 solid tissue types and 40 cell types. We comprehensively validate this atlas in independent bulk and single-nucleus DNA methylation datasets. We demonstrate that it correctly predicts the cell of origin of diverse cancer types and discovers new prognostic associations in olfactory neuroblastoma and stage 2 melanoma. In brain, the atlas predicts a neuronal origin for schizophrenia, with neuron-specific differential DNA methylation enriched for corresponding genome-wide association study risk loci. In summary, the DNA methylation atlas enables the decomposition of 13 different human tissue types at a high cellular resolution, paving the way for an improved interpretation of epigenetic data. This resource presents an in silico generated DNA methylation atlas that can be used for cell-type deconvolution of human tissues.
Collapse
|
17
|
A mammalian methylation array for profiling methylation levels at conserved sequences. Nat Commun 2022; 13:783. [PMID: 35145108 PMCID: PMC8831611 DOI: 10.1038/s41467-022-28355-z] [Citation(s) in RCA: 60] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Accepted: 01/20/2022] [Indexed: 12/12/2022] Open
Abstract
Infinium methylation arrays are not available for the vast majority of non-human mammals. Moreover, even if species-specific arrays were available, probe differences between them would confound cross-species comparisons. To address these challenges, we developed the mammalian methylation array, a single custom array that measures up to 36k CpGs per species that are well conserved across many mammalian species. We designed a set of probes that can tolerate specific cross-species mutations. We annotate the array in over 200 species and report CpG island status and chromatin states in select species. Calibration experiments demonstrate the high fidelity in humans, rats, and mice. The mammalian methylation array has several strengths: it applies to all mammalian species even those that have not yet been sequenced, it provides deep coverage of conserved cytosines facilitating the development of epigenetic biomarkers, and it increases the probability that biological insights gained in one species will translate to others. Methods to probe DNA methylation in the majority of non-human mammals are lacking. Here the authors developed a Mammalian Methylation Array that includes 36k well-conserved CpGs in mammals which will facilitate cross-species comparisons. They annotate the conserved CpGs in > 200 species. The array allows one to measure methylation in all mammalian species including unsequenced ones.
Collapse
|
18
|
Integrative analysis of 3604 GWAS reveals multiple novel cell type-specific regulatory associations. Genome Biol 2022; 23:13. [PMID: 34996498 PMCID: PMC8742386 DOI: 10.1186/s13059-021-02560-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 11/26/2021] [Indexed: 01/02/2023] Open
Abstract
Background Genome-wide association study (GWAS) single nucleotide polymorphisms (SNPs) are known to preferentially co-locate to active regulatory elements in tissues and cell types relevant to disease aetiology. Further characterisation of associated cell type-specific regulation can broaden our understanding of how GWAS signals may contribute to disease risk. Results To gain insight into potential functional mechanisms underlying GWAS associations, we developed FORGE2 (https://forge2.altiusinstitute.org/), which is an updated version of the FORGE web tool. FORGE2 uses an expanded atlas of cell type-specific regulatory element annotations, including DNase I hotspots, five histone mark categories and 15 hidden Markov model (HMM) chromatin states, to identify tissue- and cell type-specific signals. An analysis of 3,604 GWAS from the NHGRI-EBI GWAS catalogue yielded at least one significant disease/trait-tissue association for 2,057 GWAS, including > 400 associations specific to epigenomic marks in immune tissues and cell types, > 30 associations specific to heart tissue, and > 60 associations specific to brain tissue, highlighting the key potential of tissue- and cell type-specific regulatory elements. Importantly, we demonstrate that FORGE2 analysis can separate previously observed accessible chromatin enrichments into different chromatin states, such as enhancers or active transcription start sites, providing a greater understanding of underlying regulatory mechanisms. Interestingly, tissue-specific enrichments for repressive chromatin states and histone marks were also detected, suggesting a role for tissue-specific repressed regions in GWAS-mediated disease aetiology. Conclusion In summary, we demonstrate that FORGE2 has the potential to uncover previously unreported disease-tissue associations and identify new candidate mechanisms. FORGE2 is a transparent, user-friendly web tool for the integrative analysis of loci discovered from GWAS. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-021-02560-3.
Collapse
|
19
|
Abstract
Methylation levels at specific CpG positions in the genome have been used to develop accurate estimators of chronological age in humans, mice, and other species. Although epigenetic clocks are generally species-specific, the principles underpinning them appear to be conserved at least across the mammalian class. This is exemplified by the successful development of epigenetic clocks for mice and several other mammalian species. Here, we describe epigenetic clocks for the rhesus macaque (Macaca mulatta), the most widely used nonhuman primate in biological research. Using a custom methylation array (HorvathMammalMethylChip40), we profiled n = 281 tissue samples (blood, skin, adipose, kidney, liver, lung, muscle, and cerebral cortex). From these data, we generated five epigenetic clocks for macaques. These clocks differ with regard to applicability to different tissue types (pan-tissue, blood, skin), species (macaque only or both humans and macaques), and measure of age (chronological age versus relative age). Additionally, the age-based human-macaque clock exhibits a high age correlation (R = 0.89) with the vervet monkey (Chlorocebus sabaeus), another Old World species. Four CpGs within the KLF14 promoter were consistently altered with age in four tissues (adipose, blood, cerebral cortex, skin). Future studies will be needed to evaluate whether these epigenetic clocks predict age-related conditions in the rhesus macaque.
Collapse
|
20
|
Identical twins carry a persistent epigenetic signature of early genome programming. Nat Commun 2021; 12:5618. [PMID: 34584077 PMCID: PMC8479069 DOI: 10.1038/s41467-021-25583-7] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2021] [Accepted: 07/19/2021] [Indexed: 02/08/2023] Open
Abstract
Monozygotic (MZ) twins and higher-order multiples arise when a zygote splits during pre-implantation stages of development. The mechanisms underpinning this event have remained a mystery. Because MZ twinning rarely runs in families, the leading hypothesis is that it occurs at random. Here, we show that MZ twinning is strongly associated with a stable DNA methylation signature in adult somatic tissues. This signature spans regions near telomeres and centromeres, Polycomb-repressed regions and heterochromatin, genes involved in cell-adhesion, WNT signaling, cell fate, and putative human metastable epialleles. Our study also demonstrates a never-anticipated corollary: because identical twins keep a lifelong molecular signature, we can retrospectively diagnose if a person was conceived as monozygotic twin.
Collapse
|
21
|
Comparison of phenotypes and outcomes following resection of incidental versus symptomatic pancreatic neuroendocrine tumors. REVISTA ESPANOLA DE ENFERMEDADES DIGESTIVAS 2021; 114:317-322. [PMID: 34315216 DOI: 10.17235/reed.2021.8096/2021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
Background/Aim Fifty to 70 percent of pancreatic neuroendocrine tumors are diagnosed incidentally. The objective of this study is to compare the phenotype and oncological outcomes of incidental versus symptomatic pancreatic neuroendocrine tumors. Methods A retrospective study was conducted identifying all incidental and symptomatic tumors resected between 2000 and 2019. Baseline characteristics, symptoms, operative variables and pathological stage were all recorded. In both groups, patterns of recurrence and overall and disease-free survival were analyzed. Results Fifty-one incidental and 45 symptomatic pancreatic tumor resections were performed. Symptomatic tumors were more frequent in women (29 vs 17; p=0.005), in younger patients (median years; 50 vs 58; p=0.012) and were detected at a more advanced stage (p=0.027). There were no differences in location and most resections (n= 49; 51%) were performed laparoscopically. There were no operative mortalities and 17 (17.7%) severe complications (≥IIIb on the Clavien-Dindo classification) were recorded with no differences between the two groups. With a median follow-up of 64.4 months (range 13.5 - 90), overall survival at 5 and 10 years was 89.7% and 72.8% for the non-incidental tumors, and 80.9% and 54.6% for the incidental tumors (p=ns). Disease-free survival in both groups (excluding M1a) was 71.2% and 47.5%, and 93.7% and 78.1%, respectively (p= ns). Conclusions Symptomatic tumors are more frequent in women and present at higher pathological stages. There were no significant differences in overall and disease-free survival between the two groups. Resection of incidental tumors ≥1.5 - 2 cm seems advisable, although each case should assessed on an individual basis.
Collapse
|
22
|
Author Correction: DNA methylation predicts age and provides insight into exceptional longevity of bats. Nat Commun 2021; 12:2652. [PMID: 33953189 PMCID: PMC8100179 DOI: 10.1038/s41467-021-23129-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
|
23
|
Epigenome-wide association study of kidney function identifies trans-ethnic and ethnic-specific loci. Genome Med 2021; 13:74. [PMID: 33931109 PMCID: PMC8088054 DOI: 10.1186/s13073-021-00877-z] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2020] [Accepted: 03/24/2021] [Indexed: 11/18/2022] Open
Abstract
BACKGROUND DNA methylation (DNAm) is associated with gene regulation and estimated glomerular filtration rate (eGFR), a measure of kidney function. Decreased eGFR is more common among US Hispanics and African Americans. The causes for this are poorly understood. We aimed to identify trans-ethnic and ethnic-specific differentially methylated positions (DMPs) associated with eGFR using an agnostic, genome-wide approach. METHODS The study included up to 5428 participants from multi-ethnic studies for discovery and 8109 participants for replication. We tested the associations between whole blood DNAm and eGFR using beta values from Illumina 450K or EPIC arrays. Ethnicity-stratified analyses were performed using linear mixed models adjusting for age, sex, smoking, and study-specific and technical variables. Summary results were meta-analyzed within and across ethnicities. Findings were assessed using integrative epigenomics methods and pathway analyses. RESULTS We identified 93 DMPs associated with eGFR at an FDR of 0.05 and replicated 13 and 1 DMPs across independent samples in trans-ethnic and African American meta-analyses, respectively. The study also validated 6 previously published DMPs. Identified DMPs showed significant overlap enrichment with DNase I hypersensitive sites in kidney tissue, sites associated with the expression of proximal genes, and transcription factor motifs and pathways associated with kidney tissue and kidney development. CONCLUSIONS We uncovered trans-ethnic and ethnic-specific DMPs associated with eGFR, including DMPs enriched in regulatory elements in kidney tissue and pathways related to kidney development. These findings shed light on epigenetic mechanisms associated with kidney function, bridging the gap between population-specific eGFR-associated DNAm and tissue-specific regulatory context.
Collapse
|
24
|
Prostate cancer in the British Asian population: A case-control study. JOURNAL OF CLINICAL UROLOGY 2020. [DOI: 10.1177/2051415820969574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Objectives: To study demographic and disease variables at presentation, diagnosis and treatment of prostate cancer in British South Asian (SA) men and compare with their Caucasian counterparts. Methods: A retrospective review of prostate cancer cases between 2010 and 2015 identified patients of SA descent who were matched for age, residential location and year of diagnosis with two Caucasian patients. Disease characteristics, treatment and short-term outcomes were recorded and compared. Results: A total of 1274 patients were diagnosed, of which 50 (3.9%) were SA. SA patients had less screen-detected prostate cancer (12% versus 31%, p=0.02), more co-morbidity and more high-risk disease (57% versus 37%, p=0.03). Active surveillance was the preferred treatment option (32%) with 2% having surgery compared with 22% ( p=0.003) from the Caucasian cohort. Logistic regression showed these disparities to be related to the different presentations. Conclusion: SA men with prostate cancer are less likely to be diagnosed following asymptomatic screening and the majority have high-risk disease at presentation. These differences lead to SA men preferring active surveillance to radical surgery. Given the increasing British Asian population, the findings demonstrate a need for research and public health interventions to better understand and address the differences noted between these groups.
Collapse
|
25
|
The Fate of Ureteral Memokath Stent(s) in a High-Volume Referral Center: An Independent Long-Term Outcomes Review. J Endourol 2020; 35:180-186. [PMID: 32762263 DOI: 10.1089/end.2020.0542] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Purpose: To independently assess upper urinary tract Memokath (MMK-051) stent outcomes in a national tertiary referral center. Materials and Methods: Two researchers, completely independent to the treating team, reviewed electronic MMK-051 stent(s) patient management records. Outcomes included time to first complication, complication(s)-severity, MMK-051 stent lifespan and change incidence, salvage therapy, further surgical intervention, and mortality. Results and Limitations: One hundred patients received 162 MMK-051 stent(s) (59% with malignant and 63% with distal ureteral obstruction [UO]) with only three lost to follow-up (FU). At 5-year mean FU, only 25 patients had complication-free original MMK-051 stents (14 alive, 11 dead). Of the remaining 75 patients, 22 had other stents, 12 had major surgery (e.g., nephrectomy), 3 became dialysis dependent, and 14 stabilized without ureteral stenting after original MMK-051 removal. Malignant obstruction patients had greater original MMK-051 stent longevity (p < 0.02), but also 20 of the 21 deaths (95%). The 72% mean 5-year stent complication rate included migration (46%), blockage (34%), nonfunctioning kidney (8%), urosepsis needing intravenous antibiotics (8%), and others (6%), including one postoperative death, one ureteral injury, and two with intractable pain. Median time to first complication was 12.5 months. Conclusions: MMK-051 stents had optimal utility in managing malignant UO and in those unfit for corrective surgery. Longer independently assessed mean 5-year outcomes review revealed much higher complication rates (72%) than previously reported. Future international metallic ureteral stent guidelines should encourage clinicians to adopt patient-centered multidisciplinary assessment and selection, with counseling plus goal-setting, and harmonized long-term protocol-based reporting, for optimized future patient safety and outcomes.
Collapse
|
26
|
EPISCORE: cell type deconvolution of bulk tissue DNA methylomes from single-cell RNA-Seq data. Genome Biol 2020; 21:221. [PMID: 32883324 PMCID: PMC7650528 DOI: 10.1186/s13059-020-02126-9] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 07/29/2020] [Indexed: 12/19/2022] Open
Abstract
Cell type heterogeneity presents a challenge to the interpretation of epigenome data, compounded by the difficulty in generating reliable single-cell DNA methylomes for large numbers of cells and samples. We present EPISCORE, a computational algorithm that performs virtual microdissection of bulk tissue DNA methylation data at single cell-type resolution for any solid tissue. EPISCORE applies a probabilistic epigenetic model of gene regulation to a single-cell RNA-seq tissue atlas to generate a tissue-specific DNA methylation reference matrix, allowing quantification of cell-type proportions and cell-type-specific differential methylation signals in bulk tissue data. We validate EPISCORE in multiple epigenome studies and tissue types.
Collapse
|
27
|
eFORGE v2.0: updated analysis of cell type-specific signal in epigenomic data. Bioinformatics 2020; 35:4767-4769. [PMID: 31161210 PMCID: PMC6853678 DOI: 10.1093/bioinformatics/btz456] [Citation(s) in RCA: 68] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2018] [Revised: 04/24/2019] [Accepted: 05/29/2019] [Indexed: 12/31/2022] Open
Abstract
SUMMARY The Illumina Infinium EPIC BeadChip is a new high-throughput array for DNA methylation analysis, extending the earlier 450k array by over 400 000 new sites. Previously, a method named eFORGE was developed to provide insights into cell type-specific and cell-composition effects for 450k data. Here, we present a significantly updated and improved version of eFORGE that can analyze both EPIC and 450k array data. New features include analysis of chromatin states, transcription factor motifs and DNase I footprints, providing tools for epigenome-wide association study interpretation and epigenome editing. AVAILABILITY AND IMPLEMENTATION eFORGE v2.0 is implemented as a web tool available from https://eforge.altiusinstitute.org and https://eforge-tf.altiusinstitute.org/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
28
|
Betel nut chewing in high-income countries-lack of awareness and regulation. Lancet Oncol 2020; 20:181-183. [PMID: 30723038 DOI: 10.1016/s1470-2045(18)30911-2] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 01/31/2023]
|
29
|
EpiDISH web server: Epigenetic Dissection of Intra-Sample-Heterogeneity with online GUI. Bioinformatics 2019; 36:btz833. [PMID: 31710662 PMCID: PMC7703755 DOI: 10.1093/bioinformatics/btz833] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 03/11/2019] [Accepted: 11/05/2019] [Indexed: 12/22/2022] Open
Abstract
SUMMARY It is well recognized that cell-type heterogeneity hampers the interpretation of Epigenome-Wide Association Studies (EWAS). Many tools have emerged to address this issue, including several R/Bioconductor packages that infer cell-type composition. Here we present a web application for cell-type deconvolution, which offers the functionality of our EpiDISH Bioconductor/R package in a user-friendly GUI environment. Users can upload their data to infer cell-type composition and differentially methylated cytosines in individual cell-types (DMCTs) for a range of different tissues. AVAILABILITY AND IMPLEMENTATION EpiDISH web server is implemented with Shiny in R, and is freely available at https://www.biosino.org/EpiDISH/.
Collapse
|
30
|
Tobacco smoking induces changes in true DNA methylation, hydroxymethylation and gene expression in bronchoalveolar lavage cells. EBioMedicine 2019; 46:290-304. [PMID: 31303497 PMCID: PMC6710853 DOI: 10.1016/j.ebiom.2019.07.006] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 06/28/2019] [Accepted: 07/02/2019] [Indexed: 12/21/2022] Open
Abstract
Background While smoking is known to associate with development of multiple diseases, the underlying mechanisms are still poorly understood. Tobacco smoking can modify the chemical integrity of DNA leading to changes in transcriptional activity, partly through an altered epigenetic state. We aimed to investigate the impact of smoking on lung cells collected from bronchoalveolar lavage (BAL). Methods We profiled changes in DNA methylation (5mC) and its oxidised form hydroxymethylation (5hmC) using conventional bisulphite (BS) treatment and oxidative bisulphite treatment with Illumina Infinium MethylationEPIC BeadChip, and examined gene expression by RNA-seq in healthy smokers. Findings We identified 1667 total 5mC + 5hmC, 1756 5mC and 67 5hmC differentially methylated positions (DMPs) between smokers and non-smokers (FDR-adjusted P <.05, absolute Δβ >0.15). Both 5mC DMPs and to a lesser extent 5mC + 5hmC were predominantly hypomethylated. In contrast, almost all 5hmC DMPs were hypermethylated, supporting the hypothesis that smoking-associated oxidative stress can lead to DNA demethylation, via the established sequential oxidation of which 5hmC is the first step. While we confirmed differential methylation of previously reported smoking-associated 5mC + 5hmC CpGs using former generations of BeadChips in alveolar macrophages, the large majority of identified DMPs, 5mC + 5hmC (1639/1667), 5mC (1738/1756), and 5hmC (67/67), have not been previously reported. Most of these novel smoking-associating sites are specific to the EPIC BeadChip and, interestingly, many of them are associated to FANTOM5 enhancers. Transcriptional changes affecting 633 transcripts were consistent with DNA methylation profiles and converged to alteration of genes involved in migration, signalling and inflammatory response of immune cells. Interpretation Collectively, these findings suggest that tobacco smoke exposure epigenetically modifies BAL cells, possibly involving a continuous active demethylation and subsequent increased activity of inflammatory processes in the lungs. Fund The study was supported by the Swedish Research Council, the Swedish Heart-Lung Foundation, the Stockholm County Council (ALF), the King Gustav's and Queen Victoria's Freemasons' Foundation, Knut and Alice Wallenberg Foundation, Neuro Sweden, and the Swedish MS foundation.
Collapse
|
31
|
|
32
|
Deciphering the genomic, epigenomic, and transcriptomic landscapes of pre-invasive lung cancer lesions. Nat Med 2019; 25:517-525. [PMID: 30664780 PMCID: PMC7614970 DOI: 10.1038/s41591-018-0323-0] [Citation(s) in RCA: 144] [Impact Index Per Article: 28.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 12/05/2018] [Indexed: 01/10/2023]
Abstract
The molecular alterations that occur in cells before cancer is manifest are largely uncharted. Lung carcinoma in situ (CIS) lesions are the pre-invasive precursor to squamous cell carcinoma. Although microscopically identical, their future is in equipoise, with half progressing to invasive cancer and half regressing or remaining static. The cellular basis of this clinical observation is unknown. Here, we profile the genomic, transcriptomic, and epigenomic landscape of CIS in a unique patient cohort with longitudinally monitored pre-invasive disease. Predictive modeling identifies which lesions will progress with remarkable accuracy. We identify progression-specific methylation changes on a background of widespread heterogeneity, alongside a strong chromosomal instability signature. We observed mutations and copy number changes characteristic of cancer and chart their emergence, offering a window into early carcinogenesis. We anticipate that this new understanding of cancer precursor biology will improve early detection, reduce overtreatment, and foster preventative therapies targeting early clonal events in lung cancer.
Collapse
|
33
|
Snoring and breathing pauses during sleep: interview survey of a United Kingdom population sample reveals a significant increase in the rates of sleep apnoea and obesity over the last 20 years - data from the UK sleep survey. Sleep Med 2018; 54:250-256. [PMID: 30597439 DOI: 10.1016/j.sleep.2018.08.029] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2018] [Revised: 07/06/2018] [Accepted: 08/31/2018] [Indexed: 10/28/2022]
Abstract
STUDY OBJECTIVES (1) To determine the prevalence of snoring, breathing pauses during sleep and obstructive sleep apnoea syndrome in the United Kingdom (UK) and determine the relation between these events and obesity and other sociodemographic variables. (2) To compare and integrate this data with published UK population data. METHODS A total of 664 women and 575 men aged 18-100 years who formed a representative sample of the non-institutionalised UK population participated in an online interview survey directed by a previously validated computerised system. RESULTS Overall, 38% of men and 30.4% of women report that they snore at night. Furthermore, 8.7% of men and 5.6% of women state that they stop breathing at night. Comparing our data to published data from the 1990s, this study observes a highly significant increase in the rates of reported breathing pauses during sleep (sleep apnoea) in the UK over the last 20 years (p < 0.0001). In addition, we observe a highly significant increase in the prevalence of obesity (BMI>30) in the UK population between 1994 and 2015 (p < 0.0001). Integration of our data with NHS and public health England data on obesity confirms this increase. CONCLUSIONS Our data demonstrate a significant increase in the rates of reported breathing pauses during sleep (sleep apnoea) and obesity in the UK over the last 20 years. Sociodemographic and behavioural changes have likely contributed to this. Moreover, our data also suggests that sleep disordered breathing (SDB) is widely underdiagnosed in the UK.
Collapse
|
34
|
Genome-wide analysis of DNA methylation in buccal cells: a study of monozygotic twins and mQTLs. Epigenetics Chromatin 2018; 11:54. [PMID: 30253792 PMCID: PMC6156977 DOI: 10.1186/s13072-018-0225-x] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 09/17/2018] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND DNA methylation arrays are widely used in epigenome-wide association studies and methylation quantitative trait locus (mQTL) studies. Here, we performed the first genome-wide analysis of monozygotic (MZ) twin correlations and mQTLs on data obtained with the Illumina MethylationEPIC BeadChip (EPIC array) and compared the performance of the EPIC array to the Illumina HumanMethylation450 BeadChip (HM450 array) for buccal-derived DNA. RESULTS Good-quality EPIC data were obtained for 102 buccal-derived DNA samples from 49 MZ twin pairs (mean age = 7.5 years, range = 1-10). Differences between MZ twins in the cellular content of buccal swabs were a major driver for differences in their DNA methylation profiles, highlighting the importance to adjust for cellular composition in DNA methylation studies of buccal-derived DNA. After adjusting for cellular composition, the genome-wide mean correlation (r) between MZ twins was 0.21 for the EPIC array, and cis mQTL analysis in 84 twins identified 1,296,323 significant associations (FDR 5%), encompassing 33,749 methylation sites and 616,029 genetic variants. MZ twin correlations were slightly larger (p < 2.2 × 10-16) for novel EPIC probes (N = 383,066, mean r = 0.22) compared to probes that are also present on HM450 (N = 406,822, mean r = 0.20). In line with this observation, a larger percentage of novel EPIC probes was associated with genetic variants (novel EPIC probes with significant mQTL 4.7%, HM450 probes with mQTL 3.9%, p < 2.2 × 10-16). Methylation sites with a large MZ correlation and sites associated with mQTLs were most strongly enriched in epithelial cell DNase I hypersensitive sites (DHSs), enhancers, and histone mark H3K4me3. CONCLUSIONS We conclude that the contribution of familial factors to individual differences in DNA methylation and the effect of mQTLs are larger for novel EPIC probes, especially those within regulatory elements connected to active regions specific to the investigated tissue.
Collapse
|
35
|
eFORGE: A Tool for Identifying Cell Type-Specific Signal in Epigenomic Data. Cell Rep 2017; 17:2137-2150. [PMID: 27851974 PMCID: PMC5120369 DOI: 10.1016/j.celrep.2016.10.059] [Citation(s) in RCA: 81] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Revised: 08/25/2016] [Accepted: 09/30/2016] [Indexed: 12/14/2022] Open
Abstract
Epigenome-wide association studies (EWAS) provide an alternative approach for studying human disease through consideration of non-genetic variants such as altered DNA methylation. To advance the complex interpretation of EWAS, we developed eFORGE (http://eforge.cs.ucl.ac.uk/), a new standalone and web-based tool for the analysis and interpretation of EWAS data. eFORGE determines the cell type-specific regulatory component of a set of EWAS-identified differentially methylated positions. This is achieved by detecting enrichment of overlap with DNase I hypersensitive sites across 454 samples (tissues, primary cell types, and cell lines) from the ENCODE, Roadmap Epigenomics, and BLUEPRINT projects. Application of eFORGE to 20 publicly available EWAS datasets identified disease-relevant cell types for several common diseases, a stem cell-like signature in cancer, and demonstrated the ability to detect cell-composition effects for EWAS performed on heterogeneous tissues. Our approach bridges the gap between large-scale epigenomics data and EWAS-derived target selection to yield insight into disease etiology.
Collapse
|
36
|
A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. BMC Bioinformatics 2017; 18:105. [PMID: 28193155 PMCID: PMC5307731 DOI: 10.1186/s12859-017-1511-5] [Citation(s) in RCA: 228] [Impact Index Per Article: 32.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Accepted: 01/31/2017] [Indexed: 12/29/2022] Open
Abstract
BACKGROUND Intra-sample cellular heterogeneity presents numerous challenges to the identification of biomarkers in large Epigenome-Wide Association Studies (EWAS). While a number of reference-based deconvolution algorithms have emerged, their potential remains underexplored and a comparative evaluation of these algorithms beyond tissues such as blood is still lacking. RESULTS Here we present a novel framework for reference-based inference, which leverages cell-type specific DNAse Hypersensitive Site (DHS) information from the NIH Epigenomics Roadmap to construct an improved reference DNA methylation database. We show that this leads to a marginal but statistically significant improvement of cell-count estimates in whole blood as well as in mixtures involving epithelial cell-types. Using this framework we compare a widely used state-of-the-art reference-based algorithm (called constrained projection) to two non-constrained approaches including CIBERSORT and a method based on robust partial correlations. We conclude that the widely-used constrained projection technique may not always be optimal. Instead, we find that the method based on robust partial correlations is generally more robust across a range of different tissue types and for realistic noise levels. We call the combined algorithm which uses DHS data and robust partial correlations for inference, EpiDISH (Epigenetic Dissection of Intra-Sample Heterogeneity). Finally, we demonstrate the added value of EpiDISH in an EWAS of smoking. CONCLUSIONS Estimating cell-type fractions and subsequent inference in EWAS may benefit from the use of non-constrained reference-based cell-type deconvolution methods.
Collapse
|
37
|
Where next for the reproducibility agenda in computational biology? BMC SYSTEMS BIOLOGY 2016; 10:52. [PMID: 27422148 PMCID: PMC4946111 DOI: 10.1186/s12918-016-0288-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 06/08/2016] [Indexed: 11/24/2022]
Abstract
Background The concept of reproducibility is a foundation of the scientific method. With the arrival of fast and powerful computers over the last few decades, there has been an explosion of results based on complex computational analyses and simulations. The reproducibility of these results has been addressed mainly in terms of exact replicability or numerical equivalence, ignoring the wider issue of the reproducibility of conclusions through equivalent, extended or alternative methods. Results We use case studies from our own research experience to illustrate how concepts of reproducibility might be applied in computational biology. Several fields have developed ‘minimum information’ checklists to support the full reporting of computational simulations, analyses and results, and standardised data formats and model description languages can facilitate the use of multiple systems to address the same research question. We note the importance of defining the key features of a result to be reproduced, and the expected agreement between original and subsequent results. Dynamic, updatable tools for publishing methods and results are becoming increasingly common, but sometimes come at the cost of clear communication. In general, the reproducibility of computational research is improving but would benefit from additional resources and incentives. Conclusions We conclude with a series of linked recommendations for improving reproducibility in computational biology through communication, policy, education and research practice. More reproducible research will lead to higher quality conclusions, deeper understanding and more valuable knowledge.
Collapse
|
38
|
Tissue-independent and tissue-specific patterns of DNA methylation alteration in cancer. Epigenetics Chromatin 2016; 9:10. [PMID: 26958079 PMCID: PMC4782576 DOI: 10.1186/s13072-016-0058-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2016] [Accepted: 02/11/2016] [Indexed: 12/13/2022] Open
Abstract
Background
There is growing evidence that DNA methylation alterations contribute to carcinogenesis. While cancer tissue exhibits widespread DNA methylation changes, the proportion of tissue-specific versus tissue-independent DNA methylation alterations in cancer is unclear. In addition, it is unknown which factors determine the patterns of aberrant DNA methylation in cancer. Results Using HumanMethylation450 BeadChips (450k), we here analyze genome-wide DNA methylation patterns of ten types of fetal tissue, in addition to matched normal-cancer data for corresponding tissue types, encompassing over 3000 samples. We demonstrate that the level of aberrant cancer DNA methylation in gene promoters and gene bodies is highly correlated between cancer types. We estimate that up to 60 % of the DNA methylation variation in a cancer genome of a given tissue type is explained by the corresponding variation in a cancer genome of another type, implying that much of the cancer DNA methylation landscape is tissue independent. We further show that histone marks in normal cells are better predictors of aberrant cancer DNA methylation than the corresponding signals in human embryonic stem cells. We build predictors of cancer DNA methylation patterns and show that although inclusion of three histone marks (H3K4me3, H3K27me3 and H3K36me3) improves model accuracy, the bivalent marks are the most predictive. Finally, we show that chromatin accessibility of gene promoters in normal tissue dictates the promoter’s propensity to acquire aberrant DNA methylation in cancer in so far as it determines its level of DNA methylation in normal tissue. Conclusions Our data show that a considerable fraction of the aberrant cancer DNA methylation landscape results from a mechanism that is largely tissue specific. Histone marks as specified in the normal cell of origin provide highly predictive models of aberrant cancer DNA methylation and outperform those derived from the same marks in hESCs. Electronic supplementary material The online version of this article (doi:10.1186/s13072-016-0058-4) contains supplementary material, which is available to authorized users.
Collapse
|