1
|
Alsheikh AJ, Wollenhaupt S, King EA, Reeb J, Ghosh S, Stolzenburg LR, Tamim S, Lazar J, Davis JW, Jacob HJ. The landscape of GWAS validation; systematic review identifying 309 validated non-coding variants across 130 human diseases. BMC Med Genomics 2022; 15:74. [PMID: 35365203 PMCID: PMC8973751 DOI: 10.1186/s12920-022-01216-w] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 03/17/2022] [Indexed: 02/08/2023] Open
Abstract
Background The remarkable growth of genome-wide association studies (GWAS) has created a critical need to experimentally validate the disease-associated variants, 90% of which involve non-coding variants. Methods To determine how the field is addressing this urgent need, we performed a comprehensive literature review identifying 36,676 articles. These were reduced to 1454 articles through a set of filters using natural language processing and ontology-based text-mining. This was followed by manual curation and cross-referencing against the GWAS catalog, yielding a final set of 286 articles. Results We identified 309 experimentally validated non-coding GWAS variants, regulating 252 genes across 130 human disease traits. These variants covered a variety of regulatory mechanisms. Interestingly, 70% (215/309) acted through cis-regulatory elements, with the remaining through promoters (22%, 70/309) or non-coding RNAs (8%, 24/309). Several validation approaches were utilized in these studies, including gene expression (n = 272), transcription factor binding (n = 175), reporter assays (n = 171), in vivo models (n = 104), genome editing (n = 96) and chromatin interaction (n = 33). Conclusions This review of the literature is the first to systematically evaluate the status and the landscape of experimentation being used to validate non-coding GWAS-identified variants. Our results clearly underscore the multifaceted approach needed for experimental validation, have practical implications on variant prioritization and considerations of target gene nomination. While the field has a long way to go to validate the thousands of GWAS associations, we show that progress is being made and provide exemplars of validation studies covering a wide variety of mechanisms, target genes, and disease areas. Supplementary Information The online version contains supplementary material available at 10.1186/s12920-022-01216-w.
Collapse
Affiliation(s)
- Ammar J Alsheikh
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA.
| | - Sabrina Wollenhaupt
- Information Research, AbbVie Deutschland GmbH & Co. KG, 67061, Knollstrasse, Ludwigshafen, Germany
| | - Emily A King
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| | - Jonas Reeb
- Information Research, AbbVie Deutschland GmbH & Co. KG, 67061, Knollstrasse, Ludwigshafen, Germany
| | - Sujana Ghosh
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| | | | - Saleh Tamim
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| | - Jozef Lazar
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| | - J Wade Davis
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| | - Howard J Jacob
- Genomics Research Center, AbbVie Inc, North Chicago, Illinois, 60064, USA
| |
Collapse
|
2
|
Huang G, Liang D, Luo L, Lan C, Luo C, Xu H, Lai J. Significance of the lncRNAs MALAT1 and ANRIL in occurrence and development of glaucoma. J Clin Lab Anal 2022; 36:e24215. [PMID: 35028972 PMCID: PMC8842314 DOI: 10.1002/jcla.24215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2021] [Revised: 12/06/2021] [Accepted: 12/14/2021] [Indexed: 11/19/2022] Open
Abstract
Background Primary open‐angle glaucoma (POAG) is the commonest form of glaucoma which is estimated to cause bilaterally blind within 11.1 million people by 2020. Therefore, the primary objectives of this study were to investigate the clinical significance of single‐nucleotide polymorphisms (SNPs) in the lncRNAs MALAT1 and ANRIL in a Chinese Han POAG cohort. Methods Three hundred and forty‐six glaucoma patients and 263 healthy controls were recruited, and totally 14 SNPs in MALAT1 and ANRIL were genotyped between the two populations. Results The MALAT1 SNPs rs619586 (A>G), rs3200401 (C>T), and rs664589 (C>G) were associated with POAG risk, and the ANRIL SNPs rs2383207 (A>G), rs564398 (A>G), rs2157719 (A>G), rs7865618 (G>A), and rs4977574 (A>G) were associated with POAG (p < 0.05). The MALAT1 haplotypes ACG and ATC, comprised rs619586, rs3200401, and rs664589, increased POAG risk, and the ANRIL haplotype AAGAA, made up of rs2383207, rs7865618, rs4977574, rs564398, and rs2157719, show a significantly increased risk of POAG. In addition, rs619586 (A>G) of MALAT1 and rs564398/rs2157719 of ANRIL were associated with a smaller vertical cup‐to‐disc ratio, while rs619586 of MALAT1 and rs2383207/rs4977574 of ANRIL were associated with higher intraocular pressure in the POAG population. Conclusion Single‐nucleotide polymorphisms and haplotypes in ANRIL and MALAT1 were associated with POAG onset in our study population, which provide more possibilities to POAG diagnosis and treatment.
Collapse
Affiliation(s)
- Guoqiang Huang
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Dong Liang
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Lidan Luo
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Chenghong Lan
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Chengfeng Luo
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Hongwang Xu
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| | - Jiangfeng Lai
- Department of Ophthalmology, Meizhou People's Hospital, Meizhou City, China
| |
Collapse
|
3
|
Sordillo JE, Cardenas A, Qi C, Rifas-Shiman SL, Coull B, Luttmann-Gibson H, Schwartz J, Kloog I, Hivert MF, DeMeo DL, Baccarelli AA, Xu CJ, Gehring U, Vonk JM, Koppelman G, Oken E, Gold DR. Residential PM 2.5 exposure and the nasal methylome in children. ENVIRONMENT INTERNATIONAL 2021; 153:106505. [PMID: 33872926 PMCID: PMC8823376 DOI: 10.1016/j.envint.2021.106505] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 02/27/2021] [Accepted: 03/05/2021] [Indexed: 05/28/2023]
Abstract
RATIONALE PM2.5-induced adverse effects on respiratory health may be driven by epigenetic modifications in airway cells. The potential impact of exposure duration on epigenetic alterations in the airways is not yet known. OBJECTIVES We aimed to study associations of fine particulate matter PM2.5 exposure with DNA methylation in nasal cells. METHODS We conducted nasal epigenome-wide association analyses within 503 children from Project Viva (mean age 12.9 y), and examined various exposure durations (1-day, 1-week, 1-month, 3-months and 1-year) prior to nasal sampling. We used residential addresses to estimate average daily PM2.5 at 1 km resolution. We collected nasal swabs from the anterior nares and measured DNA methylation (DNAm) using the Illumina MethylationEPIC BeadChip. We tested 719,075 high quality autosomal CpGs using CpG-by-CpG and regional DNAm analyses controlling for multiple comparisons, and adjusted for maternal education, household smokers, child sex, race/ethnicity, BMI z-score, age, season at sample collection and cell-type heterogeneity. We further corrected for bias and genomic inflation. We tested for replication in a cohort from the Netherlands (PIAMA). RESULTS In adjusted analyses, we found 362 CpGs associated with 1-year PM2.5 (FDR < 0.05), 20 CpGs passing Bonferroni correction (P < 7.0x10-8) and 10 Differentially Methylated Regions (DMRs). In 445 PIAMA participants (mean age 16.3 years) 11 of 203 available CpGs replicated at P < 0.05. We observed differential DNAm at/near genes implicated in cell cycle, immune and inflammatory responses. There were no CpGs or regions associated with PM2.5 levels at 1-day, 1-week, or 1-month prior to sample collection, although 2 CpGs were associated with past 3-month PM2.5. CONCLUSION We observed wide-spread DNAm variability associated with average past year PM2.5 exposure but we did not detect associations with shorter-term exposure. Our results suggest that nasal DNAm marks reflect chronic air pollution exposure.
Collapse
Affiliation(s)
- Joanne E Sordillo
- Division of Chronic Disease Research Across the Lifecourse, Department of Population Medicine, Harvard Medical School, and Harvard Pilgrim Health Care Institute, Boston, MA, USA
| | - Andres Cardenas
- Division of Environmental Health Sciences, School of Public Health, University of California, Berkeley, 2121 Berkeley Way, Berkeley, CA, USA
| | - Cancan Qi
- University of Groningen, University Medical Center Groningen, Department of Pediatric Pulmonology and Pediatric Allergy, Beatrix Children's Hospital, Groningen, the Netherlands; University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands
| | - Sheryl L Rifas-Shiman
- Division of Chronic Disease Research Across the Lifecourse, Department of Population Medicine, Harvard Medical School, and Harvard Pilgrim Health Care Institute, Boston, MA, USA
| | - Brent Coull
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Heike Luttmann-Gibson
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Joel Schwartz
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Itai Kloog
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA
| | - Marie-France Hivert
- Division of Chronic Disease Research Across the Lifecourse, Department of Population Medicine, Harvard Medical School, and Harvard Pilgrim Health Care Institute, Boston, MA, USA
| | - Dawn L DeMeo
- Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Andrea A Baccarelli
- Department of Environmental Health Sciences, Mailman School of Public Health, Columbia University, NY, NY, USA
| | - Cheng-Jian Xu
- Research Group of Bioinformatics and Computational Genomics, CiiM, Centre for individualized infection medicine, a joint venture between Hannover Medical School and the Helmholtz Centre for Infection Research, Hannover, Germany; Department of Gastroenterology, Hepatology and Endocrinology, TWINCORE, Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School and the Helmholtz Centre for Infection Research, Hannover, Germany; Department of Internal Medicine, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Ulrike Gehring
- Institute for Risk Assessment Sciences, Utrecht University, Utrecht, the Netherlands
| | - Judith M Vonk
- University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands; University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, the Netherlands
| | - Gerard Koppelman
- University of Groningen, University Medical Center Groningen, Department of Pediatric Pulmonology and Pediatric Allergy, Beatrix Children's Hospital, Groningen, the Netherlands; University of Groningen, University Medical Center Groningen, Groningen Research Institute for Asthma and COPD (GRIAC), Groningen, the Netherlands
| | - Emily Oken
- Division of Chronic Disease Research Across the Lifecourse, Department of Population Medicine, Harvard Medical School, and Harvard Pilgrim Health Care Institute, Boston, MA, USA
| | - Diane R Gold
- Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, MA, USA; Channing Division of Network Medicine and Division of Pulmonary and Critical Care Medicine, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
4
|
Apellaniz-Ruiz M, McCluggage WG, Foulkes WD. DICER1-associated embryonal rhabdomyosarcoma and adenosarcoma of the gynecologic tract: Pathology, molecular genetics, and indications for molecular testing. Genes Chromosomes Cancer 2020; 60:217-233. [PMID: 33135284 DOI: 10.1002/gcc.22913] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Accepted: 10/21/2020] [Indexed: 12/21/2022] Open
Abstract
Gynecologic sarcomas are uncommon neoplasms, the majority occurring in the uterus. Due to the diverse nature of these, the description of "new" morphological types and the rarity of some of them, pathological diagnosis and treatment is often challenging. Finding genetic alterations specific to, and frequently occurring, in a certain type can aid in the diagnosis. DICER1 is a highly conserved ribonuclease crucial in the biogenesis of microRNAs and mutations in DICER1 (either somatic or germline) have been detected in a wide range of sarcomas including genitourinary embryonal rhabdomyosarcomas (ERMS) and adenosarcomas. Importantly, DICER1-associated sarcomas share morphological features irrespective of the site of origin such that the pathologist can strongly suspect a DICER1 association. A review of the literature shows that almost all gynecologic ERMS reported (outside of the vagina) harbor DICER1 alterations, while approximately 20% of adenosarcomas also do so. These two tumor types exhibit significant morphological overlap and DICER1 tumor testing may be helpful in distinguishing between them, because a negative result makes ERMS unlikely. Given that germline pathogenic DICER1 variants are frequent in uterine (corpus and cervix) ERMS and pathogenic germline variants in this gene cause a hereditary cancer predisposition syndrome (DICER1 syndrome), patients diagnosed with these neoplasms should be referred to medical genetic services. Cooperation between pathologists and geneticists is crucial and will help in improving the diagnosis and management of these uncommon sarcomas.
Collapse
Affiliation(s)
| | - W Glenn McCluggage
- Department of Pathology, Belfast Health and Social Care Trust, Belfast, UK
| | - William D Foulkes
- Department of Human Genetics, McGill University, Montréal, Québec, Canada
| |
Collapse
|
5
|
Li Z, Sun R, Bishop TC. Genome Dashboards: Framework and Examples. Biophys J 2020; 118:2077-2085. [PMID: 32171420 PMCID: PMC7203004 DOI: 10.1016/j.bpj.2020.02.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 01/28/2020] [Accepted: 02/18/2020] [Indexed: 12/11/2022] Open
Abstract
Genomics is a sequence-based informatics science and a three-dimensional-structure-based material science. However, in practice, most genomics researchers utilize sequence-based informatics approaches or three-dimensional-structure-based material science techniques, not both. This division is, at least in part, the result of historical developments rather than a fundamental necessity. The underlying computational tools, experimental techniques, and theoretical models were developed independently. The primary result presented here is a framework for the unification of informatics- and physics-based data associated with DNA, nucleosomes, and chromatin. The framework is based on the mathematical representation of geometrically exact rods and the generalization of DNA basepair step parameters. Data unification enables researchers to integrate computational, experimental, and theoretical approaches for the study of chromatin biology. The framework can be implemented using model-view-controller design principles, existing genome browsers, and existing molecular visualization tools. We developed a minimal, web-based genome dashboard, G-Dash-min, and applied it to two simple examples to demonstrate the usefulness of data unification and proof of concept. Genome dashboards developed using the framework and design principles presented here are extensible and customizable and are therefore more broadly applicable than the examples presented. We expect a number of purpose-specific genome dashboards to emerge as a novel means of investigating structure-function relationships for genomes that range from basepairs to entire chromosomes and for generating, validating, and testing mechanistic hypotheses.
Collapse
Affiliation(s)
- Zilong Li
- Chemistry and Physics, Louisiana Tech University, Ruston, Louisiana
| | - Ran Sun
- Chemistry and Physics, Louisiana Tech University, Ruston, Louisiana
| | - Thomas C Bishop
- Chemistry and Physics, Louisiana Tech University, Ruston, Louisiana.
| |
Collapse
|