1
|
Long E, Patel H, Golden A, Antony M, Yin J, Funderburk K, Feng J, Song L, Hoskins JW, Amundadottir LT, Hung RJ, Amos CI, Shi J, Rothman N, Lan Q, Choi J. High-throughput characterization of functional variants highlights heterogeneity and polygenicity underlying lung cancer susceptibility. Am J Hum Genet 2024:S0002-9297(24)00184-8. [PMID: 38906146 DOI: 10.1016/j.ajhg.2024.05.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 05/23/2024] [Accepted: 05/23/2024] [Indexed: 06/23/2024] Open
Abstract
Genome-wide association studies (GWASs) have identified numerous lung cancer risk-associated loci. However, decoding molecular mechanisms of these associations is challenging since most of these genetic variants are non-protein-coding with unknown function. Here, we implemented massively parallel reporter assays (MPRAs) to simultaneously measure the allelic transcriptional activity of risk-associated variants. We tested 2,245 variants at 42 loci from 3 recent GWASs in East Asian and European populations in the context of two major lung cancer histological types and exposure to benzo(a)pyrene. This MPRA approach identified one or more variants (median 11 variants) with significant effects on transcriptional activity at 88% of GWAS loci. Multimodal integration of lung-specific epigenomic data demonstrated that 63% of the loci harbored multiple potentially functional variants in linkage disequilibrium. While 22% of the significant variants showed allelic effects in both A549 (adenocarcinoma) and H520 (squamous cell carcinoma) cell lines, a subset of the functional variants displayed a significant cell-type interaction. Transcription factor analyses nominated potential regulators of the functional variants, including those with cell-type-specific expression and those predicted to bind multiple potentially functional variants across the GWAS loci. Linking functional variants to target genes based on four complementary approaches identified candidate susceptibility genes, including those affecting lung cancer cell growth. CRISPR interference of the top functional variant at 20q13.33 validated variant-to-gene connections, including RTEL1, SOX18, and ARFRP1. Our data provide a comprehensive functional analysis of lung cancer GWAS loci and help elucidate the molecular basis of heterogeneity and polygenicity underlying lung cancer susceptibility.
Collapse
Affiliation(s)
- Erping Long
- State Key Laboratory of Respiratory Health and Multimorbidity, Institute of Basic Medical Sciences, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China.
| | - Harsh Patel
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Alyxandra Golden
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Michelle Antony
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jinhu Yin
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Karen Funderburk
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - James Feng
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Lei Song
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jason W Hoskins
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Laufey T Amundadottir
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Rayjean J Hung
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, Canada
| | - Christopher I Amos
- Institute for Clinical and Translational Research, Baylor College of Medicine, Houston, TX, USA
| | - Jianxin Shi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Nathaniel Rothman
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Qing Lan
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA
| | - Jiyeon Choi
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA.
| |
Collapse
|
2
|
Hoedjes KM, Grath S, Posnien N, Ritchie MG, Schlötterer C, Abbott JK, Almudi I, Coronado-Zamora M, Durmaz Mitchell E, Flatt T, Fricke C, Glaser-Schmitt A, González J, Holman L, Kankare M, Lenhart B, Orengo DJ, Snook RR, Yılmaz VM, Yusuf L. From whole bodies to single cells: A guide to transcriptomic approaches for ecology and evolutionary biology. Mol Ecol 2024:e17382. [PMID: 38856653 DOI: 10.1111/mec.17382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 04/09/2024] [Accepted: 04/29/2024] [Indexed: 06/11/2024]
Abstract
RNA sequencing (RNAseq) methodology has experienced a burst of technological developments in the last decade, which has opened up opportunities for studying the mechanisms of adaptation to environmental factors at both the organismal and cellular level. Selecting the most suitable experimental approach for specific research questions and model systems can, however, be a challenge and researchers in ecology and evolution are commonly faced with the choice of whether to study gene expression variation in whole bodies, specific tissues, and/or single cells. A wide range of sometimes polarised opinions exists over which approach is best. Here, we highlight the advantages and disadvantages of each of these approaches to provide a guide to help researchers make informed decisions and maximise the power of their study. Using illustrative examples of various ecological and evolutionary research questions, we guide the readers through the different RNAseq approaches and help them identify the most suitable design for their own projects.
Collapse
Affiliation(s)
- Katja M Hoedjes
- Amsterdam Institute for Life and Environment, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Sonja Grath
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Nico Posnien
- Department of Developmental Biology, Göttingen Center for Molecular Biosciences (GZMB), University of Göttingen, Göttingen, Germany
| | - Michael G Ritchie
- Centre for Biological Diversity, University of St Andrews, St Andrews, UK
| | | | | | - Isabel Almudi
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | | | - Esra Durmaz Mitchell
- Department of Biology, University of Fribourg, Fribourg, Switzerland
- Functional Genomics and Metabolism Research Unit, Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - Thomas Flatt
- Department of Biology, University of Fribourg, Fribourg, Switzerland
| | - Claudia Fricke
- Institute for Zoology/Animal Ecology, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany
| | | | - Josefa González
- Institute of Evolutionary Biology, CSIC, UPF, Barcelona, Spain
| | - Luke Holman
- School of Applied Sciences, Edinburgh Napier University, Edinburgh, UK
| | - Maaria Kankare
- Department of Biological and Environmental Science, University of Jyväskylä, Jyväskylä, Finland
| | - Benedict Lenhart
- Department of Biology, University of Virginia, Charlottesville, Virginia, USA
| | - Dorcas J Orengo
- Departament de Genètica, Microbiologia i Estadística, Universitat de Barcelona, Barcelona, Spain
- Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Rhonda R Snook
- Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Vera M Yılmaz
- Division of Evolutionary Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Leeban Yusuf
- Centre for Biological Diversity, University of St Andrews, St Andrews, UK
| |
Collapse
|
3
|
Chin IM, Gardell ZA, Corces MR. Decoding polygenic diseases: advances in noncoding variant prioritization and validation. Trends Cell Biol 2024; 34:465-483. [PMID: 38719704 DOI: 10.1016/j.tcb.2024.03.005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 03/12/2024] [Accepted: 03/21/2024] [Indexed: 06/09/2024]
Abstract
Genome-wide association studies (GWASs) provide a key foundation for elucidating the genetic underpinnings of common polygenic diseases. However, these studies have limitations in their ability to assign causality to particular genetic variants, especially those residing in the noncoding genome. Over the past decade, technological and methodological advances in both analytical and empirical prioritization of noncoding variants have enabled the identification of causative variants by leveraging orthogonal functional evidence at increasing scale. In this review, we present an overview of these approaches and describe how this workflow provides the groundwork necessary to move beyond associations toward genetically informed studies on the molecular and cellular mechanisms of polygenic disease.
Collapse
Affiliation(s)
- Iris M Chin
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - Zachary A Gardell
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - M Ryan Corces
- Gladstone Institute of Neurological Disease, Gladstone Institutes, San Francisco, CA, USA; Gladstone Institute of Data Science and Biotechnology, Gladstone Institutes, San Francisco, CA, USA; Department of Neurology, University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
4
|
Deng C, Whalen S, Steyert M, Ziffra R, Przytycki PF, Inoue F, Pereira DA, Capauto D, Norton S, Vaccarino FM, Pollen AA, Nowakowski TJ, Ahituv N, Pollard KS. Massively parallel characterization of regulatory elements in the developing human cortex. Science 2024; 384:eadh0559. [PMID: 38781390 DOI: 10.1126/science.adh0559] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 03/13/2024] [Indexed: 05/25/2024]
Abstract
Nucleotide changes in gene regulatory elements are important determinants of neuronal development and diseases. Using massively parallel reporter assays in primary human cells from mid-gestation cortex and cerebral organoids, we interrogated the cis-regulatory activity of 102,767 open chromatin regions, including thousands of sequences with cell type-specific accessibility and variants associated with brain gene regulation. In primary cells, we identified 46,802 active enhancer sequences and 164 variants that alter enhancer activity. Activity was comparable in organoids and primary cells, suggesting that organoids provide an adequate model for the developing cortex. Using deep learning we decoded the sequence basis and upstream regulators of enhancer activity. This work establishes a comprehensive catalog of functional gene regulatory elements and variants in human neuronal development.
Collapse
Affiliation(s)
- Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Sean Whalen
- Gladstone Institutes, San Francisco, CA 94158, USA
| | - Marilyn Steyert
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA
- Department of Psychiatry, University of California, San Francisco, San Francisco, CA 94143, USA
- Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA 94143, USA
- Chan Zuckerberg Biohub, San Francisco, San Francisco, CA 94158, USA
- Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, CA 94143, USA
- Weill Institute for Neurosciences, University of California, San Francisco, CA 94158, USA
| | - Ryan Ziffra
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA
- Department of Psychiatry, University of California, San Francisco, San Francisco, CA 94143, USA
| | | | - Fumitaka Inoue
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University, Kyoto 606-8501, Japan
| | - Daniela A Pereira
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
- Graduate Program of Genetics, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, Minas Gerais 31270-901, Brazil
| | - Davide Capauto
- Child Study Center, Yale University, New Haven, CT 06520, USA
| | - Scott Norton
- Child Study Center, Yale University, New Haven, CT 06520, USA
| | - Flora M Vaccarino
- Child Study Center, Yale University, New Haven, CT 06520, USA
- Department of Neuroscience, Yale University, New Haven, CT 06520, USA
| | - Alex A Pollen
- Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, CA 94143, USA
- Weill Institute for Neurosciences, University of California, San Francisco, CA 94158, USA
- Department of Neurology, University of California, San Francisco, San Francisco, CA 94143, USA
- Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Tomasz J Nowakowski
- Department of Anatomy, University of California, San Francisco, San Francisco, CA 94143, USA
- Department of Psychiatry, University of California, San Francisco, San Francisco, CA 94143, USA
- Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA 94143, USA
- Kavli Institute for Fundamental Neuroscience, University of California, San Francisco, CA 94143, USA
- Weill Institute for Neurosciences, University of California, San Francisco, CA 94158, USA
- Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Katherine S Pollard
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
- Gladstone Institutes, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, San Francisco, San Francisco, CA 94158, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco, San Francisco, CA 94158, USA
| |
Collapse
|
5
|
Zeng B, Bendl J, Deng C, Lee D, Misir R, Reach SM, Kleopoulos SP, Auluck P, Marenco S, Lewis DA, Haroutunian V, Ahituv N, Fullard JF, Hoffman GE, Roussos P. Genetic regulation of cell type-specific chromatin accessibility shapes brain disease etiology. Science 2024; 384:eadh4265. [PMID: 38781378 DOI: 10.1126/science.adh4265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Accepted: 12/20/2023] [Indexed: 05/25/2024]
Abstract
Nucleotide variants in cell type-specific gene regulatory elements in the human brain are risk factors for human disease. We measured chromatin accessibility in 1932 aliquots of sorted neurons and non-neurons from 616 human postmortem brains and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTLs). Only 10.4% of caQTLs are shared between neurons and non-neurons, which supports cell type-specific genetic regulation of the brain regulome. Incorporating allele-specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms that underlie disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs and identified the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides insights into disease etiology.
Collapse
Affiliation(s)
- Biao Zeng
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Ruth Misir
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Sarah M Reach
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Steven P Kleopoulos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Pavan Auluck
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD 20892, USA
| | - Stefano Marenco
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD 20892, USA
| | - David A Lewis
- Translational Neuroscience Program, Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA 15213, USA
| | - Vahram Haroutunian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA 94158, USA
| | - John F Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Gabriel E Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY 10468, USA
| |
Collapse
|
6
|
Rossen J, Shi H, Strober BJ, Zhang MJ, Kanai M, McCaw ZR, Liang L, Weissbrod O, Price AL. MultiSuSiE improves multi-ancestry fine-mapping in All of Us whole-genome sequencing data. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.13.24307291. [PMID: 38798542 PMCID: PMC11118590 DOI: 10.1101/2024.05.13.24307291] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Leveraging data from multiple ancestries can greatly improve fine-mapping power due to differences in linkage disequilibrium and allele frequencies. We propose MultiSuSiE, an extension of the sum of single effects model (SuSiE) to multiple ancestries that allows causal effect sizes to vary across ancestries based on a multivariate normal prior informed by empirical data. We evaluated MultiSuSiE via simulations and analyses of 14 quantitative traits leveraging whole-genome sequencing data in 47k African-ancestry and 94k European-ancestry individuals from All of Us. In simulations, MultiSuSiE applied to Afr47k+Eur47k was well-calibrated and attained higher power than SuSiE applied to Eur94k; interestingly, higher causal variant PIPs in Afr47k compared to Eur47k were entirely explained by differences in the extent of LD quantified by LD 4th moments. Compared to very recently proposed multi-ancestry fine-mapping methods, MultiSuSiE attained higher power and/or much lower computational costs, making the analysis of large-scale All of Us data feasible. In real trait analyses, MultiSuSiE applied to Afr47k+Eur94k identified 579 fine-mapped variants with PIP > 0.5, and MultiSuSiE applied to Afr47k+Eur47k identified 44% more fine-mapped variants with PIP > 0.5 than SuSiE applied to Eur94k. We validated MultiSuSiE results for real traits via functional enrichment of fine-mapped variants. We highlight several examples where MultiSuSiE implicates well-studied or biologically plausible fine-mapped variants that were not implicated by other methods.
Collapse
|
7
|
Siraj L, Castro RI, Dewey H, Kales S, Nguyen TTL, Kanai M, Berenzy D, Mouri K, Wang QS, McCaw ZR, Gosai SJ, Aguet F, Cui R, Vockley CM, Lareau CA, Okada Y, Gusev A, Jones TR, Lander ES, Sabeti PC, Finucane HK, Reilly SK, Ulirsch JC, Tewhey R. Functional dissection of complex and molecular trait variants at single nucleotide resolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.05.592437. [PMID: 38766054 PMCID: PMC11100724 DOI: 10.1101/2024.05.05.592437] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Identifying the causal variants and mechanisms that drive complex traits and diseases remains a core problem in human genetics. The majority of these variants have individually weak effects and lie in non-coding gene-regulatory elements where we lack a complete understanding of how single nucleotide alterations modulate transcriptional processes to affect human phenotypes. To address this, we measured the activity of 221,412 trait-associated variants that had been statistically fine-mapped using a Massively Parallel Reporter Assay (MPRA) in 5 diverse cell-types. We show that MPRA is able to discriminate between likely causal variants and controls, identifying 12,025 regulatory variants with high precision. Although the effects of these variants largely agree with orthogonal measures of function, only 69% can plausibly be explained by the disruption of a known transcription factor (TF) binding motif. We dissect the mechanisms of 136 variants using saturation mutagenesis and assign impacted TFs for 91% of variants without a clear canonical mechanism. Finally, we provide evidence that epistasis is prevalent for variants in close proximity and identify multiple functional variants on the same haplotype at a small, but important, subset of trait-associated loci. Overall, our study provides a systematic functional characterization of likely causal common variants underlying complex and molecular human traits, enabling new insights into the regulatory grammar underlying disease risk.
Collapse
Affiliation(s)
- Layla Siraj
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Program in Biophysics, Harvard Graduate School of Arts and Sciences, Boston, MA, USA
- Harvard-Massachusetts Institute of Technology MD/PhD Program, Harvard Medical School, Boston, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | | | | | | | | | - Masahiro Kanai
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA USA
- Center for Computational and Integrative Biology, Massachusetts General Hospital, Boston, MA, USA
| | | | | | - Qingbo S. Wang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA USA
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
| | | | - Sager J. Gosai
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - François Aguet
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Illumina Artificial Intelligence Laboratory, Illumina, San Diego, CA, USA
| | - Ran Cui
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA USA
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Caleb A. Lareau
- Program in Computational and Systems Biology, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Yukinori Okada
- Department of Statistical Genetics, Osaka University Graduate School of Medicine, Suita, Japan
- Department of Genome Informatics, Graduate School of Medicine, the University of Tokyo, Tokyo, Japan
- Laboratory for Systems Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | - Alexander Gusev
- Harvard Medical School and Dana-Farber Cancer Institute, Boston, MA, USA
| | - Thouis R. Jones
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- The Novo Nordisk Foundation Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Eric S. Lander
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Biology, MIT, Cambridge, MA, USA
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Pardis C. Sabeti
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, USA
| | - Hilary K. Finucane
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA USA
| | - Steven K. Reilly
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
- Wu Tsai Institute, Yale University, New Haven, CT, USA
| | - Jacob C. Ulirsch
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
- Illumina Artificial Intelligence Laboratory, Illumina, San Diego, CA, USA
| | - Ryan Tewhey
- The Jackson Laboratory, Bar Harbor, ME, USA
- Graduate School of Biomedical Sciences and Engineering, University of Maine, Orono, ME, USA
- Graduate School of Biomedical Sciences, Tufts University School of Medicine, Boston, MA, USA
| |
Collapse
|
8
|
He Z, Chu B, Yang J, Gu J, Chen Z, Liu L, Morrison T, Belloy ME, Qi X, Hejazi N, Mathur M, Le Guen Y, Tang H, Hastie T, Ionita-laza I, Sabatti C, Candès E. Beyond guilty by association at scale: searching for causal variants on the basis of genome-wide summary statistics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.28.582621. [PMID: 38464202 PMCID: PMC10925326 DOI: 10.1101/2024.02.28.582621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
Understanding the causal genetic architecture of complex phenotypes is essential for future research into disease mechanisms and potential therapies. Here, we present a novel framework for genome-wide detection of sets of variants that carry non-redundant information on the phenotypes and are therefore more likely to be causal in a biological sense. Crucially, our framework requires only summary statistics obtained from standard genome-wide marginal association testing. The described approach, implemented in open-source software, is also computationally efficient, requiring less than 15 minutes on a single CPU to perform genome-wide analysis. Through extensive genome-wide simulation studies, we show that the method can substantially outperform usual two-stage marginal association testing and fine-mapping procedures in precision and recall. In applications to a meta-analysis of ten large-scale genetic studies of Alzheimer's disease (AD), we identified 82 loci associated with AD, including 37 additional loci missed by conventional GWAS pipeline. The identified putative causal variants achieve state-of-the-art agreement with massively parallel reporter assays and CRISPR-Cas9 experiments. Additionally, we applied the method to a retrospective analysis of 67 large-scale GWAS summary statistics since 2013 for a variety of phenotypes. Results reveal the method's capacity to robustly discover additional loci for polygenic traits and pinpoint potential causal variants underpinning each locus beyond conventional GWAS pipeline, contributing to a deeper understanding of complex genetic architectures in post-GWAS analyses.
Collapse
Affiliation(s)
- Zihuai He
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - Benjamin Chu
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - James Yang
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Jiaqi Gu
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Zhaomeng Chen
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Linxi Liu
- Department of Statistics, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Tim Morrison
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Michael E. Belloy
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA
| | - Xinran Qi
- Department of Neurology and Neurological Sciences, Stanford University, Stanford, CA 94305, USA
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Nima Hejazi
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Maya Mathur
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
- Department of Pediatrics, Stanford University, Stanford, CA 94305, USA
| | - Yann Le Guen
- Quantitative Sciences Unit, Department of Medicine, Stanford University, Stanford, CA, 94305, USA
| | - Hua Tang
- Department of Pediatrics, Stanford University, Stanford, CA 94305, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Trevor Hastie
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Iuliana Ionita-laza
- Department of Biostatistics, Columbia University Mailman School of Public Health, New York, NY 10032, USA
| | - Chiara Sabatti
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - Emmanuel Candès
- Department of Statistics, Stanford University, Stanford, CA 94305, USA
- Department of Mathematics, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
9
|
Georas SN, Khurana S. Update on asthma biology. J Allergy Clin Immunol 2024; 153:1215-1228. [PMID: 38341182 DOI: 10.1016/j.jaci.2024.01.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 01/17/2024] [Accepted: 01/24/2024] [Indexed: 02/12/2024]
Abstract
This is an exciting time to be conducting asthma research. The recent development of targeted asthma biologics has validated the power of basic research to discover new molecules amenable to therapeutic intervention. Advances in high-throughput sequencing are providing a wealth of "omics" data about genetic and epigenetic underpinnings of asthma, as well as about new cellular interacting networks and potential endotypes in asthma. Airway epithelial cells have emerged not only as key sensors of the outside environment but also as central drivers of dysregulated mucosal immune responses in asthma. Emerging data suggest that the airway epithelium in asthma remembers prior encounters with environmental exposures, resulting in potentially long-lasting changes in structure and metabolism that render asthmatic individuals susceptible to subsequent exposures. Here we summarize recent insights into asthma biology, focusing on studies using human cells or tissue that were published in the past 2 years. The studies are organized thematically into 6 content areas to draw connections and spur future research (on genetics and epigenetics, prenatal and early-life origins, microbiome, immune and inflammatory pathways, asthma endotypes and biomarkers, and lung structural alterations). We highlight recent studies of airway epithelial dysfunction and response to viral infections and conclude with a framework for considering how bidirectional interactions between alterations in airway structure and mucosal immunity can lead to sustained lung dysfunction in asthma.
Collapse
Affiliation(s)
- Steve N Georas
- Division of Pulmonary and Critical Care Medicine, University of Rochester Medical Center, Rochester, NY.
| | - Sandhya Khurana
- Division of Pulmonary and Critical Care Medicine, University of Rochester Medical Center, Rochester, NY
| |
Collapse
|
10
|
Kim KH, Hong EP, Lee Y, McLean ZL, Elezi E, Lee R, Kwak S, McAllister B, Massey TH, Lobanov S, Holmans P, Orth M, Ciosi M, Monckton DG, Long JD, Lucente D, Wheeler VC, MacDonald ME, Gusella JF, Lee JM. Posttranscriptional regulation of FAN1 by miR-124-3p at rs3512 underlies onset-delaying genetic modification in Huntington's disease. Proc Natl Acad Sci U S A 2024; 121:e2322924121. [PMID: 38607933 PMCID: PMC11032436 DOI: 10.1073/pnas.2322924121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2024] [Accepted: 02/06/2024] [Indexed: 04/14/2024] Open
Abstract
Many Mendelian disorders, such as Huntington's disease (HD) and spinocerebellar ataxias, arise from expansions of CAG trinucleotide repeats. Despite the clear genetic causes, additional genetic factors may influence the rate of those monogenic disorders. Notably, genome-wide association studies discovered somewhat expected modifiers, particularly mismatch repair genes involved in the CAG repeat instability, impacting age at onset of HD. Strikingly, FAN1, previously unrelated to repeat instability, produced the strongest HD modification signals. Diverse FAN1 haplotypes independently modify HD, with rare genetic variants diminishing DNA binding or nuclease activity of the FAN1 protein, hastening HD onset. However, the mechanism behind the frequent and the most significant onset-delaying FAN1 haplotype lacking missense variations has remained elusive. Here, we illustrated that a microRNA acting on 3'-UTR (untranslated region) SNP rs3512, rather than transcriptional regulation, is responsible for the significant FAN1 expression quantitative trait loci signal and allelic imbalance in FAN1 messenger ribonucleic acid (mRNA), accounting for the most significant and frequent onset-delaying modifier haplotype in HD. Specifically, miR-124-3p selectively targets the reference allele at rs3512, diminishing the stability of FAN1 mRNA harboring that allele and consequently reducing its levels. Subsequent validation analyses, including the use of antagomir and 3'-UTR reporter vectors with swapped alleles, confirmed the specificity of miR-124-3p at rs3512. Together, these findings indicate that the alternative allele at rs3512 renders the FAN1 mRNA less susceptible to miR-124-3p-mediated posttranscriptional regulation, resulting in increased FAN1 levels and a subsequent delay in HD onset by mitigating CAG repeat instability.
Collapse
Affiliation(s)
- Kyung-Hee Kim
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
| | - Eun Pyo Hong
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
| | - Yukyeong Lee
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
| | - Zachariah L. McLean
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
- Medical and Population Genetics Program, The Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA02142
| | - Emanuela Elezi
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
| | | | | | - Branduff McAllister
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, CardiffCF24 4HQ, United Kingdom
| | - Thomas H. Massey
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, CardiffCF24 4HQ, United Kingdom
| | - Sergey Lobanov
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, CardiffCF24 4HQ, United Kingdom
| | - Peter Holmans
- Centre for Neuropsychiatric Genetics and Genomics, Division of Psychological Medicine and Clinical Neurosciences, School of Medicine, Cardiff University, CardiffCF24 4HQ, United Kingdom
| | - Michael Orth
- University Hospital of Old Age Psychiatry and Psychotherapy, Bern University, CH-3000Bern 60, Switzerland
| | - Marc Ciosi
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, GlasgowG12 8QQ, United Kingdom
| | - Darren G. Monckton
- School of Molecular Biosciences, College of Medical, Veterinary and Life Sciences, University of Glasgow, GlasgowG12 8QQ, United Kingdom
| | - Jeffrey D. Long
- Department of Psychiatry, Carver College of Medicine, University of Iowa, Iowa City, IA52242
- Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, IA52242
| | - Diane Lucente
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
| | - Vanessa C. Wheeler
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
| | - Marcy E. MacDonald
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
- Medical and Population Genetics Program, The Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA02142
| | - James F. Gusella
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Medical and Population Genetics Program, The Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA02142
- Department of Genetics, Blavatnik Institute, Harvard Medical School, Boston, MA02115
| | - Jong-Min Lee
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA02114
- Department of Neurology, Harvard Medical School, Boston, MA02115
- Medical and Population Genetics Program, The Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA02142
| |
Collapse
|
11
|
Choi J, Lee SM, Norwitz ER, Kim JH, Jung YM, Park CW, Jun JK, Lee D, Jin Y, Kim S, Cha B, Park JS, Kim JI. Placental expression quantitative trait loci in an East Asian population. HGG ADVANCES 2024; 5:100276. [PMID: 38310352 PMCID: PMC10883826 DOI: 10.1016/j.xhgg.2024.100276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Revised: 01/29/2024] [Accepted: 01/29/2024] [Indexed: 02/05/2024] Open
Abstract
Expression quantitative trait loci (eQTL) analysis measures the contribution of genetic variation in gene expression on complex traits. Although this methodology has been used to examine gene regulation in numerous human tissues, eQTL research in solid tissues is relatively lacking. We conducted eQTL analysis on placentas collected from an East Asian population in an effort to identify gene regulatory mechanisms in this tissue. Placentas (n = 102) were collected at the time of cesarean delivery. mRNA was extracted, sequenced with NGS, and compared with matched maternal and fetal DNA arrays performed using maternal and neonatal cord blood. Linear regression modeling was performed using tensorQTL. Fine-mapping along with epigenomic annotation was used to select putative functional variants. We identified 2,703 coding genes that contained at least one eQTL with statistical significance (false discovery rate <0.05). After fine-mapping, we found 108 previously unreported eQTL variants with posterior inclusion probability >0.1. Of these, 19% were located in genomic regions with evidence from public placental epigenome suggesting that they may be functionally relevant. For example, variant rs28379289 located in the placenta-specific regulatory region changes the binding affinity of transcription factor leading to higher expression of LGALS3, which is known to affect placental function. This study expands the knowledge base of regulatory elements within the human placenta and identifies 108 previously unreported placenta eQTL signals, which are listed in our publicly available GMI eQTL database. Further studies are needed to identify and characterize genetic regulatory mechanisms that affect placental function in normal pregnancy and placenta-related diseases.
Collapse
Affiliation(s)
- Jaeyong Choi
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, Korea; Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea
| | - Seung Mi Lee
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | | | - Ji Hoi Kim
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | - Young Mi Jung
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | - Chan-Wook Park
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | - Jong Kwan Jun
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea
| | - Dakyung Lee
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, Korea; Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea
| | - Yongjoon Jin
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, Korea; Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea
| | - Sookyung Kim
- Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea
| | - Bukyoung Cha
- Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea
| | - Joong Shin Park
- Department of Obstetrics and Gynecology, Seoul National University College of Medicine, Seoul, Korea.
| | - Jong-Il Kim
- Department of Biomedical Sciences, Seoul National University College of Medicine, Seoul, Korea; Genomic Medicine Institute, Medical Research Center, Seoul National University, Seoul, Korea.
| |
Collapse
|
12
|
Fu Y, Kelly JA, Gopalakrishnan J, Pelikan RC, Tessneer KL, Pasula S, Grundahl K, Murphy DA, Gaffney PM. Massively parallel reporter assay confirms regulatory potential of hQTLs and reveals important variants in lupus and other autoimmune diseases. HGG ADVANCES 2024; 5:100279. [PMID: 38389303 PMCID: PMC10943488 DOI: 10.1016/j.xhgg.2024.100279] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 02/15/2024] [Accepted: 02/18/2024] [Indexed: 02/24/2024] Open
Abstract
We designed a massively parallel reporter assay (MPRA) in an Epstein-Barr virus transformed B cell line to directly characterize the potential for histone post-translational modifications, i.e., histone quantitative trait loci (hQTLs), expression QTLs (eQTLs), and variants on systemic lupus erythematosus (SLE) and autoimmune (AI) disease risk haplotypes to modulate regulatory activity in an allele-dependent manner. Our study demonstrates that hQTLs, as a group, are more likely to modulate regulatory activity in an MPRA compared with other variant classes tested, including a set of eQTLs previously shown to interact with hQTLs and tested AI risk variants. In addition, we nominate 17 variants (including 11 previously unreported) as putative causal variants for SLE and another 14 for various other AI diseases, prioritizing these variants for future functional studies in primary and immortalized B cells. Thus, we uncover important insights into the mechanistic relationships among genotype, epigenetics, and gene expression in SLE and AI disease phenotypes.
Collapse
Affiliation(s)
- Yao Fu
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Jennifer A Kelly
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Jaanam Gopalakrishnan
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA; Neuro-Immune Regulome Unit, National Eye Institute, National Institute of Health, Bethesda, MD 20892, USA
| | - Richard C Pelikan
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Kandice L Tessneer
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Satish Pasula
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Kiely Grundahl
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - David A Murphy
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA
| | - Patrick M Gaffney
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK 73104, USA.
| |
Collapse
|
13
|
Nizomov J, Jin W, Xia Y, Liu Y, Li Z, Chen L. MPRAVarDB: an online database and web server for exploring regulatory effects of genetic variants. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.02.587790. [PMID: 38617248 PMCID: PMC11014600 DOI: 10.1101/2024.04.02.587790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
Massively parallel reporter assay (MPRA) is an important technology to evaluate the impact of genetic variants on gene regulation. Here, we present MPRAVarDB, an online database and web server, for exploring regulatory effects of genetic variants. MPRAVarDB harbors 18 MPRA experiments designed to assess the regulatory effects of genetic variants associated with GWAS loci, eQTLs and various genomic features, resulting in a total of 242,818 variants tested across more than 30 cell lines and 30 human diseases or traits. MPRAVarDB empowers the query of MPRA variants by genomic region, disease and cell line or by any combination of these query terms. Notably, MPRAVarDB offers a suite of pretrained machine learning models tailored to the specific disease and cell line, facilitating the genome-wide prediction of regulatory variants. MPRAVarDB is friendly to use, and users only need a few clicks to receive query and prediction results.
Collapse
Affiliation(s)
- Javlon Nizomov
- Department of Biostatistics, University of Florida, Gainesville, FL, 32603
| | - Weijia Jin
- Department of Biostatistics, University of Florida, Gainesville, FL, 32603
| | - Yi Xia
- Department of Biostatistics, University of Florida, Gainesville, FL, 32603
| | - Yunlong Liu
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, 46202
| | - Zhigang Li
- Department of Biostatistics, University of Florida, Gainesville, FL, 32603
| | - Li Chen
- Department of Biostatistics, University of Florida, Gainesville, FL, 32603
| |
Collapse
|
14
|
Khetan S, Bulyk ML. Overlapping binding sites underlie TF genomic occupancy. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.05.583629. [PMID: 38496549 PMCID: PMC10942454 DOI: 10.1101/2024.03.05.583629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
Sequence-specific DNA binding by transcription factors (TFs) is a crucial step in gene regulation. However, current high-throughput in vitro approaches cannot reliably detect lower affinity TF-DNA interactions, which play key roles in gene regulation. Here, we developed PADIT-seq ( p rotein a ffinity to D NA by in vitro transcription and RNA seq uencing) to assay TF binding preferences to all 10-bp DNA sequences at far greater sensitivity than prior approaches. The expanded catalogs of low affinity DNA binding sites for the human TFs HOXD13 and EGR1 revealed that nucleotides flanking high affinity DNA binding sites create overlapping lower affinity sites that together modulate TF genomic occupancy in vivo . Formation of such extended recognition sequences stems from an inherent property of TF binding sites to interweave each other and expands the genomic sequence space for identifying noncoding variants that directly alter TF binding. One-Sentence Summary Overlapping DNA binding sites underlie TF genomic occupancy through their inherent propensity to interweave each other.
Collapse
|
15
|
Zhang Y, Hou G, Shen N. Non-coding DNA variants for risk in lupus. Best Pract Res Clin Rheumatol 2024:101937. [PMID: 38429183 DOI: 10.1016/j.berh.2024.101937] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/08/2024] [Accepted: 02/12/2024] [Indexed: 03/03/2024]
Abstract
Systemic Lupus Erythematosus (SLE) is a multifactorial autoimmune disease that arises from a dynamic interplay between genetics and environmental triggers. The advent of sophisticated genomics technology has catalyzed a shift in our understanding of disease etiology, spotlighting the pivotal role of non-coding DNA variants in SLE pathogenesis. In this review, we present a comprehensive examination of the non-coding variants associated with SLE, shedding light on their role in influencing disease risk and progression. We discuss the latest methodological advancements that have been instrumental in the identification and functional characterization of these genomic elements, with a special focus on the transformative power of CRISPR-based gene-editing technologies. Additionally, the review probes into the therapeutic opportunities that arise from modulating non-coding regions associated with SLE. Through an exploration of the complex network of non-coding DNA, this review aspires to decode the genetic puzzle of SLE and set the stage for groundbreaking gene-based therapeutic interventions and the advancement of precision medicine strategies tailored to SLE management.
Collapse
Affiliation(s)
- Yutong Zhang
- Shanghai Institute of Rheumatology, Renji Hospital, Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai, 200001, China
| | - Guojun Hou
- Shanghai Institute of Rheumatology, Renji Hospital, Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai, 200001, China
| | - Nan Shen
- Shanghai Institute of Rheumatology, Renji Hospital, Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai, 200001, China.
| |
Collapse
|
16
|
Farrow SL, Gokuladhas S, Schierding W, Pudjihartono M, Perry JK, Cooper AA, O'Sullivan JM. Identification of 27 allele-specific regulatory variants in Parkinson's disease using a massively parallel reporter assay. NPJ Parkinsons Dis 2024; 10:44. [PMID: 38413607 PMCID: PMC10899198 DOI: 10.1038/s41531-024-00659-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 02/12/2024] [Indexed: 02/29/2024] Open
Abstract
Genome wide association studies (GWAS) have identified a number of genomic loci that are associated with Parkinson's disease (PD) risk. However, the majority of these variants lie in non-coding regions, and thus the mechanisms by which they influence disease development, and/or potential subtypes, remain largely elusive. To address this, we used a massively parallel reporter assay (MPRA) to screen the regulatory function of 5254 variants that have a known or putative connection to PD. We identified 138 loci with enhancer activity, of which 27 exhibited allele-specific regulatory activity in HEK293 cells. The identified regulatory variant(s) typically did not match the original tag variant within the PD associated locus, supporting the need for deeper exploration of these loci. The existence of allele specific transcriptional impacts within HEK293 cells, confirms that at least a subset of the PD associated regions mark functional gene regulatory elements. Future functional studies that confirm the putative targets of the empirically verified regulatory variants will be crucial for gaining a greater understanding of how gene regulatory network(s) modulate PD risk.
Collapse
Affiliation(s)
- Sophie L Farrow
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand.
| | | | - William Schierding
- Liggins Institute, The University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
- Department of Ophthalmology, The University of Auckland, Auckland, New Zealand
| | | | - Jo K Perry
- Liggins Institute, The University of Auckland, Auckland, New Zealand
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand
| | - Antony A Cooper
- Australian Parkinsons Mission, Garvan Institute of Medical Research, Sydney, NSW, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, NSW, Australia
| | - Justin M O'Sullivan
- Liggins Institute, The University of Auckland, Auckland, New Zealand.
- The Maurice Wilkins Centre, The University of Auckland, Auckland, New Zealand.
- Australian Parkinsons Mission, Garvan Institute of Medical Research, Sydney, NSW, Australia.
- Singapore Institute for Clinical Sciences, Agency for Science Technology and Research, Singapore, Singapore.
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton, United Kingdom.
| |
Collapse
|
17
|
Venema WJ, Hiddingh S, van Loosdregt J, Bowes J, Balliu B, de Boer JH, Ossewaarde-van Norel J, Thompson SD, Langefeld CD, de Ligt A, van der Veken LT, Krijger PHL, de Laat W, Kuiper JJW. A cis-regulatory element regulates ERAP2 expression through autoimmune disease risk SNPs. CELL GENOMICS 2024; 4:100460. [PMID: 38190099 PMCID: PMC10794781 DOI: 10.1016/j.xgen.2023.100460] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 10/04/2023] [Accepted: 11/09/2023] [Indexed: 01/09/2024]
Abstract
Single-nucleotide polymorphisms (SNPs) near the ERAP2 gene are associated with various autoimmune conditions, as well as protection against lethal infections. Due to high linkage disequilibrium, numerous trait-associated SNPs are correlated with ERAP2 expression; however, their functional mechanisms remain unidentified. We show by reciprocal allelic replacement that ERAP2 expression is directly controlled by the splice region variant rs2248374. However, disease-associated variants in the downstream LNPEP gene promoter are independently associated with ERAP2 expression. Allele-specific conformation capture assays revealed long-range chromatin contacts between the gene promoters of LNPEP and ERAP2 and showed that interactions were stronger in patients carrying the alleles that increase susceptibility to autoimmune diseases. Replacing the SNPs in the LNPEP promoter by reference sequences lowered ERAP2 expression. These findings show that multiple SNPs act in concert to regulate ERAP2 expression and that disease-associated variants can convert a gene promoter region into a potent enhancer of a distal gene.
Collapse
Affiliation(s)
- Wouter J Venema
- Department of Ophthalmology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands; Center for Translational Immunology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Sanne Hiddingh
- Department of Ophthalmology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands; Center for Translational Immunology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Jorg van Loosdregt
- Center for Translational Immunology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - John Bowes
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
| | - Brunilda Balliu
- Department of Computational Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Joke H de Boer
- Department of Ophthalmology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | | | - Susan D Thompson
- Department of Pediatrics, University of Cincinnati College of Medicine, Division of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, USA
| | - Carl D Langefeld
- Department of Biostatistics and Data Science, and Center for Precision Medicine, Wake Forest University School of Medicine, Winston-Salem, NC, USA
| | - Aafke de Ligt
- Department of Ophthalmology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands; Center for Translational Immunology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Lars T van der Veken
- Department of Genetics, Division Laboratories, Pharmacy and Biomedical Genetics, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands
| | - Peter H L Krijger
- Oncode Institute, Hubrecht Institute-KNAW and University Medical Center Utrecht, 3584 CT Utrecht, the Netherlands
| | - Wouter de Laat
- Oncode Institute, Hubrecht Institute-KNAW and University Medical Center Utrecht, 3584 CT Utrecht, the Netherlands
| | - Jonas J W Kuiper
- Department of Ophthalmology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands; Center for Translational Immunology, University Medical Center Utrecht, Utrecht University, Utrecht, the Netherlands.
| |
Collapse
|
18
|
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CWK, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. Am J Hum Genet 2023; 110:2077-2091. [PMID: 38065072 PMCID: PMC10716520 DOI: 10.1016/j.ajhg.2023.10.017] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2023] [Revised: 10/26/2023] [Accepted: 10/27/2023] [Indexed: 12/18/2023] Open
Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide association studies (GWASs) are a powerful way to find genetic loci associated with phenotypes. GWASs are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix (local eGRM) given the ARG. Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to analyze two chromosomes containing known body size loci in a sample of Native Hawaiians. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Joshua G Schraiber
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Bryan Dinh
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Charleston W K Chiang
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA; Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Michael D Edge
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA.
| |
Collapse
|
19
|
Zhang J, Zhao H. eQTL studies: from bulk tissues to single cells. J Genet Genomics 2023; 50:925-933. [PMID: 37207929 PMCID: PMC10656365 DOI: 10.1016/j.jgg.2023.05.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Revised: 05/02/2023] [Accepted: 05/04/2023] [Indexed: 05/21/2023]
Abstract
An expression quantitative trait locus (eQTL) is a chromosomal region where genetic variants are associated with the expression levels of specific genes that can be both nearby or distant. The identifications of eQTLs for different tissues, cell types, and contexts have led to a better understanding of the dynamic regulations of gene expressions and implications of functional genes and variants for complex traits and diseases. Although most eQTL studies have been performed on data collected from bulk tissues, recent studies have demonstrated the importance of cell-type-specific and context-dependent gene regulations in biological processes and disease mechanisms. In this review, we discuss statistical methods that have been developed to enable the detection of cell-type-specific and context-dependent eQTLs from bulk tissues, purified cell types, and single cells. We also discuss the limitations of the current methods and future research opportunities.
Collapse
Affiliation(s)
- Jingfei Zhang
- Information Systems and Operations Management, Emory University, Atlanta, GA 30322, USA
| | - Hongyu Zhao
- Department of Biostatistics, Yale School of Public Health, New Haven, CT 208034, USA.
| |
Collapse
|
20
|
Amariuta T. The power paradox of detecting disease-associated and gene-expression-associated variants. Nat Genet 2023; 55:1782-1783. [PMID: 37857936 DOI: 10.1038/s41588-023-01525-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2023]
Affiliation(s)
- Tiffany Amariuta
- Halıcıoğlu Data Science Institute, University of California, San Diego, La Jolla, CA, USA.
- Department of Medicine, University of California, San Diego, La Jolla, CA, USA.
| |
Collapse
|
21
|
Xu M, Liu Q, Bi R, Li Y, Li H, Kang WB, Yan Z, Zheng Q, Sun C, Ye M, Xiang BL, Luo XJ, Li M, Zhang DF, Yao YG. Coexistence of Multiple Functional Variants and Genes Underlies Genetic Risk Locus 11p11.2 of Alzheimer's Disease. Biol Psychiatry 2023; 94:743-759. [PMID: 37290560 DOI: 10.1016/j.biopsych.2023.05.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 05/25/2023] [Accepted: 05/26/2023] [Indexed: 06/10/2023]
Abstract
BACKGROUND Genome-wide association studies have identified dozens of genetic risk loci for Alzheimer's disease (AD), yet the underlying causal variants and biological mechanisms remain elusive, especially for loci with complex linkage disequilibrium and regulation. METHODS To fully untangle the causal signal at a single locus, we performed a functional genomic study of 11p11.2 (the CELF1/SPI1 locus). Genome-wide association study signals at 11p11.2 were integrated with datasets of histone modification, open chromatin, and transcription factor binding to distill potentially functional variants (fVars). Their allelic regulatory activities were confirmed by allele imbalance, reporter assays, and base editing. Expressional quantitative trait loci and chromatin interaction data were incorporated to assign target genes to fVars. The relevance of these genes to AD was assessed by convergent functional genomics using bulk brain and single-cell transcriptomic, epigenomic, and proteomic datasets of patients with AD and control individuals, followed by cellular assays. RESULTS We found that 24 potential fVars, rather than a single variant, were responsible for the risk of 11p11.2. These fVars modulated transcription factor binding and regulated multiple genes by long-range chromatin interactions. Besides SPI1, convergent evidence indicated that 6 target genes (MTCH2, ACP2, NDUFS3, PSMC3, C1QTNF4, and MADD) of fVars were likely to be involved in AD development. Disruption of each gene led to cellular amyloid-β and phosphorylated tau changes, supporting the existence of multiple likely causal genes at 11p11.2. CONCLUSIONS Multiple variants and genes at 11p11.2 may contribute to AD risk. This finding provides new insights into the mechanistic and therapeutic challenges of AD.
Collapse
Affiliation(s)
- Min Xu
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Qianjin Liu
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Rui Bi
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China; National Resource Center for Non-Human Primates, National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Yu Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Hongli Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Wei-Bo Kang
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Zhongjiang Yan
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Quanzhen Zheng
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Chunli Sun
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Maosen Ye
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Bo-Lin Xiang
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Xiong-Jian Luo
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| | - Ming Li
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China
| | - Deng-Feng Zhang
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China; National Resource Center for Non-Human Primates, National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| | - Yong-Gang Yao
- Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province and KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, China; National Resource Center for Non-Human Primates, National Research Facility for Phenotypic & Genetic Analysis of Model Animals (Primate Facility), Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, China; Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China.
| |
Collapse
|
22
|
Mostafavi H, Spence JP, Naqvi S, Pritchard JK. Systematic differences in discovery of genetic effects on gene expression and complex traits. Nat Genet 2023; 55:1866-1875. [PMID: 37857933 DOI: 10.1038/s41588-023-01529-1] [Citation(s) in RCA: 33] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Accepted: 09/14/2023] [Indexed: 10/21/2023]
Abstract
Most signals in genome-wide association studies (GWAS) of complex traits implicate noncoding genetic variants with putative gene regulatory effects. However, currently identified regulatory variants, notably expression quantitative trait loci (eQTLs), explain only a small fraction of GWAS signals. Here, we show that GWAS and cis-eQTL hits are systematically different: eQTLs cluster strongly near transcription start sites, whereas GWAS hits do not. Genes near GWAS hits are enriched in key functional annotations, are under strong selective constraint and have complex regulatory landscapes across different tissue/cell types, whereas genes near eQTLs are depleted of most functional annotations, show relaxed constraint, and have simpler regulatory landscapes. We describe a model to understand these observations, including how natural selection on complex traits hinders discovery of functionally relevant eQTLs. Our results imply that GWAS and eQTL studies are systematically biased toward different types of variant, and support the use of complementary functional approaches alongside the next generation of eQTL studies.
Collapse
Affiliation(s)
| | | | - Sahin Naqvi
- Department of Genetics, Stanford University, Stanford, CA, USA
- Department of Chemical and Systems Biology, Stanford University, Stanford, CA, USA
| | - Jonathan K Pritchard
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Department of Biology, Stanford University, Stanford, CA, USA.
| |
Collapse
|
23
|
Cai M, Wang Z, Xiao J, Hu X, Chen G, Yang C. XMAP: Cross-population fine-mapping by leveraging genetic diversity and accounting for confounding bias. Nat Commun 2023; 14:6870. [PMID: 37898663 PMCID: PMC10613261 DOI: 10.1038/s41467-023-42614-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Accepted: 10/17/2023] [Indexed: 10/30/2023] Open
Abstract
Fine-mapping prioritizes risk variants identified by genome-wide association studies (GWASs), serving as a critical step to uncover biological mechanisms underlying complex traits. However, several major challenges still remain for existing fine-mapping methods. First, the strong linkage disequilibrium among variants can limit the statistical power and resolution of fine-mapping. Second, it is computationally expensive to simultaneously search for multiple causal variants. Third, the confounding bias hidden in GWAS summary statistics can produce spurious signals. To address these challenges, we develop a statistical method for cross-population fine-mapping (XMAP) by leveraging genetic diversity and accounting for confounding bias. By using cross-population GWAS summary statistics from global biobanks and genomic consortia, we show that XMAP can achieve greater statistical power, better control of false positive rate, and substantially higher computational efficiency for identifying multiple causal signals, compared to existing methods. Importantly, we show that the output of XMAP can be integrated with single-cell datasets, which greatly improves the interpretation of putative causal variants in their cellular context at single-cell resolution.
Collapse
Affiliation(s)
- Mingxuan Cai
- Department of Biostatistics, City University of Hong Kong, Hong Kong SAR, China.
| | - Zhiwei Wang
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou, 511458, China
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Jiashun Xiao
- Shenzhen Research Institute of Big Data, Shenzhen, 518172, China
| | - Xianghong Hu
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou, 511458, China
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | - Gang Chen
- Hunan Provincial Key Lab on Bioinformatics, School of Computer Science and Engineering, Central South University, Changsha, 410083, China
- WeGene, Shenzhen Zaozhidao Technology Co., Ltd, Shenzhen, 518040, China
- Graduate Affairs, Faculty of Medicine, Chulalongkorn University, 10330, Bangkok, Thailand
| | - Can Yang
- Guangzhou HKUST Fok Ying Tung Research Institute, Guangzhou, 511458, China.
- Department of Mathematics, The Hong Kong University of Science and Technology, Hong Kong SAR, China.
| |
Collapse
|
24
|
McAfee JC, Lee S, Lee J, Bell JL, Krupa O, Davis J, Insigne K, Bond ML, Zhao N, Boyle AP, Phanstiel DH, Love MI, Stein JL, Ruzicka WB, Davila-Velderrain J, Kosuri S, Won H. Systematic investigation of allelic regulatory activity of schizophrenia-associated common variants. CELL GENOMICS 2023; 3:100404. [PMID: 37868037 PMCID: PMC10589626 DOI: 10.1016/j.xgen.2023.100404] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 02/23/2023] [Accepted: 08/21/2023] [Indexed: 10/24/2023]
Abstract
Genome-wide association studies (GWASs) have successfully identified 145 genomic regions that contribute to schizophrenia risk, but linkage disequilibrium makes it challenging to discern causal variants. We performed a massively parallel reporter assay (MPRA) on 5,173 fine-mapped schizophrenia GWAS variants in primary human neural progenitors and identified 439 variants with allelic regulatory effects (MPRA-positive variants). Transcription factor binding had modest predictive power, while fine-map posterior probability, enhancer overlap, and evolutionary conservation failed to predict MPRA-positive variants. Furthermore, 64% of MPRA-positive variants did not exhibit expressive quantitative trait loci signature, suggesting that MPRA could identify yet unexplored variants with regulatory potentials. To predict the combinatorial effect of MPRA-positive variants on gene regulation, we propose an accessibility-by-contact model that combines MPRA-measured allelic activity with neuronal chromatin architecture.
Collapse
Affiliation(s)
- Jessica C. McAfee
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Sool Lee
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jiseok Lee
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jessica L. Bell
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Oleh Krupa
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jessica Davis
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095, USA
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Quantitative and Computational Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Kimberly Insigne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095, USA
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Quantitative and Computational Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Marielle L. Bond
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Nanxiang Zhao
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Alan P. Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
- Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Douglas H. Phanstiel
- Curriculum in Genetics and Molecular Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Curriculum in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Thurston Arthritis Research Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Cell Biology and Physiology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Michael I. Love
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Jason L. Stein
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - W. Brad Ruzicka
- Laboratory for Epigenomics in Human Psychopathology, McLean Hospital, Belmont, MA 02141, USA
- Harvard Medical School, Boston, MA 02115, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Sriram Kosuri
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA 90095, USA
- UCLA-DOE Institute for Genomics and Proteomics, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Molecular Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Quantitative and Computational Biology Institute, University of California, Los Angeles, Los Angeles, CA 90095, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Hyejung Won
- Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
- Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
25
|
Qi G, Strober BJ, Popp JM, Keener R, Ji H, Battle A. Single-cell allele-specific expression analysis reveals dynamic and cell-type-specific regulatory effects. Nat Commun 2023; 14:6317. [PMID: 37813843 PMCID: PMC10562474 DOI: 10.1038/s41467-023-42016-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 09/27/2023] [Indexed: 10/11/2023] Open
Abstract
Differential allele-specific expression (ASE) is a powerful tool to study context-specific cis-regulation of gene expression. Such effects can reflect the interaction between genetic or epigenetic factors and a measured context or condition. Single-cell RNA sequencing (scRNA-seq) allows the measurement of ASE at individual-cell resolution, but there is a lack of statistical methods to analyze such data. We present Differential Allelic Expression using Single-Cell data (DAESC), a powerful method for differential ASE analysis using scRNA-seq from multiple individuals, with statistical behavior confirmed through simulation. DAESC accounts for non-independence between cells from the same individual and incorporates implicit haplotype phasing. Application to data from 105 induced pluripotent stem cell (iPSC) lines identifies 657 genes dynamically regulated during endoderm differentiation, with enrichment for changes in chromatin state. Application to a type-2 diabetes dataset identifies several differentially regulated genes between patients and controls in pancreatic endocrine cells. DAESC is a powerful method for single-cell ASE analysis and can uncover novel insights on gene regulation.
Collapse
Affiliation(s)
- Guanghao Qi
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
- Department of Biostatistics, University of Washington, Seattle, WA, 98195, USA
| | - Benjamin J Strober
- Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, 02115, USA
| | - Joshua M Popp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Rebecca Keener
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Hongkai Ji
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA
| | - Alexis Battle
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA.
- Department of Computer Science, Johns Hopkins University, Baltimore, MD, 21218, USA.
- Department of Genetic Medicine, Johns Hopkins University, Baltimore, MD, 21205, USA.
| |
Collapse
|
26
|
Pun FW, Ozerov IV, Zhavoronkov A. AI-powered therapeutic target discovery. Trends Pharmacol Sci 2023; 44:561-572. [PMID: 37479540 DOI: 10.1016/j.tips.2023.06.010] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 06/20/2023] [Accepted: 06/23/2023] [Indexed: 07/23/2023]
Abstract
Disease modeling and target identification are the most crucial initial steps in drug discovery, and influence the probability of success at every step of drug development. Traditional target identification is a time-consuming process that takes years to decades and usually starts in an academic setting. Given its advantages of analyzing large datasets and intricate biological networks, artificial intelligence (AI) is playing a growing role in modern drug target identification. We review recent advances in target discovery, focusing on breakthroughs in AI-driven therapeutic target exploration. We also discuss the importance of striking a balance between novelty and confidence in target selection. An increasing number of AI-identified targets are being validated through experiments and several AI-derived drugs are entering clinical trials; we highlight current limitations and potential pathways for moving forward.
Collapse
Affiliation(s)
- Frank W Pun
- Insilico Medicine Hong Kong Ltd., Hong Kong Science and Technology Park, New Territories, Hong Kong
| | - Ivan V Ozerov
- Insilico Medicine Hong Kong Ltd., Hong Kong Science and Technology Park, New Territories, Hong Kong
| | - Alex Zhavoronkov
- Insilico Medicine Hong Kong Ltd., Hong Kong Science and Technology Park, New Territories, Hong Kong; Insilico Medicine MENA, 6F IRENA Building, Abu Dhabi, United Arab Emirates; Buck Institute for Research on Aging, Novato, CA, USA.
| |
Collapse
|
27
|
Wang YN, Gan T, Qu S, Xu LL, Hu Y, Liu LJ, Shi SF, Lv JC, Tsoi LC, Patrick MT, He K, Berthier CC, Xu HJ, Zhou XJ, Zhang H. MTMR3 risk alleles enhance Toll Like Receptor 9-induced IgA immunity in IgA nephropathy. Kidney Int 2023; 104:562-576. [PMID: 37414396 DOI: 10.1016/j.kint.2023.06.018] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 05/29/2023] [Accepted: 06/15/2023] [Indexed: 07/08/2023]
Abstract
Multiple genome-wide association studies (GWASs) have reproducibly identified the MTMR3/HORMAD2/LIF/OSM locus to be associated with IgA nephropathy (IgAN). However, the causal variant(s), implicated gene(s), and altered mechanisms remain poorly understood. Here, we performed fine-mapping analyses based on GWAS datasets encompassing 2762 IgAN cases and 5803 control individuals, and identified rs4823074 as the candidate causal variant that intersects the MTMR3 promoter in B-lymphoblastoid cells. Mendelian randomization studies suggested the risk allele may modulate disease susceptibility by affecting serum IgA levels through increased MTMR3 expression. Consistently, elevated MTMR3 expression in peripheral blood mononuclear cells was observed in patients with IgAN. Further mechanistic studies in vitro demonstrated that MTMR3 increased IgA production dependent upon its phosphatidylinositol 3-phosphate binding domain. Moreover, our study provided the in vivo functional evidence that Mtmr3-/- mice exhibited defective Toll Like Receptor 9-induced IgA production, glomerular IgA deposition, as well as mesangial cell proliferation. RNA-seq and pathway analyses showed that MTMR3 deficiency resulted in an impaired intestinal immune network for IgA production. Thus, our results support the role of MTMR3 in IgAN pathogenesis by enhancing Toll Like Receptor 9-induced IgA immunity.
Collapse
Affiliation(s)
- Yan-Na Wang
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Ting Gan
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Shu Qu
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Lin-Lin Xu
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Yong Hu
- Beijing Institute of Biotechnology, Beijing, People's Republic of China
| | - Li-Jun Liu
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Su-Fang Shi
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Ji-Cheng Lv
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China
| | - Lam C Tsoi
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Matthew T Patrick
- Department of Dermatology, University of Michigan Medical School, Ann Arbor, Michigan, USA; Department of Biostatistics, Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Kevin He
- Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA; Kidney Epidemiology and Cost Center, School of Public Health, University of Michigan, Ann Arbor, Michigan, USA
| | - Celine C Berthier
- Division of Nephrology, Department of Internal Medicine, University of Michigan, Ann Arbor, Michigan, USA
| | - Hu-Ji Xu
- Department of Rheumatology and Immunology, Shanghai Changzheng Hospital, The Second Military Medical University, Shanghai, People's Republic of China
| | - Xu-Jie Zhou
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China.
| | - Hong Zhang
- Renal Division, Department of Medicine, Peking University First Hospital, Beijing, People's Republic of China; Peking University Institute of Nephrology, Peking University, Beijing, People's Republic of China; Key Laboratory of Renal Disease, Ministry of Health of China, Beijing, People's Republic of China; Key Laboratory of Chronic Kidney Disease Prevention and Treatment (Peking University), Ministry of Education, Beijing, People's Republic of China; State Key Laboratory of Vascular Homeostasis and Remodeling, Peking University, Beijing, People's Republic of China.
| |
Collapse
|
28
|
Fu Y, Kelly JA, Gopalakrishnan J, Pelikan RC, Tessneer KL, Pasula S, Grundahl K, Murphy DA, Gaffney PM. Massively Parallel Reporter Assay Confirms Regulatory Potential of hQTLs and Reveals Important Variants in Lupus and Other Autoimmune Diseases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.17.553722. [PMID: 37645944 PMCID: PMC10462090 DOI: 10.1101/2023.08.17.553722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Objective To systematically characterize the potential for histone post-translational modifications, i.e., histone quantitative trait loci (hQTLs), expression QTLs (eQTLs), and variants on systemic lupus erythematosus (SLE) and autoimmune (AI) disease risk haplotypes to modulate gene expression in an allele dependent manner. Methods We designed a massively parallel reporter assay (MPRA) containing ~32K variants and transfected it into an Epstein-Barr virus transformed B cell line generated from an SLE case. Results Our study expands our understanding of hQTLs, illustrating that epigenetic QTLs are more likely to contribute to functional mechanisms than eQTLs and other variant types, and a large proportion of hQTLs overlap transcription start sites (TSS) of noncoding RNAs. In addition, we nominate 17 variants (including 11 novel) as putative causal variants for SLE and another 14 for various other AI diseases, prioritizing these variants for future functional studies primary and immortalized B cells. Conclusion We uncover important insights into the mechanistic relationships between genotype, epigenetics, gene expression, and SLE and AI disease phenotypes.
Collapse
Affiliation(s)
- Yao Fu
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Jennifer A Kelly
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Jaanam Gopalakrishnan
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
- Neuro-Immune Regulome Unit, National Eye Institute, National Institute of Health, Bethesda, MD, 20892, USA
| | - Richard C Pelikan
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Kandice L Tessneer
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Satish Pasula
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Kiely Grundahl
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - David A Murphy
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| | - Patrick M Gaffney
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, 73104, USA
| |
Collapse
|
29
|
Jeong R, Bulyk ML. Blood cell traits' GWAS loci colocalization with variation in PU.1 genomic occupancy prioritizes causal noncoding regulatory variants. CELL GENOMICS 2023; 3:100327. [PMID: 37492098 PMCID: PMC10363807 DOI: 10.1016/j.xgen.2023.100327] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 02/10/2023] [Accepted: 04/25/2023] [Indexed: 07/27/2023]
Abstract
Genome-wide association studies (GWASs) have uncovered numerous trait-associated loci across the human genome, most of which are located in noncoding regions, making interpretation difficult. Moreover, causal variants are hard to statistically fine-map at many loci because of widespread linkage disequilibrium. To address this challenge, we present a strategy utilizing transcription factor (TF) binding quantitative trait loci (bQTLs) for colocalization analysis to identify trait associations likely mediated by TF occupancy variation and to pinpoint likely causal variants using motif scores. We applied this approach to PU.1 bQTLs in lymphoblastoid cell lines and blood cell trait GWAS data. Colocalization analysis revealed 69 blood cell trait GWAS loci putatively driven by PU.1 occupancy variation. We nominate PU.1 motif-altering variants as the likely shared causal variants at 51 loci. Such integration of TF bQTL data with other GWAS data may reveal transcriptional regulatory mechanisms and causal noncoding variants underlying additional complex traits.
Collapse
Affiliation(s)
- Raehoon Jeong
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
- Bioinformatics and Integrative Genomics Graduate Program, Harvard University, Cambridge, MA 02138, USA
| | - Martha L. Bulyk
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
- Bioinformatics and Integrative Genomics Graduate Program, Harvard University, Cambridge, MA 02138, USA
- Department of Pathology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
30
|
Oliveros W, Delfosse K, Lato DF, Kiriakopulos K, Mokhtaridoost M, Said A, McMurray BJ, Browning JW, Mattioli K, Meng G, Ellis J, Mital S, Melé M, Maass PG. Systematic characterization of regulatory variants of blood pressure genes. CELL GENOMICS 2023; 3:100330. [PMID: 37492106 PMCID: PMC10363820 DOI: 10.1016/j.xgen.2023.100330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Revised: 03/29/2023] [Accepted: 04/28/2023] [Indexed: 07/27/2023]
Abstract
High blood pressure (BP) is the major risk factor for cardiovascular disease. Genome-wide association studies have identified genetic variants for BP, but functional insights into causality and related molecular mechanisms lag behind. We functionally characterize 4,608 genetic variants in linkage with 135 BP loci in vascular smooth muscle cells and cardiomyocytes by massively parallel reporter assays. High densities of regulatory variants at BP loci (i.e., ULK4, MAP4, CFDP1, PDE5A) indicate that multiple variants drive genetic association. Regulatory variants are enriched in repeats, alter cardiovascular-related transcription factor motifs, and spatially converge with genes controlling specific cardiovascular pathways. Using heuristic scoring, we define likely causal variants, and CRISPR prime editing finally determines causal variants for KCNK9, SFXN2, and PCGF6, which are candidates for developing high BP. Our systems-level approach provides a catalog of functionally relevant variants and their genomic architecture in two trait-relevant cell lines for a better understanding of BP gene regulation.
Collapse
Affiliation(s)
- Winona Oliveros
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Catalonia, Spain
| | - Kate Delfosse
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Daniella F. Lato
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Katerina Kiriakopulos
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Milad Mokhtaridoost
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Abdelrahman Said
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Brandon J. McMurray
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - Jared W.L. Browning
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Kaia Mattioli
- Division of Genetics, Department of Medicine, Brigham & Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Guoliang Meng
- Developmental and Stem Cell Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
| | - James Ellis
- Developmental and Stem Cell Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Seema Mital
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
- Ted Rogers Centre for Heart Research, Toronto, ON M5G 1X8, Canada
- Department of Pediatrics, The Hospital for Sick Children, University of Toronto, Toronto, ON M5G 0A4, Canada
| | - Marta Melé
- Life Sciences Department, Barcelona Supercomputing Center, 08034 Barcelona, Catalonia, Spain
| | - Philipp G. Maass
- Genetics & Genome Biology Program, The Hospital for Sick Children, Toronto, ON M5G 0A4, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
31
|
Yuan K, Longchamps RJ, Pardiñas AF, Yu M, Chen TT, Lin SC, Chen Y, Lam M, Liu R, Xia Y, Guo Z, Shi W, Shen C, Daly MJ, Neale BM, Feng YCA, Lin YF, Chen CY, O'Donovan M, Ge T, Huang H. Fine-mapping across diverse ancestries drives the discovery of putative causal variants underlying human complex traits and diseases. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.01.07.23284293. [PMID: 36711496 PMCID: PMC9882563 DOI: 10.1101/2023.01.07.23284293] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Genome-wide association studies (GWAS) of human complex traits or diseases often implicate genetic loci that span hundreds or thousands of genetic variants, many of which have similar statistical significance. While statistical fine-mapping in individuals of European ancestries has made important discoveries, cross-population fine-mapping has the potential to improve power and resolution by capitalizing on the genomic diversity across ancestries. Here we present SuSiEx, an accurate and computationally efficient method for cross-population fine-mapping, which builds on the single-population fine-mapping framework, Sum of Single Effects (SuSiE). SuSiEx integrates data from an arbitrary number of ancestries, explicitly models population-specific allele frequencies and LD patterns, accounts for multiple causal variants in a genomic region, and can be applied to GWAS summary statistics. We comprehensively evaluated SuSiEx using simulations, a range of quantitative traits measured in both UK Biobank and Taiwan Biobank, and schizophrenia GWAS across East Asian and European ancestries. In all evaluations, SuSiEx fine-mapped more association signals, produced smaller credible sets and higher posterior inclusion probability (PIP) for putative causal variants, and captured population-specific causal variants.
Collapse
Affiliation(s)
- Kai Yuan
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Ryan J Longchamps
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Antonio F Pardiñas
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University School of Medicine, Cardiff, UK
| | - Mingrui Yu
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Tzu-Ting Chen
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Shu-Chin Lin
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
| | - Yu Chen
- Center for Medical Genetics & Hunan Key Laboratory of Medical Genetics, School of Life Sciences and Department of Psychiatry, The Second Xiangya Hospital, Central South University, Changsha, China
| | - Max Lam
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Human Genetics, Genome Institute of Singapore, Singapore, Singapore
- Division of Psychiatry Research, the Zucker Hillside Hospital, Northwell Health, Glen Oaks, NY, USA
- Research Division Institute of Mental Health Singapore, Singapore, Singapore
| | - Ruize Liu
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Yan Xia
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Zhenglin Guo
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Wenzhao Shi
- Digital Health China Technologies Corp. Ltd., Beijing, China
| | - Chengguo Shen
- Digital Health China Technologies Corp. Ltd., Beijing, China
| | - Mark J Daly
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Benjamin M Neale
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Yen-Chen A Feng
- Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan
| | - Yen-Feng Lin
- Center for Neuropsychiatric Research, National Health Research Institutes, Miaoli, Taiwan
- Department of Public Health & Medical Humanities, School of Medicine, National Yang Ming Chiao Tung University, Taipei, Taiwan
- Institute of Behavioral Medicine, College of Medicine, National Cheng Kung University
| | | | - Michael O'Donovan
- MRC Centre for Neuropsychiatric Genetics and Genomics, Cardiff University School of Medicine, Cardiff, UK
| | - Tian Ge
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Psychiatric and Neurodevelopmental Genetics Unit, Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Psychiatry, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Hailiang Huang
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, the Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
32
|
Nonneman DJ, Lents CA. Functional genomics of reproduction in pigs: Are we there yet? Mol Reprod Dev 2023; 90:436-444. [PMID: 35704517 DOI: 10.1002/mrd.23625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 05/26/2022] [Accepted: 06/06/2022] [Indexed: 11/09/2022]
Abstract
Reproductive failure is the main reason for culling females in swine herds and is both a financial and sustainability issue. Because reproductive traits are complex and lowly to moderately heritable, genomic selection within populations can achieve substantial genetic gain in reproductive efficiency. A better understanding of the physiological components affecting the expression of these traits will facilitate greater understanding of the genes affecting reproductive traits and is necessary to improve and optimize management strategies to maximize reproductive success of gilts and sows. Large-scale genotyping with single-nucleotide polymorphism (SNP) arrays are used for genome-wide association studies (GWAS) and have facilitated identification of positional candidate genes. Transcriptomic data can be used to weight SNP for GWAS and could lead to previously unidentified candidate genes. Resequencing and fine mapping of candidate genes are necessary to identify putative functional variants and some of these have been incorporated into new genotyping arrays. Sequence imputation and genotype by sequence are newer strategies that could reveal novel functional mutations. In this study, these approaches are discussed. Advantages and limitations are highlighted where additional research is needed.
Collapse
Affiliation(s)
- Dan J Nonneman
- United States Department of Agriculture, Agriculture Research Service, U.S. Meat Animal Research Center, Clay Center, Nebraska, USA
| | - Clay A Lents
- United States Department of Agriculture, Agriculture Research Service, U.S. Meat Animal Research Center, Clay Center, Nebraska, USA
| |
Collapse
|
33
|
Karhunen V, Launonen I, Järvelin MR, Sebert S, Sillanpää MJ. Genetic fine-mapping from summary data using a nonlocal prior improves the detection of multiple causal variants. Bioinformatics 2023; 39:btad396. [PMID: 37348543 PMCID: PMC10326304 DOI: 10.1093/bioinformatics/btad396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 06/09/2023] [Accepted: 06/20/2023] [Indexed: 06/24/2023] Open
Abstract
MOTIVATION Genome-wide association studies (GWAS) have been successful in identifying genomic loci associated with complex traits. Genetic fine-mapping aims to detect independent causal variants from the GWAS-identified loci, adjusting for linkage disequilibrium patterns. RESULTS We present "FiniMOM" (fine-mapping using a product inverse-moment prior), a novel Bayesian fine-mapping method for summarized genetic associations. For causal effects, the method uses a nonlocal inverse-moment prior, which is a natural prior distribution to model non-null effects in finite samples. A beta-binomial prior is set for the number of causal variants, with a parameterization that can be used to control for potential misspecifications in the linkage disequilibrium reference. The results of simulations studies aimed to mimic a typical GWAS on circulating protein levels show improved credible set coverage and power of the proposed method over current state-of-the-art fine-mapping method SuSiE, especially in the case of multiple causal variants within a locus. AVAILABILITY AND IMPLEMENTATION https://vkarhune.github.io/finimom/.
Collapse
Affiliation(s)
- Ville Karhunen
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, P.O.Box 8000, FI-90014, Finland
- Research Unit of Population Health, University of Oulu, Oulu, Finland
| | - Ilkka Launonen
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, P.O.Box 8000, FI-90014, Finland
| | - Marjo-Riitta Järvelin
- Research Unit of Population Health, University of Oulu, Oulu, Finland
- Department of Epidemiology and Biostatistics, Imperial College London, London, United Kingdom
- Department of Life Sciences, College of Health and Life Sciences, Brunel University, London, United Kingdom
| | - Sylvain Sebert
- Research Unit of Population Health, University of Oulu, Oulu, Finland
| | - Mikko J Sillanpää
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, P.O.Box 8000, FI-90014, Finland
| |
Collapse
|
34
|
Schlosser P, Zhang J, Liu H, Surapaneni AL, Rhee EP, Arking DE, Yu B, Boerwinkle E, Welling PA, Chatterjee N, Susztak K, Coresh J, Grams ME. Transcriptome- and proteome-wide association studies nominate determinants of kidney function and damage. Genome Biol 2023; 24:150. [PMID: 37365616 DOI: 10.1186/s13059-023-02993-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Accepted: 06/15/2023] [Indexed: 06/28/2023] Open
Abstract
BACKGROUND The pathophysiological causes of kidney disease are not fully understood. Here we show that the integration of genome-wide genetic, transcriptomic, and proteomic association studies can nominate causal determinants of kidney function and damage. RESULTS Through transcriptome-wide association studies (TWAS) in kidney cortex, kidney tubule, liver, and whole blood and proteome-wide association studies (PWAS) in plasma, we assess for effects of 12,893 genes and 1342 proteins on kidney filtration (glomerular filtration rate (GFR) estimated by creatinine; GFR estimated by cystatin C; and blood urea nitrogen) and kidney damage (albuminuria). We find 1561 associations distributed among 260 genomic regions that are supported as putatively causal. We then prioritize 153 of these genomic regions using additional colocalization analyses. Our genome-wide findings are supported by existing knowledge (animal models for MANBA, DACH1, SH3YL1, INHBB), exceed the underlying GWAS signals (28 region-trait combinations without significant GWAS hit), identify independent gene/protein-trait associations within the same genomic region (INHBC, SPRYD4), nominate tissues underlying the associations (tubule expression of NRBP1), and distinguish markers of kidney filtration from those with a role in creatinine and cystatin C metabolism. Furthermore, we follow up on members of the TGF-beta superfamily of proteins and find a prognostic value of INHBC for kidney disease progression even after adjustment for measured glomerular filtration rate (GFR). CONCLUSION In summary, this study combines multimodal, genome-wide association studies to generate a catalog of putatively causal target genes and proteins relevant to kidney function and damage which can guide follow-up studies in physiology, basic science, and clinical medicine.
Collapse
Affiliation(s)
- Pascal Schlosser
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| | - Jingning Zhang
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Hongbo Liu
- Department of Medicine and Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Aditya L Surapaneni
- Welch Center for Prevention Epidemiology and Clinical Research, Johns Hopkins University, Baltimore, MD, USA
- Division of Precision Medicine, New York University Grossman School of Medicine, New York, NY, USA
| | - Eugene P Rhee
- Nephrology Division and Endocrine Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Dan E Arking
- McKusick-Nathans Institute, Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Bing Yu
- Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, USA
| | - Eric Boerwinkle
- Epidemiology, Human Genetics and Environmental Sciences, School of Public Health, University of Texas Health Science Center at Houston, Houston, TX, USA
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA
| | - Paul A Welling
- Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Physiology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Nilanjan Chatterjee
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Katalin Susztak
- Department of Medicine and Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Josef Coresh
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Morgan E Grams
- Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
- Division of Precision Medicine, New York University Grossman School of Medicine, New York, NY, USA
| |
Collapse
|
35
|
Fabo T, Khavari P. Functional characterization of human genomic variation linked to polygenic diseases. Trends Genet 2023; 39:462-490. [PMID: 36997428 PMCID: PMC11025698 DOI: 10.1016/j.tig.2023.02.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 02/22/2023] [Accepted: 02/23/2023] [Indexed: 03/30/2023]
Abstract
The burden of human disease lies predominantly in polygenic diseases. Since the early 2000s, genome-wide association studies (GWAS) have identified genetic variants and loci associated with complex traits. These have ranged from variants in coding sequences to mutations in regulatory regions, such as promoters and enhancers, as well as mutations affecting mediators of mRNA stability and other downstream regulators, such as 5' and 3'-untranslated regions (UTRs), long noncoding RNA (lncRNA), and miRNA. Recent research advances in genetics have utilized a combination of computational techniques, high-throughput in vitro and in vivo screening modalities, and precise genome editing to impute the function of diverse classes of genetic variants identified through GWAS. In this review, we highlight the vastness of genomic variants associated with polygenic disease risk and address recent advances in how genetic tools can be used to functionally characterize them.
Collapse
Affiliation(s)
- Tania Fabo
- Program in Epithelial Biology, Stanford University, Stanford, CA, USA; Stanford Cancer Institute, Stanford University, Stanford, CA, USA; Graduate Program in Genetics, Stanford University, Stanford, CA, USA; Stanford University School of Medicine, Stanford University, Stanford, CA, USA
| | - Paul Khavari
- Program in Epithelial Biology, Stanford University, Stanford, CA, USA; Stanford Cancer Institute, Stanford University, Stanford, CA, USA; Graduate Program in Genetics, Stanford University, Stanford, CA, USA; Stanford University School of Medicine, Stanford University, Stanford, CA, USA; Veterans Affairs Palo Alto Healthcare System, Palo Alto, CA, USA.
| |
Collapse
|
36
|
López Rodríguez M, Arasu UT, Kaikkonen MU. Exploring the genetic basis of coronary artery disease using functional genomics. Atherosclerosis 2023; 374:87-98. [PMID: 36801133 DOI: 10.1016/j.atherosclerosis.2023.01.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 01/20/2023] [Accepted: 01/24/2023] [Indexed: 02/05/2023]
Abstract
Genome-wide Association Studies (GWAS) have identified more than 300 loci associated with coronary artery disease (CAD), defining the genetic risk map of the disease. However, the translation of the association signals into biological-pathophysiological mechanisms constitute a major challenge. Through a group of examples of studies focused on CAD, we discuss the rationale, basic principles and outcomes of the main methodologies implemented to prioritize and characterize causal variants and their target genes. Additionally, we highlight the strategies as well as the current methods that integrate association and functional genomics data to dissect the cellular specificity underlying the complexity of disease mechanisms. Despite the limitations of existing approaches, the increasing knowledge generated through functional studies helps interpret GWAS maps and opens novel avenues for the clinical usability of association data.
Collapse
Affiliation(s)
- Maykel López Rodríguez
- A. I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, 70211, Finland; Department of Pathology and Laboratory Medicine, University of California, UCLA, Los Angeles, USA.
| | - Uma Thanigai Arasu
- A. I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, 70211, Finland
| | - Minna U Kaikkonen
- A. I. Virtanen Institute for Molecular Sciences, University of Eastern Finland, Kuopio, 70211, Finland.
| |
Collapse
|
37
|
Zhang R, Jia G, Diao X. geneHapR: an R package for gene haplotypic statistics and visualization. BMC Bioinformatics 2023; 24:199. [PMID: 37189023 DOI: 10.1186/s12859-023-05318-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 05/04/2023] [Indexed: 05/17/2023] Open
Abstract
BACKGROUND Together with application of next-generation sequencing technologies and increased accumulation of genomic variation data in different organism species, an opportunity for effectively identification of superior alleles of functional genes to facilitate marker-assisted selection is emerging, and the clarification of haplotypes of functional genes is becoming an essential target in recent study works. RESULTS In this paper, we describe an R package 'geneHapR' developed for haplotypes identification, statistics and visualization analysis of candidate genes. This package could integrate genotype data, genomic annotating information and phenotypic variation data to clarify genotype variations, evolutionary-ship, and morphological effects among haplotypes through variants visualization, network construction and phenotypic comparison. 'geneHapR' also provides functions for Linkage Disequilibrium block analysis and visualizing of haplotypes geo-distribution. CONCLUSIONS The R package 'geneHapR' provided an easy-to-use tool for haplotype identification, statistic and visualization for candidate gene and will provide useful clues for gene functional dissection and molecular-assistant pyramiding of beneficial alleles of functional locus in future breeding programs.
Collapse
Affiliation(s)
- Renliang Zhang
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Guanqing Jia
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
| | - Xianmin Diao
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, China.
| |
Collapse
|
38
|
Abstract
Autoimmune diseases display a high degree of comorbidity within individuals and families, suggesting shared risk factors. Over the past 15 years, genome-wide association studies have established the polygenic basis of these common conditions and revealed widespread sharing of genetic effects, indicative of a shared immunopathology. Despite ongoing challenges in determining the precise genes and molecular consequences of these risk variants, functional experiments and integration with multimodal genomic data are providing valuable insights into key immune cells and pathways driving these diseases, with potential therapeutic implications. Moreover, genetic studies of ancient populations are shedding light on the contribution of pathogen-driven selection pressures to the increased prevalence of autoimmune disease. This Review summarizes the current understanding of autoimmune disease genetics, including shared effects, mechanisms, and evolutionary origins.
Collapse
Affiliation(s)
- Adil Harroud
- Department of Neurology and Neurosurgery, McGill University, Montréal, Quebec, Canada
- Department of Human Genetics, McGill University, Montréal, Quebec, Canada
- The Neuro (Montreal Neurological Institute and Hospital), McGill University, Montréal, Quebec, Canada
| | - David A Hafler
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
- Department of Immunobiology, Yale School of Medicine, New Haven, CT, USA
- Broad Institute of MIT and Harvard University, Cambridge, MA, USA
| |
Collapse
|
39
|
Collins MA, Avery R, Albert FW. Substrate-specific effects of natural genetic variation on proteasome activity. PLoS Genet 2023; 19:e1010734. [PMID: 37126494 PMCID: PMC10174532 DOI: 10.1371/journal.pgen.1010734] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 05/11/2023] [Accepted: 04/04/2023] [Indexed: 05/02/2023] Open
Abstract
Protein degradation is an essential biological process that regulates protein abundance and removes misfolded and damaged proteins from cells. In eukaryotes, most protein degradation occurs through the stepwise actions of two functionally distinct entities, the ubiquitin system and the proteasome. Ubiquitin system enzymes attach ubiquitin to cellular proteins, targeting them for degradation. The proteasome then selectively binds and degrades ubiquitinated substrate proteins. Genetic variation in ubiquitin system genes creates heritable differences in the degradation of their substrates. However, the challenges of measuring the degradative activity of the proteasome independently of the ubiquitin system in large samples have limited our understanding of genetic influences on the proteasome. Here, using the yeast Saccharomyces cerevisiae, we built and characterized reporters that provide high-throughput, ubiquitin system-independent measurements of proteasome activity. Using single-cell measurements of proteasome activity from millions of genetically diverse yeast cells, we mapped 15 loci across the genome that influence proteasomal protein degradation. Twelve of these 15 loci exerted specific effects on the degradation of two distinct proteasome substrates, revealing a high degree of substrate-specificity in the genetics of proteasome activity. Using CRISPR-Cas9-based allelic engineering, we resolved a locus to a causal variant in the promoter of RPT6, a gene that encodes a subunit of the proteasome's 19S regulatory particle. The variant increases RPT6 expression, which we show results in increased proteasome activity. Our results reveal the complex genetic architecture of proteasome activity and suggest that genetic influences on the proteasome may be an important source of variation in the many cellular and organismal traits shaped by protein degradation.
Collapse
Affiliation(s)
- Mahlon A. Collins
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Randi Avery
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Frank W. Albert
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Minneapolis, Minnesota, United States of America
| |
Collapse
|
40
|
Link V, Schraiber JG, Fan C, Dinh B, Mancuso N, Chiang CW, Edge MD. Tree-based QTL mapping with expected local genetic relatedness matrices. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.07.536093. [PMID: 37066144 PMCID: PMC10104234 DOI: 10.1101/2023.04.07.536093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Understanding the genetic basis of complex phenotypes is a central pursuit of genetics. Genome-wide Association Studies (GWAS) are a powerful way to find genetic loci associated with phenotypes. GWAS are widely and successfully used, but they face challenges related to the fact that variants are tested for association with a phenotype independently, whereas in reality variants at different sites are correlated because of their shared evolutionary history. One way to model this shared history is through the ancestral recombination graph (ARG), which encodes a series of local coalescent trees. Recent computational and methodological breakthroughs have made it feasible to estimate approximate ARGs from large-scale samples. Here, we explore the potential of an ARG-based approach to quantitative-trait locus (QTL) mapping, echoing existing variance-components approaches. We propose a framework that relies on the conditional expectation of a local genetic relatedness matrix given the ARG (local eGRM). Simulations show that our method is especially beneficial for finding QTLs in the presence of allelic heterogeneity. By framing QTL mapping in terms of the estimated ARG, we can also facilitate the detection of QTLs in understudied populations. We use local eGRM to identify a large-effect BMI locus, the CREBRF gene, in a sample of Native Hawaiians in which it was not previously detectable by GWAS because of a lack of population-specific imputation resources. Our investigations can provide intuition about the benefits of using estimated ARGs in population- and statistical-genetic methods in general.
Collapse
Affiliation(s)
- Vivian Link
- Department of Quantitative and Computational Biology, University of Southern California
| | - Joshua G. Schraiber
- Department of Quantitative and Computational Biology, University of Southern California
| | - Caoqi Fan
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Bryan Dinh
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Nicholas Mancuso
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Charleston W.K. Chiang
- Department of Quantitative and Computational Biology, University of Southern California
- Center for Genetic Epidemiology, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California
| | - Michael D. Edge
- Department of Quantitative and Computational Biology, University of Southern California
| |
Collapse
|
41
|
Jeong R, Bulyk ML. Colocalization of blood cell traits GWAS associations and variation in PU.1 genomic occupancy prioritizes causal noncoding regulatory variants. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.29.534582. [PMID: 37034747 PMCID: PMC10081269 DOI: 10.1101/2023.03.29.534582] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Genome-wide association studies (GWAS) have uncovered numerous trait-associated loci across the human genome, most of which are located in noncoding regions, making interpretations difficult. Moreover, causal variants are hard to statistically fine-map at many loci because of widespread linkage disequilibrium. To address this challenge, we present a strategy utilizing transcription factor (TF) binding quantitative trait loci (bQTLs) for colocalization analysis to identify trait associations likely mediated by TF occupancy variation and to pinpoint likely causal variants using motif scores. We applied this approach to PU.1 bQTLs in lymphoblastoid cell lines and blood cell traits GWAS data. Colocalization analysis revealed 69 blood cell trait GWAS loci putatively driven by PU.1 occupancy variation. We nominate PU.1 motif-altering variants as the likely shared causal variants at 51 loci. Such integration of TF bQTL data with other GWAS data may reveal transcriptional regulatory mechanisms and causal noncoding variants underlying additional complex traits.
Collapse
Affiliation(s)
- Raehoon Jeong
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
- Bioinformatics and Integrative Genomics Graduate Program, Harvard University, Cambridge, MA 02138, USA
| | - Martha L. Bulyk
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
- Bioinformatics and Integrative Genomics Graduate Program, Harvard University, Cambridge, MA 02138, USA
- Department of Pathology, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
42
|
Zheng Y, VanDusen NJ. Massively Parallel Reporter Assays for High-Throughput In Vivo Analysis of Cis-Regulatory Elements. J Cardiovasc Dev Dis 2023; 10:jcdd10040144. [PMID: 37103023 PMCID: PMC10146671 DOI: 10.3390/jcdd10040144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 03/24/2023] [Accepted: 03/27/2023] [Indexed: 03/31/2023] Open
Abstract
The rapid improvement of descriptive genomic technologies has fueled a dramatic increase in hypothesized connections between cardiovascular gene expression and phenotypes. However, in vivo testing of these hypotheses has predominantly been relegated to slow, expensive, and linear generation of genetically modified mice. In the study of genomic cis-regulatory elements, generation of mice featuring transgenic reporters or cis-regulatory element knockout remains the standard approach. While the data obtained is of high quality, the approach is insufficient to keep pace with candidate identification and therefore results in biases introduced during the selection of candidates for validation. However, recent advances across a range of disciplines are converging to enable functional genomic assays that can be conducted in a high-throughput manner. Here, we review one such method, massively parallel reporter assays (MPRAs), in which the activities of thousands of candidate genomic regulatory elements are simultaneously assessed via the next-generation sequencing of a barcoded reporter transcript. We discuss best practices for MPRA design and use, with a focus on practical considerations, and review how this emerging technology has been successfully deployed in vivo. Finally, we discuss how MPRAs are likely to evolve and be used in future cardiovascular research.
Collapse
|
43
|
Zeng B, Bendl J, Deng C, Lee D, Misir R, Reach SM, Kleopoulos SP, Auluck P, Marenco S, Lewis DA, Haroutunian V, Ahituv N, Fullard JF, Hoffman GE, Roussos P. Genetic regulation of cell-type specific chromatin accessibility shapes the etiology of brain diseases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.02.530826. [PMID: 37090548 PMCID: PMC10120699 DOI: 10.1101/2023.03.02.530826] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2023]
Abstract
Nucleotide variants in cell type-specific gene regulatory elements in the human brain are major risk factors of human disease. We measured chromatin accessibility in sorted neurons and glia from 1,932 samples of human postmortem brain and identified 34,539 open chromatin regions with chromatin accessibility quantitative trait loci (caQTL). Only 10.4% of caQTL are shared between neurons and glia, supporting the cell type specificity of genetic regulation of the brain regulome. Incorporating allele specific chromatin accessibility improves statistical fine-mapping and refines molecular mechanisms underlying disease risk. Using massively parallel reporter assays in induced excitatory neurons, we screened 19,893 brain QTLs, identifying the functional impact of 476 regulatory variants. Combined, this comprehensive resource captures variation in the human brain regulome and provides novel insights into brain disease etiology.
Collapse
Affiliation(s)
- Biao Zeng
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jaroslav Bendl
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - Donghoon Lee
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Ruth Misir
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sarah M. Reach
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Steven P. Kleopoulos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Pavan Auluck
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD, USA
| | - Stefano Marenco
- Human Brain Collection Core, National Institute of Mental Health-Intramural Research Program, Bethesda, MD, USA
| | - David A. Lewis
- Translational Neuroscience Program, Department of Psychiatry, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | - Vahram Haroutunian
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Neuroscience, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, 94158, USA
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - John F. Fullard
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Gabriel E. Hoffman
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Panos Roussos
- Center for Disease Neurogenomics, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA
- Mental Illness Research, Education and Clinical Centers, James J. Peters VA Medical Center, Bronx, NY, USA
| |
Collapse
|
44
|
Koprulu M, Carrasco-Zanini J, Wheeler E, Lockhart S, Kerrison ND, Wareham NJ, Pietzner M, Langenberg C. Proteogenomic links to human metabolic diseases. Nat Metab 2023; 5:516-528. [PMID: 36823471 PMCID: PMC7614946 DOI: 10.1038/s42255-023-00753-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Accepted: 02/01/2023] [Indexed: 02/25/2023]
Abstract
Studying the plasma proteome as the intermediate layer between the genome and the phenome has the potential to identify new disease processes. Here, we conducted a cis-focused proteogenomic analysis of 2,923 plasma proteins measured in 1,180 individuals using antibody-based assays. We (1) identify 256 unreported protein quantitative trait loci (pQTL); (2) demonstrate shared genetic regulation of 224 cis-pQTLs with 575 specific health outcomes, revealing examples for notable metabolic diseases (such as gastrin-releasing peptide as a potential therapeutic target for type 2 diabetes); (3) improve causal gene assignment at 40% (n = 192) of overlapping risk loci; and (4) observe convergence of phenotypic consequences of cis-pQTLs and rare loss-of-function gene burden for 12 proteins, such as TIMD4 for lipoprotein metabolism. Our findings demonstrate the value of integrating complementary proteomic technologies with genomics even at moderate scale to identify new mediators of metabolic diseases with the potential for therapeutic interventions.
Collapse
Affiliation(s)
- Mine Koprulu
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Julia Carrasco-Zanini
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Eleanor Wheeler
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Sam Lockhart
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
- MRC Metabolic Diseases Unit, Wellcome-MRC Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Nicola D Kerrison
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Nicholas J Wareham
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
| | - Maik Pietzner
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK
- Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany
- Precision Healthcare University Research Institute, Queen Mary University of London, London, UK
| | - Claudia Langenberg
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge, UK.
- Computational Medicine, Berlin Institute of Health at Charité-Universitätsmedizin Berlin, Berlin, Germany.
- Precision Healthcare University Research Institute, Queen Mary University of London, London, UK.
| |
Collapse
|
45
|
Zhang J, Zhao H. eQTL Studies: from Bulk Tissues to Single Cells. ARXIV 2023:arXiv:2302.11662v1. [PMID: 36866231 PMCID: PMC9980190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
An expression quantitative trait locus (eQTL) is a chromosomal region where genetic variants are associated with the expression levels of certain genes that can be both nearby or distant. The identifications of eQTLs for different tissues, cell types, and contexts have led to better understanding of the dynamic regulations of gene expressions and implications of functional genes and variants for complex traits and diseases. Although most eQTL studies to date have been performed on data collected from bulk tissues, recent studies have demonstrated the importance of cell-type-specific and context-dependent gene regulations in biological processes and disease mechanisms. In this review, we discuss statistical methods that have been developed to enable the detections of cell-type-specific and context-dependent eQTLs from bulk tissues, purified cell types, and single cells. We also discuss the limitations of the current methods and future research opportunities.
Collapse
Affiliation(s)
- Jingfei Zhang
- Information Systems and Operations Management, Emory University
| | - Hongyu Zhao
- Department of Biostatistics, Yale University
| |
Collapse
|
46
|
Deng C, Whalen S, Steyert M, Ziffra R, Przytycki PF, Inoue F, Pereira DA, Capauto D, Norton S, Vaccarino FM, Pollen A, Nowakowski TJ, Ahituv N, Pollard KS. Massively parallel characterization of psychiatric disorder-associated and cell-type-specific regulatory elements in the developing human cortex. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.15.528663. [PMID: 36824845 PMCID: PMC9949039 DOI: 10.1101/2023.02.15.528663] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Nucleotide changes in gene regulatory elements are important determinants of neuronal development and disease. Using massively parallel reporter assays in primary human cells from mid-gestation cortex and cerebral organoids, we interrogated the cis-regulatory activity of 102,767 sequences, including differentially accessible cell-type specific regions in the developing cortex and single-nucleotide variants associated with psychiatric disorders. In primary cells, we identified 46,802 active enhancer sequences and 164 disorder-associated variants that significantly alter enhancer activity. Activity was comparable in organoids and primary cells, suggesting that organoids provide an adequate model for the developing cortex. Using deep learning, we decoded the sequence basis and upstream regulators of enhancer activity. This work establishes a comprehensive catalog of functional gene regulatory elements and variants in human neuronal development.
Collapse
Affiliation(s)
- Chengyu Deng
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA, USA
- Institute for Human Genetics, University of California, San Francisco; San Francisco, CA, USA
| | - Sean Whalen
- Gladstone Institutes; San Francisco, CA, USA
| | - Marilyn Steyert
- Department of Anatomy, University of California, San Francisco; San Francisco, CA, USA
- Department of Psychiatry, University of California, San Francisco; San Francisco, CA, USA
- Department of Neurological Surgery, University of California, San Francisco; San Francisco, CA, USA
| | - Ryan Ziffra
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA, USA
| | | | - Fumitaka Inoue
- Institute for the Advanced Study of Human Biology (WPI-ASHBi), Kyoto University; Kyoto, Japan
| | - Daniela A. Pereira
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA, USA
- Institute for Human Genetics, University of California, San Francisco; San Francisco, CA, USA
- Graduate Program of Genetics, Institute of Biological Sciences, Federal University of Minas Gerais; Belo Horizonte, Minas Gerais, Brazil
| | | | - Scott Norton
- Child Study Center, Yale University; New Haven, CT, USA
| | - Flora M. Vaccarino
- Child Study Center, Yale University; New Haven, CT, USA
- Department of Neuroscience, Yale University; New Haven, CT, USA
| | - Alex Pollen
- Department of Neurology, University of California, San Francisco; San Francisco, CA, USA
- Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco; San Francisco, CA, USA
| | - Tomasz J. Nowakowski
- Department of Anatomy, University of California, San Francisco; San Francisco, CA, USA
- Department of Psychiatry, University of California, San Francisco; San Francisco, CA, USA
- Department of Neurological Surgery, University of California, San Francisco; San Francisco, CA, USA
- Eli and Edythe Broad Center for Regeneration Medicine and Stem Cell Research, University of California, San Francisco; San Francisco, CA, USA
- Chan Zuckerberg Biohub, San Francisco; San Francisco, CA, USA
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA, USA
- Institute for Human Genetics, University of California, San Francisco; San Francisco, CA, USA
| | - Katherine S. Pollard
- Institute for Human Genetics, University of California, San Francisco; San Francisco, CA, USA
- Gladstone Institutes; San Francisco, CA, USA
- Chan Zuckerberg Biohub, San Francisco; San Francisco, CA, USA
- Department of Epidemiology and Biostatistics, University of California, San Francisco; San Francisco, CA, USA
| |
Collapse
|
47
|
Cui Y, Arnold FJ, Peng F, Wang D, Li JS, Michels S, Wagner EJ, La Spada AR, Li W. Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders. Nat Commun 2023; 14:583. [PMID: 36737438 PMCID: PMC9898543 DOI: 10.1038/s41467-023-36311-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 01/25/2023] [Indexed: 02/05/2023] Open
Abstract
Alternative polyadenylation (APA) plays an essential role in brain development; however, current transcriptome-wide association studies (TWAS) largely overlook APA in nominating susceptibility genes. Here, we performed a 3' untranslated region (3'UTR) APA TWAS (3'aTWAS) for 11 brain disorders by combining their genome-wide association studies data with 17,300 RNA-seq samples across 2,937 individuals. We identified 354 3'aTWAS-significant genes, including known APA-linked risk genes, such as SNCA in Parkinson's disease. Among these 354 genes, ~57% are not significant in traditional expression- and splicing-TWAS studies, since APA may regulate the translation, localization and protein-protein interaction of the target genes independent of mRNA level expression or splicing. Furthermore, we discovered ATXN3 as a 3'aTWAS-significant gene for amyotrophic lateral sclerosis, and its modulation substantially impacted pathological hallmarks of amyotrophic lateral sclerosis in vitro. Together, 3'aTWAS is a powerful strategy to nominate important APA-linked brain disorder susceptibility genes, most of which are largely overlooked by conventional expression and splicing analyses.
Collapse
Affiliation(s)
- Ya Cui
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA
| | - Frederick J Arnold
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA
| | - Fanglue Peng
- Department of Molecular and Cellular Biology, University Baylor College of Medicine, Houston, TX, 77030, USA
| | - Dan Wang
- Department of Medicine, Division of Cardiology, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Jason Sheng Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA
| | - Sebastian Michels
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA
| | - Eric J Wagner
- School of Medicine and Dentistry, University of Rochester Medical Center, Rochester, NY, 14642, USA
| | - Albert R La Spada
- Departments of Pathology & Laboratory Medicine, Neurology, and Biological Chemistry, School of Medicine, and the UCI Institute for Neurotherapeutics, University of California Irvine, Irvine, CA, 92697, USA.
| | - Wei Li
- Division of Computational Biomedicine, Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, CA, 92697, USA.
| |
Collapse
|
48
|
Martin-Trujillo A, Garg P, Patel N, Jadhav B, Sharp AJ. Genome-wide evaluation of the effect of short tandem repeat variation on local DNA methylation. Genome Res 2023; 33:184-196. [PMID: 36577521 PMCID: PMC10069470 DOI: 10.1101/gr.277057.122] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 12/19/2022] [Indexed: 12/30/2022]
Abstract
Short tandem repeats (STRs) contribute significantly to genetic diversity in humans, including disease-causing variation. Although the effect of STR variation on gene expression has been extensively assessed, their impact on epigenetics has been poorly studied and limited to specific genomic regions. Here, we investigated the hypothesis that some STRs act as independent regulators of local DNA methylation in the human genome and modify risk of common human traits. To address these questions, we first analyzed two independent data sets comprising PCR-free whole-genome sequencing (WGS) and genome-wide DNA methylation levels derived from whole-blood samples in 245 (discovery cohort) and 484 individuals (replication cohort). Using genotypes for 131,635 polymorphic STRs derived from WGS using HipSTR, we identified 11,870 STRs that associated with DNA methylation levels (mSTRs) of 11,774 CpGs (Bonferroni P < 0.001) in our discovery cohort, with 90% successfully replicating in our second cohort. Subsequently, through fine-mapping using CAVIAR we defined 585 of these mSTRs as the likely causal variants underlying the observed associations (fm-mSTRs) and linked a fraction of these to previously reported genome-wide association study signals, providing insights into the mechanisms underlying complex human traits. Furthermore, by integrating gene expression data, we observed that 12.5% of the tested fm-mSTRs also modulate expression levels of nearby genes, reinforcing their regulatory potential. Overall, our findings expand the catalog of functional sequence variants that affect genome regulation, highlighting the importance of incorporating STRs in future genetic association analysis and epigenetics data for the interpretation of trait-associated variants.
Collapse
Affiliation(s)
- Alejandro Martin-Trujillo
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, Hess Center for Science and Medicine, New York, New York 10029, USA
| | - Paras Garg
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, Hess Center for Science and Medicine, New York, New York 10029, USA
| | - Nihir Patel
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, Hess Center for Science and Medicine, New York, New York 10029, USA
| | - Bharati Jadhav
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, Hess Center for Science and Medicine, New York, New York 10029, USA
| | - Andrew J Sharp
- Department of Genetics and Genomic Sciences and Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, Hess Center for Science and Medicine, New York, New York 10029, USA
| |
Collapse
|
49
|
Li A, Liu S, Bakshi A, Jiang L, Chen W, Zheng Z, Sullivan PF, Visscher PM, Wray NR, Yang J, Zeng J. mBAT-combo: A more powerful test to detect gene-trait associations from GWAS data. Am J Hum Genet 2023; 110:30-43. [PMID: 36608683 PMCID: PMC9892780 DOI: 10.1016/j.ajhg.2022.12.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 12/08/2022] [Indexed: 01/07/2023] Open
Abstract
Gene-based association tests aggregate multiple SNP-trait associations into sets defined by gene boundaries and are widely used in post-GWAS analysis. A common approach for gene-based tests is to combine SNPs associations by computing the sum of χ2 statistics. However, this strategy ignores the directions of SNP effects, which could result in a loss of power for SNPs with masking effects, e.g., when the product of two SNP effects and the linkage disequilibrium (LD) correlation is negative. Here, we introduce "mBAT-combo," a set-based test that is better powered than other methods to detect multi-SNP associations in the context of masking effects. We validate the method through simulations and applications to real data. We find that of 35 blood and urine biomarker traits in the UK Biobank, 34 traits show evidence for masking effects in a total of 4,273 gene-trait pairs, indicating that masking effects is common in complex traits. We further validate the improved power of our method in height, body mass index, and schizophrenia with different GWAS sample sizes and show that on average 95.7% of the genes detected only by mBAT-combo with smaller sample sizes can be identified by the single-SNP approach with a 1.7-fold increase in sample sizes. Eleven genes significant only in mBAT-combo for schizophrenia are confirmed by functionally informed fine-mapping or Mendelian randomization integrating gene expression data. The framework of mBAT-combo can be applied to any set of SNPs to refine trait-association signals hidden in genomic regions with complex LD structures.
Collapse
Affiliation(s)
- Ang Li
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Shouye Liu
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Andrew Bakshi
- Department of Epidemiology and Preventive Medicine, School of Public Health and Preventive Medicine, Monash University, Melbourne, VIC, Australia
| | | | - Wenhan Chen
- Epigenetics Research Laboratory, Genomics and Epigenetics Theme, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Zhili Zheng
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Patrick F Sullivan
- Department of Medical Epidemiology and Biostatistics, Karolinska Institute, Stockholm, Sweden; Departments of Genetics and Psychiatry, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Peter M Visscher
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Naomi R Wray
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia; Queensland Brain Institute, University of Queensland, Brisbane, QLD, Australia
| | - Jian Yang
- School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China; Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang, China
| | - Jian Zeng
- Institute for Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
50
|
Morova T, Ding Y, Huang CCF, Sar F, Schwarz T, Giambartolomei C, Baca S, Grishin D, Hach F, Gusev A, Freedman M, Pasaniuc B, Lack N. Optimized high-throughput screening of non-coding variants identified from genome-wide association studies. Nucleic Acids Res 2022; 51:e18. [PMID: 36546757 PMCID: PMC9943666 DOI: 10.1093/nar/gkac1198] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 11/19/2022] [Accepted: 12/06/2022] [Indexed: 12/24/2022] Open
Abstract
The vast majority of disease-associated single nucleotide polymorphisms (SNP) identified from genome-wide association studies (GWAS) are localized in non-coding regions. A significant fraction of these variants impact transcription factors binding to enhancer elements and alter gene expression. To functionally interrogate the activity of such variants we developed snpSTARRseq, a high-throughput experimental method that can interrogate the functional impact of hundreds to thousands of non-coding variants on enhancer activity. snpSTARRseq dramatically improves signal-to-noise by utilizing a novel sequencing and bioinformatic approach that increases both insert size and the number of variants tested per loci. Using this strategy, we interrogated known prostate cancer (PCa) risk-associated loci and demonstrated that 35% of them harbor SNPs that significantly altered enhancer activity. Combining these results with chromosomal looping data we could identify interacting genes and provide a mechanism of action for 20 PCa GWAS risk regions. When benchmarked to orthogonal methods, snpSTARRseq showed a strong correlation with in vivo experimental allelic-imbalance studies whereas there was no correlation with predictive in silico approaches. Overall, snpSTARRseq provides an integrated experimental and computational framework to functionally test non-coding genetic variants.
Collapse
Affiliation(s)
- Tunc Morova
- Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada
| | - Yi Ding
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | | | - Funda Sar
- Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada
| | - Tommer Schwarz
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Claudia Giambartolomei
- Central RNA Lab, Istituto Italiano di Tecnologia, Genova 16163, Italy,Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Sylvan C Baca
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA
| | - Dennis Grishin
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA
| | - Faraz Hach
- Vancouver Prostate Centre, Vancouver, BC V6H 3Z6, Canada,Department of Urologic Science, University of British Columbia, Vancouver, BC V5Z 1M9, Canada
| | - Alexander Gusev
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA,Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Matthew L Freedman
- Department of Medical Oncology, The Center for Functional Cancer Epigenetics, Dana Farber Cancer Institute, Boston, MA 02215, USA,The Center for Cancer Genome Discovery, Dana Farber Cancer Institute, Boston, MA 02215, USA
| | - Bogdan Pasaniuc
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Pathology and Laboratory Medicine, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA,Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA 90095, USA
| | - Nathan A Lack
- To whom correspondence should be addressed. Tel: +1 604 875 4411;
| |
Collapse
|