1
|
Robertson CC, Elgamal RM, Henry-Kanarek BA, Arvan P, Chen S, Dhawan S, Eizirik DL, Kaddis JS, Vahedi G, Parker SCJ, Gaulton KJ, Soleimanpour SA. Untangling the genetics of beta cell dysfunction and death in type 1 diabetes. Mol Metab 2024; 86:101973. [PMID: 38914291 PMCID: PMC11283044 DOI: 10.1016/j.molmet.2024.101973] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 06/18/2024] [Accepted: 06/19/2024] [Indexed: 06/26/2024] Open
Abstract
BACKGROUND Type 1 diabetes (T1D) is a complex multi-system disease which arises from both environmental and genetic factors, resulting in the destruction of insulin-producing pancreatic beta cells. Over the past two decades, human genetic studies have provided new insight into the etiology of T1D, including an appreciation for the role of beta cells in their own demise. SCOPE OF REVIEW Here, we outline models supported by human genetic data for the role of beta cell dysfunction and death in T1D. We highlight the importance of strong evidence linking T1D genetic associations to bona fide candidate genes for mechanistic and therapeutic consideration. To guide rigorous interpretation of genetic associations, we describe molecular profiling approaches, genomic resources, and disease models that may be used to construct variant-to-gene links and to investigate candidate genes and their role in T1D. MAJOR CONCLUSIONS We profile advances in understanding the genetic causes of beta cell dysfunction and death at individual T1D risk loci. We discuss how genetic risk prediction models can be used to address disease heterogeneity. Further, we present areas where investment will be critical for the future use of genetics to address open questions in the development of new treatment and prevention strategies for T1D.
Collapse
Affiliation(s)
- Catherine C Robertson
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; Center for Precision Health Research, National Human Genome Research Institute, NIH, Bethesda, MD 20892, USA
| | - Ruth M Elgamal
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA
| | - Belle A Henry-Kanarek
- Department of Internal Medicine and Division of Metabolism, Endocrinology, and Diabetes, University of Michigan, Ann Arbor, MI, USA
| | - Peter Arvan
- Department of Internal Medicine and Division of Metabolism, Endocrinology, and Diabetes, University of Michigan, Ann Arbor, MI, USA
| | - Shuibing Chen
- Department of Surgery, Weill Cornell Medicine, New York, NY, USA; Center for Genomic Health, Weill Cornell Medicine, New York, NY, USA
| | - Sangeeta Dhawan
- Department of Translational Research and Cellular Therapeutics, Arthur Riggs Diabetes and Metabolism Research Institute, City of Hope, Duarte, CA, USA
| | - Decio L Eizirik
- ULB Center for Diabetes Research, Université Libre de Bruxelles, Brussels, Belgium
| | - John S Kaddis
- Department of Diabetes and Cancer Discovery Science, Arthur Riggs Diabetes and Metabolism Research Institute, Beckman Research Institute, City of Hope, Duarte, CA, USA
| | - Golnaz Vahedi
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA, USA
| | - Stephen C J Parker
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA; Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA; Department of Biostatistics, University of Michigan, Ann Arbor, MI, USA.
| | - Kyle J Gaulton
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, USA.
| | - Scott A Soleimanpour
- Department of Internal Medicine and Division of Metabolism, Endocrinology, and Diabetes, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
2
|
Shen S, Sobczyk MK, Paternoster L, Brown SJ. From GWASs toward Mechanistic Understanding with Case Studies in Dermatogenetics. J Invest Dermatol 2024; 144:1189-1199.e8. [PMID: 38782533 DOI: 10.1016/j.jid.2024.03.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Revised: 02/13/2024] [Accepted: 03/06/2024] [Indexed: 05/25/2024]
Abstract
Many human skin diseases result from the complex interplay of genetic and environmental mechanisms that are largely unknown. GWASs have yielded insight into the genetic aspect of complex disease by highlighting regions of the genome or specific genetic variants associated with disease. Leveraging this information to identify causal genes and cell types will provide insight into fundamental biology, inform diagnostics, and aid drug discovery. However, the etiological mechanisms from genetic variant to disease are still unestablished in most cases. There now exists an unprecedented wealth of data and computational methods for variant interpretation in a functional context. It can be challenging to decide where to start owing to a lack of consensus on the best way to identify causal genetic mechanisms. This article highlights 3 key aspects of genetic variant interpretation: prioritizing causal genes, cell types, and pathways. We provide a practical overview of the main methods and datasets, giving examples from recent atopic dermatitis studies to provide a blueprint for variant interpretation. A collection of resources, including brief description and links to the packages and web tools, is provided for researchers looking to start in silico follow-up genetic analysis of associated genetic variants.
Collapse
Affiliation(s)
- Silvia Shen
- Centre for Genomic & Experimental Medicine, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, United Kingdom; Institute for Evolution and Ecology, School of Biological Sciences, The University of Edinburgh, Edinburgh, United Kingdom.
| | - Maria K Sobczyk
- MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Lavinia Paternoster
- MRC Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Sara J Brown
- Centre for Genomic & Experimental Medicine, Institute of Genetics and Cancer, The University of Edinburgh, Edinburgh, United Kingdom; Department of Dermatology, NHS Lothian, Edinburgh, United Kingdom
| |
Collapse
|
3
|
Kim A, Zhang Z, Legros C, Lu Z, de Smith A, Moore JE, Mancuso N, Gazal S. Inferring causal cell types of human diseases and risk variants from candidate regulatory elements. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.17.24307556. [PMID: 38798383 PMCID: PMC11118635 DOI: 10.1101/2024.05.17.24307556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
The heritability of human diseases is extremely enriched in candidate regulatory elements (cRE) from disease-relevant cell types. Critical next steps are to infer which and how many cell types are truly causal for a disease (after accounting for co-regulation across cell types), and to understand how individual variants impact disease risk through single or multiple causal cell types. Here, we propose CT-FM and CT-FM-SNP, two methods that leverage cell-type-specific cREs to fine-map causal cell types for a trait and for its candidate causal variants, respectively. We applied CT-FM to 63 GWAS summary statistics (average N = 417K) using nearly one thousand cRE annotations, primarily coming from ENCODE4. CT-FM inferred 81 causal cell types with corresponding SNP-annotations explaining a high fraction of trait SNP-heritability (~2/3 of the SNP-heritability explained by existing cREs), identified 16 traits with multiple causal cell types, highlighted cell-disease relationships consistent with known biology, and uncovered previously unexplored cellular mechanisms in psychiatric and immune-related diseases. Finally, we applied CT-FM-SNP to 39 UK Biobank traits and predicted high confidence causal cell types for 2,798 candidate causal non-coding SNPs. Our results suggest that most SNPs impact a phenotype through a single cell type, and that pleiotropic SNPs target different cell types depending on the phenotype context. Altogether, CT-FM and CT-FM-SNP shed light on how genetic variants act collectively and individually at the cellular level to impact disease risk.
Collapse
Affiliation(s)
- Artem Kim
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Zixuan Zhang
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Come Legros
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Zeyun Lu
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Adam de Smith
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
| | - Jill E Moore
- Department of Genomics and Computational Biology, University of Massachusetts Chan Medical School, Worcester, MA, USA
| | - Nicholas Mancuso
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| | - Steven Gazal
- Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA
- Department of Quantitative and Computational Biology, University of Southern California, Los Angeles, CA, USA
| |
Collapse
|
4
|
Xiong Z, Thach TQ, Zhang YD, Sham PC. Improved estimation of functional enrichment in SNP heritability using feasible generalized least squares. HGG ADVANCES 2024; 5:100272. [PMID: 38327050 PMCID: PMC10901842 DOI: 10.1016/j.xhgg.2024.100272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Revised: 01/23/2024] [Accepted: 01/23/2024] [Indexed: 02/09/2024] Open
Abstract
Functional enrichment results typically implicate tissue or cell-type-specific biological pathways in disease pathogenesis and as therapeutic targets. We propose generalized linkage disequilibrium score regression (g-LDSC) that requires only genome-wide association studies (GWASs) summary-level data to estimate functional enrichment. The method adopts the same assumptions and regression model formulation as stratified linkage disequilibrium score regression (s-LDSC). Although s-LDSC only partially uses LD information, our method uses the whole LD matrix, which accounts for possible correlated error structure via a feasible generalized least-squares estimation. We demonstrate through simulation studies under various scenarios that g-LDSC provides more precise estimates of functional enrichment than s-LDSC, regardless of model misspecification. In an application to GWAS summary statistics of 15 traits from the UK Biobank, estimates of functional enrichment using g-LDSC were lower and more realistic than those obtained from s-LDSC. In addition, g-LDSC detected more significantly enriched functional annotations among 24 functional annotations for the 15 traits than s-LDSC (118 vs. 51).
Collapse
Affiliation(s)
- Zewei Xiong
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Thuan-Quoc Thach
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Yan Dora Zhang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China.
| | - Pak Chung Sham
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
5
|
Kasela S, Aguet F, Kim-Hellmuth S, Brown BC, Nachun DC, Tracy RP, Durda P, Liu Y, Taylor KD, Johnson WC, Van Den Berg D, Gabriel S, Gupta N, Smith JD, Blackwell TW, Rotter JI, Ardlie KG, Manichaikul A, Rich SS, Barr RG, Lappalainen T. Interaction molecular QTL mapping discovers cellular and environmental modifiers of genetic regulatory effects. Am J Hum Genet 2024; 111:133-149. [PMID: 38181730 PMCID: PMC10806864 DOI: 10.1016/j.ajhg.2023.11.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 11/29/2023] [Accepted: 11/29/2023] [Indexed: 01/07/2024] Open
Abstract
Bulk-tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, and context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from the blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell-type proportions, we demonstrate that cell-type iQTLs could be considered as proxies for cell-type-specific QTL effects, particularly for the most abundant cell type in the tissue. The interpretation of age iQTLs, however, warrants caution because the moderation effect of age on the genotype and molecular phenotype association could be mediated by changes in cell-type composition. Finally, we show that cell-type iQTLs contribute to cell-type-specific enrichment of diseases that, in combination with additional functional data, could guide future functional studies. Overall, this study highlights the use of iQTLs to gain insights into the context specificity of regulatory effects.
Collapse
Affiliation(s)
- Silva Kasela
- New York Genome Center, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA.
| | | | - Sarah Kim-Hellmuth
- New York Genome Center, New York, NY, USA; Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital LMU Munich, Munich, Germany; Computational Health Center, Institute of Translational Genomics, Helmholtz Munich, Neuherberg, Germany
| | - Brielin C Brown
- New York Genome Center, New York, NY, USA; Data Science Institute, Columbia University, New York, NY, USA
| | - Daniel C Nachun
- Department of Pathology, Stanford University, Stanford, CA, USA
| | - Russell P Tracy
- Pathology and Laboratory Medicine, The University of Vermont, Larner College of Medicine, Burlington, VT, USA
| | - Peter Durda
- Pathology and Laboratory Medicine, The University of Vermont, Larner College of Medicine, Burlington, VT, USA
| | - Yongmei Liu
- Department of Medicine, Duke University, Durham, NC, USA
| | - Kent D Taylor
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - W Craig Johnson
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - David Van Den Berg
- Department of Population and Public Health Sciences, University of Southern California, Los Angeles, CA, USA
| | | | - Namrata Gupta
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Joshua D Smith
- Northwest Genomics Center, University of Washington, Seattle, WA, USA
| | - Thomas W Blackwell
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Jerome I Rotter
- The Institute for Translational Genomics and Population Sciences, Department of Pediatrics, The Lundquist Institute for Biomedical Innovation at Harbor-UCLA Medical Center, Torrance, CA, USA
| | | | - Ani Manichaikul
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - R Graham Barr
- Departments of Medicine and Epidemiology, Columbia University Medical Center, New York, NY, USA
| | - Tuuli Lappalainen
- New York Genome Center, New York, NY, USA; Department of Systems Biology, Columbia University, New York, NY, USA; Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden.
| |
Collapse
|
6
|
Yuan C, Tang L, Lopdell T, Petrov VA, Oget-Ebrad C, Moreira GCM, Gualdrón Duarte JL, Sartelet A, Cheng Z, Salavati M, Wathes DC, Crowe MA, Coppieters W, Littlejohn M, Charlier C, Druet T, Georges M, Takeda H. An organism-wide ATAC-seq peak catalog for the bovine and its use to identify regulatory variants. Genome Res 2023; 33:1848-1864. [PMID: 37751945 PMCID: PMC10691486 DOI: 10.1101/gr.277947.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 09/19/2023] [Indexed: 09/28/2023]
Abstract
We report the generation of an organism-wide catalog of 976,813 cis-acting regulatory elements for the bovine detected by the assay for transposase accessible chromatin using sequencing (ATAC-seq). We regroup these regulatory elements in 16 components by nonnegative matrix factorization. Correlation between the genome-wide density of peaks and transcription start sites, correlation between peak accessibility and expression of neighboring genes, and enrichment in transcription factor binding motifs support their regulatory potential. Using a previously established catalog of 12,736,643 variants, we show that the proportion of single-nucleotide polymorphisms mapping to ATAC-seq peaks is higher than expected and that this is owing to an approximately 1.3-fold higher mutation rate within peaks. Their site frequency spectrum indicates that variants in ATAC-seq peaks are subject to purifying selection. We generate eQTL data sets for liver and blood and show that variants that drive eQTL fall into liver- and blood-specific ATAC-seq peaks more often than expected by chance. We combine ATAC-seq and eQTL data to estimate that the proportion of regulatory variants mapping to ATAC-seq peaks is approximately one in three and that the proportion of variants mapping to ATAC-seq peaks that are regulatory is approximately one in 25. We discuss the implication of these findings on the utility of ATAC-seq information to improve the accuracy of genomic selection.
Collapse
Affiliation(s)
- Can Yuan
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Lijing Tang
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Thomas Lopdell
- Research and Development, Livestock Improvement Corporation, Hamilton 3240, New Zealand
| | - Vyacheslav A Petrov
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Claire Oget-Ebrad
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | | | - José Luis Gualdrón Duarte
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Arnaud Sartelet
- Clinical Department of Ruminant, University of Liège, 4000 Liège, Belgium
| | - Zhangrui Cheng
- Royal Veterinary College, Hatfield, Herts AL9 7TA, United Kingdom
| | - Mazdak Salavati
- Royal Veterinary College, Hatfield, Herts AL9 7TA, United Kingdom
| | - D Claire Wathes
- Royal Veterinary College, Hatfield, Herts AL9 7TA, United Kingdom
| | - Mark A Crowe
- School of Veterinary Medicine, University College Dublin, Dublin 4, Ireland
| | - Wouter Coppieters
- GIGA Genomics platform, GIGA Institute, University of Liège, 4000 Liège, Belgium
| | - Mathew Littlejohn
- Research and Development, Livestock Improvement Corporation, Hamilton 3240, New Zealand
| | - Carole Charlier
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Tom Druet
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| | - Michel Georges
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium;
| | - Haruko Takeda
- Unit of Animal Genomics, GIGA-R and Faculty of Veterinary Medicine, University of Liège, 4000 Liège, Belgium
| |
Collapse
|
7
|
Fazel-Najafabadi M, Looger LL, Reddy-Rallabandi H, Nath SK. A multilayered post-GWAS analysis pipeline defines functional variants and target genes for systemic lupus erythematosus (SLE). MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.04.07.23288295. [PMID: 37066327 PMCID: PMC10104240 DOI: 10.1101/2023.04.07.23288295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/18/2023]
Abstract
Objectives Systemic lupus erythematosus (SLE), an autoimmune disease with incompletely understood etiology, has a strong genetic component. Although genome-wide association studies (GWAS) have revealed multiple SLE susceptibility loci and associated single nucleotide polymorphisms (SNPs), the precise causal variants, target genes, cell types, tissues, and mechanisms of action remain largely unknown. Methods Here, we report a comprehensive post-GWAS analysis using extensive bioinformatics, molecular modeling, and integrative functional genomic and epigenomic analyses to optimize fine-mapping. We compile and cross-reference immune cell-specific expression quantitative trait loci ( cis - and trans -eQTLs) with promoter-capture Hi-C, allele-specific chromatin accessibility, and massively parallel reporter assay data to define predisposing variants and target genes. We experimentally validate a predicted locus using CRISPR/Cas9 genome editing, qPCR, and Western blot. Results Anchoring on 452 index SNPs, we selected 9,931 high-linkage disequilibrium (r 2 >0.8) SNPs and defined 182 independent non-HLA SLE loci. 3,746 SNPs from 143 loci were identified as regulating 564 unique genes. Target genes are enriched in lupus-related tissues and associated with other autoimmune diseases. Of these, 329 SNPs (106 loci) showed significant allele-specific chromatin accessibility and/or enhancer activity, indicating regulatory potential. Using CRISPR/Cas9, we validated rs57668933 as a functional variant regulating multiple targets, including SLE risk gene ELF1 , in B-cells. Conclusion We demonstrate and validate post-GWAS strategies for utilizing multi-dimensional data to prioritize likely causal variants with cognate gene targets underlying SLE pathogenesis. Our results provide a catalog of significantly SLE-associated SNPs and loci, target genes, and likely biochemical mechanisms, to guide experimental characterization.
Collapse
|
8
|
Meng XH, Liu Z, Chen XD, Deng AM, Mao ZH. Functional Enrichment Analysis Identifying Regulatory Information Associated with Human Fracture. Calcif Tissue Int 2023; 113:286-294. [PMID: 37477662 DOI: 10.1007/s00223-023-01108-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 06/05/2023] [Indexed: 07/22/2023]
Abstract
Dozens of loci associated with fracture have been identified by genome-wide association studies (GWASs). However, most of these variants are located in the noncoding regions including introns, long terminal repeats, and intergenic regions. Although combining regulation information helps to identify the causal SNPs and interpret the involvement of these variants in the etiology of human fracture, regulation information which was truly associated with fracture was unknown. A novel functional enrichment method GARFIELD (GWAS Analysis of Regulatory of Functional Information Enrichment with LD correction) was applied to identify fracture-associated regulation information, including transcript factor binding sites, expression quantitative trait loci (eQTLs), chromatin states, enhancer, promoter, dyadic, super enhancer and Epigenome marks. Fracture SNPs were significantly enriched in exon (Bonferroni correction, p value < 7.14 × 10-3) at two GWAS p value thresholds through GARFIELD. High level of fold-enrichment was observed in super enhancer of monocyte and the enhancer of chondrocyte (Bonferroni correction, p value < 4.45 × 10-3). eQTLs of 44 tissues/cells and 10 transcription factors (TFs) were identified to be associated with human fracture. These results provide new insight into the etiology of human fracture, which might increase the identification of the causal SNPs through the fine-mapping study combined with functional annotation, as well as polygenic risk score.
Collapse
Affiliation(s)
- Xiang-He Meng
- Hunan Provincial Key Laboratory of Regional Hereditary Birth Defects Prevention and Control, Changsha Hospital for Maternal & Child Health Care Affiliated to Hunan Normal University, Changsha, 410007, People's Republic of China.
| | - Zhen Liu
- Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University, Changsha, 410081, Hunan, People's Republic of China
| | - Xiang-Ding Chen
- Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University, Changsha, 410081, Hunan, People's Republic of China
| | - Ai-Min Deng
- Hunan Provincial Key Laboratory of Regional Hereditary Birth Defects Prevention and Control, Changsha Hospital for Maternal & Child Health Care Affiliated to Hunan Normal University, Changsha, 410007, People's Republic of China.
| | - Zeng-Hui Mao
- Hunan Provincial Key Laboratory of Regional Hereditary Birth Defects Prevention and Control, Changsha Hospital for Maternal & Child Health Care Affiliated to Hunan Normal University, Changsha, 410007, People's Republic of China.
| |
Collapse
|
9
|
Kasela S, Aguet F, Kim-Hellmuth S, Brown BC, Nachun DC, Tracy RP, Durda P, Liu Y, Taylor KD, Craig Johnson W, Berg DVD, Gabriel S, Gupta N, Smith JD, Blackwell TW, Rotter JI, Ardlie KG, Manichaikul A, Rich SS, Graham Barr R, Lappalainen T. Interaction molecular QTL mapping discovers cellular and environmental modifiers of genetic regulatory effects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.26.546528. [PMID: 37425716 PMCID: PMC10326995 DOI: 10.1101/2023.06.26.546528] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Bulk tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, while context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell type proportions, we demonstrate that cell type iQTLs could be considered as proxies for cell type-specific QTL effects. The interpretation of age iQTLs, however, warrants caution as the moderation effect of age on the genotype and molecular phenotype association may be mediated by changes in cell type composition. Finally, we show that cell type iQTLs contribute to cell type-specific enrichment of diseases that, in combination with additional functional data, may guide future functional studies. Overall, this study highlights iQTLs to gain insights into the context-specificity of regulatory effects.
Collapse
Affiliation(s)
- Silva Kasela
- New York Genome Center, New York, NY, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
| | | | - Sarah Kim-Hellmuth
- New York Genome Center, New York, NY, USA
- Department of Pediatrics, Dr. von Hauner Children’s Hospital, University Hospital LMU Munich, Munich, Germany
- Computational Health Center, Institute of Translational Genomics, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Brielin C. Brown
- New York Genome Center, New York, NY, USA
- Data Science Institute, Columbia University, New York, NY, USA
| | | | - Russell P. Tracy
- Pathology and Laboratory Medicine, The University of Vermont, Larner College of Medicine, Burlington, VT, USA
| | - Peter Durda
- Pathology and Laboratory Medicine, The University of Vermont, Larner College of Medicine, Burlington, VT, USA
| | - Yongmei Liu
- Department of Medicine, Duke University, Durham, NC, USA
| | - Kent D. Taylor
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
| | - W. Craig Johnson
- Department of Biostatistics, University of Washington, Seattle, WA, USA
| | - David Van Den Berg
- Keck School of Medicine of USC, University of Southern California, Los Angeles, CA, USA
| | | | - Namrata Gupta
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Joshua D. Smith
- Northwest Genomic Center, University of Washington, Seattle, WA, USA
| | - Thomas W. Blackwell
- Department of Biostatistics, University of Michigan School of Public Health, Ann Arbor, MI, USA
| | - Jerome I. Rotter
- Department of Pediatrics, The Institute for Translational Genomics and Population Sciences, The Lundquist Institute at Harbor-UCLA Medical Center, Torrance, CA, USA
| | | | - Ani Manichaikul
- Center for Public health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Stephen S. Rich
- Center for Public health Genomics, University of Virginia, Charlottesville, VA, USA
| | - R. Graham Barr
- Epidemiology and Medicine, Columbia University Medical Center, New York, NY, USA
| | - Tuuli Lappalainen
- New York Genome Center, New York, NY, USA
- Department of Systems Biology, Columbia University, New York, NY, USA
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
| |
Collapse
|
10
|
Sathyanarayanan A, Mueller TT, Ali Moni M, Schueler K, Baune BT, Lio P, Mehta D, Baune BT, Dierssen M, Ebert B, Fabbri C, Fusar-Poli P, Gennarelli M, Harmer C, Howes OD, Janzing JGE, Lio P, Maron E, Mehta D, Minelli A, Nonell L, Pisanu C, Potier MC, Rybakowski F, Serretti A, Squassina A, Stacey D, van Westrhenen R, Xicota L. Multi-omics data integration methods and their applications in psychiatric disorders. Eur Neuropsychopharmacol 2023; 69:26-46. [PMID: 36706689 DOI: 10.1016/j.euroneuro.2023.01.001] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 11/22/2022] [Accepted: 01/02/2023] [Indexed: 01/27/2023]
Abstract
To study mental illness and health, in the past researchers have often broken down their complexity into individual subsystems (e.g., genomics, transcriptomics, proteomics, clinical data) and explored the components independently. Technological advancements and decreasing costs of high throughput sequencing has led to an unprecedented increase in data generation. Furthermore, over the years it has become increasingly clear that these subsystems do not act in isolation but instead interact with each other to drive mental illness and health. Consequently, individual subsystems are now analysed jointly to promote a holistic understanding of the underlying biological complexity of health and disease. Complementing the increasing data availability, current research is geared towards developing novel methods that can efficiently combine the information rich multi-omics data to discover biologically meaningful biomarkers for diagnosis, treatment, and prognosis. However, clinical translation of the research is still challenging. In this review, we summarise conventional and state-of-the-art statistical and machine learning approaches for discovery of biomarker, diagnosis, as well as outcome and treatment response prediction through integrating multi-omics and clinical data. In addition, we describe the role of biological model systems and in silico multi-omics model designs in clinical translation of psychiatric research from bench to bedside. Finally, we discuss the current challenges and explore the application of multi-omics integration in future psychiatric research. The review provides a structured overview and latest updates in the field of multi-omics in psychiatry.
Collapse
Affiliation(s)
- Anita Sathyanarayanan
- Queensland University of Technology, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Faculty of Health, Kelvin Grove, Queensland 4059, Australia
| | - Tamara T Mueller
- Institute for Artificial Intelligence and Informatics in Medicine, TU Munich, 80333 Munich, Germany
| | - Mohammad Ali Moni
- Artificial Intelligence and Digital Health Data Science, School of Health and Rehabilitation Sciences, Faculty of Health and Behavioural Sciences, The University of Queensland, St Lucia, QLD, 4072, Australia
| | - Katja Schueler
- Clinic for Psychosomatics, Hospital zum Heiligen Geist, Frankfurt am Main, Germany; Frankfurt Psychoanalytic Institute, Frankfurt am Main, Germany
| | - Bernhard T Baune
- Department of Psychiatry and Psychotherapy, University of Münster, Germany; Department of Psychiatry, Melbourne Medical School, University of Melbourne, Australia; The Florey Institute of Neuroscience and Mental Health, The University of Melbourne, Australia
| | - Pietro Lio
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Divya Mehta
- Queensland University of Technology, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Faculty of Health, Kelvin Grove, Queensland 4059, Australia.
| | | | - Bernhard T Baune
- Department of Psychiatry and Psychotherapy, University of Münster, Germany; Department of Psychiatry, Melbourne Medical School, University of Melbourne, Australia; The Florey Institute of Neuroscience and Mental Health, The University of Melbourne, Australia
| | - Mara Dierssen
- Center for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology; Universitat Pompeu Fabra (UPF), Barcelona, Spain
| | - Bjarke Ebert
- Medical Strategy & Communication, H. Lundbeck A/S, Valby, Denmark
| | - Chiara Fabbri
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Bologna, Italy; Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom
| | - Paolo Fusar-Poli
- Early Psychosis: Intervention and Clinical-detection (EPIC) Lab, Department of Psychosis Studies, King's College London, United Kingdom; Department of Brain and Behavioral Sciences, University of Pavia, Pavia, Italy
| | - Massimo Gennarelli
- Department of Molecular and Translational Medicine, University of Brescia; Genetics Unit, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy
| | | | - Oliver D Howes
- Institute of Psychiatry, Psychology and Neuroscience, King's College London, London, United Kingdom; Psychiatric Imaging, Medical Research Council Clinical Sciences Centre, Imperial College London, Hammersmith Hospital Campus, London, United Kingdom
| | | | - Pietro Lio
- Department of Computer Science and Technology, University of Cambridge, Cambridge, United Kingdom
| | - Eduard Maron
- Department of Psychiatry, University of Tartu, Tartu, Estonia; Centre for Neuropsychopharmacology, Division of Brain Sciences, Imperial College London, London, United Kingdom; Documental Ltd, Tallin, Estonia; West Tallinn Central Hospital, Tallinn, Estonia
| | - Divya Mehta
- Queensland University of Technology, Centre for Genomics and Personalised Health, School of Biomedical Sciences, Faculty of Health, Kelvin Grove, Queensland 4059, Australia
| | - Alessandra Minelli
- Department of Molecular and Translational Medicine, University of Brescia; Genetics Unit, IRCCS Istituto Centro San Giovanni di Dio Fatebenefratelli, Brescia, Italy
| | - Lara Nonell
- MARGenomics, IMIM (Hospital del Mar Research Institute), Barcelona, Spain
| | - Claudia Pisanu
- Department of Biomedical Sciences, Section of Neuroscience and Clinical Pharmacology, University of Cagliari, Cagliari, Italy
| | | | - Filip Rybakowski
- Department of Psychiatry, Poznan University of Medical Sciences, Poznan, Poland
| | - Alessandro Serretti
- Department of Biomedical and NeuroMotor Sciences, University of Bologna, Bologna, Italy
| | - Alessio Squassina
- Department of Biomedical Sciences, Section of Neuroscience and Clinical Pharmacology, University of Cagliari, Cagliari, Italy
| | - David Stacey
- British Heart Foundation Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge, United Kingdom
| | - Roos van Westrhenen
- Parnassia Psychiatric Institute, Amsterdam, the Netherlands; Department of Psychiatry and Neuropsychology, Faculty of Health and Sciences, Maastricht University, Maastricht, the Netherlands; Institute of Psychiatry, Psychology & Neuroscience (IoPPN) King's College London, United Kingdom
| | - Laura Xicota
- Paris Brain Institute ICM, Salpetriere Hospital, Paris, France
| |
Collapse
|
11
|
Gerussi A, Soskic B, Asselta R, Invernizzi P, Gershwin ME. GWAS and autoimmunity: What have we learned and what next. J Autoimmun 2022; 133:102922. [PMID: 36209690 DOI: 10.1016/j.jaut.2022.102922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 09/22/2022] [Accepted: 09/23/2022] [Indexed: 12/07/2022]
Abstract
Autoimmune diseases are common conditions characterized by loss of tolerance, female predominance and a remarkable heterogeneity among different populations. Most often they are polygenic and several genetic loci have been linked with the risk of developing autoimmune diseases. However, causal inference is difficult. When the genomic revolution began there were high hopes of translating fast genetic analyses to the bedside but this has proven to be challenging. Nonetheless, over the last decade, fine-mapping strategies have greatly improved; one of the most significant research lines focuses on the in vivo and ex vivo definition of the effect of genetic variants within the target tissues and within specific subpopulations of immune cells that are involved in the disease pathogenesis. This strategy also includes the longitudinal tracking of a large number of immunophenotypes in many individuals to build a large reference atlas for variant characterization. In this review, we discuss the results obtained by GWAS in autoimmune diseases and review recent advances in fine mapping strategies. More importantly, we discuss gaps and future directions.
Collapse
Affiliation(s)
- Alessio Gerussi
- Division of Gastroenterology, Center for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Monza, Italy; European Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Monza, Italy.
| | - Blagoje Soskic
- Human Technopole, Viale Rita Levi-Montalcini 1, 20157 Milan, Italy
| | - Rosanna Asselta
- IRCCS Humanitas Research Hospital, Rozzano, Milan, Italy; Department of Biomedical Sciences, Humanitas University, Pieve Emanuele, Milan, Italy
| | - Pietro Invernizzi
- Division of Gastroenterology, Center for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Monza, Italy; European Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Monza, Italy
| | - Merrill E Gershwin
- Division of Rheumatology, Allergy and Clinical Immunology, University of California, Davis, Davis, CA, United States
| |
Collapse
|
12
|
Chen J, Wang L, De Jager PL, Bennett DA, Buchman AS, Yang J. A scalable Bayesian functional GWAS method accounting for multivariate quantitative functional annotations with applications for studying Alzheimer disease. HGG ADVANCES 2022; 3:100143. [PMID: 36204489 PMCID: PMC9530673 DOI: 10.1016/j.xhgg.2022.100143] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 09/14/2022] [Indexed: 11/30/2022] Open
Abstract
Existing methods for integrating functional annotations in genome-wide association studies (GWASs) to fine-map and prioritize potential causal variants are limited to using non-overlapped categorical annotations or limited by the computation burden of modeling genome-wide variants. To overcome these limitations, we propose a scalable Bayesian functional GWAS method to account for multivariate quantitative functional annotations (BFGWAS_QUANT), accompanied by a scalable computation algorithm enabling joint modeling of genome-wide variants. Simulation studies validated the performance of BFGWAS_QUANT for accurately quantifying annotation enrichment and improving GWAS power. Applying BFGWAS_QUANT to study five Alzheimer disease (AD)-related phenotypes using individual-level GWAS data (n = ∼1,000), we found that histone modification annotations have higher enrichment than expression quantitative trait locus (eQTL) annotations for all considered phenotypes, with the highest enrichment in H3K27me3 (polycomb regression). We also found that cis-eQTLs in microglia had higher enrichment than eQTLs of bulk brain frontal cortex tissue for all considered phenotypes. A similar enrichment pattern was also identified using the International Genomics of Alzheimer's Project (IGAP) summary-level GWAS data of AD (n = ∼54,000). The strongest known APOE E4 risk allele was identified for all five phenotypes, and the APOE locus was validated using the IGAP data. BFGWAS_QUANT fine-mapped 32 significant variants from 1,073 genome-wide significant variants in the IGAP data. We also demonstrated that the polygenic risk scores (PRSs) using effect size estimates by BFGWAS_QUANT had a similar prediction accuracy as other methods assuming a sparse causal model. Overall, BFGWAS_QUANT is a useful GWAS tool for quantifying annotation enrichment and prioritizing potential causal variants.
Collapse
Affiliation(s)
- Junyu Chen
- Department of Epidemiology, Emory University School of Public Health, Atlanta, GA 30322, USA
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| | - Lei Wang
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
- Department of Biostatistics and Informatics, Colorado School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Philip L. De Jager
- Center for Translational and Computational Neuroimmunology, Department of Neurology and Taub Institute for Research on Alzheimer’s Disease and the Aging Brain, Columbia University Irving Medical Center, New York, NY 10032, USA
| | - David A. Bennett
- Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, IL 60612, USA
| | - Aron S. Buchman
- Rush Alzheimer’s Disease Center, Rush University Medical Center, Chicago, IL 60612, USA
| | - Jingjing Yang
- Center for Computational and Quantitative Genetics, Department of Human Genetics, Emory University School of Medicine, Atlanta, GA 30322, USA
| |
Collapse
|
13
|
Das AC, Foroutan A, Qian B, Hosseini Naghavi N, Shabani K, Shooshtari P. Single-Cell Chromatin Accessibility Data Combined with GWAS Improves Detection of Relevant Cell Types in 59 Complex Phenotypes. Int J Mol Sci 2022; 23:11456. [PMID: 36232752 PMCID: PMC9570273 DOI: 10.3390/ijms231911456] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2022] [Revised: 09/14/2022] [Accepted: 09/20/2022] [Indexed: 11/16/2022] Open
Abstract
Several disease risk variants reside on non-coding regions of DNA, particularly in open chromatin regions of specific cell types. Identifying the cell types relevant to complex traits through the integration of chromatin accessibility data and genome-wide association studies (GWAS) data can help to elucidate the mechanisms of these traits. In this study, we created a collection of associations between the combinations of chromatin accessibility data (bulk and single-cell) with an array of 201 complex phenotypes. We integrated the GWAS data of these 201 phenotypes with bulk chromatin accessibility data from 137 cell types measured by DNase-I hypersensitive sequencing and found significant results (FDR adjusted p-value ≤ 0.05) for at least one cell type in 21 complex phenotypes, such as atopic dermatitis, Graves' disease, and body mass index. With the integration of single-cell chromatin accessibility data measured by an assay for transposase-accessible chromatin with high-throughput sequencing (scATAC-seq), taken from 111 adult and 111 fetal cell types, the resolution of association was magnified, enabling the identification of further cell types. This resulted in the identification of significant correlations (FDR adjusted p-value ≤ 0.05) between 15 categories of single-cell subtypes and 59 phenotypes ranging from autoimmune diseases like Graves' disease to cardiovascular traits like diastolic/systolic blood pressure.
Collapse
Affiliation(s)
- Akash Chandra Das
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
- Department of Biosciences and Bioengineering, Indian Institute of Technology Guwahati, Guwahati 781039, India
| | - Aidin Foroutan
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
| | - Brian Qian
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
| | - Nader Hosseini Naghavi
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
- Department of Computer Science, Western University, London, ON N6A 5B7, Canada
| | - Kayvan Shabani
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
- Department of Computer Science, Western University, London, ON N6A 5B7, Canada
| | - Parisa Shooshtari
- Department of Pathology and Laboratory Medicine, Western University, London, ON N6A 3K7, Canada
- Children’s Health Research Institute, Lawson Research Institute, London, ON N6C 2R5, Canada
- Department of Computer Science, Western University, London, ON N6A 5B7, Canada
- Ontario Institute for Cancer Research, Toronto, ON M5G 0A3, Canada
| |
Collapse
|
14
|
Abstract
Insights into the genetic basis of human disease are helping to address some of the key challenges in new drug development including the very high rates of failure. Here we review the recent history of an emerging, genomics-assisted approach to pharmaceutical research and development, and its relationship to Mendelian randomization (MR), a well-established analytical approach to causal inference. We demonstrate how human genomic data linked to pharmaceutically relevant phenotypes can be used for (1) drug target identification (mapping relevant drug targets to diseases), (2) drug target validation (inferring the likely effects of drug target perturbation), (3) evaluation of the effectiveness and specificity of compound-target engagement (inferring the extent to which the effects of a compound are exclusive to the target and distinguishing between on-target and off-target compound effects), and (4) the selection of end points in clinical trials (the diseases or conditions to be evaluated as trial outcomes). We show how genomics can help identify indication expansion opportunities for licensed drugs and repurposing of compounds developed to clinical phase that proved safe but ineffective for the original intended indication. We outline statistical and biological considerations in using MR for drug target validation (drug target MR) and discuss the obstacles and challenges for scaled applications of these genomics-based approaches.
Collapse
Affiliation(s)
- Amand F Schmidt
- Institute of Cardiovascular Science, Faculty of Population Health, University College London, London WC1E 6BT, United Kingdom
- UCL British Heart Foundation Research Accelerator, London WC1E 6BT, United Kingdom
- Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, 3584 CX Utrecht, the Netherlands
| | - Aroon D Hingorani
- Institute of Cardiovascular Science, Faculty of Population Health, University College London, London WC1E 6BT, United Kingdom
- UCL British Heart Foundation Research Accelerator, London WC1E 6BT, United Kingdom
- Health Data Research UK, London NW1 2BE, United Kingdom
| | - Chris Finan
- Institute of Cardiovascular Science, Faculty of Population Health, University College London, London WC1E 6BT, United Kingdom
- UCL British Heart Foundation Research Accelerator, London WC1E 6BT, United Kingdom
- Department of Cardiology, Division Heart and Lungs, University Medical Center Utrecht, 3584 CX Utrecht, the Netherlands
- Health Data Research UK, London NW1 2BE, United Kingdom
| |
Collapse
|
15
|
Breeze CE, Haugen E, Reynolds A, Teschendorff A, van Dongen J, Lan Q, Rothman N, Bourque G, Dunham I, Beck S, Stamatoyannopoulos J, Franceschini N, Berndt SI. Integrative analysis of 3604 GWAS reveals multiple novel cell type-specific regulatory associations. Genome Biol 2022; 23:13. [PMID: 34996498 PMCID: PMC8742386 DOI: 10.1186/s13059-021-02560-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Accepted: 11/26/2021] [Indexed: 01/02/2023] Open
Abstract
Background Genome-wide association study (GWAS) single nucleotide polymorphisms (SNPs) are known to preferentially co-locate to active regulatory elements in tissues and cell types relevant to disease aetiology. Further characterisation of associated cell type-specific regulation can broaden our understanding of how GWAS signals may contribute to disease risk. Results To gain insight into potential functional mechanisms underlying GWAS associations, we developed FORGE2 (https://forge2.altiusinstitute.org/), which is an updated version of the FORGE web tool. FORGE2 uses an expanded atlas of cell type-specific regulatory element annotations, including DNase I hotspots, five histone mark categories and 15 hidden Markov model (HMM) chromatin states, to identify tissue- and cell type-specific signals. An analysis of 3,604 GWAS from the NHGRI-EBI GWAS catalogue yielded at least one significant disease/trait-tissue association for 2,057 GWAS, including > 400 associations specific to epigenomic marks in immune tissues and cell types, > 30 associations specific to heart tissue, and > 60 associations specific to brain tissue, highlighting the key potential of tissue- and cell type-specific regulatory elements. Importantly, we demonstrate that FORGE2 analysis can separate previously observed accessible chromatin enrichments into different chromatin states, such as enhancers or active transcription start sites, providing a greater understanding of underlying regulatory mechanisms. Interestingly, tissue-specific enrichments for repressive chromatin states and histone marks were also detected, suggesting a role for tissue-specific repressed regions in GWAS-mediated disease aetiology. Conclusion In summary, we demonstrate that FORGE2 has the potential to uncover previously unreported disease-tissue associations and identify new candidate mechanisms. FORGE2 is a transparent, user-friendly web tool for the integrative analysis of loci discovered from GWAS. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-021-02560-3.
Collapse
Affiliation(s)
- Charles E Breeze
- National Cancer Institute, NIH, Bethesda, MD, 20892, USA. .,Altius Institute for Biomedical Sciences, Seattle, WA, 98121, USA. .,UCL Cancer Institute, University College London, WC1E 6BT, London, UK.
| | - Eric Haugen
- Altius Institute for Biomedical Sciences, Seattle, WA, 98121, USA
| | - Alex Reynolds
- Altius Institute for Biomedical Sciences, Seattle, WA, 98121, USA
| | - Andrew Teschendorff
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Shanghai Institute for Biological Sciences, University of Chinese Academy of Sciences, Chinese Academy of Sciences, 320 Yue Yang Road, Shanghai, 200031, China
| | - Jenny van Dongen
- Department of Biological Psychology, Vrije Universiteit Amsterdam, Amsterdam, 1081BT, The Netherlands
| | - Qing Lan
- National Cancer Institute, NIH, Bethesda, MD, 20892, USA
| | | | - Guillaume Bourque
- Department of Human Genetics, McGill University and Génome Québec Innovation Center, Montréal, H3A 0G1, Canada
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Stephan Beck
- UCL Cancer Institute, University College London, WC1E 6BT, London, UK
| | | | - Nora Franceschini
- Department of Epidemiology, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, USA
| | - Sonja I Berndt
- National Cancer Institute, NIH, Bethesda, MD, 20892, USA
| |
Collapse
|
16
|
Schilder BM, Humphrey J, Raj T. echolocatoR: an automated end-to-end statistical and functional genomic fine-mapping pipeline. Bioinformatics 2022; 38:536-539. [PMID: 34529038 PMCID: PMC10060715 DOI: 10.1093/bioinformatics/btab658] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2020] [Revised: 09/06/2021] [Accepted: 09/13/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY echolocatoR integrates a diverse suite of statistical and functional fine-mapping tools to identify, test enrichment in, and visualize high-confidence causal consensus variants in any phenotype. It requires minimal input from users (a summary statistics file), can be run in a single R function, and provides extensive access to relevant datasets (e.g. reference linkage disequilibrium panels, quantitative trait loci, genome-wide annotations, cell-type-specific epigenomics), thereby enabling rapid, robust and scalable end-to-end fine-mapping investigations. AVAILABILITY AND IMPLEMENTATION echolocatoR is an open-source R package available through GitHub under the GNU General Public License (Version 3) license: https://github.com/RajLabMSSM/echolocatoR. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Brian M Schilder
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
| | - Jack Humphrey
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
| | - Towfique Raj
- Nash Family Department of Neuroscience & Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Ronald M. Loeb Center for Alzheimer’s Disease, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
- Estelle and Daniel Maggin Department of Neurology, Icahn School of Medicine at Mount Sinai, New York 10029, NY, USA
| |
Collapse
|
17
|
Dong A, Bao X. GPSmatch: an R package for comparing Genomic-binding Profile Similarity among transcriptional regulators using customizable databases. Bioinformatics 2021; 38:853-855. [PMID: 34672337 PMCID: PMC8756198 DOI: 10.1093/bioinformatics/btab728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Revised: 09/14/2021] [Accepted: 10/18/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Eukaryotic gene expression requires coordination among hundreds of transcriptional regulators. To characterize a specific transcriptional regulator, identifying how it shares genomic-binding sites with other regulators can generate important insights into its action. As genomic data such as chromatin immunoprecipitation assays with sequencing (ChIP-Seq) are being continously generated from individual labs, there is a demand for timely integration and analysis of these new data. We have developed an R package, GPSmatch (Genomic-binding Profile Similarity match), for calculating the Jaccard index to compare the ChIP-Seq peaks from one experiment to other experiments stored in a user-supplied customizable database. GPSmatch also evaluates the statistical significance of the calculated Jaccard index using a nonparametric Monte Carlo procedure. We show that GPSmatch is suitable for identifying and ranking transcriptional regulators with shared genomic-binding profiles, which may unravel potential mechanistic actions of gene regulation. AVAILABILITY AND IMPLEMENTATION The software is freely available at https://github.com/Bao-Lab/GPSmatch. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Amy Dong
- Hinsdale Central High School, Hinsdale, IL 60521, USA
| | | |
Collapse
|
18
|
Genetic variations of DNA bindings of FOXA1 and co-factors in breast cancer susceptibility. Nat Commun 2021; 12:5318. [PMID: 34518541 PMCID: PMC8438084 DOI: 10.1038/s41467-021-25670-9] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 08/25/2021] [Indexed: 11/21/2022] Open
Abstract
Identifying transcription factors (TFs) whose DNA bindings are altered by genetic variants that regulate susceptibility genes is imperative to understand transcriptional dysregulation in disease etiology. Here, we develop a statistical framework to analyze extensive ChIP-seq and GWAS data and identify 22 breast cancer risk-associated TFs. We find that, by analyzing genetic variations of TF-DNA bindings, the interaction of FOXA1 with co-factors such as ESR1 and E2F1, and the interaction of TFs with chromatin features (i.e., enhancers) play a key role in breast cancer susceptibility. Using genetic variants occupied by the 22 TFs, transcriptome-wide association analyses identify 52 previously unreported breast cancer susceptibility genes, including seven with evidence of essentiality from functional screens in breast relevant cell lines. We show that FOXA1 and co-factors form a core TF-transcriptional network regulating the susceptibility genes. Our findings provide additional insights into genetic variations of TF-DNA bindings (particularly for FOXA1) underlying breast cancer susceptibility. The identification of transcription factors (TFs) whose binding sites are affected by risk genetic variants remains crucial. Here, the authors develop a statistical framework to analyse ChIP-seq and GWAS data, identify 22 breast cancer risk-associated TFs and a core TF-transcriptional network for FOXA1 and co-factors.
Collapse
|
19
|
Robertson CC, Inshaw JRJ, Onengut-Gumuscu S, Chen WM, Santa Cruz DF, Yang H, Cutler AJ, Crouch DJM, Farber E, Bridges SL, Edberg JC, Kimberly RP, Buckner JH, Deloukas P, Divers J, Dabelea D, Lawrence JM, Marcovina S, Shah AS, Greenbaum CJ, Atkinson MA, Gregersen PK, Oksenberg JR, Pociot F, Rewers MJ, Steck AK, Dunger DB, Wicker LS, Concannon P, Todd JA, Rich SS. Fine-mapping, trans-ancestral and genomic analyses identify causal variants, cells, genes and drug targets for type 1 diabetes. Nat Genet 2021; 53:962-971. [PMID: 34127860 PMCID: PMC8273124 DOI: 10.1038/s41588-021-00880-5] [Citation(s) in RCA: 114] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2020] [Accepted: 05/05/2021] [Indexed: 12/13/2022]
Abstract
We report the largest and most diverse genetic study of type 1 diabetes (T1D) to date (61,427 participants), yielding 78 genome-wide-significant (P < 5 × 10-8) regions, including 36 that are new. We define credible sets of T1D-associated variants and show that they are enriched in immune-cell accessible chromatin, particularly CD4+ effector T cells. Using chromatin-accessibility profiling of CD4+ T cells from 115 individuals, we map chromatin-accessibility quantitative trait loci and identify five regions where T1D risk variants co-localize with chromatin-accessibility quantitative trait loci. We highlight rs72928038 in BACH2 as a candidate causal T1D variant leading to decreased enhancer accessibility and BACH2 expression in T cells. Finally, we prioritize potential drug targets by integrating genetic evidence, functional genomic maps and immune protein-protein interactions, identifying 12 genes implicated in T1D that have been targeted in clinical trials for autoimmune diseases. These findings provide an expanded genomic landscape for T1D.
Collapse
Affiliation(s)
- Catherine C Robertson
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA, USA
| | - Jamie R J Inshaw
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Suna Onengut-Gumuscu
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Wei-Min Chen
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - David Flores Santa Cruz
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Hanzhi Yang
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - Antony J Cutler
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Daniel J M Crouch
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Emily Farber
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
| | - S Louis Bridges
- Division of Rheumatology, Department of Medicine, Hospital for Special Surgery, New York, NY, USA
- Division of Rheumatology, Department of Medicine, Weill Cornell Medical College, New York, NY, USA
| | - Jeffrey C Edberg
- Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Robert P Kimberly
- Division of Clinical Immunology and Rheumatology, Department of Medicine, University of Alabama at Birmingham, Birmingham, AL, USA
| | - Jane H Buckner
- Center for Translational Immunology, Benaroya Research Institute, Seattle, WA, USA
| | - Panos Deloukas
- Clinical Pharmacology, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, UK
- Princess Al-Jawhara Al-Brahim Centre of Excellence in Research of Hereditary Disorders (PACER-HD), King Abdulaziz University, Jeddah, Saudi Arabia
| | - Jasmin Divers
- Division of Health Services Research, Department of Foundations of Medicine, New York University Long Island School of Medicine, Mineola, NY, USA
| | - Dana Dabelea
- Colorado School of Public Health and Lifecourse Epidemiology of Adiposity and Diabetes (LEAD) Center, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Jean M Lawrence
- Department of Research and Evaluation, Kaiser Permanente Southern California, Pasadena, CA, USA
| | - Santica Marcovina
- Northwest Lipid Metabolism and Diabetes Research Laboratories, University of Washington, Seattle, WA, USA
- Medpace Reference Laboratories, Cincinnati, OH, USA
| | - Amy S Shah
- Cincinnati Children's Hospital Medical Center and the University of Cincinnati, Cincinnati, OH, USA
| | - Carla J Greenbaum
- Center for Interventional Immunology, Benaroya Research Institute, Seattle, WA, USA
- Diabetes Program, Benaroya Research Institute, Seattle, WA, USA
| | - Mark A Atkinson
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida, Gainesville, FL, USA
| | - Peter K Gregersen
- Robert S. Boas Center for Genomics and Human Genetics, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Jorge R Oksenberg
- Department of Neurology and Weill Institute for Neurosciences, University of California at San Francisco, San Francisco, CA, USA
| | - Flemming Pociot
- Department of Pediatrics, Herlev University Hospital, Copenhagen, Denmark
- Institute of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
- Type 1 Diabetes Biology, Department of Clinical Research, Steno Diabetes Center Copenhagen, Gentofte, Denmark
| | - Marian J Rewers
- Barbara Davis Center for Diabetes, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Andrea K Steck
- Barbara Davis Center for Diabetes, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - David B Dunger
- Department of Paediatrics, University of Cambridge, Cambridge, UK
- Wellcome Trust Medical Research Council Institute of Metabolic Science, University of Cambridge, Cambridge, UK
| | - Linda S Wicker
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Patrick Concannon
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida, Gainesville, FL, USA
- Genetics Institute, University of Florida, Gainesville, FL, USA
| | - John A Todd
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK.
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| |
Collapse
|
20
|
Meng XH, Xiao HM, Deng HW. Combining artificial intelligence: deep learning with Hi-C data to predict the functional effects of non-coding variants. Bioinformatics 2021; 37:1339-1344. [PMID: 33196774 DOI: 10.1093/bioinformatics/btaa970] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 09/12/2020] [Accepted: 11/05/2020] [Indexed: 12/20/2022] Open
Abstract
MOTIVATION Although genome-wide association studies (GWASs) have identified thousands of variants for various traits, the causal variants and the mechanisms underlying the significant loci are largely unknown. In this study, we aim to predict non-coding variants that may functionally affect translation initiation through long-range chromatin interaction. RESULTS By incorporating the Hi-C data, we propose a novel and powerful deep learning model of artificial intelligence to classify interacting and non-interacting fragment pairs and predict the functional effects of sequence alteration of single nucleotide on chromatin interaction and thus on gene expression. The changes in chromatin interaction probability between the reference sequence and the altered sequence reflect the degree of functional impact for the variant. The model was effective and efficient with the classification of interacting and non-interacting fragment pairs. The predicted causal SNPs that had a larger impact on chromatin interaction were more likely to be identified by GWAS and eQTL analyses. We demonstrate that an integrative approach combining artificial intelligence-deep learning with high throughput experimental evidence of chromatin interaction leads to prioritizing the functional variants in disease- and phenotype-related loci and thus will greatly expedite uncover of the biological mechanism underlying the association identified in genomic studies. AVAILABILITY AND IMPLEMENTATION Source code used in data preparing and model training is available at the GitHub website (https://github.com/biocai/DeepHiC). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiang-He Meng
- Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University, Changsha, Hunan 410008, China.,Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA.,Centers of System Biology, Data Information and Reproductive Health, Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University, Changsha, Hunan 410081, China
| | - Hong-Mei Xiao
- Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University, Changsha, Hunan 410008, China
| | - Hong-Wen Deng
- Centers of System Biology, Data Information and Reproductive Health, School of Basic Medical Science, Central South University, Changsha, Hunan 410008, China.,Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA 70112, USA.,Centers of System Biology, Data Information and Reproductive Health, Laboratory of Molecular and Statistical Genetics, College of Life Sciences, Hunan Normal University, Changsha, Hunan 410081, China
| |
Collapse
|
21
|
Lu J, Wang X, Sun K, Lan X. Chrom-Lasso: a lasso regression-based model to detect functional interactions using Hi-C data. Brief Bioinform 2021; 22:6278150. [PMID: 34013331 PMCID: PMC8574949 DOI: 10.1093/bib/bbab181] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 05/13/2021] [Indexed: 01/02/2023] Open
Abstract
Hi-C is a genome-wide assay based on Chromosome Conformation Capture and high-throughput sequencing to decipher 3D chromatin organization in the nucleus. However, computational methods to detect functional interactions utilizing Hi-C data face challenges including the correction for various sources of biases and the identification of functional interactions with low counts of interacting fragments. We present Chrom-Lasso, a lasso linear regression model that removes complex biases assumption-free and identifies functional interacting loci with increased power by combining information of local reads distribution surrounding the area of interest. We showed that interacting regions identified by Chrom-Lasso are more enriched for 5C validated interactions and functional GWAS hits than that of GOTHiC and Fit-Hi-C. To further demonstrate the ability of Chrom-Lasso to detect interactions of functional importance, we performed time-series Hi-C and RNA-seq during T cell activation and exhaustion. We showed that the dynamic changes in gene expression and chromatin interactions identified by Chrom-Lasso were largely concordant with each other. Finally, we experimentally confirmed Chrom-Lasso’s finding that Erbb3 was co-regulated with distinct neighboring genes at different states during T cell activation. Our results highlight Chrom-Lasso’s utility in detecting weak functional interaction between cis-regulatory elements, such as promoters and enhancers.
Collapse
Affiliation(s)
- Jingzhe Lu
- School of Medicine, Tsinghua University, Beijing, China
| | - Xu Wang
- School of Medicine and the Tsinghua-Peking Center for Life science, Tsinghua University, Beijing, China
| | - Keyong Sun
- School of Medicine and the Tsinghua-Peking Center for Life science, Tsinghua University, Beijing, China
| | - Xun Lan
- School of Medicine and the Tsinghua-Peking Center for Life science, Tsinghua University, Beijing, China
| |
Collapse
|
22
|
Gasperi C, Chun S, Sunyaev SR, Cotsapas C. Shared associations identify causal relationships between gene expression and immune cell phenotypes. Commun Biol 2021; 4:279. [PMID: 33664438 PMCID: PMC7933159 DOI: 10.1038/s42003-021-01823-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2020] [Accepted: 02/04/2021] [Indexed: 12/22/2022] Open
Abstract
Genetic mapping studies have identified thousands of associations between common variants and hundreds of human traits. Translating these associations into mechanisms is complicated by two factors: they fall into gene regulatory regions; and they are rarely mapped to one causal variant. One way around these limitations is to find groups of traits that share associations, using this genetic link to infer a biological connection. Here, we assess how many trait associations in the same locus are due to the same genetic variant, and thus shared; and if these shared associations are due to causal relationships between traits. We find that only a subset of traits share associations, with many due to causal relationships rather than pleiotropy. We therefore suggest that simply observing overlapping associations at a genetic locus is insufficient to infer causality; direct evidence of shared associations is required to support mechanistic hypotheses in genetic studies of complex traits.
Collapse
Affiliation(s)
- Christiane Gasperi
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
- Department of Neurology, Klinikum rechts der Isar, TUM School of Medicine, Technical University of Munich, Ismaninger Str. 22, Munich, Germany
| | - Sung Chun
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
- Division of Pulmonary Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Shamil R Sunyaev
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chris Cotsapas
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA.
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA.
| |
Collapse
|
23
|
Bon-Baret V, Chignon A, Boulanger MC, Li Z, Argaud D, Arsenault BJ, Thériault S, Bossé Y, Mathieu P. System Genetics Including Causal Inference Identify Immune Targets for Coronary Artery Disease and the Lifespan. CIRCULATION. GENOMIC AND PRECISION MEDICINE 2021; 14:e003196. [PMID: 33625251 PMCID: PMC8284374 DOI: 10.1161/circgen.120.003196] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
BACKGROUND Randomized clinical trials indicate that the immune response plays a significant role in coronary artery disease (CAD), a disorder impacting the lifespan potential. However, the identification of targets critical to the immune response in atheroma is still hampered by a lack of solid inference. METHODS Herein, we implemented a system genetics approach to identify causally associated immune targets implicated in atheroma. We leveraged genome-wide association studies to perform mapping and Mendelian randomization to assess causal associations between gene expression in blood cells with CAD and the lifespan. Expressed genes (eGenes) were prioritized in network and in single-cell expression derived from plaque immune cells. RESULTS Among 840 CAD-associated blood eGenes, 37 were predicted causally associated with CAD and 6 were also associated with the parental lifespan in Mendelian randomization. In multivariable Mendelian randomization, the impact of eGenes on the lifespan potential was mediated by the CAD risk. Predicted causal eGenes were central in network. FLT1 and CCR5 were identified as targets of approved drugs, whereas 22 eGenes were deemed tractable for the development of small molecules and antibodies. Analyses of plaque immune single-cell expression identified predicted causal eGenes enriched in macrophages (GPX1, C4orf3) and involved in ligand-receptor interactions (CCR5). CONCLUSIONS We identified 37 blood eGenes predicted causally associated with CAD. The predicted expression for 6 eGenes impacted the lifespan potential through the risk of CAD. Prioritization based on network, annotations, and single-cell expression identified targets deemed tractable for the development of drugs and for drug repurposing.
Collapse
Affiliation(s)
- Valentin Bon-Baret
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| | - Arnaud Chignon
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| | - Marie-Chloé Boulanger
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| | - Zhonglin Li
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| | - Deborah Argaud
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| | | | - Sébastien Thériault
- Department of Molecular Biology, Medical Biochemistry and Pathology (S.T.), Laval University, Quebec, Canada
| | - Yohan Bossé
- Department of Molecular Medicine (Y.B.), Laval University, Quebec, Canada
| | - Patrick Mathieu
- Laboratory of Cardiovascular Pathobiology, Quebec Heart and Lung Institute/Research Center, Department of Surgery (V.B.-B., A.C., M.-C.B., Z.L., D.A., P.M.), Laval University, Quebec, Canada
| |
Collapse
|
24
|
Integrative analysis identifies bHLH transcription factors as contributors to Parkinson's disease risk mechanisms. Sci Rep 2021; 11:3502. [PMID: 33568722 PMCID: PMC7875985 DOI: 10.1038/s41598-021-83087-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2019] [Accepted: 01/26/2021] [Indexed: 11/08/2022] Open
Abstract
Genome-wide association studies (GWAS) have identified multiple genetic risk signals for Parkinson’s disease (PD), however translation into underlying biological mechanisms remains scarce. Genomic functional annotations of neurons provide new resources that may be integrated into analyses of GWAS findings. Altered transcription factor binding plays an important role in human diseases. Insight into transcriptional networks involved in PD risk mechanisms may thus improve our understanding of pathogenesis. We analysed overlap between genome-wide association signals in PD and open chromatin in neurons across multiple brain regions, finding a significant enrichment in the superior temporal cortex. The involvement of transcriptional networks was explored in neurons of the superior temporal cortex based on the location of candidate transcription factor motifs identified by two de novo motif discovery methods. Analyses were performed in parallel, both finding that PD risk variants significantly overlap with open chromatin regions harboring motifs of basic Helix-Loop-Helix (bHLH) transcription factors. Our findings show that cortical neurons are likely mediators of genetic risk for PD. The concentration of PD risk variants at sites of open chromatin targeted by members of the bHLH transcription factor family points to an involvement of these transcriptional networks in PD risk mechanisms.
Collapse
|
25
|
Zhu H, Shang L, Zhou X. A Review of Statistical Methods for Identifying Trait-Relevant Tissues and Cell Types. Front Genet 2021; 11:587887. [PMID: 33584792 PMCID: PMC7874162 DOI: 10.3389/fgene.2020.587887] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 12/30/2020] [Indexed: 11/17/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified and replicated many genetic variants that are associated with diseases and disease-related complex traits. However, the biological mechanisms underlying these identified associations remain largely elusive. Exploring the biological mechanisms underlying these associations requires identifying trait-relevant tissues and cell types, as genetic variants likely influence complex traits in a tissue- and cell type-specific manner. Recently, several statistical methods have been developed to integrate genomic data with GWASs for identifying trait-relevant tissues and cell types. These methods often rely on different genomic information and use different statistical models for trait-tissue relevance inference. Here, we present a comprehensive technical review to summarize ten existing methods for trait-tissue relevance inference. These methods make use of different genomic information that include functional annotation information, expression quantitative trait loci information, genetically regulated gene expression information, as well as gene co-expression network information. These methods also use different statistical models that range from linear mixed models to covariance network models. We hope that this review can serve as a useful reference both for methodologists who develop methods and for applied analysts who apply these methods for identifying trait relevant tissues and cell types.
Collapse
Affiliation(s)
- Huanhuan Zhu
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
| | - Lulu Shang
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, United States
| |
Collapse
|
26
|
Xin S, Fang W, Li J, Li D, Wang C, Huang Q, Huang M, Zhuang W, Wang X, Chen L. Impact of STAT1 polymorphisms on crizotinib-induced hepatotoxicity in ALK-positive non-small cell lung cancer patients. J Cancer Res Clin Oncol 2021; 147:725-737. [PMID: 33387041 DOI: 10.1007/s00432-020-03476-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Accepted: 11/18/2020] [Indexed: 01/01/2023]
Abstract
PURPOSE Crizotinib is the first-line small molecule tyrosine kinase inhibitor for ALK-positive non-small cell lung cancer. In this study, a retrospective pharmacogenomics investigation was conducted to explore the relationship between genes related to RTK downstream signaling pathways and crizotinib-induced hepatic toxicity in ALK-positive NSCLC patients. METHODS The variable importance analysis of random forest algorithm was applied to identify the significant features which contribute to the crizotinib sensitivity in Cancer Cell Line Encyclopedia (CCLE) database. The KEGG and reactome pathway enrichment analysis were conducted with EnrichR. The differential expression genes were identified with R package DESeq2 in CCLE liver derived cell lines between crizotinib sensitive and resistant groups. From 2012 to 2015, 42 NSCLC patients were enrolled in this study. 90 polymorphisms were genotyped using the Sequenom Massarray system. Sequencing of HGFR (c-Met) genes was carried out on the Ion Torrent Proton. RESULTS In total, 66.7% NSCLC patients suffered from crizotinib-induced liver toxicity and 11.9% progressed to severe hepatic toxicity. The features with the top importance from classification and regression random forest model were enriched in RTK downstream signaling pathways (JAK/STAT, RAS/RAF/MAPK, PI3K/AKT pathways) and immune system-related pathways. Collagen family genes (COL1A1, COL1A2, COL6A1, COL5A1) and other extracellular matrix protein (TNC, TAGLN, TENM2, EDIL3, VCAN, CNN1, SH3BP4, TAGLN), which were closely related to MAPK-ERK signaling pathways, were significantly enriched in crizotinib resistant cell lines. In multiple logistic regression, STAT1 rs10208033 (T > C) was significantly associated with crizotinib-induced liver toxicity (OR = 6.733, 95% CI 1.406-32.24, P = 0.017). Compared with non-CC, OR is 5.5 (95% CI 1.219-24.81, P = 0.027) for STAT1 rs10208033 CC genotype to develop crizotinib-induced liver toxicity. Further cell viability test in human fetal hepatocyte line, L-02, reveals that the STAT1 inhibitor might protect hepatocyte cells from the toxicity caused by crizotinib. CONCLUSION Polymorphism of rs10208033 is a potential biomarker for predicting crizotinib-induced hepatotoxicity. These results suggest that STAT1 plays an important role in crizotinib-induced hepatotoxicity. Further studies are needed to confirm our finding and understand the underlying mechanisms.
Collapse
Affiliation(s)
- Shuang Xin
- Institute of Clinical Pharmacology, School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou, 510060, People's Republic of China.,Department of Medical Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, 510060, People's Republic of China
| | - Wenfeng Fang
- Department of Medical Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, 510060, People's Republic of China
| | - Jianwen Li
- CapitalBio Genomics Co., Ltd., Dongguan, 523808, China
| | - Delan Li
- Chemotherapy Department 2, Zhongshan City People's Hospital, Zhongshan, 528403, People's Republic of China
| | - Changzheng Wang
- Department of Computer Science, City University of Hong Kong, Hong Kong, China
| | - Quanfei Huang
- CapitalBio Genomics Co., Ltd., Dongguan, 523808, China
| | - Min Huang
- Institute of Clinical Pharmacology, School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou, 510060, People's Republic of China
| | - Wei Zhuang
- Institute of Clinical Pharmacology, School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou, 510060, People's Republic of China
| | - Xueding Wang
- Institute of Clinical Pharmacology, School of Pharmaceutical Sciences, Sun Yat-Sen University, Guangzhou, 510060, People's Republic of China.
| | - Likun Chen
- Department of Medical Oncology, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Sun Yat-Sen University Cancer Center, Guangzhou, 510060, People's Republic of China.
| |
Collapse
|
27
|
Xue H, Wu C, Pan W. Leveraging existing GWAS summary data of genetically correlated and uncorrelated traits to improve power for a new GWAS. Genet Epidemiol 2020; 44:717-732. [PMID: 32677173 PMCID: PMC7722071 DOI: 10.1002/gepi.22333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 06/09/2020] [Accepted: 06/18/2020] [Indexed: 11/08/2022]
Abstract
In spite of the tremendous success of genome-wide association studies (GWAS) in identifying genetic variants associated with complex traits and common diseases, many more are yet to be discovered. Hence, it is always desirable to improve the statistical power of GWAS. Paralleling with the intensive efforts of integrating GWAS with functional annotations or other omic data, we propose leveraging other published GWAS summary data to boost statistical power for a new/focus GWAS; the traits of the published GWAS may or may not be genetically correlated with the target trait of the new GWAS. Building on weighted hypothesis testing with a solid theoretical foundation, we develop a novel and effective method to construct single-nucleotide polymorphism (SNP)-specific weights based on 22 published GWAS data sets with various traits, detecting sometimes dramatically increased numbers of significant SNPs and independent loci as compared to the standard/unweighted analysis. For example, by integrating a schizophrenia GWAS summary data set with 19 other GWAS summary data sets of nonschizophrenia traits, our new method identified 1,585 genome-wide significant SNPs mapping to 15 linkage disequilibrium-independent loci, largely exceeding 818 significant SNPs in 13 independent loci identified by the standard/unweighted analysis; furthermore, using a later and larger schizophrenia GWAS summary data set as the validation data, 1,423 (out of 1,585) significant SNPs identified by the weighted analysis, compared to 705 (out of 818) by the unweighted analysis, were confirmed, while all 15 and 13 independent loci were also confirmed. Similar conclusions were reached with lipids and Alzheimer's disease (AD) traits. We conclude that the proposed approach is simple and cost-effective to improve GWAS power.
Collapse
Affiliation(s)
- Haoran Xue
- School of Statistics, University of Minnesota, Minneapolis, Minnesota
| | - Chong Wu
- Department of Statistics, Florida State University, Tallahassee, Florida
| | - Wei Pan
- Division of Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota
| |
Collapse
|
28
|
Cousminer DL, Freathy RM. Genetics of early growth traits. Hum Mol Genet 2020; 29:R66-R72. [PMID: 32886111 PMCID: PMC7530515 DOI: 10.1093/hmg/ddaa149] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 06/04/2020] [Accepted: 07/09/2020] [Indexed: 12/11/2022] Open
Abstract
In recent years, genome-wide association studies have shed light on the genetics of early growth and its links with later-life health outcomes. Large-scale datasets and meta-analyses, combined with recently developed analytical methods, have enabled dissection of the maternal and fetal genetic contributions to variation in birth weight. Additionally, longitudinal approaches have shown differences between the genetic contributions to infant, childhood and adult adiposity. In contrast, studies of adult height loci have shown strong associations with early body length and childhood height. Early growth-associated loci provide useful tools for causal analyses: Mendelian randomization (MR) studies have provided evidence that early BMI and height are causally related to a number of adult health outcomes. We advise caution in the design and interpretation of MR studies of birth weight investigating effects of fetal growth on later-life cardiometabolic disease because birth weight is only a crude indicator of fetal growth, and the choice of genetic instrument (maternal or fetal) will greatly influence the interpretation of the results. Most genetic studies of early growth have to date centered on European-ancestry participants and outcomes measured at a single time-point, so key priorities for future studies of early growth genetics are aggregation of large samples of diverse ancestries and longitudinal studies of growth trajectories.
Collapse
Affiliation(s)
- Diana L Cousminer
- Division of Human Genetics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Genetics, University of Pennsylvania, Philadelphia, PA 19104, USA
- Center for Spatial and Functional Genomics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Rachel M Freathy
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Exeter EX2 5DW, UK
| |
Collapse
|
29
|
Thaventhiran JED, Lango Allen H, Burren OS, Rae W, Greene D, Staples E, Zhang Z, Farmery JHR, Simeoni I, Rivers E, Maimaris J, Penkett CJ, Stephens J, Deevi SVV, Sanchis-Juan A, Gleadall NS, Thomas MJ, Sargur RB, Gordins P, Baxendale HE, Brown M, Tuijnenburg P, Worth A, Hanson S, Linger RJ, Buckland MS, Rayner-Matthews PJ, Gilmour KC, Samarghitean C, Seneviratne SL, Sansom DM, Lynch AG, Megy K, Ellinghaus E, Ellinghaus D, Jorgensen SF, Karlsen TH, Stirrups KE, Cutler AJ, Kumararatne DS, Chandra A, Edgar JDM, Herwadkar A, Cooper N, Grigoriadou S, Huissoon AP, Goddard S, Jolles S, Schuetz C, Boschann F, Lyons PA, Hurles ME, Savic S, Burns SO, Kuijpers TW, Turro E, Ouwehand WH, Thrasher AJ, Smith KGC. Whole-genome sequencing of a sporadic primary immunodeficiency cohort. Nature 2020; 583:90-95. [PMID: 32499645 PMCID: PMC7334047 DOI: 10.1038/s41586-020-2265-1] [Citation(s) in RCA: 121] [Impact Index Per Article: 30.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2018] [Accepted: 02/26/2020] [Indexed: 12/19/2022]
Abstract
Primary immunodeficiency (PID) is characterized by recurrent and often life-threatening infections, autoimmunity and cancer, and it poses major diagnostic and therapeutic challenges. Although the most severe forms of PID are identified in early childhood, most patients present in adulthood, typically with no apparent family history and a variable clinical phenotype of widespread immune dysregulation: about 25% of patients have autoimmune disease, allergy is prevalent and up to 10% develop lymphoid malignancies1-3. Consequently, in sporadic (or non-familial) PID genetic diagnosis is difficult and the role of genetics is not well defined. Here we address these challenges by performing whole-genome sequencing in a large PID cohort of 1,318 participants. An analysis of the coding regions of the genome in 886 index cases of PID found that disease-causing mutations in known genes that are implicated in monogenic PID occurred in 10.3% of these patients, and a Bayesian approach (BeviMed4) identified multiple new candidate PID-associated genes, including IVNS1ABP. We also examined the noncoding genome, and found deletions in regulatory regions that contribute to disease causation. In addition, we used a genome-wide association study to identify loci that are associated with PID, and found evidence for the colocalization of-and interplay between-novel high-penetrance monogenic variants and common variants (at the PTPN2 and SOCS1 loci). This begins to explain the contribution of common variants to the variable penetrance and phenotypic complexity that are observed in PID. Thus, using a cohort-based whole-genome-sequencing approach in the diagnosis of PID can increase diagnostic yield and further our understanding of the key pathways that influence immune responsiveness in humans.
Collapse
Affiliation(s)
- James E D Thaventhiran
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK.
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK.
- Medical Research Council Toxicology Unit, School of Biological Sciences, University of Cambridge, Cambridge, UK.
| | - Hana Lango Allen
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
- Medical Research Council Epidemiology Unit, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Oliver S Burren
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - William Rae
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Daniel Greene
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
- Medical Research Council Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge, UK
| | - Emily Staples
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Zinan Zhang
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- Molecular Development of the Immune System Section, Laboratory of Immune System Biology and Clinical Genomics Program, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - James H R Farmery
- Medical Research Council Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge, UK
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Cambridge, UK
| | - Ilenia Simeoni
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Elizabeth Rivers
- UCL Great Ormond Street Institute of Child Health, London, UK
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Jesmeen Maimaris
- UCL Great Ormond Street Institute of Child Health, London, UK
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Christopher J Penkett
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Jonathan Stephens
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Sri V V Deevi
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Alba Sanchis-Juan
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Nicholas S Gleadall
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
| | - Moira J Thomas
- Department of Immunology, Queen Elizabeth University Hospital, Glasgow, UK
- Gartnavel General Hospital, NHS Greater Glasgow and Clyde, Glasgow, UK
| | - Ravishankar B Sargur
- Sheffield Teaching Hospitals NHS Foundation Trust, Sheffield, UK
- Department of Infection, Immunity and Cardiovascular Disease, University of Sheffield, Sheffield, UK
| | - Pavels Gordins
- East Yorkshire Regional Adult Immunology and Allergy Unit, Hull Royal Infirmary, Hull and East Yorkshire Hospitals NHS Trust, Hull, UK
| | - Helen E Baxendale
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- Royal Papworth Hospital NHS Foundation Trust, Cambridge, UK
| | - Matthew Brown
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Paul Tuijnenburg
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam, The Netherlands
- Department of Experimental Immunology, Amsterdam University Medical Center (AMC), University of Amsterdam, Amsterdam, The Netherlands
| | - Austen Worth
- UCL Great Ormond Street Institute of Child Health, London, UK
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Steven Hanson
- Institute of Immunity and Transplantation, University College London, London, UK
- Department of Immunology, Royal Free London NHS Foundation Trust, London, UK
| | - Rachel J Linger
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
- Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Matthew S Buckland
- Institute of Immunity and Transplantation, University College London, London, UK
- Department of Immunology, Royal Free London NHS Foundation Trust, London, UK
| | - Paula J Rayner-Matthews
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Kimberly C Gilmour
- UCL Great Ormond Street Institute of Child Health, London, UK
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Crina Samarghitean
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Suranjith L Seneviratne
- Institute of Immunity and Transplantation, University College London, London, UK
- Department of Immunology, Royal Free London NHS Foundation Trust, London, UK
| | - David M Sansom
- Institute of Immunity and Transplantation, University College London, London, UK
- Department of Immunology, Royal Free London NHS Foundation Trust, London, UK
| | - Andy G Lynch
- Cancer Research UK Cambridge Institute, University of Cambridge, Li Ka Shing Centre, Cambridge, UK
- School of Mathematics and Statistics, University of St Andrews, St Andrews, UK
- School of Medicine, University of St Andrews, St Andrews, UK
| | - Karyn Megy
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Eva Ellinghaus
- K.G. Jebsen Inflammation Research Centre, Institute of Clinical Medicine, University of Oslo, Oslo University Hospital, Oslo, Norway
| | - David Ellinghaus
- Department of Transplantation, Institute of Clinical Medicine, University of Oslo, Oslo University Hospital, Oslo, Norway
- Institute of Clinical Molecular Biology, Christian Albrechts University of Kiel, Kiel, Germany
| | - Silje F Jorgensen
- Section of Clinical Immunology and Infectious Diseases, Department of Rheumatology, Dermatology and Infectious Diseases, Oslo University Hospital, Oslo, Norway
- Research Institute of Internal Medicine, Division of Surgery, Inflammatory Diseases and Transplantation, Oslo University Hospital, Oslo, Norway
| | - Tom H Karlsen
- K.G. Jebsen Inflammation Research Centre, Institute of Clinical Medicine, University of Oslo, Oslo University Hospital, Oslo, Norway
| | - Kathleen E Stirrups
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Antony J Cutler
- JDRF/Wellcome Diabetes and Inflammation Laboratory, Wellcome Centre for Human Genetics, Nuffield Department of Medicine, NIHR Oxford Biomedical Research Centre, University of Oxford, Oxford, UK
| | - Dinakantha S Kumararatne
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- Department of Clinical Biochemistry and Immunology, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - Anita Chandra
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- Department of Clinical Biochemistry and Immunology, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
| | - J David M Edgar
- St James's Hospital, Dublin, Ireland
- Trinity College Dublin, Dublin, Ireland
| | | | - Nichola Cooper
- Department of Medicine, Imperial College London, London, UK
| | | | - Aarnoud P Huissoon
- West Midlands Immunodeficiency Centre, University Hospitals Birmingham, Birmingham, UK
- Birmingham Heartlands Hospital, University Hospitals Birmingham NHS Foundation Trust, Birmingham, UK
| | - Sarah Goddard
- University Hospitals of North Midlands NHS Trust, Stoke-on-Trent, UK
| | - Stephen Jolles
- Immunodeficiency Centre for Wales, University Hospital of Wales, Cardiff, UK
| | - Catharina Schuetz
- Department of Pediatric Immunology, University Hospital Carl Gustav Carus, Dresden, Germany
| | - Felix Boschann
- Institute of Medical Genetics and Human Genetics, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Paul A Lyons
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
| | - Matthew E Hurles
- Department of Human Genetics, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Sinisa Savic
- Department of Clinical Immunology and Allergy, St James's University Hospital, Leeds, UK
- The NIHR Leeds Biomedical Research Centre, Leeds, UK
- Leeds Institute of Rheumatic and Musculoskeletal Medicine, Leeds, UK
| | - Siobhan O Burns
- Institute of Immunity and Transplantation, University College London, London, UK
- Department of Immunology, Royal Free London NHS Foundation Trust, London, UK
| | - Taco W Kuijpers
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam, The Netherlands
- Department of Experimental Immunology, Amsterdam University Medical Center (AMC), University of Amsterdam, Amsterdam, The Netherlands
- Department of Blood Cell Research, Sanquin, Amsterdam, The Netherlands
| | - Ernest Turro
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
- Medical Research Council Biostatistics Unit, Cambridge Institute of Public Health, Cambridge Biomedical Campus, Cambridge, UK
| | - Willem H Ouwehand
- Department of Haematology, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK
- NHS Blood and Transplant, Cambridge Biomedical Campus, Cambridge, UK
- NIHR BioResource, Cambridge University Hospitals, Cambridge Biomedical Campus, Cambridge, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Adrian J Thrasher
- UCL Great Ormond Street Institute of Child Health, London, UK
- Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
| | - Kenneth G C Smith
- Cambridge Institute of Therapeutic Immunology and Infectious Disease, Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, Cambridge, UK.
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, UK.
| |
Collapse
|
30
|
Fortune MD, Wallace C. simGWAS: a fast method for simulation of large scale case-control GWAS summary statistics. Bioinformatics 2020; 35:1901-1906. [PMID: 30371734 PMCID: PMC6546134 DOI: 10.1093/bioinformatics/bty898] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Revised: 09/12/2018] [Accepted: 10/26/2018] [Indexed: 11/23/2022] Open
Abstract
Motivation Methods for analysis of GWAS summary statistics have encouraged data sharing and democratized the analysis of different diseases. Ideal validation for such methods is application to simulated data, where some ‘truth’ is known. As GWAS increase in size, so does the computational complexity of such evaluations; standard practice repeatedly simulates and analyses genotype data for all individuals in an example study. Results We have developed a novel method based on an alternative approach, directly simulating GWAS summary data, without individual data as an intermediate step. We mathematically derive the expected statistics for any set of causal variants and their effect sizes, conditional upon control haplotype frequencies (available from public reference datasets). Simulation of GWAS summary output can be conducted independently of sample size by simulating random variates about these expected values. Across a range of scenarios, our method, produces very similar output to that from simulating individual genotypes with a substantial gain in speed even for modest sample sizes. Fast simulation of GWAS summary statistics will enable more complete and rapid evaluation of summary statistic methods as well as opening new potential avenues of research in fine mapping and gene set enrichment analysis. Availability and implementation Our method is available under a GPL license as an R package from http://github.com/chr1swallace/simGWAS. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mary D Fortune
- MRC Biostatistics Unit, Cambridge Institute of Public Health, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK.,Department of Medicine, University of Cambridge, Addenbrooke's Hospital, Cambridge, UK
| | - Chris Wallace
- MRC Biostatistics Unit, Cambridge Institute of Public Health, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK.,Department of Medicine, University of Cambridge, Addenbrooke's Hospital, Cambridge, UK
| |
Collapse
|
31
|
Thom CS, Jobaliya CD, Lorenz K, Maguire JA, Gagne A, Gadue P, French DL, Voight BF. Tropomyosin 1 genetically constrains in vitro hematopoiesis. BMC Biol 2020; 18:52. [PMID: 32408895 PMCID: PMC7227211 DOI: 10.1186/s12915-020-00783-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Accepted: 04/21/2020] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Identifying causal variants and genes from human genetic studies of hematopoietic traits is important to enumerate basic regulatory mechanisms underlying these traits, and could ultimately augment translational efforts to generate platelets and/or red blood cells in vitro. To identify putative causal genes from these data, we performed computational modeling using available genome-wide association datasets for platelet and red blood cell traits. RESULTS Our model identified a joint collection of genomic features enriched at established trait associations and plausible candidate variants. Additional studies associating variation at these loci with change in gene expression highlighted Tropomyosin 1 (TPM1) among our top-ranked candidate genes. CRISPR/Cas9-mediated TPM1 knockout in human induced pluripotent stem cells (iPSCs) enhanced hematopoietic progenitor development, increasing total megakaryocyte and erythroid cell yields. CONCLUSIONS Our findings may help explain human genetic associations and identify a novel genetic strategy to enhance in vitro hematopoiesis. A similar trait-specific gene prioritization strategy could be employed to help streamline functional validation experiments for virtually any human trait.
Collapse
Affiliation(s)
- Christopher Stephen Thom
- Division of Neonatology, Children's Hospital of Philadelphia, Philadelphia, PA, USA.
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| | - Chintan D Jobaliya
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Kimberly Lorenz
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Jean Ann Maguire
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Alyssa Gagne
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Paul Gadue
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Deborah L French
- Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Benjamin Franklin Voight
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
- Institute of Translational Medicine and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
32
|
Cano-Gamez E, Trynka G. From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases. Front Genet 2020; 11:424. [PMID: 32477401 PMCID: PMC7237642 DOI: 10.3389/fgene.2020.00424] [Citation(s) in RCA: 273] [Impact Index Per Article: 68.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 04/06/2020] [Indexed: 12/19/2022] Open
Abstract
Genome-wide association studies (GWAS) have successfully mapped thousands of loci associated with complex traits. These associations could reveal the molecular mechanisms altered in common complex diseases and result in the identification of novel drug targets. However, GWAS have also left a number of outstanding questions. In particular, the majority of disease-associated loci lie in non-coding regions of the genome and, even though they are thought to play a role in gene expression regulation, it is unclear which genes they regulate and in which cell types or physiological contexts this regulation occurs. This has hindered the translation of GWAS findings into clinical interventions. In this review we summarize how these challenges have been addressed over the last decade, with a particular focus on the integration of GWAS results with functional genomics datasets. Firstly, we investigate how the tissues and cell types involved in diseases can be identified using methods that test for enrichment of GWAS variants in genomic annotations. Secondly, we explore how to find the genes regulated by GWAS loci using methods that test for colocalization of GWAS signals with molecular phenotypes such as quantitative trait loci (QTLs). Finally, we highlight potential future research avenues such as integrating GWAS results with single-cell sequencing read-outs, designing functionally informed polygenic risk scores (PRS), and validating disease associated genes using genetic engineering. These tools will be crucial to identify new drug targets for common complex diseases.
Collapse
Affiliation(s)
- Eddie Cano-Gamez
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
| | - Gosia Trynka
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, United Kingdom
- Open Targets, Wellcome Genome Campus, Cambridge, United Kingdom
| |
Collapse
|
33
|
Shang L, Smith JA, Zhou X. Leveraging gene co-expression patterns to infer trait-relevant tissues in genome-wide association studies. PLoS Genet 2020; 16:e1008734. [PMID: 32310941 PMCID: PMC7192514 DOI: 10.1371/journal.pgen.1008734] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 04/30/2020] [Accepted: 03/24/2020] [Indexed: 12/11/2022] Open
Abstract
Genome-wide association studies (GWASs) have identified many SNPs associated with various common diseases. Understanding the biological functions of these identified SNP associations requires identifying disease/trait relevant tissues or cell types. Here, we develop a network method, CoCoNet, to facilitate the identification of trait-relevant tissues or cell types. Different from existing approaches, CoCoNet incorporates tissue-specific gene co-expression networks constructed from either bulk or single cell RNA sequencing (RNAseq) studies with GWAS data for trait-tissue inference. In particular, CoCoNet relies on a covariance regression network model to express gene-level effect measurements for the given GWAS trait as a function of the tissue-specific co-expression adjacency matrix. With a composite likelihood-based inference algorithm, CoCoNet is scalable to tens of thousands of genes. We validate the performance of CoCoNet through extensive simulations. We apply CoCoNet for an in-depth analysis of four neurological disorders and four autoimmune diseases, where we integrate the corresponding GWASs with bulk RNAseq data from 38 tissues and single cell RNAseq data from 10 cell types. In the real data applications, we show how CoCoNet can help identify specific glial cell types relevant for neurological disorders and identify disease-targeted colon tissues as relevant for autoimmune diseases.
Collapse
Affiliation(s)
- Lulu Shang
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
| | - Jennifer A. Smith
- Department of Epidemiology, University of Michigan, Ann Arbor, MI, United States of America
- Survey Research Center, Institute for Social Research, University of Michigan, Ann Arbor, MI, United States of America
| | - Xiang Zhou
- Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States of America
- Center for Statistical Genetics, University of Michigan, Ann Arbor, MI, United States of America
| |
Collapse
|
34
|
Wallace C. Eliciting priors and relaxing the single causal variant assumption in colocalisation analyses. PLoS Genet 2020; 16:e1008720. [PMID: 32310995 PMCID: PMC7192519 DOI: 10.1371/journal.pgen.1008720] [Citation(s) in RCA: 142] [Impact Index Per Article: 35.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/30/2020] [Accepted: 03/17/2020] [Indexed: 01/03/2023] Open
Abstract
Horizontal integration of summary statistics from different GWAS traits can be used to evaluate evidence for their shared genetic causality. One popular method to do this is a Bayesian method, coloc, which is attractive in requiring only GWAS summary statistics and no linkage disequilibrium estimates and is now being used routinely to perform thousands of comparisons between traits. Here we show that while most users do not adjust default software values, misspecification of prior parameters can substantially alter posterior inference. We suggest data driven methods to derive sensible prior values, and demonstrate how sensitivity analysis can be used to assess robustness of posterior inference. The flexibility of coloc comes at the expense of an unrealistic assumption of a single causal variant per trait. This assumption can be relaxed by stepwise conditioning, but this requires external software and an LD matrix aligned to study alleles. We have now implemented conditioning within coloc, and propose a new alternative method, masking, that does not require LD and approximates conditioning when causal variants are independent. Importantly, masking can be used in combination with conditioning where allelically aligned LD estimates are available for only a single trait. We have implemented these developments in a new version of coloc which we hope will enable more informed choice of priors and overcome the restriction of the single causal variant assumptions in coloc analysis.
Collapse
Affiliation(s)
- Chris Wallace
- Cambridge Institute for Therapeutic Immunology & Infectious Disease, and MRC Biostatistics Unit, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
35
|
Zhou Y, Sun Y, Huang D, Li MJ. epiCOLOC: Integrating Large-Scale and Context-Dependent Epigenomics Features for Comprehensive Colocalization Analysis. Front Genet 2020; 11:53. [PMID: 32117461 PMCID: PMC7029718 DOI: 10.3389/fgene.2020.00053] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Accepted: 01/17/2020] [Indexed: 12/18/2022] Open
Abstract
High-throughput genome-wide epigenomic assays, such as ChIP-seq, DNase-seq and ATAC-seq, have profiled a huge number of functional elements across numerous human tissues/cell types, which provide an unprecedented opportunity to interpret human genome and disease in context-dependent manner. Colocalization analysis determines whether genomic features are functionally related to a given search and will facilitate identifying the underlying biological functions characterizing intricate relationships with queries for genomic regions. Existing colocalization methods leveraged diverse assumptions and background models to assess the significance of enrichment, however, they only provided limited and predefined sets of epigenomic features. Here, we comprehensively collected and integrated over 44,385 bulk or single-cell epigenomic assays across 53 human tissues/cell types, such as transcription factor binding, histone modification, open chromatin and transcriptional event. By classifying these profiles into hierarchy of tissue/cell type, we developed a web portal, epiCOLOC (http://mulinlab.org/epicoloc or http://mulinlab.tmu.edu.cn/epicoloc), for users to perform context-dependent colocalization analysis in a convenient way.
Collapse
Affiliation(s)
- Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Yongzheng Sun
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Dandan Huang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China.,Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Tianjin Medical University, Tianjin, China
| |
Collapse
|
36
|
Boueiz A, Pham B, Chase R, Lamb A, Lee S, Naing ZZC, Cho MH, Parker MM, Sakornsakolpat P, Hersh CP, Crapo JD, Stergachis AB, Tal-Singer R, DeMeo DL, Silverman EK, Zhou X, Castaldi PJ. Integrative Genomics Analysis Identifies ACVR1B as a Candidate Causal Gene of Emphysema Distribution. Am J Respir Cell Mol Biol 2019; 60:388-398. [PMID: 30335480 DOI: 10.1165/rcmb.2018-0110oc] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Genome-wide association studies (GWAS) have identified multiple associations with emphysema apicobasal distribution (EABD), but the biological functions of these variants are unknown. To characterize the functions of EABD-associated variants, we integrated GWAS results with 1) expression quantitative trait loci (eQTL) from the Genotype Tissue Expression (GTEx) project and subjects in the COPDGene (Genetic Epidemiology of COPD) study and 2) cell type epigenomic marks from the Roadmap Epigenomics project. On the basis of these analyses, we selected a variant near ACVR1B (activin A receptor type 1B) for functional validation. SNPs from 168 loci with P values less than 5 × 10-5 in the largest GWAS meta-analysis of EABD were analyzed. Eighty-four loci overlapped eQTL, with 12 of these loci showing greater than 80% likelihood of harboring a single, shared GWAS and eQTL causal variant. Seventeen cell types were enriched for overlap between EABD loci and Roadmap Epigenomics marks (permutation P < 0.05), with the strongest enrichment observed in CD4+, CD8+, and regulatory T cells. We selected a putative causal variant, rs7962469, associated with ACVR1B expression in lung tissue for additional functional investigation, and reporter assays confirmed allele-specific regulatory activity for this variant in human bronchial epithelial and Jurkat immune cell lines. ACVR1B expression levels exhibit a nominally significant association with emphysema distribution. EABD-associated loci are preferentially enriched in regulatory elements of multiple cell types, most notably T-cell subsets. Multiple EABD loci colocalize to regulatory elements that are active across multiple tissues and cell types, and functional analyses confirm the presence of an EABD-associated functional variant that regulates ACVR1B expression, indicating that transforming growth factor-β signaling plays a role in the EABD phenotype. Clinical trial registered with www.clinicaltrials.gov (NCT00608764).
Collapse
Affiliation(s)
- Adel Boueiz
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | | | | | | | - Sool Lee
- 1 Channing Division of Network Medicine
| | | | - Michael H Cho
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | | | | | - Craig P Hersh
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | - James D Crapo
- 3 Pulmonary Medicine, National Jewish Health, Denver, Colorado; and
| | | | | | - Dawn L DeMeo
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | - Edwin K Silverman
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | - Xiaobo Zhou
- 1 Channing Division of Network Medicine.,2 Division of Pulmonary and Critical Care Medicine
| | - Peter J Castaldi
- 1 Channing Division of Network Medicine.,6 Division of General Medicine and Primary Care, Brigham and Women's Hospital, Harvard Medical School, Boston, Massachusetts
| | | |
Collapse
|
37
|
Reynolds RH, Hardy J, Ryten M, Gagliano Taliun SA. Informing disease modelling with brain-relevant functional genomic annotations. Brain 2019; 142:3694-3712. [PMID: 31603214 PMCID: PMC6885670 DOI: 10.1093/brain/awz295] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 07/05/2019] [Accepted: 07/29/2019] [Indexed: 12/13/2022] Open
Abstract
The past decade has seen a surge in the number of disease/trait-associated variants, largely because of the union of studies to share genetic data and the availability of electronic health records from large cohorts for research use. Variant discovery for neurological and neuropsychiatric genome-wide association studies, including schizophrenia, Parkinson's disease and Alzheimer's disease, has greatly benefitted; however, the translation of these genetic association results to interpretable biological mechanisms and models is lagging. Interpreting disease-associated variants requires knowledge of gene regulatory mechanisms and computational tools that permit integration of this knowledge with genome-wide association study results. Here, we summarize key conceptual advances in the generation of brain-relevant functional genomic annotations and amongst tools that allow integration of these annotations with association summary statistics, which together provide a new and exciting opportunity to identify disease-relevant genes, pathways and cell types in silico. We discuss the opportunities and challenges associated with these developments and conclude with our perspective on future advances in annotation generation, tool development and the union of the two.
Collapse
Affiliation(s)
- Regina H Reynolds
- Department of Neurodegenerative Disease, University College London (UCL) Institute of Neurology, London, UK
| | - John Hardy
- Department of Neurodegenerative Disease, University College London (UCL) Institute of Neurology, London, UK
- UK Dementia Research Institute at University College London (UCL), London, UK
| | - Mina Ryten
- Department of Neurodegenerative Disease, University College London (UCL) Institute of Neurology, London, UK
| | - Sarah A Gagliano Taliun
- Center for Statistical Genetics and Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
38
|
Amariuta T, Luo Y, Knevel R, Okada Y, Raychaudhuri S. Advances in genetics toward identifying pathogenic cell states of rheumatoid arthritis. Immunol Rev 2019; 294:188-204. [PMID: 31782165 DOI: 10.1111/imr.12827] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Accepted: 11/07/2019] [Indexed: 12/11/2022]
Abstract
Rheumatoid arthritis (RA) risk has a large genetic component (~60%) that is still not fully understood. This has hampered the design of effective treatments that could promise lifelong remission. RA is a polygenic disease with 106 known genome-wide significant associated loci and thousands of small effect causal variants. Our current understanding of RA risk has suggested cell-type-specific contexts for causal variants, implicating CD4 + effector memory T cells, as well as monocytes, B cells and stromal fibroblasts. While these cellular states and categories are still mechanistically broad, future studies may identify causal cell subpopulations. These efforts are propelled by advances in single cell profiling. Identification of causal cell subpopulations may accelerate therapeutic intervention to achieve lifelong remission.
Collapse
Affiliation(s)
- Tiffany Amariuta
- Center for Data Sciences, Harvard Medical School, Boston, MA, USA.,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Graduate School of Arts and Sciences, Harvard University, Boston, MA, USA
| | - Yang Luo
- Center for Data Sciences, Harvard Medical School, Boston, MA, USA.,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Rachel Knevel
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Rheumatology, Leiden University Medical Centre, Leiden, The Netherlands
| | - Yukinori Okada
- Division of Medicine, Osaka University, Osaka, Japan.,Osaka University Graduate School of Medicine, Osaka, Japan
| | - Soumya Raychaudhuri
- Center for Data Sciences, Harvard Medical School, Boston, MA, USA.,Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.,Arthritis Research UK Centre for Genetics and Genomics, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK
| |
Collapse
|
39
|
Thompson DJ, Genovese G, Halvardson J, Ulirsch JC, Wright DJ, Terao C, Davidsson OB, Day FR, Sulem P, Jiang Y, Danielsson M, Davies H, Dennis J, Dunlop MG, Easton DF, Fisher VA, Zink F, Houlston RS, Ingelsson M, Kar S, Kerrison ND, Kinnersley B, Kristjansson RP, Law PJ, Li R, Loveday C, Mattisson J, McCarroll SA, Murakami Y, Murray A, Olszewski P, Rychlicka-Buniowska E, Scott RA, Thorsteinsdottir U, Tomlinson I, Moghadam BT, Turnbull C, Wareham NJ, Gudbjartsson DF, Kamatani Y, Hoffmann ER, Jackson SP, Stefansson K, Auton A, Ong KK, Machiela MJ, Loh PR, Dumanski JP, Chanock SJ, Forsberg LA, Perry JRB. Genetic predisposition to mosaic Y chromosome loss in blood. Nature 2019; 575:652-657. [PMID: 31748747 PMCID: PMC6887549 DOI: 10.1038/s41586-019-1765-3] [Citation(s) in RCA: 178] [Impact Index Per Article: 35.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Accepted: 10/04/2019] [Indexed: 12/13/2022]
Abstract
Mosaic loss of chromosome Y (LOY) in circulating white blood cells is the most common form of clonal mosaicism1-5, yet our knowledge of the causes and consequences of this is limited. Here, using a computational approach, we estimate that 20% of the male population represented in the UK Biobank study (n = 205,011) has detectable LOY. We identify 156 autosomal genetic determinants of LOY, which we replicate in 757,114 men of European and Japanese ancestry. These loci highlight genes that are involved in cell-cycle regulation and cancer susceptibility, as well as somatic drivers of tumour growth and targets of cancer therapy. We demonstrate that genetic susceptibility to LOY is associated with non-haematological effects on health in both men and women, which supports the hypothesis that clonal haematopoiesis is a biomarker of genomic instability in other tissues. Single-cell RNA sequencing identifies dysregulated expression of autosomal genes in leukocytes with LOY and provides insights into why clonal expansion of these cells may occur. Collectively, these data highlight the value of studying clonal mosaicism to uncover fundamental mechanisms that underlie cancer and other ageing-related diseases.
Collapse
Affiliation(s)
- Deborah J Thompson
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Giulio Genovese
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jonatan Halvardson
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Jacob C Ulirsch
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Daniel J Wright
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Open Targets Core Genetics, Wellcome Sanger Institute, Hinxton, UK
| | - Chikashi Terao
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
- Department of Applied Genetics, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| | | | - Felix R Day
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | | | | | - Marcus Danielsson
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Hanna Davies
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Joe Dennis
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Malcolm G Dunlop
- Colon Cancer Genetics Group, Medical Research Council Human Genetics Unit and CRUK Cancer Research Centre, Institute of Genetics and Molecular Medicine, University of Edinburgh, Western General Hospital, Edinburgh, UK
| | - Douglas F Easton
- Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK
| | - Victoria A Fisher
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | | | - Richard S Houlston
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Martin Ingelsson
- Geriatrics Research Group, Department of Public Health and Caring Sciences, Uppsala University, Uppsala, Sweden
| | - Siddhartha Kar
- Centre for Cancer Genetic Epidemiology, Department of Oncology, University of Cambridge, Cambridge, UK
| | - Nicola D Kerrison
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Ben Kinnersley
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | | | - Philip J Law
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Rong Li
- Department of Cell Biology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Chey Loveday
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
| | - Jonas Mattisson
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Steven A McCarroll
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yoshinori Murakami
- Division of Molecular Pathology, Institute of Medical Science, University of Tokyo, Tokyo, Japan
| | - Anna Murray
- Genetics of Complex Traits, University of Exeter Medical School, University of Exeter, Exeter, UK
| | - Pawel Olszewski
- Faculty of Pharmacy, Medical University of Gdansk, Gdansk, Poland
| | - Edyta Rychlicka-Buniowska
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Faculty of Pharmacy, Medical University of Gdansk, Gdansk, Poland
| | - Robert A Scott
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Unnur Thorsteinsdottir
- deCODE Genetics, Amgen, Reykjavík, Iceland
- Faculty of Medicine, University of Iceland, Reykjavík, Iceland
| | - Ian Tomlinson
- Cancer Genetics and Evolution Laboratory, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK
| | - Behrooz Torabi Moghadam
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Clare Turnbull
- Division of Genetics and Epidemiology, The Institute of Cancer Research, London, UK
- William Harvey Research Institute, Queen Mary University, London, UK
| | - Nicholas J Wareham
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Daniel F Gudbjartsson
- deCODE Genetics, Amgen, Reykjavík, Iceland
- School of Engineering and Natural Sciences, University of Iceland, Reykjavík, Iceland
| | - Yoichiro Kamatani
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
- Kyoto-McGill International Collaborative School in Genomic Medicine, Kyoto University Graduate School of Medicine, Kyoto, Japan
| | - Eva R Hoffmann
- DNRF Center for Chromosome Stability, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Steve P Jackson
- Department of Biochemistry, University of Cambridge, Cambridge, UK
- Wellcome Trust and Cancer Research UK Gurdon Institute, University of Cambridge, Cambridge, UK
| | - Kari Stefansson
- deCODE Genetics, Amgen, Reykjavík, Iceland
- Faculty of Medicine, University of Iceland, Reykjavík, Iceland
| | | | - Ken K Ong
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK
| | - Mitchell J Machiela
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | - Po-Ru Loh
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Jan P Dumanski
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Faculty of Pharmacy, Medical University of Gdansk, Gdansk, Poland
| | - Stephen J Chanock
- Division of Cancer Epidemiology and Genetics, National Cancer Institute, Rockville, MD, USA
| | - Lars A Forsberg
- Department of Immunology, Genetics and Pathology and Science for Life Laboratory, Uppsala University, Uppsala, Sweden
- Beijer Laboratory of Genome Research, Uppsala University, Uppsala, Sweden
| | - John R B Perry
- MRC Epidemiology Unit, School of Clinical Medicine, University of Cambridge, Cambridge, UK.
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands.
| |
Collapse
|
40
|
Soskic B, Cano-Gamez E, Smyth DJ, Rowan WC, Nakic N, Esparza-Gordillo J, Bossini-Castillo L, Tough DF, Larminie CGC, Bronson PG, Willé D, Trynka G. Chromatin activity at GWAS loci identifies T cell states driving complex immune diseases. Nat Genet 2019; 51:1486-1493. [PMID: 31548716 PMCID: PMC6872452 DOI: 10.1038/s41588-019-0493-9] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 07/30/2019] [Indexed: 12/15/2022]
Abstract
Immune-disease-associated variants are enriched in active chromatin regions of T cells and macrophages. However, whether these variants function in specific cell states is unknown. Here we stimulated T cells and macrophages in the presence of 13 cytokines and profiled active and open chromatin regions. T cell activation induced major chromatin remodeling, while the presence of cytokines fine-tuned the magnitude of changes. We developed a statistical method that accounts for subtle changes in the chromatin landscape to identify SNP enrichment across cell states. Our results point towards the role of immune-disease-associated variants in early rather than late activation of memory CD4+ T cells, with modest differences across cytokines. Furthermore, variants associated with inflammatory bowel disease are enriched in type 1 T helper (TH1) cells, whereas variants associated with Alzheimer's disease are enriched in different macrophage cell states. Our results represent an in-depth analysis of immune-disease-associated variants across a comprehensive panel of activation states of T cells and macrophages.
Collapse
Affiliation(s)
- Blagoje Soskic
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Eddie Cano-Gamez
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Deborah J Smyth
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
- Open Targets, Wellcome Genome Campus, Cambridge, UK
| | - Wendy C Rowan
- Novel Human Genetics, GSK Research, GSK Medicines Research Centre, Stevenage, UK
| | - Nikolina Nakic
- Functional Genomics, Molecular Science and Technology R&D, GSK Medicines Research Centre, Stevenage, UK
| | | | | | - David F Tough
- Epigenetics RU, Oncology R&D, GSK Medicines Research Centre, Stevenage, UK
| | | | - Paola G Bronson
- Human Target Validation Core, RED Translational Biology, Biogen, Cambridge, MA, USA
| | - David Willé
- Biostatistics, GSK Research, GSK Medicines Research Centre, Stevenage, UK
| | - Gosia Trynka
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK.
- Open Targets, Wellcome Genome Campus, Cambridge, UK.
| |
Collapse
|
41
|
Abstract
PURPOSE OF REVIEW The goal of the review is to provide a comprehensive overview of the current understanding of the mechanisms underlying variation in human stature. RECENT FINDINGS Human height is an anthropometric trait that varies considerably within human populations as well as across the globe. Historically, much research focus was placed on understanding the biology of growth plate chondrocytes and how modifications to core chondrocyte proliferation and differentiation pathways potentially shaped height attainment in normal as well as pathological contexts. Recently, much progress has been made to improve our understanding regarding the mechanisms underlying the normal and pathological range of height variation within as well as between human populations, and today, it is understood to reflect complex interactions among a myriad of genetic, environmental, and evolutionary factors. Indeed, recent improvements in genetics (e.g., GWAS) and breakthroughs in functional genomics (e.g., whole exome sequencing, DNA methylation analysis, ATAC-sequencing, and CRISPR) have shed light on previously unknown pathways/mechanisms governing pathological and common height variation. Additionally, the use of an evolutionary perspective has also revealed important mechanisms that have shaped height variation across the planet. This review provides an overview of the current knowledge of the biological mechanisms underlying height variation by highlighting new research findings on skeletal growth control with an emphasis on previously unknown pathways/mechanisms influencing pathological and common height variation. In this context, this review also discusses how evolutionary forces likely shaped the genomic architecture of height across the globe.
Collapse
Affiliation(s)
| | - Terence D Capellini
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
42
|
Huang D, Yi X, Zhang S, Zheng Z, Wang P, Xuan C, Sham PC, Wang J, Li MJ. GWAS4D: multidimensional analysis of context-specific regulatory variant for human complex diseases and traits. Nucleic Acids Res 2019; 46:W114-W120. [PMID: 29771388 PMCID: PMC6030885 DOI: 10.1093/nar/gky407] [Citation(s) in RCA: 54] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2018] [Accepted: 05/03/2018] [Indexed: 01/04/2023] Open
Abstract
Genome-wide association studies have generated over thousands of susceptibility loci for many human complex traits, and yet for most of these associations the true causal variants remain unknown. Tissue/cell type-specific prediction and prioritization of non-coding regulatory variants will facilitate the identification of causal variants and underlying pathogenic mechanisms for particular complex diseases and traits. By leveraging recent large-scale functional genomics/epigenomics data, we develop an intuitive web server, GWAS4D (http://mulinlab.tmu.edu.cn/gwas4d or http://mulinlab.org/gwas4d), that systematically evaluates GWAS signals and identifies context-specific regulatory variants. The updated web server includes six major features: (i) updates the regulatory variant prioritization method with our new algorithm; (ii) incorporates 127 tissue/cell type-specific epigenomes data; (iii) integrates motifs of 1480 transcriptional regulators from 13 public resources; (iv) uniformly processes Hi-C data and generates significant interactions at 5 kb resolution across 60 tissues/cell types; (v) adds comprehensive non-coding variant functional annotations; (vi) equips a highly interactive visualization function for SNP-target interaction. Using a GWAS fine-mapped set for 161 coronary artery disease risk loci, we demonstrate that GWAS4D is able to efficiently prioritize disease-causal regulatory variants.
Collapse
Affiliation(s)
- Dandan Huang
- Department of Pharmacology, Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.,Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xianfu Yi
- School of Biomedical Engineering, Tianjin Medical University, Tianjin, China
| | - Shijie Zhang
- Department of Pharmacology, Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zhanye Zheng
- Department of Pharmacology, Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Panwen Wang
- Department of Health Sciences Research & Center for Individualized Medicine, Mayo Clinic, Scottsdale, USA
| | - Chenghao Xuan
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Pak Chung Sham
- Center for Genomic Sciences, The University of Hong Kong, Hong Kong SAR, China.,Departments of Psychiatry, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China.,State Key Laboratory of Brain and Cognitive Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Junwen Wang
- Department of Health Sciences Research & Center for Individualized Medicine, Mayo Clinic, Scottsdale, USA.,Department of Biomedical Informatics, Arizona State University, Scottsdale, USA
| | - Mulin Jun Li
- Department of Pharmacology, Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| |
Collapse
|
43
|
Parker MM, Hao Y, Guo F, Pham B, Chase R, Platig J, Cho MH, Hersh CP, Thannickal VJ, Crapo J, Washko G, Randell SH, Silverman EK, San José Estépar R, Zhou X, Castaldi PJ. Identification of an emphysema-associated genetic variant near TGFB2 with regulatory effects in lung fibroblasts. eLife 2019; 8:e42720. [PMID: 31343404 PMCID: PMC6693893 DOI: 10.7554/elife.42720] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2018] [Accepted: 07/25/2019] [Indexed: 02/06/2023] Open
Abstract
Murine studies have linked TGF-β signaling to emphysema, and human genome-wide association studies (GWAS) studies of lung function and COPD have identified associated regions near genes in the TGF-β superfamily. However, the functional regulatory mechanisms at these loci have not been identified. We performed the largest GWAS of emphysema patterns to date, identifying 10 GWAS loci including an association peak spanning a 200 kb region downstream from TGFB2. Integrative analysis of publicly available eQTL, DNaseI, and chromatin conformation data identified a putative functional variant, rs1690789, that may regulate TGFB2 expression in human fibroblasts. Using chromatin conformation capture, we confirmed that the region containing rs1690789 contacts the TGFB2 promoter in fibroblasts, and CRISPR/Cas-9 targeted deletion of a ~ 100 bp region containing rs1690789 resulted in decreased TGFB2 expression in primary human lung fibroblasts. These data provide novel mechanistic evidence linking genetic variation affecting the TGF-β pathway to emphysema in humans.
Collapse
Affiliation(s)
- Margaret M Parker
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Yuan Hao
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Feng Guo
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Betty Pham
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Robert Chase
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - John Platig
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Michael H Cho
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
- Division of Pulmonary and Critical Care MedicineBrigham and Women’s HospitalBostonUnited States
| | - Craig P Hersh
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
- Division of Pulmonary and Critical Care MedicineBrigham and Women’s HospitalBostonUnited States
| | - Victor J Thannickal
- Division of Pulmonary, Allergy and Critical Care, Department of MedicineSchool of Medicine, University of Alabama at BirminghamBirminghamUnited States
| | - James Crapo
- Division of Pulmonary, Critical Care and Sleep MedicineNational Jewish HealthDenverUnited States
| | - George Washko
- Division of Pulmonary and Critical Care MedicineBrigham and Women’s HospitalBostonUnited States
| | - Scott H Randell
- Marsico Lung InstituteThe University of North Carolina at Chapel HillChapel HillUnited States
| | - Edwin K Silverman
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
- Division of Pulmonary and Critical Care MedicineBrigham and Women’s HospitalBostonUnited States
| | | | - Xiaobo Zhou
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
| | - Peter J Castaldi
- Channing Division of Network MedicineBrigham and Women’s HospitalBostonUnited States
- Division of General Internal Medicine and Primary CareBrigham and Women’s HospitalBostonUnited States
| |
Collapse
|
44
|
|
45
|
Amariuta T, Luo Y, Gazal S, Davenport EE, van de Geijn B, Ishigaki K, Westra HJ, Teslovich N, Okada Y, Yamamoto K, Price AL, Raychaudhuri S. IMPACT: Genomic Annotation of Cell-State-Specific Regulatory Elements Inferred from the Epigenome of Bound Transcription Factors. Am J Hum Genet 2019; 104:879-895. [PMID: 31006511 PMCID: PMC6506796 DOI: 10.1016/j.ajhg.2019.03.012] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Accepted: 03/14/2019] [Indexed: 12/18/2022] Open
Abstract
Despite significant progress in annotating the genome with experimental methods, much of the regulatory noncoding genome remains poorly defined. Here we assert that regulatory elements may be characterized by leveraging local epigenomic signatures where specific transcription factors (TFs) are bound. To link these two features, we introduce IMPACT, a genome annotation strategy that identifies regulatory elements defined by cell-state-specific TF binding profiles, learned from 515 chromatin and sequence annotations. We validate IMPACT using multiple compelling applications. First, IMPACT distinguishes between bound and unbound TF motif sites with high accuracy (average AUPRC 0.81, SE 0.07; across 8 tested TFs) and outperforms state-of-the-art TF binding prediction methods, MocapG, MocapS, and Virtual ChIP-seq. Second, in eight tested cell types, RNA polymerase II IMPACT annotations capture more cis-eQTL variation than sequence-based annotations, such as promoters and TSS windows (25% average increase in enrichment). Third, integration with rheumatoid arthritis (RA) summary statistics from European (N = 38,242) and East Asian (N = 22,515) populations revealed that the top 5% of CD4+ Treg IMPACT regulatory elements capture 85.7% of RA h2, the most comprehensive explanation for RA h2 to date. In comparison, the average RA h2 captured by compared CD4+ T histone marks is 42.3% and by CD4+ T specifically expressed gene sets is 36.4%. Lastly, we find that IMPACT may be used in many different cell types to identify complex trait associated regulatory elements.
Collapse
Affiliation(s)
- Tiffany Amariuta
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Graduate School of Arts and Sciences, Harvard University, Cambridge, MA 02138, USA
| | - Yang Luo
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Steven Gazal
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Emma E Davenport
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bryce van de Geijn
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA
| | - Kazuyoshi Ishigaki
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Harm-Jan Westra
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Faculty of Medical Sciences, University of Groningen, Groningen, the Netherlands
| | - Nikola Teslovich
- Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA
| | - Yukinori Okada
- Osaka University Graduate School of Medicine, Osaka, Japan; Immunology Frontier Research Center (WPI-IFReC), Osaka University, Osaka, Japan
| | - Kazuhiko Yamamoto
- RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, Japan
| | - Alkes L Price
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA; Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA 02115, USA.
| | - Soumya Raychaudhuri
- Center for Data Sciences, Harvard Medical School, Boston, MA 02115, USA; Divisions of Genetics and Rheumatology, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA; Graduate School of Arts and Sciences, Harvard University, Cambridge, MA 02138, USA; Arthritis Research UK Centre for Genetics and Genomics, Centre for Musculoskeletal Research, Manchester Academic Health Science Centre, The University of Manchester, Manchester, UK.
| |
Collapse
|
46
|
Kanduri C, Bock C, Gundersen S, Hovig E, Sandve GK. Colocalization analyses of genomic elements: approaches, recommendations and challenges. Bioinformatics 2019; 35:1615-1624. [PMID: 30307532 PMCID: PMC6499241 DOI: 10.1093/bioinformatics/bty835] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 09/03/2018] [Accepted: 10/10/2018] [Indexed: 12/23/2022] Open
Abstract
MOTIVATION Many high-throughput methods produce sets of genomic regions as one of their main outputs. Scientists often use genomic colocalization analysis to interpret such region sets, for example to identify interesting enrichments and to understand the interplay between the underlying biological processes. Although widely used, there is little standardization in how these analyses are performed. Different practices can substantially affect the conclusions of colocalization analyses. RESULTS Here, we describe the different approaches and provide recommendations for performing genomic colocalization analysis, while also discussing common methodological challenges that may influence the conclusions. As illustrated by concrete example cases, careful attention to analysis details is needed in order to meet these challenges and to obtain a robust and biologically meaningful interpretation of genomic region set data. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chakravarthi Kanduri
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| | - Christoph Bock
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Department of Laboratory Medicine, Medical University of Vienna, Vienna, Austria
- Max Planck Institute for Informatics, Saarbrücken, Germany
| | - Sveinung Gundersen
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
| | - Eivind Hovig
- Department of Informatics, University of Oslo, Oslo, Norway
- Elixir Norway, Oslo Node, University of Oslo, Oslo, Norway
- Department of Tumor Biology, Institute for Cancer Research, Oslo, Norway
- Institute for Cancer Genetics and Informatics, The Norwegian Radium Hospital, Oslo, Norway, UK
| | - Geir Kjetil Sandve
- Department of Informatics, University of Oslo, Oslo, Norway
- K. G. Jebsen Coeliac Disease Research Centre, Oslo, Norway
| |
Collapse
|
47
|
Fan S, Kelly DE, Beltrame MH, Hansen MEB, Mallick S, Ranciaro A, Hirbo J, Thompson S, Beggs W, Nyambo T, Omar SA, Meskel DW, Belay G, Froment A, Patterson N, Reich D, Tishkoff SA. African evolutionary history inferred from whole genome sequence data of 44 indigenous African populations. Genome Biol 2019; 20:82. [PMID: 31023338 PMCID: PMC6485071 DOI: 10.1186/s13059-019-1679-2] [Citation(s) in RCA: 67] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Accepted: 03/22/2019] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Africa is the origin of modern humans within the past 300 thousand years. To infer the complex demographic history of African populations and adaptation to diverse environments, we sequenced the genomes of 92 individuals from 44 indigenous African populations. RESULTS Genetic structure analyses indicate that among Africans, genetic ancestry is largely partitioned by geography and language, though we observe mixed ancestry in many individuals, consistent with both short- and long-range migration events followed by admixture. Phylogenetic analysis indicates that the San genetic lineage is basal to all modern human lineages. The San and Niger-Congo, Afroasiatic, and Nilo-Saharan lineages were substantially diverged by 160 kya (thousand years ago). In contrast, the San and Central African rainforest hunter-gatherer (CRHG), Hadza hunter-gatherer, and Sandawe hunter-gatherer lineages were diverged by ~ 120-100 kya. Niger-Congo, Nilo-Saharan, and Afroasiatic lineages diverged more recently by ~ 54-16 kya. Eastern and western CRHG lineages diverged by ~ 50-31 kya, and the western CRHG lineages diverged by ~ 18-12 kya. The San and CRHG populations maintained the largest effective population size compared to other populations prior to 60 kya. Further, we observed signatures of positive selection at genes involved in muscle development, bone synthesis, reproduction, immune function, energy metabolism, and cell signaling, which may contribute to local adaptation of African populations. CONCLUSIONS We observe high levels of genomic variation between ethnically diverse Africans which is largely correlated with geography and language. Our study indicates ancient population substructure and local adaptation of Africans.
Collapse
Affiliation(s)
- Shaohua Fan
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Present Address: State Key Laboratory of Genetic Engineering, Human Phenome Institute, School of Life Sciences, Fudan University, 2005 Songhu Road, Shanghai, China
| | - Derek E Kelly
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Marcia H Beltrame
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Matthew E B Hansen
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, 02115, USA
| | - Alessia Ranciaro
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Jibril Hirbo
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Present Address: Division of Genetic Medicine, Vanderbilt University Medical Center, Vanderbilt University, Nashville, TN, 37232, USA
| | - Simon Thompson
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - William Beggs
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Thomas Nyambo
- Department of Biochemistry, Muhimbili University of Health and Allied Sciences, Dares Salaam, Tanzania
| | - Sabah A Omar
- Center for Biotechnology Research and Development, Kenya Medical Research Institute, Nairobi, Kenya
| | | | - Gurja Belay
- Department of Biology, Addis Ababa University, Addis Ababa, Ethiopia
| | | | - Nick Patterson
- Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
| | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA, 02115, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, 02142, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, 02115, USA
| | - Sarah A Tishkoff
- Department of Genetics, University of Pennsylvania, Philadelphia, PA, 19104, USA.
- Department of Biology, University of Pennsylvania, Philadelphia, PA, 19104, USA.
| |
Collapse
|
48
|
Ulirsch JC, Lareau CA, Bao EL, Ludwig LS, Guo MH, Benner C, Satpathy AT, Kartha VK, Salem RM, Hirschhorn JN, Finucane HK, Aryee MJ, Buenrostro JD, Sankaran VG. Interrogation of human hematopoiesis at single-cell and single-variant resolution. Nat Genet 2019; 51:683-693. [PMID: 30858613 PMCID: PMC6441389 DOI: 10.1038/s41588-019-0362-6] [Citation(s) in RCA: 109] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2018] [Accepted: 01/28/2019] [Indexed: 11/16/2022]
Abstract
Widespread linkage disequilibrium and incomplete annotation of cell-to-cell state variation represent substantial challenges to elucidating mechanisms of trait-associated genetic variation. Here, we perform genetic fine-mapping for blood cell traits in the UK Biobank to identify putative causal variants. These variants are enriched in genes encoding for proteins in trait-relevant biological pathways and in accessible chromatin of hematopoietic progenitors. For regulatory variants, we explore patterns of developmental enhancer activity, predict molecular mechanisms, and identify likely target genes. In several instances, we localize multiple independent variants to the same regulatory element or gene. We further observe that variants with pleiotropic effects preferentially act in common progenitor populations to direct the production of distinct lineages. Finally, we leverage fine-mapped variants in conjunction with continuous epigenomic annotations to identify trait-cell type enrichments within closely related populations and in single cells. Our study provides a comprehensive framework for single-variant and single-cell analyses of genetic associations. Fine mapping of blood cell traits in UK Biobank identifies putative causal variants and enrichment of fine-mapped variants in accessible chromatin of hematopoietic progenitor cells. The study provides an analytical framework for single-variant and single-cell analyses of genetic associations.
Collapse
Affiliation(s)
- Jacob C Ulirsch
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Caleb A Lareau
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA.,Department of Pathology, Massachusetts General Hospital, Boston, MA, USA
| | - Erik L Bao
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Harvard-MIT Health Sciences and Technology, Harvard Medical School, Boston, MA, USA
| | - Leif S Ludwig
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael H Guo
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Division of Endocrinology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Genetics, Harvard Medical School, Boston, MA, USA.,Center for Basic and Translational Obesity Research, Boston Children's Hospital, Boston, MA, USA
| | - Christian Benner
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland.,Department of Public Health, University of Helsinki, Helsinki, Finland
| | - Ansuman T Satpathy
- Department of Pathology, Stanford University School of Medicine, Stanford, CA, USA
| | - Vinay K Kartha
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA
| | - Rany M Salem
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Division of Endocrinology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Genetics, Harvard Medical School, Boston, MA, USA.,Center for Basic and Translational Obesity Research, Boston Children's Hospital, Boston, MA, USA
| | - Joel N Hirschhorn
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Division of Endocrinology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Genetics, Harvard Medical School, Boston, MA, USA.,Center for Basic and Translational Obesity Research, Boston Children's Hospital, Boston, MA, USA
| | - Hilary K Finucane
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Schmidt Fellows Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Martin J Aryee
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Pathology, Massachusetts General Hospital, Boston, MA, USA.,Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Jason D Buenrostro
- Broad Institute of MIT and Harvard, Cambridge, MA, USA. .,Department of Stem Cell and Regenerative Biology, Harvard University, Cambridge, MA, USA.
| | - Vijay G Sankaran
- Division of Hematology/Oncology, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA. .,Department of Pediatric Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA, USA. .,Broad Institute of MIT and Harvard, Cambridge, MA, USA. .,Harvard Stem Cell Institute, Cambridge, MA, USA.
| |
Collapse
|
49
|
Iotchkova V, Ritchie GRS, Geihs M, Morganella S, Min JL, Walter K, Timpson NJ, Dunham I, Birney E, Soranzo N. GARFIELD classifies disease-relevant genomic features through integration of functional annotations with association signals. Nat Genet 2019; 51:343-353. [PMID: 30692680 PMCID: PMC6908448 DOI: 10.1038/s41588-018-0322-6] [Citation(s) in RCA: 117] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 11/29/2018] [Indexed: 12/31/2022]
Abstract
Loci discovered by genome-wide association studies predominantly map outside protein-coding genes. The interpretation of the functional consequences of non-coding variants can be greatly enhanced by catalogs of regulatory genomic regions in cell lines and primary tissues. However, robust and readily applicable methods are still lacking by which to systematically evaluate the contribution of these regions to genetic variation implicated in diseases or quantitative traits. Here we propose a novel approach that leverages genome-wide association studies' findings with regulatory or functional annotations to classify features relevant to a phenotype of interest. Within our framework, we account for major sources of confounding not offered by current methods. We further assess enrichment of genome-wide association studies for 19 traits within Encyclopedia of DNA Elements- and Roadmap-derived regulatory regions. We characterize unique enrichment patterns for traits and annotations driving novel biological insights. The method is implemented in standalone software and an R package, to facilitate its application by the research community.
Collapse
Affiliation(s)
- Valentina Iotchkova
- Human Genetics, Wellcome Sanger Institute, Hinxton, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Graham R S Ritchie
- Human Genetics, Wellcome Sanger Institute, Hinxton, UK
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | | | - Sandro Morganella
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Josine L Min
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | | | - Nicholas John Timpson
- MRC Integrative Epidemiology Unit, University of Bristol, Bristol, UK
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, UK
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton, UK.
| | - Nicole Soranzo
- Human Genetics, Wellcome Sanger Institute, Hinxton, UK.
- Department of Haematology, University of Cambridge, Cambridge, UK.
- The National Institute for Health Research Blood and Transplant Unit (NIHR BTRU) in Donor Health and Genomics, University of Cambridge, Cambridge, UK.
| |
Collapse
|
50
|
Liu L, Sanderford MD, Patel R, Chandrashekar P, Gibson G, Kumar S. Biological relevance of computationally predicted pathogenicity of noncoding variants. Nat Commun 2019; 10:330. [PMID: 30659175 PMCID: PMC6338804 DOI: 10.1038/s41467-018-08270-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2018] [Accepted: 12/19/2018] [Indexed: 11/15/2022] Open
Abstract
Computational prediction of the phenotypic propensities of noncoding single nucleotide variants typically combines annotation of genomic, functional and evolutionary attributes into a single score. Here, we evaluate if the claimed excellent accuracies of these predictions translate into high rates of success in addressing questions important in biological research, such as fine mapping causal variants, distinguishing pathogenic allele(s) at a given position, and prioritizing variants for genetic risk assessment. A significant disconnect is found to exist between the statistical modelling and biological performance of predictive approaches. We discuss fundamental reasons underlying these deficiencies and suggest that future improvements of computational predictions need to address confounding of allelic, positional and regional effects as well as imbalance of the proportion of true positive variants in candidate lists. Researchers can make use of a variety of computational tools to prioritize genetic variants and predict their pathogenicity. Here, the authors evaluate the performance of six of these tools in three typical biological tasks and find generally low concordance of predictions and experimental confirmation.
Collapse
Affiliation(s)
- Li Liu
- College of Health Solutions, Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Maxwell D Sanderford
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA
| | - Ravi Patel
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA.,Department of Biology, Temple University, Philadelphia, PA, USA
| | - Pramod Chandrashekar
- College of Health Solutions, Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Greg Gibson
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, PA, USA. .,Department of Biology, Temple University, Philadelphia, PA, USA.
| |
Collapse
|