1
|
Kontou PI, Bagos PG. The goldmine of GWAS summary statistics: a systematic review of methods and tools. BioData Min 2024; 17:31. [PMID: 39238044 PMCID: PMC11375927 DOI: 10.1186/s13040-024-00385-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 08/27/2024] [Indexed: 09/07/2024] Open
Abstract
Genome-wide association studies (GWAS) have revolutionized our understanding of the genetic architecture of complex traits and diseases. GWAS summary statistics have become essential tools for various genetic analyses, including meta-analysis, fine-mapping, and risk prediction. However, the increasing number of GWAS summary statistics and the diversity of software tools available for their analysis can make it challenging for researchers to select the most appropriate tools for their specific needs. This systematic review aims to provide a comprehensive overview of the currently available software tools and databases for GWAS summary statistics analysis. We conducted a comprehensive literature search to identify relevant software tools and databases. We categorized the tools and databases by their functionality, including data management, quality control, single-trait analysis, and multiple-trait analysis. We also compared the tools and databases based on their features, limitations, and user-friendliness. Our review identified a total of 305 functioning software tools and databases dedicated to GWAS summary statistics, each with unique strengths and limitations. We provide descriptions of the key features of each tool and database, including their input/output formats, data types, and computational requirements. We also discuss the overall usability and applicability of each tool for different research scenarios. This comprehensive review will serve as a valuable resource for researchers who are interested in using GWAS summary statistics to investigate the genetic basis of complex traits and diseases. By providing a detailed overview of the available tools and databases, we aim to facilitate informed tool selection and maximize the effectiveness of GWAS summary statistics analysis.
Collapse
Affiliation(s)
| | - Pantelis G Bagos
- Department of Computer Science and Biomedical Informatics, University of Thessaly, 35131, Lamia, Greece.
| |
Collapse
|
2
|
Zou Y, Carbonetto P, Xie D, Wang G, Stephens M. Fast and flexible joint fine-mapping of multiple traits via the Sum of Single Effects model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.04.14.536893. [PMID: 37425935 PMCID: PMC10327118 DOI: 10.1101/2023.04.14.536893] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
We introduce mvSuSiE, a multi-trait fine-mapping method for identifying putative causal variants from genetic association data (individual-level or summary data). mvSuSiE learns patterns of shared genetic effects from data, and exploits these patterns to improve power to identify causal SNPs. Comparisons on simulated data show that mvSuSiE is competitive in speed, power and precision with existing multi-trait methods, and uniformly improves on single-trait fine-mapping (SuSiE) in each trait separately. We applied mvSuSiE to jointly fine-map 16 blood cell traits using data from the UK Biobank. By jointly analyzing the traits and modeling heterogeneous effect sharing patterns, we discovered a much larger number of causal SNPs (>3,000) compared with single-trait fine-mapping, and with narrower credible sets. mvSuSiE also more comprehensively characterized the ways in which the genetic variants affect one or more blood cell traits; 68% of causal SNPs showed significant effects in more than one blood cell type.
Collapse
Affiliation(s)
- Yuxin Zou
- Department of Statistics, University of Chicago, Chicago, IL, USA
- Regeneron Genetics Center, Regeneron Pharmaceuticals, Inc., Tarrytown, NY, USA
| | - Peter Carbonetto
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| | - Dongyue Xie
- Department of Statistics, University of Chicago, Chicago, IL, USA
| | - Gao Wang
- Gertrude. H. Sergievsky Center, Department of Neurology, Columbia University, New York, NY, USA
| | - Matthew Stephens
- Department of Statistics, University of Chicago, Chicago, IL, USA
- Department of Human Genetics, University of Chicago, Chicago, IL, USA
| |
Collapse
|
3
|
Lincoln MR, Connally N, Axisa PP, Gasperi C, Mitrovic M, van Heel D, Wijmenga C, Withoff S, Jonkers IH, Padyukov L, Rich SS, Graham RR, Gaffney PM, Langefeld CD, Vyse TJ, Hafler DA, Chun S, Sunyaev SR, Cotsapas C. Genetic mapping across autoimmune diseases reveals shared associations and mechanisms. Nat Genet 2024; 56:838-845. [PMID: 38741015 DOI: 10.1038/s41588-024-01732-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 03/21/2024] [Indexed: 05/16/2024]
Abstract
Autoimmune and inflammatory diseases are polygenic disorders of the immune system. Many genomic loci harbor risk alleles for several diseases, but the limited resolution of genetic mapping prevents determining whether the same allele is responsible, indicating a shared underlying mechanism. Here, using a collection of 129,058 cases and controls across 6 diseases, we show that ~40% of overlapping associations are due to the same allele. We improve fine-mapping resolution for shared alleles twofold by combining cases and controls across diseases, allowing us to identify more expression quantitative trait loci driven by the shared alleles. The patterns indicate widespread sharing of pathogenic mechanisms but not a single global autoimmune mechanism. Our approach can be applied to any set of traits and is particularly valuable as sample collections become depleted.
Collapse
Affiliation(s)
- Matthew R Lincoln
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
- Division of Neurology at the Department of Medicine, University of Toronto, Toronto, Ontario, Canada
- Keenan Research Centre for Biomedical Science, St. Michael's Hospital, Toronto, Ontario, Canada
| | - Noah Connally
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
| | - Pierre-Paul Axisa
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
| | | | - Mitja Mitrovic
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
- Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
| | - David van Heel
- Blizard Institute, Queen Mary University of London, London, UK
| | - Cisca Wijmenga
- Department of Genetics at the University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
| | - Sebo Withoff
- Department of Genetics at the University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
| | - Iris H Jonkers
- Department of Genetics at the University Medical Center Groningen, University of Groningen, Groningen, the Netherlands
| | - Leonid Padyukov
- Division of Rheumatology at the Department of Medicine, Karolinska Institutet and Karolinska University Hospital, Stockholm, Sweden
| | - Stephen S Rich
- Center for Public Health Genomics, University of Virginia, Charlottesville, VA, USA
- Department of Public Health Sciences, University of Virginia, Charlottesville, VA, USA
| | - Robert R Graham
- Maze Therapeutics, South San Francisco, CA, USA
- Genentech, South San Francisco, CA, USA
| | - Patrick M Gaffney
- Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
| | - Carl D Langefeld
- Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, NC, USA
- Center for Precision Medicine, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Timothy J Vyse
- Department of Medical and Molecular Genetics, Kings College London, London, UK
| | - David A Hafler
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA
| | - Sung Chun
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Shamil R Sunyaev
- Division of Genetics, Brigham and Women's Hospital, Boston, MA, USA
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Chris Cotsapas
- Department of Neurology, Yale School of Medicine, New Haven, CT, USA.
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA.
- Vesalius Therapeutics, Cambridge, MA, USA.
| |
Collapse
|
4
|
Bao C, Tan T, Wang S, Gao C, Lu C, Yang S, Diao Y, Jiang L, Jing D, Chen L, Lv H, Fang H. A cross-disease, pleiotropy-driven approach for therapeutic target prioritization and evaluation. CELL REPORTS METHODS 2024; 4:100757. [PMID: 38631345 PMCID: PMC11046034 DOI: 10.1016/j.crmeth.2024.100757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Revised: 01/08/2024] [Accepted: 03/22/2024] [Indexed: 04/19/2024]
Abstract
Cross-disease genome-wide association studies (GWASs) unveil pleiotropic loci, mostly situated within the non-coding genome, each of which exerts pleiotropic effects across multiple diseases. However, the challenge "W-H-W" (namely, whether, how, and in which specific diseases pleiotropy can inform clinical therapeutics) calls for effective and integrative approaches and tools. We here introduce a pleiotropy-driven approach specifically designed for therapeutic target prioritization and evaluation from cross-disease GWAS summary data, with its validity demonstrated through applications to two systems of disorders (neuropsychiatric and inflammatory). We illustrate its improved performance in recovering clinical proof-of-concept therapeutic targets. Importantly, it identifies specific diseases where pleiotropy informs clinical therapeutics. Furthermore, we illustrate its versatility in accomplishing advanced tasks, including pathway crosstalk identification and downstream crosstalk-based analyses. To conclude, our integrated solution helps bridge the gap between pleiotropy studies and therapeutics discovery.
Collapse
Affiliation(s)
- Chaohui Bao
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Tingting Tan
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Shan Wang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Chenxu Gao
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Chang Lu
- MRC London Institute of Medical Sciences, Imperial College London, W12 0HS London, UK
| | - Siyue Yang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China; Faculty of Medical Laboratory Science, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Yizhu Diao
- College of Finance and Statistics, Hunan University, Changsha, Hunan 410079, China
| | - Lulu Jiang
- Translational Health Sciences, University of Bristol, BS1 3NY Bristol, UK
| | - Duohui Jing
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China
| | - Liye Chen
- Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, OX3 7LD Oxford, UK.
| | - Haitao Lv
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China; School of Chinese Medicine, State Key Laboratory of Environmental and Biological Analysis, Hong Kong Chinese Medicine Phenome Research Center, Hong Kong Baptist University, Hong Kong 999077, China.
| | - Hai Fang
- Shanghai Institute of Hematology, State Key Laboratory of Medical Genomics, National Research Center for Translational Medicine at Shanghai, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, Shanghai 200025, China.
| |
Collapse
|
5
|
Li X, Sham PC, Zhang YD. A Bayesian fine-mapping model using a continuous global-local shrinkage prior with applications in prostate cancer analysis. Am J Hum Genet 2024; 111:213-226. [PMID: 38171363 PMCID: PMC10870138 DOI: 10.1016/j.ajhg.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 12/04/2023] [Accepted: 12/05/2023] [Indexed: 01/05/2024] Open
Abstract
The aim of fine mapping is to identify genetic variants causally contributing to complex traits or diseases. Existing fine-mapping methods employ Bayesian discrete mixture priors and depend on a pre-specified maximum number of causal variants, which may lead to sub-optimal solutions. In this work, we propose a Bayesian fine-mapping method called h2-D2, utilizing a continuous global-local shrinkage prior. We also present an approach to define credible sets of causal variants in continuous prior settings. Simulation studies demonstrate that h2-D2 outperforms current state-of-the-art fine-mapping methods such as SuSiE and FINEMAP in accurately identifying causal variants and estimating their effect sizes. We further applied h2-D2 to prostate cancer analysis and discovered some previously unknown causal variants. In addition, we inferred 369 target genes associated with the detected causal variants and several pathways that were significantly over-represented by these genes, shedding light on their potential roles in prostate cancer development and progression.
Collapse
Affiliation(s)
- Xiang Li
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China
| | - Pak Chung Sham
- Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China; Centre for PanorOmic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Yan Dora Zhang
- Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong SAR, China.
| |
Collapse
|
6
|
Zhang X, Jiang W, Zhao H. Integration of expression QTLs with fine mapping via SuSiE. PLoS Genet 2024; 20:e1010929. [PMID: 38271473 PMCID: PMC10846745 DOI: 10.1371/journal.pgen.1010929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Revised: 02/06/2024] [Accepted: 01/06/2024] [Indexed: 01/27/2024] Open
Abstract
Genome-wide association studies (GWASs) have achieved remarkable success in associating thousands of genetic variants with complex traits. However, the presence of linkage disequilibrium (LD) makes it challenging to identify the causal variants. To address this critical gap from association to causation, many fine-mapping methods have been proposed to assign well-calibrated probabilities of causality to candidate variants, taking into account the underlying LD pattern. In this manuscript, we introduce a statistical framework that incorporates expression quantitative trait locus (eQTL) information to fine-mapping, built on the sum of single-effects (SuSiE) regression model. Our new method, SuSiE2, connects two SuSiE models, one for eQTL analysis and one for genetic fine-mapping. This is achieved by first computing the posterior inclusion probabilities (PIPs) from an eQTL-based SuSiE model with the expression level of the candidate gene as the phenotype. These calculated PIPs are then utilized as prior inclusion probabilities for risk variants in another SuSiE model for the trait of interest. By prioritizing functional variants within the candidate region using eQTL information, SuSiE2 improves SuSiE by increasing the detection rate of causal SNPs and reducing the average size of credible sets. We compared the performance of SuSiE2 with other multi-trait fine-mapping methods with respect to power, coverage, and precision through simulations and applications to the GWAS results of Alzheimer's disease (AD) and body mass index (BMI). Our results demonstrate the better performance of SuSiE2, both when the in-sample linkage disequilibrium (LD) matrix and an external reference panel is used in inference.
Collapse
Affiliation(s)
- Xiangyu Zhang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| | - Wei Jiang
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| | - Hongyu Zhao
- Department of Biostatistics, School of Public Health, Yale University, New Haven, Connecticut, United States of America
| |
Collapse
|
7
|
Zhou F, Soremekun O, Chikowore T, Fatumo S, Barroso I, Morris AP, Asimit JL. Leveraging information between multiple population groups and traits improves fine-mapping resolution. Nat Commun 2023; 14:7279. [PMID: 37949886 PMCID: PMC10638399 DOI: 10.1038/s41467-023-43159-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 11/01/2023] [Indexed: 11/12/2023] Open
Abstract
Statistical fine-mapping helps to pinpoint likely causal variants underlying genetic association signals. Its resolution can be improved by (i) leveraging information between traits; and (ii) exploiting differences in linkage disequilibrium structure between diverse population groups. Using association summary statistics, MGflashfm jointly fine-maps signals from multiple traits and population groups; MGfm uses an analogous framework to analyse each trait separately. We also provide a practical approach to fine-mapping with out-of-sample reference panels. In simulation studies we show that MGflashfm and MGfm are well-calibrated and that the mean proportion of causal variants with PP > 0.80 is above 0.75 (MGflashfm) and 0.70 (MGfm). In our analysis of four lipids traits across five population groups, MGflashfm gives a median 99% credible set reduction of 10.5% over MGfm. MGflashfm and MGfm only require summary level data, making them very useful fine-mapping tools in consortia efforts where individual-level data cannot be shared.
Collapse
Affiliation(s)
- Feng Zhou
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Opeyemi Soremekun
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda
| | - Tinashe Chikowore
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- MRC/Wits Developmental Pathways for Health Research Unit, Department of Paediatrics, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
| | - Segun Fatumo
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK
| | - Inês Barroso
- Exeter Centre of Excellence for Diabetes Research (EXCEED), University of Exeter Medical School, Exeter, UK
| | - Andrew P Morris
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK
| | | |
Collapse
|
8
|
Kamiza AB, Touré SM, Zhou F, Soremekun O, Cissé C, Wélé M, Touré AM, Nashiru O, Corpas M, Nyirenda M, Crampin A, Shaffer J, Doumbia S, Zeggini E, Morris AP, Asimit JL, Chikowore T, Fatumo S. Multi-trait discovery and fine-mapping of lipid loci in 125,000 individuals of African ancestry. Nat Commun 2023; 14:5403. [PMID: 37669986 PMCID: PMC10480211 DOI: 10.1038/s41467-023-41271-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 08/29/2023] [Indexed: 09/07/2023] Open
Abstract
Most genome-wide association studies (GWAS) for lipid traits focus on the separate analysis of lipid traits. Moreover, there are limited GWASs evaluating the genetic variants associated with multiple lipid traits in African ancestry. To further identify and localize loci with pleiotropic effects on lipid traits, we conducted a genome-wide meta-analysis, multi-trait analysis of GWAS (MTAG), and multi-trait fine-mapping (flashfm) in 125,000 individuals of African ancestry. Our meta-analysis and MTAG identified four and 14 novel loci associated with lipid traits, respectively. flashfm yielded an 18% mean reduction in the 99% credible set size compared to single-trait fine-mapping with JAM. Moreover, we identified more genetic variants with a posterior probability of causality >0.9 with flashfm than with JAM. In conclusion, we identified additional novel loci associated with lipid traits, and flashfm reduced the 99% credible set size to identify causal genetic variants associated with multiple lipid traits in African ancestry.
Collapse
Affiliation(s)
- Abram Bunya Kamiza
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda
- Malawi Epidemiology and Intervention Research Unit, Lilongwe, Malawi
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Sounkou M Touré
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda
- African Center of Excellence in Bioinformatics, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
| | - Feng Zhou
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| | - Opeyemi Soremekun
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda
| | - Cheickna Cissé
- African Center of Excellence in Bioinformatics, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
- Faculty of Sciences and Techniques, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
| | - Mamadou Wélé
- Malawi Epidemiology and Intervention Research Unit, Lilongwe, Malawi
- Faculty of Sciences and Techniques, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
| | - Aboubacrine M Touré
- Faculty of Sciences and Techniques, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
| | - Oyekanmi Nashiru
- H3Africa Bioinformatics Network (H3ABioNet) Node, Center for Genomics Research and Innovation, NABDA/FMST, Abuja, Nigeria
| | - Manuel Corpas
- School of Life sciences, University of Westminster, London, UK
| | - Moffat Nyirenda
- Medical Research Council/Uganda Virus Research Institute and London School of Hygiene & Tropical Medicine Uganda Research Unit, Entebbe, Uganda
| | - Amelia Crampin
- Malawi Epidemiology and Intervention Research Unit, Lilongwe, Malawi
| | - Jeffrey Shaffer
- Department of Biostatistics and Data Science, School of Public Health and Tropical Medicine, Tulane University, New Orleans, LA, USA
| | - Seydou Doumbia
- African Center of Excellence in Bioinformatics, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
- Faculty of Medicine and Odonto-stomatology, University of Sciences, Techniques and Technologies of Bamako, Bamako, Mali
| | - Eleftheria Zeggini
- Institute of Translational Genomics, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- TUM School of Medicine, Translational Genomics, Technical University of Munich and Klinikum Rechts der Isar, Munich, Germany
| | - Andrew P Morris
- Centre for Genetics and Genomics Versus Arthritis, Centre for Musculoskeletal Research, University of Manchester, Manchester, UK
| | | | - Tinashe Chikowore
- Sydney Brenner Institute for Molecular Bioscience, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
- Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA
- Harvard Medical School, Boston, MA, USA
- MRC/Wits Developmental Pathways for Health Research Unit, Department of Pediatrics, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa
| | - Segun Fatumo
- The African Computational Genomic (TACG) Research Group, MRC/UVRI and LSHTM, Entebbe, Uganda.
- H3Africa Bioinformatics Network (H3ABioNet) Node, Center for Genomics Research and Innovation, NABDA/FMST, Abuja, Nigeria.
- Institute of Translational Genomics, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany.
- Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK.
| |
Collapse
|
9
|
Bartell E, Lin K, Tsuo K, Gan W, Vedantam S, Cole JB, Baronas JM, Yengo L, Marouli E, Amariuta T, Chen Z, Li L, Renthal NE, Jacobsen CM, Salem RM, Walters RG, Hirschhorn JN. Genetics of skeletal proportions in two different populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.22.541772. [PMID: 37292977 PMCID: PMC10245876 DOI: 10.1101/2023.05.22.541772] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Human height can be divided into sitting height and leg length, reflecting growth of different parts of the skeleton whose relative proportions are captured by the ratio of sitting to total height (as sitting height ratio, SHR). Height is a highly heritable trait, and its genetic basis has been well-studied. However, the genetic determinants of skeletal proportion are much less well-characterized. Expanding substantially on past work, we performed a genome-wide association study (GWAS) of SHR in ∼450,000 individuals with European ancestry and ∼100,000 individuals with East Asian ancestry from the UK and China Kadoorie Biobanks. We identified 565 loci independently associated with SHR, including all genomic regions implicated in prior GWAS in these ancestries. While SHR loci largely overlap height-associated loci (P < 0.001), the fine-mapped SHR signals were often distinct from height. We additionally used fine-mapped signals to identify 36 credible sets with heterogeneous effects across ancestries. Lastly, we used SHR, sitting height, and leg length to identify genetic variation acting on specific body regions rather than on overall human height.
Collapse
|
10
|
Zhou F, Butterworth AS, Asimit JL. Flashfm-ivis: interactive visualization for fine-mapping of multiple quantitative traits. Bioinformatics 2022; 38:4238-4242. [PMID: 35792838 PMCID: PMC9438951 DOI: 10.1093/bioinformatics/btac453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 05/31/2022] [Accepted: 07/05/2022] [Indexed: 12/24/2022] Open
Abstract
SUMMARY flashfm-ivis provides a suite of interactive visualization plots to view potential causal genetic variants that underlie associations that are shared or distinct between multiple quantitative traits and compares results between single- and multi-trait fine-mapping. Unique features include network diagrams that show joint effects between variants for each trait and regional association plots that integrate fine-mapping results, all with user-controlled zoom features for an interactive exploration of potential causal variants across traits. AVAILABILITY AND IMPLEMENTATION flashfm-ivis is an open-source software under the MIT license. It is available as an interactive web-based tool (http://shiny.mrc-bsu.cam.ac.uk/apps/flashfm-ivis/) and as an R package. Code and documentation are available at https://github.com/fz-cambridge/flashfm-ivis and https://zenodo.org/record/6376244#.YjnarC-l2X0. Additional features can be downloaded as standalone R libraries to encourage reuse. SUPPLEMENTARY INFORMATION Supplementary information are available at Bioinformatics online.
Collapse
Affiliation(s)
- Feng Zhou
- To whom correspondence should be addressed. or
| | - Adam S Butterworth
- British Heart Foundation, Cardiovascular Epidemiology Unit, Department of Public Health and Primary Care, University of Cambridge, Cambridge CB1 8RN, UK,National Institute for Health Research Blood and Transplant Research Unit in Donor Health and Genomics University of Cambridge, Cambridge CB1 8RN, UK,British Heart Foundation Centre of Research Excellence, University of Cambridge, Cambridge CB1 8RN, UK,Health Data Research UK Cambridge, Wellcome Genome Campus and University of Cambridge, Cambridge CB10 1SA, UK
| | | |
Collapse
|
11
|
Ballard JL, O'Connor LJ. Shared components of heritability across genetically correlated traits. Am J Hum Genet 2022; 109:989-1006. [PMID: 35477001 PMCID: PMC9247834 DOI: 10.1016/j.ajhg.2022.04.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 04/01/2022] [Indexed: 11/01/2022] Open
Abstract
Most disease-associated genetic variants are pleiotropic, affecting multiple genetically correlated traits. Their pleiotropic associations can be mechanistically informative: if many variants have similar patterns of association, they may act via similar pleiotropic mechanisms, forming a shared component of heritability. We developed pleiotropic decomposition regression (PDR) to identify shared components and their underlying genetic variants. We validated PDR on simulated data and identified limitations of existing methods in recovering the true components. We applied PDR to three clusters of five to six traits genetically correlated with coronary artery disease (CAD), asthma, and type II diabetes (T2D), producing biologically interpretable components. For CAD, PDR identified components related to BMI, hypertension, and cholesterol, and it clarified the relationship among these highly correlated risk factors. We assigned variants to components, calculated their posterior-mean effect sizes, and performed out-of-sample validation. Our posterior-mean effect sizes pool statistical power across traits and substantially boost the correlation (r2) between true and estimated effect sizes (compared with the original summary statistics) by 94% and 70% for asthma and T2D out of sample, respectively, and by a predicted 300% for CAD.
Collapse
Affiliation(s)
- Jenna Lee Ballard
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Luke Jen O'Connor
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|