1
|
Manipur I, Reales G, Sul JH, Shin MK, Longerich S, Cortes A, Wallace C. CoPheScan: phenome-wide association studies accounting for linkage disequilibrium. Nat Commun 2024; 15:5862. [PMID: 38997278 PMCID: PMC11245513 DOI: 10.1038/s41467-024-49990-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 06/25/2024] [Indexed: 07/14/2024] Open
Abstract
Phenome-wide association studies (PheWAS) facilitate the discovery of associations between a single genetic variant with multiple phenotypes. For variants which impact a specific protein, this can help identify additional therapeutic indications or on-target side effects of intervening on that protein. However, PheWAS is restricted by an inability to distinguish confounding due to linkage disequilibrium (LD) from true pleiotropy. Here we describe CoPheScan (Coloc adapted Phenome-wide Scan), a Bayesian approach that enables an intuitive and systematic exploration of causal associations while simultaneously addressing LD confounding. We demonstrate its performance through simulation, showing considerably better control of false positive rates than a conventional approach not accounting for LD. We used CoPheScan to perform PheWAS of protein-truncating variants and fine-mapped variants from disease and pQTL studies, in 2275 disease phenotypes from the UK Biobank. Our results identify the complexity of known pleiotropic genes such as APOE, and suggest a new causal role for TGM3 in skin cancer.
Collapse
Affiliation(s)
- Ichcha Manipur
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0AW, UK.
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 2QQ, UK.
| | - Guillermo Reales
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0AW, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 2QQ, UK
| | | | | | | | - Adrian Cortes
- Human Genetics and Genomics, GSK, Heidelberg, 69117, Germany
| | - Chris Wallace
- Cambridge Institute of Therapeutic Immunology & Infectious Disease (CITIID), Jeffrey Cheah Biomedical Centre, Cambridge Biomedical Campus, University of Cambridge, Cambridge, CB2 0AW, UK
- Department of Medicine, University of Cambridge School of Clinical Medicine, Cambridge Biomedical Campus, Cambridge, CB2 2QQ, UK
- MRC Biostatistics Unit, University of Cambridge, Cambridge, UK
| |
Collapse
|
2
|
Lassen FH, Venkatesh SS, Baya N, Hill B, Zhou W, Bloemendal A, Neale BM, Kessler BM, Whiffin N, Lindgren CM, Palmer DS. Exome-wide evidence of compound heterozygous effects across common phenotypes in the UK Biobank. CELL GENOMICS 2024; 4:100602. [PMID: 38944039 DOI: 10.1016/j.xgen.2024.100602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 03/11/2024] [Accepted: 06/07/2024] [Indexed: 07/01/2024]
Abstract
The phenotypic impact of compound heterozygous (CH) variation has not been investigated at the population scale. We phased rare variants (MAF ∼0.001%) in the UK Biobank (UKBB) exome-sequencing data to characterize recessive effects in 175,587 individuals across 311 common diseases. A total of 6.5% of individuals carry putatively damaging CH variants, 90% of which are only identifiable upon phasing rare variants (MAF < 0.38%). We identify six recessive gene-trait associations (p < 1.68 × 10-7) after accounting for relatedness, polygenicity, nearby common variants, and rare variant burden. Of these, just one is discovered when considering homozygosity alone. Using longitudinal health records, we additionally identify and replicate a novel association between bi-allelic variation in ATP2C2 and an earlier age at onset of chronic obstructive pulmonary disease (COPD) (p < 3.58 × 10-8). Genetic phase contributes to disease risk for gene-trait pairs: ATP2C2-COPD (p = 0.000238), FLG-asthma (p = 0.00205), and USH2A-visual impairment (p = 0.0084). We demonstrate the power of phasing large-scale genetic cohorts to discover phenome-wide consequences of compound heterozygosity.
Collapse
Affiliation(s)
- Frederik H Lassen
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK.
| | - Samvida S Venkatesh
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Nikolas Baya
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Barney Hill
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK
| | - Wei Zhou
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Analytical and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Alex Bloemendal
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Novo Nordisk Center for Genomic Mechanisms of Disease, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin M Neale
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Analytical and Translational Genetics Unit, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Benedikt M Kessler
- Target Discovery Institute, Centre for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Nicola Whiffin
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cecilia M Lindgren
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK; Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK; Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, UK.
| | - Duncan S Palmer
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, UK; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA; Nuffield Department of Population Health, Medical Sciences Division, University of Oxford, Oxford, UK.
| |
Collapse
|
3
|
Gudmundsson S, Singer-Berk M, Stenton SL, Goodrich JK, Wilson MW, Einson J, Watts NA, Lappalainen T, Rehm HL, MacArthur DG, O’Donnell-Luria A. Exploring penetrance of clinically relevant variants in over 800,000 humans from the Genome Aggregation Database. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.12.593113. [PMID: 38915639 PMCID: PMC11195293 DOI: 10.1101/2024.06.12.593113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Incomplete penetrance, or absence of disease phenotype in an individual with a disease-associated variant, is a major challenge in variant interpretation. Studying individuals with apparent incomplete penetrance can shed light on underlying drivers of altered phenotype penetrance. Here, we investigate clinically relevant variants from ClinVar in 807,162 individuals from the Genome Aggregation Database (gnomAD), demonstrating improved representation in gnomAD version 4. We then conduct a comprehensive case-by-case assessment of 734 predicted loss of function variants (pLoF) in 77 genes associated with severe, early-onset, highly penetrant haploinsufficient disease. We identified explanations for the presumed lack of disease manifestation in 701 of the variants (95%). Individuals with unexplained lack of disease manifestation in this set of disorders rarely occur, underscoring the need and power of deep case-by-case assessment presented here to minimize false assignments of disease risk, particularly in unaffected individuals with higher rates of secondary properties that result in rescue.
Collapse
Affiliation(s)
- Sanna Gudmundsson
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine & Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sarah L. Stenton
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine & Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
| | - Julia K. Goodrich
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael W. Wilson
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Nicholas A Watts
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Tuuli Lappalainen
- Science for Life Laboratory, Department of Gene Technology, KTH Royal Institute of Technology, Stockholm, Sweden
- New York Genome Center, New York, NY, USA
| | - Heidi L. Rehm
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine & Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Daniel G. MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, New South Wales, Australia
- Centre for Population Genomics, Murdoch Children’s Research Institute, Melbourne, Australia
| | - Anne O’Donnell-Luria
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine & Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Division of Genetics and Genomics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
4
|
Yuan H, Liu Z, Chen M, Xu Q, Jiang Y, Zhang T, Suo C, Chen X. Protein truncating variants in mitochondrial-related nuclear genes and the risk of chronic liver disease. BMC Med 2024; 22:239. [PMID: 38862964 PMCID: PMC11167739 DOI: 10.1186/s12916-024-03466-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Accepted: 06/05/2024] [Indexed: 06/13/2024] Open
Abstract
BACKGROUND Mitochondrial (MT) dysfunction is a hallmark of liver diseases. However, the effects of functional variants such as protein truncating variants (PTVs) in MT-related genes on the risk of liver diseases have not been extensively explored. METHODS We extracted 60,928 PTVs across 2466 MT-related nucleus genes using whole-exome sequencing data obtained from 442,603 participants in the UK Biobank. We examined their associations with liver dysfunction that represented by the liver-related biomarkers and the risks of chronic liver diseases and liver-related mortality. RESULTS 96.10% of the total participants carried at least one PTV. We identified 866 PTVs that were positively associated with liver dysfunction at the threshold of P value < 8.21e - 07. The coding genes of these PTVs were mainly enriched in pathways related to lipid, fatty acid, amino acid, and carbohydrate metabolisms. The 866 PTVs were presented in 1.07% (4721) of participants. Compared with participants who did not carry any of the PTVs, the carriers had a 5.33-fold (95% CI 4.15-6.85), 2.82-fold (1.69-4.72), and 4.41-fold (3.04-6.41) increased risk for fibrosis and cirrhosis of liver, liver cancer, and liver disease-related mortality, respectively. These adverse effects were consistent across subgroups based on age, sex, body mass index, smoking status, and presence of hypertension, diabetes, dyslipidemia, and metabolic syndrome. CONCLUSIONS Our findings revealed a significant impact of PTVs in MT-related genes on liver disease risk, highlighting the importance of these variants in identifying populations at risk of liver diseases and facilitating early clinical interventions.
Collapse
Affiliation(s)
- Huangbo Yuan
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200438, China
| | - Zhenqiu Liu
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200438, China
| | - Mingyang Chen
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200438, China
| | - Qiaoyi Xu
- Department of Epidemiology, School of Public Health, Fudan University, No. 130 Dongan Road, Shanghai, 200032, China
| | - Yanfeng Jiang
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200438, China
- Fudan University Taizhou Institute of Health Sciences, Taizhou, China
| | - Tiejun Zhang
- Department of Epidemiology, School of Public Health, Fudan University, No. 130 Dongan Road, Shanghai, 200032, China
- Fudan University Taizhou Institute of Health Sciences, Taizhou, China
| | - Chen Suo
- Department of Epidemiology, School of Public Health, Fudan University, No. 130 Dongan Road, Shanghai, 200032, China.
- Fudan University Taizhou Institute of Health Sciences, Taizhou, China.
| | - Xingdong Chen
- State Key Laboratory of Genetic Engineering, Human Phenome Institute, Zhangjiang Fudan International Innovation Center, and School of Life Sciences, Fudan University, No. 2005 Songhu Road, Shanghai, 200438, China.
- Fudan University Taizhou Institute of Health Sciences, Taizhou, China.
- National Clinical Research Center for Aging and Medicine, Huashan Hospital, Fudan University, Shanghai, China.
- Yiwu Research Institute of Fudan University, Yiwu, China.
| |
Collapse
|
5
|
Papier K, Atkins JR, Tong TYN, Gaitskell K, Desai T, Ogamba CF, Parsaeian M, Reeves GK, Mills IG, Key TJ, Smith-Byrne K, Travis RC. Identifying proteomic risk factors for cancer using prospective and exome analyses of 1463 circulating proteins and risk of 19 cancers in the UK Biobank. Nat Commun 2024; 15:4010. [PMID: 38750076 PMCID: PMC11096312 DOI: 10.1038/s41467-024-48017-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 04/18/2024] [Indexed: 05/18/2024] Open
Abstract
The availability of protein measurements and whole exome sequence data in the UK Biobank enables investigation of potential observational and genetic protein-cancer risk associations. We investigated associations of 1463 plasma proteins with incidence of 19 cancers and 9 cancer subsites in UK Biobank participants (average 12 years follow-up). Emerging protein-cancer associations were further explored using two genetic approaches, cis-pQTL and exome-wide protein genetic scores (exGS). We identify 618 protein-cancer associations, of which 107 persist for cases diagnosed more than seven years after blood draw, 29 of 618 were associated in genetic analyses, and four had support from long time-to-diagnosis ( > 7 years) and both cis-pQTL and exGS analyses: CD74 and TNFRSF1B with NHL, ADAM8 with leukemia, and SFTPA2 with lung cancer. We present multiple blood protein-cancer risk associations, including many detectable more than seven years before cancer diagnosis and that had concordant evidence from genetic analyses, suggesting a possible role in cancer development.
Collapse
Affiliation(s)
- Keren Papier
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK.
| | - Joshua R Atkins
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Tammy Y N Tong
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Kezia Gaitskell
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Trishna Desai
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Chibuzor F Ogamba
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Mahboubeh Parsaeian
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Gillian K Reeves
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Ian G Mills
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
- Patrick G Johnston Centre for Cancer Research, Queen's University Belfast, Belfast, UK
| | - Tim J Key
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Karl Smith-Byrne
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| | - Ruth C Travis
- Cancer Epidemiology Unit, Nuffield Department of Population Health, University of Oxford, Oxford, UK
| |
Collapse
|
6
|
Guo H, Urban AE, Wong WH. Prioritizing disease-related rare variants by integrating gene expression data. RESEARCH SQUARE 2024:rs.3.rs-4355589. [PMID: 38766095 PMCID: PMC11100897 DOI: 10.21203/rs.3.rs-4355589/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2024]
Abstract
Rare variants, comprising a vast majority of human genetic variations, are likely to have more deleterious impact on human diseases compared to common variants. Here we present carrier statistic, a statistical framework to prioritize disease-related rare variants by integrating gene expression data. By quantifying the impact of rare variants on gene expression, carrier statistic can prioritize those rare variants that have large functional consequence in the diseased patients. Through simulation studies and analyzing real multi-omics dataset, we demonstrated that carrier statistic is applicable in studies with limited sample size (a few hundreds) and achieves substantially higher sensitivity than existing rare variants association methods. Application to Alzheimer's disease reveals 16 rare variants within 15 genes with extreme carrier statistics. We also found strong excess of rare variants among the top prioritized genes in diseased patients compared to that in healthy individuals. The carrier statistic method can be applied to various rare variant types and is adaptable to other omics data modalities, offering a powerful tool for investigating the molecular mechanisms underlying complex diseases.
Collapse
|
7
|
Mohseni Sani N, Talaee M, Akbari A, Ashoori F, Zamani J, Kermani AA, Shahbani Zahiri H, Presley J, Vali H, Akbari Noghabi K. Unveiling the structure-emulsifying function relationship of truncated recombinant forms of the SA01-OmpA protein opens up a new vista in bioemulsifiers. Microbiol Spectr 2024; 12:e0346523. [PMID: 38206002 PMCID: PMC10846152 DOI: 10.1128/spectrum.03465-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 12/03/2023] [Indexed: 01/12/2024] Open
Abstract
The emulsifying ability of SA01-OmpA (outer membrane protein A from Acinetobacter sp. SA01) was found to be constrained by challenges like low production efficiency and high costs associated with protein recovery from E. coli inclusion bodies, as described in our previous study. The present study sought to benefit from the advantages of the targeted truncating of SA01-OmpA protein, taking into account the reduced propensity of protein expression as inclusion bodies and cytotoxicity. Here, the structure and activity relationship of two truncated recombinant forms of SA01-OmpA protein was unraveled through a hybrid approach based on experimental data and computational methodologies, representing an innovative bioemulsifier with advantageous emulsifying activity. The recombinant truncated SA01-OmpA variants were cloned and heterologously expressed in E. coli host cells and subsequently purified. The results showed increased emulsifying activity of N-terminally truncated SA01-OmpA (NT-OmpA) compared to full-length SA01-OmpA. Molecular dynamics (MD) simulations analysis demonstrated a direct correlation between the C-terminally truncated SA01-OmpA (CT-OmpA) and its expression as inclusion bodies. Analysis of the structure-activity relationship of truncated variants of SA01-OmpA revealed that, compared to the full-length protein, deletion of the β-barrel portion from the N-terminal of SA01-OmpA increased the emulsifying activity of NT-OmpA while lowering its expression as inclusion bodies. Contrary to the full-length protein, the N-terminally truncated SA01-OmpA was not as cytotoxic, according to the MTT assay, FCM analysis, and AO/EB staining. The findings of this extensive study advance our knowledge of SA01-OmpA at the molecular level as well as the design and development of efficient bioemulsifiers.IMPORTANCEPrevious research (Shahryari et al. 2021, mSystems 6: e01175-20) introduced and characterized the SA01-OmpA protein as a multifaceted protein with a variety of functions, including maintaining cellular homeostasis under oxidative stress conditions, biofilm formation, outer membrane vesicles (OMV) biogenesis, and beneficial emulsifying capacity. By truncating the SA01-OmpA protein, the current study presents a unique method for developing protein-type bioemulsifiers. The findings indicate that the N-terminally truncated SA01-OmpA (NT-OmpA) has the potential to fully replace full-length SA01-OmpA as a novel bioemulsifier with significant emulsifying activity. This study opens up a new frontier in bioemulsifiers, shedding light on a possible relationship between the structure and activity of SA01-OmpA truncated forms.
Collapse
Affiliation(s)
- Naeema Mohseni Sani
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Mahbubeh Talaee
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Ali Akbari
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Faranak Ashoori
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Javad Zamani
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - Ali A. Kermani
- Department of Structural Biology, St. Jude Children’s Research Hospital, Memphis, Tennessee, USA
| | - Hossein Shahbani Zahiri
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| | - John Presley
- Department of Anatomy & Cell Biology, McGill University, Montreal, Québec, Canada
| | - Hojatollah Vali
- Department of Anatomy & Cell Biology, McGill University, Montreal, Québec, Canada
| | - Kambiz Akbari Noghabi
- Department of Energy and Environmental Biotechnology, National Institute of Genetic Engineering and Biotechnology (NIGEB), Tehran, Iran
| |
Collapse
|
8
|
Tanigawa Y, Kellis M. Power of inclusion: Enhancing polygenic prediction with admixed individuals. Am J Hum Genet 2023; 110:1888-1902. [PMID: 37890495 PMCID: PMC10645553 DOI: 10.1016/j.ajhg.2023.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 09/22/2023] [Accepted: 09/22/2023] [Indexed: 10/29/2023] Open
Abstract
Admixed individuals offer unique opportunities for addressing limited transferability in polygenic scores (PGSs), given the substantial trans-ancestry genetic correlation in many complex traits. However, they are rarely considered in PGS training, given the challenges in representing ancestry-matched linkage-disequilibrium reference panels for admixed individuals. Here we present inclusive PGS (iPGS), which captures ancestry-shared genetic effects by finding the exact solution for penalized regression on individual-level data and is thus naturally applicable to admixed individuals. We validate our approach in a simulation study across 33 configurations with varying heritability, polygenicity, and ancestry composition in the training set. When iPGS is applied to n = 237,055 ancestry-diverse individuals in the UK Biobank, it shows the greatest improvements in Africans by 48.9% on average across 60 quantitative traits and up to 50-fold improvements for some traits (neutrophil count, R2 = 0.058) over the baseline model trained on the same number of European individuals. When we allowed iPGS to use n = 284,661 individuals, we observed an average improvement of 60.8% for African, 11.6% for South Asian, 7.3% for non-British White, 4.8% for White British, and 17.8% for the other individuals. We further developed iPGS+refit to jointly model the ancestry-shared and -dependent genetic effects when heterogeneous genetic associations were present. For neutrophil count, for example, iPGS+refit showed the highest predictive performance in the African group (R2 = 0.115), which exceeds the best predictive performance for the White British group (R2 = 0.090 in the iPGS model), even though only 1.49% of individuals used in the iPGS training are of African ancestry. Our results indicate the power of including diverse individuals for developing more equitable PGS models.
Collapse
Affiliation(s)
- Yosuke Tanigawa
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Manolis Kellis
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
9
|
Lassen FH, Venkatesh SS, Baya N, Zhou W, Bloemendal A, Neale BM, Kessler BM, Whiffin N, Lindgren CM, Palmer DS. Exome-wide evidence of compound heterozygous effects across common phenotypes in the UK Biobank. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.06.29.23291992. [PMID: 37461573 PMCID: PMC10350147 DOI: 10.1101/2023.06.29.23291992] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Exome-sequencing association studies have successfully linked rare protein-coding variation to risk of thousands of diseases. However, the relationship between rare deleterious compound heterozygous (CH) variation and their phenotypic impact has not been fully investigated. Here, we leverage advances in statistical phasing to accurately phase rare variants (MAF ~ 0.001%) in exome sequencing data from 175,587 UK Biobank (UKBB) participants, which we then systematically annotate to identify putatively deleterious CH coding variation. We show that 6.5% of individuals carry such damaging variants in the CH state, with 90% of variants occurring at MAF < 0.34%. Using a logistic mixed model framework, systematically accounting for relatedness, polygenic risk, nearby common variants, and rare variant burden, we investigate recessive effects in common complex diseases. We find six exome-wide significant (P < 1.68 × 10 - 7 ) and 17 nominally significant (P < 5.25 × 10 - 5 ) gene-trait associations. Among these, only four would have been identified without accounting for CH variation in the gene. We further incorporate age-at-diagnosis information from primary care electronic health records, to show that genetic phase influences lifetime risk of disease across 20 gene-trait combinations (FDR < 5%). Using a permutation approach, we find evidence for genetic phase contributing to disease susceptibility for a collection of gene-trait pairs, including FLG-asthma (P = 0.00205 ) and USH2A-visual impairment (P = 0.0084 ). Taken together, we demonstrate the utility of phasing large-scale genetic sequencing cohorts for robust identification of the phenome-wide consequences of compound heterozygosity.
Collapse
Affiliation(s)
- Frederik H. Lassen
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Samvida S. Venkatesh
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Nikolas Baya
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| | - Wei Zhou
- Program in Medical and Population Genetics Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Department of Medicine Massachusetts General Hospital, Boston, MA, USA
| | - Alex Bloemendal
- Program in Medical and Population Genetics Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Novo Nordisk Center for Genomic Mechanisms of Disease Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Data Sciences Platform Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Benjamin M. Neale
- Program in Medical and Population Genetics Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Stanley Center for Psychiatric Research Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Department of Medicine Massachusetts General Hospital, Boston, MA, USA
| | - Benedikt M. Kessler
- Target Discovery Institute, Centre for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom
| | - Nicola Whiffin
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
- Program in Medical and Population Genetics Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cecilia M. Lindgren
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, United Kingdom
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
- Nuffield Department of Population Health Health, Medical Sciences Division University of Oxford, Oxford, United Kingdom
| | - Duncan S. Palmer
- Big Data Institute, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
10
|
Alemany-Navarro M, Diz-de Almeida S, Cruz R, Riancho JA, Rojas-Martínez A, Lapunzina P, Flores C, Carracedo A. Psychiatric polygenic risk as a predictor of COVID-19 risk and severity: insight into the genetic overlap between schizophrenia and COVID-19. Transl Psychiatry 2023; 13:189. [PMID: 37280221 DOI: 10.1038/s41398-023-02482-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 04/24/2023] [Accepted: 05/23/2023] [Indexed: 06/08/2023] Open
Abstract
Despite the high contagion and mortality rates that have accompanied the coronavirus disease-19 (COVID-19) pandemic, the clinical presentation of the syndrome varies greatly from one individual to another. Potential host factors that accompany greater risk from COVID-19 have been sought and schizophrenia (SCZ) patients seem to present more severe COVID-19 than control counterparts, with certain gene expression similarities between psychiatric and COVID-19 patients reported. We used summary statistics from the last SCZ, bipolar disorder (BD), and depression (DEP) meta-analyses available on the Psychiatric Genomics Consortium webpage to calculate polygenic risk scores (PRSs) for a target sample of 11,977 COVID-19 cases and 5943 subjects with unknown COVID-19 status. Linkage disequilibrium score (LDSC) regression analysis was performed when positive associations were obtained from the PRS analysis. The SCZ PRS was a significant predictor in the case/control, symptomatic/asymptomatic, and hospitalization/no hospitalization analyses in the total and female samples; and of symptomatic/asymptomatic status in men. No significant associations were found for the BD or DEP PRS or in the LDSC regression analysis. SNP-based genetic risk for SCZ, but not for BD or DEP, may be associated with higher risk of SARS-CoV-2 infection and COVID-19 severity, especially among women; however, predictive accuracy barely exceeded chance level. We believe that the inclusion of sexual loci and rare variations in the analysis of genomic overlap between SCZ and COVID-19 will help to elucidate the genetic commonalities between these conditions.
Collapse
Affiliation(s)
- M Alemany-Navarro
- IBIS (Universidad de Sevilla, HUVR, Junta de Andalucia, CSIC), Sevilla, Spain.
- Centro Singular de Investigación en Medicina Molecular y Enfermedades Crónicas (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain.
- Fundación Pública Galega de Medicina Xenómica, Sistema Galego de Saúde (SERGAS) Santiago de Compostela, Santiago de Compostela, Spain.
- Grupo de Genética. Instituto de Investigación Sanitaria de Santiago (IDIS), Santiago de Compostela, Spain.
| | - S Diz-de Almeida
- Centro Singular de Investigación en Medicina Molecular y Enfermedades Crónicas (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Instituto de Salud Carlos III, Madrid, Spain
| | - R Cruz
- Centro Singular de Investigación en Medicina Molecular y Enfermedades Crónicas (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Instituto de Salud Carlos III, Madrid, Spain
| | - J A Riancho
- IDIVAL, Cantabria, Spain
- Universidad de Cantabria, Cantabria, Spain
- Hospital U M Valdecilla, Cantabria, Spain
| | - A Rojas-Martínez
- Tecnologico de Monterrey, Escuela de Medicina y Ciencias de la Salud, Monterrey, Mexico
| | - P Lapunzina
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Instituto de Salud Carlos III, Madrid, Spain
- Instituto de Genética Médica y Molecular (INGEMM) del Hospital Universitario La Paz, Madrid, Spain
- ERN-ITHACA-European Reference Network, Santa Cruz de Tenerife, Canarias, Spain
| | - C Flores
- Research Unit, Hospital Universitario N.S. de Candelaria, Santa Cruz de Tenerife, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain
- Genomics Division, Instituto Tecnológico y de Energías Renovables, Santa Cruz de Tenerife, Spain
- Department of Clinical Sciences, University Fernando Pessoa Canarias, Las Palmas de Gran Canaria, Spain
| | - A Carracedo
- Centro Singular de Investigación en Medicina Molecular y Enfermedades Crónicas (CIMUS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
- Fundación Pública Galega de Medicina Xenómica, Sistema Galego de Saúde (SERGAS) Santiago de Compostela, Santiago de Compostela, Spain
- Grupo de Genética. Instituto de Investigación Sanitaria de Santiago (IDIS), Santiago de Compostela, Spain
- Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER-ISCIII), Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
11
|
Chen Z, Lei Y, Finnell RH, Ding Y, Su Z, Wang Y, Xie H, Chen F. Whole-exome sequencing study of hypospadias. iScience 2023; 26:106663. [PMID: 37168556 PMCID: PMC10165268 DOI: 10.1016/j.isci.2023.106663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/01/2023] [Accepted: 04/07/2023] [Indexed: 05/13/2023] Open
Abstract
Hypospadias results from the impaired urethral development, which is influenced by androgens, but its genetic etiology is still unknown. Through whole exome sequencing analysis, we identified NR5A1, SRD5A2, and AR as mutational hotspots in the etiology of severe hypospadias, as these genes are related to androgen signaling. Additionally, rare damaging variants in cilia-related outer dynein arm heavy chain (ODNAH) genes (DNAH5, DNAH8, DNAH9, DNAH11, and DNAH17) (p = 8.5 × 10-47) were significantly enriched in hypospadias cases. The Dnah8 KO mice exhibited significantly decreased testosterone levels, which had an impact on urethral development and disrupted steroid biosynthesis. Combined with trios data, transcriptomic, and phenotypical and proteomic characterization of a mouse model, our work links ciliary genes with hypospadias. Overall, a panel of ODNAH genes with rare damaging variants was identified in 24% of hypospadias patients, providing significant insights into the underlying pathogenesis of hypospadias as well as genetic counseling.
Collapse
Affiliation(s)
- Zhongzhong Chen
- Department of Urology, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
- Urogenital Development Research Center, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
| | - Yunping Lei
- Center for Precision Environmental Health, Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Richard H. Finnell
- Center for Precision Environmental Health, Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA
- Departments of Molecular and Human Genetics and Medicine, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yu Ding
- Department of Urology, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
| | - Zhixi Su
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Yaping Wang
- Department of Urology, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
| | - Hua Xie
- Department of Urology, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
| | - Fang Chen
- Department of Urology, Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200062, China
- Clinical Research Center For Hypospadias Pediatric College, Shanghai Jiao Tong University School of Medicine, Shanghai 200062, China
| |
Collapse
|
12
|
Daghlas I, Gill D. Mendelian randomization as a tool to inform drug development using human genetics. CAMBRIDGE PRISMS. PRECISION MEDICINE 2023; 1:e16. [PMID: 38550933 PMCID: PMC10953771 DOI: 10.1017/pcm.2023.5] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Revised: 01/14/2023] [Accepted: 01/30/2023] [Indexed: 04/11/2024]
Abstract
Drug development is essential to the advancement of human health, however, the process is slow, costly, and at high risk of failure at all stages. A promising strategy for expediting and improving the probability of success in the drug development process is the use of naturally randomized human genetic variation for drug target identification and validation. These data can be harnessed using the Mendelian randomization (MR) analytic paradigm to proxy the lifelong consequences of genetic perturbations of drug targets. In this review, we discuss the myriad applications of the MR paradigm for human drug target identification and validation. We review the methodology and applications of MR, key limitations of MR, and potential future opportunities for research. Throughout the review, we refer to illustrative examples of MR analyses investigating the consequences of genetic inhibition of interleukin 6 signaling which, in some cases, have anticipated results from randomized controlled trials. As human genetic data become more widely available, we predict that MR will serve as a key pillar of support for drug development efforts.
Collapse
Affiliation(s)
- Iyas Daghlas
- Department of Neurology, University of California San Francisco, San Francisco, CA, USA
| | - Dipender Gill
- Department of Epidemiology and Biostatistics, School of Public Health, Imperial College London, London, UK
- Chief Scientific Advisor Office, Research and Early Development, Novo Nordisk, Copenhagen, Denmark
| |
Collapse
|
13
|
Zhang Y, Cai Q, Luo Y, Zhang Y, Li H. Integrated top-down and bottom-up proteomics mass spectrometry for the characterization of endogenous ribosomal protein heterogeneity. J Pharm Anal 2023; 13:63-72. [PMID: 36820077 PMCID: PMC9937802 DOI: 10.1016/j.jpha.2022.11.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 11/07/2022] [Accepted: 11/08/2022] [Indexed: 11/16/2022] Open
Abstract
Ribosomes are abundant, large RNA-protein complexes that are the sites of all protein synthesis in cells. Defects in ribosomal proteins (RPs), including proteoforms arising from genetic variations, alternative splicing of RNA transcripts, post-translational modifications and alterations of protein expression level, have been linked to a diverse range of diseases, including cancer and aging. Comprehensive characterization of ribosomal proteoforms is challenging but important for the discovery of potential disease biomarkers or protein targets. In the present work, using E. coli 70S RPs as an example, we first developed a top-down proteomics approach on a Waters Synapt G2 Si mass spectrometry (MS) system, and then applied it to the HeLa 80S ribosome. The results were complemented by a bottom-up approach. In total, 50 out of 55 RPs were identified using the top-down approach. Among these, more than 30 RPs were found to have their N-terminal methionine removed. Additional modifications such as methylation, acetylation, and hydroxylation were also observed, and the modification sites were identified by bottom-up MS. In a HeLa 80S ribosomal sample, we identified 98 ribosomal proteoforms, among which multiple truncated 80S ribosomal proteoforms were observed, the type of information which is often overlooked by bottom-up experiments. Although their relevance to diseases is not yet known, the integration of top-down and bottom-up proteomics approaches paves the way for the discovery of proteoform-specific disease biomarkers or targets.
Collapse
Affiliation(s)
- Ying Zhang
- School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou, 510006, China
| | - Qinghua Cai
- Henan Engineering Laboratory for Mammary Bioreactor, School of Life Sciences, Henan University, Kaifeng, Henan, 475004, China
| | - Yuxiang Luo
- School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou, 510006, China
| | - Yu Zhang
- The Shennong Laboratory, Zhengzhou, 450002, China
| | - Huilin Li
- School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou, 510006, China
- Guangdong Key Laboratory of Chiral Molecule and Drug Discovery, School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou, 510006, China
- Corresponding author. School of Pharmaceutical Sciences, Sun Yat-sen University, Guangzhou, 510006, China.
| |
Collapse
|
14
|
Keskin Karakoyun H, Yüksel ŞK, Amanoglu I, Naserikhojasteh L, Yeşilyurt A, Yakıcıer C, Timuçin E, Akyerli CB. Evaluation of AlphaFold structure-based protein stability prediction on missense variations in cancer. Front Genet 2023; 14:1052383. [PMID: 36896237 PMCID: PMC9988940 DOI: 10.3389/fgene.2023.1052383] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 02/08/2023] [Indexed: 02/23/2023] Open
Abstract
Identifying pathogenic missense variants in hereditary cancer is critical to the efforts of patient surveillance and risk-reduction strategies. For this purpose, many different gene panels consisting of different number and/or set of genes are available and we are particularly interested in a panel of 26 genes with a varying degree of hereditary cancer risk consisting of ABRAXAS1, ATM, BARD1, BLM, BRCA1, BRCA2, BRIP1, CDH1, CHEK2, EPCAM, MEN1, MLH1, MRE11, MSH2, MSH6, MUTYH, NBN, PALB2, PMS2, PTEN, RAD50, RAD51C, RAD51D, STK11, TP53, and XRCC2. In this study, we have compiled a collection of the missense variations reported in any of these 26 genes. More than a thousand missense variants were collected from ClinVar and the targeted screen of a breast cancer cohort of 355 patients which contributed to this set with 160 novel missense variations. We analyzed the impact of the missense variations on protein stability by five different predictors including both sequence- (SAAF2EC and MUpro) and structure-based (Maestro, mCSM, CUPSAT) predictors. For the structure-based tools, we have utilized the AlphaFold (AF2) protein structures which comprise the first structural analysis of this hereditary cancer proteins. Our results agreed with the recent benchmarks that computed the power of stability predictors in discriminating the pathogenic variants. Overall, we reported a low-to-medium-level performance for the stability predictors in discriminating pathogenic variants, except MUpro which had an AUROC of 0.534 (95% CI [0.499-0.570]). The AUROC values ranged between 0.614-0.719 for the total set and 0.596-0.682 for the set with high AF2 confidence regions. Furthermore, our findings revealed that the confidence score for a given variant in the AF2 structure could alone predict pathogenicity more robustly than any of the tested stability predictors with an AUROC of 0.852. Altogether, this study represents the first structural analysis of the 26 hereditary cancer genes underscoring 1) the thermodynamic stability predicted from AF2 structures as a moderate and 2) the confidence score of AF2 as a strong descriptor for variant pathogenicity.
Collapse
Affiliation(s)
- Hilal Keskin Karakoyun
- Department of Biochemistry and Molecular Biology, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Şirin K Yüksel
- Department of Biochemistry and Molecular Biology, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Ilayda Amanoglu
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Lara Naserikhojasteh
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Ahmet Yeşilyurt
- Acibadem Labgen Genetic Diagnosis Centre, Acibadem Health Group, Istanbul, Türkiye
| | - Cengiz Yakıcıer
- Acibadem Pathology Laboratories, Acibadem Health Group, Istanbul, Türkiye
| | - Emel Timuçin
- Department of Biostatistics and Medical Informatics, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| | - Cemaliye B Akyerli
- Department of Medical Biology, School of Medicine, Acibadem Mehmet Ali Aydinlar University, Istanbul, Türkiye
| |
Collapse
|
15
|
Berling E, Nicolle R, Laforêt P, Ronzitti G. Gene therapy review: Duchenne muscular dystrophy case study. Rev Neurol (Paris) 2023; 179:90-105. [PMID: 36517287 DOI: 10.1016/j.neurol.2022.11.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 11/17/2022] [Indexed: 12/14/2022]
Abstract
Gene therapy, i.e., any therapeutic approach involving the use of genetic material as a drug and more largely altering the transcription or translation of one or more genes, covers a wide range of innovative methods for treating diseases, including neurological disorders. Although they share common principles, the numerous gene therapy approaches differ greatly in their mechanisms of action. They also differ in their maturity for some are already used in clinical practice while others have never been used in humans. The aim of this review is to present the whole range of gene therapy techniques through the example of Duchenne muscular dystrophy (DMD). DMD is a severe myopathy caused by mutations in the dystrophin gene leading to the lack of functional dystrophin protein. It is a disease known to all neurologists and in which almost all gene therapy methods were applied. Here we discuss the mechanisms of gene transfer techniques with or without viral vectors, DNA editing with or without matrix repair and those acting at the RNA level (RNA editing, exon skipping and STOP-codon readthrough). For each method, we present the results obtained in DMD with a particular focus on clinical data. This review aims also to outline the advantages, limitations and risks of gene therapy related to the approach used.
Collapse
Affiliation(s)
- E Berling
- Neurology department, Raymond Poincaré university hospital, AP-HP, Garches, France; Nord-Est-Île-de-France neuromuscular reference center, FHU PHENIX, Garches, France; U 1179 Inserm, université Paris-Saclay, Montigny-Le-Bretonneux, France.
| | - R Nicolle
- Université Paris Cité, Inserm UMR1163, Imagine Institute, Clinical Bioinformatics laboratory, 75015 Paris, France
| | - P Laforêt
- Neurology department, Raymond Poincaré university hospital, AP-HP, Garches, France; Nord-Est-Île-de-France neuromuscular reference center, FHU PHENIX, Garches, France; U 1179 Inserm, université Paris-Saclay, Montigny-Le-Bretonneux, France
| | - G Ronzitti
- Université Paris Cité, Inserm UMR1163, Imagine Institute, Clinical Bioinformatics laboratory, 75015 Paris, France; Genethon, Evry, France
| |
Collapse
|
16
|
Kunizheva SS, Volobaev VP, Plotnikova MY, Kupriyanova DA, Kuznetsova IL, Tyazhelova TV, Rogaev EI. Current Trends and Approaches to the Search for Genetic Determinants of Aging and Longevity. RUSS J GENET+ 2022; 58:1427-1443. [PMID: 36590179 PMCID: PMC9794410 DOI: 10.1134/s1022795422120067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Revised: 07/06/2022] [Accepted: 07/07/2022] [Indexed: 12/29/2022]
Abstract
Aging is a natural process of extinction of the body and the main aspect that determines the life expectancy for individuals who have survived to the post-reproductive period. The process of aging is accompanied by certain physiological, immune, and metabolic changes in the body, as well as the development of age-related diseases. The contribution of genetic factors to human life expectancy is estimated at about 25-30%. Despite the success in identifying genes and metabolic pathways that may be involved in the life extension process in model organisms, the key question remains to what extent these data can be extrapolated to humans, for example, because of the complexity of its biological and sociocultural systems, as well as possible species differences in life expectancy and causes of mortality. New molecular genetic methods have significantly expanded the possibilities for searching for genetic factors of human life expectancy and identifying metabolic pathways of aging, the interaction of genes and transcription factors, the regulation of gene expression at the level of transcription, and epigenetic modifications. The review presents the latest research and current strategies for studying the genetic basis of human aging and longevity: the study of individual candidate genes in genetic population studies, variations identified by the GWAS method, immunogenetic differences in aging, and genomic studies to identify factors of "healthy aging." Understanding the mechanisms of the interaction between factors affecting the life expectancy and the possibility of their regulation can become the basis for developing comprehensive measures to achieve healthy longevity. Supplementary Information The online version contains supplementary material available at 10.1134/S1022795422120067.
Collapse
Affiliation(s)
- S. S. Kunizheva
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
- Moscow State University, 119234 Moscow, Russia
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991 Moscow, Russia
| | - V. P. Volobaev
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
| | - M. Yu. Plotnikova
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
- Moscow State University, 119234 Moscow, Russia
| | - D. A. Kupriyanova
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
| | - I. L. Kuznetsova
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991 Moscow, Russia
| | - T. V. Tyazhelova
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991 Moscow, Russia
| | - E. I. Rogaev
- Center for Genetics and Life Sciences, Sirius University of Science and Technology, 354340 Sochi, Russia
- Moscow State University, 119234 Moscow, Russia
- Vavilov Institute of General Genetics, Russian Academy of Sciences, 119991 Moscow, Russia
- University of Massachusetts Chan Medical School, 01545 Shrewsbury, MA United States
| |
Collapse
|
17
|
The contribution of common and rare genetic variants to variation in metabolic traits in 288,137 East Asians. Nat Commun 2022; 13:6642. [PMID: 36333282 PMCID: PMC9636136 DOI: 10.1038/s41467-022-34163-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 10/17/2022] [Indexed: 11/06/2022] Open
Abstract
Metabolic traits are heritable phenotypes widely-used in assessing the risk of various diseases. We conduct a genome-wide association analysis (GWAS) of nine metabolic traits (including glycemic, lipid, liver enzyme levels) in 125,872 Korean subjects genotyped with the Korea Biobank Array. Following meta-analysis with GWAS from Biobank Japan identify 144 novel signals (MAF ≥ 1%), of which 57.0% are replicated in UK Biobank. Additionally, we discover 66 rare (MAF < 1%) variants, 94.4% of them co-incident to common loci, adding to allelic series. Although rare variants have limited contribution to overall trait variance, these lead, in carriers, substantial loss of predictive accuracy from polygenic predictions of disease risk from common variant alone. We capture groups with up to 16-fold variation in type 2 diabetes (T2D) prevalence by integration of genetic risk scores of fasting plasma glucose and T2D and the I349F rare protective variant. This study highlights the need to consider the joint contribution of both common and rare variants on inherited risk of metabolic traits and related diseases.
Collapse
|
18
|
Whole-Genome Profile of Greek Patients with Teratozοοspermia: Identification of Candidate Variants and Genes. Genes (Basel) 2022; 13:genes13091606. [PMID: 36140773 PMCID: PMC9498395 DOI: 10.3390/genes13091606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 08/23/2022] [Accepted: 08/30/2022] [Indexed: 01/09/2023] Open
Abstract
Male infertility is a global health problem that affects a large number of couples worldwide. It can be categorized into specific subtypes, including teratozoospermia. The present study aimed to identify new variants associated with teratozoospermia in the Greek population and to explore the role of genes on which these were identified. For this reason, whole-genome sequencing (WGS) was performed on normozoospermic and teratozoospermic individuals, and after selecting only variants found in teratozoospermic men, these were further prioritized using a wide range of tools, functional and predictive algorithms, etc. An average of 600,000 variants were identified, and of them, 61 were characterized as high impact and 153 as moderate impact. Many of these are mapped in genes previously associated with male infertility, yet others are related for the first time to teratozoospermia. Furthermore, pathway enrichment analysis and Gene ontology (GO) analyses revealed the important role of the extracellular matrix in teratozoospermia. Therefore, the present study confirms the contribution of genes studied in the past to male infertility and sheds light on new molecular mechanisms by providing a list of variants and candidate genes associated with teratozoospermia in the Greek population.
Collapse
|
19
|
Potapova NA. Nonsense Mutations in Eukaryotes. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:400-412. [PMID: 35790376 DOI: 10.1134/s0006297922050029] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 02/14/2022] [Accepted: 03/22/2022] [Indexed: 06/15/2023]
Abstract
Nonsense mutations are a type of mutations which results in a premature termination codon occurrence. In general, these mutations have been considered to be among the most harmful ones which lead to premature protein translation termination and result in shortened nonfunctional polypeptide. However, there is evidence that not all nonsense mutations are harmful as well as some molecular mechanisms exist which allow to avoid pathogenic effects of these mutations. This review addresses relevant information on nonsense mutations in eukaryotic genomes, characteristics of these mutations, and different molecular mechanisms preventing or mitigating harmful effects thereof.
Collapse
Affiliation(s)
- Nadezhda A Potapova
- Kharkevich Institute for Information Transmission Problems (IITP), Russian Academy of Sciences, Moscow, 127051, Russia.
| |
Collapse
|
20
|
Adiliaghdam F, Amatullah H, Digumarthi S, Saunders TL, Rahman RU, Wong LP, Sadreyev R, Droit L, Paquette J, Goyette P, Rioux J, Hodin R, Mihindukulasuriya KA, Handley SA, Jeffrey KL. Human enteric viruses autonomously shape inflammatory bowel disease phenotype through divergent innate immunomodulation. Sci Immunol 2022; 7:eabn6660. [PMID: 35394816 PMCID: PMC9416881 DOI: 10.1126/sciimmunol.abn6660] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
Altered enteric microorganisms in concert with host genetics shape inflammatory bowel disease (IBD) phenotypes. However, insight is limited to bacteria and fungi. We found that eukaryotic viruses and bacteriophages (collectively, the virome), enriched from non-IBD, noninflamed human colon resections, actively elicited atypical anti-inflammatory innate immune programs. Conversely, ulcerative colitis or Crohn's disease colon resection viromes provoked inflammation, which was successfully dampened by non-IBD viromes. The IBD colon tissue virome was perturbed, including an increase in the enterovirus B species of eukaryotic picornaviruses, not previously detected in fecal virome studies. Mice humanized with non-IBD colon tissue viromes were protected from intestinal inflammation, whereas IBD virome mice exhibited exacerbated inflammation in a nucleic acid sensing-dependent fashion. Furthermore, there were detrimental consequences for IBD patient-derived intestinal epithelial cells bearing loss-of-function mutations within virus sensor MDA5 when exposed to viromes. Our results demonstrate that innate recognition of IBD or non-IBD human viromes autonomously influences intestinal homeostasis and disease phenotypes. Thus, perturbations in the intestinal virome, or an altered ability to sense the virome due to genetic variation, contribute to the induction of IBD. Harnessing the virome may offer therapeutic and biomarker potential.
Collapse
Affiliation(s)
- Fatemeh Adiliaghdam
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Hajera Amatullah
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Sreehaas Digumarthi
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Tahnee L. Saunders
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Raza-Ur Rahman
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Lai Ping Wong
- Department of Molecular Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
- Department of Genetics, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Ruslan Sadreyev
- Department of Molecular Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
- Department of Pathology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | - Lindsay Droit
- Department of Pathology and Immunology, Washington University School of Medicine, St Louis, MO, USA
| | - Jean Paquette
- Montreal Heart Institute, Montreal Quebec Canada H1T 1C8
| | | | - John Rioux
- Montreal Heart Institute, Montreal Quebec Canada H1T 1C8
- Université de Montréal, Montreal Quebec Canada H3C 3J7
| | - Richard Hodin
- Department of Surgery, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
| | | | - Scott A. Handley
- Department of Pathology and Immunology, Washington University School of Medicine, St Louis, MO, USA
| | - Kate L. Jeffrey
- Department of Medicine, Division of Gastroenterology and the Center for the Study of Inflammatory Bowel Disease, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA
- Center for Microbiome Informatics and Therapeutics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
21
|
Pathophysiological Heterogeneity of the BBSOA Neurodevelopmental Syndrome. Cells 2022; 11:cells11081260. [PMID: 35455940 PMCID: PMC9024734 DOI: 10.3390/cells11081260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/17/2022] [Accepted: 03/29/2022] [Indexed: 11/17/2022] Open
Abstract
The formation and maturation of the human brain is regulated by highly coordinated developmental events, such as neural cell proliferation, migration and differentiation. Any impairment of these interconnected multi-factorial processes can affect brain structure and function and lead to distinctive neurodevelopmental disorders. Here, we review the pathophysiology of the Bosch–Boonstra–Schaaf Optic Atrophy Syndrome (BBSOAS; OMIM 615722; ORPHA 401777), a recently described monogenic neurodevelopmental syndrome caused by the haploinsufficiency of NR2F1 gene, a key transcriptional regulator of brain development. Although intellectual disability, developmental delay and visual impairment are arguably the most common symptoms affecting BBSOAS patients, multiple additional features are often reported, including epilepsy, autistic traits and hypotonia. The presence of specific symptoms and their variable level of severity might depend on still poorly characterized genotype–phenotype correlations. We begin with an overview of the several mutations of NR2F1 identified to date, then further focuses on the main pathological features of BBSOAS patients, providing evidence—whenever possible—for the existing genotype–phenotype correlations. On the clinical side, we lay out an up-to-date list of clinical examinations and therapeutic interventions recommended for children with BBSOAS. On the experimental side, we describe state-of-the-art in vivo and in vitro studies aiming at deciphering the role of mouse Nr2f1, in physiological conditions and in pathological contexts, underlying the BBSOAS features. Furthermore, by modeling distinct NR2F1 genetic alterations in terms of dimer formation and nuclear receptor binding efficiencies, we attempt to estimate the total amounts of functional NR2F1 acting in developing brain cells in normal and pathological conditions. Finally, using the NR2F1 gene and BBSOAS as a paradigm of monogenic rare neurodevelopmental disorder, we aim to set the path for future explorations of causative links between impaired brain development and the appearance of symptoms in human neurological syndromes.
Collapse
|
22
|
Tessadori F, Duran K, Knapp K, Fellner M, Smithson S, Beleza Meireles A, Elting MW, Waisfisz Q, O’Donnell-Luria A, Nowak C, Douglas J, Ronan A, Brunet T, Kotzaeridou U, Svihovec S, Saenz MS, Thiffault I, Del Viso F, Devine P, Rego S, Tenney J, van Haeringen A, Ruivenkamp CA, Koene S, Robertson SP, Deshpande C, Pfundt R, Verbeek N, van de Kamp JM, Weiss JM, Ruiz A, Gabau E, Banne E, Pepler A, Bottani A, Laurent S, Guipponi M, Bijlsma E, Bruel AL, Sorlin A, Willis M, Powis Z, Smol T, Vincent-Delorme C, Baralle D, Colin E, Revencu N, Calpena E, Wilkie AO, Chopra M, Cormier-Daire V, Keren B, Afenjar A, Niceta M, Terracciano A, Specchio N, Tartaglia M, Rio M, Barcia G, Rondeau S, Colson C, Bakkers J, Mace PD, Bicknell LS, van Haaften G, van Haaften G. Recurrent de novo missense variants across multiple histone H4 genes underlie a neurodevelopmental syndrome. Am J Hum Genet 2022; 109:750-758. [PMID: 35202563 PMCID: PMC9069069 DOI: 10.1016/j.ajhg.2022.02.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 02/03/2022] [Indexed: 12/12/2022] Open
Abstract
Chromatin is essentially an array of nucleosomes, each of which consists of the DNA double-stranded fiber wrapped around a histone octamer. This organization supports cellular processes such as DNA replication, DNA transcription, and DNA repair in all eukaryotes. Human histone H4 is encoded by fourteen canonical histone H4 genes, all differing at the nucleotide level but encoding an invariant protein. Here, we present a cohort of 29 subjects with de novo missense variants in six H4 genes (H4C3, H4C4, H4C5, H4C6, H4C9, and H4C11) identified by whole-exome sequencing and matchmaking. All individuals present with neurodevelopmental features of intellectual disability and motor and/or gross developmental delay, while non-neurological features are more variable. Ten amino acids are affected, six recurrently, and are all located within the H4 core or C-terminal tail. These variants cluster to specific regions of the core H4 globular domain, where protein-protein interactions occur with either other histone subunits or histone chaperones. Functional consequences of the identified variants were evaluated in zebrafish embryos, which displayed abnormal general development, defective head organs, and reduced body axis length, providing compelling evidence for the causality of the reported disorder(s). While multiple developmental syndromes have been linked to chromatin-associated factors, missense-bearing histone variants (e.g., H3 oncohistones) are only recently emerging as a major cause of pathogenicity. Our findings establish a broader involvement of H4 variants in developmental syndromes.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Gijs van Haaften
- Department of Genetics, Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, 3584 Utrecht, the Netherlands.
| |
Collapse
|
23
|
Liu JZ, Chen CY, Tsai EA, Whelan CD, Sexton D, John S, Runz H. The burden of rare protein-truncating genetic variants on human lifespan. NATURE AGING 2022; 2:289-294. [PMID: 37117740 PMCID: PMC10154195 DOI: 10.1038/s43587-022-00182-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2021] [Accepted: 01/20/2022] [Indexed: 04/30/2023]
Abstract
Genetic predisposition has been shown to contribute substantially to the age at which we die. Genome-wide association studies (GWASs) have linked more than 20 loci to phenotypes related to human lifespan1. However, little is known about how lifespan is impacted by gene loss of function. Through whole-exome sequencing of 352,338 UK Biobank participants of European ancestry, we assessed the relevance of protein-truncating variant (PTV) gene burden on individual and parental survival. We identified four exome-wide significant (P < 4.2 × 10-7) human lifespan genes, BRCA1, BRCA2, ATM and TET2. Gene and gene-set, PTV-burden, phenome-wide association studies support known roles of these genes in cancer to impact lifespan at the population level. The TET2 PTV burden was associated with a lifespan through somatic mutation events presumably due to clonal hematopoiesis. The overlap between PTV burden and common variant-based lifespan GWASs was modest, underscoring the value of exome sequencing in well-powered biobank cohorts to complement GWASs for identifying genes underlying complex traits.
Collapse
Affiliation(s)
- Jimmy Z Liu
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA.
| | - Chia-Yen Chen
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA
| | - Ellen A Tsai
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA
| | | | - David Sexton
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA
| | - Sally John
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA
| | - Heiko Runz
- Translational Biology, Research & Development, Biogen Inc., Cambridge, MA, USA.
| |
Collapse
|
24
|
Tanigawa Y, Qian J, Venkataraman G, Justesen JM, Li R, Tibshirani R, Hastie T, Rivas MA. Significant sparse polygenic risk scores across 813 traits in UK Biobank. PLoS Genet 2022; 18:e1010105. [PMID: 35324888 PMCID: PMC8946745 DOI: 10.1371/journal.pgen.1010105] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2021] [Accepted: 02/15/2022] [Indexed: 01/05/2023] Open
Abstract
We present a systematic assessment of polygenic risk score (PRS) prediction across more than 1,500 traits using genetic and phenotype data in the UK Biobank. We report 813 sparse PRS models with significant (p < 2.5 x 10-5) incremental predictive performance when compared against the covariate-only model that considers age, sex, types of genotyping arrays, and the principal component loadings of genotypes. We report a significant correlation between the number of genetic variants selected in the sparse PRS model and the incremental predictive performance (Spearman's ⍴ = 0.61, p = 2.2 x 10-59 for quantitative traits, ⍴ = 0.21, p = 9.6 x 10-4 for binary traits). The sparse PRS model trained on European individuals showed limited transferability when evaluated on non-European individuals in the UK Biobank. We provide the PRS model weights on the Global Biobank Engine (https://biobankengine.stanford.edu/prs).
Collapse
Affiliation(s)
- Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
- Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Junyang Qian
- Department of Statistics, Stanford University, Stanford, California, United States of America
| | - Guhan Venkataraman
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
| | - Johanne Marie Justesen
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
| | - Ruilin Li
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, California, United States of America
| | - Robert Tibshirani
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
- Department of Statistics, Stanford University, Stanford, California, United States of America
| | - Trevor Hastie
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
- Department of Statistics, Stanford University, Stanford, California, United States of America
| | - Manuel A. Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, California, United States of America
| |
Collapse
|
25
|
Pietzner M, Wheeler E, Carrasco-Zanini J, Kerrison ND, Oerton E, Koprulu M, Luan J, Hingorani AD, Williams SA, Wareham NJ, Langenberg C. Synergistic insights into human health from aptamer- and antibody-based proteomic profiling. Nat Commun 2021; 12:6822. [PMID: 34819519 PMCID: PMC8613205 DOI: 10.1038/s41467-021-27164-0] [Citation(s) in RCA: 74] [Impact Index Per Article: 24.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Accepted: 11/03/2021] [Indexed: 01/09/2023] Open
Abstract
Affinity-based proteomics has enabled scalable quantification of thousands of protein targets in blood enhancing biomarker discovery, understanding of disease mechanisms, and genetic evaluation of drug targets in humans through protein quantitative trait loci (pQTLs). Here, we integrate two partly complementary techniques-the aptamer-based SomaScan® v4 assay and the antibody-based Olink assays-to systematically assess phenotypic consequences of hundreds of pQTLs discovered for 871 protein targets across both platforms. We create a genetically anchored cross-platform proteome-phenome network comprising 547 protein-phenotype connections, 36.3% of which were only seen with one of the two platforms suggesting that both techniques capture distinct aspects of protein biology. We further highlight discordance of genetically predicted effect directions between assays, such as for PILRA and Alzheimer's disease. Our results showcase the synergistic nature of these technologies to better understand and identify disease mechanisms and provide a benchmark for future cross-platform discoveries.
Collapse
Affiliation(s)
- Maik Pietzner
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK ,grid.6363.00000 0001 2218 4662Computational Medicine, Berlin Institute of Health (BIH) at Charité – Universitätsmedizin Berlin, Berlin, Germany
| | - Eleanor Wheeler
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Julia Carrasco-Zanini
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Nicola D. Kerrison
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Erin Oerton
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Mine Koprulu
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Jian’an Luan
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK
| | - Aroon D. Hingorani
- grid.83440.3b0000000121901201Institute of Cardiovascular Science, Faculty of Population Health, University College London, London, WC1E 6BT UK ,grid.83440.3b0000000121901201UCL BHF Research Accelerator Centre, London, UK ,grid.507332.0Health Data Research UK, London, UK
| | | | - Nicholas J. Wareham
- grid.5335.00000000121885934MRC Epidemiology Unit, University of Cambridge, Cambridge, UK ,grid.507332.0Health Data Research UK, London, UK
| | - Claudia Langenberg
- MRC Epidemiology Unit, University of Cambridge, Cambridge, UK. .,Computational Medicine, Berlin Institute of Health (BIH) at Charité - Universitätsmedizin Berlin, Berlin, Germany. .,Health Data Research UK, London, UK.
| |
Collapse
|
26
|
Ntunzwenimana JC, Boucher G, Paquette J, Gosselin H, Alikashani A, Morin N, Beauchamp C, Thauvette L, Rivard MÈ, Dupuis F, Deschênes S, Foisy S, Latour F, Lavallée G, Daly MJ, Xavier RJ, Charron G, Goyette P, Rioux JD. Functional screen of inflammatory bowel disease genes reveals key epithelial functions. Genome Med 2021; 13:181. [PMID: 34758847 PMCID: PMC8582123 DOI: 10.1186/s13073-021-00996-7] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 10/21/2021] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND Genetic studies have been tremendously successful in identifying genomic regions associated with a wide variety of phenotypes, although the success of these studies in identifying causal genes, their variants, and their functional impacts has been more limited. METHODS We identified 145 genes from IBD-associated genomic loci having endogenous expression within the intestinal epithelial cell compartment. We evaluated the impact of lentiviral transfer of the open reading frame (ORF) of these IBD genes into the HT-29 intestinal epithelial cell line via transcriptomic analyses. By comparing the genes in which expression was modulated by each ORF, as well as the functions enriched within these gene lists, we identified ORFs with shared impacts and their putative disease-relevant biological functions. RESULTS Analysis of the transcriptomic data for cell lines expressing the ORFs for known causal genes such as HNF4a, IFIH1, and SMAD3 identified functions consistent with what is already known for these genes. These analyses also identified two major clusters of genes: Cluster 1 contained the known IBD causal genes IFIH1, SBNO2, NFKB1, and NOD2, as well as genes from other IBD loci (ZFP36L1, IRF1, GIGYF1, OTUD3, AIRE and PITX1), whereas Cluster 2 contained the known causal gene KSR1 and implicated DUSP16 from another IBD locus. Our analyses highlight how multiple IBD gene candidates can impact on epithelial structure and function, including the protection of the mucosa from intestinal microbiota, and demonstrate that DUSP16 acts a regulator of MAPK activity and contributes to mucosal defense, in part via its regulation of the polymeric immunoglobulin receptor, involved in the protection of the intestinal mucosa from enteric microbiota. CONCLUSIONS This functional screen, based on expressing IBD genes within an appropriate cellular context, in this instance intestinal epithelial cells, resulted in changes to the cell's transcriptome that are relevant to their endogenous biological function(s). This not only helped in identifying likely causal genes within genetic loci but also provided insight into their biological functions. Furthermore, this work has highlighted the central role of intestinal epithelial cells in IBD pathophysiology, providing a scientific rationale for a drug development strategy that targets epithelial functions in addition to the current therapies targeting immune functions.
Collapse
Affiliation(s)
- Jessy Carol Ntunzwenimana
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
- Université de Montréal, Montreal, Quebec, Canada
| | - Gabrielle Boucher
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Jean Paquette
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Hugues Gosselin
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Azadeh Alikashani
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Nicolas Morin
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Claudine Beauchamp
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Louise Thauvette
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Marie-Ève Rivard
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Frédérique Dupuis
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Sonia Deschênes
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Sylvain Foisy
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Frédéric Latour
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Geneviève Lavallée
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Mark J Daly
- Massachusetts General Hospital, Boston, MA, USA
- The Broad Institute, Cambridge, MA, USA
| | - Ramnik J Xavier
- Massachusetts General Hospital, Boston, MA, USA
- The Broad Institute, Cambridge, MA, USA
| | - Guy Charron
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - Philippe Goyette
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada
| | - John D Rioux
- Montreal Heart Institute Research Centre, 5000 rue Bélanger, S-6201, Montreal, Quebec, Canada.
- Université de Montréal, Montreal, Quebec, Canada.
| |
Collapse
|
27
|
Ramensky VE, Ershova AI, Zaicenoka M, Kiseleva AV, Zharikova AA, Vyatkin YV, Sotnikova EA, Efimova IA, Divashuk MG, Kurilova OV, Skirko OP, Muromtseva GA, Belova OA, Rachkova SA, Pokrovskaya MS, Shalnova SA, Meshkov AN, Drapkina OM. Targeted Sequencing of 242 Clinically Important Genes in the Russian Population From the Ivanovo Region. Front Genet 2021; 12:709419. [PMID: 34691145 PMCID: PMC8529250 DOI: 10.3389/fgene.2021.709419] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Accepted: 08/31/2021] [Indexed: 12/16/2022] Open
Abstract
We performed a targeted sequencing of 242 clinically important genes mostly associated with cardiovascular diseases in a representative population sample of 1,658 individuals from the Ivanovo region northeast of Moscow. Approximately 11% of 11,876 detected variants were not found in the Single Nucleotide Polymorphism Database (dbSNP) or reported earlier in the Russian population. Most novel variants were singletons and doubletons in our sample, and virtually no novel alleles presumably specific for the Russian population were able to reach the frequencies above 0.1-0.2%. The overwhelming majority (99.3%) of variants detected in this study in three or more copies were shared with other populations. We found two dominant and seven recessive known pathogenic variants with allele frequencies significantly increased compared to those in the gnomAD non-Finnish Europeans. Of the 242 targeted genes, 28 were in the list of 59 genes for which the American College of Medical Genetics and Genomics (ACMG) recommended the reporting of incidental findings. Based on the number of variants detected in the sequenced subset of ACMG59 genes, we approximated the prevalence of known pathogenic and novel or rare protein-truncating variants in the complete set of ACMG59 genes in the Ivanovo population at 1.4 and 2.8%, respectively. We analyzed the available clinical data and observed the incomplete penetrance of known pathogenic variants in the 28 ACMG59 genes: only 1 individual out of 12 with such variants had the phenotype most likely related to the variant. When known pathogenic and novel or rare protein-truncating variants were considered together, the overall rate of confirmed phenotypes was about 19%, with maximum in the subset of novel protein-truncating variants. We report three novel protein truncating variants in APOB and one in MYH7 observed in individuals with hypobetalipoproteinemia and hypertrophic cardiomyopathy, respectively. Our results provide a valuable reference for the clinical interpretation of gene sequencing in Russian and other populations.
Collapse
Affiliation(s)
- Vasily E Ramensky
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia.,Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Alexandra I Ershova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Marija Zaicenoka
- Moscow Institute of Physics and Technology, Dolgoprudny, Moscow, Russia
| | - Anna V Kiseleva
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Anastasia A Zharikova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia.,Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia
| | - Yuri V Vyatkin
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia.,Novosibirsk State University, Novosibirsk, Russia
| | - Evgeniia A Sotnikova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Irina A Efimova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Mikhail G Divashuk
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia.,All-Russia Research Institute of Agricultural Biotechnology, Moscow, Russia
| | - Olga V Kurilova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Olga P Skirko
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Galina A Muromtseva
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | | | | | - Maria S Pokrovskaya
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Svetlana A Shalnova
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Alexey N Meshkov
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| | - Oxana M Drapkina
- National Medical Research Center for Therapy and Preventive Medicine, Moscow, Russia
| |
Collapse
|
28
|
Tan M, Brusgaard K, Gerdes AM, Mortensen MB, Detlefsen S, Schaffalitzky de Muckadell OB, Joergensen MT. Whole genome sequencing identifies rare germline variants enriched in cancer related genes in first degree relatives of familial pancreatic cancer patients. Clin Genet 2021; 100:551-562. [PMID: 34313325 PMCID: PMC9291090 DOI: 10.1111/cge.14038] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2021] [Revised: 07/22/2021] [Accepted: 07/23/2021] [Indexed: 12/20/2022]
Abstract
First-degree relatives (FDRs) of familial pancreatic cancer (FPC) patients have increased risk of developing pancreatic ductal adenocarcinoma (PDAC). Investigating and understanding the genetic basis for PDAC susceptibility in FPC predisposed families may contribute toward future risk-assessment and management of high-risk individuals. Using a Danish cohort of 27 FPC families, we performed whole-genome sequencing of 61 FDRs of FPC patients focusing on rare genetic variants that may contribute to familial aggregation of PDAC. Statistical analysis was performed using the gnomAD database as external controls. Through analysis of heterozygous premature truncating variants (PTV), we identified cancer-related genes and cancer-driver genes harboring multiple germline mutations. Association analysis detected 20 significant genes with false discovery rate, q < 0.05 including: PALD1, LRP1B, COL4A2, CYLC2, ZFYVE9, BRD3, AHDC1, etc. Functional annotation showed that the significant genes were enriched by gene clusters encoding for extracellular matrix and associated proteins. PTV genes were over-represented by functions related to transport of small molecules, innate immune system, ion channel transport, and stimuli-sensing channels. In conclusion, FDRs of FPC patients carry rare germline variants related to cancer pathogenesis that may contribute to increased susceptibility to PDAC. The identified variants may potentially be useful for risk prediction of high-risk individuals in predisposed families.
Collapse
Affiliation(s)
- Ming Tan
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Department of Medical Gastroenterology, Odense University Hospital, Odense, Denmark.,Odense Pancreas Center (OPAC), Odense University Hospital, Odense, Denmark
| | - Klaus Brusgaard
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
| | - Anne-Marie Gerdes
- Department of Clinical Genetics, Rigshospitalet, Copenhagen, Denmark
| | - Michael Bau Mortensen
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Odense Pancreas Center (OPAC), Odense University Hospital, Odense, Denmark.,Department of Surgery, Odense University Hospital, Odense, Denmark
| | - Sönke Detlefsen
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Odense Pancreas Center (OPAC), Odense University Hospital, Odense, Denmark.,Department of Pathology, Odense University Hospital, Odense, Denmark
| | - Ove B Schaffalitzky de Muckadell
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Department of Medical Gastroenterology, Odense University Hospital, Odense, Denmark.,Odense Pancreas Center (OPAC), Odense University Hospital, Odense, Denmark
| | - Maiken Thyregod Joergensen
- Department of Clinical Research, University of Southern Denmark, Odense, Denmark.,Department of Medical Gastroenterology, Odense University Hospital, Odense, Denmark.,Odense Pancreas Center (OPAC), Odense University Hospital, Odense, Denmark
| |
Collapse
|
29
|
Whole-exome imputation within UK Biobank powers rare coding variant association and fine-mapping analyses. Nat Genet 2021; 53:1260-1269. [PMID: 34226706 PMCID: PMC8349845 DOI: 10.1038/s41588-021-00892-1] [Citation(s) in RCA: 69] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Accepted: 05/28/2021] [Indexed: 02/06/2023]
Abstract
Exome association studies to date have generally been underpowered to systematically evaluate the phenotypic impact of very rare coding variants. We leveraged extensive haplotype sharing between 49,960 exome-sequenced UK Biobank participants and the remainder of the cohort (total N~500K) to impute exome-wide variants with accuracy (R2>0.5) down to minor allele frequency (MAF) ~0.00005. Association and fine-mapping analyses of 54 quantitative traits identified 1,189 significant associations (P<5 x 10−8) involving 675 distinct rare protein-altering variants (MAF<0.01) that passed stringent filters for likely causality. Across all traits, 49% of associations (578/1,189) occurred in genes with two or more hits; follow-up analyses of these genes identified allelic series containing up to 45 distinct likely-causal variants. Our results demonstrate the utility of within-cohort imputation in population-scale GWAS cohorts, provide a catalog of likely-causal, large-effect coding variant associations, and foreshadow the insights that will be revealed as genetic biobank studies continue to grow.
Collapse
|
30
|
Xu H, Zhen Q, Bai M, Fang L, Zhang Y, Li B, Ge H, Moon S, Chen W, Fu W, Xu Q, Zhou Y, Yu Y, Lin L, Yong L, Zhang T, Chen S, Liu S, Zhang H, Chen R, Cao L, Zhang Y, Zhang R, Yang H, Hu X, Akey JM, Jin X, Sun L. Deep sequencing of 1320 genes reveals the landscape of protein-truncating variants and their contribution to psoriasis in 19,973 Chinese individuals. Genome Res 2021; 31:1150-1158. [PMID: 34155038 PMCID: PMC8256863 DOI: 10.1101/gr.267963.120] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2020] [Accepted: 05/10/2021] [Indexed: 12/30/2022]
Abstract
Protein-truncating variants (PTVs) have important impacts on phenotype diversity and disease. However, their population genetics characteristics in more globally diverse populations are not well defined. Here, we describe patterns of PTVs in 1320 genes sequenced in 10,539 healthy controls and 9434 patients with psoriasis, all of Han Chinese ancestry. We identify 8720 PTVs, of which 77% are novel, and estimate 88% of all PTVs are deleterious and subject to purifying selection. Furthermore, we show that individuals with psoriasis have a significantly higher burden of PTVs compared to controls (P = 0.02). Finally, we identified 18 PTVs in 14 genes with unusually high levels of population differentiation, consistent with the action of local adaptation. Our study provides insights into patterns and consequences of PTVs.
Collapse
Affiliation(s)
- Huixin Xu
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- School of Medicine, South China University of Technology, Guangzhou 510006, China
| | - Qi Zhen
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Mingzhou Bai
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
| | - Lin Fang
- Guangdong Engineering Research Center of Life Sciences Bigdata, Shenzhen 518083, China
- Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| | - Yong Zhang
- Guangdong Engineering Research Center of Life Sciences Bigdata, Shenzhen 518083, China
- Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| | - Bao Li
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
| | - Huiyao Ge
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Sunjin Moon
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08540, USA
| | - Weiwei Chen
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Wenqing Fu
- Microsoft Corporation, Redmond, Washington 98052, USA
| | - Qiongqiong Xu
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Yuwen Zhou
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yafeng Yu
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Long Lin
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Liang Yong
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Tao Zhang
- Department of Biology, University of Copenhagen, Copenhagen 2200, Denmark
| | - Shirui Chen
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Siyang Liu
- School of Public Health (Shenzhen), Sun Yat-sen University, Shenzhen 510006, Guangdong, China
| | - Hui Zhang
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Ruoyan Chen
- The First Affiliated Hospital of Shenzhen University, Shenzhen Second People's Hospital, Shenzhen 518035, China
| | - Lu Cao
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Yuanwei Zhang
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
| | - Ruixue Zhang
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Huanjie Yang
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
| | - Xia Hu
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| | - Joshua M Akey
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08540, USA
| | - Xin Jin
- School of Medicine, South China University of Technology, Guangzhou 510006, China
| | - Liangdan Sun
- Department of Dermatology, the First Affiliated Hospital of Anhui Medical University, Hefei 230032, China
- Key Laboratory of Dermatology (Anhui Medical University), Ministry of Education, Anhui, Hefei 230032, China
- Inflammation and Immune Mediated Diseases Laboratory of Anhui Province, Hefei 230032, China
- Anhui Provincial Institute of Translational Medicine, Hefei 230032, China
| |
Collapse
|
31
|
Li R, Chang C, Tanigawa Y, Narasimhan B, Hastie T, Tibshirani R, Rivas MA. Fast Numerical Optimization for Genome Sequencing Data in Population Biobanks. Bioinformatics 2021; 37:4148-4155. [PMID: 34146108 PMCID: PMC9206591 DOI: 10.1093/bioinformatics/btab452] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2021] [Revised: 06/08/2021] [Accepted: 06/15/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Large-scale and high-dimensional genome sequencing data poses computational challenges. General purpose optimization tools are usually not optimal in terms of computational and memory performance for genetic data. RESULTS We develop two efficient solvers for optimization problems arising from large-scale regularized regressions on millions of genetic variants sequenced from hundreds of thousands of individuals. These genetic variants are encoded by the values in the set {0, 1, 2, NA}. We take advantage of this fact and use two bits to represent each entry in a genetic matrix, which reduces memory requirement by a factor of 32 compared to a double precision floating point representation. Using this representation, we implemented an iteratively reweighted least square algorithm to solve Lasso regressions on genetic matrices, which we name snpnet-2.0. When the dataset contains many rare variants, the predictors can be encoded in a sparse matrix. We utilize the sparsity in the predictor matrix to further reduce memory requirement and computational speed. Our sparse genetic matrix implementation uses both the compact 2-bit representation and a simplified version of compressed sparse block format so that matrix-vector multiplications can be effectively parallelized on multiple CPU cores. To demonstrate the effectiveness of this representation, we implement an accelerated proximal gradient method to solve group Lasso on these sparse genetic matrices. This solver is named sparse-snpnet, and will also be included as part of snpnet R package. Our implementation is able to solve Lasso and group Lasso, linear, logistic and Cox regression problems on sparse genetic matrices that contain 1,000,000 variants and almost 100,000 individuals within 10 minutes and using less than 32GB of memory. AVAILABILITY https://github.com/rivas-lab/snpnet/tree/compact.
Collapse
Affiliation(s)
- Ruilin Li
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, 94305, United States
| | | | - Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, 94305, United States
| | - Balasubramanian Narasimhan
- Department of Biomedical Data Science, Stanford University, Stanford, 94305, United States.,Department of Statistics, Stanford University, Stanford, 94305, United States
| | - Trevor Hastie
- Department of Biomedical Data Science, Stanford University, Stanford, 94305, United States.,Department of Statistics, Stanford University, Stanford, 94305, United States
| | - Robert Tibshirani
- Department of Biomedical Data Science, Stanford University, Stanford, 94305, United States.,Department of Statistics, Stanford University, Stanford, 94305, United States
| | - Manuel A Rivas
- Department of Statistics, Stanford University, Stanford, 94305, United States
| |
Collapse
|
32
|
Gui H, Levin AM, Hu D, Sleiman P, Xiao S, Mak ACY, Yang M, Barczak AJ, Huntsman S, Eng C, Hochstadt S, Zhang E, Whitehouse K, Simons S, Cabral W, Takriti S, Abecasis G, Blackwell TW, Kang HM, Nickerson DA, Germer S, Lanfear DE, Gilliland F, Gauderman WJ, Kumar R, Erle DJ, Martinez FD, Hakonarson H, Burchard EG, Williams LK. Mapping the 17q12-21.1 Locus for Variants Associated with Early-Onset Asthma in African Americans. Am J Respir Crit Care Med 2021; 203:424-436. [PMID: 32966749 PMCID: PMC7885840 DOI: 10.1164/rccm.202006-2623oc] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Accepted: 09/21/2020] [Indexed: 01/12/2023] Open
Abstract
Rationale: The 17q12-21.1 locus is one of the most highly replicated genetic associations with asthma. Individuals of African descent have lower linkage disequilibrium in this region, which could facilitate identifying causal variants.Objectives: To identify functional variants at 17q12-21.1 associated with early-onset asthma among African American individuals.Methods: We evaluated African American participants from SAPPHIRE (Study of Asthma Phenotypes and Pharmacogenomic Interactions by Race-Ethnicity) (n = 1,940), SAGE II (Study of African Americans, Asthma, Genes and Environment) (n = 885), and GCPD-A (Study of the Genetic Causes of Complex Pediatric Disorders-Asthma) (n = 2,805). Associations with asthma onset at ages under 5 years were meta-analyzed across cohorts. The lead signal was reevaluated considering haplotypes informed by genetic ancestry (i.e., African vs. European). Both an expression-quantitative trait locus analysis and a phenome-wide association study were performed on the lead variant.Measurements and Main Results: The meta-analyzed results from SAPPHIRE, SAGE II, and the GCPD-A identified rs11078928 as the top association for early-onset asthma. A haplotype analysis suggested that the asthma association partitioned most closely with the rs11078928 genotype. Genetic ancestry did not appear to influence the effect of this variant. In the expression-quantitative trait locus analysis, rs11078928 was related to alternative splicing of GSDMB (gasdermin-B) transcripts. The phenome-wide association study of rs11078928 suggested that this variant was predominantly associated with asthma and asthma-associated symptoms.Conclusions: A splice-acceptor polymorphism appears to be a causal variant for asthma at the 17q12-21.1 locus. This variant appears to have the same magnitude of effect in individuals of African and European descent.
Collapse
Affiliation(s)
- Hongsheng Gui
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Albert M. Levin
- Department of Public Health Sciences, Henry Ford Health System, Detroit, Michigan
| | | | - Patrick Sleiman
- Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Shujie Xiao
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | | | - Mao Yang
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | | | | | | | - Samantha Hochstadt
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Ellen Zhang
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Kyle Whitehouse
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Samantha Simons
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Whitney Cabral
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Sami Takriti
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Gonçalo Abecasis
- Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan
| | - Thomas W. Blackwell
- Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan
| | - Hyun Min Kang
- Center for Statistical Genetics, University of Michigan, Ann Arbor, Michigan
| | - Deborah A. Nickerson
- Department of Genome Sciences, University of Washington, Seattle, Washington
- Northwest Genomics Center, Seattle, Washington
- Brotman Baty Institute, Seattle, Washington
| | | | - David E. Lanfear
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| | - Frank Gilliland
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - W. James Gauderman
- Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Rajesh Kumar
- Department of Pediatrics, Feinberg School of Medicine, Northwestern University, Chicago, Illinois; and
| | - David J. Erle
- Department of Medicine
- Lung Biology Center
- CoLabs, and
| | - Fernando D. Martinez
- Arizona Respiratory Center and
- Department of Pediatrics, University of Arizona, Tucson, Arizona
| | - Hakon Hakonarson
- Center for Applied Genomics, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania
| | - Esteban G. Burchard
- Department of Medicine
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California
| | - L. Keoki Williams
- Department of Internal Medicine, Center for Individualized and Genomic Medicine Research and
| |
Collapse
|
33
|
Li R, Tanigawa Y, Justesen JM, Taylor J, Hastie T, Tibshirani R, Rivas MA. Survival Analysis on Rare Events Using Group-Regularized Multi-Response Cox Regression. Bioinformatics 2021; 37:4437-4443. [PMID: 33560296 PMCID: PMC8652035 DOI: 10.1093/bioinformatics/btab095] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 01/27/2021] [Accepted: 02/05/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The prediction performance of Cox proportional hazard model suffers when there are only few uncensored events in the training data. RESULTS We propose a Sparse-Group regularized Cox regression method to improve the prediction performance of large-scale and high-dimensional survival data with few observed events. Our approach is applicable when there is one or more other survival responses that 1. has a large number of observed events; 2. share a common set of associated predictors with the rare event response. This scenario is common in the UK Biobank (Sudlow et al., 2015) dataset where records for a large number of common and less prevalent diseases of the same set of individuals are available. By analyzing these responses together, we hope to achieve higher prediction performance than when they are analyzed individually. To make this approach practical for large-scale data, we developed an accelerated proximal gradient optimization algorithm as well as a screening procedure inspired by Qian et al. (2020). AVAILABILITY https://github.com/rivas-lab/multisnpnet-Cox. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ruilin Li
- Institute for Computational and Mathematical Engineering, Stanford University, Stanford, United States
| | - Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, United States
| | - Johanne M Justesen
- Department of Biomedical Data Science, Stanford University, Stanford, United States
| | - Jonathan Taylor
- Department of Statistics, Stanford University, Stanford, United States
| | - Trevor Hastie
- Department of Biomedical Data Science, Stanford University, Stanford, United States.,Department of Statistics, Stanford University, Stanford, United States
| | - Robert Tibshirani
- Department of Biomedical Data Science, Stanford University, Stanford, United States.,Department of Statistics, Stanford University, Stanford, United States
| | - Manuel A Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, United States
| |
Collapse
|
34
|
Aguirre M, Tanigawa Y, Venkataraman GR, Tibshirani R, Hastie T, Rivas MA. Polygenic risk modeling with latent trait-related genetic components. Eur J Hum Genet 2021; 29:1071-1081. [PMID: 33558700 DOI: 10.1038/s41431-021-00813-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2020] [Revised: 12/26/2020] [Accepted: 01/14/2021] [Indexed: 02/06/2023] Open
Abstract
Polygenic risk models have led to significant advances in understanding complex diseases and their clinical presentation. While polygenic risk scores (PRS) can effectively predict outcomes, they do not generally account for disease subtypes or pathways which underlie within-trait diversity. Here, we introduce a latent factor model of genetic risk based on components from Decomposition of Genetic Associations (DeGAs), which we call the DeGAs polygenic risk score (dPRS). We compute DeGAs using genetic associations for 977 traits and find that dPRS performs comparably to standard PRS while offering greater interpretability. We show how to decompose an individual's genetic risk for a trait across DeGAs components, with examples for body mass index (BMI) and myocardial infarction (heart attack) in 337,151 white British individuals in the UK Biobank, with replication in a further set of 25,486 non-British white individuals. We find that BMI polygenic risk factorizes into components related to fat-free mass, fat mass, and overall health indicators like physical activity. Most individuals with high dPRS for BMI have strong contributions from both a fat-mass component and a fat-free mass component, whereas a few "outlier" individuals have strong contributions from only one of the two components. Overall, our method enables fine-scale interpretation of the drivers of genetic risk for complex traits.
Collapse
Affiliation(s)
- Matthew Aguirre
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA.,Department of Pediatrics, School of Medicine, Stanford University, Stanford, CA, USA
| | - Yosuke Tanigawa
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA
| | - Guhan Ram Venkataraman
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA
| | - Rob Tibshirani
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA.,Department of Statistics, Stanford University, Stanford, CA, USA
| | - Trevor Hastie
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA.,Department of Statistics, Stanford University, Stanford, CA, USA
| | - Manuel A Rivas
- Department of Biomedical Data Science, School of Medicine, Stanford University, Stanford, CA, USA.
| |
Collapse
|
35
|
Abstract
Clinical laboratory tests are a critical component of the continuum of care. We evaluate the genetic basis of 35 blood and urine laboratory measurements in the UK Biobank (n = 363,228 individuals). We identify 1,857 loci associated with at least one trait, containing 3,374 fine-mapped associations and additional sets of large-effect (>0.1 s.d.) protein-altering, human leukocyte antigen (HLA) and copy number variant (CNV) associations. Through Mendelian randomization (MR) analysis, we discover 51 causal relationships, including previously known agonistic effects of urate on gout and cystatin C on stroke. Finally, we develop polygenic risk scores (PRSs) for each biomarker and build 'multi-PRS' models for diseases using 35 PRSs simultaneously, which improved chronic kidney disease, type 2 diabetes, gout and alcoholic cirrhosis genetic risk stratification in an independent dataset (FinnGen; n = 135,500) relative to single-disease PRSs. Together, our results delineate the genetic basis of biomarkers and their causal influences on diseases and improve genetic risk stratification for common diseases.
Collapse
|
36
|
Benet S, Gálvez C, Drobniewski F, Kontsevaya I, Arias L, Monguió-Tortajada M, Erkizia I, Urrea V, Ong RY, Luquin M, Dupont M, Chojnacki J, Dalmau J, Cardona P, Neyrolles O, Lugo-Villarino G, Vérollet C, Julián E, Furrer H, Günthard HF, Crocker PR, Tapia G, Borràs FE, Fellay J, McLaren PJ, Telenti A, Cardona PJ, Clotet B, Vilaplana C, Martinez-Picado J, Izquierdo-Useros N. Dissemination of Mycobacterium tuberculosis is associated to a SIGLEC1 null variant that limits antigen exchange via trafficking extracellular vesicles. J Extracell Vesicles 2021; 10:e12046. [PMID: 33489013 PMCID: PMC7807485 DOI: 10.1002/jev2.12046] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 10/28/2020] [Accepted: 12/01/2020] [Indexed: 12/14/2022] Open
Abstract
The identification of individuals with null alleles enables studying how the loss of gene function affects infection. We previously described a non-functional variant in SIGLEC1, which encodes the myeloid-cell receptor Siglec-1/CD169 implicated in HIV-1 cell-to-cell transmission. Here we report a significant association between the SIGLEC1 null variant and extrapulmonary dissemination of Mycobacterium tuberculosis (Mtb) in two clinical cohorts comprising 6,256 individuals. Local spread of bacteria within the lung is apparent in Mtb-infected Siglec-1 knockout mice which, despite having similar bacterial load, developed more extensive lesions compared to wild type mice. We find that Siglec-1 is necessary to induce antigen presentation through extracellular vesicle uptake. We postulate that lack of Siglec-1 delays the onset of protective immunity against Mtb by limiting antigen exchange via extracellular vesicles, allowing for an early local spread of mycobacteria that increases the risk for extrapulmonary dissemination.
Collapse
Affiliation(s)
- Susana Benet
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain.,Department of Retrovirology Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain
| | - Cristina Gálvez
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain.,Department of Retrovirology Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain
| | | | - Irina Kontsevaya
- Department of Retrovirology Imperial College London UK.,Department of Retrovirology Research Center Borstel, Borstel Germany.,Department of Retrovirology N.V. Postnikov Samara Region Clinical Tuberculosis Dispensary Samara Russia
| | - Lilibeth Arias
- Experimental Tuberculosis Unit (UTE) Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES) Madrid Spain
| | - Marta Monguió-Tortajada
- REMAR-IVECAT Group Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,ICREC Research Program Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Department of Cell Biology Physiology and Immunology Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain
| | - Itziar Erkizia
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain
| | - Victor Urrea
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain
| | - Ruo-Yan Ong
- Division of Cell Signalling and Immunology University of Dundee Dundee UK
| | - Marina Luquin
- Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain
| | - Maeva Dupont
- Institut de Pharmacologie et Biologie Structurale IPBS CNRS UPS Université de Toulouse Toulouse France.,International associated laboratory (LIA) CNRS "IM-TB/HIV" (1167) France and Buenos Aires Toulouse Argentina
| | - Jakub Chojnacki
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain
| | - Judith Dalmau
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain
| | - Paula Cardona
- Experimental Tuberculosis Unit (UTE) Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES) Madrid Spain
| | - Olivier Neyrolles
- Institut de Pharmacologie et Biologie Structurale IPBS CNRS UPS Université de Toulouse Toulouse France.,International associated laboratory (LIA) CNRS "IM-TB/HIV" (1167) France and Buenos Aires Toulouse Argentina
| | - Geanncarlo Lugo-Villarino
- Institut de Pharmacologie et Biologie Structurale IPBS CNRS UPS Université de Toulouse Toulouse France.,International associated laboratory (LIA) CNRS "IM-TB/HIV" (1167) France and Buenos Aires Toulouse Argentina
| | - Christel Vérollet
- Institut de Pharmacologie et Biologie Structurale IPBS CNRS UPS Université de Toulouse Toulouse France.,International associated laboratory (LIA) CNRS "IM-TB/HIV" (1167) France and Buenos Aires Toulouse Argentina
| | - Esther Julián
- Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain
| | - Hansjakob Furrer
- Department of Infectious Diseases Bern University Hospital University of Bern Bern Switzerland
| | - Huldrych F Günthard
- Division of Infectious Diseases and Hospital Epidemiology University Hospital Zurich Zurich Switzerland.,Institute of Medical Virology University of Zurich Zurich Switzerland
| | - Paul R Crocker
- Division of Cell Signalling and Immunology University of Dundee Dundee UK
| | - Gustavo Tapia
- Department of Retrovirology Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain.,Pathology Department Hospital Universitario Germans Trias i Pujol Badalona Spain.,Germans Trias i Pujol Research Institute (IGTP) Can Ruti Campus Badalona Spain
| | - Francesc E Borràs
- REMAR-IVECAT Group Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Nephrology Department Germans Trias i Pujol University Hospital Badalona Spain
| | - Jacques Fellay
- School of Life Sciences École Polytechnique Fédérale de Lausanne Lausanne Switzerland.,Swiss Institute of Bioinformatics Lausanne Switzerland.,Precision Medicine Unit Lausanne University Hospital and University of Lausanne Lausanne Switzerland
| | - Paul J McLaren
- JC Wilt Infectious Diseases Research Centre Public Health Agency of Canada Winnipeg Manitoba Canada.,Department of Medical Microbiology and Infectious Diseases University of Manitoba Winnipeg Manitoba Canada
| | | | - Pere-Joan Cardona
- Experimental Tuberculosis Unit (UTE) Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES) Madrid Spain
| | - Bonaventura Clotet
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain.,Germans Trias i Pujol Research Institute (IGTP) Can Ruti Campus Badalona Spain.,AIDS and Related Illnesses Centre for Health and Social Care Research (CESS) Faculty of Medicine University of Vic - Central University of Catalonia (UVic - UCC) Vic Spain
| | - Cristina Vilaplana
- Experimental Tuberculosis Unit (UTE) Germans Trias i Pujol Health Science Research Institute Can Ruti Campus Badalona Spain.,Departament de Genètica i de Microbiologia Facultat de Biociències Universitat Autònoma de Barcelona Cerdanyola del Vallès Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES) Madrid Spain
| | - Javier Martinez-Picado
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain.,Germans Trias i Pujol Research Institute (IGTP) Can Ruti Campus Badalona Spain.,AIDS and Related Illnesses Centre for Health and Social Care Research (CESS) Faculty of Medicine University of Vic - Central University of Catalonia (UVic - UCC) Vic Spain.,Catalan Institution for Research and Advanced Studies (ICREA) Barcelona Spain
| | - Nuria Izquierdo-Useros
- Department of Retrovirology IrsiCaixa AIDS Research Institute Badalona Spain.,Germans Trias i Pujol Research Institute (IGTP) Can Ruti Campus Badalona Spain
| |
Collapse
|
37
|
Amar D, Sinnott-Armstrong N, Ashley EA, Rivas MA. Graphical analysis for phenome-wide causal discovery in genotyped population-scale biobanks. Nat Commun 2021; 12:350. [PMID: 33441555 PMCID: PMC7806647 DOI: 10.1038/s41467-020-20516-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2019] [Accepted: 12/02/2020] [Indexed: 11/19/2022] Open
Abstract
Causal inference via Mendelian randomization requires making strong assumptions about horizontal pleiotropy, where genetic instruments are connected to the outcome not only through the exposure. Here, we present causal Graphical Analysis Using Genetics (cGAUGE), a pipeline that overcomes these limitations using instrument filters with provable properties. This is achievable by identifying conditional independencies while examining multiple traits. cGAUGE also uses ExSep (Exposure-based Separation), a novel test for the existence of causal pathways that does not require selecting instruments. In simulated data we illustrate how cGAUGE can reduce the empirical false discovery rate by up to 30%, while retaining the majority of true discoveries. On 96 complex traits from 337,198 subjects from the UK Biobank, our results cover expected causal links and many new ones that were previously suggested by correlation-based observational studies. Notably, we identify multiple risk factors for cardiovascular disease, including red blood cell distribution width.
Collapse
Affiliation(s)
- David Amar
- Center for Inherited Cardiovascular Disease, Stanford University, Stanford, CA, USA
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | | | - Euan A Ashley
- Center for Inherited Cardiovascular Disease, Stanford University, Stanford, CA, USA
| | - Manuel A Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
| |
Collapse
|
38
|
Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature 2021; 597:527-532. [PMID: 34375979 PMCID: PMC8458098 DOI: 10.1038/s41586-021-03855-y] [Citation(s) in RCA: 178] [Impact Index Per Article: 59.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2020] [Accepted: 07/28/2021] [Indexed: 02/08/2023]
Abstract
Genome-wide association studies have uncovered thousands of common variants associated with human disease, but the contribution of rare variants to common disease remains relatively unexplored. The UK Biobank contains detailed phenotypic data linked to medical records for approximately 500,000 participants, offering an unprecedented opportunity to evaluate the effect of rare variation on a broad collection of traits1,2. Here we study the relationships between rare protein-coding variants and 17,361 binary and 1,419 quantitative phenotypes using exome sequencing data from 269,171 UK Biobank participants of European ancestry. Gene-based collapsing analyses revealed 1,703 statistically significant gene-phenotype associations for binary traits, with a median odds ratio of 12.4. Furthermore, 83% of these associations were undetectable via single-variant association tests, emphasizing the power of gene-based collapsing analysis in the setting of high allelic heterogeneity. Gene-phenotype associations were also significantly enriched for loss-of-function-mediated traits and approved drug targets. Finally, we performed ancestry-specific and pan-ancestry collapsing analyses using exome sequencing data from 11,933 UK Biobank participants of African, East Asian or South Asian ancestry. Our results highlight a significant contribution of rare variants to common disease. Summary statistics are publicly available through an interactive portal ( http://azphewas.com/ ).
Collapse
|
39
|
Momozawa Y, Mizukami K. Unique roles of rare variants in the genetics of complex diseases in humans. J Hum Genet 2021; 66:11-23. [PMID: 32948841 PMCID: PMC7728599 DOI: 10.1038/s10038-020-00845-2] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 09/06/2020] [Indexed: 12/19/2022]
Abstract
Genome-wide association studies have identified >10,000 genetic variants associated with various phenotypes and diseases. Although the majority are common variants, rare variants with >0.1% of minor allele frequency have been investigated by imputation and using disease-specific custom SNP arrays. Rare variants sequencing analysis mainly revealed have played unique roles in the genetics of complex diseases in humans due to their distinctive features, in contrast to common variants. Unique roles are hypothesis-free evidence for gene causality, a precise target of functional analysis for understanding disease mechanisms, a new favorable target for drug development, and a genetic marker with high disease risk for personalized medicine. As whole-genome sequencing continues to identify more rare variants, the roles associated with rare variants will also increase. However, a better estimation of the functional impact of rare variants across whole genome is needed to enhance their contribution to improvements in human health.
Collapse
Affiliation(s)
- Yukihide Momozawa
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan.
- Laboratory for Molecular Science for Drug Discovery, Graduate School of Medical Life Science, Yokohama City University, Kanagawa, Japan.
| | - Keijiro Mizukami
- Laboratory for Genotyping Development, RIKEN Center for Integrative Medical Sciences, Kanagawa, Japan
| |
Collapse
|
40
|
A phenome-wide association study of 26 mendelian genes reveals phenotypic expressivity of common and rare variants within the general population. PLoS Genet 2020; 16:e1008802. [PMID: 33226994 PMCID: PMC7735621 DOI: 10.1371/journal.pgen.1008802] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2019] [Revised: 12/14/2020] [Accepted: 04/27/2020] [Indexed: 02/06/2023] Open
Abstract
The clinical evaluation of a genetic syndrome relies upon recognition of a characteristic pattern of signs or symptoms to guide targeted genetic testing for confirmation of the diagnosis. However, individuals displaying a single phenotype of a complex syndrome may not meet criteria for clinical diagnosis or genetic testing. Here, we present a phenome-wide association study (PheWAS) approach to systematically explore the phenotypic expressivity of common and rare alleles in genes associated with four well-described syndromic diseases (Alagille (AS), Marfan (MS), DiGeorge (DS), and Noonan (NS) syndromes) in the general population. Using human phenotype ontology (HPO) terms, we systematically mapped 60 phenotypes related to AS, MS, DS and NS in 337,198 unrelated white British from the UK Biobank (UKBB) based on their hospital admission records, self-administrated questionnaires, and physiological measurements. We performed logistic regression adjusting for age, sex, and the first 5 genetic principal components, for each phenotype and each variant in the target genes (JAG1, NOTCH2 FBN1, PTPN1 and RAS-opathy genes, and genes in the 22q11.2 locus) and performed a gene burden test. Overall, we observed multiple phenotype-genotype correlations, such as the association between variation in JAG1, FBN1, PTPN11 and SOS2 with diastolic and systolic blood pressure; and pleiotropy among multiple variants in syndromic genes. For example, rs11066309 in PTPN11 was significantly associated with a lower body mass index, an increased risk of hypothyroidism and a smaller size for gestational age, all in concordance with NS-related phenotypes. Similarly, rs589668 in FBN1 was associated with an increase in body height and blood pressure, and a reduced body fat percentage as observed in Marfan syndrome. Our findings suggest that the spectrum of associations of common and rare variants in genes involved in syndromic diseases can be extended to individual phenotypes within the general population. Standard medical evaluation of genetic syndromes relies upon recognizing a characteristic pattern of signs or symptoms to guide targeted genetic testing for confirmation of the diagnosis. This may lead to missing diagnoses in patients with silent or a low expressed form of the syndrome. Here we take advantage of a rich electronic health record, various phenotypic measurements, and genetic information in 337,198 unrelated white British from the UKBB, to study the relation between single syndromic disease phenotypes and genes related to syndromic disease. We show multiple phenotype-genotype associations in concordance with phenotypes variations found in syndromic diseases. For example, we show that a commonly found variant in FBN1 was associated with high standing/sitting height ratio and reduced body fat percentage as observed in individuals with Marfan syndrome. Our findings suggest that common and rare alleles in syndromic disease genes are causative of individual component phenotypes present in a general population; further research is needed to characterize the pleiotropic effect of alleles in syndromic genes in persons without the syndromic disease.
Collapse
|
41
|
Qian J, Tanigawa Y, Du W, Aguirre M, Chang C, Tibshirani R, Rivas MA, Hastie T. A fast and scalable framework for large-scale and ultrahigh-dimensional sparse regression with application to the UK Biobank. PLoS Genet 2020; 16:e1009141. [PMID: 33095761 PMCID: PMC7641476 DOI: 10.1371/journal.pgen.1009141] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Revised: 11/04/2020] [Accepted: 09/04/2020] [Indexed: 11/18/2022] Open
Abstract
The UK Biobank is a very large, prospective population-based cohort study across the United Kingdom. It provides unprecedented opportunities for researchers to investigate the relationship between genotypic information and phenotypes of interest. Multiple regression methods, compared with genome-wide association studies (GWAS), have already been showed to greatly improve the prediction performance for a variety of phenotypes. In the high-dimensional settings, the lasso, since its first proposal in statistics, has been proved to be an effective method for simultaneous variable selection and estimation. However, the large-scale and ultrahigh dimension seen in the UK Biobank pose new challenges for applying the lasso method, as many existing algorithms and their implementations are not scalable to large applications. In this paper, we propose a computational framework called batch screening iterative lasso (BASIL) that can take advantage of any existing lasso solver and easily build a scalable solution for very large data, including those that are larger than the memory size. We introduce snpnet, an R package that implements the proposed algorithm on top of glmnet and optimizes for single nucleotide polymorphism (SNP) datasets. It currently supports ℓ1-penalized linear model, logistic regression, Cox model, and also extends to the elastic net with ℓ1/ℓ2 penalty. We demonstrate results on the UK Biobank dataset, where we achieve competitive predictive performance for all four phenotypes considered (height, body mass index, asthma, high cholesterol) using only a small fraction of the variants compared with other established polygenic risk score methods.
Collapse
Affiliation(s)
- Junyang Qian
- Department of Statistics, Stanford University, Stanford, CA, United States of America
| | - Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America
| | - Wenfei Du
- Department of Statistics, Stanford University, Stanford, CA, United States of America
| | - Matthew Aguirre
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America
| | - Chris Chang
- Grail, Inc., Menlo Park, CA, United States of America
| | - Robert Tibshirani
- Department of Statistics, Stanford University, Stanford, CA, United States of America
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America
| | - Manuel A. Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America
| | - Trevor Hastie
- Department of Statistics, Stanford University, Stanford, CA, United States of America
- Department of Biomedical Data Science, Stanford University, Stanford, CA, United States of America
- * E-mail:
| |
Collapse
|
42
|
Cui H, Zuo S, Liu Z, Liu H, Wang J, You T, Zheng Z, Zhou Y, Qian X, Yao H, Xie L, Liu T, Sham PC, Yu Y, Li MJ. The support of genetic evidence for cardiovascular risk induced by antineoplastic drugs. SCIENCE ADVANCES 2020; 6:6/42/eabb8543. [PMID: 33055159 PMCID: PMC7556838 DOI: 10.1126/sciadv.abb8543] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Accepted: 08/28/2020] [Indexed: 05/04/2023]
Abstract
Cardiovascular dysfunction is one of the most common complications of long-term cancer treatment. Growing evidence has shown that antineoplastic drugs can increase cardiovascular risk during cancer therapy, seriously affecting patient survival. However, little is known about the genetic factors associated with the cardiovascular risk of antineoplastic drugs. We established a compendium of genetic evidence that supports cardiovascular risk induced by antineoplastic drugs. Most of this genetic evidence is attributed to causal alleles altering the expression of cardiovascular disease genes. We found that antineoplastic drugs predicted to induce cardiovascular risk are significantly enriched in drugs associated with cardiovascular adverse reactions, including many first-line cancer treatments. Functional experiments validated that retinoid X receptor agonists can reduce triglyceride lipolysis, thus modulating cardiovascular risk. Our results establish a link between the causal allele of cardiovascular disease genes and the direction of pharmacological modulation, which could facilitate cancer drug discovery and clinical trial design.
Collapse
Affiliation(s)
- Hui Cui
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China
- Key Laboratory of Food Safety Research, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute for Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Shengkai Zuo
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zipeng Liu
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Huanhuan Liu
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Jianhua Wang
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Tianyi You
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zhanye Zheng
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Yao Zhou
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Xinyi Qian
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Hongcheng Yao
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Lu Xie
- Shanghai Center for Bioinformation Technology, Shanghai Academy of Science and Technology, Shanghai, China
| | - Tong Liu
- Tianjin Key Laboratory of Ionic-Molecular Function of Cardiovascular Disease, Department of Cardiology, Tianjin Institute of Cardiology, Second Hospital of Tianjin Medical University, Tianjin, China
| | - Pak Chung Sham
- Centre for PanorOmic Sciences-Genomics and Bioinformatics Cores, LKS Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Ying Yu
- School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
- Key Laboratory of Food Safety Research, CAS Center for Excellence in Molecular Cell Science, Shanghai Institute for Nutrition and Health, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, China
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Mulin Jun Li
- Department of Pharmacology, Tianjin Key Laboratory of Inflammation Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.
- Department of Epidemiology and Biostatistics, Tianjin Key Laboratory of Molecular Cancer Epidemiology, National Clinical Research Center for Cancer, Tianjin Medical University Cancer Institute and Hospital, Tianjin Medical University, Tianjin, China
| |
Collapse
|
43
|
Hujoel MLA, Parmigiani G, Braun D. Statistical approaches for meta-analysis of genetic mutation prevalence. Genet Epidemiol 2020; 45:154-170. [PMID: 33000511 PMCID: PMC10391692 DOI: 10.1002/gepi.22364] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2020] [Revised: 08/23/2020] [Accepted: 09/03/2020] [Indexed: 12/13/2022]
Abstract
Estimating the prevalence of rare germline genetic mutations in the general population is of interest as it can inform genetic counseling and risk management. Most studies that estimate the prevalence of mutations are performed in high-risk populations, and each study is designed with differing inclusion criteria, resulting in ascertained populations. Quantifying the effects of ascertainment is necessary to estimate the prevalence in the general population. This quantification is difficult as the inclusion criteria is often based on disease status and/or family history. Combining estimates from multiple studies through a meta-analysis is challenging due to the variety of study designs and ascertainment mechanisms as well as the complexity of quantifying the effect of these mechanisms. We provide guidelines on how to quantify the ascertainment mechanism for a wide range of settings and propose a general approach for conducting a meta-analysis in these complex settings by incorporating study-specific ascertainment mechanisms into a joint likelihood function. We implement the proposed likelihood-based approach using both frequentist and Bayesian methodologies. We evaluate these approaches in simulations and show that the methods are robust and produce unbiased estimates of the prevalence. An advantage of the Bayesian approach is that it can easily incorporate uncertainty in ascertainment probability values. We apply our methods to estimate the prevalence of PALB2 mutations in the United States by combining data from multiple studies and obtain a prevalence estimate of around 0.02%.
Collapse
Affiliation(s)
- Margaux L A Hujoel
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.,Division of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Giovanni Parmigiani
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.,Division of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| | - Danielle Braun
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, Massachusetts, USA.,Division of Biostatistics, Dana-Farber Cancer Institute, Boston, Massachusetts, USA
| |
Collapse
|
44
|
Hujoel MLA, Parmigiani G, Braun D. Statistical approaches for meta‐analysis of genetic mutation prevalence. Genet Epidemiol 2020. [DOI: 10.7560/746435-017] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Margaux L. A. Hujoel
- Department of Biostatistics Harvard T.H. Chan School of Public Health Boston Massachusetts USA
- Division of Biostatistics Dana‐Farber Cancer Institute Boston Massachusetts USA
| | - Giovanni Parmigiani
- Department of Biostatistics Harvard T.H. Chan School of Public Health Boston Massachusetts USA
- Division of Biostatistics Dana‐Farber Cancer Institute Boston Massachusetts USA
| | - Danielle Braun
- Department of Biostatistics Harvard T.H. Chan School of Public Health Boston Massachusetts USA
- Division of Biostatistics Dana‐Farber Cancer Institute Boston Massachusetts USA
| |
Collapse
|
45
|
Flynn E, Tanigawa Y, Rodriguez F, Altman RB, Sinnott-Armstrong N, Rivas MA. Sex-specific genetic effects across biomarkers. Eur J Hum Genet 2020; 29:154-163. [PMID: 32873964 DOI: 10.1038/s41431-020-00712-w] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Revised: 07/28/2020] [Accepted: 08/04/2020] [Indexed: 11/09/2022] Open
Abstract
Sex differences have been shown in laboratory biomarkers; however, the extent to which this is due to genetics is unknown. In this study, we infer sex-specific genetic parameters (heritability and genetic correlation) across 33 quantitative biomarker traits in 181,064 females and 156,135 males from the UK Biobank study. We apply a Bayesian Mixture Model, Sex Effects Mixture Model (SEMM), to Genome-wide Association Study summary statistics in order to (1) estimate the contributions of sex to the genetic variance of these biomarkers and (2) identify variants whose statistical association with these traits is sex-specific. We find that the genetics of most biomarker traits are shared between males and females, with the notable exception of testosterone, where we identify 119 female and 445 male-specific variants. These include protein-altering variants in steroid hormone production genes (POR, UGT2B7). Using the sex-specific variants as genetic instruments for Mendelian randomization, we find evidence for causal links between testosterone levels and height, body mass index, waist and hip circumference, and type 2 diabetes. We also show that sex-specific polygenic risk score models for testosterone outperform a combined model. Overall, these results demonstrate that while sex has a limited role in the genetics of most biomarker traits, sex plays an important role in testosterone genetics.
Collapse
Affiliation(s)
- Emily Flynn
- Biomedical Informatics Training Program, Stanford University, Stanford, CA, USA.
| | - Yosuke Tanigawa
- Biomedical Informatics Training Program, Stanford University, Stanford, CA, USA
| | - Fatima Rodriguez
- Department of Medicine, Division of Cardiovascular Medicine and the Cardiovascular Institute, Stanford University, Stanford, CA, USA
| | - Russ B Altman
- Biomedical Informatics Training Program, Stanford University, Stanford, CA, USA.,Department of Medicine, Stanford University, Stanford, CA, USA.,Department of Bioengineering, Stanford University, Stanford, CA, USA.,Department of Genetics, Stanford University, Stanford, CA, USA
| | | | - Manuel A Rivas
- Biomedical Informatics Training Program, Stanford University, Stanford, CA, USA.
| |
Collapse
|
46
|
Ratnakumar A, Weinhold N, Mar JC, Riaz N. Protein-Protein interactions uncover candidate 'core genes' within omnigenic disease networks. PLoS Genet 2020; 16:e1008903. [PMID: 32678846 PMCID: PMC7390454 DOI: 10.1371/journal.pgen.1008903] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Revised: 07/29/2020] [Accepted: 06/01/2020] [Indexed: 01/09/2023] Open
Abstract
Genome wide association studies (GWAS) of human diseases have generally identified many loci associated with risk with relatively small effect sizes. The omnigenic model attempts to explain this observation by suggesting that diseases can be thought of as networks, where genes with direct involvement in disease-relevant biological pathways are named ‘core genes’, while peripheral genes influence disease risk via their interactions or regulatory effects on core genes. Here, we demonstrate a method for identifying candidate core genes solely from genes in or near disease-associated SNPs (GWAS hits) in conjunction with protein-protein interaction network data. Applied to 1,381 GWAS studies from 5 ancestries, we identify a total of 1,865 candidate core genes in 343 GWAS studies. Our analysis identifies several well-known disease-related genes that are not identified by GWAS, including BRCA1 in Breast Cancer, Amyloid Precursor Protein (APP) in Alzheimer’s Disease, INS in A1C measurement and Type 2 Diabetes, and PCSK9 in LDL cholesterol, amongst others. Notably candidate core genes are preferentially enriched for disease relevance over GWAS hits and are enriched for both Clinvar pathogenic variants and known drug targets—consistent with the predictions of the omnigenic model. We subsequently use parent term annotations provided by the GWAS catalog, to merge related GWAS studies and identify candidate core genes in over-arching disease processes such as cancer–where we identify 109 candidate core genes. A recent theory suggests that only a small number of genes underpin the biology of a disease, these genes are called ‘core genes’, and for most diseases, these core genes remain unknown. The suggested methods for finding them requires complex and expensive experiments. We reasoned that if we merge currently available datasets in smart ways, we may be able to uncover these ‘core genes’. Our method finds “hub” proteins by merging lists of genes previously linked with disease to information on how proteins interact with each other. We found that many of these hub proteins have central roles in disease, such as insulin for both A1C measurement and Type 2 Diabetes, BRCA1 in Breast cancer, and Amyloid Precursor Protein in Alzheimer’s Disease. We think these ‘hub’ proteins are candidate ‘core genes’, and offer our method as a way to find ‘core genes’ by utilizing publicly available reference datasets.
Collapse
Affiliation(s)
- Abhirami Ratnakumar
- Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
- * E-mail:
| | - Nils Weinhold
- Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| | - Jessica C. Mar
- Australian Institute for Bioengineering and Nanotechnology, University of Queensland, Brisbane, Australia
| | - Nadeem Riaz
- Department of Radiation Oncology, Memorial Sloan Kettering Cancer Center, New York, New York, United States of America
| |
Collapse
|
47
|
Abstract
The prevalence and clinical characteristics of depressive disorders differ between women and men; however, the genetic contribution to sex differences in depressive disorders has not been elucidated. To evaluate sex-specific differences in the genetic architecture of depression, whole exome sequencing of samples from 1000 patients (70.7% female) with depressive disorder was conducted. Control data from healthy individuals with no psychiatric disorder (n = 72, 26.4% female) and East-Asian subpopulation 1000 Genome Project data (n = 207, 50.7% female) were included. The genetic variation between men and women was directly compared using both qualitative and quantitative research designs. Qualitative analysis identified five genetic markers potentially associated with increased risk of depressive disorder in females, including three variants (rs201432982 within PDE4A, and rs62640397 and rs79442975 within FDX1L) mapping to chromosome 19p13.2 and two novel variants (rs820182 and rs820148) within MYO15B at the chromosome 17p25.1 locus. Depressed patients homozygous for these variants showed more severe depressive symptoms and higher suicidality than those who were not homozygotes (i.e., heterozygotes and homozygotes for the non-associated allele). Quantitative analysis demonstrated that the genetic burden of protein-truncating and deleterious variants was higher in males than females, even after permutation testing. Our study provides novel genetic evidence that the higher prevalence of depressive disorders in women may be attributable to inherited variants.
Collapse
|
48
|
DeBoever C, Tanigawa Y, Aguirre M, McInnes G, Lavertu A, Rivas MA. Assessing Digital Phenotyping to Enhance Genetic Studies of Human Diseases. Am J Hum Genet 2020; 106:611-622. [PMID: 32275883 PMCID: PMC7212271 DOI: 10.1016/j.ajhg.2020.03.007] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Accepted: 03/11/2020] [Indexed: 12/17/2022] Open
Abstract
Population-scale biobanks that combine genetic data and high-dimensional phenotyping for a large number of participants provide an exciting opportunity to perform genome-wide association studies (GWAS) to identify genetic variants associated with diverse quantitative traits and diseases. A major challenge for GWAS in population biobanks is ascertaining disease cases from heterogeneous data sources such as hospital records, digital questionnaire responses, or interviews. In this study, we use genetic parameters, including genetic correlation, to evaluate whether GWAS performed using cases in the UK Biobank ascertained from hospital records, questionnaire responses, and family history of disease implicate similar disease genetics across a range of effect sizes. We find that hospital record and questionnaire GWAS largely identify similar genetic effects for many complex phenotypes and that combining together both phenotyping methods improves power to detect genetic associations. We also show that family history GWAS using cases ascertained on family history of disease agrees with combined hospital record and questionnaire GWAS and that family history GWAS has better power to detect genetic associations for some phenotypes. Overall, this work demonstrates that digital phenotyping and unstructured phenotype data can be combined with structured data such as hospital records to identify cases for GWAS in biobanks and improve the ability of such studies to identify genetic associations.
Collapse
Affiliation(s)
| | - Yosuke Tanigawa
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Matthew Aguirre
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Greg McInnes
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Adam Lavertu
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA
| | - Manuel A Rivas
- Department of Biomedical Data Science, Stanford University, Stanford, CA, USA.
| |
Collapse
|
49
|
Minikel EV, Karczewski KJ, Martin HC, Cummings BB, Whiffin N, Rhodes D, Alföldi J, Trembath RC, van Heel DA, Daly MJ, Schreiber SL, MacArthur DG. Evaluating drug targets through human loss-of-function genetic variation. Nature 2020; 581:459-464. [PMID: 32461653 PMCID: PMC7272226 DOI: 10.1038/s41586-020-2267-z] [Citation(s) in RCA: 96] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 02/10/2020] [Indexed: 12/15/2022]
Abstract
Naturally occurring human genetic variants that are predicted to inactivate protein-coding genes provide an in vivo model of human gene inactivation that complements knockout studies in cells and model organisms. Here we report three key findings regarding the assessment of candidate drug targets using human loss-of-function variants. First, even essential genes, in which loss-of-function variants are not tolerated, can be highly successful as targets of inhibitory drugs. Second, in most genes, loss-of-function variants are sufficiently rare that genotype-based ascertainment of homozygous or compound heterozygous 'knockout' humans will await sample sizes that are approximately 1,000 times those presently available, unless recruitment focuses on consanguineous individuals. Third, automated variant annotation and filtering are powerful, but manual curation remains crucial for removing artefacts, and is a prerequisite for recall-by-genotype efforts. Our results provide a roadmap for human knockout studies and should guide the interpretation of loss-of-function variants in drug development.
Collapse
Affiliation(s)
- Eric Vallabh Minikel
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Chemical Biology and Therapeutics Science Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA.
- Henry and Allison McCance Center for Brain Health, Massachusetts General Hospital, Boston, MA, USA.
- Department of Neurology, Massachusetts General Hospital, Boston, MA, USA.
- Prion Alliance, Cambridge, MA, USA.
| | - Konrad J Karczewski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | | | - Beryl B Cummings
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Nicola Whiffin
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- National Heart and Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, London, UK
| | - Daniel Rhodes
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London and Barts Health NHS Trust, London, UK
| | - Jessica Alföldi
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Richard C Trembath
- School of Basic and Medical Biosciences, Faculty of Life Sciences and Medicine, King's College London, London, UK
| | - David A van Heel
- Blizard Institute, Barts and The London School of Medicine and Dentistry, Queen Mary University of London, London, UK
| | - Mark J Daly
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Stuart L Schreiber
- Chemical Biology and Therapeutics Science Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Chemistry & Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytical and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Centre for Population Genomics, Garvan Institute of Medical Research and UNSW Sydney, Sydney, Australia.
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Australia.
| |
Collapse
|
50
|
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, Collins RL, Laricchia KM, Ganna A, Birnbaum DP, Gauthier LD, Brand H, Solomonson M, Watts NA, Rhodes D, Singer-Berk M, England EM, Seaby EG, Kosmicki JA, Walters RK, Tashman K, Farjoun Y, Banks E, Poterba T, Wang A, Seed C, Whiffin N, Chong JX, Samocha KE, Pierce-Hoffman E, Zappala Z, O'Donnell-Luria AH, Minikel EV, Weisburd B, Lek M, Ware JS, Vittal C, Armean IM, Bergelson L, Cibulskis K, Connolly KM, Covarrubias M, Donnelly S, Ferriera S, Gabriel S, Gentry J, Gupta N, Jeandet T, Kaplan D, Llanwarne C, Munshi R, Novod S, Petrillo N, Roazen D, Ruano-Rubio V, Saltzman A, Schleicher M, Soto J, Tibbetts K, Tolonen C, Wade G, Talkowski ME, Neale BM, Daly MJ, MacArthur DG. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 2020; 581:434-443. [PMID: 32461654 PMCID: PMC7334197 DOI: 10.1038/s41586-020-2308-7] [Citation(s) in RCA: 5196] [Impact Index Per Article: 1299.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2019] [Accepted: 03/26/2020] [Indexed: 12/04/2022]
Abstract
Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes1. Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.
Collapse
Affiliation(s)
- Konrad J Karczewski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
| | - Laurent C Francioli
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Grace Tiao
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Beryl B Cummings
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Jessica Alföldi
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Qingbo Wang
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, USA
| | - Ryan L Collins
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Kristen M Laricchia
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Andrea Ganna
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Institute for Molecular Medicine Finland, Helsinki, Finland
| | - Daniel P Birnbaum
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Laura D Gauthier
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Harrison Brand
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Matthew Solomonson
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Nicholas A Watts
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Daniel Rhodes
- Centre for Translational Bioinformatics, William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London and Barts Health NHS Trust, London, UK
| | - Moriel Singer-Berk
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Eleina M England
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Eleanor G Seaby
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Jack A Kosmicki
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Program in Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, USA
| | - Raymond K Walters
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Katherine Tashman
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Yossi Farjoun
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Eric Banks
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Timothy Poterba
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Arcturus Wang
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Cotton Seed
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nicola Whiffin
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- National Heart & Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, London, UK
- Cardiovascular Research Centre, Royal Brompton & Harefield Hospitals NHS Trust, London, UK
| | - Jessica X Chong
- Department of Pediatrics, University of Washington, Seattle, WA, USA
| | - Kaitlin E Samocha
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Emma Pierce-Hoffman
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Zachary Zappala
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Vertex Pharmaceuticals Inc, Boston, MA, USA
| | - Anne H O'Donnell-Luria
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Eric Vallabh Minikel
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Ben Weisburd
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Monkol Lek
- Department of Genetics, Yale School of Medicine, New Haven, CT, USA
| | - James S Ware
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- National Heart & Lung Institute and MRC London Institute of Medical Sciences, Imperial College London, London, UK
- Cardiovascular Research Centre, Royal Brompton & Harefield Hospitals NHS Trust, London, UK
| | - Christopher Vittal
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Irina M Armean
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
| | - Louis Bergelson
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kristian Cibulskis
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Miguel Covarrubias
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Stacey Donnelly
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Steven Ferriera
- Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Stacey Gabriel
- Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jeff Gentry
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Namrata Gupta
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Broad Genomics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Thibault Jeandet
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Diane Kaplan
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Ruchi Munshi
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sam Novod
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Nikelle Petrillo
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Roazen
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Andrea Saltzman
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Molly Schleicher
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jose Soto
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kathleen Tibbetts
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Charlotte Tolonen
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Gordon Wade
- Data Sciences Platform, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Michael E Talkowski
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Neurology, Harvard Medical School, Boston, MA, USA
| | - Benjamin M Neale
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Mark J Daly
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA
- Institute for Molecular Medicine Finland, Helsinki, Finland
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Daniel G MacArthur
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Centre for Population Genomics, Garvan Institute of Medical Research, and UNSW Sydney, Sydney, New South Wales, Australia.
- Centre for Population Genomics, Murdoch Children's Research Institute, Melbourne, Victoria, Australia.
| |
Collapse
|