1
|
González-Padilla D, Camara MD, Lauschke VM, Zhou Y. Population-scale variability of the human UDP-glycosyltransferase gene family. J Genet Genomics 2024:S1673-8527(24)00161-9. [PMID: 38969258 DOI: 10.1016/j.jgg.2024.06.018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Revised: 06/10/2024] [Accepted: 06/26/2024] [Indexed: 07/07/2024]
Abstract
Human UDP-glycosyltransferases (UGTs) are responsible for the glucuronidation of a wide variety of endogenous substrates and commonly prescribed drugs. Different genetic polymorphisms in UGT genes are implicated in interindividual differences in drug response and cancer risk. However, the genetic complexity beyond these variants has not been comprehensively assessed. We here leveraged whole-exome and whole-genome sequencing data from 141,456 unrelated individuals across 7 major human populations to provide a comprehensive profile of genetic variability across the human UGT gene family. Overall, 9666 exonic variants were observed of which 98.9% were rare. To interpret the functional impact of UGT missense variants, we developed a gene family-specific variant effect predictor. This algorithm identified a total of 1208 deleterious variants, most of which were found in African and South Asian populations. Structural analysis corroborated the predicted effects for multiple variations in substrate binding sites. Combined, our analyses provide a systematic overview of UGT variability, which can yield insights into interindividual differences in phase 2 metabolism and facilitate the translation of sequencing data into personalized predictions of UGT substrate disposition.
Collapse
Affiliation(s)
| | - Mahamadou D Camara
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden; Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden
| | - Volker M Lauschke
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden; Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden; Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany; University of Tübingen, Tübingen, Germany.
| | - Yitian Zhou
- Department of Physiology and Pharmacology, Karolinska Institutet, Stockholm, Sweden; Center for Molecular Medicine, Karolinska Institutet and University Hospital, Stockholm, Sweden.
| |
Collapse
|
2
|
Jänes J, Müller M, Selvaraj S, Manoel D, Stephenson J, Gonçalves C, Lafita A, Polacco B, Obernier K, Alasoo K, Lemos MC, Krogan N, Martin M, Saraiva LR, Burke D, Beltrao P. Predicted mechanistic impacts of human protein missense variants. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.29.596373. [PMID: 38854010 PMCID: PMC11160786 DOI: 10.1101/2024.05.29.596373] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
Genome sequencing efforts have led to the discovery of tens of millions of protein missense variants found in the human population with the majority of these having no annotated role and some likely contributing to trait variation and disease. Sequence-based artificial intelligence approaches have become highly accurate at predicting variants that are detrimental to the function of proteins but they do not inform on mechanisms of disruption. Here we combined sequence and structure-based methods to perform proteome-wide prediction of deleterious variants with information on their impact on protein stability, protein-protein interactions and small-molecule binding pockets. AlphaFold2 structures were used to predict approximately 100,000 small-molecule binding pockets and stability changes for over 200 million variants. To inform on protein-protein interfaces we used AlphaFold2 to predict structures for nearly 500,000 protein complexes. We illustrate the value of mechanism-aware variant effect predictions to study the relation between protein stability and abundance and the structural properties of interfaces underlying trans protein quantitative trait loci (pQTLs). We characterised the distribution of mechanistic impacts of protein variants found in patients and experimentally studied example disease linked variants in FGFR1.
Collapse
Affiliation(s)
- Jürgen Jänes
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Marc Müller
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
| | - Senthil Selvaraj
- Sidra Medicine, Doha, Qatar
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| | - Diogo Manoel
- Sidra Medicine, Doha, Qatar
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| | - James Stephenson
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK
| | - Catarina Gonçalves
- Sidra Medicine, Doha, Qatar
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| | | | - Benjamin Polacco
- Quantitative Biosciences Institute (QBI), University of California, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA, USA
| | - Kirsten Obernier
- Quantitative Biosciences Institute (QBI), University of California, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA, USA
| | - Kaur Alasoo
- Institute of Computer Science, University of Tartu, Tartu, Estonia
| | - Manuel C. Lemos
- CICS-UBI, Health Sciences Research Centre, University of Beira Interior, 6200-506, Covilhã, Portugal
| | - Nevan Krogan
- Quantitative Biosciences Institute (QBI), University of California, San Francisco, CA, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, CA, USA
- J. David Gladstone Institutes, San Francisco, CA, USA
| | - Maria Martin
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK
| | - Luis R. Saraiva
- Sidra Medicine, Doha, Qatar
- College of Health and Life Sciences, Hamad Bin Khalifa University, Doha, Qatar
| | - David Burke
- Faculty of Life Sciences and Medicine, King’s College, London, UK
| | - Pedro Beltrao
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge CB10 1SD, UK
- Open Targets, Wellcome Genome Campus, Cambridge, CB10 1SA, UK
| |
Collapse
|
3
|
Ingelman-Sundberg M, Nebert DW, Lauschke VM. Emerging trends in pharmacogenomics: from common variant associations toward comprehensive genomic profiling. Hum Genomics 2023; 17:105. [PMID: 37996916 PMCID: PMC10668450 DOI: 10.1186/s40246-023-00554-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 11/14/2023] [Indexed: 11/25/2023] Open
Affiliation(s)
| | - Daniel W Nebert
- Department of Environmental and Public Health Sciences, Center for Environmental Genetics, University of Cincinnati College of Medicine, Cincinnati, OH, USA
- Department of Pediatrics and Molecular & Developmental Biology, Cincinnati Children's Research Center, Cincinnati, OH, 45229, USA
| | - Volker M Lauschke
- Department of Physiology and Pharmacology, Karolinska Institutet, 171 77, Stockholm, Sweden.
- Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany.
- University of Tübingen, Tübingen, Germany.
| |
Collapse
|
4
|
Park Y, Lauschke V. Towards more accurate pharmacogenomic variant effect predictions. Pharmacogenomics 2023; 24:841-844. [PMID: 37846582 DOI: 10.2217/pgs-2023-0187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2023] Open
Abstract
Tweetable abstract Accurate variant interpretation has become a key bottleneck for the translation of an individual's pharmacogenome into actionable recommendations. We recommend an integrated use of multiplexed assays, structure-based predictions and biobank data to develop more accurate effect predictors.
Collapse
Affiliation(s)
- Yoomi Park
- Seoul National University Biomedical Informatics (SNUBI), Seoul National University College of Medicine, Seoul, South Korea
- Medical Research Center, Seoul National University College of Medicine, Seoul, South Korea
| | - Volker Lauschke
- Dr Margarete Fischer-Bosch Institute of Clinical Pharmacology, Stuttgart, Germany
- Department of Physiology & Pharmacology, Karolinska Institutet, Stockholm, Sweden
- University of Tübingen, Tübingen, Germany
| |
Collapse
|
5
|
Geck RC, Powell NR, Dunham MJ. Functional interpretation, cataloging, and analysis of 1,341 glucose-6-phosphate dehydrogenase variants. Am J Hum Genet 2023; 110:228-239. [PMID: 36681081 PMCID: PMC9943724 DOI: 10.1016/j.ajhg.2023.01.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 01/03/2023] [Indexed: 01/22/2023] Open
Abstract
Glucose-6-phosphate dehydrogenase (G6PD) deficiency affects over 500 million individuals who can experience anemia in response to oxidative stressors such as certain foods and drugs. Recently, the World Health Organization (WHO) called for revisiting G6PD variant classification as a priority to implement genetic medicine in low- and middle-income countries. Toward this goal, we sought to collect reports of G6PD variants and provide interpretations. We identified 1,341 G6PD variants in population and clinical databases. Using the ACMG standards and guidelines for the interpretation of sequence variants, we provided interpretations for 268 variants, including 186 variants that were not reported or of uncertain significance in ClinVar, bringing the total number of variants with non-conflicting interpretations to 400. For 414 variants with functional or clinical data, we analyzed associations between activity, stability, and current classification systems, including the new 2022 WHO classification. We corroborated known challenges with classification systems, including phenotypic variation, emphasizing the importance of comparing variant effects across individuals and studies. Biobank data made available by All of Us illustrate the benefit of large-scale sequencing and phenotyping by adding additional support connecting variants to G6PD-deficient anemia. By leveraging available data and interpretation guidelines, we created a repository for information on G6PD variants and nearly doubled the number of variants with clinical interpretations. These tools enable better interpretation of G6PD variants for the implementation of genetic medicine.
Collapse
Affiliation(s)
- Renee C Geck
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Nicholas R Powell
- Division of Clinical Pharmacology, Department of Medicine, School of Medicine, Indiana University, Indianapolis, IN 46202, USA
| | - Maitreya J Dunham
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
6
|
Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022; 12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
BACKGROUND Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
Collapse
Affiliation(s)
- Yunfan Fu
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Justin Bedő
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Anthony T Papenfuss
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
- Peter MacCallum Cancer Centre, Melbourne, Victoria 3000, Australia
| | - Alan F Rubin
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| |
Collapse
|
7
|
Tabet D, Parikh V, Mali P, Roth FP, Claussnitzer M. Scalable Functional Assays for the Interpretation of Human Genetic Variation. Annu Rev Genet 2022; 56:441-465. [PMID: 36055970 DOI: 10.1146/annurev-genet-072920-032107] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Scalable sequence-function studies have enabled the systematic analysis and cataloging of hundreds of thousands of coding and noncoding genetic variants in the human genome. This has improved clinical variant interpretation and provided insights into the molecular, biophysical, and cellular effects of genetic variants at an astonishing scale and resolution across the spectrum of allele frequencies. In this review, we explore current applications and prospects for the field and outline the principles underlying scalable functional assay design, with a focus on the study of single-nucleotide coding and noncoding variants.
Collapse
Affiliation(s)
- Daniel Tabet
- Donnelly Centre, Department of Molecular Genetics, and Department of Computer Science, University of Toronto, Toronto, Ontario, Canada;
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
| | - Victoria Parikh
- Center for Inherited Cardiovascular Disease, Division of Cardiovascular Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Prashant Mali
- Department of Bioengineering, University of California, San Diego, California, USA
| | - Frederick P Roth
- Donnelly Centre, Department of Molecular Genetics, and Department of Computer Science, University of Toronto, Toronto, Ontario, Canada;
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, Ontario, Canada
| | - Melina Claussnitzer
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
- Center for Genomic Medicine and Endocrine Division, Massachusetts General Hospital, Boston, Massachusetts, USA
- Harvard Medical School, Harvard University, Boston, Massachusetts, USA;
| |
Collapse
|
8
|
Brown KE, Staples JW, Woodahl EL. Keeping pace with CYP2D6 haplotype discovery: innovative methods to assign function. Pharmacogenomics 2022; 23:255-262. [PMID: 35083931 PMCID: PMC8890136 DOI: 10.2217/pgs-2021-0149] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The discovery of haplotypes with unknown or uncertain function in the CYP2D6 pharmacogene is outpacing the capabilities of traditional in vitro and in vivo approaches to characterize their function. This challenge will undoubtedly grow as pharmacogenomic research becomes more inclusive of globally diverse populations. As accurate phenotypic assignment is paramount to the utility of pharmacogenomics, high-throughput technologies are needed for this complex pharmacogene. We describe the evolving landscape of innovative approaches to assign function to CYP2D6 haplotypes and possibilities for adopting these technologies into cohesive processes. Promising approaches include ADME-optimized prediction frameworks, machine learning algorithms, deep mutational scanning and phenoconversion predictions. Implementing these approaches will lead to improved personalization of treatment for patients.
Collapse
Affiliation(s)
- Karen E Brown
- Department of Biomedical & Pharmaceutical Sciences, Skaggs School of Pharmacy, University of Montana, Missoula, MT 59812, USA,Skaggs Institute for Health Innovation, University of Montana, Missoula, MT 59812, USA
| | - Jack W Staples
- Department of Biomedical & Pharmaceutical Sciences, Skaggs School of Pharmacy, University of Montana, Missoula, MT 59812, USA,Skaggs Institute for Health Innovation, University of Montana, Missoula, MT 59812, USA
| | - Erica L Woodahl
- Department of Biomedical & Pharmaceutical Sciences, Skaggs School of Pharmacy, University of Montana, Missoula, MT 59812, USA,Skaggs Institute for Health Innovation, University of Montana, Missoula, MT 59812, USA,Author for correspondence: Tel.: +1 406 243 4129;
| |
Collapse
|