1
|
Kwon S, Safer J, Nguyen DT, Hoksza D, May P, Arbesfeld JA, Rubin AF, Campbell AJ, Burgin A, Iqbal S. Genomics 2 Proteins portal: a resource and discovery tool for linking genetic screening outputs to protein sequences and structures. Nat Methods 2024:10.1038/s41592-024-02409-0. [PMID: 39294369 DOI: 10.1038/s41592-024-02409-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 08/09/2024] [Indexed: 09/20/2024]
Abstract
Recent advances in AI-based methods have revolutionized the field of structural biology. Concomitantly, high-throughput sequencing and functional genomics have generated genetic variants at an unprecedented scale. However, efficient tools and resources are needed to link disparate data types-to 'map' variants onto protein structures, to better understand how the variation causes disease, and thereby design therapeutics. Here we present the Genomics 2 Proteins portal ( https://g2p.broadinstitute.org/ ): a human proteome-wide resource that maps 20,076,998 genetic variants onto 42,413 protein sequences and 77,923 structures, with a comprehensive set of structural and functional features. Additionally, the Genomics 2 Proteins portal allows users to interactively upload protein residue-wise annotations (for example, variants and scores) as well as the protein structure beyond databases to establish the connection between genomics to proteins. The portal serves as an easy-to-use discovery tool for researchers and scientists to hypothesize the structure-function relationship between natural or synthetic variations and their molecular phenotypes.
Collapse
Affiliation(s)
- Seulki Kwon
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Jordan Safer
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Duyen T Nguyen
- PATTERN, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David Hoksza
- Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, Czech Republic
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Jeremy A Arbesfeld
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children's Hospital, Columbus, OH, USA
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria, Australia
- Department of Medical Biology, University of Melbourne, Parkville, Victoria, Australia
| | - Arthur J Campbell
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Alex Burgin
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, MA, USA.
- Cancer Data Sciences, Dana-Farber/Harvard Cancer Center, Boston, MA, USA.
| |
Collapse
|
3
|
Haque B, Guirguis G, Curtis M, Mohsin H, Walker S, Morrow MM, Costain G. A comparative medical genomics approach may facilitate the interpretation of rare missense variation. J Med Genet 2024; 61:817-821. [PMID: 38508706 PMCID: PMC11287553 DOI: 10.1136/jmg-2023-109760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 03/12/2024] [Indexed: 03/22/2024]
Abstract
PURPOSE To determine the degree to which likely causal missense variants of single-locus traits in domesticated species have features suggestive of pathogenicity in a human genomic context. METHODS We extracted missense variants from the Online Mendelian Inheritance in Animals database for nine animals (cat, cattle, chicken, dog, goat, horse, pig, rabbit and sheep), mapped coordinates to the human reference genome and annotated variants using genome analysis tools. We also searched a private commercial laboratory database of genetic testing results from >400 000 individuals with suspected rare disorders. RESULTS Of 339 variants that were mappable to the same residue and gene in the human genome, 56 had been previously classified with respect to pathogenicity: 31 (55.4%) pathogenic/likely pathogenic, 1 (1.8%) benign/likely benign and 24 (42.9%) uncertain/other. The odds ratio for a pathogenic/likely pathogenic classification in ClinVar was 7.0 (95% CI 4.1 to 12.0, p<0.0001), compared with all other germline missense variants in these same 220 genes. The remaining 283 variants disproportionately had allele frequencies and REVEL scores that supported pathogenicity. CONCLUSION Cross-species comparisons could facilitate the interpretation of rare missense variation. These results provide further support for comparative medical genomics approaches that connect big data initiatives in human and veterinary genetics.
Collapse
Affiliation(s)
- Bushra Haque
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - George Guirguis
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Meredith Curtis
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Hera Mohsin
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Susan Walker
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
| | | | - Gregory Costain
- Program in Genetics and Genome Biology, SickKids Research Institute, Toronto, Ontario, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- The Centre for Applied Genomics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, Ontario, Canada
- Department of Paediatrics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|
5
|
Nil Z, Deshwar AR, Huang Y, Barish S, Zhang X, Choufani S, Le Quesne Stabej P, Hayes I, Yap P, Haldeman-Englert C, Wilson C, Prescott T, Tveten K, Vøllo A, Haynes D, Wheeler PG, Zon J, Cytrynbaum C, Jobling R, Blyth M, Banka S, Afenjar A, Mignot C, Robin-Renaldo F, Keren B, Kanca O, Mao X, Wegner DJ, Sisco K, Shinawi M, Wangler MF, Weksberg R, Yamamoto S, Costain G, Bellen HJ. Rare de novo gain-of-function missense variants in DOT1L are associated with developmental delay and congenital anomalies. Am J Hum Genet 2023; 110:1919-1937. [PMID: 37827158 PMCID: PMC10645550 DOI: 10.1016/j.ajhg.2023.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 09/18/2023] [Accepted: 09/18/2023] [Indexed: 10/14/2023] Open
Abstract
Misregulation of histone lysine methylation is associated with several human cancers and with human developmental disorders. DOT1L is an evolutionarily conserved gene encoding a lysine methyltransferase (KMT) that methylates histone 3 lysine-79 (H3K79) and was not previously associated with a Mendelian disease in OMIM. We have identified nine unrelated individuals with seven different de novo heterozygous missense variants in DOT1L through the Undiagnosed Disease Network (UDN), the SickKids Complex Care genomics project, and GeneMatcher. All probands had some degree of global developmental delay/intellectual disability, and most had one or more major congenital anomalies. To assess the pathogenicity of the DOT1L variants, functional studies were performed in Drosophila and human cells. The fruit fly DOT1L ortholog, grappa, is expressed in most cells including neurons in the central nervous system. The identified DOT1L variants behave as gain-of-function alleles in flies and lead to increased H3K79 methylation levels in flies and human cells. Our results show that human DOT1L and fly grappa are required for proper development and that de novo heterozygous variants in DOT1L are associated with a Mendelian disease.
Collapse
Affiliation(s)
- Zelha Nil
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Ashish R Deshwar
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada; Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | - Yan Huang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Scott Barish
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Xi Zhang
- Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China; National Health Commission Key Laboratory for Birth Defect Research and Prevention, Hunan Provincial Maternal and Child Health Care Hospital, Changsha 410005, China
| | - Sanaa Choufani
- Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | - Polona Le Quesne Stabej
- Department of Molecular Medicine and Pathology, Faculty of Medical and Health Sciences, the University of Auckland, Auckland, New Zealand
| | - Ian Hayes
- Genetic Health Service New Zealand- Northern Hub, Auckland District Health Board, Auckland, New Zealand
| | - Patrick Yap
- Genetic Health Service New Zealand- Northern Hub, Auckland District Health Board, Auckland, New Zealand
| | | | - Carolyn Wilson
- Mission Fullerton Genetics Center, Asheville, NC 28803, USA
| | - Trine Prescott
- Department of Medical Genetics, Telemark Hospital Trust, 3710 Skien, Norway
| | - Kristian Tveten
- Department of Medical Genetics, Telemark Hospital Trust, 3710 Skien, Norway
| | - Arve Vøllo
- Department of Pediatrics, Hospital of Østfold, 1714 Grålum, Norway
| | - Devon Haynes
- Division of Genetics, Arnold Palmer Hospital for Children - Orlando Health, Orlando, FL, USA; Clinical Genetics Service, Guy's Hospital, Guy's and St Thomas' NHS Trust, London, England, UK
| | - Patricia G Wheeler
- Division of Genetics, Arnold Palmer Hospital for Children - Orlando Health, Orlando, FL, USA
| | - Jessica Zon
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada
| | - Cheryl Cytrynbaum
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada; Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada
| | - Rebekah Jobling
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada
| | - Moira Blyth
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Foresterhill, Aberdeen, UK
| | - Siddharth Banka
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, M13 9WL Manchester, UK; Manchester Centre for Genomic Medicine, St Mary's Hospital, Manchester University NHS Foundation Trust, Health Innovation Manchester, M13 9WL Manchester, UK
| | - Alexandra Afenjar
- Service de génétique, CRMR des malformations et maladies congénitales du cervelet et CRMR déficience intellectuelle, hôpital Trousseau, AP-HP, Paris, France
| | - Cyril Mignot
- Sorbonne Université, Département de Génétique, Groupe Hospitalier Pitié-Salpêtrière and Hôpital Trousseau, Paris, France; Centre de Référence Déficiences Intellectuelles de Causes Rares, Paris, France
| | | | - Boris Keren
- AP-HP, Hôpital de la Pitié-Salpêtrière, Département de Génétique, 75013 Paris, France
| | - Oguz Kanca
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Xiao Mao
- National Health Commission Key Laboratory for Birth Defect Research and Prevention, Hunan Provincial Maternal and Child Health Care Hospital, Changsha 410005, China; Clinical Research Center for Placental Medicine in Hunan Province, Hunan Provincial Maternal and Child Health Care Hospital, Changsha 410005, China
| | - Daniel J Wegner
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Kathleen Sisco
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Marwan Shinawi
- Department of Pediatrics, Washington University School of Medicine, St. Louis, MO 63110, USA
| | - Michael F Wangler
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA
| | - Rosanna Weksberg
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada; Department of Neurology, Xiangya Hospital, Central South University, Changsha 410008, China; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Shinya Yamamoto
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030, USA
| | - Gregory Costain
- Division of Clinical and Metabolic Genetics, The Hospital for Sick Children, Toronto, ON, Canada; Program in Genetics and Genome Biology, The Hospital for Sick Children, Toronto, ON, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
| | - Hugo J Bellen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Jan and Dan Duncan Neurological Research Institute, Texas Children's Hospital, Houston, TX 77030, USA; Department of Neuroscience, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
6
|
Snyder A, Ryan VH, Hawrot J, Lawton S, Ramos DM, Qi YA, Johnson K, Reed X, Johnson NL, Kollasch AW, Duffy M, VandeVrede L, Cochran JN, Miller BL, Toro C, Bielekova B, Yokoyama JS, Marks DS, Kwan JY, Cookson MR, Ward ME. An ANXA11 P93S variant dysregulates TDP-43 and causes corticobasal syndrome. RESEARCH SQUARE 2023:rs.3.rs-3462973. [PMID: 37886540 PMCID: PMC10602153 DOI: 10.21203/rs.3.rs-3462973/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/28/2023]
Abstract
As genetic testing has become more accessible and affordable, variants of uncertain significance (VUS) are increasingly identified, and determining whether these variants play causal roles in disease is a major challenge. The known disease-associated Annexin A11 (ANXA11) mutations result in ANXA11 aggregation, alterations in lysosomal-RNA granule co-trafficking, and TDP-43 mis-localization and present as amyotrophic lateral sclerosis or frontotemporal dementia. We identified a novel VUS in ANXA11 (P93S) in a kindred with corticobasal syndrome and unique radiographic features that segregated with disease. We then queried neurodegenerative disorder clinic databases to identify the phenotypic spread of ANXA11 mutations. Multi-modal computational analysis of this variant was performed and the effect of this VUS on ANXA11 function and TDP-43 biology was characterized in iPSC-derived neurons. Single-cell sequencing and proteomic analysis of iPSC-derived neurons and microglia were used to determine the multiomic signature of this VUS. Mutations in ANXA11 were found in association with clinically diagnosed corticobasal syndrome, thereby establishing corticobasal syndrome as part of ANXA11 clinical spectrum. In iPSC-derived neurons expressing mutant ANXA11, we found decreased colocalization of lysosomes and decreased neuritic RNA as well as decreased nuclear TDP-43 and increased formation of cryptic exons compared to controls. Multiomic assessment of the P93S variant in iPSC-derived neurons and microglia indicates that the pathogenic omic signature in neurons is modest compared to microglia. Additionally, omic studies reveal that immune dysregulation and interferon signaling pathways in microglia are central to disease. Collectively, these findings identify a new pathogenic variant in ANXA11, expand the range of clinical syndromes caused by ANXA11 mutations, and implicate both neuronal and microglia dysfunction in ANXA11 pathophysiology. This work illustrates the potential for iPSC-derived cellular models to revolutionize the variant annotation process and provides a generalizable approach to determining causality of novel variants across genes.
Collapse
Affiliation(s)
- Allison Snyder
- Neurogenetics Branch, National Institute of Neurological Disorders and Stroke
| | - Veronica H Ryan
- Center for Alzheimer's and Related Dementias, National Institutes of Health
| | - James Hawrot
- Neurogenetics Branch, National Institute of Neurological Disorders and Stroke
| | - Sydney Lawton
- Neurogenetics Branch, National Institute of Neurological Disorders and Stroke
| | - Daniel M Ramos
- Center for Alzheimer's and Related Dementias, National Institutes of Health
| | - Y Andy Qi
- Center for Alzheimer's and Related Dementias, National Institutes of Health
| | - Kory Johnson
- Intramural Bioinformatics, National Institute of Neurological Disorders and Stroke
| | - Xylena Reed
- Center for Alzheimer's and Related Dementias, National Institutes of Health
| | | | | | - Megan Duffy
- Cell Biology and Gene Expression Section, Laboratory of Neurogenetics, National Institute on Aging
| | - Lawren VandeVrede
- Memory and Aging Center, Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco
| | | | - Bruce L Miller
- Memory and Aging Center, Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco
| | - Camilo Toro
- Undiagnosed Diseases Program, National Human Genome Research Institute
| | - Bibiana Bielekova
- Neuroimmunological Diseases Section, National Institute of Allergy and Infectious Disease
| | - Jennifer S Yokoyama
- Memory and Aging Center, Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston
| | - Justin Y Kwan
- Office of the Clinical Director, National Institute of Neurological Disorders and Stroke
| | - Mark R Cookson
- Cell Biology and Gene Expression Section, Laboratory of Neurogenetics, National Institute on Aging
| | - Michael E Ward
- Neurogenetics Branch, National Institute of Neurological Disorders and Stroke
| |
Collapse
|