1
|
Bridges Y, de Souza V, Cortes KG, Haendel M, Harris NL, Korn DR, Marinakis NM, Matentzoglu N, McLaughlin JA, Mungall CJ, Osumi-Sutherland D, Robinson PN, Smedley D, Jacobsen JO. Towards a standard benchmark for variant and gene prioritisation algorithms: PhEval - Phenotypic inference Evaluation framework. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.13.598672. [PMID: 38915571 PMCID: PMC11195176 DOI: 10.1101/2024.06.13.598672] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Background Computational approaches to support rare disease diagnosis are challenging to build, requiring the integration of complex data types such as ontologies, gene-to-phenotype associations, and cross-species data into variant and gene prioritisation algorithms (VGPAs). However, the performance of VGPAs has been difficult to measure and is impacted by many factors, for example, ontology structure, annotation completeness or changes to the underlying algorithm. Assertions of the capabilities of VGPAs are often not reproducible, in part because there is no standardised, empirical framework and openly available patient data to assess the efficacy of VGPAs - ultimately hindering the development of effective prioritisation tools. Results In this paper, we present our benchmarking tool, PhEval, which aims to provide a standardised and empirical framework to evaluate phenotype-driven VGPAs. The inclusion of standardised test corpora and test corpus generation tools in the PhEval suite of tools allows open benchmarking and comparison of methods on standardised data sets. Conclusions PhEval and the standardised test corpora solve the issues of patient data availability and experimental tooling configuration when benchmarking and comparing rare disease VGPAs. By providing standardised data on patient cohorts from real-world case-reports and controlling the configuration of evaluated VGPAs, PhEval enables transparent, portable, comparable and reproducible benchmarking of VGPAs. As these tools are often a key component of many rare disease diagnostic pipelines, a thorough and standardised method of assessment is essential for improving patient diagnosis and care.
Collapse
Affiliation(s)
- Yasemin Bridges
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ, UK
| | - Vinicius de Souza
- European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Katherina G Cortes
- School of Public Health, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Melissa Haendel
- Department of Genetics, University of North Carolina, Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Nomi L Harris
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Daniel R Korn
- Department of Genetics, University of North Carolina, Chapel Hill, Chapel Hill, NC, 27599, USA
| | - Nikolaos M Marinakis
- Laboratory of Medical Genetics, National and Kapodistrian University of Athens, Athens, 11527, Greece
| | | | - James A McLaughlin
- Samples, Phenotypes, and Ontologies (SPOT), European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridgeshire, CB10 1SD, UK
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | | | - Peter N Robinson
- Berlin Institute of Health, Charité - Universitätsmedizin Berlin, Berlin, 10117, Germany
| | - Damian Smedley
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ, UK
| | - Julius Ob Jacobsen
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ, UK
| |
Collapse
|
2
|
Hejla D, Huynh S, Samra S, Richmond PA, Dalmann J, Del Bel KL, Byres L, Lehman A, Turvey SE, Boerkoel CF. Naturally occurring splice variants dissect the functional domains of BHC80 and emphasize the need for RNA analysis. Am J Med Genet A 2024; 194:e63548. [PMID: 38264805 DOI: 10.1002/ajmg.a.63548] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 01/04/2024] [Accepted: 01/12/2024] [Indexed: 01/25/2024]
Abstract
Pathogenic PHF21A variation causes PHF21A-related neurodevelopmental disorders (NDDs). Although amorphic alleles, including haploinsufficiency, have been established as a disease mechanism, increasing evidence suggests that missense variants as well as frameshift variants extending the BHC80 carboxyl terminus also cause disease. Expanding on these, we report a proposita with intellectual disability and overgrowth and a novel de novo heterozygous PHF21A splice variant (NM_001352027.3:c.[153+1G>C];[=]) causing skipping of exon 6, which encodes an in-frame BHC80 deletion (p.(Asn30_Gln51del)). This deletion disrupts a predicted leucine zipper domain and implicates this domain in BHC80 function and as a target of variation causing PHF21A-related NDDs. This extension of understanding emphasizes the application of RNA analysis in precision genomic medicine practice.
Collapse
Affiliation(s)
- Duha Hejla
- Department of Pediatrics, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Stephanie Huynh
- Provincial Medical Genetics Program, B.C. Women's Hospital, Vancouver, British Columbia, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Simran Samra
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
- Experimental Medicine Program, Department of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Phillip A Richmond
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Joshua Dalmann
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Kate L Del Bel
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Loryn Byres
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Anna Lehman
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Stuart E Turvey
- The Rare Disease Discovery Hub, BC Children's Hospital Research Institute, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Cornelius F Boerkoel
- Provincial Medical Genetics Program, B.C. Women's Hospital, Vancouver, British Columbia, Canada
- Department of Medical Genetics, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
3
|
Danis D, Bamshad MJ, Bridges Y, Cacheiro P, Carmody LC, Chong JX, Coleman B, Dalgleish R, Freeman PJ, Graefe ASL, Groza T, Jacobsen JOB, Klocperk A, Kusters M, Ladewig MS, Marcello AJ, Mattina T, Mungall CJ, Munoz-Torres MC, Reese JT, Rehburg F, Reis BCS, Schuetz C, Smedley D, Strauss T, Sundaramurthi JC, Thun S, Wissink K, Wagstaff JF, Zocche D, Haendel MA, Robinson PN. A corpus of GA4GH Phenopackets: case-level phenotyping for genomic diagnostics and discovery. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.29.24308104. [PMID: 38854034 PMCID: PMC11160806 DOI: 10.1101/2024.05.29.24308104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2024]
Abstract
The Global Alliance for Genomics and Health (GA4GH) Phenopacket Schema was released in 2022 and approved by ISO as a standard for sharing clinical and genomic information about an individual, including phenotypic descriptions, numerical measurements, genetic information, diagnoses, and treatments. A phenopacket can be used as an input file for software that supports phenotype-driven genomic diagnostics and for algorithms that facilitate patient classification and stratification for identifying new diseases and treatments. There has been a great need for a collection of phenopackets to test software pipelines and algorithms. Here, we present phenopacket-store. Version 0.1.12 of phenopacket-store includes 4916 phenopackets representing 277 Mendelian and chromosomal diseases associated with 236 genes, and 2872 unique pathogenic alleles curated from 605 different publications. This represents the first large-scale collection of case-level, standardized phenotypic information derived from case reports in the literature with detailed descriptions of the clinical data and will be useful for many purposes, including the development and testing of software for prioritizing genes and diseases in diagnostic genomics, machine learning analysis of clinical phenotype data, patient stratification, and genotype-phenotype correlations. This corpus also provides best-practice examples for curating literature-derived data using the GA4GH Phenopacket Schema.
Collapse
Affiliation(s)
- Daniel Danis
- The Jackson Institute for Genomic Medicine, 10 Discovery Drive, Farmington CT 06032, USA
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Michael J Bamshad
- Department of Pediatrics, Division of Genetic Medicine, University of Washington, 1959 NE Pacific Street, Box 357371, Seattle, WA 98195, USA
- Brotman-Baty Institute for Precision Medicine, 1959 NE Pacific Street, Box 357657, Seattle WA 98195, USA
- Department of Pediatrics, Division of Genetic Medicine, Seattle Children's Hospital, Seattle, WA 98195, USA
| | - Yasemin Bridges
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Pilar Cacheiro
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Leigh C Carmody
- The Jackson Institute for Genomic Medicine, 10 Discovery Drive, Farmington CT 06032, USA
| | - Jessica X Chong
- Department of Pediatrics, Division of Genetic Medicine, University of Washington, 1959 NE Pacific Street, Box 357371, Seattle, WA 98195, USA
- Brotman-Baty Institute for Precision Medicine, 1959 NE Pacific Street, Box 357657, Seattle WA 98195, USA
| | - Ben Coleman
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA
- The Jackson Institute for Genomic Medicine, 10 Discovery Drive, Farmington CT 06032, USA
| | - Raymond Dalgleish
- Department of Genetics and Genome Biology, University of Leicester, Leicester, UK
| | - Peter J Freeman
- Division of Informatics, Imaging and Data Science, The University of Manchester, Manchester, UK
| | - Adam S L Graefe
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Tudor Groza
- Rare Care Centre, Perth Children's Hospital, Nedlands, WA 6009, Australia
- SingHealth Duke-NUS Institute of Precision Medicine, 5 Hospital Drive Level 9, Singapore 169609, Singapore
- Telethon Kids Institute, Nedlands, WA 6009, Australia
| | - Julius O B Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Adam Klocperk
- Department of Immunology, 2nd Faculty of Medicine, Charles University and University Hospital in Motol, Prague, Czech Republic
| | - Maaike Kusters
- Department of Paediatric Immunology, Great Ormond Street Hospital for Children NHS Foundation Trust, London, UK
- University College London Institute of Child Health, London, United Kingdom
| | - Markus S Ladewig
- Department of Ophthalmology, University Clinic Marburg - Campus Fulda, Fulda, Germany
| | - Anthony J Marcello
- Department of Pediatrics, Division of Genetic Medicine, University of Washington, 1959 NE Pacific Street, Box 357371, Seattle, WA 98195, USA
| | - Teresa Mattina
- Medica Genetics University of Catania Italy
- Morgagni foundation and Clinic, Catania, Italy
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Monica C Munoz-Torres
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Ccampus
| | - Justin T Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Filip Rehburg
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Bárbara C S Reis
- Department of Immunology, National Institute of Women's, Children's and Adolescents' Health Fernandes Figueira, Rio de Janeiro, Brazil
- High Complexity Laboratory, National Institute of Women's, Children's and Adolescents' Health Fernandes Figueira, Rio de Janeiro, Brazil
| | - Catharina Schuetz
- Department of Pediatrics, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- University Center for Rare Diseases, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Timmy Strauss
- Department of Pediatrics, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
- University Center for Rare Diseases, Faculty of Medicine and University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany
| | | | - Sylvia Thun
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Kyran Wissink
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
- Utrecht University, Utrecht, the Netherlands
| | | | - David Zocche
- North West Thames Regional Genetics Service, Northwick Park & St Mark's Hospitals, London, UK
| | | | - Peter N Robinson
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
- The Jackson Institute for Genomic Medicine, 10 Discovery Drive, Farmington CT 06032, USA
- ELLIS-European Laboratory for Learning and Intelligent Systems
| |
Collapse
|
4
|
Mao D, Liu C, Wang L, Ai-Ouran R, Deisseroth C, Pasupuleti S, Kim SY, Li L, Rosenfeld JA, Meng L, Burrage LC, Wangler MF, Yamamoto S, Santana M, Perez V, Shukla P, Eng CM, Lee B, Yuan B, Xia F, Bellen HJ, Liu P, Liu Z. AI-MARRVEL - A Knowledge-Driven AI System for Diagnosing Mendelian Disorders. NEJM AI 2024; 1:10.1056/aioa2300009. [PMID: 38962029 PMCID: PMC11221788 DOI: 10.1056/aioa2300009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/05/2024]
Abstract
BACKGROUND Diagnosing genetic disorders requires extensive manual curation and interpretation of candidate variants, a labor-intensive task even for trained geneticists. Although artificial intelligence (AI) shows promise in aiding these diagnoses, existing AI tools have only achieved moderate success for primary diagnosis. METHODS AI-MARRVEL (AIM) uses a random-forest machine-learning classifier trained on over 3.5 million variants from thousands of diagnosed cases. AIM additionally incorporates expert-engineered features into training to recapitulate the intricate decision-making processes in molecular diagnosis. The online version of AIM is available at https://ai.marrvel.org. To evaluate AIM, we benchmarked it with diagnosed patients from three independent cohorts. RESULTS AIM improved the rate of accurate genetic diagnosis, doubling the number of solved cases as compared with benchmarked methods, across three distinct real-world cohorts. To better identify diagnosable cases from the unsolved pools accumulated over time, we designed a confidence metric on which AIM achieved a precision rate of 98% and identified 57% of diagnosable cases out of a collection of 871 cases. Furthermore, AIM's performance improved after being fine-tuned for targeted settings including recessive disorders and trio analysis. Finally, AIM demonstrated potential for novel disease gene discovery by correctly predicting two newly reported disease genes from the Undiagnosed Diseases Network. CONCLUSIONS AIM achieved superior accuracy compared with existing methods for genetic diagnosis. We anticipate that this tool may aid in primary diagnosis, reanalysis of unsolved cases, and the discovery of novel disease genes. (Funded by the NIH Common Fund and others.).
Collapse
Affiliation(s)
- Dongxue Mao
- Department of Pediatrics, Baylor College of Medicine, Houston
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Chaozhong Liu
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
- Graduate School of Biomedical Sciences, Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston
| | - Linhua Wang
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
- Graduate School of Biomedical Sciences, Program in Quantitative and Computational Biosciences, Baylor College of Medicine, Houston
| | - Rami Ai-Ouran
- Department of Pediatrics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
- Department of Data Science and AI, Al Hussein Technical University, Amman, Jordan
| | - Cole Deisseroth
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Sasidhar Pasupuleti
- Department of Pediatrics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Seon Young Kim
- Department of Pediatrics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Lucian Li
- Department of Pediatrics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Jill A Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
| | - Linyan Meng
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Baylor Genetics, Houston7
| | - Lindsay C Burrage
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
| | - Michael F Wangler
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | - Shinya Yamamoto
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| | | | | | | | - Christine M Eng
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Baylor Genetics, Houston7
| | - Brendan Lee
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
| | - Bo Yuan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Human Genome Sequencing Center, Baylor College of Medicine, Houston
| | - Fan Xia
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Baylor Genetics, Houston7
| | - Hugo J Bellen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
- Department of Neuroscience, Baylor College of Medicine, Houston
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston
- Baylor Genetics, Houston7
| | - Zhandong Liu
- Department of Pediatrics, Baylor College of Medicine, Houston
- Jan and Dan Duncan Neurological Research Institute at Texas Children's Hospital, Houston
| |
Collapse
|
5
|
Duyzend MH, Cacheiro P, Jacobsen JO, Giordano J, Brand H, Wapner RJ, Talkowski ME, Robinson PN, Smedley D. Improving prenatal diagnosis through standards and aggregation. Prenat Diagn 2024; 44:454-464. [PMID: 38242839 PMCID: PMC11006584 DOI: 10.1002/pd.6522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 12/17/2023] [Accepted: 12/22/2023] [Indexed: 01/21/2024]
Abstract
Advances in sequencing and imaging technologies enable enhanced assessment in the prenatal space, with a goal to diagnose and predict the natural history of disease, to direct targeted therapies, and to implement clinical management, including transfer of care, election of supportive care, and selection of surgical interventions. The current lack of standardization and aggregation stymies variant interpretation and gene discovery, which hinders the provision of prenatal precision medicine, leaving clinicians and patients without an accurate diagnosis. With large amounts of data generated, it is imperative to establish standards for data collection, processing, and aggregation. Aggregated and homogeneously processed genetic and phenotypic data permits dissection of the genomic architecture of prenatal presentations of disease and provides a dataset on which data analysis algorithms can be tuned to the prenatal space. Here we discuss the importance of generating aggregate data sets and how the prenatal space is driving the development of interoperable standards and phenotype-driven tools.
Collapse
Affiliation(s)
- Michael H. Duyzend
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children’s Hospital and Harvard Medical School, Boston, MA, USA
| | - Pilar Cacheiro
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
| | - Julius O.B. Jacobsen
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
| | - Jessica Giordano
- Department of Obstetrics & Gynecology, Columbia University Medical Center, New York, NY, USA
| | - Harrison Brand
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Neurology, Harvard Medical School, Boston, MA, USA
| | - Ronald J. Wapner
- Department of Obstetrics & Gynecology, Columbia University Medical Center, New York, NY, USA
| | - Michael E. Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Program in Medical and Population Genetics, The Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Neurology, Harvard Medical School, Boston, MA, USA
- Program in Biological and Biomedical Sciences, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
- Program in Bioinformatics and Integrative Genomics, Division of Medical Sciences, Harvard Medical School, Boston, MA, USA
| | - Peter N. Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, USA
- Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032, USA
| | - Damian Smedley
- William Harvey Research Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London EC1M 6BQ, UK
| |
Collapse
|
6
|
Kim HH, Kim DW, Woo J, Lee K. Explicable prioritization of genetic variants by integration of rule-based and machine learning algorithms for diagnosis of rare Mendelian disorders. Hum Genomics 2024; 18:28. [PMID: 38509596 PMCID: PMC10956189 DOI: 10.1186/s40246-024-00595-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 03/03/2024] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND In the process of finding the causative variant of rare diseases, accurate assessment and prioritization of genetic variants is essential. Previous variant prioritization tools mainly depend on the in-silico prediction of the pathogenicity of variants, which results in low sensitivity and difficulty in interpreting the prioritization result. In this study, we propose an explainable algorithm for variant prioritization, named 3ASC, with higher sensitivity and ability to annotate evidence used for prioritization. 3ASC annotates each variant with the 28 criteria defined by the ACMG/AMP genome interpretation guidelines and features related to the clinical interpretation of the variants. The system can explain the result based on annotated evidence and feature contributions. RESULTS We trained various machine learning algorithms using in-house patient data. The performance of variant ranking was assessed using the recall rate of identifying causative variants in the top-ranked variants. The best practice model was a random forest classifier that showed top 1 recall of 85.6% and top 3 recall of 94.4%. The 3ASC annotates the ACMG/AMP criteria for each genetic variant of a patient so that clinical geneticists can interpret the result as in the CAGI6 SickKids challenge. In the challenge, 3ASC identified causal genes for 10 out of 14 patient cases, with evidence of decreased gene expression for 6 cases. Among them, two genes (HDAC8 and CASK) had decreased gene expression profiles confirmed by transcriptome data. CONCLUSIONS 3ASC can prioritize genetic variants with higher sensitivity compared to previous methods by integrating various features related to clinical interpretation, including features related to false positive risk such as quality control and disease inheritance pattern. The system allows interpretation of each variant based on the ACMG/AMP criteria and feature contribution assessed using explainable AI techniques.
Collapse
Affiliation(s)
- Ho Heon Kim
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Dong-Wook Kim
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Junwoo Woo
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Kyoungyeul Lee
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea.
| |
Collapse
|
7
|
Bhasin MA, Knaus A, Incardona P, Schmid A, Holtgrewe M, Elbracht M, Krawitz PM, Hsieh TC. Enhancing Variant Prioritization in VarFish through On-Premise Computational Facial Analysis. Genes (Basel) 2024; 15:370. [PMID: 38540429 PMCID: PMC10969976 DOI: 10.3390/genes15030370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 03/03/2024] [Accepted: 03/13/2024] [Indexed: 06/14/2024] Open
Abstract
Genomic variant prioritization is crucial for identifying disease-associated genetic variations. Integrating facial and clinical feature analyses into this process enhances performance. This study demonstrates the integration of facial analysis (GestaltMatcher) and Human Phenotype Ontology analysis (CADA) within VarFish, an open-source variant analysis framework. Challenges related to non-open-source components were addressed by providing an open-source version of GestaltMatcher, facilitating on-premise facial analysis to address data privacy concerns. Performance evaluation on 163 patients recruited from a German multi-center study of rare diseases showed PEDIA's superior accuracy in variant prioritization compared to individual scores. This study highlights the importance of further benchmarking and future integration of advanced facial analysis approaches aligned with ACMG guidelines to enhance variant classification.
Collapse
Affiliation(s)
- Meghna Ahuja Bhasin
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
| | - Alexej Knaus
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
| | - Pietro Incardona
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
- Core Unit for Bioinformatics Data Analysis, Medical Faculty, University of Bonn, 53127 Bonn, Germany
| | - Alexander Schmid
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
| | - Manuel Holtgrewe
- CUBI—Core Unit Bioinformatics, Berlin Institute of Health, 10117 Berlin, Germany;
| | - Miriam Elbracht
- Institute for Human Genetics and Genomic Medicine, Medical Faculty, RWTH Aachen University, 52062 Aachen, Germany;
| | - Peter M. Krawitz
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
| | - Tzung-Chien Hsieh
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, Rheinische Friedrich-Wilhelms-Universität Bonn, 53127 Bonn, Germany; (M.A.B.); (A.K.); (P.I.); (A.S.); (P.M.K.)
| |
Collapse
|
8
|
Yuan X, Su J, Wang J, Dai B, Sun Y, Zhang K, Li Y, Chuan J, Tang C, Yu Y, Gong Q. Refined preferences of prioritizers improve intelligent diagnosis for Mendelian diseases. Sci Rep 2024; 14:2845. [PMID: 38310124 PMCID: PMC10838329 DOI: 10.1038/s41598-024-53461-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Accepted: 01/31/2024] [Indexed: 02/05/2024] Open
Abstract
Phenotype-guided gene prioritizers have proved a highly efficient approach to identifying causal genes for Mendelian diseases. In our previous study, we preliminarily evaluated the performance of ten prioritizers. However, all the selected software was run based on default settings and singleton mode. With a large-scale family dataset from Deciphering Developmental Disorders (DDD) project (N = 305) and an in-house trio cohort (N = 152), the four optimal performers in our prior study including Exomiser, PhenIX, AMELIE, and LIRCIAL were further assessed through parameter optimization and/or the utilization of trio mode. The in-depth assessment revealed high diagnostic yields of the four prioritizers with refined preferences, each alone or together: (1) 83.3-91.8% of the causal genes were presented among the first ten candidates in the final ranking lists of the four tools; (2) Over 97.7% of the causal genes were successfully captured within the top 50 by either of the four software. Exomiser did best in directly hitting the target (ranking the causal gene at the very top) while LIRICAL displayed a predominant overall detection capability. Besides, cases affected by low-penetrance and high-frequency pathogenic variants were found misjudged during the automated prioritization process. The discovery of the limitations shed light on the specific directions of future enhancement for causal-gene ranking tools.
Collapse
Affiliation(s)
- Xiao Yuan
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Jieqiong Su
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Jing Wang
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Bing Dai
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Yanfang Sun
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Keke Zhang
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Yinghua Li
- Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, Guangdong, China
| | - Jun Chuan
- Genetalks Biotech. Co., Ltd., Changsha, Hunan, China
| | - Chunyan Tang
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China
| | - Yan Yu
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China.
| | - Qiang Gong
- Changsha Kingmed Center for Clinical Laboratory, Lutian Road 28, Changsha, 410000, Hunan, China.
| |
Collapse
|
9
|
Yang J, Shu L, Han M, Pan J, Chen L, Yuan T, Tan L, Shu Q, Duan H, Li H. RDmaster: A novel phenotype-oriented dialogue system supporting differential diagnosis of rare disease. Comput Biol Med 2024; 169:107924. [PMID: 38181610 DOI: 10.1016/j.compbiomed.2024.107924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/18/2023] [Accepted: 01/01/2024] [Indexed: 01/07/2024]
Abstract
BACKGROUND Clinicians often lack the necessary expertise to differentially diagnose multiple underlying rare diseases (RDs) due to their complex and overlapping clinical features, leading to misdiagnoses and delayed treatments. The aim of this study is to develop a novel electronic differential diagnostic support system for RDs. METHOD Through integrating two Bayesian diagnostic methods, a candidate list was generated with enhance clinical interpretability for the further Q&A based differential diagnosis (DDX). To achieve an efficient Q&A dialogue strategy, we introduce a novel metric named the adaptive information gain and Gini index (AIGGI) to evaluate the expected gain of interrogated phenotypes within real-time diagnostic states. RESULTS This DDX tool called RDmaster has been implemented as a web-based platform (http://rdmaster.nbscn.org/). A diagnostic trial involving 238 published RD patients revealed that RDmaster outperformed existing RD diagnostic tools, as well as ChatGPT, and was shown to enhance the diagnostic accuracy through its Q&A system. CONCLUSIONS The RDmaster offers an effective multi-omics differential diagnostic technique and outperforms existing tools and popular large language models, particularly enhancing differential diagnosis in collecting diagnostically beneficial phenotypes.
Collapse
Affiliation(s)
- Jian Yang
- Clinical Data Center, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China; The College of Biomedical Engineering and Instrument Science, Zhejiang University, Zhejiang, China
| | - Liqi Shu
- Rhode Island Hospital, Warren Alpert Medical School of Brown University, Rhode Island, USA
| | - Mingyu Han
- Neonatal Department, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Jiarong Pan
- Neonatal Department, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Lihua Chen
- Neonatal Department, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Tianming Yuan
- Neonatal Department, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Linhua Tan
- Surgical Intensive Care Unit, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Qiang Shu
- Clinical Data Center, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China
| | - Huilong Duan
- The College of Biomedical Engineering and Instrument Science, Zhejiang University, Zhejiang, China
| | - Haomin Li
- Clinical Data Center, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China.
| |
Collapse
|
10
|
Lagorce D, Lebreton E, Matalonga L, Hongnat O, Chahdil M, Piscia D, Paramonov I, Ellwanger K, Köhler S, Robinson P, Graessner H, Beltran S, Lucano C, Hanauer M, Rath A. Phenotypic similarity-based approach for variant prioritization for unsolved rare disease: a preliminary methodological report. Eur J Hum Genet 2024; 32:182-189. [PMID: 37926714 PMCID: PMC10853199 DOI: 10.1038/s41431-023-01486-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/13/2023] [Accepted: 10/05/2023] [Indexed: 11/07/2023] Open
Abstract
Rare diseases (RD) have a prevalence of not more than 1/2000 persons in the European population, and are characterised by the difficulty experienced in obtaining a correct and timely diagnosis. According to Orphanet, 72.5% of RD have a genetic origin although 35% of them do not yet have an identified causative gene. A significant proportion of patients suspected to have a genetic RD receive an inconclusive exome/genome sequencing. Working towards the International Rare Diseases Research Consortium (IRDiRC)'s goal for 2027 to ensure that all people living with a RD receive a diagnosis within one year of coming to medical attention, the Solve-RD project aims to identify the molecular causes underlying undiagnosed RD. As part of this strategy, we developed a phenotypic similarity-based variant prioritization methodology comparing submitted cases with other submitted cases and with known RD in Orphanet. Three complementary approaches based on phenotypic similarity calculations using the Human Phenotype Ontology (HPO), the Orphanet Rare Diseases Ontology (ORDO) and the HPO-ORDO Ontological Module (HOOM) were developed; genomic data reanalysis was performed by the RD-Connect Genome-Phenome Analysis Platform (GPAP). The methodology was tested in 4 exemplary cases discussed with experts from European Reference Networks. Variants of interest (pathogenic or likely pathogenic) were detected in 8.8% of the 725 cases clustered by similarity calculations. Diagnostic hypotheses were validated in 42.1% of them and needed further exploration in another 10.9%. Based on the promising results, we are devising an automated standardized phenotypic-based re-analysis pipeline to be applied to the entire unsolved cases cohort.
Collapse
Affiliation(s)
- David Lagorce
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France.
| | - Emeline Lebreton
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| | - Leslie Matalonga
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain
| | - Oscar Hongnat
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| | - Maroua Chahdil
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| | - Davide Piscia
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain
| | - Ida Paramonov
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain
| | - Kornelia Ellwanger
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
- Centre for Rare Diseases, University of Tübingen, Tübingen, Germany
| | | | - Peter Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - Holm Graessner
- Institute of Medical Genetics and Applied Genomics, University of Tübingen, Tübingen, Germany
- Centre for Rare Diseases, University of Tübingen, Tübingen, Germany
| | - Sergi Beltran
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Baldiri Reixac 4, Barcelona, 08028, Spain
| | - Caterina Lucano
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| | - Marc Hanauer
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| | - Ana Rath
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, 75014, Paris, France
| |
Collapse
|
11
|
Yang J, Liu C, Deng W, Wu D, Weng C, Zhou Y, Wang K. Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT. PATTERNS (NEW YORK, N.Y.) 2024; 5:100887. [PMID: 38264716 PMCID: PMC10801236 DOI: 10.1016/j.patter.2023.100887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/25/2023] [Accepted: 11/06/2023] [Indexed: 01/25/2024]
Abstract
To enhance phenotype recognition in clinical notes of genetic diseases, we developed two models-PhenoBCBERT and PhenoGPT-for expanding the vocabularies of Human Phenotype Ontology (HPO) terms. While HPO offers a standardized vocabulary for phenotypes, existing tools often fail to capture the full scope of phenotypes due to limitations from traditional heuristic or rule-based approaches. Our models leverage large language models to automate the detection of phenotype terms, including those not in the current HPO. We compare these models with PhenoTagger, another HPO recognition tool, and found that our models identify a wider range of phenotype concepts, including previously uncharacterized ones. Our models also show strong performance in case studies on biomedical literature. We evaluate the strengths and weaknesses of BERT- and GPT-based models in aspects such as architecture and accuracy. Overall, our models enhance automated phenotype detection from clinical texts, improving downstream analyses on human diseases.
Collapse
Affiliation(s)
- Jingye Yang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Wendy Deng
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Da Wu
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Yunyun Zhou
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Biostatistics and Bioinformatics Facility, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
12
|
Gargano MA, Matentzoglu N, Coleman B, Addo-Lartey EB, Anagnostopoulos A, Anderton J, Avillach P, Bagley AM, Bakštein E, Balhoff JP, Baynam G, Bello SM, Berk M, Bertram H, Bishop S, Blau H, Bodenstein DF, Botas P, Boztug K, Čady J, Callahan TJ, Cameron R, Carbon S, Castellanos F, Caufield JH, Chan LE, Chute C, Cruz-Rojo J, Dahan-Oliel N, Davids JR, de Dieuleveult M, de Souza V, de Vries BBA, de Vries E, DePaulo JR, Derfalvi B, Dhombres F, Diaz-Byrd C, Dingemans AJM, Donadille B, Duyzend M, Elfeky R, Essaid S, Fabrizzi C, Fico G, Firth HV, Freudenberg-Hua Y, Fullerton JM, Gabriel DL, Gilmour K, Giordano J, Goes FS, Moses RG, Green I, Griese M, Groza T, Gu W, Guthrie J, Gyori B, Hamosh A, Hanauer M, Hanušová K, He Y(O, Hegde H, Helbig I, Holasová K, Hoyt CT, Huang S, Hurwitz E, Jacobsen JOB, Jiang X, Joseph L, Keramatian K, King B, Knoflach K, Koolen DA, Kraus M, Kroll C, Kusters M, Ladewig MS, Lagorce D, Lai MC, Lapunzina P, Laraway B, Lewis-Smith D, Li X, Lucano C, Majd M, Marazita ML, Martinez-Glez V, McHenry TH, McInnis MG, McMurry JA, Mihulová M, Millett CE, Mitchell PB, Moslerová V, Narutomi K, Nematollahi S, Nevado J, Nierenberg AA, Čajbiková NN, Nurnberger JI, Ogishima S, Olson D, Ortiz A, Pachajoa H, Perez de Nanclares G, Peters A, Putman T, Rapp CK, Rath A, Reese J, Rekerle L, Roberts A, Roy S, Sanders SJ, Schuetz C, Schulte EC, Schulze TG, Schwarz M, Scott K, Seelow D, Seitz B, Shen Y, Similuk MN, Simon ES, Singh B, Smedley D, Smith CL, Smolinsky JT, Sperry S, Stafford E, Stefancsik R, Steinhaus R, Strawbridge R, Sundaramurthi JC, Talapova P, Tenorio Castano JA, Tesner P, Thomas RH, Thurm A, Turnovec M, van Gijn ME, Vasilevsky NA, Vlčková M, Walden A, Wang K, Wapner R, Ware JS, Wiafe AA, Wiafe SA, Wiggins LD, Williams AE, Wu C, Wyrwoll MJ, Xiong H, Yalin N, Yamamoto Y, Yatham LN, Yocum AK, Young AH, Yüksel Z, Zandi PP, Zankl A, Zarante I, Zvolský M, Toro S, Carmody LC, Harris NL, Munoz-Torres MC, Danis D, Mungall CJ, Köhler S, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res 2024; 52:D1333-D1346. [PMID: 37953324 PMCID: PMC10767975 DOI: 10.1093/nar/gkad1005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/12/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs.
Collapse
Affiliation(s)
| | | | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | - Joel Anderton
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Anita M Bagley
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Eduard Bakštein
- National Institute of Mental Health, Klecany, Czech Republic
| | - James P Balhoff
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, NC 27517, USA
| | - Gareth Baynam
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Michael Berk
- Deakin University, IMPACT - the Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Barwon Health, Geelong, Australia
| | - Holli Bertram
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Somer Bishop
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - David F Bodenstein
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | | | - Kaan Boztug
- St. Anna Children's Cancer Research Institute (CCRI), Vienna, Austria
| | - Jolana Čady
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, NY, NY, USA
| | | | - Seth J Carbon
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - J Harry Caufield
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren E Chan
- College of Public Health and Human Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Jaime Cruz-Rojo
- UDISGEN (Dysmorphology and Genetics Unit), 12 de Octubre Hospital, Madrid, Spain
| | - Noémi Dahan-Oliel
- Department of Clinical Research, Shriners Hospitals for Children, Montreal, Quebec, Canada
| | - Jon R Davids
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Maud de Dieuleveult
- Département I&D, AP-HP, Banque Nationale de Données Maladies Rares, Paris, France
| | - Vinicius de Souza
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Bert B A de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | | | - J Raymond DePaulo
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Beata Derfalvi
- Department of Pediatrics, Dalhousie University, Halifax, NS, Canada
| | - Ferdinand Dhombres
- Fetal Medicine Department, Armand Trousseau Hospital, Sorbonne University, GRC26, INSERM, Limics, Paris, France
| | - Claudia Diaz-Byrd
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Alexander J M Dingemans
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Bruno Donadille
- St Antoine Hospital, Reference Center for Rare Growth Endocrine Disorders, Sorbonne University, AP-HP, INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | | | - Reem Elfeky
- Department of Immunology, GOS Hospital for Children NHS Foundation Trust, University College London, London, UK
| | - Shahim Essaid
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | | | - Giovanna Fico
- Bipolar and Depressive Disorders Unit, Institute of Neuroscience, Hospital Clinic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Catalonia, Spain
| | - Helen V Firth
- Addenbrooke's Hospital, Cambridge University Hospitals, Cambridge, UK
| | - Yun Freudenberg-Hua
- Department of Psychiatry, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | | | - Davera L Gabriel
- School of Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
| | | | - Jessica Giordano
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - Fernando S Goes
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Rachel Gore Moses
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Ian Green
- SNOMED International, London W2 6BD, UK
| | - Matthias Griese
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Tudor Groza
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Julia Guthrie
- Department of Structural and Computational Biology, University of Vienna; Max Perutz Labs, Vienna, Austria
| | - Benjamin Gyori
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Ada Hamosh
- Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Marc Hanauer
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Kateřina Hanušová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | | | - Harshad Hegde
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ingo Helbig
- Neurology, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Kateřina Holasová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Charles Tapley Hoyt
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | | | - Eric Hurwitz
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Julius O B Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Lisa Joseph
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Kamyar Keramatian
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Bryan King
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Katrin Knoflach
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - David A Koolen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Carlo Kroll
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Maaike Kusters
- Immunology, NIHR Great Ormond Street Hospital BRC, London, UK
| | - Markus S Ladewig
- Department of Ophthalmology, University Clinic Marburg - Campus Fulda, Fulda, Germany
| | - David Lagorce
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Meng-Chuan Lai
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Pablo Lapunzina
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Bryan Laraway
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | | | - Caterina Lucano
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Marzieh Majd
- Department of Psychiatry, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Mary L Marazita
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Victor Martinez-Glez
- Center for Genomic Medicine, Parc Taulí Hospital Universitari, Institut d’Investigació i Innovació Parc Taulí (I3PT-CERCA), Sabadell, Spain
| | - Toby H McHenry
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Melvin G McInnis
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Julie A McMurry
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Michaela Mihulová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Caitlin E Millett
- Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Philip B Mitchell
- Discipline of Psychiatry & Mental Health, School of Clinical Medicine, Faculty of Medicine & Health, University of New South Wales, Sydney, NSW, Australia
| | - Veronika Moslerová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Kenji Narutomi
- Okinawa Prefectural Nanbu Medical Center & Children's Medical Center
| | - Shahrzad Nematollahi
- School of Physical and Occupational Therapy, McGill University, Montreal, Quebec, Canada
| | - Julian Nevado
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Andrew A Nierenberg
- Dauten Family Center for Bipolar Treatment Innovation, Massachusetts General Hospital, Boston, MA, USA
| | - Nikola Novák Čajbiková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - John I Nurnberger
- Stark Neurosciences Research Institute, Departments of Psychiatry and Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | | | - Daniel Olson
- Data Collaboration Center, Data Science, Critical Path Institute, Tucson, AZ, USA
| | - Abigail Ortiz
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada
| | - Harry Pachajoa
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad Icesi, Cali, Colombia
| | - Guiomar Perez de Nanclares
- Molecular (epi) genetics lab, Bioaraba Health Research Institute, Araba University Hospital, Vitoria-Gasteiz, Spain
| | - Amy Peters
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Tim Putman
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Christina K Rapp
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Ana Rath
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Justin Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren Rekerle
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Angharad M Roberts
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | - Suzy Roy
- SNOMED International, London W2 6BD, UK
| | - Stephan J Sanders
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, University of Oxford, Oxford, UK
| | - Catharina Schuetz
- Universitätsklinikum Carl Gustav Carus, Medizinische Fakultät, TU, Dresden, Germany
| | - Eva C Schulte
- Institute of Psychiatric Phenomics and Genomics (IPPG), LMU University Hospital, LMU Munich, Munich, Germany
| | - Thomas G Schulze
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Martin Schwarz
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Katie Scott
- Department of Psychiatry, Dalhousie University, Halifax, NS, Canada
| | - Dominik Seelow
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center UKS, Homburg/Saar, Germany
| | | | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Eric S Simon
- Eisenberg Family Depression Center, University of Michigan, Ann Arbor, MI, USA
| | - Balwinder Singh
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Jake T Smolinsky
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ, USA
| | - Sarah Sperry
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | | | - Ray Stefancsik
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Robin Steinhaus
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Rebecca Strawbridge
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | | | - Polina Talapova
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | | | - Pavel Tesner
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Rhys H Thomas
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | - Audrey Thurm
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Marek Turnovec
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Marielle E van Gijn
- Department of Genetics, University Medical Center Groningen, Groningen, Netherlands
| | | | - Markéta Vlčková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Anita Walden
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Kai Wang
- Chinese HPO Consortium, Beijing, China
| | - Ron Wapner
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - James S Ware
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | | | | | - Lisa D Wiggins
- National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Andrew E Williams
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | - Chen Wu
- Chinese HPO Consortium, Beijing, China
| | - Margot J Wyrwoll
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, Edinburgh, UK
| | - Hui Xiong
- Chinese HPO Consortium, Beijing, China
| | - Nefize Yalin
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | - Yasunori Yamamoto
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Japan
| | - Lakshmi N Yatham
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Anastasia K Yocum
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Allan H Young
- Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience, King's College London & South London and Maudsley NHS Foundation Trust, Bethlem Royal Hospital, Monks Orchard Road, Beckenham, Kent, London SE5 8AF, UK
| | - Zafer Yüksel
- Department of Human Genetics, Bioscientia Healthcare GmbH, Ingelheim, Germany
| | - Peter P Zandi
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Andreas Zankl
- Faculty of Medicine and Health, The University of Sydney, Camperdown, Australia
| | - Ignacio Zarante
- Institute of Human Genetics, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Miroslav Zvolský
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Nomi L Harris
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Monica C Munoz-Torres
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Daniel Danis
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Melissa A Haendel
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| |
Collapse
|
13
|
Yang J, Liu C, Deng W, Wu D, Weng C, Zhou Y, Wang K. Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT. ARXIV 2023:arXiv:2308.06294v2. [PMID: 37986722 PMCID: PMC10659449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
To enhance phenotype recognition in clinical notes of genetic diseases, we developed two models - PhenoBCBERT and PhenoGPT - for expanding the vocabularies of Human Phenotype Ontology (HPO) terms. While HPO offers a standardized vocabulary for phenotypes, existing tools often fail to capture the full scope of phenotypes, due to limitations from traditional heuristic or rule-based approaches. Our models leverage large language models (LLMs) to automate the detection of phenotype terms, including those not in the current HPO. We compared these models to PhenoTagger, another HPO recognition tool, and found that our models identify a wider range of phenotype concepts, including previously uncharacterized ones. Our models also showed strong performance in case studies on biomedical literature. We evaluated the strengths and weaknesses of BERT-based and GPT-based models in aspects such as architecture and accuracy. Overall, our models enhance automated phenotype detection from clinical texts, improving downstream analyses on human diseases.
Collapse
Affiliation(s)
- Jingye Yang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Wendy Deng
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Da Wu
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Yunyun Zhou
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Biostatistics and Bioinformatics facility, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
14
|
Alsentzer E, Finlayson SG, Li MM, Kobren SN, Kohane IS. Simulation of undiagnosed patients with novel genetic conditions. Nat Commun 2023; 14:6403. [PMID: 37828001 PMCID: PMC10570269 DOI: 10.1038/s41467-023-41980-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 09/26/2023] [Indexed: 10/14/2023] Open
Abstract
Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300-400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.
Collapse
Grants
- U01 HG007690 NHGRI NIH HHS
- U54 NS108251 NINDS NIH HHS
- U01 HG010219 NHGRI NIH HHS
- U01 HG007672 NHGRI NIH HHS
- U01 HG010233 NHGRI NIH HHS
- U01 HG010230 NHGRI NIH HHS
- U01 HG007943 NHGRI NIH HHS
- U01 HG010217 NHGRI NIH HHS
- U01 HG007942 NHGRI NIH HHS
- U01 HG010215 NHGRI NIH HHS
- U01 HG007708 NHGRI NIH HHS
- T32 HG002295 NHGRI NIH HHS
- T32 GM007753 NIGMS NIH HHS
- U01 HG007674 NHGRI NIH HHS
- U01 TR001395 NCATS NIH HHS
- U01 HG007709 NHGRI NIH HHS
- U54 NS093793 NINDS NIH HHS
- U01 HG007530 NHGRI NIH HHS
- U01 TR002471 NCATS NIH HHS
- U01 HG007703 NHGRI NIH HHS
- UDN research reported in this manuscript was supported by the NIH Common Fund, through the Office of Strategic Coordination/Office of the NIH Director under Award Number(s) U01HG007709, U01HG010219, U01HG010230, U01HG010217, U01HG010233, U01HG010215, U01HG007672, U01HG007690, U01HG007708, U01HG007703, U01HG007674, U01HG007530, U01HG007942, U01HG007943, U01TR001395, U01TR002471, U54NS108251, and U54NS093793.
- E.A. is supported by a Microsoft Research PhD Fellowship.
- S.F. is supported by award Number T32GM007753 from the National Institute of General Medical Sciences.
- M.L. is supported by T32HG002295 from the National Human Genome Research Institute and a National Science Foundation Graduate Research Fellowship.
Collapse
Affiliation(s)
- Emily Alsentzer
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Program in Health Sciences and Technology, MIT, Cambridge, MA, 02139, USA
| | - Samuel G Finlayson
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Program in Health Sciences and Technology, MIT, Cambridge, MA, 02139, USA
- Department of Pediatrics, Division of Genetic Medicine, Seattle Children's Hospital, Seattle, WA, 98105, USA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98105, USA
| | - Michelle M Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
| | - Shilpa N Kobren
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA.
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA.
| |
Collapse
|
15
|
Maassen W, Legger G, Kul Cinar O, van Daele P, Gattorno M, Bader-Meunier B, Wouters C, Briggs T, Johansson L, van der Velde J, Swertz M, Omoyinmi E, Hoppenreijs E, Belot A, Eleftheriou D, Caorsi R, Aeschlimann F, Boursier G, Brogan P, Haimel M, van Gijn M. Curation and expansion of the Human Phenotype Ontology for systemic autoinflammatory diseases improves phenotype-driven disease-matching. Front Immunol 2023; 14:1215869. [PMID: 37781402 PMCID: PMC10536149 DOI: 10.3389/fimmu.2023.1215869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Accepted: 08/09/2023] [Indexed: 10/03/2023] Open
Abstract
Introduction Accurate and standardized phenotypic descriptions are essential in diagnosing rare diseases and discovering new diseases, and the Human Phenotype Ontology (HPO) system was developed to provide a rich collection of hierarchical phenotypic descriptions. However, although the HPO terms for inborn errors of immunity have been improved and curated, it has not been investigated whether this curation improves the diagnosis of systemic autoinflammatory disease (SAID) patients. Here, we aimed to study if improved HPO annotation for SAIDs enhanced SAID identification and to demonstrate the potential of phenotype-driven genome diagnostics using curated HPO terms for SAIDs. Methods We collected HPO terms from 98 genetically confirmed SAID patients across eight different European SAID expertise centers and used the LIRICAL (Likelihood Ratio Interpretation of Clinical Abnormalities) computational algorithm to estimate the effect of HPO curation on the prioritization of the correct SAID for each patient. Results Our results show that the percentage of correct diagnoses increased from 66% to 86% and that the number of diagnoses with the highest ranking increased from 38 to 45. In a further pilot study, curation also improved HPO-based whole-exome sequencing (WES) analysis, diagnosing 10/12 patients before and 12/12 after curation. In addition, the average number of candidate diseases that needed to be interpreted decreased from 35 to 2. Discussion This study demonstrates that curation of HPO terms can increase identification of the correct diagnosis, emphasizing the high potential of HPO-based genome diagnostics for SAIDs.
Collapse
Affiliation(s)
- Willem Maassen
- Genomics Coordination Centre, Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| | - Geertje Legger
- Department of Rheumatology and Clinical Immunology, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| | - Ovgu Kul Cinar
- Department of Paediatric Rheumatology, Great Ormond Street Hospital for Children National Health Service Trust, London, United Kingdom
| | - Paul van Daele
- Department of Internal Medicine, Erasmus Medical Centre, Rotterdam, Netherlands
- Department of Immunology, Erasmus Medical Centre, Rotterdam, Netherlands
| | - Marco Gattorno
- UOC Reumatologia e Malattie Autoinfiammatorie, IRCCS Istituto Giannini Gaslini, Genoa, Italy
| | - Brigitte Bader-Meunier
- Department of Paediatric Immunology-Hematology and Rheumatology, Necker University Hospital - APHP, Paris, France
- Laboratory of Immunogenetics of Paediatric Autoimmune Diseases, UMR 1163, Imagine Institute, INSERM, Paris, France
| | - Carine Wouters
- Department of Pediatric Rheumatology, University Hospital Leuven, Leuven, Belgium
| | - Tracy Briggs
- Division of Evolution and Genomic Sciences, School of Biological Sciences, University of Manchester, Manchester, United Kingdom
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester University Hospitals National Health Service Foundation Trust, Manchester, United Kingdom
| | - Lennart Johansson
- Genomics Coordination Centre, Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| | - Joeri van der Velde
- Genomics Coordination Centre, Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| | - Morris Swertz
- Genomics Coordination Centre, Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| | - Ebun Omoyinmi
- Department of Paediatric Rheumatology, Great Ormond Street Hospital for Children National Health Service Trust, London, United Kingdom
| | - Esther Hoppenreijs
- Department of Pediatric Rheumatology, Pediatrics, Radboud University Medical Center, Nijmegen, Netherlands
| | - Alexandre Belot
- National Referee Centre for Rheumatic and AutoImmune and Systemic Diseases in Children (RAISE), Pediatric Nephrology, Rheumatology, Dermatology Unit, INSERM, Hospital of Mother and Child, Hospices Civils of Lyon, Lyon, France
- International Center of Infectiology Research (CIRI), University of Lyon, INSERM, Claude Bernard University, Lyon, France
| | - Despina Eleftheriou
- Department of Paediatric Rheumatology, Great Ormond Street Hospital for Children National Health Service Trust, London, United Kingdom
| | - Roberta Caorsi
- UOC Reumatologia e Malattie Autoinfiammatorie, IRCCS Istituto Giannini Gaslini, Genoa, Italy
| | - Florence Aeschlimann
- Department of Paediatric Immunology-Hematology and Rheumatology, Necker University Hospital - APHP, Paris, France
- Division of Pediatric Rheumatology, University Children’s Hospital Basel, Basel, Switzerland
| | - Guilaine Boursier
- Laboratory of Rare and Autoinflammatory Genetic Diseases and Reference Centre for Autoinflammatory Diseases and Amyloidosis (CEREMAIA), Department of Medical Genetics, Rare Diseases and Personalized Medicine, CHU Montpellier, University of Montpellier, Montpellier, France
| | - Paul Brogan
- Inflammation and Rheumatology Section, University College London Great Ormond Street Institute of Child Health, London, United Kingdom
| | | | - Marielle van Gijn
- Department of Genetics, University Medical Centre Groningen, University of Groningen, Groningen, Netherlands
| |
Collapse
|
16
|
Huang D, Jiang J, Zhao T, Wu S, Li P, Lyu Y, Feng J, Wei M, Zhu Z, Gu J, Ren Y, Yu G, Lu H. diseaseGPS: auxiliary diagnostic system for genetic disorders based on genotype and phenotype. Bioinformatics 2023; 39:btad517. [PMID: 37647638 PMCID: PMC10500091 DOI: 10.1093/bioinformatics/btad517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 07/24/2023] [Accepted: 08/29/2023] [Indexed: 09/01/2023] Open
Abstract
SUMMARY The next-generation sequencing brought opportunities for the diagnosis of genetic disorders due to its high-throughput capabilities. However, the majority of existing methods were limited to only sequencing candidate variants, and the process of linking these variants to a diagnosis of genetic disorders still required medical professionals to consult databases. Therefore, we introduce diseaseGPS, an integrated platform for the diagnosis of genetic disorders that combines both phenotype and genotype data for analysis. It offers not only a user-friendly GUI web application for those without a programming background but also scripts that can be executed in batch mode for bioinformatics professionals. The genetic and phenotypic data are integrated using the ACMG-Bayes method and a novel phenotypic similarity method, to prioritize the results of genetic disorders. diseaseGPS was evaluated on 6085 cases from Deciphering Developmental Disorders project and 187 cases from Shanghai Children's hospital. The results demonstrated that diseaseGPS performed better than other commonly used methods. AVAILABILITY AND IMPLEMENTATION diseaseGPS is available to freely accessed at https://diseasegps.sjtu.edu.cn with source code at https://github.com/BioHuangDY/diseaseGPS.
Collapse
Affiliation(s)
- Daoyi Huang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Jianping Jiang
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Tingting Zhao
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, Shanghai, China
| | - Shengnan Wu
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Pin Li
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Yongfen Lyu
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Jincai Feng
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Mingyue Wei
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Zhixing Zhu
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, Shanghai, China
| | - Jianlei Gu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Yongyong Ren
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Guangjun Yu
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Engineering Research Center for Big Data in Pediatric Precision Medicine, Shanghai, China
- School of Medicine, The Chinese University of Hong Kong, Shenzhen, Guangdong, China
| | - Hui Lu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences, Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
- SJTU-Yale Joint Center for Biostatistics and Data Science, National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
- Shanghai Children’s Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
17
|
Forwood C, Ashton K, Zhu Y, Zhang F, Dias K, Standen K, Evans C, Carey L, Cardamone M, Shalhoub C, Katf H, Riveros C, Hsieh T, Krawitz P, Robinson PN, Dudding‐Byth T, Sadikovic B, Pinner J, Buckley MF, Roscioli T. Integration of EpiSign, facial phenotyping, and likelihood ratio interpretation of clinical abnormalities in the re-classification of an ARID1B missense variant. AMERICAN JOURNAL OF MEDICAL GENETICS. PART C, SEMINARS IN MEDICAL GENETICS 2023; 193:e32056. [PMID: 37654076 PMCID: PMC10952833 DOI: 10.1002/ajmg.c.32056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 07/14/2023] [Indexed: 09/02/2023]
Abstract
Heterozygous ARID1B variants result in Coffin-Siris syndrome. Features may include hypoplastic nails, slow growth, characteristic facial features, hypotonia, hypertrichosis, and sparse scalp hair. Most reported cases are due to ARID1B loss of function variants. We report a boy with developmental delay, feeding difficulties, aspiration, recurrent respiratory infections, slow growth, and hypotonia without a clinical diagnosis, where a previously unreported ARID1B missense variant was classified as a variant of uncertain significance. The pathogenicity of this variant was refined through combined methodologies including genome-wide methylation signature analysis (EpiSign), Machine Learning (ML) facial phenotyping, and LIRICAL. Trio exome sequencing and EpiSign were performed. ML facial phenotyping compared facial images using FaceMatch and GestaltMatcher to syndrome-specific libraries to prioritize the trio exome bioinformatic pipeline gene list output. Phenotype-driven variant prioritization was performed with LIRICAL. A de novo heterozygous missense variant, ARID1B p.(Tyr1268His), was reported as a variant of uncertain significance. The ACMG classification was refined to likely pathogenic by a supportive methylation signature, ML facial phenotyping, and prioritization through LIRICAL. The ARID1B genotype-phenotype has been expanded through an extended analysis of missense variation through genome-wide methylation signatures, ML facial phenotyping, and likelihood-ratio gene prioritization.
Collapse
Affiliation(s)
- Caitlin Forwood
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
- Centre for Clinical GeneticsSydney Children's HospitalRandwickAustralia
- Neuroscience Research Australia (NeuRA)University of New South WalesSydneyAustralia
| | - Katie Ashton
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Ying Zhu
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Futao Zhang
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Kerith‐Rae Dias
- Neuroscience Research Australia (NeuRA)University of New South WalesSydneyAustralia
| | - Krystle Standen
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Carey‐Anne Evans
- Neuroscience Research Australia (NeuRA)University of New South WalesSydneyAustralia
| | - Louise Carey
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Michael Cardamone
- Sydney Children's HospitalRandwickAustralia
- School of Women's and Children's HealthUNSWSydneyAustralia
| | - Carolyn Shalhoub
- Centre for Clinical GeneticsSydney Children's HospitalRandwickAustralia
| | - Hala Katf
- Sydney Children's HospitalRandwickAustralia
| | - Carlos Riveros
- Bioinformatics, Hunter Medical Research InstituteNewcastleAustralia
| | - Tzung‐Chien Hsieh
- Institute for Genomic Statistics and BioinformaticsUniversity Hospital BonnBonnGermany
| | - Peter Krawitz
- Institute for Genomic Statistics and BioinformaticsUniversity Hospital BonnBonnGermany
| | - Peter N Robinson
- JAX Center for Precision GeneticsThe JAX Cancer CenterFarmingtonConnecticutUSA
| | | | - Bekim Sadikovic
- London Health Sciences Centre, Verspeeten Clinical Genome CentreWestern UniversityLondonCanada
| | - Jason Pinner
- Centre for Clinical GeneticsSydney Children's HospitalRandwickAustralia
- School of Women's and Children's HealthUNSWSydneyAustralia
| | - Michael F. Buckley
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
| | - Tony Roscioli
- NSW Health Pathology Randwick GenomicsPrince of Wales HospitalSydneyAustralia
- Neuroscience Research Australia (NeuRA)University of New South WalesSydneyAustralia
- School of Clinical MedicineUNSWSydneyAustralia
| |
Collapse
|
18
|
Dingemans AJM, Hinne M, Truijen KMG, Goltstein L, van Reeuwijk J, de Leeuw N, Schuurs-Hoeijmakers J, Pfundt R, Diets IJ, den Hoed J, de Boer E, Coenen-van der Spek J, Jansen S, van Bon BW, Jonis N, Ockeloen CW, Vulto-van Silfhout AT, Kleefstra T, Koolen DA, Campeau PM, Palmer EE, Van Esch H, Lyon GJ, Alkuraya FS, Rauch A, Marom R, Baralle D, van der Sluijs PJ, Santen GWE, Kooy RF, van Gerven MAJ, Vissers LELM, de Vries BBA. PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework. Nat Genet 2023; 55:1598-1607. [PMID: 37550531 DOI: 10.1038/s41588-023-01469-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 07/05/2023] [Indexed: 08/09/2023]
Abstract
Several molecular and phenotypic algorithms exist that establish genotype-phenotype correlations, including facial recognition tools. However, no unified framework that investigates both facial data and other phenotypic data directly from individuals exists. We developed PhenoScore: an open-source, artificial intelligence-based phenomics framework, combining facial recognition technology with Human Phenotype Ontology data analysis to quantify phenotypic similarity. Here we show PhenoScore's ability to recognize distinct phenotypic entities by establishing recognizable phenotypes for 37 of 40 investigated syndromes against clinical features observed in individuals with other neurodevelopmental disorders and show it is an improvement on existing approaches. PhenoScore provides predictions for individuals with variants of unknown significance and enables sophisticated genotype-phenotype studies by testing hypotheses on possible phenotypic (sub)groups. PhenoScore confirmed previously known phenotypic subgroups caused by variants in the same gene for SATB1, SETBP1 and DEAF1 and provides objective clinical evidence for two distinct ADNP-related phenotypes, already established functionally.
Collapse
Affiliation(s)
- Alexander J M Dingemans
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Max Hinne
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Kim M G Truijen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Lia Goltstein
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Jeroen van Reeuwijk
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Nicole de Leeuw
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Janneke Schuurs-Hoeijmakers
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Rolph Pfundt
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Illja J Diets
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Joery den Hoed
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, the Netherlands
| | - Elke de Boer
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Jet Coenen-van der Spek
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Sandra Jansen
- Department of Human Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Bregje W van Bon
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Noraly Jonis
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Charlotte W Ockeloen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Anneke T Vulto-van Silfhout
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Tjitske Kleefstra
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - David A Koolen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Philippe M Campeau
- Department of Pediatrics, University of Montreal, Montreal, Quebec, Canada
| | - Elizabeth E Palmer
- Faculty of Medicine and Health, UNSW Sydney, Sydney, New South Wales, Australia
- Sydney Children's Hospitals Network, Sydney, New South Wales, Australia
| | - Hilde Van Esch
- Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven, Belgium
| | - Gholson J Lyon
- Department of Human Genetics and George A. Jervis Clinic, Institute for Basic Research in Developmental Disabilities (IBR), Staten Island, NY, USA
- Biology PhD Program, The Graduate Center, The City University of New York, New York City, NY, USA
| | - Fowzan S Alkuraya
- Department of Translational Genomics, Center for Genomic Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Anita Rauch
- Institute of Medical Genetics, University of Zürich, Zürich, Switzerland
| | - Ronit Marom
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Diana Baralle
- Faculty of Medicine, University of Southampton, Southampton, UK
| | | | - Gijs W E Santen
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - R Frank Kooy
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - Marcel A J van Gerven
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Lisenka E L M Vissers
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - Bert B A de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands.
| |
Collapse
|
19
|
Moller-Hansen A, Hejla D, Lee HK, Lyles JB, Yang Y, Chen K, Li WL, Thomas G, Boerkoel CF. Do PACS1 variants impeding adaptor protein binding predispose to syndromic intellectual disability? Am J Med Genet A 2023; 191:2181-2187. [PMID: 37141437 PMCID: PMC10524240 DOI: 10.1002/ajmg.a.63232] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2023] [Revised: 04/16/2023] [Accepted: 04/20/2023] [Indexed: 05/06/2023]
Abstract
To date, PACS1-neurodevelopmental disorder (PACS1-NDD) has been associated with recurrent variation of Arg203 and is considered diagnostic of PACS1-NDD, an autosomal dominant syndromic intellectual disability disorder. Although incompletely defined, the proposed disease mechanism for this variant is altered PACS1 affinity for its client proteins. Given this proposed mechanism, we hypothesized that PACS1 variants that interfere with binding of adaptor proteins might also give rise to syndromic intellectual disability. Herein, we report a proposita and her mother with phenotypic features overlapping PACS1-NDD and a novel PACS1 variant (NM_018026.3:c.[755C > T];[=], p.(Ser252Phe)) that impedes binding of the adaptor protein GGA3 (Golgi-associated, gamma-adaptin ear-containing, ARF-binding protein 3). We hypothesize that attenuating PACS1 binding of GGA3 also gives rise to a disorder with features overlapping those of PACS1-NDD. This observation better delineates the mechanism by which PACS1 variation predisposes to syndromic intellectual disability.
Collapse
Affiliation(s)
- Ashley Moller-Hansen
- Department of Medical Genetics and Provincial Medical Genetics Program, University of British Columbia and Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Duha Hejla
- Department of Pediatrics, University of British Columbia and Children's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Hyun Kyung Lee
- Department of Medical Genetics and Provincial Medical Genetics Program, University of British Columbia and Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Jenea Barbara Lyles
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh Cancer Institute, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | - Yunhan Yang
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh Cancer Institute, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | - Kun Chen
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh Cancer Institute, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | | | - Gary Thomas
- Department of Microbiology and Molecular Genetics, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh Cancer Institute, Pittsburgh, Pennsylvania, USA
- University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA
| | - Cornelius F Boerkoel
- Department of Medical Genetics and Provincial Medical Genetics Program, University of British Columbia and Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
20
|
Lesmann H, Klinkhammer H, M. Krawitz PDMDPP. The future role of facial image analysis in ACMG classification guidelines. MED GENET-BERLIN 2023; 35:115-121. [PMID: 38840866 PMCID: PMC10842539 DOI: 10.1515/medgen-2023-2014] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]
Abstract
The use of next-generation sequencing (NGS) has dramatically improved the diagnosis of rare diseases. However, the analysis of genomic data has become complex with the increasing detection of variants by exome and genome sequencing. The American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) developed a 5-tier classification scheme in 2015 for variant interpretation, that has since been widely adopted. Despite efforts to minimise discrepancies in the application of these criteria, inconsistencies still occur. Further specifications for individual genes were developed by Variant Curation Expert Panels (VCEPs) of the Clinical Genome Resource (ClinGen) consortium, that also take into consideration gene or disease specific features. For instance, in disorders with a highly characerstic facial gestalt a "phenotypic match" (PP4) has higher pathogenic evidence than e.g. in a non-syndromic form of intellectual disability. With computational approaches for quantifying the similarity of dysmorphic features results of such analysis can now be used in a refined Bayesian framework for the ACMG/AMP criteria.
Collapse
Affiliation(s)
- Hellen Lesmann
- University of Bonn, Medical Faculty & University Hospital BonnInstitute of Human GeneticsVenusberg-Campus 153127BonnGermany
| | - Hannah Klinkhammer
- University of BonnInstitute for Genomic Statistics and BioinformaticsBonnGermany
| | | |
Collapse
|
21
|
Danis D, Jacobsen JOB, Wagner AH, Groza T, Beckwith MA, Rekerle L, Carmody LC, Reese J, Hegde H, Ladewig MS, Seitz B, Munoz-Torres M, Harris NL, Rambla J, Baudis M, Mungall CJ, Haendel MA, Robinson PN. Phenopacket-tools: Building and validating GA4GH Phenopackets. PLoS One 2023; 18:e0285433. [PMID: 37196000 PMCID: PMC10191354 DOI: 10.1371/journal.pone.0285433] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2023] [Accepted: 04/21/2023] [Indexed: 05/19/2023] Open
Abstract
The Global Alliance for Genomics and Health (GA4GH) is a standards-setting organization that is developing a suite of coordinated standards for genomics. The GA4GH Phenopacket Schema is a standard for sharing disease and phenotype information that characterizes an individual person or biosample. The Phenopacket Schema is flexible and can represent clinical data for any kind of human disease including rare disease, complex disease, and cancer. It also allows consortia or databases to apply additional constraints to ensure uniform data collection for specific goals. We present phenopacket-tools, an open-source Java library and command-line application for construction, conversion, and validation of phenopackets. Phenopacket-tools simplifies construction of phenopackets by providing concise builders, programmatic shortcuts, and predefined building blocks (ontology classes) for concepts such as anatomical organs, age of onset, biospecimen type, and clinical modifiers. Phenopacket-tools can be used to validate the syntax and semantics of phenopackets as well as to assess adherence to additional user-defined requirements. The documentation includes examples showing how to use the Java library and the command-line tool to create and validate phenopackets. We demonstrate how to create, convert, and validate phenopackets using the library or the command-line application. Source code, API documentation, comprehensive user guide and a tutorial can be found at https://github.com/phenopackets/phenopacket-tools. The library can be installed from the public Maven Central artifact repository and the application is available as a standalone archive. The phenopacket-tools library helps developers implement and standardize the collection and exchange of phenotypic and other clinical data for use in phenotype-driven genomic diagnostics, translational research, and precision medicine applications.
Collapse
Affiliation(s)
- Daniel Danis
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
| | - Julius O. B. Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Alex H. Wagner
- Departments of Pediatrics and Biomedical Informatics, The Ohio State University College of Medicine, Columbus, OH, United States of America
- The Steve and Cindy Rasmussen Institute for Genomic Medicine, Nationwide Children’s Hospital, Columbus, OH, United States of America
| | | | - Martha A. Beckwith
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
| | - Lauren Rekerle
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
| | - Leigh C. Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
| | - Justin Reese
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Harshad Hegde
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Markus S. Ladewig
- Department of Ophthalmology, Klinikum Saarbrücken, Saarbrücken, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center, Homburg/Saar, Germany
| | - Monica Munoz-Torres
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Nomi L. Harris
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Jordi Rambla
- European Genome-Phenome Archive (EGA) in the Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Michael Baudis
- University of Zurich and Swiss Institute of Bioinformatics, Zurich, Switzerland
| | - Christopher J. Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, United States of America
| | - Melissa A. Haendel
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, United States of America
| | - Peter N. Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, United States of America
- Institute for Systems Genomics, University of Connecticut, Farmington, CT, United States of America
| |
Collapse
|
22
|
Licata L, Via A, Turina P, Babbi G, Benevenuta S, Carta C, Casadio R, Cicconardi A, Facchiano A, Fariselli P, Giordano D, Isidori F, Marabotti A, Martelli PL, Pascarella S, Pinelli M, Pippucci T, Russo R, Savojardo C, Scafuri B, Valeriani L, Capriotti E. Resources and tools for rare disease variant interpretation. Front Mol Biosci 2023; 10:1169109. [PMID: 37234922 PMCID: PMC10206239 DOI: 10.3389/fmolb.2023.1169109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
Collectively, rare genetic disorders affect a substantial portion of the world's population. In most cases, those affected face difficulties in receiving a clinical diagnosis and genetic characterization. The understanding of the molecular mechanisms of these diseases and the development of therapeutic treatments for patients are also challenging. However, the application of recent advancements in genome sequencing/analysis technologies and computer-aided tools for predicting phenotype-genotype associations can bring significant benefits to this field. In this review, we highlight the most relevant online resources and computational tools for genome interpretation that can enhance the diagnosis, clinical management, and development of treatments for rare disorders. Our focus is on resources for interpreting single nucleotide variants. Additionally, we present use cases for interpreting genetic variants in clinical settings and review the limitations of these results and prediction tools. Finally, we have compiled a curated set of core resources and tools for analyzing rare disease genomes. Such resources and tools can be utilized to develop standardized protocols that will enhance the accuracy and effectiveness of rare disease diagnosis.
Collapse
Affiliation(s)
- Luana Licata
- Department of Biology, University of Rome Tor Vergata, Roma, Italy
| | - Allegra Via
- Department of Biochemical Sciences “A. Rossi Fanelli”, University of Rome “La Sapienza”, Roma, Italy
| | - Paola Turina
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Giulia Babbi
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | | | - Claudio Carta
- National Centre for Rare Diseases, Istituto Superiore di Sanità, Roma, Italy
| | - Rita Casadio
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Andrea Cicconardi
- Department of Physics, University of Genova, Genova, Italy
- Italiano di Tecnologia—IIT, Genova, Italy
| | - Angelo Facchiano
- National Research Council, Institute of Food Science, Avellino, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Deborah Giordano
- National Research Council, Institute of Food Science, Avellino, Italy
| | - Federica Isidori
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Anna Marabotti
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Fisciano, SA, Italy
| | - Pier Luigi Martelli
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Stefano Pascarella
- Department of Biochemical Sciences “A. Rossi Fanelli”, University of Rome “La Sapienza”, Roma, Italy
| | - Michele Pinelli
- Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Napoli, Italy
| | - Tommaso Pippucci
- Medical Genetics Unit, IRCCS Azienda Ospedaliero-Universitaria di Bologna, Bologna, Italy
| | - Roberta Russo
- Department of Molecular Medicine and Medical Biotechnology, University of Naples Federico II, Napoli, Italy
- CEINGE Biotecnologie Avanzate Franco Salvatore, Napoli, Italy
| | - Castrense Savojardo
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Bernardina Scafuri
- Department of Chemistry and Biology “A. Zambelli”, University of Salerno, Fisciano, SA, Italy
| | | | - Emidio Capriotti
- Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| |
Collapse
|
23
|
Yang J, Shu L, Duan H, Li H. A Robust Phenotype-driven Likelihood Ratio Analysis Approach Assisting Interpretable Clinical Diagnosis of Rare Diseases. J Biomed Inform 2023; 142:104372. [PMID: 37105510 DOI: 10.1016/j.jbi.2023.104372] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Revised: 02/20/2023] [Accepted: 04/20/2023] [Indexed: 04/29/2023]
Abstract
Phenotype-based prioritization of candidate genes and diseases has become a well-established approach for multi-omics diagnostics of rare diseases. Most current algorithms exploit semantic analysis and probabilistic statistics based on Human Phenotype Ontology and are commonly superior to naive search methods. However, these algorithms are mostly less interpretable and do not perform well in real clinical scenarios due to noise and imprecision of query terms, and the fact that individuals may not display all phenotypes of the disease they belong to. We present a Phenotype-driven Likelihood Ratio analysis approach (PheLR) assisting interpretable clinical diagnosis of rare diseases. With a likelihood ratio paradigm, PheLR estimates the posterior probability of candidate diseases and how much a phenotypic feature contributes to the prioritization result. Benchmarked using simulated and realistic patients, PheLR shows significant advantages over current approaches and is robust to noise and inaccuracy. To facilitate clinical practice and visualized differential diagnosis, PheLR is implemented as an online web tool (http://phelr.nbscn.org).
Collapse
Affiliation(s)
- Jian Yang
- Clinical Data Center, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China; The College of Biomedical Engineering and Instrument Science, Zhejiang University, Zhejiang, China
| | - Liqi Shu
- Rhode Island Hospital, Warren Alpert Medical School of Brown University, Rhode Island, USA
| | - Huilong Duan
- The College of Biomedical Engineering and Instrument Science, Zhejiang University, Zhejiang, China
| | - Haomin Li
- Clinical Data Center, The Children's Hospital, Zhejiang University School of Medicine, National Clinical Research Center for Child Health, Zhejiang, China.
| |
Collapse
|
24
|
Ladewig MS, Jacobsen JOB, Wagner AH, Danis D, El Kassaby B, Gargano M, Groza T, Baudis M, Steinhaus R, Seelow D, Bechrakis NE, Mungall CJ, Schofield PN, Elemento O, Smith L, McMurry JA, Munoz‐Torres M, Haendel MA, Robinson PN. GA4GH Phenopackets: A Practical Introduction. ADVANCED GENETICS (HOBOKEN, N.J.) 2023; 4:2200016. [PMID: 36910590 PMCID: PMC10000265 DOI: 10.1002/ggn2.202200016] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 06/30/2022] [Indexed: 11/08/2022]
Abstract
The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis. The Phenopacket Schema, together with other GA4GH data and technical standards, will enable data exchange and provide a foundation for the computational analysis of disease and phenotype information to improve our ability to diagnose and conduct research on all types of disorders, including cancer and rare diseases.
Collapse
Affiliation(s)
- Markus S. Ladewig
- Department of OphthalmologyKlinikum Saarbrücken66119SaarbrückenGermany
| | - Julius O. B. Jacobsen
- William Harvey Research InstituteCharterhouse SquareBarts and the London School of Medicine and Dentistry QueenQueen Mary University of LondonLondonEC1M 6BQUK
| | - Alex H. Wagner
- Departments of Pediatrics and Biomedical InformaticsThe Ohio State University College of MedicineColumbusOH43210USA
- The Steve and Cindy Rasmussen Institute for Genomic MedicineNationwide Children's HospitalColumbusOH43215USA
| | - Daniel Danis
- The Jackson Laboratory for Genomic Medicine10 Discovery DriveFarmingtonCT06032USA
| | - Baha El Kassaby
- The Jackson Laboratory for Genomic Medicine10 Discovery DriveFarmingtonCT06032USA
| | - Michael Gargano
- The Jackson Laboratory for Genomic Medicine10 Discovery DriveFarmingtonCT06032USA
| | - Tudor Groza
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)CambridgeCB10 1SDUK
| | - Michael Baudis
- Department of Molecular Life Sciences and Swiss Institute of BioinformaticsUniversity of ZurichZurichSwitzerland
| | - Robin Steinhaus
- Exploratory Diagnostic SciencesBerlin Institute of Health at Charité – Universitätsmedizin Berlin10178BerlinGermany
- Institute of Medical Genetics and Human GeneticsCharité – Universitätsmedizin BerlinCorporate Member of Freie Universität Berlin and Humboldt‐Universität zu Berlin13353BerlinGermany
| | - Dominik Seelow
- Exploratory Diagnostic SciencesBerlin Institute of Health at Charité – Universitätsmedizin Berlin10178BerlinGermany
- Institute of Medical Genetics and Human GeneticsCharité – Universitätsmedizin BerlinCorporate Member of Freie Universität Berlin and Humboldt‐Universität zu Berlin13353BerlinGermany
| | | | - Christopher J. Mungall
- Lawrence Berkeley National LaboratoryEnvironmental Genomics and Systems BiologyBerkeleyCA94720USA
| | - Paul N. Schofield
- Department of Physiology Development and NeuroscienceUniversity of CambridgeDowning StreetCambridgeCB2 3EGUK
- The Jackson LaboratoryBar HarborME04609USA
| | - Olivier Elemento
- Caryl and Israel Englander Institute for Precision MedicineWeill Cornell MedicineNew YorkNY10021USA
| | - Lindsay Smith
- Ontario Institute for Cancer ResearchAdaptive OncologyTorontoCAM5G0A3USA
- Global Alliance for Genomics and HealthTorontoCAM5G0A3USA
| | - Julie A. McMurry
- Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraCO80045USA
| | - Monica Munoz‐Torres
- Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraCO80045USA
| | - Melissa A. Haendel
- Center for Health AIUniversity of Colorado Anschutz Medical CampusAuroraCO80045USA
| | - Peter N. Robinson
- The Jackson Laboratory for Genomic Medicine10 Discovery DriveFarmingtonCT06032USA
- Institute for Systems GenomicsUniversity of ConnecticutFarmingtonCT06032USA
| |
Collapse
|
25
|
Sparks TN, Dugoff L. How to choose a test for prenatal genetic diagnosis: a practical overview. Am J Obstet Gynecol 2023; 228:178-186. [PMID: 36029833 PMCID: PMC9877133 DOI: 10.1016/j.ajog.2022.08.039] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 07/29/2022] [Accepted: 08/05/2022] [Indexed: 01/28/2023]
Abstract
Establishing the diagnosis of a fetal genetic disease in utero expands decision-making opportunities for individuals during pregnancy and enables providers to tailor prenatal care and surveillance to disease-specific risks. The selection of prenatal genetic tests is guided by key details from fetal imaging, family and obstetrical history, suspected diagnoses and mechanisms of disease, an accurate understanding of what abnormalities each test is designed to detect, and, at times, the gestational age at which testing is initiated. Pre- and posttest counseling, by or in conjunction with providers trained in genetics, ensure an accurate understanding of genetic tests, their potential results and limitations, estimated turnaround time for results, and the clinical implications of their findings. As prenatal diagnosis and testing options continue to expand rapidly, it is increasingly important for obstetrical providers to understand how to choose appropriate genetic testing and contextualize the clinical implications of their results.
Collapse
Affiliation(s)
- Teresa N Sparks
- Division of Maternal-Fetal Medicine, Department of Obstetrics, Gynecology, and Reproductive Sciences, University of California, San Francisco, San Francisco, CA; Institute for Human Genetics, University of California, San Francisco, San Francisco, CA.
| | - Lorraine Dugoff
- Divisions of Reproductive Genetics and Maternal-Fetal Medicine, Department of Obstetrics and Gynecology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA
| |
Collapse
|
26
|
Handra J, Elbert A, Gazzaz N, Moller-Hansen A, Hyunh S, Lee HK, Boerkoel P, Alderman E, Anderson E, Clarke L, Hamilton S, Hamman R, Hughes S, Ip S, Langlois S, Lee M, Li L, Mackenzie F, Patel MS, Prentice LM, Sangha K, Sato L, Seath K, Seppelt M, Swenerton A, Warnock L, Zambonin JL, Boerkoel CF, Chin HL, Armstrong L. The practice of genomic medicine: A delineation of the process and its governing principles. Front Med (Lausanne) 2023; 9:1071348. [PMID: 36714130 PMCID: PMC9877428 DOI: 10.3389/fmed.2022.1071348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 12/23/2022] [Indexed: 01/13/2023] Open
Abstract
Genomic medicine, an emerging medical discipline, applies the principles of evolution, developmental biology, functional genomics, and structural genomics within clinical care. Enabling widespread adoption and integration of genomic medicine into clinical practice is key to achieving precision medicine. We delineate a biological framework defining diagnostic utility of genomic testing and map the process of genomic medicine to inform integration into clinical practice. This process leverages collaboration and collective cognition of patients, principal care providers, clinical genomic specialists, laboratory geneticists, and payers. We detail considerations for referral, triage, patient intake, phenotyping, testing eligibility, variant analysis and interpretation, counseling, and management within the utilitarian limitations of health care systems. To reduce barriers for clinician engagement in genomic medicine, we provide several decision-making frameworks and tools and describe the implementation of the proposed workflow in a prototyped electronic platform that facilitates genomic care. Finally, we discuss a vision for the future of genomic medicine and comment on areas for continued efforts.
Collapse
Affiliation(s)
- Julia Handra
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Adrienne Elbert
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Nour Gazzaz
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada,Department of Pediatrics, University of British Columbia, Vancouver, BC, Canada,Department of Pediatrics, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Ashley Moller-Hansen
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Stephanie Hyunh
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Hyun Kyung Lee
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Pierre Boerkoel
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada
| | - Emily Alderman
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Erin Anderson
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Lorne Clarke
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Sara Hamilton
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Ronnalea Hamman
- Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Shevaun Hughes
- Clinical Research Informatics, Provincial Health Services Authority, Vancouver, BC, Canada
| | - Simon Ip
- Process & Systems Improvement, Provincial Health Services Authority, Vancouver, BC, Canada
| | - Sylvie Langlois
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Mary Lee
- Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Laura Li
- Breakthrough Genomics, Irvine, CA, United States
| | - Frannie Mackenzie
- Women’s Health Research Institute, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Millan S. Patel
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Leah M. Prentice
- Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Karan Sangha
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Laura Sato
- Process & Systems Improvement, Provincial Health Services Authority, Vancouver, BC, Canada
| | - Kimberly Seath
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Margaret Seppelt
- Process & Systems Improvement, Provincial Health Services Authority, Vancouver, BC, Canada
| | - Anne Swenerton
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Lynn Warnock
- Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Jessica L. Zambonin
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Cornelius F. Boerkoel
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| | - Hui-Lin Chin
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada,Khoo Teck Puat-National University Children’s Medical Institute, National University Hospital, Singapore, Singapore,*Correspondence: Hui-Lin Chin,
| | - Linlea Armstrong
- Department of Medical Genetics, University of British Columbia, Vancouver, BC, Canada,Provincial Medical Genetics Program, British Columbia Women’s Hospital and Health Centre, Vancouver, BC, Canada
| |
Collapse
|
27
|
Reese JT, Blau H, Casiraghi E, Bergquist T, Loomba JJ, Callahan TJ, Laraway B, Antonescu C, Coleman B, Gargano M, Wilkins KJ, Cappelletti L, Fontana T, Ammar N, Antony B, Murali TM, Caufield JH, Karlebach G, McMurry JA, Williams A, Moffitt R, Banerjee J, Solomonides AE, Davis H, Kostka K, Valentini G, Sahner D, Chute CG, Madlock-Brown C, Haendel MA, Robinson PN. Generalisable long COVID subtypes: findings from the NIH N3C and RECOVER programmes. EBioMedicine 2023; 87:104413. [PMID: 36563487 PMCID: PMC9769411 DOI: 10.1016/j.ebiom.2022.104413] [Citation(s) in RCA: 43] [Impact Index Per Article: 43.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 11/23/2022] [Accepted: 11/29/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Stratification of patients with post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) would allow precision clinical management strategies. However, long COVID is incompletely understood and characterised by a wide range of manifestations that are difficult to analyse computationally. Additionally, the generalisability of machine learning classification of COVID-19 clinical outcomes has rarely been tested. METHODS We present a method for computationally modelling PASC phenotype data based on electronic healthcare records (EHRs) and for assessing pairwise phenotypic similarity between patients using semantic similarity. Our approach defines a nonlinear similarity function that maps from a feature space of phenotypic abnormalities to a matrix of pairwise patient similarity that can be clustered using unsupervised machine learning. FINDINGS We found six clusters of PASC patients, each with distinct profiles of phenotypic abnormalities, including clusters with distinct pulmonary, neuropsychiatric, and cardiovascular abnormalities, and a cluster associated with broad, severe manifestations and increased mortality. There was significant association of cluster membership with a range of pre-existing conditions and measures of severity during acute COVID-19. We assigned new patients from other healthcare centres to clusters by maximum semantic similarity to the original patients, and showed that the clusters were generalisable across different hospital systems. The increased mortality rate originally identified in one cluster was consistently observed in patients assigned to that cluster in other hospital systems. INTERPRETATION Semantic phenotypic clustering provides a foundation for assigning patients to stratified subgroups for natural history or therapy studies on PASC. FUNDING NIH (TR002306/OT2HL161847-01/OD011883/HG010860), U.S.D.O.E. (DE-AC02-05CH11231), Donald A. Roux Family Fund at Jackson Laboratory, Marsico Family at CU Anschutz.
Collapse
Affiliation(s)
- Justin T Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Elena Casiraghi
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA; AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | | | - Johanna J Loomba
- The Integrated Translational Health Research Institute of Virginia (iTHRIV), University of Virginia, Charlottesville, VA, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Bryan Laraway
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | | | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Michael Gargano
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Kenneth J Wilkins
- Biostatistics Program, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Luca Cappelletti
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | - Tommaso Fontana
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | - Nariman Ammar
- Health Science Center, University of Tennessee, Memphis, TN, USA
| | - Blessy Antony
- Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - T M Murali
- Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - J Harry Caufield
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Guy Karlebach
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Julie A McMurry
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Andrew Williams
- Tufts Medical Center Clinical and Translational Science Institute, Tufts Medical Center, Boston, MA, USA; Tufts University School of Medicine, Institute for Clinical Research and Health Policy Studies, Boston, MA, USA; Northeastern University, OHDSI Center at the Roux Institute, Boston, MA, USA
| | - Richard Moffitt
- Department of Biomedical Informatics and Stony Brook Cancer Center, Stony Brook University, Stony Brook, NY, USA
| | | | | | | | - Kristin Kostka
- Northeastern University, OHDSI Center at the Roux Institute, Boston, MA, USA
| | - Giorgio Valentini
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | | | - Christopher G Chute
- Schools of Medicine, Public Health and Nursing, Johns Hopkins University, Baltimore, MD, USA
| | | | - Melissa A Haendel
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA; Institute for Systems Genomics, University of Connecticut, Farmington, CT, USA.
| |
Collapse
|
28
|
Phenotype-aware prioritisation of rare Mendelian disease variants. Trends Genet 2022; 38:1271-1283. [PMID: 35934592 PMCID: PMC9950798 DOI: 10.1016/j.tig.2022.07.002] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Revised: 06/06/2022] [Accepted: 07/05/2022] [Indexed: 01/24/2023]
Abstract
A molecular diagnosis from the analysis of sequencing data in rare Mendelian diseases has a huge impact on the management of patients and their families. Numerous patient phenotype-aware variant prioritisation (VP) tools have been developed to help automate this process, and shorten the diagnostic odyssey, but performance statistics on real patient data are limited. Here we identify, assess, and compare the performance of all up-to-date, freely available, and programmatically accessible tools using a whole-exome, retinal disease dataset from 134 individuals with a molecular diagnosis. All tools were able to identify around two-thirds of the genetic diagnoses as the top-ranked candidate, with LIRICAL performing best overall. Finally, we discuss the challenges to overcome most cases remaining undiagnosed after current, state-of-the-art practices.
Collapse
|
29
|
Aitken S, Firth HV, Wright CF, Hurles ME, FitzPatrick DR, Semple CA. IMPROVE-DD: Integrating multiple phenotype resources optimizes variant evaluation in genetically determined developmental disorders. HGG ADVANCES 2022; 4:100162. [PMID: 36561149 PMCID: PMC9763511 DOI: 10.1016/j.xhgg.2022.100162] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022] Open
Abstract
Diagnosing rare developmental disorders using genome-wide sequencing data commonly necessitates review of multiple plausible candidate variants, often using ontologies of categorical clinical terms. We show that Integrating Multiple Phenotype Resources Optimizes Variant Evaluation in Developmental Disorders (IMPROVE-DD) by incorporating additional classes of data commonly available to clinicians and recorded in health records. In doing so, we quantify the distinct contributions of sex, growth, and development in addition to Human Phenotype Ontology (HPO) terms and demonstrate added value from these readily available information sources. We use likelihood ratios for nominal and quantitative data and propose a classifier for HPO terms in this framework. This Bayesian framework results in more robust diagnoses. Using data systematically collected in the Deciphering Developmental Disorders study, we considered 77 genes with pathogenic/likely pathogenic variants in ≥10 individuals. All genes showed at least a satisfactory prediction by receiver operating characteristic when testing on training data (AUC ≥ 0.6), and HPO terms were the best predictor for the majority of genes, though a minority (13/77) of genes were better predicted by other phenotypic data types. Overall, classifiers based upon multiple integrated phenotypic data sources performed better than those based upon any individual source, and importantly, integrated models produced notably fewer false positives. Finally, we show that IMPROVE-DD models with good predictive performance on cross-validation can be constructed from relatively few individuals. This suggests new strategies for candidate gene prioritization and highlights the value of systematic clinical data collection to support diagnostic programs.
Collapse
Affiliation(s)
- Stuart Aitken
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Helen V. Firth
- Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK,Clinical Genetics Department, Addenbrooke’s Hospital Cambridge University Hospitals, Cambridge CB2 0QQ, UK
| | - Caroline F. Wright
- University of Exeter Medical School, Royal Devon & Exeter Hospital, Barrack Road, Exeter EX2 5DW, UK
| | | | - David R. FitzPatrick
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Colin A. Semple
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh EH4 2XU, UK,Corresponding author
| |
Collapse
|
30
|
Lewis-Smith D, Parthasarathy S, Xian J, Kaufman MC, Ganesan S, Galer PD, Thomas RH, Helbig I. Computational analysis of neurodevelopmental phenotypes: Harmonization empowers clinical discovery. Hum Mutat 2022; 43:1642-1658. [PMID: 35460582 PMCID: PMC9560951 DOI: 10.1002/humu.24389] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/23/2022] [Accepted: 04/21/2022] [Indexed: 11/09/2022]
Abstract
Making a specific diagnosis in neurodevelopmental disorders is traditionally based on recognizing clinical features of a distinct syndrome, which guides testing of its possible genetic etiologies. Scalable frameworks for genomic diagnostics, however, have struggled to integrate meaningful measurements of clinical phenotypic features. While standardization has enabled generation and interpretation of genomic data for clinical diagnostics at unprecedented scale, making the equivalent breakthrough for clinical data has proven challenging. However, increasingly clinical features are being recorded using controlled dictionaries with machine readable formats such as the Human Phenotype Ontology (HPO), which greatly facilitates their use in the diagnostic space. Improving the tractability of large-scale clinical information will present new opportunities to inform genomic research and diagnostics from a clinical perspective. Here, we describe novel approaches for computational phenotyping to harmonize clinical features, improve data translation through revising domain-specific dictionaries, quantify phenotypic features, and determine clinical relatedness. We demonstrate how these concepts can be applied to longitudinal phenotypic information, which represents a critical element of developmental disorders and pediatric conditions. Finally, we expand our discussion to clinical data derived from electronic medical records, a largely untapped resource of deep clinical information with distinct strengths and weaknesses.
Collapse
Affiliation(s)
- David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle-upon-Tyne, UK
- Department of Clinical Neurosciences, Royal Victoria Infirmary, Newcastle-upon-Tyne, UK
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shridhar Parthasarathy
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Julie Xian
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Michael C. Kaufman
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shiva Ganesan
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Peter D. Galer
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Center for Neuroengineering and Therapeutics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
| | - Rhys H. Thomas
- Translational and Clinical Research Institute, Newcastle University, Newcastle-upon-Tyne, UK
- Department of Clinical Neurosciences, Royal Victoria Infirmary, Newcastle-upon-Tyne, UK
| | - Ingo Helbig
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
| |
Collapse
|
31
|
Liu Y, Yeung WSB, Chiu PCN, Cao D. Computational approaches for predicting variant impact: An overview from resources, principles to applications. Front Genet 2022; 13:981005. [PMID: 36246661 PMCID: PMC9559863 DOI: 10.3389/fgene.2022.981005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Accepted: 08/08/2022] [Indexed: 11/13/2022] Open
Abstract
One objective of human genetics is to unveil the variants that contribute to human diseases. With the rapid development and wide use of next-generation sequencing (NGS), massive genomic sequence data have been created, making personal genetic information available. Conventional experimental evidence is critical in establishing the relationship between sequence variants and phenotype but with low efficiency. Due to the lack of comprehensive databases and resources which present clinical and experimental evidence on genotype-phenotype relationship, as well as accumulating variants found from NGS, different computational tools that can predict the impact of the variants on phenotype have been greatly developed to bridge the gap. In this review, we present a brief introduction and discussion about the computational approaches for variant impact prediction. Following an innovative manner, we mainly focus on approaches for non-synonymous variants (nsSNVs) impact prediction and categorize them into six classes. Their underlying rationale and constraints, together with the concerns and remedies raised from comparative studies are discussed. We also present how the predictive approaches employed in different research. Although diverse constraints exist, the computational predictive approaches are indispensable in exploring genotype-phenotype relationship.
Collapse
Affiliation(s)
- Ye Liu
- Shenzhen Key Laboratory of Fertility Regulation, Reproductive Medicine Center, The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
| | - William S. B. Yeung
- Shenzhen Key Laboratory of Fertility Regulation, Reproductive Medicine Center, The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
- Department of Obstetrics and Gynaecology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
| | - Philip C. N. Chiu
- Shenzhen Key Laboratory of Fertility Regulation, Reproductive Medicine Center, The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
- Department of Obstetrics and Gynaecology, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong SAR, China
- *Correspondence: Philip C. N. Chiu, ; Dandan Cao,
| | - Dandan Cao
- Shenzhen Key Laboratory of Fertility Regulation, Reproductive Medicine Center, The University of Hong Kong-Shenzhen Hospital, Shenzhen, China
- *Correspondence: Philip C. N. Chiu, ; Dandan Cao,
| |
Collapse
|
32
|
Swanson K, Norton ME, Lianoglou BR, Jelin AC, Hodoglugil U, Van Ziffle J, Devine P, Sparks TN. The utility of pathologic examination and comprehensive phenotyping for accurate diagnosis with perinatal exome sequencing. Prenat Diagn 2022; 42:1288-1294. [PMID: 35723972 PMCID: PMC9531346 DOI: 10.1002/pd.6197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 05/29/2022] [Accepted: 06/13/2022] [Indexed: 11/09/2022]
Abstract
OBJECTIVE Exome sequencing (ES) offers the ability to assess for variants in thousands of genes and is particularly useful in the setting of fetal anomalies. However, the ES pipeline relies on a thorough understanding of an individual patient's phenotype, which may be limited in the prenatal setting. Additional pathology evaluations in the pre- and postnatal settings can add phenotypic details important for clearly establishing and characterizing a diagnosis. METHODS This is a case series of prenatal ES performed at our institution in which pathology evaluations, including autopsy, dysmorphology examination, histology, and peripheral blood smear, augmented the understanding of the fetal phenotype. ES was performed at our institution and a multidisciplinary panel reviewed and classified the variants for each case. RESULTS We present four cases wherein pathology evaluations were beneficial for supporting a perinatal diagnosis identified with ES. In each of these cases, pathology findings provided additional data to support a more complete understanding of the relationship between the perinatal phenotype and variants identified with ES. CONCLUSION These cases highlight challenges of perinatal ES related to incomplete prenatal phenotyping, demonstrate the utility of pathology evaluations to support diagnoses identified with ES, and further characterize the disease manifestations of specific genetic variants.
Collapse
Affiliation(s)
- Kate Swanson
- Department of Obstetrics, Gynecology and Reproductive Sciences, Division of Maternal-Fetal Medicine, University of California, San Francisco, California, USA
- Department of Pediatrics, Division of Medical Genetics, University of California, San Francisco, California, USA
| | - Mary E. Norton
- Department of Obstetrics, Gynecology and Reproductive Sciences, Division of Maternal-Fetal Medicine, University of California, San Francisco, California, USA
- Department of Pediatrics, Division of Medical Genetics, University of California, San Francisco, California, USA
- Fetal Treatment Center, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| | - Billie R. Lianoglou
- Fetal Treatment Center, University of California, San Francisco, California, USA
| | - Angie C. Jelin
- Department of Gynecology and Obstetrics, Johns Hopkins Hospital, Baltimore, Maryland, USA
| | - Ugur Hodoglugil
- Genomic Medicine Laboratory, University of California, San Francisco, California, USA
| | - Jessica Van Ziffle
- Genomic Medicine Laboratory, University of California, San Francisco, California, USA
| | - Patrick Devine
- Genomic Medicine Laboratory, University of California, San Francisco, California, USA
| | - Teresa N. Sparks
- Department of Obstetrics, Gynecology and Reproductive Sciences, Division of Maternal-Fetal Medicine, University of California, San Francisco, California, USA
- Fetal Treatment Center, University of California, San Francisco, California, USA
- Institute for Human Genetics, University of California, San Francisco, California, USA
| |
Collapse
|
33
|
Jacobsen JOB, Kelly C, Cipriani V, Research Consortium GE, Mungall CJ, Reese J, Danis D, Robinson PN, Smedley D. Phenotype-driven approaches to enhance variant prioritization and diagnosis of rare disease. Hum Mutat 2022; 43:1071-1081. [PMID: 35391505 PMCID: PMC9288531 DOI: 10.1002/humu.24380] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 01/25/2022] [Accepted: 04/03/2022] [Indexed: 11/20/2022]
Abstract
Rare disease diagnostics and disease gene discovery have been revolutionized by whole-exome and genome sequencing but identifying the causative variant(s) from the millions in each individual remains challenging. The use of deep phenotyping of patients and reference genotype-phenotype knowledge, alongside variant data such as allele frequency, segregation, and predicted pathogenicity, has proved an effective strategy to tackle this issue. Here we review the numerous tools that have been developed to automate this approach and demonstrate the power of such an approach on several thousand diagnosed cases from the 100,000 Genomes Project. Finally, we discuss the challenges that need to be overcome if we are going to improve detection rates and help the majority of patients that still remain without a molecular diagnosis after state-of-the-art genomic interpretation.
Collapse
Affiliation(s)
- Julius O. B. Jacobsen
- William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry QueenQueen Mary University of LondonLondonUK
| | - Catherine Kelly
- William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry QueenQueen Mary University of LondonLondonUK
| | - Valentina Cipriani
- William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry QueenQueen Mary University of LondonLondonUK
| | | | - Christopher J. Mungall
- Environmental Genomics and Systems Biology, Lawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
| | - Justin Reese
- Environmental Genomics and Systems Biology, Lawrence Berkeley National LaboratoryBerkeleyCaliforniaUSA
| | - Daniel Danis
- The Jackson Laboratory for Genomic MedicineFarmingtonConnecticutUSA
| | | | - Damian Smedley
- William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry QueenQueen Mary University of LondonLondonUK
| |
Collapse
|
34
|
Reese JT, Blau H, Bergquist T, Loomba JJ, Callahan T, Laraway B, Antonescu C, Casiraghi E, Coleman B, Gargano M, Wilkins KJ, Cappelletti L, Fontana T, Ammar N, Antony B, Murali TM, Karlebach G, McMurry JA, Williams A, Moffitt R, Banerjee J, Solomonides AE, Davis H, Kostka K, Valentini G, Sahner D, Chute CG, Madlock-Brown C, Haendel MA, Robinson PN. Generalizable Long COVID Subtypes: Findings from the NIH N3C and RECOVER Programs. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2022:2022.05.24.22275398. [PMID: 35665012 PMCID: PMC9164456 DOI: 10.1101/2022.05.24.22275398] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/11/2023]
Abstract
Accurate stratification of patients with post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) would allow precision clinical management strategies. However, the natural history of long COVID is incompletely understood and characterized by an extremely wide range of manifestations that are difficult to analyze computationally. In addition, the generalizability of machine learning classification of COVID-19 clinical outcomes has rarely been tested. We present a method for computationally modeling PASC phenotype data based on electronic healthcare records (EHRs) and for assessing pairwise phenotypic similarity between patients using semantic similarity. Our approach defines a nonlinear similarity function that maps from a feature space of phenotypic abnormalities to a matrix of pairwise patient similarity that can be clustered using unsupervised machine learning procedures. Using k-means clustering of this similarity matrix, we found six distinct clusters of PASC patients, each with distinct profiles of phenotypic abnormalities. There was a significant association of cluster membership with a range of pre-existing conditions and with measures of severity during acute COVID-19. Two of the clusters were associated with severe manifestations and displayed increased mortality. We assigned new patients from other healthcare centers to one of the six clusters on the basis of maximum semantic similarity to the original patients. We show that the identified clusters were generalizable across different hospital systems and that the increased mortality rate was consistently observed in two of the clusters. Semantic phenotypic clustering can provide a foundation for assigning patients to stratified subgroups for natural history or therapy studies on PASC.
Collapse
|
35
|
Chin HL, Gazzaz N, Huynh S, Handra I, Warnock L, Moller-Hansen A, Boerkoel P, Jacobsen JOB, du Souich C, Zhang N, Shefchek K, Prentice LM, Washington N, Haendel M, Armstrong L, Clarke L, Li WL, Smedley D, Robinson PN, Boerkoel CF. The Clinical Variant Analysis Tool: Analyzing the evidence supporting reported genomic variation in clinical practice. Genet Med 2022; 24:1512-1522. [PMID: 35442193 PMCID: PMC9363005 DOI: 10.1016/j.gim.2022.03.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2021] [Revised: 03/15/2022] [Accepted: 03/16/2022] [Indexed: 01/03/2023] Open
Abstract
PURPOSE Genomic test results, regardless of laboratory variant classification, require clinical practitioners to judge the applicability of a variant for medical decisions. Teaching and standardizing clinical interpretation of genomic variation calls for a methodology or tool. METHODS To generate such a tool, we distilled the Clinical Genome Resource framework of causality and the American College of Medical Genetics/Association of Molecular Pathology and Quest Diagnostic Laboratory scoring of variant deleteriousness into the Clinical Variant Analysis Tool (CVAT). Applying this to 289 clinical exome reports, we compared the performance of junior practitioners with that of experienced medical geneticists and assessed the utility of reported variants. RESULTS CVAT enabled performance comparable to that of experienced medical geneticists. In total, 124 of 289 (42.9%) exome reports and 146 of 382 (38.2%) reported variants supported a diagnosis. Overall, 10.5% (1 pathogenic [P] or likely pathogenic [LP] variant and 39 variants of uncertain significance [VUS]) of variants were reported in genes without established disease association; 20.2% (23 P/LP and 54 VUS) were in genes without sufficient phenotypic concordance; 7.3% (15 P/LP and 13 VUS) conflicted with the known molecular disease mechanism; and 24% (91 VUS) had insufficient evidence for deleteriousness. CONCLUSION Implementation of CVAT standardized clinical interpretation of genomic variation and emphasized the need for collaborative and transparent reporting of genomic variation.
Collapse
Affiliation(s)
- Hui-Lin Chin
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada; Khoo Teck Puat-National University Children's Medical Institute, National University Hospital, Singapore, Singapore
| | - Nour Gazzaz
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada; Department of Pediatrics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Department of Pediatrics, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Stephanie Huynh
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Iulia Handra
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Lynn Warnock
- Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Ashley Moller-Hansen
- Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Pierre Boerkoel
- MD Undergraduate Program, Faculty of Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Julius O B Jacobsen
- William Harvey Research Institute, Barts & The London School of Medicine & Dentistry, Queen Mary University of London, London, United Kingdom
| | - Christèle du Souich
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada
| | | | - Kent Shefchek
- Oregon Clinical and Translational Science Institute, Oregon Health & Science University, Portland, OR
| | - Leah M Prentice
- Provincial Laboratory Medicine Services, Provincial Health Services Authority, Vancouver, British Columbia, Canada
| | | | - Melissa Haendel
- Oregon Clinical and Translational Science Institute, Oregon Health & Science University, Portland, OR
| | - Linlea Armstrong
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | - Lorne Clarke
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada
| | | | - Damian Smedley
- William Harvey Research Institute, Barts & The London School of Medicine & Dentistry, Queen Mary University of London, London, United Kingdom
| | | | - Cornelius F Boerkoel
- Department of Medical Genetics, Faculty of Medicine, The University of British Columbia, Vancouver, British Columbia, Canada; Provincial Medical Genetics Program, Women's Hospital of British Columbia, Vancouver, British Columbia, Canada.
| |
Collapse
|
36
|
Yates T, Lain A, Campbell J, FitzPatrick DR, Simpson TI. Creation and evaluation of full-text literature-derived, feature-weighted disease models of genetically determined developmental disorders. Database (Oxford) 2022; 2022:baac038. [PMID: 35670729 PMCID: PMC9216525 DOI: 10.1093/database/baac038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2021] [Revised: 03/26/2022] [Accepted: 05/25/2022] [Indexed: 11/24/2022]
Abstract
There are >2500 different genetically determined developmental disorders (DD), which, as a group, show very high levels of both locus and allelic heterogeneity. This has led to the wide-spread use of evidence-based filtering of genome-wide sequence data as a diagnostic tool in DD. Determining whether the association of a filtered variant at a specific locus is a plausible explanation of the phenotype in the proband is crucial and commonly requires extensive manual literature review by both clinical scientists and clinicians. Access to a database of weighted clinical features extracted from rigorously curated literature would increase the efficiency of this process and facilitate the development of robust phenotypic similarity metrics. However, given the large and rapidly increasing volume of published information, conventional biocuration approaches are becoming impractical. Here, we present a scalable, automated method for the extraction of categorical phenotypic descriptors from the full-text literature. Papers identified through literature review were downloaded and parsed using the Cadmus custom retrieval package. Human Phenotype Ontology terms were extracted using MetaMap, with 76-84% precision and 65-73% recall. Mean terms per paper increased from 9 in title + abstract, to 68 using full text. We demonstrate that these literature-derived disease models plausibly reflect true disease expressivity more accurately than widely used manually curated models, through comparison with prospectively gathered data from the Deciphering Developmental Disorders study. The area under the curve for receiver operating characteristic (ROC) curves increased by 5-10% through the use of literature-derived models. This work shows that scalable automated literature curation increases performance and adds weight to the need for this strategy to be integrated into informatic variant analysis pipelines. Database URL: https://doi.org/10.1093/database/baac038.
Collapse
Affiliation(s)
- T.M Yates
- MRC Human Genetics Unit, Western General Hospital, Institute of Genetics and Cancer, The University of Edinburgh, Crewe Road South, Edinburgh EH4 2XU, UK
- Transforming Genetic Medicine Initiative, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - A Lain
- Institute for Adaptive and Neural Computation, Informatics Forum, The University of Edinburgh, 10 Crichton Street, Edinburgh EH8 9AB, UK
| | - J Campbell
- MRC Human Genetics Unit, Western General Hospital, Institute of Genetics and Cancer, The University of Edinburgh, Crewe Road South, Edinburgh EH4 2XU, UK
- Simons Initiative for the Developing Brain, The University of Edinburgh, Hugh Robson Building, George Square, Edinburgh EH8 9XF, UK
| | - D R FitzPatrick
- MRC Human Genetics Unit, Western General Hospital, Institute of Genetics and Cancer, The University of Edinburgh, Crewe Road South, Edinburgh EH4 2XU, UK
- Transforming Genetic Medicine Initiative, European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
- Simons Initiative for the Developing Brain, The University of Edinburgh, Hugh Robson Building, George Square, Edinburgh EH8 9XF, UK
| | - T I Simpson
- Institute for Adaptive and Neural Computation, Informatics Forum, The University of Edinburgh, 10 Crichton Street, Edinburgh EH8 9AB, UK
- Simons Initiative for the Developing Brain, The University of Edinburgh, Hugh Robson Building, George Square, Edinburgh EH8 9XF, UK
| |
Collapse
|
37
|
Cohen ASA, Farrow EG, Abdelmoity AT, Alaimo JT, Amudhavalli SM, Anderson JT, Bansal L, Bartik L, Baybayan P, Belden B, Berrios CD, Biswell RL, Buczkowicz P, Buske O, Chakraborty S, Cheung WA, Coffman KA, Cooper AM, Cross LA, Curran T, Dang TTT, Elfrink MM, Engleman KL, Fecske ED, Fieser C, Fitzgerald K, Fleming EA, Gadea RN, Gannon JL, Gelineau-Morel RN, Gibson M, Goldstein J, Grundberg E, Halpin K, Harvey BS, Heese BA, Hein W, Herd SM, Hughes SS, Ilyas M, Jacobson J, Jenkins JL, Jiang S, Johnston JJ, Keeler K, Korlach J, Kussmann J, Lambert C, Lawson C, Le Pichon JB, Leeder JS, Little VC, Louiselle DA, Lypka M, McDonald BD, Miller N, Modrcin A, Nair A, Neal SH, Oermann CM, Pacicca DM, Pawar K, Posey NL, Price N, Puckett LMB, Quezada JF, Raje N, Rowell WJ, Rush ET, Sampath V, Saunders CJ, Schwager C, Schwend RM, Shaffer E, Smail C, Soden S, Strenk ME, Sullivan BR, Sweeney BR, Tam-Williams JB, Walter AM, Welsh H, Wenger AM, Willig LK, Yan Y, Younger ST, Zhou D, Zion TN, Thiffault I, Pastinen T. Genomic answers for children: Dynamic analyses of >1000 pediatric rare disease genomes. Genet Med 2022; 24:1336-1348. [PMID: 35305867 DOI: 10.1016/j.gim.2022.02.007] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 02/05/2022] [Accepted: 02/07/2022] [Indexed: 12/17/2022] Open
Abstract
PURPOSE This study aimed to provide comprehensive diagnostic and candidate analyses in a pediatric rare disease cohort through the Genomic Answers for Kids program. METHODS Extensive analyses of 960 families with suspected genetic disorders included short-read exome sequencing and short-read genome sequencing (srGS); PacBio HiFi long-read genome sequencing (HiFi-GS); variant calling for single nucleotide variants (SNV), structural variant (SV), and repeat variants; and machine-learning variant prioritization. Structured phenotypes, prioritized variants, and pedigrees were stored in PhenoTips database, with data sharing through controlled access the database of Genotypes and Phenotypes. RESULTS Diagnostic rates ranged from 11% in patients with prior negative genetic testing to 34.5% in naive patients. Incorporating SVs from genome sequencing added up to 13% of new diagnoses in previously unsolved cases. HiFi-GS yielded increased discovery rate with >4-fold more rare coding SVs compared with srGS. Variants and genes of unknown significance remain the most common finding (58% of nondiagnostic cases). CONCLUSION Computational prioritization is efficient for diagnostic SNVs. Thorough identification of non-SNVs remains challenging and is partly mitigated using HiFi-GS sequencing. Importantly, community research is supported by sharing real-time data to accelerate gene validation and by providing HiFi variant (SNV/SV) resources from >1000 human alleles to facilitate implementation of new sequencing platforms for rare disease diagnoses.
Collapse
Affiliation(s)
- Ana S A Cohen
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; Department of Pathology and Laboratory Medicine, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO
| | - Emily G Farrow
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Joseph T Alaimo
- Department of Pathology and Laboratory Medicine, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO
| | - Shivarajan M Amudhavalli
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - John T Anderson
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Lalit Bansal
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Lauren Bartik
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Bradley Belden
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | | | - Rebecca L Biswell
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | | | | | | | - Warren A Cheung
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Keith A Coffman
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Ashley M Cooper
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Laura A Cross
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Tom Curran
- Children's Mercy Research Institute, Kansas City, MO
| | - Thuy Tien T Dang
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Mary M Elfrink
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | | | - Erin D Fecske
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Cynthia Fieser
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Keely Fitzgerald
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Emily A Fleming
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Randi N Gadea
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Rose N Gelineau-Morel
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Margaret Gibson
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Jeffrey Goldstein
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Elin Grundberg
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Kelsee Halpin
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Brian S Harvey
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Bryce A Heese
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Wendy Hein
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Suzanne M Herd
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Susan S Hughes
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Mohammed Ilyas
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Jill Jacobson
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Janda L Jenkins
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | | | - Kathryn Keeler
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Jonas Korlach
- Pacific Biosciences of California, Inc, Menlo Park, CA
| | | | | | - Caitlin Lawson
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | | | - Vicki C Little
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | | | | | | | - Neil Miller
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Division of Allergy Immunology Pulmonary and Sleep Medicine, Children's Mercy Kansas City, Kansas City, MO
| | - Ann Modrcin
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Annapoorna Nair
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Shelby H Neal
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | | | - Donna M Pacicca
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Kailash Pawar
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Nyshele L Posey
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Nigel Price
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Laura M B Puckett
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Julio F Quezada
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Nikita Raje
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Division of Neonatology, Children's Mercy Kansas City, Kansas City, MO
| | | | - Eric T Rush
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Division of Genetics, Children's Mercy Kansas City, Kansas City, MO; Department of Internal Medicine, University of Kansas School of Medicine, Kansas City, MO
| | - Venkatesh Sampath
- Division of Neonatology, Children's Mercy Hospital Kansas City, Kansas City, MO
| | - Carol J Saunders
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; Department of Pathology and Laboratory Medicine, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO
| | - Caitlin Schwager
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Richard M Schwend
- Department of Orthopaedic Surgery, Children's Mercy Kansas City, Kansas City, MO
| | - Elizabeth Shaffer
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Craig Smail
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Sarah Soden
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Meghan E Strenk
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Brooke R Sweeney
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Adam M Walter
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Holly Welsh
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | | | - Laurel K Willig
- Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Yun Yan
- UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO
| | - Scott T Younger
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO
| | - Dihong Zhou
- Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Tricia N Zion
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Department of Pediatrics, Children's Mercy Kansas City, Kansas City, MO; Division of Genetics, Children's Mercy Kansas City, Kansas City, MO
| | - Isabelle Thiffault
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; Department of Pathology and Laboratory Medicine, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO.
| | - Tomi Pastinen
- Genomic Medicine Center, Children's Mercy Kansas City, Kansas City, MO; UKMC School of Medicine, University of Missouri Kansas City, Kansas City, MO; Children's Mercy Research Institute, Kansas City, MO.
| |
Collapse
|
38
|
Dhombres F, Morgan P, Chaudhari BP, Filges I, Sparks TN, Lapunzina P, Roscioli T, Agarwal U, Aggarwal S, Beneteau C, Cacheiro P, Carmody LC, Collardeau‐Frachon S, Dempsey EA, Dufke A, Duyzend MH, el Ghosh M, Giordano JL, Glad R, Grinfelde I, Iliescu DG, Ladewig MS, Munoz‐Torres MC, Pollazzon M, Radio FC, Rodo C, Silva RG, Smedley D, Sundaramurthi JC, Toro S, Valenzuela I, Vasilevsky NA, Wapner RJ, Zemet R, Haendel MA, Robinson PN. Prenatal phenotyping: A community effort to enhance the Human Phenotype Ontology. AMERICAN JOURNAL OF MEDICAL GENETICS. PART C, SEMINARS IN MEDICAL GENETICS 2022; 190:231-242. [PMID: 35872606 PMCID: PMC9588534 DOI: 10.1002/ajmg.c.31989] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 07/01/2022] [Indexed: 01/07/2023]
Abstract
Technological advances in both genome sequencing and prenatal imaging are increasing our ability to accurately recognize and diagnose Mendelian conditions prenatally. Phenotype-driven early genetic diagnosis of fetal genetic disease can help to strategize treatment options and clinical preventive measures during the perinatal period, to plan in utero therapies, and to inform parental decision-making. Fetal phenotypes of genetic diseases are often unique and at present are not well understood; more comprehensive knowledge about prenatal phenotypes and computational resources have an enormous potential to improve diagnostics and translational research. The Human Phenotype Ontology (HPO) has been widely used to support diagnostics and translational research in human genetics. To better support prenatal usage, the HPO consortium conducted a series of workshops with a group of domain experts in a variety of medical specialties, diagnostic techniques, as well as diseases and phenotypes related to prenatal medicine, including perinatal pathology, musculoskeletal anomalies, neurology, medical genetics, hydrops fetalis, craniofacial malformations, cardiology, neonatal-perinatal medicine, fetal medicine, placental pathology, prenatal imaging, and bioinformatics. We expanded the representation of prenatal phenotypes in HPO by adding 95 new phenotype terms under the Abnormality of prenatal development or birth (HP:0001197) grouping term, and revised definitions, synonyms, and disease annotations for most of the 152 terms that existed before the beginning of this effort. The expansion of prenatal phenotypes in HPO will support phenotype-driven prenatal exome and genome sequencing for precision genetic diagnostics of rare diseases to support prenatal care.
Collapse
Affiliation(s)
- Ferdinand Dhombres
- Sorbonne University, GRC26, INSERM, Limics, Armand Trousseau Hospital, Fetal Medicine Department, APHPParisFrance
| | - Patricia Morgan
- American College of Medical Genetics and Genomics, Newborn Screening Translational Research NetworkBethesdaMarylandUSA
| | - Bimal P. Chaudhari
- Institute for Genomic MedicineNationwide Children's HospitalColumbusOhioUSA
| | - Isabel Filges
- University Hospital Basel and University of Basel, Medical GeneticsBaselSwitzerland
| | - Teresa N. Sparks
- Department of Obstetrics, Gynecology, & Reproductive SciencesUniversity of California, San FranciscoSan FranciscoCaliforniaUSA
| | - Pablo Lapunzina
- CIBERER and Hospital Universitario La Paz, INGEMM‐Institute of Medical and Molecular GeneticsMadridSpain
| | - Tony Roscioli
- Neuroscience Research Australia (NeuRA), University of New South WalesSydneyNew South WalesAustralia
| | - Umber Agarwal
- Department of Maternal and Fetal MedicineLiverpool Women's NHS Foundation TrustLiverpoolUK
| | - Shagun Aggarwal
- Department of Medical GeneticsNizam's Institute of Medical SciencesHyderabadTelanganaIndia
| | - Claire Beneteau
- Service de Génétique Médicale, UF 9321 de Fœtopathologie et Génétique, CHU de NantesNantesFrance
| | - Pilar Cacheiro
- William Harvey Research InstituteQueen Mary University of LondonLondonUK
| | - Leigh C. Carmody
- Department of Genomic MedicineThe Jackson LaboratoryFarmingtonConnecticutUSA
| | | | - Esther A. Dempsey
- St George's University of London, Molecular and Clinical Sciences Research InstituteLondonUK
| | - Andreas Dufke
- University of Tübingen, Institute of Medical Genetics and Applied GenomicsTübingenGermany
| | | | | | - Jessica L. Giordano
- Department of Obstetrics and GynecologyColumbia University Irving Medical CenterNew YorkNew YorkUSA
| | - Ragnhild Glad
- Department of Obstetrics and GynecologyUniversity Hospital of North NorwayTromsøNorway
| | - Ieva Grinfelde
- Department of Medical Genetics and Prenatal diagnosisChildren's University HospitalRigaLatvia
| | - Dominic G. Iliescu
- Department of Obstetrics and GynecologyUniversity of Medicine and Pharmacy CraiovaCraiovaDoljRomania
| | - Markus S. Ladewig
- Department of OphthalmologyKlinikum SaarbrückenSaarbrückenSaarlandGermany
| | - Monica C. Munoz‐Torres
- Department of Biochemistry and Molecular GeneticsUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
| | - Marzia Pollazzon
- Azienda USL‐IRCCS di Reggio EmiliaMedical Genetics UnitReggio EmiliaItaly
| | | | - Carlota Rodo
- Vall d'Hebron Hospital Campus, Maternal & Fetal MedicineBarcelonaSpain
| | - Raquel Gouveia Silva
- Hospital Santa Maria, Serviço de Genética, Departamento de PediatriaHospital de Santa Maria, Centro Hospitalar Universitário Lisboa Norte, Centro Académico de Medicina de LisboaLisboaPortugal
| | - Damian Smedley
- William Harvey Research InstituteQueen Mary University of LondonLondonUK
| | | | - Sabrina Toro
- Department of Biochemistry and Molecular GeneticsUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
| | - Irene Valenzuela
- Hospital Vall d'Hebron, Clinical and Molecular Genetics AreaBarcelonaSpain
| | - Nicole A. Vasilevsky
- Department of Biochemistry and Molecular GeneticsUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
| | - Ronald J. Wapner
- Department of Obstetrics and GynecologyColumbia University Irving Medical CenterNew YorkNew YorkUSA
| | - Roni Zemet
- Department of Molecular and Human GeneticsBaylor College of MedicineHoustonTexasUSA
| | - Melissa A Haendel
- Department of Biochemistry and Molecular GeneticsUniversity of Colorado Anschutz Medical CampusAuroraColoradoUSA
| | - Peter N. Robinson
- Department of Genomic MedicineThe Jackson LaboratoryFarmingtonConnecticutUSA
| |
Collapse
|
39
|
Slavotinek A, Prasad H, Yip T, Rego S, Hoban H, Kvale M. Predicting genes from phenotypes using human phenotype ontology (HPO) terms. Hum Genet 2022; 141:1749-1760. [PMID: 35357580 DOI: 10.1007/s00439-022-02449-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 03/16/2022] [Indexed: 11/28/2022]
Abstract
The interpretation of genomic variants following whole exome sequencing (WES) can be aided using human phenotype ontology (HPO) terms to standardize clinical features and predict causative genes. We performed WES on 453 patients diagnosed prior to 18 years of age and identified 114 pathogenic (P) or likely pathogenic (LP) variants in 112 patients. We utilized PhenoDB to extract HPO terms from provider notes and then used Phen2Gene to generate a gene score and gene ranking from each list of HPO terms. We assigned Phen2Gene gene rankings to 6 rank classes, with class 1 covering raw gene rankings of 1 to 10 and class 2 covering rankings from 11 to 50 out of a total of 17,126 possible gene rankings. Phen2Gene ranked causative genes into rank class 1 or 2 in 27.7% of cases and the genes in rank class 1 were all associated with well-characterized phenotypes. We found significant associations between the gene score and the number of years, since the gene was first published, the number of HPO terms with an hierarchical depth greater or equal to 11, and the number of Online Mendelian Inheritance in Man terms associated with the phenotype and gene. We conclude that genes associated with recognizable phenotypes and terms deep in the HPO hierarchy have the best chance of producing a high gene score and ranking in class 1 to 2 using Phen2Gene software with HPO terms. Clinicians and laboratory staff should consider these results when HPO terms are employed to prioritize candidate genes.
Collapse
Affiliation(s)
- Anne Slavotinek
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA.
| | - Hannah Prasad
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| | - Tiffany Yip
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Shannon Rego
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Hannah Hoban
- Division of Genetics, Department of Pediatrics, University of California San Francisco, San Francisco, CA, USA
| | - Mark Kvale
- Institute for Human Genetics, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
40
|
Fujiwara T, Shin JM, Yamaguchi A. Advances in the development of PubCaseFinder, including the new application programming interface and matching algorithm. Hum Mutat 2022; 43:734-742. [PMID: 35143083 PMCID: PMC9305291 DOI: 10.1002/humu.24341] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Revised: 01/17/2022] [Accepted: 02/07/2022] [Indexed: 11/11/2022]
Abstract
Over 10,000 rare genetic diseases have been identified, and millions of newborns are affected by severe rare genetic diseases each year. A variety of Human Phenotype Ontology (HPO)-based clinical decision support systems (CDSS) and patient repositories have been developed to support clinicians in diagnosing patients with suspected rare genetic diseases. In September 2017, we released PubCaseFinder (https://pubcasefinder.dbcls.jp), a web-based CDSS that provides ranked lists of genetic and rare diseases using HPO-based phenotypic similarities, where top-listed diseases represent the most likely differential diagnosis. We also developed a Matchmaker Exchange (MME) application programming interface (API) to query PubCaseFinder, which has been adopted by several patient repositories. In this paper, we describe notable updates regarding PubCaseFinder, the GeneYenta matching algorithm implemented in PubCaseFinder, and the PubCaseFinder API. The updated GeneYenta matching algorithm improves the performance of the CDSS automated differential diagnosis function. Moreover, the updated PubCaseFinder and new API empower patient repositories participating in MME and medical professionals to actively use HPO-based resources. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Toyofumi Fujiwara
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Kashiwa-shi, Chiba-ken, 277-0871, Japan
| | - Jae-Moon Shin
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Kashiwa-shi, Chiba-ken, 277-0871, Japan
| | - Atsuko Yamaguchi
- Graduate School of Integrative Science and Engineering, Tokyo City University, Setagaya-ku, Tokyo, 158-8557, Japan
| |
Collapse
|
41
|
Yuan X, Wang J, Dai B, Sun Y, Zhang K, Chen F, Peng Q, Huang Y, Zhang X, Chen J, Xu X, Chuan J, Mu W, Li H, Fang P, Gong Q, Zhang P. Evaluation of phenotype-driven gene prioritization methods for Mendelian diseases. Brief Bioinform 2022; 23:6521702. [PMID: 35134823 PMCID: PMC8921623 DOI: 10.1093/bib/bbac019] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 01/10/2022] [Accepted: 01/13/2022] [Indexed: 12/31/2022] Open
Abstract
It’s challenging work to identify disease-causing genes from the next-generation sequencing (NGS) data of patients with Mendelian disorders. To improve this situation, researchers have developed many phenotype-driven gene prioritization methods using a patient’s genotype and phenotype information, or phenotype information only as input to rank the candidate’s pathogenic genes. Evaluations of these ranking methods provide practitioners with convenience for choosing an appropriate tool for their workflows, but retrospective benchmarks are underpowered to provide statistically significant results in their attempt to differentiate. In this research, the performance of ten recognized causal-gene prioritization methods was benchmarked using 305 cases from the Deciphering Developmental Disorders (DDD) project and 209 in-house cases via a relatively unbiased methodology. The evaluation results show that methods using Human Phenotype Ontology (HPO) terms and Variant Call Format (VCF) files as input achieved better overall performance than those using phenotypic data alone. Besides, LIRICAL and AMELIE, two of the best methods in our benchmark experiments, complement each other in cases with the causal genes ranked highly, suggesting a possible integrative approach to further enhance the diagnostic efficiency. Our benchmarking provides valuable reference information to the computer-assisted rapid diagnosis in Mendelian diseases and sheds some light on the potential direction of future improvement on disease-causing gene prioritization methods.
Collapse
Affiliation(s)
- Xiao Yuan
- Changsha KingMed Center for Clinical Laboratory, Changsha, China.,Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China.,Genetalks Biotech. Co., Ltd., Changsha, China
| | - Jing Wang
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Bing Dai
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Yanfang Sun
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Keke Zhang
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Fangfang Chen
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Qian Peng
- Changsha KingMed Center for Clinical Laboratory, Changsha, China
| | - Yixuan Huang
- Beijing Geneworks Technology Co., Ltd., Beijing, China
| | - Xinlei Zhang
- Reproductive & Genetics Hospital of Citic & Xiangya, Changsha, China
| | - Junru Chen
- Genetalks Biotech. Co., Ltd., Changsha, China
| | - Xilin Xu
- Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Jun Chuan
- Changsha KingMed Center for Clinical Laboratory, Changsha, China.,Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Wenbo Mu
- Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Huiyuan Li
- Changsha KingMed Center for Clinical Laboratory, Changsha, China.,Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Ping Fang
- Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Qiang Gong
- Changsha KingMed Center for Clinical Laboratory, Changsha, China.,Guangzhou Kingmed Center for Clinical Laboratory, Guangzhou, China
| | - Peng Zhang
- Beijing Key Laboratory for Genetics of Birth Defects, Beijing Pediatric Research Institute, Beijing Children's Hospital, Capital Medical University, National Center for Children's Health, Beijing, China
| |
Collapse
|
42
|
Ruscheinski A, Reimler AL, Ewald R, Uhrmacher AM. VPMBench: a test bench for variant prioritization methods. BMC Bioinformatics 2021; 22:543. [PMID: 34749640 PMCID: PMC8576923 DOI: 10.1186/s12859-021-04458-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Accepted: 10/23/2021] [Indexed: 11/18/2022] Open
Abstract
Background Clinical diagnostics of whole-exome and whole-genome sequencing data requires geneticists to consider thousands of genetic variants for each patient. Various variant prioritization methods have been developed over the last years to aid clinicians in identifying variants that are likely disease-causing. Each time a new method is developed, its effectiveness must be evaluated and compared to other approaches based on the most recently available evaluation data. Doing so in an unbiased, systematic, and replicable manner requires significant effort. Results The open-source test bench “VPMBench” automates the evaluation of variant prioritization methods. VPMBench introduces a standardized interface for prioritization methods and provides a plugin system that makes it easy to evaluate new methods. It supports different input data formats and custom output data preparation. VPMBench exploits declaratively specified information about the methods, e.g., the variants supported by the methods. Plugins may also be provided in a technology-agnostic manner via containerization. Conclusions VPMBench significantly simplifies the evaluation of both custom and published variant prioritization methods. As we expect variant prioritization methods to become ever more critical with the advent of whole-genome sequencing in clinical diagnostics, such tool support is crucial to facilitate methodological research.
Collapse
Affiliation(s)
- Andreas Ruscheinski
- Modeling and Simulation Group, Institute for Visual and Analytic Computing, University of Rostock, Albert-Einstein-Straße 22, 18051, Rostock, Germany.
| | - Anna Lena Reimler
- Modeling and Simulation Group, Institute for Visual and Analytic Computing, University of Rostock, Albert-Einstein-Straße 22, 18051, Rostock, Germany
| | - Roland Ewald
- Limbus Medical Technologies GmbH, Lindenstraße 2, 18055, Rostock, Germany
| | - Adelinde M Uhrmacher
- Modeling and Simulation Group, Institute for Visual and Analytic Computing, University of Rostock, Albert-Einstein-Straße 22, 18051, Rostock, Germany
| |
Collapse
|
43
|
De La Vega FM, Chowdhury S, Moore B, Frise E, McCarthy J, Hernandez EJ, Wong T, James K, Guidugli L, Agrawal PB, Genetti CA, Brownstein CA, Beggs AH, Löscher BS, Franke A, Boone B, Levy SE, Õunap K, Pajusalu S, Huentelman M, Ramsey K, Naymik M, Narayanan V, Veeraraghavan N, Billings P, Reese MG, Yandell M, Kingsmore SF. Artificial intelligence enables comprehensive genome interpretation and nomination of candidate diagnoses for rare genetic diseases. Genome Med 2021; 13:153. [PMID: 34645491 PMCID: PMC8515723 DOI: 10.1186/s13073-021-00965-0] [Citation(s) in RCA: 43] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 08/27/2021] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Clinical interpretation of genetic variants in the context of the patient's phenotype is becoming the largest component of cost and time expenditure for genome-based diagnosis of rare genetic diseases. Artificial intelligence (AI) holds promise to greatly simplify and speed genome interpretation by integrating predictive methods with the growing knowledge of genetic disease. Here we assess the diagnostic performance of Fabric GEM, a new, AI-based, clinical decision support tool for expediting genome interpretation. METHODS We benchmarked GEM in a retrospective cohort of 119 probands, mostly NICU infants, diagnosed with rare genetic diseases, who received whole-genome or whole-exome sequencing (WGS, WES). We replicated our analyses in a separate cohort of 60 cases collected from five academic medical centers. For comparison, we also analyzed these cases with current state-of-the-art variant prioritization tools. Included in the comparisons were trio, duo, and singleton cases. Variants underpinning diagnoses spanned diverse modes of inheritance and types, including structural variants (SVs). Patient phenotypes were extracted from clinical notes by two means: manually and using an automated clinical natural language processing (CNLP) tool. Finally, 14 previously unsolved cases were reanalyzed. RESULTS GEM ranked over 90% of the causal genes among the top or second candidate and prioritized for review a median of 3 candidate genes per case, using either manually curated or CNLP-derived phenotype descriptions. Ranking of trios and duos was unchanged when analyzed as singletons. In 17 of 20 cases with diagnostic SVs, GEM identified the causal SVs as the top candidate and in 19/20 within the top five, irrespective of whether SV calls were provided or inferred ab initio by GEM using its own internal SV detection algorithm. GEM showed similar performance in absence of parental genotypes. Analysis of 14 previously unsolved cases resulted in a novel finding for one case, candidates ultimately not advanced upon manual review for 3 cases, and no new findings for 10 cases. CONCLUSIONS GEM enabled diagnostic interpretation inclusive of all variant types through automated nomination of a very short list of candidate genes and disorders for final review and reporting. In combination with deep phenotyping by CNLP, GEM enables substantial automation of genetic disease diagnosis, potentially decreasing cost and expediting case review.
Collapse
Affiliation(s)
- Francisco M. De La Vega
- Fabric Genomics Inc., Oakland, CA USA
- Department of Biomedical Data Science, Stanford University School of Medicine, Stanford, CA USA
- Current Address: Tempus Labs Inc., Redwood City, CA 94065 USA
| | - Shimul Chowdhury
- Rady Children’s Institute for Genomic Medicine, San Diego, CA USA
| | - Barry Moore
- Department of Human Genetics, Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT USA
| | | | | | - Edgar Javier Hernandez
- Department of Human Genetics, Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT USA
| | - Terence Wong
- Rady Children’s Institute for Genomic Medicine, San Diego, CA USA
| | - Kiely James
- Rady Children’s Institute for Genomic Medicine, San Diego, CA USA
| | - Lucia Guidugli
- Rady Children’s Institute for Genomic Medicine, San Diego, CA USA
| | - Pankaj B. Agrawal
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children’s Hospital, Harvard Medical School, Boston, MA USA
- Division of Newborn Medicine, Boston Children’s Hospital, Boston, MA USA
| | - Casie A. Genetti
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children’s Hospital, Harvard Medical School, Boston, MA USA
| | - Catherine A. Brownstein
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children’s Hospital, Harvard Medical School, Boston, MA USA
| | - Alan H. Beggs
- Division of Genetics and Genomics, The Manton Center for Orphan Disease Research, Boston Children’s Hospital, Harvard Medical School, Boston, MA USA
| | - Britt-Sabina Löscher
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel & University Hospital Schleswig-Holstein, Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Christian-Albrechts-University of Kiel & University Hospital Schleswig-Holstein, Kiel, Germany
| | - Braden Boone
- HudsonAlpha Institute for Biotechnology, Huntsville, AL USA
| | - Shawn E. Levy
- HudsonAlpha Institute for Biotechnology, Huntsville, AL USA
| | - Katrin Õunap
- Department of Clinical Genetics, United Laboratories, Tartu University Hospital, Tartu, Estonia
- Department of Clinical Genetics, Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
| | - Sander Pajusalu
- Department of Clinical Genetics, United Laboratories, Tartu University Hospital, Tartu, Estonia
- Department of Clinical Genetics, Institute of Clinical Medicine, University of Tartu, Tartu, Estonia
| | - Matt Huentelman
- Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ USA
| | - Keri Ramsey
- Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ USA
| | - Marcus Naymik
- Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ USA
| | - Vinodh Narayanan
- Center for Rare Childhood Disorders, Translational Genomics Research Institute, Phoenix, AZ USA
| | | | | | | | - Mark Yandell
- Fabric Genomics Inc., Oakland, CA USA
- Department of Human Genetics, Utah Center for Genetic Discovery, University of Utah, Salt Lake City, UT USA
| | | |
Collapse
|
44
|
Peng C, Dieck S, Schmid A, Ahmad A, Knaus A, Wenzel M, Mehnert L, Zirn B, Haack T, Ossowski S, Wagner M, Brunet T, Ehmke N, Danyel M, Rosnev S, Kamphans T, Nadav G, Fleischer N, Fröhlich H, Krawitz P. CADA: phenotype-driven gene prioritization based on a case-enriched knowledge graph. NAR Genom Bioinform 2021; 3:lqab078. [PMID: 34514393 PMCID: PMC8415429 DOI: 10.1093/nargab/lqab078] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 08/16/2021] [Accepted: 08/31/2021] [Indexed: 12/11/2022] Open
Abstract
Many rare syndromes can be well described and delineated from other disorders by a combination of characteristic symptoms. These phenotypic features are best documented with terms of the Human Phenotype Ontology (HPO), which are increasingly used in electronic health records (EHRs), too. Many algorithms that perform HPO-based gene prioritization have also been developed; however, the performance of many such tools suffers from an over-representation of atypical cases in the medical literature. This is certainly the case if the algorithm cannot handle features that occur with reduced frequency in a disorder. With Cada, we built a knowledge graph based on both case annotations and disorder annotations. Using network representation learning, we achieve gene prioritization by link prediction. Our results suggest that Cada exhibits superior performance particularly for patients that present with the pathognomonic findings of a disease. Additionally, information about the frequency of occurrence of a feature can readily be incorporated, when available. Crucial in the design of our approach is the use of the growing amount of phenotype–genotype information that diagnostic labs deposit in databases such as ClinVar. By this means, Cada is an ideal reference tool for differential diagnostics in rare disorders that can also be updated regularly.
Collapse
Affiliation(s)
- Chengyao Peng
- Institute for Genomic Statistics, University Bonn, 53129 Bonn, Germany
| | - Simon Dieck
- Institute for Genomic Statistics, University Bonn, 53129 Bonn, Germany
| | - Alexander Schmid
- Institute for Genomic Statistics, University Bonn, 53129 Bonn, Germany
| | - Ashar Ahmad
- Fraunhofer SCAI, Department of Bioinformatics, 53757 Sankt Augustin, Germany
| | - Alexej Knaus
- Institute for Genomic Statistics, University Bonn, 53129 Bonn, Germany
| | - Maren Wenzel
- Genetikum Counseling Center, 70173 Stuttgart, Germany
| | - Laura Mehnert
- Genetikum Counseling Center, 70173 Stuttgart, Germany
| | - Birgit Zirn
- Genetikum Counseling Center, 70173 Stuttgart, Germany
| | - Tobias Haack
- Institute of Medical Genetics and Applied Genomics, University Tübingen, 72076 Tübingen, Germany
| | - Stephan Ossowski
- Institute of Medical Genetics and Applied Genomics, University Tübingen, 72076 Tübingen, Germany
| | - Matias Wagner
- Institute for Human Genetics, Technical University Munich, 81675 Munich, Germany
| | - Theresa Brunet
- Institute for Human Genetics, Technical University Munich, 81675 Munich, Germany
| | - Nadja Ehmke
- Institute for Medical Genetics, Charité University Medicine, 13353 Berlin, Germany
| | - Magdalena Danyel
- Institute for Medical Genetics, Charité University Medicine, 13353 Berlin, Germany
| | | | | | | | | | - Holger Fröhlich
- Fraunhofer SCAI, Department of Bioinformatics, 53757 Sankt Augustin, Germany
| | | |
Collapse
|
45
|
Dhombres F, Charlet J. Knowledge Representation and Management: Interest in New Solutions for Ontology Curation. Yearb Med Inform 2021; 30:185-190. [PMID: 34479390 PMCID: PMC8416227 DOI: 10.1055/s-0041-1726508] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Objective:
To select, present and summarize some of the best papers in the field of Knowledge Representation and Management (KRM) published in 2020.
Methods:
A comprehensive and standardized review of the medical informatics literature was performed to select the most interesting papers of KRM published in 2020, based on PubMed queries. This review was conducted according to the IMIA Yearbook guidelines.
Results:
Four best papers were selected among 1,175 publications. In contrast with the papers selected last year, the four best papers of 2020 demonstrated a significant focus on methods and tools for ontology curation and design. The usual KRM application domains (bioinformatics, machine learning, and electronic health records) were also represented.
Conclusion:
In 2020, ontology curation emerges as a significant topic of research interest. Bioinformatics, machine learning, and electronics health records remain significant research areas in the KRM community with various applications. Knowledge representations are key to advance machine learning by providing context and to develop novel bioinformatics metrics. As in 2019, representations serve a great variety of applications across many medical domains, with actionable results and now with growing adhesion to the open science initiative.
Collapse
Affiliation(s)
- Ferdinand Dhombres
- Sorbonne Université, INSERM, Univ Sorbonne Paris Nord, LIMICS, Paris, France.,Sorbonne Université, Service de Médecine Fœtale, DMU Origyne, AP-HP, Hôpital Armand Trousseau, Paris, France
| | - Jean Charlet
- Sorbonne Université, INSERM, Univ Sorbonne Paris Nord, LIMICS, Paris, France.,AP-HP, DRCI, Paris, France
| | | |
Collapse
|
46
|
González-Ortega G, Llamas-Velasco S, Arteche-López A, Quesada-Espinosa JF, Puertas-Martín V, Gómez-Grande A, López-Álvarez J, Saiz Díaz RA, Lezana-Rosales JM, Villarejo-Galende A, González de la Aleja J. Early-Onset Dementia Associated with a Heterozygous, Nonsense, and de novo Variant in the MBD5 Gene. J Alzheimers Dis 2021; 84:73-78. [PMID: 34459404 DOI: 10.3233/jad-210648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The haploinsufficiency of the methyl-binding domain protein 5 (MBD5) gene has been identified as the determinant cause of the neuropsychiatric disorders grouped under the name MBD5-neurodevelopment disorders (MAND). MAND includes patients with intellectual disability, behavioral problems, and seizures with a static clinical course. However, a few reports have suggested regression. We describe a non-intellectually disabled female, with previous epilepsy and personality disorder, who developed early-onset dementia. The extensive etiologic study revealed a heterozygous nonsense de novo pathogenic variant in the MBD5 gene. This finding could support including the MBD5 gene in the study of patients with atypical early-onset dementia.
Collapse
Affiliation(s)
| | - Sara Llamas-Velasco
- Department of Neurology, Hospital Universitario 12 de Octubre, Madrid, Spain.,Group of Neurodegenerative Diseases, Instituto de Investigación Hospital 12 de Octubre (I+12), Madrid, Spain.,Biomedical Research Networking Center in Neurodegenerative diseases CIBERNED, Madrid, Spain
| | - Ana Arteche-López
- Department of Genetics, Hospital Universitario 12 de Octubre, Madrid, Spain
| | | | - Verónica Puertas-Martín
- Department of Neurology, Hospital Universitario 12 de Octubre, Madrid, Spain.,Universidad Internacional de La Rioja (UNIR), Logroño, Spain
| | - Adolfo Gómez-Grande
- Department of Nuclear Medicine, Hospital Universitario 12 de Octubre, Madrid, Spain
| | - Jorge López-Álvarez
- Department of Psychiatry, Hospital Universitario 12 de Octubre, Madrid, Spain
| | - Rosa Ana Saiz Díaz
- Department of Neurology, Hospital Universitario 12 de Octubre, Madrid, Spain.,Department of Medicine, School of Medicine, Complutense University, Madrid, Spain.,Epilepsy-EEG Unit, Hospital Universitario 12 de Octubre, Madrid, Spain
| | | | - Alberto Villarejo-Galende
- Department of Neurology, Hospital Universitario 12 de Octubre, Madrid, Spain.,Group of Neurodegenerative Diseases, Instituto de Investigación Hospital 12 de Octubre (I+12), Madrid, Spain.,Biomedical Research Networking Center in Neurodegenerative diseases CIBERNED, Madrid, Spain.,Department of Medicine, School of Medicine, Complutense University, Madrid, Spain
| | - Jesús González de la Aleja
- Department of Neurology, Hospital Universitario 12 de Octubre, Madrid, Spain.,Epilepsy-EEG Unit, Hospital Universitario 12 de Octubre, Madrid, Spain
| |
Collapse
|
47
|
Pearson NM, Stolte C, Shi K, Beren F, Abul-Husn NS, Bertier G, Brown K, Diaz GA, Odgis JA, Suckiel SA, Horowitz CR, Wasserstein M, Gelb BD, Kenny EE, Gagnon C, Jobanputra V, Bloom T, Greally JM. GenomeDiver: a platform for phenotype-guided medical genomic diagnosis. Genet Med 2021; 23:1998-2002. [PMID: 34113009 PMCID: PMC8488006 DOI: 10.1038/s41436-021-01219-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2020] [Revised: 05/06/2021] [Accepted: 05/07/2021] [Indexed: 12/11/2022] Open
Abstract
Purpose: Making a diagnosis from clinical genomic sequencing requires well-structured phenotypic data to guide genotype interpretation. A patient’s phenotypic features can be documented using the Human Phenotype Ontology (HPO), generating terms used to prioritize genes potentially causing the patient’s disease. We have developed GenomeDiver to provide a user interface for clinicians that allows more effective collaboration with the clinical diagnostic laboratory, with the goal of improving the success of the diagnostic process. Methods: GenomeDiver uses genomic data to prompt reverse phenotyping of patients undergoing genetic testing, enriching the amount and quality of structured phenotype data for the diagnostic laboratory, and helping clinicians to explore and flag diseases potentially causing their patient’s presentation. Results: We show how GenomeDiver communicates the clinician’s informed insights to the diagnostic lab in the form of HPO terms for interpretation of genomic sequencing data. We describe our user-driven design process, the engineering of the software for efficiency, security and portability, and examples of the performance of GenomeDiver using genomic testing data. Conclusions: GenomeDiver is a first step in a new approach to genomic diagnostics that enhances laboratory-clinician interactions, with the goal of directly engaging clinicians to improve the outcome of genomic diagnostic testing.
Collapse
Affiliation(s)
| | - Christian Stolte
- New York Genome Center, New York, NY, USA.,Stolte Design, Islesboro, ME, USA
| | - Kevin Shi
- New York Genome Center, New York, NY, USA
| | - Faygel Beren
- Columbia University, Graduate School of Arts and Sciences, New York, NY, USA
| | - Noura S Abul-Husn
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Kaitlyn Brown
- Division of Genetics, Department of Pediatrics, Children's Hospital at Montefiore, Bronx, NY, USA
| | - George A Diaz
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Jacqueline A Odgis
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Sabrina A Suckiel
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | - Melissa Wasserstein
- Division of Genetics, Department of Pediatrics, Children's Hospital at Montefiore, Bronx, NY, USA.,Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Bruce D Gelb
- Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Eimear E Kenny
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | | | | | - Toby Bloom
- New York Genome Center, New York, NY, USA.,eGenesis, Inc., Cambridge, MA, USA
| | - John M Greally
- Division of Genetics, Department of Pediatrics, Children's Hospital at Montefiore, Bronx, NY, USA. .,Department of Genetics, Albert Einstein College of Medicine, Bronx, NY, USA. .,Center for Epigenomics, Albert Einstein College of Medicine, Bronx, NY, USA.
| |
Collapse
|
48
|
Köhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, Danis D, Balagura G, Baynam G, Brower AM, Callahan TJ, Chute CG, Est JL, Galer PD, Ganesan S, Griese M, Haimel M, Pazmandi J, Hanauer M, Harris NL, Hartnett M, Hastreiter M, Hauck F, He Y, Jeske T, Kearney H, Kindle G, Klein C, Knoflach K, Krause R, Lagorce D, McMurry JA, Miller JA, Munoz-Torres M, Peters RL, Rapp CK, Rath AM, Rind SA, Rosenberg A, Segal MM, Seidel MG, Smedley D, Talmy T, Thomas Y, Wiafe SA, Xian J, Yüksel Z, Helbig I, Mungall CJ, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2021. Nucleic Acids Res 2021; 49:D1207-D1217. [PMID: 33264411 PMCID: PMC7778952 DOI: 10.1093/nar/gkaa1043] [Citation(s) in RCA: 532] [Impact Index Per Article: 177.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/11/2020] [Accepted: 11/16/2020] [Indexed: 12/21/2022] Open
Abstract
The Human Phenotype Ontology (HPO, https://hpo.jax.org) was launched in 2008 to provide a comprehensive logical standard to describe and computationally analyze phenotypic abnormalities found in human disease. The HPO is now a worldwide standard for phenotype exchange. The HPO has grown steadily since its inception due to considerable contributions from clinical experts and researchers from a diverse range of disciplines. Here, we present recent major extensions of the HPO for neurology, nephrology, immunology, pulmonology, newborn screening, and other areas. For example, the seizure subontology now reflects the International League Against Epilepsy (ILAE) guidelines and these enhancements have already shown clinical validity. We present new efforts to harmonize computational definitions of phenotypic abnormalities across the HPO and multiple phenotype ontologies used for animal models of disease. These efforts will benefit software such as Exomiser by improving the accuracy and scope of cross-species phenotype matching. The computational modeling strategy used by the HPO to define disease entities and phenotypic features and distinguish between them is explained in detail.We also report on recent efforts to translate the HPO into indigenous languages. Finally, we summarize recent advances in the use of HPO in electronic health record systems.
Collapse
Affiliation(s)
| | - Michael Gargano
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Nicolas Matentzoglu
- Monarch Initiative
- Semanticly Ltd, London, UK
- European Bioinformatics Institute (EMBL-EBI)
| | - Leigh C Carmody
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
- Clinical Neurosciences, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Nicole A Vasilevsky
- Monarch Initiative
- Oregon Clinical & Translational Research Institute, Oregon Health & Science University
| | | | - Ganna Balagura
- Department of Neurosciences, Rehabilitation, Ophthalmology, Genetics, and Maternal and Child Health, University of Genoa, Genoa, Italy
- Pediatric Neurology and Muscular Diseases Unit, IRCCS ‘G. Gaslini’ Institute, Genoa, Italy
| | - Gareth Baynam
- Western Australian Register of Developmental Anomalies, King Edward memorial Hospital, Perth, Australia
- Telethon Kids Institute and the Division of Paediatrics, Faculty of Helath and Medical Sciences, University of Western Australia, Perth, Australia
| | - Amy M Brower
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Tiffany J Callahan
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Colorado, USA
| | | | - Johanna L Est
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Peter D Galer
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shiva Ganesan
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Matthias Griese
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Matthias Haimel
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
| | - Julia Pazmandi
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032, USA
| | - Marc Hanauer
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Nomi L Harris
- Monarch Initiative
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley CA, USA
| | - Michael J Hartnett
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Maximilian Hastreiter
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Fabian Hauck
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- German Centre for Infection Research (DZIF), Munich, Germany
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Tim Jeske
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Hugh Kearney
- FutureNeuro, SFI Research Centre for Chronic and Rare Neurological Diseases, Ireland
| | - Gerhard Kindle
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency (CCI). Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
- Centre for Biobanking FREEZE, Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
| | - Christoph Klein
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Katrin Knoflach
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Roland Krause
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367 Belvaux, Luxembourg
| | - David Lagorce
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Julie A McMurry
- Monarch Initiative
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Jillian A Miller
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Monica C Munoz-Torres
- Monarch Initiative
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Rebecca L Peters
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Christina K Rapp
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Ana M Rath
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Shahmir A Rind
- WA Register of Developmental Anomalies
- Curtin University, Western Australia, Australia
| | - Avi Z Rosenberg
- Division of Kidney-Urologic Pathology, Johns Hopkins University, Baltimore, MD 21205, USA
| | | | - Markus G Seidel
- Research Unit for Pediatric Hematology and Immunology, Division of Pediatric Hemato-Oncology, Department of Pediatrics and Adolescent Medicine, Medical University of Graz, Graz, Austria
| | - Damian Smedley
- The William Harvey Research Institute, Charterhouse Square Barts and the London School of Medicine and Dentistry Queen Mary University of London, London EC1M 6BQ, UK
| | - Tomer Talmy
- Genomic Research Department, Emedgene Technologies, Tel Aviv, Israel
- Faculty of Medicine, Hebrew University Hadassah Medical School, Jerusalem, Israel
| | - Yarlalu Thomas
- West Australian Register of Developmental Anomalies, East Perth, WA, Australia
| | | | - Julie Xian
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, PA, USA
| | - Zafer Yüksel
- Human Genetics, Bioscientia GmbH, Ingelheim, Germany
| | - Ingo Helbig
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Christopher J Mungall
- Monarch Initiative
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley CA, USA
| | - Melissa A Haendel
- Monarch Initiative
- Oregon Clinical & Translational Research Institute, Oregon Health & Science University
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Peter N Robinson
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032, USA
| |
Collapse
|