1
|
Jin X, He W, Liu M, Wang L, Zhang Y, Xu Y, Ma L, Huang Y, Xie M. Mining functional gene modules by multi-view NMF of phenome-genome association. BMC Genomics 2025; 23:868. [PMID: 39789452 PMCID: PMC11720361 DOI: 10.1186/s12864-024-11120-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 12/02/2024] [Indexed: 01/12/2025] Open
Abstract
BACKGROUND Mining functional gene modules from genomic data is an important step to detect gene members of pathways or other relations such as protein-protein interactions. This work explores the plausibility of detecting functional gene modules by factorizing gene-phenotype association matrix from the phenotype ontology data rather than the conventionally used gene expression data. Recently, the hierarchical structure of phenotype ontologies has not been sufficiently utilized in gene clustering while functionally related genes are consistently associated with phenotypes on the same path in phenotype ontologies. RESULTS This work demonstrates a hierarchical Nonnegative Matrix Factorization (NMF) framework, called Consistent Multi-view Nonnegative Matrix Factorization (CMNMF), which factorizes genome-phenome association matrix at consecutive levels of the hierarchical structure in phenotype ontology to mine functional gene modules. CMNMF constrains the gene clusters from the association matrices at two consecutive levels to be consistent since the genes are annotated with both the child-level phenotypes and the parent-level phenotypes in two levels. CMNMF also restricts the identified gene clusters to be densely connected in the phenotype ontology hierarchy. In the experiments on mining functionally related genes from mouse phenotype ontology and human phenotype ontology, CMNMF effectively improves clustering performance over the baseline methods. Gene ontology enrichment analysis is also conducted to verify its practical effectiveness to reveal meaningful gene modules. CONCLUSIONS Utilizing the information in the hierarchical structure of phenotype ontology, CMNMF can identify functional gene modules with more biological significance than conventional methods. CMNMF can also be a better tool for predicting members of gene pathways and protein-protein interactions.
Collapse
Affiliation(s)
- Xu Jin
- College of Software, Nankai University, TianJin, China
| | - WenQian He
- College of Software, Nankai University, TianJin, China
| | - MingMing Liu
- College of Software, Nankai University, TianJin, China
| | - Lin Wang
- College of Software, Nankai University, TianJin, China
| | - YaoGong Zhang
- College of Software, Nankai University, TianJin, China
| | - YingJie Xu
- College of Software, Nankai University, TianJin, China
| | - Ling Ma
- College of Software, Nankai University, TianJin, China
| | - YaLou Huang
- TianJin International Joint Academy of Biomedicine, TianJin, China
| | - MaoQiang Xie
- College of Software, Nankai University, TianJin, China.
| |
Collapse
|
2
|
Beckwith MA, Danis D, Bridges Y, Jacobsen JOB, Smedley D, Robinson PN. Leveraging clinical intuition to improve accuracy of phenotype-driven prioritization. Genet Med 2025; 27:101292. [PMID: 39396132 DOI: 10.1016/j.gim.2024.101292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Revised: 10/03/2024] [Accepted: 10/04/2024] [Indexed: 10/14/2024] Open
Abstract
PURPOSE Clinical intuition is commonly incorporated into the differential diagnosis as an assessment of the likelihood of candidate diagnoses based either on the patient population being seen in a specific clinic or on the signs and symptoms of the initial presentation. Algorithms to support diagnostic sequencing in individuals with a suspected rare genetic disease do not yet incorporate intuition and instead assume that each Mendelian disease has an equal pretest probability. METHODS The LIkelihood Ratio Interpretation of Clinical AbnormaLities (LIRICAL) algorithm calculates the likelihood ratio of clinical manifestations represented by Human Phenotype Ontology terms to rank candidate diagnoses. The initial version of LIRICAL assumed an equal pretest probability for each disease in its calculation of the posttest probability (where the test is diagnostic exome or genome sequencing). We introduce Clinical Intuition for Likelihood Ratios (ClintLR), an extension of the LIRICAL algorithm that boosts the pretest probability of groups of related diseases deemed to be more likely. RESULTS The average rank of the correct diagnosis in simulations using ClintLR showed a statistically significant improvement over a range of adjustment factors. CONCLUSION ClintLR successfully encodes clinical intuition to improve ranking of rare diseases in diagnostic sequencing. ClintLR is freely available at https://github.com/TheJacksonLaboratory/ClintLR.
Collapse
Affiliation(s)
| | - Daniel Danis
- The Jackson Laboratory for Genomic Medicine, Farmington, CT
| | - Yasemin Bridges
- William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Julius O B Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, United Kingdom
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT; Berlin Institute of Health, Charité-Universitätsmedizin Berlin, Berlin, Germany.
| |
Collapse
|
3
|
Kakar N, Mascarenhas S, Ali A, Azmatullah, Ijlal Haider SM, Badiger VA, Ghofrani MS, Kruse N, Hashmi SN, Pozojevic J, Balachandran S, Toft M, Malik S, Händler K, Fatima A, Iqbal Z, Shukla A, Spielmann M, Radhakrishnan P. Further evidence of biallelic NAV3 variants associated with recessive neurodevelopmental disorder with dysmorphism, developmental delay, intellectual disability, and behavioral abnormalities. Hum Genet 2025; 144:55-65. [PMID: 39708122 PMCID: PMC11754320 DOI: 10.1007/s00439-024-02718-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2024] [Accepted: 11/27/2024] [Indexed: 12/23/2024]
Abstract
Neuron navigators (NAVs) are cytoskeleton-associated proteins well known for their role in axonal guidance, neuronal migration, and neurite growth necessary for neurodevelopment. Neuron navigator 3 (NAV3) is one of the three NAV proteins highly expressed in the embryonic and adult brain. However, the role of the NAV3 gene in human disease is not well-studied. Recently, five bi-allelic and three mono-allelic variants in NAV3 were reported in 12 individuals from eight unrelated families with neurodevelopmental disorder (NDD). Here, we report five patients from three unrelated consanguineous families segregating autosomal recessive NDD. Patients have symptoms of dysmorphism, intellectual disability, developmental delay, and behavioral abnormalities. Exome sequencing (ES) was performed on two affected individuals from one large family, and one affected individual from each of the other two families. ES revealed two homozygous nonsense c.6325C > T; p.(Gln2109Ter) and c.6577C > T; p.(Arg2193Ter) and a homozygous splice site (c.243 + 1G > T) variants in the NAV3 (NM_001024383.2). Analysis of single-cell sequencing datasets from embryonic and young adult human brains revealed that NAV3 is highly expressed in the excitatory neurons, inhibitory neurons, and microglia, consistent with its role in neurodevelopment. In conclusion, in this study, we further validate biallelic protein truncating variants in NAV3 as a cause of NDD, expanding the spectrum of pathogenic variants in this newly discovered NDD gene.
Collapse
Affiliation(s)
- Naseebullah Kakar
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
- Department for Biotechnology, FLS&I, BUITEMS, Quetta, Pakistan
| | - Selinda Mascarenhas
- Department of Medical Genetics, Kasturba Medical College, Manipal, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - Asmat Ali
- Department of Biological and Biomedical Science, The Aga Khan University, Stadium Road, Karachi, 78400, Pakistan
| | - Azmatullah
- Department of Zoology, Human Genetics Program, Quaid-i-Azam University, Islamabad, Pakistan
| | | | - Vaishnavi Ashok Badiger
- Department of Medical Genetics, Kasturba Medical College, Manipal, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - Mobina Shadman Ghofrani
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
| | - Nathalie Kruse
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
| | - Sohana Nadeem Hashmi
- Department of Biological and Biomedical Science, The Aga Khan University, Stadium Road, Karachi, 78400, Pakistan
| | - Jelena Pozojevic
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
| | - Saranya Balachandran
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
| | - Mathias Toft
- Institute of Clinical Medicine, University of Oslo, P.O Box 1171, 0318, Oslo, Norway
- Department of Neurology, Oslo University Hospital, Nydalen, P.O. Box 4950, 0424, Oslo, Norway
| | - Sajid Malik
- Department of Zoology, Human Genetics Program, Quaid-i-Azam University, Islamabad, Pakistan
| | - Kristian Händler
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany
| | - Ambrin Fatima
- Department of Biological and Biomedical Science, The Aga Khan University, Stadium Road, Karachi, 78400, Pakistan
| | - Zafar Iqbal
- Department of Neurology, Oslo University Hospital, Nydalen, P.O. Box 4950, 0424, Oslo, Norway
| | - Anju Shukla
- Department of Medical Genetics, Kasturba Medical College, Manipal, Manipal Academy of Higher Education, Manipal, Karnataka, India
| | - Malte Spielmann
- Institut für Humangenetik, Universitätsklinikum Schleswig-Holstein, University of Lübeck and University of Kiel, 23562, Lübeck, Germany.
| | - Periyasamy Radhakrishnan
- Department of Medical Genetics, Kasturba Medical College, Manipal, Manipal Academy of Higher Education, Manipal, Karnataka, India.
| |
Collapse
|
4
|
Zhang X, Jacobs KA, Raygor KP, Li S, Li J, Wang RA. Arterial endothelial deletion of hereditary hemorrhagic telangiectasia 2/ Alk1 causes epistaxis and cerebral microhemorrhage with aberrant arteries and defective smooth muscle coverage. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.25.622742. [PMID: 39651127 PMCID: PMC11623514 DOI: 10.1101/2024.11.25.622742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Hereditary Hemorrhagic Telangiectasia (HHT) is an autosomal dominant vascular disorder with manifestations including severe nose bleeding and microhemorrhage in brains. Despite being the second most common inherited bleeding disorder, the pathophysiological mechanism underlying HHT-associated hemorrhage is poorly understood. HHT pathogenesis is thought to follow a Knudsonian two-hit model, requiring a second somatic mutation for lesion formation. Mutations in activin receptor-like kinase 1 ( ALK1 ) gene cause HHT type 2. We hypothesize that somatic mutation of Alk1 in arterial endothelial cells (AECs) leads to arterial defects and hemorrhage. Here, we mutated Alk1 in AECs in postnatal mice using Bmx(PAC)-Cre ERT2 and found that somatic arterial endothelial mutation of Alk1 was sufficient to induce spontaneous epistaxis and multifocal cerebral microhemorrhage. This bleeding occurred in the presence of tortuous and enlarged blood vessels, loss of arterial molecular marker Efnb2 , disorganization of vascular smooth muscle, and impaired vasoregulation. Our data suggest that arterial endothelial deletion of Alk1 leading to reduced arterial identity and disrupted vascular smooth muscle cell coverage is a plausible molecular mechanism for HHT-associated severe epistaxis. This work provides the first evidence that somatic Alk1 mutation in AECs can cause hemorrhagic vascular lesions, offering a novel preclinical model critically needed for studying HHT-associated epistaxis, and delineating an arterial mechanism to HHT pathophysiology.
Collapse
|
5
|
Huang QQ, Wigdor EM, Malawsky DS, Campbell P, Samocha KE, Chundru VK, Danecek P, Lindsay S, Marchant T, Koko M, Amanat S, Bonfanti D, Sheridan E, Radford EJ, Barrett JC, Wright CF, Firth HV, Warrier V, Strudwick Young A, Hurles ME, Martin HC. Examining the role of common variants in rare neurodevelopmental conditions. Nature 2024; 636:404-411. [PMID: 39567701 PMCID: PMC11634775 DOI: 10.1038/s41586-024-08217-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Accepted: 10/15/2024] [Indexed: 11/22/2024]
Abstract
Although rare neurodevelopmental conditions have a large Mendelian component1, common genetic variants also contribute to risk2,3. However, little is known about how this polygenic risk is distributed among patients with these conditions and their parents nor its interplay with rare variants. It is also unclear whether polygenic background affects risk directly through alleles transmitted from parents to children, or whether indirect genetic effects mediated through the family environment4 also play a role. Here we addressed these questions using genetic data from 11,573 patients with rare neurodevelopmental conditions, 9,128 of their parents and 26,869 controls. Common variants explained around 10% of variance in risk. Patients with a monogenic diagnosis had significantly less polygenic risk than those without, supporting a liability threshold model5. A polygenic score for neurodevelopmental conditions showed only a direct genetic effect. By contrast, polygenic scores for educational attainment and cognitive performance showed no direct genetic effect, but the non-transmitted alleles in the parents were correlated with the child's risk, potentially due to indirect genetic effects and/or parental assortment for these traits4. Indeed, as expected under parental assortment, we show that common variant predisposition for neurodevelopmental conditions is correlated with the rare variant component of risk. These findings indicate that future studies should investigate the possible role and nature of indirect genetic effects on rare neurodevelopmental conditions, and consider the contribution of common and rare variants simultaneously when studying cognition-related phenotypes.
Collapse
Affiliation(s)
| | | | | | - Patrick Campbell
- Wellcome Sanger Institute, Hinxton, UK
- Department of Medical and Molecular Genetics, King's College London, London, UK
| | - Kaitlin E Samocha
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - V Kartik Chundru
- Wellcome Sanger Institute, Hinxton, UK
- Institute of Biomedical and Clinical Science, University of Exeter, Exeter, UK
| | | | | | | | | | | | | | - Eamonn Sheridan
- Wellcome Sanger Institute, Hinxton, UK
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK
- Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, UK
| | - Elizabeth J Radford
- Wellcome Sanger Institute, Hinxton, UK
- Department of Paediatrics, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | | | - Caroline F Wright
- Institute of Biomedical and Clinical Science, University of Exeter, Exeter, UK
| | - Helen V Firth
- Wellcome Sanger Institute, Hinxton, UK
- Cambridge University Hospitals Foundation Trust, Addenbrooke's Hospital, Cambridge, UK
| | - Varun Warrier
- Department of Psychiatry, University of Cambridge, Cambridge, UK
- Department of Psychology, University of Cambridge, Cambridge, UK
| | - Alexander Strudwick Young
- University of California Los Angeles Anderson School of Management, Los Angeles, CA, USA
- Human Genetics Department, UCLA David Geffen School of Medicine, Los Angeles, CA, USA
| | | | | |
Collapse
|
6
|
Ridsdale AM, Dickerson A, Chundru VK, Firth HV, Wright CF. Phenotypic spectrum of dual diagnoses in developmental disorders. Am J Hum Genet 2024; 111:2382-2391. [PMID: 39353430 PMCID: PMC11568748 DOI: 10.1016/j.ajhg.2024.08.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2024] [Revised: 08/30/2024] [Accepted: 08/30/2024] [Indexed: 10/04/2024] Open
Abstract
As more patients receive genome-wide sequencing, the number of individuals diagnosed with multiple monogenic conditions is increasing. We sought to investigate the relative phenotypic contribution of dual diagnoses using both manual curation and computational approaches. First, we computed 1,003,236 semantic similarity scores for all possible pairs of 1,417 genes in the Developmental Disorder Gene2Phenotype (DDG2P) database using Human Phenotype Ontology terms. Next, for 62 probands with two molecular diagnoses in the Deciphering Developmental Disorders study, we computed semantic similarity scores between the probands' phenotypes and DDG2P phenotypes associated with the two disorders and compared the results with manual attribution of proband phenotypes to none, one, or both of the genes. We found a spectrum of phenotypic similarity for dual diagnoses, both across all DDG2P genes and within dual diagnosed probands, from phenotypically distinct through blended to indistinguishable conditions. Pairwise semantic similarity scores between two DDG2P genes were a good predictor of the extent of phenotypic blending observed in probands. Dual diagnoses involving genes linked with synergistic phenotypes can result in more extreme presentations while those involving antagonistic phenotypes have spuriously high pairwise semantic similarity scores despite a potentially milder atypical presentation. We suggest that the phenotypic contribution of two molecular diagnoses may contain discrete, synergistic, or antagonistic elements. Conceptual recognition of this phenotypic spectrum is important for making a final clinico-molecular diagnosis and providing accurate genetic counseling.
Collapse
Affiliation(s)
- Alys M Ridsdale
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke's Campus, Magdalen Road, Exeter EX1 2LU, UK
| | - Anna Dickerson
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke's Campus, Magdalen Road, Exeter EX1 2LU, UK
| | - V Kartik Chundru
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke's Campus, Magdalen Road, Exeter EX1 2LU, UK
| | - Helen V Firth
- East Anglian Medical Genetics Service, Clinical Genetics, Box 134, Addenbrooke's Treatment Centre, Level 6, Addenbrooke's Hospital, Hills Road, Cambridge CB2 0QQ, UK; Wellcome Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Saffron Walden CB10 1RQ, UK
| | - Caroline F Wright
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke's Campus, Magdalen Road, Exeter EX1 2LU, UK.
| |
Collapse
|
7
|
Toro S, Anagnostopoulos AV, Bello SM, Blumberg K, Cameron R, Carmody L, Diehl AD, Dooley DM, Duncan WD, Fey P, Gaudet P, Harris NL, Joachimiak MP, Kiani L, Lubiana T, Munoz-Torres MC, O'Neil S, Osumi-Sutherland D, Puig-Barbe A, Reese JT, Reiser L, Robb SM, Ruemping T, Seager J, Sid E, Stefancsik R, Weber M, Wood V, Haendel MA, Mungall CJ. Dynamic Retrieval Augmented Generation of Ontologies using Artificial Intelligence (DRAGON-AI). J Biomed Semantics 2024; 15:19. [PMID: 39415214 PMCID: PMC11484368 DOI: 10.1186/s13326-024-00320-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2024] [Accepted: 09/08/2024] [Indexed: 10/18/2024] Open
Abstract
BACKGROUND Ontologies are fundamental components of informatics infrastructure in domains such as biomedical, environmental, and food sciences, representing consensus knowledge in an accurate and computable form. However, their construction and maintenance demand substantial resources and necessitate substantial collaboration between domain experts, curators, and ontology experts. We present Dynamic Retrieval Augmented Generation of Ontologies using AI (DRAGON-AI), an ontology generation method employing Large Language Models (LLMs) and Retrieval Augmented Generation (RAG). DRAGON-AI can generate textual and logical ontology components, drawing from existing knowledge in multiple ontologies and unstructured text sources. RESULTS We assessed performance of DRAGON-AI on de novo term construction across ten diverse ontologies, making use of extensive manual evaluation of results. Our method has high precision for relationship generation, but has slightly lower precision than from logic-based reasoning. Our method is also able to generate definitions deemed acceptable by expert evaluators, but these scored worse than human-authored definitions. Notably, evaluators with the highest level of confidence in a domain were better able to discern flaws in AI-generated definitions. We also demonstrated the ability of DRAGON-AI to incorporate natural language instructions in the form of GitHub issues. CONCLUSIONS These findings suggest DRAGON-AI's potential to substantially aid the manual ontology construction process. However, our results also underscore the importance of having expert curators and ontology editors drive the ontology generation process.
Collapse
Affiliation(s)
- Sabrina Toro
- University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | | | - Kai Blumberg
- Department of Agriculture, Beltsville Human Nutrition Research Center, Beltsville, MD, USA
| | | | - Leigh Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | | | - Petra Fey
- Northwestern University, Evanston, IL, USA
| | - Pascale Gaudet
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Nomi L Harris
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Leila Kiani
- Independent Scientific Information Analyst, Philadelphia, USA
| | | | | | - Shawn O'Neil
- University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | | | - Justin T Reese
- Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | | | - Sofia Mc Robb
- Stowers Institute for Medical Research, Kansas City, MO, USA
| | | | | | - Eric Sid
- National Center for Advancing Translational Sciences, Bethesda, MD, USA
| | - Ray Stefancsik
- European Bioinformatics Institute (EMBL-EBI), Hinxton, UK
| | - Magalie Weber
- INRAE, French National Research Institute for Agriculture, Food and Environment, UR BIA, Nantes, France
| | | | | | | |
Collapse
|
8
|
Fischer SN, Claussen ER, Kourtis S, Sdelci S, Orchard S, Hermjakob H, Kustatscher G, Drew K. hu.MAP3.0: Atlas of human protein complexes by integration of > 25,000 proteomic experiments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.11.617930. [PMID: 39464102 PMCID: PMC11507723 DOI: 10.1101/2024.10.11.617930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/29/2024]
Abstract
Macromolecular protein complexes carry out most functions in the cell including essential functions required for cell survival. Unfortunately, we lack the subunit composition for all human protein complexes. To address this gap we integrated >25,000 mass spectrometry experiments using a machine learning approach to identify > 15,000 human protein complexes. We show our map of protein complexes is highly accurate and more comprehensive than previous maps, placing ~75% of human proteins into their physical contexts. We globally characterize our complexes using protein co-variation data (ProteomeHD.2) and identify co-varying complexes suggesting common functional associations. Our map also generates testable functional hypotheses for 472 uncharacterized proteins which we support using AlphaFold modeling. Additionally, we use AlphaFold modeling to identify 511 mutually exclusive protein pairs in hu.MAP3.0 complexes suggesting complexes serve different functional roles depending on their subunit composition. We identify expression as the primary way cells and organisms relieve the conflict of mutually exclusive subunits. Finally, we import our complexes to EMBL-EBI's Complex Portal (https://www.ebi.ac.uk/complexportal/home) as well as provide complexes through our hu.MAP3.0 web interface (https://humap3.proteincomplexes.org/). We expect our resource to be highly impactful to the broader research community.
Collapse
Affiliation(s)
- Samantha N. Fischer
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| | - Erin R. Claussen
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| | - Savvas Kourtis
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sara Sdelci
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Sandra Orchard
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Georg Kustatscher
- Wellcome Centre for Cell Biology, University of Edinburgh, Edinburgh EH9 3BF, UK
| | - Kevin Drew
- Department of Biological Sciences, University of Illinois at Chicago, Chicago, IL 60607
| |
Collapse
|
9
|
Copeland H, Low KJ, Wynn SL, Ahmed A, Arthur V, Balasubramanian M, Bennett K, Berg J, Bertoli M, Bryson L, Bucknall C, Campbell J, Chandler K, Chauhan J, Clarkson A, Coles R, Conti H, Costello P, Coupar T, Craig A, Dean J, Dillon A, Dixit A, Drew K, Eason J, Forzano F, Foulds N, Gardham A, Ghali N, Green A, Hanna W, Harrison R, Hegarty M, Higgs J, Holder M, Irving R, Jain V, Johnson K, Jolley R, Jones WD, Jones G, Joss S, Kalinauskiene R, Kanani F, Kavanagh K, Khan M, Khan N, Kivuva E, Lahiri N, Lakhani N, Lampe A, Lynch SA, Mansour S, Marsden A, Massey H, McKee S, Mohammed S, Naik S, Nesarajah M, Newbury-Ecob R, Osborne F, Parker MJ, Patterson J, Pottinger C, Prapa M, Prescott K, Quinn S, Radley JA, Robart S, Ross A, Rosti G, Sansbury FH, Sarkar A, Searle C, Shannon N, Shears D, Smithson S, Stewart H, Suri M, Tadros S, Theobald R, Thomas R, Tsoulaki O, Vasudevan P, Rodriguez MV, Vittery E, Whyte S, Woods E, Wright T, Zocche D, Firth HV, Wright CF. Large-scale evaluation of outcomes after a genetic diagnosis in children with severe developmental disorders. GENETICS IN MEDICINE OPEN 2024; 2:101864. [PMID: 39822267 PMCID: PMC11736166 DOI: 10.1016/j.gimo.2024.101864] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 06/19/2024] [Accepted: 06/20/2024] [Indexed: 01/19/2025]
Abstract
Purpose We sought to evaluate outcomes for clinical management after a genetic diagnosis from the Deciphering Developmental Disorders study. Methods Individuals in the Deciphering Developmental Disorders study who had a pathogenic/likely pathogenic genotype in the DECIPHER database were selected for inclusion (n = 5010). Clinical notes from regional clinical genetics services notes were reviewed to assess predefined clinical outcomes relating to interventions, prenatal choices, and information provision. Results Outcomes were recorded for 4237 diagnosed probands (85% of those eligible) from all 24 recruiting centers across the United Kingdom and Ireland. Clinical management was reported to have changed in 28% of affected individuals. Where individual-level interventions were recorded, additional diagnostic or screening tests were started in 903 (21%) probands through referral to a range of different clinical specialties, and stopped or avoided in a further 26 (0.6%). Disease-specific treatment was started in 85 (2%) probands, including seizure-control medications and dietary supplements, and contra-indicated medications were stopped or avoided in a further 20 (0.5%). The option of prenatal/preimplantation genetic testing was discussed with 1204 (28%) families, despite the relatively advanced age of the parents at the time of diagnosis. Importantly, condition-specific information or literature was given to 3214 (76%) families, and 880 (21%) were involved in family support groups. In the most common condition (KBG syndrome; 79 [2%] probands), clinical interventions only partially reflected the temporal development of phenotypes, highlighting the importance of consensus management guidelines and patient support groups. Conclusion Our results underscore the importance of achieving a clinico-molecular diagnosis to ensure timely onward referral of patients, enabling appropriate care and anticipatory surveillance, and for accessing relevant patient support groups.
Collapse
Affiliation(s)
- Harriet Copeland
- Peninsula Clinical Genetics, Clinical Genetics, Royal Devon University Healthcare NHS Foundation Trust, Exeter, United Kingdom
| | - Karen J. Low
- Bristol Regional Clinical Genetics Service, Level B, St Michael’s Hospital, Bristol, United Kingdom
- Centre for Academic Child Health, Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Sarah L. Wynn
- Unique (Rare Chromosome Disorder Support Group), Oxted, Surrey, United Kingdom
| | - Ayesha Ahmed
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Victoria Arthur
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - Meena Balasubramanian
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
| | - Katya Bennett
- Liverpool Centre for Genomic Medicine, Liverpool Women’s Hospital, Liverpool, United Kingdom
| | - Jonathan Berg
- Clinical Genetics, Human Genetics Unit, Ninewells Hospital, Dundee, United Kingdom
| | - Marta Bertoli
- Northern Genetics Service, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Institute of Genetic Medicine, International Centre for Life, Newcastle upon Tyne, United Kingdom
| | - Lisa Bryson
- West of Scotland Centre for Genomic Medicine, Laboratory Medicine Building, Queen Elizabeth University Hospital, Glasgow, United Kingdom
| | - Catrin Bucknall
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Jamie Campbell
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, United Kingdom
| | - Kate Chandler
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - Jaynee Chauhan
- Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, United Kingdom
| | - Amy Clarkson
- Northern Genetics Service, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Institute of Genetic Medicine, International Centre for Life, Newcastle upon Tyne, United Kingdom
| | - Rachel Coles
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
| | - Hector Conti
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Philandra Costello
- Wessex Clinical Genetics Service, Princess Anne Hospital, Southampton, United Kingdom
| | - Tessa Coupar
- Clinical Genetics, Human Genetics Unit, Ninewells Hospital, Dundee, United Kingdom
| | - Amy Craig
- South West Thames Centre for Genomics, St. George's University Hospital, Tooting, London, United Kingdom
| | - John Dean
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, United Kingdom
| | - Amy Dillon
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - Abhijit Dixit
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Kathryn Drew
- West Midlands Regional Genetics Service, Department of Clinical Genetics, Birmingham Women’s Hospital, Edgbaston, United Kingdom
| | - Jacqueline Eason
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Francesca Forzano
- Department of Clinical Genetics, Guy's Hospital, London, United Kingdom
| | - Nicola Foulds
- Wessex Clinical Genetics Service, Princess Anne Hospital, Southampton, United Kingdom
| | - Alice Gardham
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
| | - Neeti Ghali
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
| | - Andrew Green
- Department of Clinical Genetics, Children’s Health Ireland, Crumlin, Ireland
| | - William Hanna
- Northern Ireland Regional Genetics Service, Medical Genetics Department, Belfast City Hospital, Belfast, United Kingdom
| | - Rachel Harrison
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Mairead Hegarty
- Northern Ireland Regional Genetics Service, Medical Genetics Department, Belfast City Hospital, Belfast, United Kingdom
| | - Jenny Higgs
- Liverpool Centre for Genomic Medicine, Liverpool Women’s Hospital, Liverpool, United Kingdom
| | - Muriel Holder
- Department of Clinical Genetics, Guy's Hospital, London, United Kingdom
| | - Rachel Irving
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Vani Jain
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Katie Johnson
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Rachel Jolley
- West Midlands Regional Genetics Service, Department of Clinical Genetics, Birmingham Women’s Hospital, Edgbaston, United Kingdom
| | - Wendy D. Jones
- North East Thames Regional Genetics Service, Clinical Genetics Unit, Great Ormond Street Hospital NHS Trust, London, United Kingdom
| | - Gabriela Jones
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Shelagh Joss
- West of Scotland Centre for Genomic Medicine, Laboratory Medicine Building, Queen Elizabeth University Hospital, Glasgow, United Kingdom
| | | | - Farah Kanani
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
| | - Karl Kavanagh
- Department of Clinical Genetics, Children’s Health Ireland, Crumlin, Ireland
| | - Mahmudur Khan
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
| | - Naz Khan
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - Emma Kivuva
- Peninsula Clinical Genetics, Clinical Genetics, Royal Devon University Healthcare NHS Foundation Trust, Exeter, United Kingdom
| | - Nayana Lahiri
- South West Thames Centre for Genomics, St. George's University Hospital, Tooting, London, United Kingdom
| | - Neeta Lakhani
- Leicestershire, Northamptonshire and Rutland Genomic Medicine Service, Leicester Royal Infirmary, University Hospitals of Leicester NHS Trust, Leicester, United Kingdom
| | - Anne Lampe
- South East Scotland Clinical Genetics Service, Western General Hospital, Edinburgh, United Kingdom
| | - Sally Ann Lynch
- Department of Clinical Genetics, Children’s Health Ireland, Crumlin, Ireland
| | - Sahar Mansour
- South West Thames Centre for Genomics, St. George's University Hospital, Tooting, London, United Kingdom
| | - Alice Marsden
- Liverpool Centre for Genomic Medicine, Liverpool Women’s Hospital, Liverpool, United Kingdom
| | - Hannah Massey
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, United Kingdom
| | - Shane McKee
- Northern Ireland Regional Genetics Service, Medical Genetics Department, Belfast City Hospital, Belfast, United Kingdom
| | - Shehla Mohammed
- Department of Clinical Genetics, Guy's Hospital, London, United Kingdom
| | - Swati Naik
- West Midlands Regional Genetics Service, Department of Clinical Genetics, Birmingham Women’s Hospital, Edgbaston, United Kingdom
| | - Mithushanaa Nesarajah
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - Ruth Newbury-Ecob
- Bristol Regional Clinical Genetics Service, Level B, St Michael’s Hospital, Bristol, United Kingdom
| | - Fiona Osborne
- South East Scotland Clinical Genetics Service, Western General Hospital, Edinburgh, United Kingdom
| | - Michael J. Parker
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
| | - Jenny Patterson
- West of Scotland Centre for Genomic Medicine, Laboratory Medicine Building, Queen Elizabeth University Hospital, Glasgow, United Kingdom
| | - Caroline Pottinger
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Matina Prapa
- East Anglian Medical Genetics Service, Clinical Genetics, Addenbrooke’s Treatment Centre, Addenbrooke’s Hospital, Cambridge, United Kingdom
| | - Katrina Prescott
- Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, United Kingdom
| | - Shauna Quinn
- Department of Clinical Genetics, Children’s Health Ireland, Crumlin, Ireland
| | - Jessica A. Radley
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
| | - Sarah Robart
- North East Thames Regional Genetics Service, Clinical Genetics Unit, Great Ormond Street Hospital NHS Trust, London, United Kingdom
| | - Alison Ross
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, United Kingdom
| | - Giulia Rosti
- Department of Clinical Genetics, Guy's Hospital, London, United Kingdom
| | - Francis H. Sansbury
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
| | - Ajoy Sarkar
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Claire Searle
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Nora Shannon
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Debbie Shears
- Oxford Centre for Genomic Medicine, Department of Clinical Genetics, Churchill Hospital, Headington, Oxford, United Kingdom
| | - Sarah Smithson
- Bristol Regional Clinical Genetics Service, Level B, St Michael’s Hospital, Bristol, United Kingdom
| | - Helen Stewart
- Oxford Centre for Genomic Medicine, Department of Clinical Genetics, Churchill Hospital, Headington, Oxford, United Kingdom
| | - Mohnish Suri
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
| | - Shereen Tadros
- North East Thames Regional Genetics Service, Clinical Genetics Unit, Great Ormond Street Hospital NHS Trust, London, United Kingdom
| | - Rachel Theobald
- Northern Genetics Service, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Institute of Genetic Medicine, International Centre for Life, Newcastle upon Tyne, United Kingdom
| | - Rhian Thomas
- South West Thames Centre for Genomics, St. George's University Hospital, Tooting, London, United Kingdom
| | - Olga Tsoulaki
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
| | - Pradeep Vasudevan
- Leicestershire, Northamptonshire and Rutland Genomic Medicine Service, Leicester Royal Infirmary, University Hospitals of Leicester NHS Trust, Leicester, United Kingdom
| | | | - Emma Vittery
- Northern Genetics Service, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Institute of Genetic Medicine, International Centre for Life, Newcastle upon Tyne, United Kingdom
| | - Sinead Whyte
- North East Thames Regional Genetics Service, Clinical Genetics Unit, Great Ormond Street Hospital NHS Trust, London, United Kingdom
| | - Emily Woods
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
| | - Thomas Wright
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
| | - David Zocche
- Oxford Centre for Genomic Medicine, Department of Clinical Genetics, Churchill Hospital, Headington, Oxford, United Kingdom
| | - Helen V. Firth
- East Anglian Medical Genetics Service, Clinical Genetics, Addenbrooke’s Treatment Centre, Addenbrooke’s Hospital, Cambridge, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Caroline F. Wright
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke’s Campus, Exeter, United Kingdom
| | - the DDD Study28
- Peninsula Clinical Genetics, Clinical Genetics, Royal Devon University Healthcare NHS Foundation Trust, Exeter, United Kingdom
- Bristol Regional Clinical Genetics Service, Level B, St Michael’s Hospital, Bristol, United Kingdom
- Centre for Academic Child Health, Bristol Medical School, University of Bristol, Bristol, United Kingdom
- Unique (Rare Chromosome Disorder Support Group), Oxted, Surrey, United Kingdom
- All Wales Medical Genomics Service, Wales Genomic Health Centre, Cardiff Edge Business Park, Whitchurch, Cardiff, United Kingdom
- Manchester Centre for Genomic Medicine, St Mary’s Hospital, Manchester, United Kingdom
- Sheffield Clinical Genomics Service, Sheffield Children’s NHS Foundation Trust, Sheffield, United Kingdom
- Liverpool Centre for Genomic Medicine, Liverpool Women’s Hospital, Liverpool, United Kingdom
- Clinical Genetics, Human Genetics Unit, Ninewells Hospital, Dundee, United Kingdom
- Northern Genetics Service, The Newcastle upon Tyne Hospitals NHS Foundation Trust, Institute of Genetic Medicine, International Centre for Life, Newcastle upon Tyne, United Kingdom
- West of Scotland Centre for Genomic Medicine, Laboratory Medicine Building, Queen Elizabeth University Hospital, Glasgow, United Kingdom
- North of Scotland Regional Genetics Service, Clinical Genetics Centre, Ashgrove House, Aberdeen Royal Infirmary, Foresterhill, Aberdeen, United Kingdom
- Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, United Kingdom
- North West Thames Regional Genetics Service, London North West University Healthcare NHS Trust, Northwick Park Hospital, Harrow, United Kingdom
- Wessex Clinical Genetics Service, Princess Anne Hospital, Southampton, United Kingdom
- South West Thames Centre for Genomics, St. George's University Hospital, Tooting, London, United Kingdom
- Nottingham Regional Genetics Service, Nottingham City Hospital Campus, The Gables, Nottingham, United Kingdom
- West Midlands Regional Genetics Service, Department of Clinical Genetics, Birmingham Women’s Hospital, Edgbaston, United Kingdom
- Department of Clinical Genetics, Guy's Hospital, London, United Kingdom
- Department of Clinical Genetics, Children’s Health Ireland, Crumlin, Ireland
- Northern Ireland Regional Genetics Service, Medical Genetics Department, Belfast City Hospital, Belfast, United Kingdom
- North East Thames Regional Genetics Service, Clinical Genetics Unit, Great Ormond Street Hospital NHS Trust, London, United Kingdom
- Leicestershire, Northamptonshire and Rutland Genomic Medicine Service, Leicester Royal Infirmary, University Hospitals of Leicester NHS Trust, Leicester, United Kingdom
- South East Scotland Clinical Genetics Service, Western General Hospital, Edinburgh, United Kingdom
- East Anglian Medical Genetics Service, Clinical Genetics, Addenbrooke’s Treatment Centre, Addenbrooke’s Hospital, Cambridge, United Kingdom
- Oxford Centre for Genomic Medicine, Department of Clinical Genetics, Churchill Hospital, Headington, Oxford, United Kingdom
- Department of Clinical and Biomedical Sciences, Medical School, University of Exeter, St Luke’s Campus, Exeter, United Kingdom
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom
| |
Collapse
|
10
|
Johnson R, Stephens AV, Mester R, Knyazev S, Kohn LA, Freund MK, Bondhus L, Hill BL, Schwarz T, Zaitlen N, Arboleda VA, Bastarache LA, Pasaniuc B, Butte MJ. Electronic health record signatures identify undiagnosed patients with common variable immunodeficiency disease. Sci Transl Med 2024; 16:eade4510. [PMID: 38691621 PMCID: PMC11402387 DOI: 10.1126/scitranslmed.ade4510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/10/2024] [Indexed: 05/03/2024]
Abstract
Human inborn errors of immunity include rare disorders entailing functional and quantitative antibody deficiencies due to impaired B cells called the common variable immunodeficiency (CVID) phenotype. Patients with CVID face delayed diagnoses and treatments for 5 to 15 years after symptom onset because the disorders are rare (prevalence of ~1/25,000), and there is extensive heterogeneity in CVID phenotypes, ranging from infections to autoimmunity to inflammatory conditions, overlapping with other more common disorders. The prolonged diagnostic odyssey drives excessive system-wide costs before diagnosis. Because there is no single causal mechanism, there are no genetic tests to definitively diagnose CVID. Here, we present PheNet, a machine learning algorithm that identifies patients with CVID from their electronic health records (EHRs). PheNet learns phenotypic patterns from verified CVID cases and uses this knowledge to rank patients by likelihood of having CVID. PheNet could have diagnosed more than half of our patients with CVID 1 or more years earlier than they had been diagnosed. When applied to a large EHR dataset, followed by blinded chart review of the top 100 patients ranked by PheNet, we found that 74% were highly probable to have CVID. We externally validated PheNet using >6 million records from disparate medical systems in California and Tennessee. As artificial intelligence and machine learning make their way into health care, we show that algorithms such as PheNet can offer clinical benefits by expediting the diagnosis of rare diseases.
Collapse
Affiliation(s)
- Ruth Johnson
- Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Alexis V. Stephens
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Rachel Mester
- Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Sergey Knyazev
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Lisa A. Kohn
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Malika K. Freund
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Leroy Bondhus
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Brian L. Hill
- Department of Computer Science, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Tommer Schwarz
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Noah Zaitlen
- Department of Neurology, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Valerie A. Arboleda
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Lisa A. Bastarache
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN, USA 37203
| | - Bogdan Pasaniuc
- Department of Pathology and Laboratory Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
- Bioinformatics Interdepartmental Program, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Computational Medicine, University of California Los Angeles, Los Angeles, CA 90095, USA
| | - Manish J. Butte
- Department of Pediatrics, Division of Immunology, Allergy and Rheumatology, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
- Department of Microbiology, Immunology, and Molecular Genetics, University of California Los Angeles, Los Angeles, CA 90095, USA
| |
Collapse
|
11
|
Lee B, Nasanovsky L, Shen L, Maglinte DT, Pan Y, Gai X, Schmidt RJ, Raca G, Biegel JA, Roytman M, An P, Saunders CJ, Farrow EG, Shams S, Ji J. Significance Associated with Phenotype Score Aids in Variant Prioritization for Exome Sequencing Analysis. J Mol Diagn 2024; 26:337-348. [PMID: 38360210 DOI: 10.1016/j.jmoldx.2024.01.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 12/04/2023] [Accepted: 01/29/2024] [Indexed: 02/17/2024] Open
Abstract
Several in silico annotation-based methods have been developed to prioritize variants in exome sequencing analysis. This study introduced a novel metric Significance Associated with Phenotypes (SAP) score, which generates a statistical score by comparing an individual's observed phenotypes against existing gene-phenotype associations. To evaluate the SAP score, a retrospective analysis was performed on 219 exomes. Among them, 82 family-based and 35 singleton exomes had at least one disease-causing variant that explained the patient's clinical features. SAP scores were calculated, and the rank of the disease-causing variant was compared with a known method, Exomiser. Using the SAP score, the known causative variant was ranked in the top 10 retained variants for 94% (77 of 82) of the family-based exomes and in first place for 73% of these cases. For singleton exomes, the SAP score analysis ranked the known pathogenic variants within the top 10 for 80% (28 of 35) of cases. The SAP score, which is independent of detected variants, demonstrates comparable performance with Exomiser, which considers both phenotype and variant-level evidence simultaneously. Among 102 cases with negative results or variants of uncertain significance, SAP score analysis revealed two cases with a potential new diagnosis based on rank. The SAP score, a phenotypic quantitative metric, can be used in conjunction with standard variant filtration and annotation to enhance variant prioritization in exome analysis.
Collapse
Affiliation(s)
- Brian Lee
- Bionano Genomics, San Diego, California
| | | | - Lishuang Shen
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California
| | - Dennis T Maglinte
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California
| | - Yachen Pan
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California
| | - Xiaowu Gai
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California; Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Ryan J Schmidt
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California; Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Gordana Raca
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California; Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California
| | - Jaclyn A Biegel
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California; Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California
| | | | - Paul An
- Bionano Genomics, San Diego, California
| | - Carol J Saunders
- Department of Pathology and Laboratory Medicine, Children's Mercy Hospital, Kansas City, Missouri; University of Missouri-Kansas City School of Medicine, Kansas City, Missouri
| | - Emily G Farrow
- Department of Pathology and Laboratory Medicine, Children's Mercy Hospital, Kansas City, Missouri; University of Missouri-Kansas City School of Medicine, Kansas City, Missouri
| | | | - Jianling Ji
- Center for Personalized Medicine, Department of Pathology and Laboratory Medicine, Children's Hospital Los Angeles, Los Angeles, California; Department of Pathology, Keck School of Medicine, University of Southern California, Los Angeles, California.
| |
Collapse
|
12
|
Gravel B, Renaux A, Papadimitriou S, Smits G, Nowé A, Lenaerts T. Prioritization of oligogenic variant combinations in whole exomes. Bioinformatics 2024; 40:btae184. [PMID: 38603604 PMCID: PMC11037482 DOI: 10.1093/bioinformatics/btae184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 01/29/2024] [Accepted: 04/10/2024] [Indexed: 04/13/2024] Open
Abstract
MOTIVATION Whole exome sequencing (WES) has emerged as a powerful tool for genetic research, enabling the collection of a tremendous amount of data about human genetic variation. However, properly identifying which variants are causative of a genetic disease remains an important challenge, often due to the number of variants that need to be screened. Expanding the screening to combinations of variants in two or more genes, as would be required under the oligogenic inheritance model, simply blows this problem out of proportion. RESULTS We present here the High-throughput oligogenic prioritizer (Hop), a novel prioritization method that uses direct oligogenic information at the variant, gene and gene pair level to detect digenic variant combinations in WES data. This method leverages information from a knowledge graph, together with specialized pathogenicity predictions in order to effectively rank variant combinations based on how likely they are to explain the patient's phenotype. The performance of Hop is evaluated in cross-validation on 36 120 synthetic exomes for training and 14 280 additional synthetic exomes for independent testing. Whereas the known pathogenic variant combinations are found in the top 20 in approximately 60% of the cross-validation exomes, 71% are found in the same ranking range when considering the independent set. These results provide a significant improvement over alternative approaches that depend simply on a monogenic assessment of pathogenicity, including early attempts for digenic ranking using monogenic pathogenicity scores. AVAILABILITY AND IMPLEMENTATION Hop is available at https://github.com/oligogenic/HOP.
Collapse
Affiliation(s)
- Barbara Gravel
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Department of Computer Science, Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Department of Computer Science, Artificial Intelligence Laboratory, Vrije Universiteit Brussels, 1050 Brussels, Belgium
| | - Alexandre Renaux
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Department of Computer Science, Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Department of Computer Science, Artificial Intelligence Laboratory, Vrije Universiteit Brussels, 1050 Brussels, Belgium
| | - Sofia Papadimitriou
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Department of Computer Science, Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Brussels Interuniversity Genomics High Throughput core (BRIGHTcore), UZ Brussel, Vrije Universiteit Brussel (VUB) - Université Libre de Bruxelles (ULB), 1090 Brussels, Belgium
| | - Guillaume Smits
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Center of Human Genetics, Hôpital Erasme, Hôpital Universitaire de Bruxelles, Université Libre de Bruxelles, 1070 Brussels, Belgium
| | - Ann Nowé
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Department of Computer Science, Artificial Intelligence Laboratory, Vrije Universiteit Brussels, 1050 Brussels, Belgium
| | - Tom Lenaerts
- Interuniversity Institute of Bioinformatics in Brussels, Université Libre de Bruxelles-Vrije Universiteit Brussel, 1050 Brussels, Belgium
- Department of Computer Science, Machine Learning Group, Université Libre de Bruxelles, 1050 Brussels, Belgium
- Department of Computer Science, Artificial Intelligence Laboratory, Vrije Universiteit Brussels, 1050 Brussels, Belgium
| |
Collapse
|
13
|
Kim HH, Kim DW, Woo J, Lee K. Explicable prioritization of genetic variants by integration of rule-based and machine learning algorithms for diagnosis of rare Mendelian disorders. Hum Genomics 2024; 18:28. [PMID: 38509596 PMCID: PMC10956189 DOI: 10.1186/s40246-024-00595-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 03/03/2024] [Indexed: 03/22/2024] Open
Abstract
BACKGROUND In the process of finding the causative variant of rare diseases, accurate assessment and prioritization of genetic variants is essential. Previous variant prioritization tools mainly depend on the in-silico prediction of the pathogenicity of variants, which results in low sensitivity and difficulty in interpreting the prioritization result. In this study, we propose an explainable algorithm for variant prioritization, named 3ASC, with higher sensitivity and ability to annotate evidence used for prioritization. 3ASC annotates each variant with the 28 criteria defined by the ACMG/AMP genome interpretation guidelines and features related to the clinical interpretation of the variants. The system can explain the result based on annotated evidence and feature contributions. RESULTS We trained various machine learning algorithms using in-house patient data. The performance of variant ranking was assessed using the recall rate of identifying causative variants in the top-ranked variants. The best practice model was a random forest classifier that showed top 1 recall of 85.6% and top 3 recall of 94.4%. The 3ASC annotates the ACMG/AMP criteria for each genetic variant of a patient so that clinical geneticists can interpret the result as in the CAGI6 SickKids challenge. In the challenge, 3ASC identified causal genes for 10 out of 14 patient cases, with evidence of decreased gene expression for 6 cases. Among them, two genes (HDAC8 and CASK) had decreased gene expression profiles confirmed by transcriptome data. CONCLUSIONS 3ASC can prioritize genetic variants with higher sensitivity compared to previous methods by integrating various features related to clinical interpretation, including features related to false positive risk such as quality control and disease inheritance pattern. The system allows interpretation of each variant based on the ACMG/AMP criteria and feature contribution assessed using explainable AI techniques.
Collapse
Affiliation(s)
- Ho Heon Kim
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Dong-Wook Kim
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Junwoo Woo
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea
| | - Kyoungyeul Lee
- Research and Development Center, 3billion, 14th floor, 416 Teheran-ro, Gangnam-gu, Seoul, 06193, Republic of Korea.
| |
Collapse
|
14
|
Coban-Akdemir Z, Song X, Ceballos FC, Pehlivan D, Karaca E, Bayram Y, Mitani T, Gambin T, Bozkurt-Yozgatli T, Jhangiani SN, Muzny DM, Lewis RA, Liu P, Boerwinkle E, Hamosh A, Gibbs RA, Sutton VR, Sobreira N, Carvalho CM, Shaw CA, Posey JE, Valle D, Lupski JR. The impact of the Turkish population variome on the genomic architecture of rare disease traits. GENETICS IN MEDICINE OPEN 2024; 2:101830. [PMID: 39669594 PMCID: PMC11613692 DOI: 10.1016/j.gimo.2024.101830] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 02/03/2024] [Accepted: 02/07/2024] [Indexed: 12/14/2024]
Abstract
Purpose The variome of the Turkish (TK) population, a population with a considerable history of admixture and consanguinity, has not been deeply investigated for insights on the genomic architecture of disease. Methods We generated and analyzed a database of variants derived from exome sequencing data of 773 TK unrelated, clinically affected individuals with various suspected Mendelian disease traits and 643 unaffected relatives. Results Using uniform manifold approximation and projection, we showed that the TK genomes are more similar to those of Europeans and consist of 2 main subpopulations: clusters 1 and 2 (N = 235 and 1181, respectively), which differ in admixture proportion and variome (https://turkishvariomedb.shinyapps.io/tvdb/). Furthermore, the higher inbreeding coefficient values observed in the TK affected compared with unaffected individuals correlated with a larger median span of long-sized (>2.64 Mb) runs of homozygosity (ROH) regions (P value = 2.09e-18). We show that long-sized ROHs are more likely to be formed on recently configured haplotypes enriched for rare homozygous deleterious variants in the TK affected compared with TK unaffected individuals (P value = 3.35e-11). Analysis of genotype-phenotype correlations reveals that genes with rare homozygous deleterious variants in long-sized ROHs provide the most comprehensive set of molecular diagnoses for the observed disease traits with a systematic quantitative analysis of Human Phenotype Ontology terms. Conclusion Our findings support the notion that novel rare variants on newly configured haplotypes arising within the recent past generations of a family or clan contribute significantly to recessive disease traits in the TK population.
Collapse
Affiliation(s)
- Zeynep Coban-Akdemir
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
| | - Xiaofei Song
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Department of Biostatistics and Bioinformatics, H. Lee Moffitt Cancer Center & Research Institute, Tampa, FL
| | | | - Davut Pehlivan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Section of Neurology, Department of Pediatrics, Baylor College of Medicine, Houston, TX
| | - Ender Karaca
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Department of Pathology, Baylor University Medical Center, Dallas, TX
- Texas A&M School of Medicine, Texas A&M University, Dallas, TX
| | - Yavuz Bayram
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Division of Genomic Diagnostics, Department of Pathology and Laboratory Medicine, Children's Hospital of Philadelphia, Philadelphia, PA
- Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA
| | - Tadahiro Mitani
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| | - Tomasz Gambin
- Institute of Computer Science, Warsaw University of Technology, Warsaw, Poland
| | - Tugce Bozkurt-Yozgatli
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
- Department of Biostatistics and Bioinformatics, Institute of Health Sciences, Acibadem Mehmet Ali Aydinlar University, Istanbul, Turkey
| | | | - Donna M. Muzny
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - Richard A. Lewis
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Department of Pediatrics, Baylor College of Medicine, Houston, TX
- Department of Ophthalmology, Cullen Eye Institute, Baylor College of Medicine, Houston, TX
| | - Pengfei Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| | - Eric Boerwinkle
- Human Genetics Center, Department of Epidemiology, Human Genetics, and Environmental Sciences, School of Public Health, The University of Texas Health Science Center at Houston, Houston, TX
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - Ada Hamosh
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Richard A. Gibbs
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
| | - V. Reid Sutton
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Texas Children’s Hospital, Houston, TX
| | - Nara Sobreira
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - Claudia M.B. Carvalho
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Pacific Northwest Research Institute, Seattle, WA
| | - Chad A. Shaw
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Baylor Genetics, Houston, TX
| | - Jennifer E. Posey
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
| | - David Valle
- McKusick-Nathans Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD
| | - James R. Lupski
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX
- Department of Pediatrics, Baylor College of Medicine, Houston, TX
- Texas Children’s Hospital, Houston, TX
| |
Collapse
|
15
|
Wyrwoll MJ, van der Heijden GW, Krausz C, Aston KI, Kliesch S, McLachlan R, Ramos L, Conrad DF, O'Bryan MK, Veltman JA, Tüttelmann F. Improved phenotypic classification of male infertility to promote discovery of genetic causes. Nat Rev Urol 2024; 21:91-101. [PMID: 37723288 DOI: 10.1038/s41585-023-00816-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2023] [Indexed: 09/20/2023]
Abstract
An increasing number of genes are being described in the context of non-syndromic male infertility. Linking the underlying genetic causes of non-syndromic male infertility with clinical data from patients is important to establish new genotype-phenotype correlations. This process can be facilitated by using universal nomenclature, but no standardized vocabulary is available in the field of non-syndromic male infertility. The International Male Infertility Genomics Consortium aimed at filling this gap, providing a standardized vocabulary containing nomenclature based on the Human Phenotype Ontology (HPO). The "HPO tree" was substantially revised compared with the previous version and is based on the clinical work-up of infertile men, including physical examination and hormonal assessment. Some causes of male infertility can already be suspected based on the patient's clinical history, whereas in other instances, a testicular biopsy is needed for diagnosis. We assembled 49 HPO terms that are linked in a logical hierarchy and showed examples of morphological features of spermatozoa and testicular histology of infertile men with identified genetic diagnoses to describe the phenotypes. This work will help to record patients' phenotypes systematically and facilitate communication between geneticists and andrologists. Collaboration across institutions will improve the identification of patients with the same phenotypes, which will promote the discovery of novel genetic causes for non-syndromic male infertility.
Collapse
Affiliation(s)
- Margot J Wyrwoll
- Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | | | - Csilla Krausz
- Department of Biomedical, Experimental and Clinical Sciences "Mario Serio", University of Florence, University Hospital of Careggi (AOUC), Florence, Italy
| | - Kenneth I Aston
- Andrology and IVF Laboratory, Department of Surgery (Urology), University of Utah, Salt Lake City, UT, USA
| | - Sabine Kliesch
- Centre of Reproductive Medicine and Andrology, Department of Clinical and Surgical Andrology, University of Münster, Münster, Germany
| | - Robert McLachlan
- Department of Clinical Research, Hudson Institute of Medical Research, Melbourne, Victoria, Australia
| | - Liliana Ramos
- Department of Obstetrics and Gynecology, Radboud University Medical Center, Nijmegen, Netherlands
| | - Donald F Conrad
- Department of Genetics, Oregon National Primate Research Center, Oregon Health and Science University, Beaverton, OR, USA
| | - Moira K O'Bryan
- School of BioSciences and Bio21 Institute, The University of Melbourne, Parkville, Victoria, Australia
| | - Joris A Veltman
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - Frank Tüttelmann
- Institute of Reproductive Genetics, University of Münster, Münster, Germany.
| |
Collapse
|
16
|
Gargano MA, Matentzoglu N, Coleman B, Addo-Lartey EB, Anagnostopoulos A, Anderton J, Avillach P, Bagley AM, Bakštein E, Balhoff JP, Baynam G, Bello SM, Berk M, Bertram H, Bishop S, Blau H, Bodenstein DF, Botas P, Boztug K, Čady J, Callahan TJ, Cameron R, Carbon S, Castellanos F, Caufield JH, Chan LE, Chute C, Cruz-Rojo J, Dahan-Oliel N, Davids JR, de Dieuleveult M, de Souza V, de Vries BBA, de Vries E, DePaulo JR, Derfalvi B, Dhombres F, Diaz-Byrd C, Dingemans AJM, Donadille B, Duyzend M, Elfeky R, Essaid S, Fabrizzi C, Fico G, Firth HV, Freudenberg-Hua Y, Fullerton JM, Gabriel DL, Gilmour K, Giordano J, Goes FS, Moses RG, Green I, Griese M, Groza T, Gu W, Guthrie J, Gyori B, Hamosh A, Hanauer M, Hanušová K, He Y(O, Hegde H, Helbig I, Holasová K, Hoyt CT, Huang S, Hurwitz E, Jacobsen JOB, Jiang X, Joseph L, Keramatian K, King B, Knoflach K, Koolen DA, Kraus M, Kroll C, Kusters M, Ladewig MS, Lagorce D, Lai MC, Lapunzina P, Laraway B, Lewis-Smith D, Li X, Lucano C, Majd M, Marazita ML, Martinez-Glez V, McHenry TH, McInnis MG, McMurry JA, Mihulová M, Millett CE, Mitchell PB, Moslerová V, Narutomi K, Nematollahi S, Nevado J, Nierenberg AA, Čajbiková NN, Nurnberger JI, Ogishima S, Olson D, Ortiz A, Pachajoa H, Perez de Nanclares G, Peters A, Putman T, Rapp CK, Rath A, Reese J, Rekerle L, Roberts A, Roy S, Sanders SJ, Schuetz C, Schulte EC, Schulze TG, Schwarz M, Scott K, Seelow D, Seitz B, Shen Y, Similuk MN, Simon ES, Singh B, Smedley D, Smith CL, Smolinsky JT, Sperry S, Stafford E, Stefancsik R, Steinhaus R, Strawbridge R, Sundaramurthi JC, Talapova P, Tenorio Castano JA, Tesner P, Thomas RH, Thurm A, Turnovec M, van Gijn ME, Vasilevsky NA, Vlčková M, Walden A, Wang K, Wapner R, Ware JS, Wiafe AA, Wiafe SA, Wiggins LD, Williams AE, Wu C, Wyrwoll MJ, Xiong H, Yalin N, Yamamoto Y, Yatham LN, Yocum AK, Young AH, Yüksel Z, Zandi PP, Zankl A, Zarante I, Zvolský M, Toro S, Carmody LC, Harris NL, Munoz-Torres MC, Danis D, Mungall CJ, Köhler S, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res 2024; 52:D1333-D1346. [PMID: 37953324 PMCID: PMC10767975 DOI: 10.1093/nar/gkad1005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/12/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs.
Collapse
Affiliation(s)
| | | | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | - Joel Anderton
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Anita M Bagley
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Eduard Bakštein
- National Institute of Mental Health, Klecany, Czech Republic
| | - James P Balhoff
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, NC 27517, USA
| | - Gareth Baynam
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Michael Berk
- Deakin University, IMPACT - the Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Barwon Health, Geelong, Australia
| | - Holli Bertram
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Somer Bishop
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - David F Bodenstein
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | | | - Kaan Boztug
- St. Anna Children's Cancer Research Institute (CCRI), Vienna, Austria
| | - Jolana Čady
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, NY, NY, USA
| | | | - Seth J Carbon
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - J Harry Caufield
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren E Chan
- College of Public Health and Human Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Jaime Cruz-Rojo
- UDISGEN (Dysmorphology and Genetics Unit), 12 de Octubre Hospital, Madrid, Spain
| | - Noémi Dahan-Oliel
- Department of Clinical Research, Shriners Hospitals for Children, Montreal, Quebec, Canada
| | - Jon R Davids
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Maud de Dieuleveult
- Département I&D, AP-HP, Banque Nationale de Données Maladies Rares, Paris, France
| | - Vinicius de Souza
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Bert B A de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | | | - J Raymond DePaulo
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Beata Derfalvi
- Department of Pediatrics, Dalhousie University, Halifax, NS, Canada
| | - Ferdinand Dhombres
- Fetal Medicine Department, Armand Trousseau Hospital, Sorbonne University, GRC26, INSERM, Limics, Paris, France
| | - Claudia Diaz-Byrd
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Alexander J M Dingemans
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Bruno Donadille
- St Antoine Hospital, Reference Center for Rare Growth Endocrine Disorders, Sorbonne University, AP-HP, INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | | | - Reem Elfeky
- Department of Immunology, GOS Hospital for Children NHS Foundation Trust, University College London, London, UK
| | - Shahim Essaid
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | | | - Giovanna Fico
- Bipolar and Depressive Disorders Unit, Institute of Neuroscience, Hospital Clinic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Catalonia, Spain
| | - Helen V Firth
- Addenbrooke's Hospital, Cambridge University Hospitals, Cambridge, UK
| | - Yun Freudenberg-Hua
- Department of Psychiatry, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | | | - Davera L Gabriel
- School of Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
| | | | - Jessica Giordano
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - Fernando S Goes
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Rachel Gore Moses
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Ian Green
- SNOMED International, London W2 6BD, UK
| | - Matthias Griese
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Tudor Groza
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Julia Guthrie
- Department of Structural and Computational Biology, University of Vienna; Max Perutz Labs, Vienna, Austria
| | - Benjamin Gyori
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Ada Hamosh
- Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Marc Hanauer
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Kateřina Hanušová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | | | - Harshad Hegde
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ingo Helbig
- Neurology, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Kateřina Holasová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Charles Tapley Hoyt
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | | | - Eric Hurwitz
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Julius O B Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Lisa Joseph
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Kamyar Keramatian
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Bryan King
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Katrin Knoflach
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - David A Koolen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Carlo Kroll
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Maaike Kusters
- Immunology, NIHR Great Ormond Street Hospital BRC, London, UK
| | - Markus S Ladewig
- Department of Ophthalmology, University Clinic Marburg - Campus Fulda, Fulda, Germany
| | - David Lagorce
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Meng-Chuan Lai
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Pablo Lapunzina
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Bryan Laraway
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | | | - Caterina Lucano
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Marzieh Majd
- Department of Psychiatry, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Mary L Marazita
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Victor Martinez-Glez
- Center for Genomic Medicine, Parc Taulí Hospital Universitari, Institut d’Investigació i Innovació Parc Taulí (I3PT-CERCA), Sabadell, Spain
| | - Toby H McHenry
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Melvin G McInnis
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Julie A McMurry
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Michaela Mihulová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Caitlin E Millett
- Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Philip B Mitchell
- Discipline of Psychiatry & Mental Health, School of Clinical Medicine, Faculty of Medicine & Health, University of New South Wales, Sydney, NSW, Australia
| | - Veronika Moslerová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Kenji Narutomi
- Okinawa Prefectural Nanbu Medical Center & Children's Medical Center
| | - Shahrzad Nematollahi
- School of Physical and Occupational Therapy, McGill University, Montreal, Quebec, Canada
| | - Julian Nevado
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Andrew A Nierenberg
- Dauten Family Center for Bipolar Treatment Innovation, Massachusetts General Hospital, Boston, MA, USA
| | - Nikola Novák Čajbiková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - John I Nurnberger
- Stark Neurosciences Research Institute, Departments of Psychiatry and Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | | | - Daniel Olson
- Data Collaboration Center, Data Science, Critical Path Institute, Tucson, AZ, USA
| | - Abigail Ortiz
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada
| | - Harry Pachajoa
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad Icesi, Cali, Colombia
| | - Guiomar Perez de Nanclares
- Molecular (epi) genetics lab, Bioaraba Health Research Institute, Araba University Hospital, Vitoria-Gasteiz, Spain
| | - Amy Peters
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Tim Putman
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Christina K Rapp
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Ana Rath
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Justin Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren Rekerle
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Angharad M Roberts
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | - Suzy Roy
- SNOMED International, London W2 6BD, UK
| | - Stephan J Sanders
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, University of Oxford, Oxford, UK
| | - Catharina Schuetz
- Universitätsklinikum Carl Gustav Carus, Medizinische Fakultät, TU, Dresden, Germany
| | - Eva C Schulte
- Institute of Psychiatric Phenomics and Genomics (IPPG), LMU University Hospital, LMU Munich, Munich, Germany
| | - Thomas G Schulze
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Martin Schwarz
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Katie Scott
- Department of Psychiatry, Dalhousie University, Halifax, NS, Canada
| | - Dominik Seelow
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center UKS, Homburg/Saar, Germany
| | | | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Eric S Simon
- Eisenberg Family Depression Center, University of Michigan, Ann Arbor, MI, USA
| | - Balwinder Singh
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Jake T Smolinsky
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ, USA
| | - Sarah Sperry
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | | | - Ray Stefancsik
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Robin Steinhaus
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Rebecca Strawbridge
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | | | - Polina Talapova
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | | | - Pavel Tesner
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Rhys H Thomas
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | - Audrey Thurm
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Marek Turnovec
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Marielle E van Gijn
- Department of Genetics, University Medical Center Groningen, Groningen, Netherlands
| | | | - Markéta Vlčková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Anita Walden
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Kai Wang
- Chinese HPO Consortium, Beijing, China
| | - Ron Wapner
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - James S Ware
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | | | | | - Lisa D Wiggins
- National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Andrew E Williams
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | - Chen Wu
- Chinese HPO Consortium, Beijing, China
| | - Margot J Wyrwoll
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, Edinburgh, UK
| | - Hui Xiong
- Chinese HPO Consortium, Beijing, China
| | - Nefize Yalin
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | - Yasunori Yamamoto
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Japan
| | - Lakshmi N Yatham
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Anastasia K Yocum
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Allan H Young
- Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience, King's College London & South London and Maudsley NHS Foundation Trust, Bethlem Royal Hospital, Monks Orchard Road, Beckenham, Kent, London SE5 8AF, UK
| | - Zafer Yüksel
- Department of Human Genetics, Bioscientia Healthcare GmbH, Ingelheim, Germany
| | - Peter P Zandi
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Andreas Zankl
- Faculty of Medicine and Health, The University of Sydney, Camperdown, Australia
| | - Ignacio Zarante
- Institute of Human Genetics, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Miroslav Zvolský
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Nomi L Harris
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Monica C Munoz-Torres
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Daniel Danis
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Melissa A Haendel
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| |
Collapse
|
17
|
Shen L, Falk MJ, Gai X. MSeqDR Quick-Mitome (QM): Combining Phenotype-Guided Variant Interpretation and Machine Learning Classifiers to Aid Primary Mitochondrial Disease Genetic Diagnosis. Curr Protoc 2024; 4:e955. [PMID: 38284225 PMCID: PMC11046528 DOI: 10.1002/cpz1.955] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2024]
Abstract
The international Mitochondrial Disease Sequence Data Resource Consortium (MSeqDR) Quick-Mitome (QM) is a web-based platform enabling automated variant interpretation of whole-exome sequencing (WES) datasets for the genetic diagnosis of primary mitochondrial diseases (PMD). Designed specifically to address the unique dual genome nature of PMD etiologies, QM includes features for both nuclear and mitochondrial DNA (mtDNA) genome analysis. QM requires VCF variant lists, HPO ID clinical phenotypes, and pedigree files for multiple-sample VCF inputs. QM maps phenotypes to HPO terms before analysis. QM analysis requires 2 to 20 min for 100,000 variants on an 8-vCPU AWS server using Exomiser's "PASS_ONLY" mode for nuclear variants. QM ranks variants based on allele frequency, phenotype-gene association, functional impact, and inheritance mode. Variants are further annotated with multiple data sources such as OMIM, ClinVar, dbNSFP, gnoMAD, MITOMAP, and MSeqDR. In addition to standard Exomiser results, QM generates an Analysis Report and QM Integrated Report with add-on mtDNA-specific analyses, including haplogroup prediction with Phy-Mer, heteroplasmy calculation, and mvTool annotations. We developed the Mitochondrial Disease Variant (MDV) classifier using XGBoost to predict variant pathogenicity for PMD. The MDV classifier was trained on >120 features and performance benchmarking showed that it correctly classified >98% of nuclear gene variants as being pathogenic or benign, and predicted PMD-causing variants with 94% precision. The MSeqDR QM server is an open-access resource for phenotype-driven dual-genome analyses for PMD diagnosis by the global mitochondrial disease community. It is publicly available for non-commercial, non-clinical research use at https://mseqdr.org/quickmitome.php. © 2024 Wiley Periodicals LLC. Basic Protocol 1: Standardizing clinical phenotypes into human phenotype ontology (HPO) terms as the phenotype input for Quick-Mitome (QM) Basic Protocol 2: Prepare the pedigree input for multiple-sample VCF Basic Protocol 3: Quick-Mitome (QM) analysis Basic Protocol 4: Reviewing and understanding the QM Integrated Report and Analysis Report.
Collapse
Affiliation(s)
- Lishuang Shen
- Center for Personalized Medicine, Department of Pathology & Laboratory Medicine, Children’s Hospital Los Angeles, Los Angeles, California, USA
| | - Marni J. Falk
- Mitochondrial Medicine Frontier Program, Division of Human Genetics, Department of Pediatrics, The Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
- University of Pennsylvania Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Xiaowu Gai
- Center for Personalized Medicine, Department of Pathology & Laboratory Medicine, Children’s Hospital Los Angeles, Los Angeles, California, USA
- Keck School of Medicine, University of Southern California, California, USA
| |
Collapse
|
18
|
Patalano SD, Fuxman Bass P, Fuxman Bass JI. Transcription factors in the development and treatment of immune disorders. Transcription 2023:1-23. [PMID: 38100543 DOI: 10.1080/21541264.2023.2294623] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 12/08/2023] [Indexed: 12/17/2023] Open
Abstract
Immune function is highly controlled at the transcriptional level by the binding of transcription factors (TFs) to promoter and enhancer elements. Several TF families play major roles in immune gene expression, including NF-κB, STAT, IRF, AP-1, NRs, and NFAT, which trigger anti-pathogen responses, promote cell differentiation, and maintain immune system homeostasis. Aberrant expression, activation, or sequence of isoforms and variants of these TFs can result in autoimmune and inflammatory diseases as well as hematological and solid tumor cancers. For this reason, TFs have become attractive drug targets, even though most were previously deemed "undruggable" due to their lack of small molecule binding pockets and the presence of intrinsically disordered regions. However, several aspects of TF structure and function can be targeted for therapeutic intervention, such as ligand-binding domains, protein-protein interactions between TFs and with cofactors, TF-DNA binding, TF stability, upstream signaling pathways, and TF expression. In this review, we provide an overview of each of the important TF families, how they function in immunity, and some related diseases they are involved in. Additionally, we discuss the ways of targeting TFs with drugs along with recent research developments in these areas and their clinical applications, followed by the advantages and disadvantages of targeting TFs for the treatment of immune disorders.
Collapse
Affiliation(s)
- Samantha D Patalano
- Biology Department, Boston University, Boston, MA, USA
- Molecular Biology, Cellular Biology and Biochemistry Program, Boston University, Boston, MA, USA
| | - Paula Fuxman Bass
- Facultad de Medicina, Universidad de Buenos Aires, Ciudad Autónoma de Buenos Aires, Argentina
| | - Juan I Fuxman Bass
- Biology Department, Boston University, Boston, MA, USA
- Molecular Biology, Cellular Biology and Biochemistry Program, Boston University, Boston, MA, USA
- Bioinformatics Program, Boston University, Boston, MA, USA
| |
Collapse
|
19
|
Carmody LC, Gargano MA, Toro S, Vasilevsky NA, Adam MP, Blau H, Chan LE, Gomez-Andres D, Horvath R, Kraus ML, Ladewig MS, Lewis-Smith D, Lochmüller H, Matentzoglu NA, Munoz-Torres MC, Schuetz C, Seitz B, Similuk MN, Sparks TN, Strauss T, Swietlik EM, Thompson R, Zhang XA, Mungall CJ, Haendel MA, Robinson PN. The Medical Action Ontology: A tool for annotating and analyzing treatments and clinical management of human disease. MED 2023; 4:913-927.e3. [PMID: 37963467 PMCID: PMC10842845 DOI: 10.1016/j.medj.2023.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/31/2023] [Accepted: 10/14/2023] [Indexed: 11/16/2023]
Abstract
BACKGROUND Navigating the clinical literature to determine the optimal clinical management for rare diseases presents significant challenges. We introduce the Medical Action Ontology (MAxO), an ontology specifically designed to organize medical procedures, therapies, and interventions. METHODS MAxO incorporates logical structures that link MAxO terms to numerous other ontologies within the OBO Foundry. Term development involves a blend of manual and semi-automated processes. Additionally, we have generated annotations detailing diagnostic modalities for specific phenotypic abnormalities defined by the Human Phenotype Ontology (HPO). We introduce a web application, POET, that facilitates MAxO annotations for specific medical actions for diseases using the Mondo Disease Ontology. FINDINGS MAxO encompasses 1,757 terms spanning a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. These terms annotate phenotypic features associated with specific disease (using HPO and Mondo). Presently, there are over 16,000 MAxO diagnostic annotations that target HPO terms. Through POET, we have created 413 MAxO annotations specifying treatments for 189 rare diseases. CONCLUSIONS MAxO offers a computational representation of treatments and other actions taken for the clinical management of patients. Its development is closely coupled to Mondo and HPO, broadening the scope of our computational modeling of diseases and phenotypic features. We invite the community to contribute disease annotations using POET (https://poet.jax.org/). MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO). FUNDING NHGRI 1U24HG011449-01A1 and NHGRI 5RM1HG010860-04.
Collapse
Affiliation(s)
- Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | | | - Margaret P Adam
- University of Washington School of Medicine, Seattle, WA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - David Gomez-Andres
- Pediatric Neurology, Vall d'Hebron Institut de Recerca (VHIR), Hospital Universitari Vall d'Hebron, Vall d'Hebron Barcelona Hospital Campus, Passeig Vall d'Hebron 119-129, 08035 Barcelona, Spain
| | - Rita Horvath
- Department of Clinical Neurosciences, University of Cambridge, Robinson Way, Cambridge CB2 0PY, UK
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Markus S Ladewig
- Department of Ophthalmology, Klinikum Saarbrücken, Saarbrücken, Germany
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Hanns Lochmüller
- Children's Hospital of Eastern Ontario Research Institute, Ottowa, Canada; Division of Neurology, Department of Medicine, The Ottawa Hospital, Ottawa, Canada; Brain and Mind Research Institute, University of Ottawa, Ottawa, Canada; Department of Neuropediatrics and Muscle Disorders, Medical Center - University of Freiburg, Faculty of Medicine, Freiburg, Germany; Centro Nacional de Análisis Genómico, Barcelona, Spain
| | | | | | - Catharina Schuetz
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center UKS, Homburg, Saar, Germany
| | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Teresa N Sparks
- Department of Obstetrics, Gynecology, & Reproductive Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Timmy Strauss
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Emilia M Swietlik
- Department of Medicine, University of Cambridge, Heart and Lung Research Institute, Cambridge CB2 0BB, UK
| | - Rachel Thompson
- Children's Hospital of Eastern Ontario Research Institute, Ottowa, Canada
| | | | | | | | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|
20
|
Izumi H, Demura M, Imai A, Ogawa R, Fukuchi M, Okubo T, Tabata T, Mori H, Yoshida T. Developmental synapse pathology triggered by maternal exposure to the herbicide glufosinate ammonium. Front Mol Neurosci 2023; 16:1298238. [PMID: 38098940 PMCID: PMC10720911 DOI: 10.3389/fnmol.2023.1298238] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/09/2023] [Indexed: 12/17/2023] Open
Abstract
Environmental and genetic factors influence synapse formation. Numerous animal experiments have revealed that pesticides, including herbicides, can disturb normal intracellular signals, gene expression, and individual animal behaviors. However, the mechanism underlying the adverse outcomes of pesticide exposure remains elusive. Herein, we investigated the effect of maternal exposure to the herbicide glufosinate ammonium (GLA) on offspring neuronal synapse formation in vitro. Cultured cerebral cortical neurons prepared from mouse embryos with maternal GLA exposure demonstrated impaired synapse formation induced by synaptic organizer neuroligin 1 (NLGN1)-coated beads. Conversely, the direct administration of GLA to the neuronal cultures exhibited negligible effect on the NLGN1-induced synapse formation. The comparison of the transcriptomes of cultured neurons from embryos treated with maternal GLA or vehicle and a subsequent bioinformatics analysis of differentially expressed genes (DEGs) identified "nervous system development," including "synapse," as the top-ranking process for downregulated DEGs in the GLA group. In addition, we detected lower densities of parvalbumin (Pvalb)-positive neurons at the postnatal developmental stage in the medial prefrontal cortex (mPFC) of offspring born to GLA-exposed dams. These results suggest that maternal GLA exposure induces synapse pathology, with alterations in the expression of genes that regulate synaptic development via an indirect pathway distinct from the effect of direct GLA action on neurons.
Collapse
Affiliation(s)
- Hironori Izumi
- Department of Molecular Neuroscience, Faculty of Medicine, University of Toyama, Toyama, Japan
- Research Center for Idling Brain Science, University of Toyama, Toyama, Japan
| | - Maina Demura
- Department of Molecular Neuroscience, Faculty of Medicine, University of Toyama, Toyama, Japan
| | - Ayako Imai
- Department of Molecular Neuroscience, Faculty of Medicine, University of Toyama, Toyama, Japan
- Research Center for Idling Brain Science, University of Toyama, Toyama, Japan
| | - Ryohei Ogawa
- Department of Radiology, Faculty of Medicine, University of Toyama, Toyama, Japan
| | - Mamoru Fukuchi
- Laboratory of Molecular Neuroscience, Faculty of Pharmacy, Takasaki University of Health and Welfare, Gunma, Japan
| | - Taisaku Okubo
- Laboratory for Biological Information Processing, Faculty of Engineering, University of Toyama, Toyama, Japan
| | - Toshihide Tabata
- Laboratory for Biological Information Processing, Faculty of Engineering, University of Toyama, Toyama, Japan
| | - Hisashi Mori
- Department of Molecular Neuroscience, Faculty of Medicine, University of Toyama, Toyama, Japan
- Research Center for Idling Brain Science, University of Toyama, Toyama, Japan
- Research Center for Pre-Disease Science, University of Toyama, Toyama, Japan
| | - Tomoyuki Yoshida
- Department of Molecular Neuroscience, Faculty of Medicine, University of Toyama, Toyama, Japan
- Research Center for Idling Brain Science, University of Toyama, Toyama, Japan
| |
Collapse
|
21
|
Sadeghi-Alavijeh O, Chan MMY, Moochhala SH, Howles S, Gale DP, Böckenhauer D. Rare variants in the sodium-dependent phosphate transporter gene SLC34A3 explain missing heritability of urinary stone disease. Kidney Int 2023; 104:975-984. [PMID: 37414395 DOI: 10.1016/j.kint.2023.06.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 05/10/2023] [Accepted: 06/15/2023] [Indexed: 07/08/2023]
Abstract
Urinary stone disease (USD) is a major health burden affecting over 10% of the United Kingdom population. While stone disease is associated with lifestyle, genetic factors also strongly contribute. Common genetic variants at multiple loci from genome-wide association studies account for 5% of the estimated 45% heritability of the disorder. Here, we investigated the extent to which rare genetic variation contributes to the unexplained heritability of USD. Among participants of the United Kingdom 100,000-genome project, 374 unrelated individuals were identified and assigned diagnostic codes indicative of USD. Whole genome gene-based rare variant testing and polygenic risk scoring against a control population of 24,930 ancestry-matched controls was performed. We observed (and replicated in an independent dataset) exome-wide significant enrichment of monoallelic rare, predicted damaging variants in the SLC34A3 gene for a sodium-dependent phosphate transporter that were present in 5% cases compared with 1.6% of controls. This gene was previously associated with autosomal recessive disease. The effect on USD risk of having a qualifying SLC34A3 variant was greater than that of a standard deviation increase in polygenic risk derived from GWAS. Addition of the rare qualifying variants in SLC34A3 to a linear model including polygenic score increased the liability-adjusted heritability from 5.1% to 14.2% in the discovery cohort. We conclude that rare variants in SLC34A3 represent an important genetic risk factor for USD, with effect size intermediate between the fully penetrant rare variants linked with Mendelian disorders and common variants associated with USD. Thus, our findings explain some of the heritability unexplained by prior common variant genome-wide association studies.
Collapse
Affiliation(s)
| | - Melanie M Y Chan
- Department of Renal Medicine, University College London, London, UK
| | | | - Sarah Howles
- Nuffield Department of Surgical Sciences, University of Oxford, Oxford, UK
| | - Daniel P Gale
- Department of Renal Medicine, University College London, London, UK.
| | | |
Collapse
|
22
|
O'Connell GC, Wang J, Smothers C. Donor white blood cell differential is the single largest determinant of whole blood gene expression patterns. Genomics 2023; 115:110708. [PMID: 37730167 PMCID: PMC10872590 DOI: 10.1016/j.ygeno.2023.110708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 08/18/2023] [Accepted: 09/17/2023] [Indexed: 09/22/2023]
Abstract
It has become widely accepted that sample cellular composition is a significant determinant of the gene expression patterns observed in any transcriptomic experiment performed with bulk tissue. Despite this, many investigations currently performed with whole blood do not experimentally account for possible inter-specimen differences in cellularity, and often assume that any observed gene expression differences are a result of true differences in nuclear transcription. In order to determine how confounding of an assumption this may be, in this study, we recruited a large cohort of human donors (n = 138) and used a combination of next generation sequencing and flow cytometry to quantify and compare the underlying contributions of variance in leukocyte counts versus variance in other biological factors to overall variance in whole blood transcript levels. Our results suggest that the combination of donor neutrophil and lymphocyte counts alone are the primary determinants of whole blood transcript levels for up to 75% of the protein-coding genes expressed in peripheral circulation, whereas the other factors such as age, sex, race, ethnicity, and common disease states have comparatively minimal influence. Broadly, this infers that a majority of gene expression differences observed in experiments performed with whole blood are driven by latent differences in leukocyte counts, and that cell count heterogeneity must be accounted for to meaningfully biologically interpret the results.
Collapse
Affiliation(s)
- Grant C O'Connell
- Molecular Biomarker Core, Case Western Reserve University, Cleveland, OH, USA; School of Nursing, Case Western Reserve University, Cleveland, OH, USA.
| | - Jing Wang
- Molecular Biomarker Core, Case Western Reserve University, Cleveland, OH, USA; School of Nursing, Case Western Reserve University, Cleveland, OH, USA
| | | |
Collapse
|
23
|
Pan Y, Li R, Li W, Lv L, Guan J, Zhou S. HPC-Atlas: Computationally Constructing A Comprehensive Atlas of Human Protein Complexes. GENOMICS, PROTEOMICS & BIOINFORMATICS 2023; 21:976-990. [PMID: 37730114 PMCID: PMC10928439 DOI: 10.1016/j.gpb.2023.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/19/2022] [Revised: 04/23/2023] [Accepted: 05/08/2023] [Indexed: 09/22/2023]
Abstract
A fundamental principle of biology is that proteins tend to form complexes to play important roles in the core functions of cells. For a complete understanding of human cellular functions, it is crucial to have a comprehensive atlas of human protein complexes. Unfortunately, we still lack such a comprehensive atlas of experimentally validated protein complexes, which prevents us from gaining a complete understanding of the compositions and functions of human protein complexes, as well as the underlying biological mechanisms. To fill this gap, we built Human Protein Complexes Atlas (HPC-Atlas), as far as we know, the most accurate and comprehensive atlas of human protein complexes available to date. We integrated two latest protein interaction networks, and developed a novel computational method to identify nearly 9000 protein complexes, including many previously uncharacterized complexes. Compared with the existing methods, our method achieved outstanding performance on both testing and independent datasets. Furthermore, with HPC-Atlas we identified 751 severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)-affected human protein complexes, and 456 multifunctional proteins that contain many potential moonlighting proteins. These results suggest that HPC-Atlas can serve as not only a computing framework to effectively identify biologically meaningful protein complexes by integrating multiple protein data sources, but also a valuable resource for exploring new biological findings. The HPC-Atlas webserver is freely available at http://www.yulpan.top/HPC-Atlas.
Collapse
Affiliation(s)
- Yuliang Pan
- Department of Computer Science and Technology, College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China
| | - Ruiyi Li
- Translational Medical Center for Stem Cell Therapy, Shanghai East Hospital, School of Medicine, Tongji University, Shanghai 200120, China
| | - Wengen Li
- Department of Computer Science and Technology, College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China
| | - Liuzhenghao Lv
- Department of Computer Science and Technology, College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China
| | - Jihong Guan
- Department of Computer Science and Technology, College of Electronic and Information Engineering, Tongji University, Shanghai 201804, China.
| | - Shuigeng Zhou
- Shanghai Key Laboratory of Intelligent Information Processing, School of Computer Science, Fudan University, Shanghai 200433, China.
| |
Collapse
|
24
|
Carmody LC, Gargano MA, Toro S, Vasilevsky NA, Adam MP, Blau H, Chan LE, Gomez-Andres D, Horvath R, Kraus ML, Ladewig MS, Lewis-Smith D, Lochmüller H, Matentzoglu NA, Munoz-Torres MC, Schuetz C, Seitz B, Similuk MN, Sparks TN, Strauss T, Swietlik EM, Thompson R, Zhang XA, Mungall CJ, Haendel MA, Robinson PN. The Medical Action Ontology: A Tool for Annotating and Analyzing Treatments and Clinical Management of Human Disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.13.23292612. [PMID: 37503136 PMCID: PMC10370244 DOI: 10.1101/2023.07.13.23292612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Navigating the vast landscape of clinical literature to find optimal treatments and management strategies can be a challenging task, especially for rare diseases. To address this task, we introduce the Medical Action Ontology (MAxO), the first ontology specifically designed to organize medical procedures, therapies, and interventions in a structured way. Currently, MAxO contains 1757 medical action terms added through a combination of manual and semi-automated processes. MAxO was developed with logical structures that make it compatible with several other ontologies within the Open Biological and Biomedical Ontologies (OBO) Foundry. These cover a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. We have created a database of over 16000 annotations that describe diagnostic modalities for specific phenotypic abnormalities as defined by the Human Phenotype Ontology (HPO). Additionally, 413 annotations are provided for medical actions for 189 rare diseases. We have developed a web application called POET (https://poet.jax.org/) for the community to use to contribute MAxO annotations. MAxO provides a computational representation of treatments and other actions taken for the clinical management of patients. The development of MAxO is closely coupled to the Mondo Disease Ontology (Mondo) and the Human Phenotype Ontology (HPO) and expands the scope of our computational modeling of diseases and phenotypic features to include diagnostics and therapeutic actions. MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO).
Collapse
Affiliation(s)
- Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine,Farmington,CT,United States
| | - Michael A Gargano
- The Jackson Laboratory for Genomic Medicine,Farmington,CT,United States
| | - Sabrina Toro
- University of Colorado Anschutz Medical Campus,Aurora,CO,United States
| | | | - Margaret P Adam
- University of Washington School of Medicine, Seattle, WA, United States
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine,Farmington,CT,United States
| | | | - David Gomez-Andres
- Pediatric Neurology, Vall d'Hebron Institut de Recerca (VHIR), Hospital Universitari Vall d'Hebron, Vall d'Hebron Barcelona Hospital Campus., Passeig Vall d'Hebron 119-129, 08035 Barcelona, Spain
| | - Rita Horvath
- Department of Clinical Neurosciences, University of Cambridge, Robinson Way CB2 0PY, Cambridge UK
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus,Aurora,CO,United States
| | - Markus S Ladewig
- Department of Ophthalmology,Klinikum Saarbrücken,Saarbrücken,Germany
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, NE2 4HH, United Kingdom
| | | | | | | | - Catharina Schuetz
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Berthold Seitz
- Department of Ophthalmology,Saarland University Hospital UKS,Homburg/Saar Germany
| | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases,National Institutes of Health,Bethesda,MD,United States
| | - Teresa N Sparks
- Department of Obstetrics, Gynecology, & Reproductive Sciences, University of California, San Francisco, San Francisco, CA 94143
| | - Timmy Strauss
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Emilia M Swietlik
- Department of Medicine, University of Cambridge, Heart and Lung Research Institute, CB2 0BB, Cambridge, UK
| | | | | | | | - Melissa A Haendel
- University of Colorado Anschutz Medical Campus,Aurora,CO,United States
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine,Farmington,CT,United States
| |
Collapse
|
25
|
Liu X, Gao L, Peng Y, Fang Z, Wang J. PheSom: a term frequency-based method for measuring human phenotype similarity on the basis of MeSH vocabulary. Front Genet 2023; 14:1185790. [PMID: 37496714 PMCID: PMC10366691 DOI: 10.3389/fgene.2023.1185790] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Accepted: 06/21/2023] [Indexed: 07/28/2023] Open
Abstract
Background: Phenotype similarity calculation should be used to help improve drug repurposing. In this study, based on the MeSH terms describing the phenotypes deposited in OMIM, we proposed a method, namely, PheSom (Phenotype Similarity On MeSH), to measure the similarity between phenotypes. PheSom counted the number of overlapping MeSH terms between two phenotypes and then took the weight of every MeSH term within each phenotype into account according to the term frequency-inverse document frequency (FIDC). Phenotype-related genes were used for the evaluation of our method. Results: A 7,739 × 7,739 similarity score matrix was finally obtained and the number of phenotype pairs was dramatically decreased with the increase of similarity score. Besides, the overlapping rates of phenotype-related genes were remarkably increased with the increase of similarity score between phenotypes, which supports the reliability of our method. Conclusion: We anticipate our method can be applied to identifying novel therapeutic methods for complex diseases.
Collapse
Affiliation(s)
- Xinhua Liu
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, China
- School of Biomedical Engineering and Technology, Tianjin Medical University, Tianjin, China
| | - Ling Gao
- Department of Biochemistry and Molecular Biology, School of Basic Medical Sciences, Hangzhou Normal University, Hangzhou, Zhejiang, China
| | - Yonglin Peng
- Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, Shanghai, China
| | - Zhonghai Fang
- School of Biomedical Engineering and Technology, Tianjin Medical University, Tianjin, China
| | - Ju Wang
- School of Biomedical Engineering and Technology, Tianjin Medical University, Tianjin, China
| |
Collapse
|
26
|
Garza Flores A, Nordgren I, Pettersson M, Dias-Santagata D, Nilsson D, Hammarsjö A, Lindstrand A, Batkovskyte D, Wiggs J, Walton DS, Goldenberg P, Eisfeldt J, Lin AE, Lachman RS, Nishimura G, Grigelioniene G. Case report: Extending the spectrum of clinical and molecular findings in FOXC1 haploinsufficiency syndrome. Front Genet 2023; 14:1174046. [PMID: 37424725 PMCID: PMC10326848 DOI: 10.3389/fgene.2023.1174046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 06/12/2023] [Indexed: 07/11/2023] Open
Abstract
FOXC1 is a ubiquitously expressed forkhead transcription factor that plays a critical role during early development. Germline pathogenic variants in FOXC1 are associated with anterior segment dysgenesis and Axenfeld-Rieger syndrome (ARS, #602482), an autosomal dominant condition with ophthalmologic anterior segment abnormalities, high risk for glaucoma and extraocular findings including distinctive facial features, as well as dental, skeletal, audiologic, and cardiac anomalies. De Hauwere syndrome is an ultrarare condition previously associated with 6p microdeletions and characterized by anterior segment dysgenesis, joint instability, short stature, hydrocephalus, and skeletal abnormalities. Here, we report clinical findings of two unrelated adult females with FOXC1 haploinsufficiency who have ARS and skeletal abnormalities. Final molecular diagnoses of both patients were achieved using genome sequencing. Patient 1 had a complex rearrangement involving a 4.9 kB deletion including FOXC1 coding region (Hg19; chr6:1,609,721-1,614,709), as well as a 7 MB inversion (Hg19; chr6:1,614,710-8,676,899) and a second deletion of 7.1 kb (Hg19; chr6:8,676,900-8,684,071). Patient 2 had a heterozygous single nucleotide deletion, resulting in a frameshift and a premature stop codon in FOXC1 (NM_001453.3): c.467del, p.(Pro156Argfs*25). Both individuals had moderate short stature, skeletal abnormalities, anterior segment dysgenesis, glaucoma, joint laxity, pes planovalgus, dental anomalies, hydrocephalus, distinctive facial features, and normal intelligence. Skeletal surveys revealed dolichospondyly, epiphyseal hypoplasia of femoral and humeral heads, dolichocephaly with frontal bossin gand gracile long bones. We conclude that haploinsufficiency of FOXC1 causes ARS and a broad spectrum of symptoms with variable expressivity that at its most severe end also includes a phenotype overlapping with De Hauwere syndrome.
Collapse
Affiliation(s)
- Alexandra Garza Flores
- Medical Genetics, Mass General for Children, Boston, MA, United States
- Genetics Department, Cook Children´s Hospital, Fort Worth, TX, United States
| | - Ida Nordgren
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
| | - Maria Pettersson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Dora Dias-Santagata
- Department of Pathology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, United States
| | - Daniel Nilsson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Anna Hammarsjö
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Anna Lindstrand
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Dominyka Batkovskyte
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
| | - Janey Wiggs
- Department of Ophthalmology, Ocular Genomics Institute, Mass Eye and Ear Infirmary, Harvard Medical School, Boston, MA, United States
| | - David S. Walton
- Department of Ophthalmology, Harvard Medical School, Boston, MA, United States
| | - Paula Goldenberg
- Medical Genetics, Mass General for Children, Boston, MA, United States
| | - Jesper Eisfeldt
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Angela E. Lin
- Medical Genetics, Mass General for Children, Boston, MA, United States
| | - Ralph S. Lachman
- Department of Radiological Sciences and Pediatrics, UCLA School of Medicine, Los Angeles, CA, United States
- Department of Radiological Sciences Stanford University, Stanford, CA, United States
- Orthopedic Department, International Skeletal Dysplasia Registry, UCLA School of Medicine, Los Angeles, CA, United States
| | - Gen Nishimura
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Radiology, Musashino-Yowakai Hospital, Musashino, Tokyo, Japan
| | - Giedre Grigelioniene
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
- Endocrine Unit, Massachusetts General Hospital, Boston, MA, United States
| |
Collapse
|
27
|
Geraghty RM, Orr S, Olinger E, Neatu R, Barroso-Gil M, Mabillard H, Consortium GER, Wilson I, Sayer JA. Use of whole genome sequencing to determine the genetic basis of visceral myopathies including Prune Belly syndrome. JOURNAL OF RARE DISEASES (BERLIN, GERMANY) 2023; 2:9. [PMID: 37288276 PMCID: PMC10241726 DOI: 10.1007/s44162-023-00012-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Accepted: 05/21/2023] [Indexed: 06/09/2023]
Abstract
Objectives/aims The visceral myopathies (VM) are a group of disorders characterised by poorly contractile or acontractile smooth muscle. They manifest in both the GI and GU tracts, ranging from megacystis to Prune Belly syndrome. We aimed to apply a bespoke virtual genetic panel and describe novel variants associated with this condition using whole genome sequencing data within the Genomics England 100,000 Genomes Project. Methods We screened the Genomics England 100,000 Genomes Project rare diseases database for patients with VM-related phenotypes. These patients were screened for sequence variants and copy number variants (CNV) in ACTG2, ACTA2, MYH11, MYLK, LMOD1, CHRM3, MYL9, FLNA and KNCMA1 by analysing whole genome sequencing data. The identified variants were analysed using variant effect predictor online tool, and any possible segregation in other family members and novel missense mutations was modelled using in silico tools. The VM cohort was also used to perform a genome-wide variant burden test in order to identify confirm gene associations in this cohort. Results We identified 76 patients with phenotypes consistent with a diagnosis of VM. The range of presentations included megacystis/microcolon hypoperistalsis syndrome, Prune Belly syndrome and chronic intestinal pseudo-obstruction. Of the patients in whom we identified heterozygous ACTG2 variants, 7 had likely pathogenic variants including 1 novel likely pathogenic allele. There were 4 patients in whom we identified a heterozygous MYH11 variant of uncertain significance which leads to a frameshift and a predicted protein elongation. We identified one family in whom we found a heterozygous variant of uncertain significance in KCNMA1 which in silico models predicted to be disease causing and may explain the VM phenotype seen. We did not find any CNV changes in known genes leading to VM-related disease phenotypes. In this phenotype selected cohort, ACTG2 is the largest monogenic cause of VM-related disease accounting for 9% of the cohort, supported by a variant burden test approach, which identified ACTG2 variants as the largest contributor to VM-related phenotypes. Conclusions VM are a group of disorders that are not easily classified and may be given different diagnostic labels depending on their phenotype. Molecular genetic analysis of these patients is valuable as it allows precise diagnosis and aids understanding of the underlying disease manifestations. We identified ACTG2 as the most frequent genetic cause of VM. We recommend a nomenclature change to 'autosomal dominant ACTG2 visceral myopathy' for patients with pathogenic variants in ACTG2 and associated VM phenotypes. Supplementary Information The online version contains supplementary material available at 10.1007/s44162-023-00012-z.
Collapse
Affiliation(s)
- Robert M. Geraghty
- Renal Services, The Newcastle Upon Tyne Hospitals NHS Foundation Trust, Freeman Road, Newcastle Upon Tyne, NE7 7DN UK
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Sarah Orr
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Eric Olinger
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Ruxandra Neatu
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Miguel Barroso-Gil
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Holly Mabillard
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - Genomics England Research Consortium
- Renal Services, The Newcastle Upon Tyne Hospitals NHS Foundation Trust, Freeman Road, Newcastle Upon Tyne, NE7 7DN UK
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
- Faculty of Medical Sciences, Biosciences Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
- National Institute for Health Research Newcastle Biomedical Research Centre, Newcastle Upon Tyne, NE4 5PL UK
| | - Ian Wilson
- Faculty of Medical Sciences, Biosciences Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
| | - John A. Sayer
- Renal Services, The Newcastle Upon Tyne Hospitals NHS Foundation Trust, Freeman Road, Newcastle Upon Tyne, NE7 7DN UK
- Faculty of Medical Sciences, Translational and Clinical Institute, Newcastle University, Central Parkway, Newcastle Upon Tyne, NE1 3BZ UK
- National Institute for Health Research Newcastle Biomedical Research Centre, Newcastle Upon Tyne, NE4 5PL UK
| |
Collapse
|
28
|
Hazari Y, Urra H, Garcia Lopez VA, Diaz J, Tamburini G, Milani M, Pihan P, Durand S, Aprahamia F, Baxter R, Huang M, Dong XC, Vihinen H, Batista-Gonzalez A, Godoy P, Criollo A, Ratziu V, Foufelle F, Hengstler JG, Jokitalo E, Bailly-Maitre B, Maiers JL, Plate L, Kroemer G, Hetz C. The endoplasmic reticulum stress sensor IRE1 regulates collagen secretion through the enforcement of the proteostasis factor P4HB/PDIA1 contributing to liver damage and fibrosis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.02.538835. [PMID: 37205565 PMCID: PMC10187203 DOI: 10.1101/2023.05.02.538835] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Collagen is one the most abundant proteins and the main cargo of the secretory pathway, contributing to hepatic fibrosis and cirrhosis due to excessive deposition of extracellular matrix. Here we investigated the possible contribution of the unfolded protein response, the main adaptive pathway that monitors and adjusts the protein production capacity at the endoplasmic reticulum, to collagen biogenesis and liver disease. Genetic ablation of the ER stress sensor IRE1 reduced liver damage and diminished collagen deposition in models of liver fibrosis triggered by carbon tetrachloride (CCl 4 ) administration or by high fat diet. Proteomic and transcriptomic profiling identified the prolyl 4-hydroxylase (P4HB, also known as PDIA1), which is known to be critical for collagen maturation, as a major IRE1-induced gene. Cell culture studies demonstrated that IRE1 deficiency results in collagen retention at the ER and altered secretion, a phenotype rescued by P4HB overexpression. Taken together, our results collectively establish a role of the IRE1/P4HB axis in the regulation of collagen production and its significance in the pathogenesis of various disease states.
Collapse
|
29
|
Daniali M, Galer PD, Lewis-Smith D, Parthasarathy S, Kim E, Salvucci DD, Miller JM, Haag S, Helbig I. Enriching representation learning using 53 million patient notes through human phenotype ontology embedding. Artif Intell Med 2023; 139:102523. [PMID: 37100502 PMCID: PMC10782859 DOI: 10.1016/j.artmed.2023.102523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 02/17/2023] [Accepted: 02/23/2023] [Indexed: 03/04/2023]
Abstract
The Human Phenotype Ontology (HPO) is a dictionary of >15,000 clinical phenotypic terms with defined semantic relationships, developed to standardize phenotypic analysis. Over the last decade, the HPO has been used to accelerate the implementation of precision medicine into clinical practice. In addition, recent research in representation learning, specifically in graph embedding, has led to notable progress in automated prediction via learned features. Here, we present a novel approach to phenotype representation by incorporating phenotypic frequencies based on 53 million full-text health care notes from >1.5 million individuals. We demonstrate the efficacy of our proposed phenotype embedding technique by comparing our work to existing phenotypic similarity-measuring methods. Using phenotype frequencies in our embedding technique, we are able to identify phenotypic similarities that surpass current computational models. Furthermore, our embedding technique exhibits a high degree of agreement with domain experts' judgment. By transforming complex and multidimensional phenotypes from the HPO format into vectors, our proposed method enables efficient representation of these phenotypes for downstream tasks that require deep phenotyping. This is demonstrated in a patient similarity analysis and can further be applied to disease trajectory and risk prediction.
Collapse
Affiliation(s)
- Maryam Daniali
- Department of Computer Science, Drexel University, Philadelphia, PA, USA; Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Peter D Galer
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA; The Epilepsy Neuro Genetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Center for Neuroengineering and Therapeutics, University of Pennsylvania, Philadelphia, PA, USA
| | - David Lewis-Smith
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA; The Epilepsy Neuro Genetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Translational and Clinical Research Institute, Newcastle University, Newcastle-upon-Tyne, UK; Department of Clinical Neurosciences, Royal Victoria Infirmary, Newcastle-upon-Tyne, UK
| | - Shridhar Parthasarathy
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA; The Epilepsy Neuro Genetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Edward Kim
- Department of Computer Science, Drexel University, Philadelphia, PA, USA
| | - Dario D Salvucci
- Department of Computer Science, Drexel University, Philadelphia, PA, USA
| | - Jeffrey M Miller
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Scott Haag
- Department of Computer Science, Drexel University, Philadelphia, PA, USA; Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ingo Helbig
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA; The Epilepsy Neuro Genetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA.
| |
Collapse
|
30
|
Wright CF, Campbell P, Eberhardt RY, Aitken S, Perrett D, Brent S, Danecek P, Gardner EJ, Chundru VK, Lindsay SJ, Andrews K, Hampstead J, Kaplanis J, Samocha KE, Middleton A, Foreman J, Hobson RJ, Parker MJ, Martin HC, FitzPatrick DR, Hurles ME, Firth HV. Genomic Diagnosis of Rare Pediatric Disease in the United Kingdom and Ireland. N Engl J Med 2023; 388:1559-1571. [PMID: 37043637 PMCID: PMC7614484 DOI: 10.1056/nejmoa2209046] [Citation(s) in RCA: 82] [Impact Index Per Article: 41.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 04/14/2023]
Abstract
BACKGROUND Pediatric disorders include a range of highly penetrant, genetically heterogeneous conditions amenable to genomewide diagnostic approaches. Finding a molecular diagnosis is challenging but can have profound lifelong benefits. METHODS We conducted a large-scale sequencing study involving more than 13,500 families with probands with severe, probably monogenic, difficult-to-diagnose developmental disorders from 24 regional genetics services in the United Kingdom and Ireland. Standardized phenotypic data were collected, and exome sequencing and microarray analyses were performed to investigate novel genetic causes. We developed an iterative variant analysis pipeline and reported candidate variants to clinical teams for validation and diagnostic interpretation to inform communication with families. Multiple regression analyses were performed to evaluate factors affecting the probability of diagnosis. RESULTS A total of 13,449 probands were included in the analyses. On average, we reported 1.0 candidate variant per parent-offspring trio and 2.5 variants per singleton proband. Using clinical and computational approaches to variant classification, we made a diagnosis in approximately 41% of probands (5502 of 13,449). Of 3599 probands in trios who received a diagnosis by clinical assertion, approximately 76% had a pathogenic de novo variant. Another 22% of probands (2997 of 13,449) had variants of uncertain significance in genes that were strongly linked to monogenic developmental disorders. Recruitment in a parent-offspring trio had the largest effect on the probability of diagnosis (odds ratio, 4.70; 95% confidence interval [CI], 4.16 to 5.31). Probands were less likely to receive a diagnosis if they were born extremely prematurely (i.e., 22 to 27 weeks' gestation; odds ratio, 0.39; 95% CI, 0.22 to 0.68), had in utero exposure to antiepileptic medications (odds ratio, 0.44; 95% CI, 0.29 to 0.67), had mothers with diabetes (odds ratio, 0.52; 95% CI, 0.41 to 0.67), or were of African ancestry (odds ratio, 0.51; 95% CI, 0.31 to 0.78). CONCLUSIONS Among probands with severe, probably monogenic, difficult-to-diagnose developmental disorders, multimodal analysis of genomewide data had good diagnostic power, even after previous attempts at diagnosis. (Funded by the Health Innovation Challenge Fund and Wellcome Sanger Institute.).
Collapse
Affiliation(s)
- Caroline F. Wright
- Department of Clinical and Biomedical Sciences, University of Exeter Medical School, RILD Building, Royal Devon & Exeter Hospital, Barrack Road, Exeter UK, EX2 5DW
| | - Patrick Campbell
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
- Cambridge University Hospitals Foundation Trust, Addenbrooke’s Hospital, Cambridge UK, CB2 0QQ
| | - Ruth Y. Eberhardt
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Stuart Aitken
- MRC Human Genetics Unit, Institute of Genetic and Cancer, University of Edinburgh, Edinburgh UK, EH4 2XU
| | - Daniel Perrett
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SD
| | - Simon Brent
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SD
| | - Petr Danecek
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Eugene J. Gardner
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - V. Kartik Chundru
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Sarah J. Lindsay
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Katrina Andrews
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Juliet Hampstead
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Joanna Kaplanis
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Kaitlin E. Samocha
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Anna Middleton
- Wellcome Connecting Science, Wellcome Genome Campus, Hinxton, Cambridge, UK, CB10 1SA
| | - Julia Foreman
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SD
| | - Rachel J. Hobson
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Michael J. Parker
- Wellcome Centre for Ethics and Humanities/Ethox Centre, Oxford Population Health, University of Oxford, Big Data Institute, Old Road Campus, Oxford, UK, OX3 7LF
| | - Hilary C. Martin
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - David R. FitzPatrick
- MRC Human Genetics Unit, Institute of Genetic and Cancer, University of Edinburgh, Edinburgh UK, EH4 2XU
| | - Matthew E. Hurles
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
| | - Helen V. Firth
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge UK, CB10 1SA
- Cambridge University Hospitals Foundation Trust, Addenbrooke’s Hospital, Cambridge UK, CB2 0QQ
| |
Collapse
|
31
|
Sanders LM, Scott RT, Yang JH, Qutub AA, Garcia Martin H, Berrios DC, Hastings JJA, Rask J, Mackintosh G, Hoarfrost AL, Chalk S, Kalantari J, Khezeli K, Antonsen EL, Babdor J, Barker R, Baranzini SE, Beheshti A, Delgado-Aparicio GM, Glicksberg BS, Greene CS, Haendel M, Hamid AA, Heller P, Jamieson D, Jarvis KJ, Komarova SV, Komorowski M, Kothiyal P, Mahabal A, Manor U, Mason CE, Matar M, Mias GI, Miller J, Myers JG, Nelson C, Oribello J, Park SM, Parsons-Wingerter P, Prabhu RK, Reynolds RJ, Saravia-Butler A, Saria S, Sawyer A, Singh NK, Snyder M, Soboczenski F, Soman K, Theriot CA, Van Valen D, Venkateswaran K, Warren L, Worthey L, Zitnik M, Costes SV. Biological research and self-driving labs in deep space supported by artificial intelligence. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00618-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
32
|
Wang X, Gan M, Dong X, Lu Y, Zhou W. An Integrated Pipeline for Trio-Rapid Genome Sequencing in Critically Ill Infants. Curr Protoc 2023; 3:e706. [PMID: 36971344 DOI: 10.1002/cpz1.706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/29/2023]
Abstract
Trio-rapid genome sequencing (trio-rGS) can assist the genetic diagnosis of critically ill infants given its ability to detect a broad range of pathogenic variants, as well as microbes, simultaneously with high efficiency. To achieve more comprehensive clinical diagnoses, it is essential to propose a recommended protocol in clinical practice. Here, we introduced an integrated pipeline to detect germline variants and microorganisms simultaneously from trio-RGS in critically ill infants, which provides step-by-step criteria for the semi-automatic processing procedures. With this pipeline in clinical application, only 1 ml of peripheral blood is needed for clinicians to provide both genetic and infectious causal information to a patient. The establishment and clinical practice of the method is of great significance for further mining of high-throughput sequencing data and for assisting clinicians in promoting diagnosis efficiency and accuracy. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Experimental pipeline for rapid whole-genome sequencing for the simultaneous detection of germline variants and microorganisms Basic Protocol 2: Computational pipeline for rapid whole-genome sequencing for the simultaneous detection of germline variants and microorganisms.
Collapse
Affiliation(s)
- Xiao Wang
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
| | - Mingyu Gan
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
| | - Xinran Dong
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
| | - Yulan Lu
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
- Center for Big Data in Clinical Research, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
| | - Wenhao Zhou
- Center for Molecular Medicine, Children's Hospital of Fudan University, National Children's Medical Center, Shanghai, China
| |
Collapse
|
33
|
Hier DB, Yelugam R, Carrithers MD, Wunsch DC. The visualization of Orphadata neurology phenotypes. Front Digit Health 2023; 5:1064936. [PMID: 36778102 PMCID: PMC9911440 DOI: 10.3389/fdgth.2023.1064936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Accepted: 01/10/2023] [Indexed: 01/28/2023] Open
Abstract
Disease phenotypes are characterized by signs (what a physician observes during the examination of a patient) and symptoms (the complaints of a patient to a physician). Large repositories of disease phenotypes are accessible through the Online Mendelian Inheritance of Man, Human Phenotype Ontology, and Orphadata initiatives. Many of the diseases in these datasets are neurologic. For each repository, the phenotype of neurologic disease is represented as a list of concepts of variable length where the concepts are selected from a restricted ontology. Visualizations of these concept lists are not provided. We address this limitation by using subsumption to reduce the number of descriptive features from 2,946 classes into thirty superclasses. Phenotype feature lists of variable lengths were converted into fixed-length vectors. Phenotype vectors were aggregated into matrices and visualized as heat maps that allowed side-by-side disease comparisons. Individual diseases (representing a row in the matrix) were visualized as word clouds. We illustrate the utility of this approach by visualizing the neuro-phenotypes of 32 dystonic diseases from Orphadata. Subsumption can collapse phenotype features into superclasses, phenotype lists can be vectorized, and phenotypes vectors can be visualized as heat maps and word clouds.
Collapse
Affiliation(s)
- Daniel B Hier
- Applied Computational Intelligence Laboratory, Department of Electrical & Computer Engineering, Missouri University of Science & Technology, Rolla, MO, United States.,Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Raghu Yelugam
- Applied Computational Intelligence Laboratory, Department of Electrical & Computer Engineering, Missouri University of Science & Technology, Rolla, MO, United States
| | - Michael D Carrithers
- Department of Neurology and Rehabilitation, University of Illinois at Chicago, Chicago, IL, United States
| | - Donald C Wunsch
- National Institute of Diabetes and Digestive and Kidney Diseases, Liver Diseases Branch, Bethesda, MD, United States
| |
Collapse
|
34
|
Paredes-Fuentes AJ, Oliva C, Urreizti R, Yubero D, Artuch R. Laboratory testing for mitochondrial diseases: biomarkers for diagnosis and follow-up. Crit Rev Clin Lab Sci 2023; 60:270-289. [PMID: 36694353 DOI: 10.1080/10408363.2023.2166013] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
The currently available biomarkers generally lack the specificity and sensitivity needed for the diagnosis and follow-up of patients with mitochondrial diseases (MDs). In this group of rare genetic disorders (mutations in approximately 350 genes associated with MDs), all clinical presentations, ages of disease onset and inheritance types are possible. Blood, urine, and cerebrospinal fluid surrogates are well-established biomarkers that are used in clinical practice to assess MD. One of the main challenges is validating specific and sensitive biomarkers for the diagnosis of disease and prediction of disease progression. Profiling of lactate, amino acids, organic acids, and acylcarnitine species is routinely conducted to assess MD patients. New biomarkers, including some proteins and circulating cell-free mitochondrial DNA, with increased diagnostic specificity have been identified in the last decade and have been proposed as potentially useful in the assessment of clinical outcomes. Despite these advances, even these new biomarkers are not sufficiently specific and sensitive to assess MD progression, and new biomarkers that indicate MD progression are urgently needed to monitor the success of novel therapeutic strategies. In this report, we review the mitochondrial biomarkers that are currently analyzed in clinical laboratories, new biomarkers, an overview of the most common laboratory diagnostic techniques, and future directions regarding targeted versus untargeted metabolomic and genomic approaches in the clinical laboratory setting. Brief descriptions of the current methodologies are also provided.
Collapse
Affiliation(s)
- Abraham J Paredes-Fuentes
- Division of Inborn Errors of Metabolism-IBC, Biochemistry and Molecular Genetics Department, Hospital Clínic de Barcelona, Barcelona, Spain
| | - Clara Oliva
- Clinical Biochemistry Department, Institut de Recerca Sant Joan de Déu, Hospital Sant Joan de Déu, Barcelona, Spain
| | - Roser Urreizti
- Clinical Biochemistry Department, Institut de Recerca Sant Joan de Déu, Hospital Sant Joan de Déu, Barcelona, Spain.,Biomedical Network Research Centre on Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
| | - Delia Yubero
- Biomedical Network Research Centre on Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain.,Department of Genetic and Molecular Medicine-IPER, Institut de Recerca Sant Joan de Déu, Barcelona, Spain
| | - Rafael Artuch
- Clinical Biochemistry Department, Institut de Recerca Sant Joan de Déu, Hospital Sant Joan de Déu, Barcelona, Spain.,Biomedical Network Research Centre on Rare Diseases (CIBERER), Instituto de Salud Carlos III, Madrid, Spain
| |
Collapse
|
35
|
Starr AL, Gokhman D, Fraser HB. Accounting for cis-regulatory constraint prioritizes genes likely to affect species-specific traits. Genome Biol 2023; 24:11. [PMID: 36658652 PMCID: PMC9850818 DOI: 10.1186/s13059-023-02846-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 01/04/2023] [Indexed: 01/20/2023] Open
Abstract
Measuring allele-specific expression in interspecies hybrids is a powerful way to detect cis-regulatory changes underlying adaptation. However, it remains difficult to identify genes most likely to explain species-specific traits. Here, we outline a simple strategy that leverages population-scale allele-specific RNA-seq data to identify genes that show constrained cis-regulation within species yet show divergence between species. Applying this strategy to data from human-chimpanzee hybrid cortical organoids, we identify signatures of lineage-specific selection on genes related to saccharide metabolism, neurodegeneration, and primary cilia. We also highlight cis-regulatory divergence in CUX1 and EDNRB that may shape the trajectory of human brain development.
Collapse
Affiliation(s)
| | - David Gokhman
- Department of Biology, Stanford University, Stanford, CA, USA
- Present Address: Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Hunter B Fraser
- Department of Biology, Stanford University, Stanford, CA, USA.
| |
Collapse
|
36
|
Reese JT, Blau H, Casiraghi E, Bergquist T, Loomba JJ, Callahan TJ, Laraway B, Antonescu C, Coleman B, Gargano M, Wilkins KJ, Cappelletti L, Fontana T, Ammar N, Antony B, Murali TM, Caufield JH, Karlebach G, McMurry JA, Williams A, Moffitt R, Banerjee J, Solomonides AE, Davis H, Kostka K, Valentini G, Sahner D, Chute CG, Madlock-Brown C, Haendel MA, Robinson PN. Generalisable long COVID subtypes: findings from the NIH N3C and RECOVER programmes. EBioMedicine 2023; 87:104413. [PMID: 36563487 PMCID: PMC9769411 DOI: 10.1016/j.ebiom.2022.104413] [Citation(s) in RCA: 75] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2022] [Revised: 11/23/2022] [Accepted: 11/29/2022] [Indexed: 12/24/2022] Open
Abstract
BACKGROUND Stratification of patients with post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) would allow precision clinical management strategies. However, long COVID is incompletely understood and characterised by a wide range of manifestations that are difficult to analyse computationally. Additionally, the generalisability of machine learning classification of COVID-19 clinical outcomes has rarely been tested. METHODS We present a method for computationally modelling PASC phenotype data based on electronic healthcare records (EHRs) and for assessing pairwise phenotypic similarity between patients using semantic similarity. Our approach defines a nonlinear similarity function that maps from a feature space of phenotypic abnormalities to a matrix of pairwise patient similarity that can be clustered using unsupervised machine learning. FINDINGS We found six clusters of PASC patients, each with distinct profiles of phenotypic abnormalities, including clusters with distinct pulmonary, neuropsychiatric, and cardiovascular abnormalities, and a cluster associated with broad, severe manifestations and increased mortality. There was significant association of cluster membership with a range of pre-existing conditions and measures of severity during acute COVID-19. We assigned new patients from other healthcare centres to clusters by maximum semantic similarity to the original patients, and showed that the clusters were generalisable across different hospital systems. The increased mortality rate originally identified in one cluster was consistently observed in patients assigned to that cluster in other hospital systems. INTERPRETATION Semantic phenotypic clustering provides a foundation for assigning patients to stratified subgroups for natural history or therapy studies on PASC. FUNDING NIH (TR002306/OT2HL161847-01/OD011883/HG010860), U.S.D.O.E. (DE-AC02-05CH11231), Donald A. Roux Family Fund at Jackson Laboratory, Marsico Family at CU Anschutz.
Collapse
Affiliation(s)
- Justin T Reese
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Elena Casiraghi
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA; AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | | | - Johanna J Loomba
- The Integrated Translational Health Research Institute of Virginia (iTHRIV), University of Virginia, Charlottesville, VA, USA
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| | - Bryan Laraway
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | | | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Michael Gargano
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Kenneth J Wilkins
- Biostatistics Program, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Luca Cappelletti
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | - Tommaso Fontana
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | - Nariman Ammar
- Health Science Center, University of Tennessee, Memphis, TN, USA
| | - Blessy Antony
- Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - T M Murali
- Department of Computer Science, Virginia Tech, Blacksburg, VA, USA
| | - J Harry Caufield
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Guy Karlebach
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA
| | - Julie A McMurry
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Andrew Williams
- Tufts Medical Center Clinical and Translational Science Institute, Tufts Medical Center, Boston, MA, USA; Tufts University School of Medicine, Institute for Clinical Research and Health Policy Studies, Boston, MA, USA; Northeastern University, OHDSI Center at the Roux Institute, Boston, MA, USA
| | - Richard Moffitt
- Department of Biomedical Informatics and Stony Brook Cancer Center, Stony Brook University, Stony Brook, NY, USA
| | | | | | | | - Kristin Kostka
- Northeastern University, OHDSI Center at the Roux Institute, Boston, MA, USA
| | - Giorgio Valentini
- AnacletoLab, Dipartimento di Informatica, Università Degli Studi di Milano, Milan, Italy
| | | | - Christopher G Chute
- Schools of Medicine, Public Health and Nursing, Johns Hopkins University, Baltimore, MD, USA
| | | | - Melissa A Haendel
- Departments of Biomedical Informatics and Pediatrics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT, USA; Institute for Systems Genomics, University of Connecticut, Farmington, CT, USA.
| |
Collapse
|
37
|
Rojnueangnit K, Anthanont P, Khetkham T, Puttamanee S, Ittiwut C. Genetic diagnosis for adult patients at a genetic clinic. Cold Spring Harb Mol Case Stud 2022; 8:a006235. [PMID: 36265913 PMCID: PMC9808555 DOI: 10.1101/mcs.a006235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2022] [Accepted: 10/04/2022] [Indexed: 01/31/2023] Open
Abstract
Clinical utility of genetic testing has rapidly increased in the past decade to identify the definitive diagnosis, etiology, and specific management. The majority of patients receiving testing are children. There are several barriers for genetic tests in adult patients; barriers may arise from either patients or clinicians. Our study aims to realize the detection rate and the benefits of genetic tests in adults. We conducted a prospective study of 10 adult patients who were referred to a genetic clinic. Exome sequencing (ES) was pursued in all cases, and chromosomal microarray (CMA) was performed for six cases. Our result is impressive; six cases (60%) received likely pathogenic and pathogenic variants. Four definitive diagnosis cases had known pathogenic variants in KCNJ2, TGFBR1, SCN1A, and FBN1, whereas another two cases revealed novel likely pathogenic and pathogenic variants in GNB1 and DNAH9. Our study demonstrates the success in genetic diagnosis in adult patients: four cases with definitive, two cases with possible, and one case with partial diagnosis. The advantage of diagnosis is beyond obtaining the diagnosis itself, but also relieving any doubt for the patient regarding any previous questionable diagnosis, guide for management, and recurrence risk in their children or family members. Therefore, this supports the value of genetic testing in adult patients.
Collapse
Affiliation(s)
- Kitiwan Rojnueangnit
- Department of Pediatrics, Faculty of Medicine, Thammasat University, Pathumthani, 12120 Thailand
| | - Pimjai Anthanont
- Department of Medicine, Faculty of Medicine, Thammasat University, Pathumthani, 12120 Thailand
| | - Thanitchet Khetkham
- Division of Forensic Medicine, Thammasat University Hospital, 12120 Thailand
| | - Sukita Puttamanee
- Faculty of Medicine, Thammasat University, Pathumthani, 12120 Thailand
| | - Chupong Ittiwut
- Center of Excellence for Medical Genomics, Department of Pediatrics, Faculty of Medicine, Chulalongkorn University, Bangkok, 10330 Thailand
- Excellence Center for Genomics and Precision Medicine, King Chulalongkorn Memorial Hospital, the Thai Red Cross Society, Bangkok, 10330 Thailand
| |
Collapse
|
38
|
Lewis-Smith D, Parthasarathy S, Xian J, Kaufman MC, Ganesan S, Galer PD, Thomas RH, Helbig I. Computational analysis of neurodevelopmental phenotypes: Harmonization empowers clinical discovery. Hum Mutat 2022; 43:1642-1658. [PMID: 35460582 PMCID: PMC9560951 DOI: 10.1002/humu.24389] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2022] [Revised: 03/23/2022] [Accepted: 04/21/2022] [Indexed: 11/09/2022]
Abstract
Making a specific diagnosis in neurodevelopmental disorders is traditionally based on recognizing clinical features of a distinct syndrome, which guides testing of its possible genetic etiologies. Scalable frameworks for genomic diagnostics, however, have struggled to integrate meaningful measurements of clinical phenotypic features. While standardization has enabled generation and interpretation of genomic data for clinical diagnostics at unprecedented scale, making the equivalent breakthrough for clinical data has proven challenging. However, increasingly clinical features are being recorded using controlled dictionaries with machine readable formats such as the Human Phenotype Ontology (HPO), which greatly facilitates their use in the diagnostic space. Improving the tractability of large-scale clinical information will present new opportunities to inform genomic research and diagnostics from a clinical perspective. Here, we describe novel approaches for computational phenotyping to harmonize clinical features, improve data translation through revising domain-specific dictionaries, quantify phenotypic features, and determine clinical relatedness. We demonstrate how these concepts can be applied to longitudinal phenotypic information, which represents a critical element of developmental disorders and pediatric conditions. Finally, we expand our discussion to clinical data derived from electronic medical records, a largely untapped resource of deep clinical information with distinct strengths and weaknesses.
Collapse
Affiliation(s)
- David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle-upon-Tyne, UK
- Department of Clinical Neurosciences, Royal Victoria Infirmary, Newcastle-upon-Tyne, UK
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shridhar Parthasarathy
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Julie Xian
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Michael C. Kaufman
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shiva Ganesan
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Peter D. Galer
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Center for Neuroengineering and Therapeutics, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
| | - Rhys H. Thomas
- Translational and Clinical Research Institute, Newcastle University, Newcastle-upon-Tyne, UK
- Department of Clinical Neurosciences, Royal Victoria Infirmary, Newcastle-upon-Tyne, UK
| | - Ingo Helbig
- Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
| |
Collapse
|
39
|
Chen T, Xu B, Chen H, Sun Y, Song J, Sun X, Zhang X, Hua W. Transcription factor NFE2L3 promotes the proliferation of esophageal squamous cell carcinoma cells and causes radiotherapy resistance by regulating IL-6. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2022; 226:107102. [PMID: 36108571 DOI: 10.1016/j.cmpb.2022.107102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 08/23/2022] [Accepted: 08/29/2022] [Indexed: 06/15/2023]
Abstract
OBJECTIVE To scrutinize the impact of overexpression and interference of NFE2L3 on radiosensitivity of esophageal squamous cell carcinoma cells (ESCC) and its downstream mechanism and to assess whether NFE2L3 expression alters in vivo radiosensitivity of ESCC by developing a subcutaneous tumor model in mice. METHODS Through RNA-Seq, we compared the differentially expressed genes between the ECA-109R cell line and its parent ECA-109 cell line. The differentially expressed genes were selected and verified by qRT-PCR. Transfection of ESCC cell lines with NFE2L3 inhibitor or mimic lentivirus constructs was done to study the activity of NFE2L3. To assess the effect of NFE2L3 on cellular growth and proliferation, clonogenic survival assay, EdU incorporation assay, and CCK-8 assay were done after irradiation. To probe how many irradiated DNA double-strand breaks were produced, the corresponding intensity of γ-H2AX foci were detected by immunofluorescence. Apoptotic cells were assayed by flow cytometry assay after irradiation; To investigate the downstream genes of NFE2L3, we knocked NFE2L3, and RNA-Seq was used to find out the downstream genes. qRT-PCR and western blot ensued to score associated protein profiles. The in vivo ESCC cell radiosensitivity was scrutinized by nude mouse xenograft models. RESULTS The differential genes between ECA-109R cells and its parent ECA-109 cells were compared by qRT-PCR to unveil a significant increase in NFE2L3 expression. Functional analysis indicated that NFE2L3 increased radioresistance in ESCC cells. Then, through high-throughput sequencing and bioinformatics analysis, IL-6 was found to be a hub gene that played a role downstream of NFE2L3 and was verified by qRT-PCR, western blot, and double luciferase reporter gene experiment. NFE2L3 could regulate ESCC cell radiosensitivity via the IL-6-STAT3 signaling pathway, and downregulation of IL-6 expression could reverse the effects of highly expressed NFE2L3. In vivo tumor xenograft experiments confirmed that NFE2L3 affects the sensitivity to radiation therapy. CONCLUSION NFE2L3 can affect the radiosensitivity of ESCC cells through IL-6 transcription and IL-6/STAT3 signaling pathway. This makes NFE2L3 a putative target to regulate ESCC cell radiosensitivity.
Collapse
Affiliation(s)
- Tingting Chen
- Department of Oncology, Clinical Medical College, Yangzhou University, Yangzhou, Jiangsu Province, PR China
| | - Bing Xu
- Department of Oncology, Clinical Medical College, Yangzhou University, Yangzhou, Jiangsu Province, PR China
| | - Hui Chen
- Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu Province, PR China
| | - Yuanyuan Sun
- Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu Province, PR China
| | - Jiahang Song
- Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu Province, PR China
| | - Xinchen Sun
- Department of Radiation Oncology, The First Affiliated Hospital of Nanjing Medical University, Nanjing, Jiangsu Province, PR China.
| | - Xizhi Zhang
- Department of Oncology, Clinical Medical College, Yangzhou University, Yangzhou, Jiangsu Province, PR China.
| | - Wei Hua
- Department of Oncology, Clinical Medical College, Yangzhou University, Yangzhou, Jiangsu Province, PR China.
| |
Collapse
|
40
|
Genome-wide rare variant score associates with morphological subtypes of autism spectrum disorder. Nat Commun 2022; 13:6463. [PMID: 36309498 PMCID: PMC9617891 DOI: 10.1038/s41467-022-34112-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Accepted: 10/13/2022] [Indexed: 02/06/2023] Open
Abstract
Defining different genetic subtypes of autism spectrum disorder (ASD) can enable the prediction of developmental outcomes. Based on minor physical and major congenital anomalies, we categorize 325 Canadian children with ASD into dysmorphic and nondysmorphic subgroups. We develop a method for calculating a patient-level, genome-wide rare variant score (GRVS) from whole-genome sequencing (WGS) data. GRVS is a sum of the number of variants in morphology-associated coding and non-coding regions, weighted by their effect sizes. Probands with dysmorphic ASD have a significantly higher GRVS compared to those with nondysmorphic ASD (P = 0.03). Using the polygenic transmission disequilibrium test, we observe an over-transmission of ASD-associated common variants in nondysmorphic ASD probands (P = 2.9 × 10-3). These findings replicate using WGS data from 442 ASD probands with accompanying morphology data from the Simons Simplex Collection. Our results provide support for an alternative genomic classification of ASD subgroups using morphology data, which may inform intervention protocols.
Collapse
|
41
|
van den Hurk M, Lau S, Marchetto MC, Mertens J, Stern S, Corti O, Brice A, Winner B, Winkler J, Gage FH, Bardy C. Druggable transcriptomic pathways revealed in Parkinson's patient-derived midbrain neurons. NPJ Parkinsons Dis 2022; 8:134. [PMID: 36258029 PMCID: PMC9579158 DOI: 10.1038/s41531-022-00400-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 09/28/2022] [Indexed: 11/06/2022] Open
Abstract
Complex genetic predispositions accelerate the chronic degeneration of midbrain substantia nigra neurons in Parkinson’s disease (PD). Deciphering the human molecular makeup of PD pathophysiology can guide the discovery of therapeutics to slow the disease progression. However, insights from human postmortem brain studies only portray the latter stages of PD, and there is a lack of data surrounding molecular events preceding the neuronal loss in patients. We address this gap by identifying the gene dysregulation of live midbrain neurons reprogrammed in vitro from the skin cells of 42 individuals, including sporadic and familial PD patients and matched healthy controls. To minimize bias resulting from neuronal reprogramming and RNA-seq methods, we developed an analysis pipeline integrating PD transcriptomes from different RNA-seq datasets (unsorted and sorted bulk vs. single-cell and Patch-seq) and reprogramming strategies (induced pluripotency vs. direct conversion). This PD cohort’s transcriptome is enriched for human genes associated with known clinical phenotypes of PD, regulation of locomotion, bradykinesia and rigidity. Dysregulated gene expression emerges strongest in pathways underlying synaptic transmission, metabolism, intracellular trafficking, neural morphogenesis and cellular stress/immune responses. We confirmed a synaptic impairment with patch-clamping and identified pesticides and endoplasmic reticulum stressors as the most significant gene-chemical interactions in PD. Subsequently, we associated the PD transcriptomic profile with candidate pharmaceuticals in a large database and a registry of current clinical trials. This study highlights human transcriptomic pathways that can be targeted therapeutically before the irreversible neuronal loss. Furthermore, it demonstrates the preclinical relevance of unbiased large transcriptomic assays of reprogrammed patient neurons.
Collapse
Affiliation(s)
- Mark van den Hurk
- grid.430453.50000 0004 0565 2606South Australian Health and Medical Research Institute (SAHMRI), Laboratory for Human Neurophysiology and Genetics, Adelaide, SA Australia
| | - Shong Lau
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA
| | - Maria C. Marchetto
- grid.266100.30000 0001 2107 4242Department of Anthropology, University of California San Diego, La Jolla, CA USA
| | - Jerome Mertens
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA ,grid.5771.40000 0001 2151 8122Neural Aging Laboratory, Institute of Molecular Biology, CMBI, Leopold-Franzens-University Innsbruck, Innsbruck, Tyrol Austria
| | - Shani Stern
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA ,grid.18098.380000 0004 1937 0562Sagol Department of Neurobiology, Faculty of Natural Sciences, University of Haifa, Haifa, Israel
| | - Olga Corti
- grid.425274.20000 0004 0620 5939Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, AP-HP, Hôpital de la Pitié Salpêtrière, DMU BioGeM, Paris, France
| | - Alexis Brice
- grid.425274.20000 0004 0620 5939Sorbonne Université, Institut du Cerveau - Paris Brain Institute - ICM, Inserm, CNRS, AP-HP, Hôpital de la Pitié Salpêtrière, DMU BioGeM, Paris, France
| | - Beate Winner
- grid.411668.c0000 0000 9935 6525Department of Stem Cell Biology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Center of Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Department of Molecular Neurology, University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany
| | - Jürgen Winkler
- grid.411668.c0000 0000 9935 6525Department of Stem Cell Biology, University Hospital Erlangen, Friedrich-Alexander University Erlangen-Nürnberg (FAU), Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Center of Rare Diseases Erlangen (ZSEER), University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany ,grid.411668.c0000 0000 9935 6525Department of Molecular Neurology, University Hospital Erlangen, FAU Erlangen-Nürnberg, Erlangen, Germany
| | - Fred H. Gage
- grid.250671.70000 0001 0662 7144Laboratory of Genetics, Salk Institute for Biological Studies, La Jolla, CA USA
| | - Cedric Bardy
- grid.430453.50000 0004 0565 2606South Australian Health and Medical Research Institute (SAHMRI), Laboratory for Human Neurophysiology and Genetics, Adelaide, SA Australia ,grid.1014.40000 0004 0367 2697Flinders Health and Medical Research Institute, Flinders University, Adelaide, SA Australia
| |
Collapse
|
42
|
Lindstrand A, Ek M, Kvarnung M, Anderlid BM, Björck E, Carlsten J, Eisfeldt J, Grigelioniene G, Gustavsson P, Hammarsjö A, Helgadóttir HT, Hellström-Pigg M, Kuchinskaya E, Lagerstedt-Robinson K, Levin LÅ, Lieden A, Lindelöf H, Malmgren H, Nilsson D, Svensson E, Paucar M, Sahlin E, Tesi B, Tham E, Winberg J, Winerdal M, Wincent J, Johansson Soller M, Pettersson M, Nordgren A. Genome sequencing is a sensitive first-line test to diagnose individuals with intellectual disability. Genet Med 2022; 24:2296-2307. [PMID: 36066546 DOI: 10.1016/j.gim.2022.07.022] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 07/22/2022] [Accepted: 07/25/2022] [Indexed: 10/14/2022] Open
Abstract
PURPOSE Individuals with intellectual disability (ID) and/or neurodevelopment disorders (NDDs) are currently investigated with several different approaches in clinical genetic diagnostics. METHODS We compared the results from 3 diagnostic pipelines in patients with ID/NDD: genome sequencing (GS) first (N = 100), GS as a secondary test (N = 129), or chromosomal microarray (CMA) with or without FMR1 analysis (N = 421). RESULTS The diagnostic yield was 35% (GS-first), 26% (GS as a secondary test), and 11% (CMA/FMR1). Notably, the age of diagnosis was delayed by 1 year when GS was performed as a secondary test and the cost per diagnosed individual was 36% lower with GS first than with CMA/FMR1. Furthermore, 91% of those with a negative result after CMA/FMR1 analysis (338 individuals) have not yet been referred for additional genetic testing and remain undiagnosed. CONCLUSION Our findings strongly suggest that genome analysis outperforms other testing strategies and should replace traditional CMA and FMR1 analysis as a first-line genetic test in individuals with ID/NDD. GS is a sensitive, time- and cost-effective method that results in a confirmed molecular diagnosis in 35% of all referred patients.
Collapse
Affiliation(s)
- Anna Lindstrand
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden.
| | - Marlene Ek
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Malin Kvarnung
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Britt-Marie Anderlid
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Erik Björck
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Jonas Carlsten
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Jesper Eisfeldt
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden; Science for Life Laboratory, Department of Molecular Medicine and Surgery, Karolinska Institutet, Solna, Sweden
| | - Giedre Grigelioniene
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Peter Gustavsson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Anna Hammarsjö
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Hafdís T Helgadóttir
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Maritta Hellström-Pigg
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Ekaterina Kuchinskaya
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Kristina Lagerstedt-Robinson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Lars-Åke Levin
- Department of Health, Medicine and Caring Sciences, Linköping University, Linköping, Sweden
| | - Agne Lieden
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Hillevi Lindelöf
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Helena Malmgren
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Daniel Nilsson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden; Science for Life Laboratory, Department of Molecular Medicine and Surgery, Karolinska Institutet, Solna, Sweden
| | - Eva Svensson
- Department of Pediatric Neurology, Karolinska University Hospital, Huddinge, Sweden
| | - Martin Paucar
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Ellika Sahlin
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Bianca Tesi
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Emma Tham
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Johanna Winberg
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Max Winerdal
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Josephine Wincent
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Maria Johansson Soller
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Maria Pettersson
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| | - Ann Nordgren
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden; Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden
| |
Collapse
|
43
|
Integration of Human Protein Sequence and Protein-Protein Interaction Data by Graph Autoencoder to Identify Novel Protein-Abnormal Phenotype Associations. Cells 2022; 11:cells11162485. [PMID: 36010562 PMCID: PMC9406402 DOI: 10.3390/cells11162485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 07/31/2022] [Accepted: 08/05/2022] [Indexed: 11/18/2022] Open
Abstract
Understanding gene functions and their associated abnormal phenotypes is crucial in the prevention, diagnosis and treatment against diseases. The Human Phenotype Ontology (HPO) is a standardized vocabulary for describing the phenotype abnormalities associated with human diseases. However, the current HPO annotations are far from completion, and only a small fraction of human protein-coding genes has HPO annotations. Thus, it is necessary to predict protein-phenotype associations using computational methods. Protein sequences can indicate the structure and function of the proteins, and interacting proteins are more likely to have same function. It is promising to integrate these features for predicting HPO annotations of human protein. We developed GraphPheno, a semi-supervised method based on graph autoencoders, which does not require feature engineering to capture deep features from protein sequences, while also taking into account the topological properties in the protein–protein interaction network to predict the relationships between human genes/proteins and abnormal phenotypes. Cross validation and independent dataset tests show that GraphPheno has satisfactory prediction performance. The algorithm is further confirmed on automatic HPO annotation for no-knowledge proteins under the benchmark of the second Critical Assessment of Functional Annotation, 2013–2014 (CAFA2), where GraphPheno surpasses most existing methods. Further bioinformatics analysis shows that predicted certain phenotype-associated genes using GraphPheno share similar biological properties with known ones. In a case study on the phenotype of abnormality of mitochondrial respiratory chain, top prioritized genes are validated by recent papers. We believe that GraphPheno will help to reveal more associations between genes and phenotypes, and contribute to the discovery of drug targets.
Collapse
|
44
|
Yang DD, Rio M, Michot C, Boddaert N, Yacoub W, Garcelon N, Thierry B, Bonnet D, Rondeau S, Herve D, Guey S, Angoulvant F, Cormier-Daire V. Natural history of Myhre syndrome. Orphanet J Rare Dis 2022; 17:304. [PMID: 35907855 PMCID: PMC9338657 DOI: 10.1186/s13023-022-02447-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Accepted: 07/14/2022] [Indexed: 11/16/2022] Open
Abstract
Background Myhre syndrome (MS) is a rare genetic disease characterized by skeletal disorders, facial features and joint limitation, caused by a gain of function mutation in SMAD4 gene. The natural history of MS remains incompletely understood.
Methods We recruited in a longitudinal retrospective study patients with molecular confirmed MS from the French reference center for rare skeletal dysplasia. We described natural history by chaining data from medical reports, clinical data warehouse, medical imaging and photographies.
Results We included 12 patients. The median age was 22 years old (y/o). Intrauterine and postnatal growth retardation were consistently reported. In preschool age, neurodevelopment disorders were reported in 80% of children. Specifics facial and skeletal features, thickened skin and joint limitation occured mainly in school age children. The adolescence was marked by the occurrence of pulmonary arterial hypertension (PAH) and vascular stenosis. We reported for the first time recurrent strokes from the age of 26 y/o, caused by a moyamoya syndrome in one patient. Two patients died at late adolescence and in their 20 s respectively from PAH crises and mesenteric ischemia. Conclusion Myhre syndrome is a progressive disease with severe multisystemic impairement and life-threathning complication requiring multidisciplinary monitoring.
Collapse
Affiliation(s)
- David Dawei Yang
- Centre de Recherche Des Cordeliers, INSERM UMRS 1138 Team 22, Université de Paris, 75006, Paris, France.,Pediatric Emergency Department, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France
| | - Marlene Rio
- Université de Paris, Institut IMAGINE, Developmental Brain Disorders Laboratory, INSERM UMR1163, 75015, Paris, France.,Departement of Medical Genetics, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France
| | - Caroline Michot
- Departement of Medical Genetics, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France.,Université de Paris, Institut IMAGINE, Molecular and Physiopathological Bases of Osteochondrodysplasia, INSERM UMR1163, 75015, Paris, France
| | - Nathalie Boddaert
- Paediatric Radiology Department, AP-HP, Hôpital Universitaire Necker Enfants Malades, 75015, Paris, France.,Université de Paris, Institut IMAGINE, INSERM1163, 75015, Paris, France
| | - Wael Yacoub
- Paediatric Radiology Department, AP-HP, Hôpital Universitaire Necker Enfants Malades, 75015, Paris, France.,Université de Paris, Institut IMAGINE, INSERM1163, 75015, Paris, France
| | - Nicolas Garcelon
- Centre de Recherche Des Cordeliers, INSERM UMRS 1138 Team 22, Université de Paris, 75006, Paris, France.,Université de Paris, Institut IMAGINE, Data Science Platform, INSERM UMR1163, 75015, Paris, France
| | - Briac Thierry
- Department of Pediatric Otolaryngology-Head and Neck Surgery, AP-HP, Hôpital Universitaire Necker - Enfants Malades, 75015, Paris, France.,Université de Paris, Human Immunology, Pathophysiology, Immunotherapy/HIPI/INSERM UMR976, Stem Cell Biotechnologies, 75010, Paris, France
| | - Damien Bonnet
- Université de Paris, Institut IMAGINE, INSERM1163, 75015, Paris, France.,M3C-Paediatric Cardiology, AP-HP, Hôpital Universitaire Necker Enfants Malades, 75015, Paris, France
| | - Sophie Rondeau
- Departement of Medical Genetics, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France.,Université de Paris, Institut IMAGINE, Molecular and Physiopathological Bases of Osteochondrodysplasia, INSERM UMR1163, 75015, Paris, France
| | - Dominique Herve
- Department of Neurology, AP-HP Nord, Referral Center for Rare Vascular Diseases of the Brain and Retina (CERVCO), DHU NeuroVasc, INSERM U 1161, 75010, Paris, France
| | - Stephanie Guey
- Department of Neurology, AP-HP, Hôpital Lariboisière, UMR-S1161, 75010, Paris, France
| | - Francois Angoulvant
- Centre de Recherche Des Cordeliers, INSERM UMRS 1138 Team 22, Université de Paris, 75006, Paris, France.,Pediatric Emergency Department, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France
| | - Valerie Cormier-Daire
- Departement of Medical Genetics, AP-HP, Hôpital Universitaire Necker-Enfants Malades, 75015, Paris, France. .,Université de Paris, Institut IMAGINE, Molecular and Physiopathological Bases of Osteochondrodysplasia, INSERM UMR1163, 75015, Paris, France.
| |
Collapse
|
45
|
Reese JT, Blau H, Bergquist T, Loomba JJ, Callahan T, Laraway B, Antonescu C, Casiraghi E, Coleman B, Gargano M, Wilkins KJ, Cappelletti L, Fontana T, Ammar N, Antony B, Murali TM, Karlebach G, McMurry JA, Williams A, Moffitt R, Banerjee J, Solomonides AE, Davis H, Kostka K, Valentini G, Sahner D, Chute CG, Madlock-Brown C, Haendel MA, Robinson PN. Generalizable Long COVID Subtypes: Findings from the NIH N3C and RECOVER Programs. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2022:2022.05.24.22275398. [PMID: 35665012 PMCID: PMC9164456 DOI: 10.1101/2022.05.24.22275398] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/11/2023]
Abstract
Accurate stratification of patients with post-acute sequelae of SARS-CoV-2 infection (PASC, or long COVID) would allow precision clinical management strategies. However, the natural history of long COVID is incompletely understood and characterized by an extremely wide range of manifestations that are difficult to analyze computationally. In addition, the generalizability of machine learning classification of COVID-19 clinical outcomes has rarely been tested. We present a method for computationally modeling PASC phenotype data based on electronic healthcare records (EHRs) and for assessing pairwise phenotypic similarity between patients using semantic similarity. Our approach defines a nonlinear similarity function that maps from a feature space of phenotypic abnormalities to a matrix of pairwise patient similarity that can be clustered using unsupervised machine learning procedures. Using k-means clustering of this similarity matrix, we found six distinct clusters of PASC patients, each with distinct profiles of phenotypic abnormalities. There was a significant association of cluster membership with a range of pre-existing conditions and with measures of severity during acute COVID-19. Two of the clusters were associated with severe manifestations and displayed increased mortality. We assigned new patients from other healthcare centers to one of the six clusters on the basis of maximum semantic similarity to the original patients. We show that the identified clusters were generalizable across different hospital systems and that the increased mortality rate was consistently observed in two of the clusters. Semantic phenotypic clustering can provide a foundation for assigning patients to stratified subgroups for natural history or therapy studies on PASC.
Collapse
|
46
|
Joyce KE, Onabanjo E, Brownlow S, Nur F, Olupona K, Fakayode K, Sroya M, Thomas GA, Ferguson T, Redhead J, Millar CM, Cooper N, Layton DM, Boardman-Pretty F, Caulfield MJ, Shovlin CL. Whole genome sequences discriminate hereditary hemorrhagic telangiectasia phenotypes by non-HHT deleterious DNA variation. Blood Adv 2022; 6:3956-3969. [PMID: 35316832 PMCID: PMC9278305 DOI: 10.1182/bloodadvances.2022007136] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 02/21/2022] [Indexed: 11/20/2022] Open
Abstract
The abnormal vascular structures of hereditary hemorrhagic telangiectasia (HHT) often cause severe anemia due to recurrent hemorrhage, but HHT causal genes do not predict the severity of hematological complications. We tested for chance inheritance and clinical associations of rare deleterious variants in which loss-of-function causes bleeding or hemolytic disorders in the general population. In double-blinded analyses, all 104 patients with HHT from a single reference center recruited to the 100 000 Genomes Project were categorized on new MALO (more/as-expected/less/opposite) sub-phenotype severity scales, and whole genome sequencing data were tested for high impact variants in 75 HHT-independent genes encoding coagulation factors, or platelet, hemoglobin, erythrocyte enzyme, and erythrocyte membrane constituents. Rare variants (all gnomAD allele frequencies <0.003) were identified in 56 (75%) of these 75 HHT-unrelated genes. Deleteriousness assignments by Combined Annotation Dependent Depletion (CADD) scores >15 were supported by gene-level mutation significance cutoff scores. CADD >15 variants were identified in 38/104 (36.5%) patients with HHT, found for 1 in 10 patients within platelet genes; 1 in 8 within coagulation genes; and 1 in 4 within erythrocyte hemolytic genes. In blinded analyses, patients with greater hemorrhagic severity that had been attributed solely to HHT vessels had more CADD-deleterious variants in platelet (Spearman ρ = 0.25; P = .008) and coagulation (Spearman ρ = 0.21; P = .024) genes. However, the HHT cohort had 60% fewer deleterious variants in platelet and coagulation genes than expected (Mann-Whitney test P = .021). In conclusion, patients with HHT commonly have rare variants in genes of relevance to their phenotype, offering new therapeutic targets and opportunities for informed, personalized medicine strategies.
Collapse
Affiliation(s)
- Katie E. Joyce
- Imperial College School of Medicine, Imperial College, London, United Kingdom
- Genomics England Respiratory Clinical Interpretation Partnership (GeCIP), London, United Kingdom
| | - Ebun Onabanjo
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Sheila Brownlow
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Fadumo Nur
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Kike Olupona
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Kehinde Fakayode
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Manveer Sroya
- Department of Surgery and Cancer, Imperial College, London, United Kingdom
| | | | - Teena Ferguson
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Julian Redhead
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
| | - Carolyn M. Millar
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
- Centre for Haematology, Department of Immunology and Inflammation, Imperial College, London, United Kingdom
| | - Nichola Cooper
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
- Centre for Haematology, Department of Immunology and Inflammation, Imperial College, London, United Kingdom
| | - D. Mark Layton
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
- Centre for Haematology, Department of Immunology and Inflammation, Imperial College, London, United Kingdom
| | | | - Mark J. Caulfield
- Genomics England Research Consortium, Genomics England, London, United Kingdom
- William Harvey Research Institute, Queen Mary University of London, London, United Kingdom; and
| | | | - Claire L. Shovlin
- Genomics England Respiratory Clinical Interpretation Partnership (GeCIP), London, United Kingdom
- West London Genomic Medicine Centre, Imperial College Healthcare NHS Trust, London, United Kingdom
- National Heart and Lung Institute, Imperial College, London, United Kingdom
| |
Collapse
|
47
|
Han JH, Ryan G, Guy A, Liu L, Quinodoz M, Helbling I, Lai-Cheong JE, Barwell J, Folcher M, McGrath JA, Moss C, Rivolta C. Mutations in the ribosome biogenesis factor gene LTV1 are linked to LIPHAK syndrome, a novel poikiloderma-like disorder. Hum Mol Genet 2022; 31:1970-1978. [PMID: 34999892 PMCID: PMC9239743 DOI: 10.1093/hmg/ddab368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 11/18/2021] [Accepted: 12/13/2021] [Indexed: 11/12/2022] Open
Abstract
In the framework of the UK 100 000 Genomes Project, we investigated the genetic origin of a previously undescribed recessive dermatological condition, which we named LIPHAK (LTV1-associated Inflammatory Poikiloderma with Hair abnormalities and Acral Keratoses), in four affected individuals from two UK families of Pakistani and Indian origins, respectively. Our analysis showed that only one gene, LTV1, carried rare biallelic variants that were shared in all affected individuals, and specifically they bore the NM_032860.5:c.503A > G, p.(Asn168Ser) change, found homozygously in all of them. In addition, high-resolution homozygosity mapping revealed the presence of a small 652-kb stretch on chromosome 6, encompassing LTV1, that was haploidentical and common to all affected individuals. The c.503A > G variant was predicted by in silico tools to affect the correct splicing of LTV1's exon 5. Minigene-driven splicing assays in HEK293T cells and in a skin sample from one of the patients confirmed that this variant was indeed responsible for the creation of a new donor splice site, resulting in aberrant splicing and in a premature termination codon in exon 6 of this gene. LTV1 encodes one of the ribosome biogenesis factors that promote the assembly of the small (40S) ribosomal subunit. In yeast, defects in LTV1 alter the export of nascent ribosomal subunits to the cytoplasm; however, the role of this gene in human pathology is unknown to date. Our data suggest that LIPHAK could be a previously unrecognized ribosomopathy.
Collapse
Affiliation(s)
- Ji Hoon Han
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), 4031 Basel, Switzerland
- Department of Ophthalmology, University of Basel, 4031 Basel, Switzerland
| | - Gavin Ryan
- West Midlands Regional Genetics Laboratory, Central and South Genomic Laboratory Hub, Birmingham B15 2TG, UK
| | - Alyson Guy
- Viapath, St Thomas' Hospital, London SE1 7EH, UK
| | - Lu Liu
- Viapath, St Thomas' Hospital, London SE1 7EH, UK
| | - Mathieu Quinodoz
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), 4031 Basel, Switzerland
- Department of Ophthalmology, University of Basel, 4031 Basel, Switzerland
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
| | - Ingrid Helbling
- Department of Dermatology, University Hospitals of Leicester NHS Trust, Leicester LE1 5WW, UK
| | | | | | - Julian Barwell
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
- Department of Clinical Genetics, University Hospitals of Leicester NHS Trust, Leicester LE1 5WW, UK
| | - Marc Folcher
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), 4031 Basel, Switzerland
- Department of Ophthalmology, University of Basel, 4031 Basel, Switzerland
| | - John A McGrath
- NIHR Biomedical Research Centre, Guy's and St Thomas' NHS Foundation Trust and King's College London, London SE1 9RT, UK
- St John's Institute of Dermatology, King's College London (Guy's campus), London SE1 9RT, UK
| | - Celia Moss
- Department of Paediatric Dermatology, Birmingham Women’s and Children’s Hospital NHS FT, Birmingham B4 6NH, UK
- College of Medical and Dental Sciences, University of Birmingham, Birmingham B15 2TT, UK
| | - Carlo Rivolta
- Institute of Molecular and Clinical Ophthalmology Basel (IOB), 4031 Basel, Switzerland
- Department of Ophthalmology, University of Basel, 4031 Basel, Switzerland
- Department of Genetics and Genome Biology, University of Leicester, Leicester LE1 7RH, UK
| |
Collapse
|
48
|
Driver HG, Hartley T, Price EM, Turinsky AL, Buske OJ, Osmond M, Ramani AK, Kirby E, Kernohan KD, Couse M, Elrick H, Lu K, Mashouri P, Mohan A, So D, Klamann C, Le HGBH, Herscovich A, Marshall CR, Statia A, Canada Consortium C, Knoppers BM, Brudno M, Boycott KM. Genomics4RD: An integrated platform to share Canadian deep-phenotype and multiomic data for international rare disease gene discovery. Hum Mutat 2022; 43:800-811. [PMID: 35181971 PMCID: PMC9311832 DOI: 10.1002/humu.24354] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Revised: 02/08/2022] [Accepted: 02/16/2022] [Indexed: 11/06/2022]
Abstract
Despite recent progress in the understanding of the genetic etiologies of rare diseases (RDs), a significant number remain intractable to diagnostic and discovery efforts. Broad data collection and sharing of information among RD researchers is therefore critical. In 2018, the Care4Rare Canada Consortium launched the project C4R‐SOLVE, a subaim of which was to collect, harmonize, and share both retrospective and prospective Canadian clinical and multiomic data. Here, we introduce Genomics4RD, an integrated web‐accessible platform to share Canadian phenotypic and multiomic data between researchers, both within Canada and internationally, for the purpose of discovering the mechanisms that cause RDs. Genomics4RD has been designed to standardize data collection and processing, and to help users systematically collect, prioritize, and visualize participant information. Data storage, authorization, and access procedures have been developed in collaboration with policy experts and stakeholders to ensure the trusted and secure access of data by external researchers. The breadth and standardization of data offered by Genomics4RD allows researchers to compare candidate disease genes and variants between participants (i.e., matchmaking) for discovery purposes, while facilitating the development of computational approaches for multiomic data analyses and enabling clinical translation efforts for new genetic technologies in the future.
Collapse
Affiliation(s)
- Hannah G. Driver
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
| | - Taila Hartley
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
| | - E. Magda Price
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
| | - Andrei L. Turinsky
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | | | - Matthew Osmond
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
| | - Arun K. Ramani
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Emily Kirby
- Centre of Genomics and PolicyMcGill UniversityMontrealCanada
| | - Kristin D. Kernohan
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
- Newborn Screening OntarioChildren's Hospital of Eastern OntarioOttawaCanada
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
| | - Madeline Couse
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Hillary Elrick
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Kevin Lu
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Pouria Mashouri
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Aarthi Mohan
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Delvin So
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Conor Klamann
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Hannah G. B. H. Le
- Centre for Computational MedicineThe Hospital for Sick ChildrenTorontoCanada
| | - Andrea Herscovich
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
| | - Christian R. Marshall
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
- Genome DiagnosticsThe Hospital for Sick ChildrenTorontoCanada
| | - Andrew Statia
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
| | | | | | - Michael Brudno
- PhenoTips, The Hospital for Sick ChildrenTorontoCanada
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
- Techna InstituteUniversity Health NetworkTorontoCanada
- Department of Computer ScienceUniversity of TorontoTorontoCanada
| | - Kym M. Boycott
- Children's Hospital of Eastern Ontario Research InstituteUniversity of OttawaOttawaCanada
- Genomics4RD Steering CommitteeChildren's Hospital of Eastern Ontario Research InstituteOttawaCanada
| |
Collapse
|
49
|
Middleton L, Harper AR, Nag A, Wang Q, Reznichenko A, Vitsios D, Petrovski S. Gene-SCOUT: identifying genes with similar continuous trait fingerprints from phenome-wide association analyses. Nucleic Acids Res 2022; 50:4289-4301. [PMID: 35474393 PMCID: PMC9071452 DOI: 10.1093/nar/gkac274] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Revised: 03/31/2022] [Accepted: 04/09/2022] [Indexed: 12/24/2022] Open
Abstract
Large-scale phenome-wide association studies performed using densely-phenotyped cohorts such as the UK Biobank (UKB), reveal many statistically robust gene-phenotype relationships for both clinical and continuous traits. Here, we present Gene-SCOUT, a tool used to identify genes with similar continuous trait fingerprints to a gene of interest. A fingerprint reflects the continuous traits identified to be statistically associated with a gene of interest based on multiple underlying rare variant genetic architectures. Similarities between genes are evaluated by the cosine similarity measure, to capture concordant effect directionality, elucidating clusters of genes in a high dimensional space. The underlying gene-biomarker population-scale association statistics were obtained from a gene-level rare variant collapsing analysis performed on over 1500 continuous traits using 394 692 UKB participant exomes, with additional metabolomic trait associations provided through Nightingale Health's recent study of 121 394 of these participants. We demonstrate that gene similarity estimates from Gene-SCOUT provide stronger enrichments for clinical traits compared to existing methods. Furthermore, we provide a fully interactive web-resource (http://genescout.public.cgr.astrazeneca.com) to explore the pre-calculated exome-wide similarities. This resource enables a user to examine the biological relevance of the most similar genes for Gene Ontology (GO) enrichment and UKB clinical trait enrichment statistics, as well as a detailed breakdown of the traits underpinning a given fingerprint.
Collapse
Affiliation(s)
- Lawrence Middleton
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Andrew R Harper
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Abhishek Nag
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Quanli Wang
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Waltham, MA, USA
| | - Anna Reznichenko
- Translational Science & Experimental Medicine, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden
| | - Dimitrios Vitsios
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| | - Slavé Petrovski
- Centre for Genomics Research, Discovery Sciences, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
- Department of Medicine, University of Melbourne, Austin Health, Melbourne, Victoria, Australia
| |
Collapse
|
50
|
Shi JW, Cao H, Hong L, Ma J, Cui L, Zhang Y, Song X, Liu J, Yang Y, Lv Q, Zhang L, Wang J, Xie M. Diagnostic yield of whole exome data in fetuses aborted for conotruncal malformations. Prenat Diagn 2022; 42:852-861. [PMID: 35420166 DOI: 10.1002/pd.6147] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 04/01/2022] [Accepted: 04/07/2022] [Indexed: 11/05/2022]
Abstract
OBJECTIVE We investigated a custom congenital heart disease (CHD) geneset to assess the diagnostic value of whole-exome sequencing (WES) in karyotype- and copy number variation (CNV)-negative aborted fetuses with conotruncal defects (CTD), and to explore the impact of postnatal phenotyping on genetic diagnosis. METHODS We sequentially analyzed CNV-seq and WES data from 47 CTD fetuses detected by prenatal ultrasonography. Fetuses with either a confirmed aneuploidy or pathogenic CNV were excluded from the WES analyses, which were performed following the American College of Medical Genetics and Genomics recommendations and a custom CHD-geneset. Imaging and autopsy were applied to obtain postnatal phenotypic information about aborted fetuses. RESULTS CNV-seq identified aneuploidy in 7/47 cases while 13/47 fetuses were CNV-positive. Eighty-five rare deleterious variants in 61 genes (from custom geneset) were identified by WES in the remaining fetuses. Of these, five (likely) pathogenic variants (LPV/PV) were identified in five fetuses, revealing a 10.6% incremental diagnostic yield. Furthermore, RERE:c.2461_2472delGGGATGTGGCGA was reclassified as LPV based on postnatal phenotypic data. CONCLUSION We have developed and defined a CHD gene panel that can be utilized in a subset of fetuses with CTDs. We demonstrate the utility of incorporating both prenatal and postnatal phenotypic information may facilitate WES diagnostics. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Jia-Wei Shi
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Haiyan Cao
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Liu Hong
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Jing Ma
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Li Cui
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Yi Zhang
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Xiaoyan Song
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Juanjuan Liu
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Yali Yang
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Qing Lv
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Li Zhang
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Jing Wang
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| | - Mingxing Xie
- Department of Ultrasound, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430022, China.,Clinical Research Center for Medical Imaging in Hubei Province, Wuhan, 430022, China.,Hubei Province Key Laboratory of Molecular Imaging, Wuhan, 430022, China
| |
Collapse
|