1
|
Groza T, Gration D, Baynam G, Robinson PN. FastHPOCR: pragmatic, fast, and accurate concept recognition using the human phenotype ontology. Bioinformatics 2024; 40:btae406. [PMID: 38913850 PMCID: PMC11227366 DOI: 10.1093/bioinformatics/btae406] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 05/18/2024] [Accepted: 06/19/2024] [Indexed: 06/26/2024] Open
Abstract
MOTIVATION Human Phenotype Ontology (HPO)-based phenotype concept recognition (CR) underpins a faster and more effective mechanism to create patient phenotype profiles or to document novel phenotype-centred knowledge statements. While the increasing adoption of large language models (LLMs) for natural language understanding has led to several LLM-based solutions, we argue that their intrinsic resource-intensive nature is not suitable for realistic management of the phenotype CR lifecycle. Consequently, we propose to go back to the basics and adopt a dictionary-based approach that enables both an immediate refresh of the ontological concepts as well as efficient re-analysis of past data. RESULTS We developed a dictionary-based approach using a pre-built large collection of clusters of morphologically equivalent tokens-to address lexical variability and a more effective CR step by reducing the entity boundary detection strictly to candidates consisting of tokens belonging to ontology concepts. Our method achieves state-of-the-art results (0.76 F1 on the GSC+ corpus) and a processing efficiency of 10 000 publication abstracts in 5 s. AVAILABILITY AND IMPLEMENTATION FastHPOCR is available as a Python package installable via pip. The source code is available at https://github.com/tudorgroza/fast_hpo_cr. A Java implementation of FastHPOCR will be made available as part of the Fenominal Java library available at https://github.com/monarch-initiative/fenominal. The up-to-date GCS-2024 corpus is available at https://github.com/tudorgroza/code-for-papers/tree/main/gsc-2024.
Collapse
Affiliation(s)
- Tudor Groza
- Rare Care Centre, Perth Children’s Hospital, Nedlands, WA 6009, Australia
- Telethon Kids Institute, Nedlands, WA 6009, Australia
- School of Electrical Engineering, Computing and Mathematical Sciences, Curtin University, Bentley, WA 6102, Australia
- SingHealth Duke-NUS Institute of Precision Medicine, Singapore 169609, Singapore
| | - Dylan Gration
- Western Australian Register of Developmental Anomalies, King Edward Memorial Hospital, Subiaco, WA 6008, Australia
| | - Gareth Baynam
- Rare Care Centre, Perth Children’s Hospital, Nedlands, WA 6009, Australia
- Telethon Kids Institute, Nedlands, WA 6009, Australia
- Western Australian Register of Developmental Anomalies, King Edward Memorial Hospital, Subiaco, WA 6008, Australia
- Faculty of Health and Medical Sciences, University of Western Australia, Crawley, WA 6009, Australia
| | - Peter N Robinson
- Berlin Institute of Health at Charité – Universitätsmedizin Berlin, Charitéplatz 1, 10117 Berlin, Germany
- The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032, United States
| |
Collapse
|
2
|
Nédellec C, Sauvion C, Bossy R, Borovikova M, Deléger L. TaeC: A manually annotated text dataset for trait and phenotype extraction and entity linking in wheat breeding literature. PLoS One 2024; 19:e0305475. [PMID: 38870159 PMCID: PMC11175518 DOI: 10.1371/journal.pone.0305475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 05/31/2024] [Indexed: 06/15/2024] Open
Abstract
Wheat varieties show a large diversity of traits and phenotypes. Linking them to genetic variability is essential for shorter and more efficient wheat breeding programs. A growing number of plant molecular information networks provide interlinked interoperable data to support the discovery of gene-phenotype interactions. A large body of scientific literature and observational data obtained in-field and under controlled conditions document wheat breeding experiments. The cross-referencing of this complementary information is essential. Text from databases and scientific publications has been identified early on as a relevant source of information. However, the wide variety of terms used to refer to traits and phenotype values makes it difficult to find and cross-reference the textual information, e.g. simple dictionary lookup methods miss relevant terms. Corpora with manually annotated examples are thus needed to evaluate and train textual information extraction methods. While several corpora contain annotations of human and animal phenotypes, no corpus is available for plant traits. This hinders the evaluation of text mining-based crop knowledge graphs (e.g. AgroLD, KnetMiner, WheatIS-FAIDARE) and limits the ability to train machine learning methods and improve the quality of information. The Triticum aestivum trait Corpus is a new gold standard for traits and phenotypes of wheat. It consists of 528 PubMed references that are fully annotated by trait, phenotype, and species. We address the interoperability challenge of crossing sparse assay data and publications by using the Wheat Trait and Phenotype Ontology to normalize trait mentions and the species taxonomy of the National Center for Biotechnology Information to normalize species. The paper describes the construction of the corpus. A study of the performance of state-of-the-art language models for both named entity recognition and linking tasks trained on the corpus shows that it is suitable for training and evaluation. This corpus is currently the most comprehensive manually annotated corpus for natural language processing studies on crop phenotype information from the literature.
Collapse
Affiliation(s)
- Claire Nédellec
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
| | - Clara Sauvion
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
| | - Robert Bossy
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
| | - Mariya Borovikova
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
- TETIS, Univ. Montpellier, AgroParisTech, CIRAD, CNRS, INRAE, Montpellier, France
| | - Louise Deléger
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
| |
Collapse
|
3
|
Ferreira T, Polavarapu K, Olimpio C, Paramonov I, Lochmüller H, Horvath R. Variants in mitochondrial disease genes are common causes of inherited peripheral neuropathies. J Neurol 2024; 271:3546-3553. [PMID: 38549004 PMCID: PMC11136726 DOI: 10.1007/s00415-024-12319-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 03/08/2024] [Accepted: 03/08/2024] [Indexed: 05/30/2024]
Abstract
BACKGROUND Peripheral neuropathies in mitochondrial disease are caused by mutations in nuclear genes encoding mitochondrial proteins, or in the mitochondrial genome. Whole exome or genome sequencing enable parallel testing of nuclear and mtDNA genes, and it has significantly advanced the genetic diagnosis of inherited diseases. Despite this, approximately 40% of all Charcot-Marie-Tooth (CMT) cases remain undiagnosed. METHODS The genome-phenome analysis platform (GPAP) in RD-Connect was utilised to create a cohort of 2087 patients with at least one Human Phenotype Ontology (HPO) term suggestive of a peripheral neuropathy, from a total of 10,935 patients. These patients' genetic data were then analysed and searched for variants in known mitochondrial disease genes. RESULTS A total of 1,379 rare variants were identified, 44 of which were included in this study as either reported pathogenic or likely causative in 42 patients from 36 families. The most common genes found to be likely causative for an autosomal dominant neuropathy were GDAP1 and GARS1. We also detected heterozygous likely pathogenic variants in DNA2, MFN2, DNM2, PDHA1, SDHA, and UCHL1. Biallelic variants in SACS, SPG7, GDAP1, C12orf65, UCHL1, NDUFS6, ETFDH and DARS2 and variants in the mitochondrial DNA (mtDNA)-encoded MT-ATP6 and MT-TK were also causative for mitochondrial CMT. Only 50% of these variants were already reported as solved in GPAP. CONCLUSION Variants in mitochondrial disease genes are frequent in patients with inherited peripheral neuropathies. Due to the clinical overlap between mitochondrial disease and CMT, agnostic exome or genome sequencing have better diagnostic yields than targeted gene panels.
Collapse
Affiliation(s)
- Tomas Ferreira
- Department of Clinical Neurosciences, John Van Geest Centre for Brain Repair, School of Clinical Medicine, University of Cambridge, Robinson Way, Cambridge, CB2 0PY, UK
| | - Kiran Polavarapu
- Children's Hospital of Eastern Ontario Research Institute, Ottawa, ON, Canada
| | - Catarina Olimpio
- Department of Clinical Neurosciences, John Van Geest Centre for Brain Repair, School of Clinical Medicine, University of Cambridge, Robinson Way, Cambridge, CB2 0PY, UK
- East Anglian Medical Genetics Service, Cambridge University Hospitals NHS Foundation Trust, Cambridge, UK
| | - Ida Paramonov
- Centro Nacional de Análisis Genómico, Barcelona, Spain
| | - Hanns Lochmüller
- Children's Hospital of Eastern Ontario Research Institute, Ottawa, ON, Canada
- Centro Nacional de Análisis Genómico, Barcelona, Spain
- Division of Neurology, Department of Medicine, The Ottawa Hospital, Ottawa, Canada
- Brain and Mind Research Institute, University of Ottawa, Ottawa, Canada
- Department of Neuropediatrics and Muscle Disorders, Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
| | - Rita Horvath
- Department of Clinical Neurosciences, John Van Geest Centre for Brain Repair, School of Clinical Medicine, University of Cambridge, Robinson Way, Cambridge, CB2 0PY, UK.
| |
Collapse
|
4
|
Thompson MD, Knaus A. Rare Genetic Developmental Disabilities: Mabry Syndrome (MIM 239300) Index Cases and Glycophosphatidylinositol (GPI) Disorders. Genes (Basel) 2024; 15:619. [PMID: 38790248 PMCID: PMC11121671 DOI: 10.3390/genes15050619] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 04/08/2024] [Accepted: 04/09/2024] [Indexed: 05/26/2024] Open
Abstract
The case report by Mabry et al. (1970) of a family with four children with elevated tissue non-specific alkaline phosphatase, seizures and profound developmental disability, became the basis for phenotyping children with the features that became known as Mabry syndrome. Aside from improvements in the services available to patients and families, however, the diagnosis and treatment of this, and many other developmental disabilities, did not change significantly until the advent of massively parallel sequencing. As more patients with features of the Mabry syndrome were identified, exome and genome sequencing were used to identify the glycophosphatidylinositol (GPI) biosynthesis disorders (GPIBDs) as a group of congenital disorders of glycosylation (CDG). Biallelic variants of the phosphatidylinositol glycan (PIG) biosynthesis, type V (PIGV) gene identified in Mabry syndrome became evidence of the first in a phenotypic series that is numbered HPMRS1-6 in the order of discovery. HPMRS1 [MIM: 239300] is the phenotype resulting from inheritance of biallelic PIGV variants. Similarly, HPMRS2 (MIM 614749), HPMRS5 (MIM 616025) and HPMRS6 (MIM 616809) result from disruption of the PIGO, PIGW and PIGY genes expressed in the endoplasmic reticulum. By contrast, HPMRS3 (MIM 614207) and HPMRS4 (MIM 615716) result from disruption of post attachment to proteins PGAP2 (HPMRS3) and PGAP3 (HPMRS4). The GPI biosynthesis disorders (GPIBDs) are currently numbered GPIBD1-21. Working with Dr. Mabry, in 2020, we were able to use improved laboratory diagnostics to complete the molecular diagnosis of patients he had originally described in 1970. We identified biallelic variants of the PGAP2 gene in the first reported HPMRS patients. We discuss the longevity of the Mabry syndrome index patients in the context of the utility of pyridoxine treatment of seizures and evidence for putative glycolipid storage in patients with HPMRS3. From the perspective of the laboratory innovations made that enabled the identification of the HPMRS phenotype in Dr. Mabry's patients, the need for treatment innovations that will benefit patients and families affected by developmental disabilities is clear.
Collapse
Affiliation(s)
- Miles D. Thompson
- Krembil Brain Institute, Toronto Western Hospital, 399 Bathurst Street, Toronto, ON M5T 2S8, Canada
| | - Alexej Knaus
- Institute for Genomic Statistics and Bioinformatics, University Hospital Bonn, University of Bonn, 53127 Bonn, Germany;
| |
Collapse
|
5
|
Althagafi A, Zhapa-Camacho F, Hoehndorf R. Prioritizing genomic variants through neuro-symbolic, knowledge-enhanced learning. Bioinformatics 2024; 40:btae301. [PMID: 38696757 PMCID: PMC11132820 DOI: 10.1093/bioinformatics/btae301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 04/05/2024] [Accepted: 04/30/2024] [Indexed: 05/04/2024] Open
Abstract
MOTIVATION Whole-exome and genome sequencing have become common tools in diagnosing patients with rare diseases. Despite their success, this approach leaves many patients undiagnosed. A common argument is that more disease variants still await discovery, or the novelty of disease phenotypes results from a combination of variants in multiple disease-related genes. Interpreting the phenotypic consequences of genomic variants relies on information about gene functions, gene expression, physiology, and other genomic features. Phenotype-based methods to identify variants involved in genetic diseases combine molecular features with prior knowledge about the phenotypic consequences of altering gene functions. While phenotype-based methods have been successfully applied to prioritizing variants, such methods are based on known gene-disease or gene-phenotype associations as training data and are applicable to genes that have phenotypes associated, thereby limiting their scope. In addition, phenotypes are not assigned uniformly by different clinicians, and phenotype-based methods need to account for this variability. RESULTS We developed an Embedding-based Phenotype Variant Predictor (EmbedPVP), a computational method to prioritize variants involved in genetic diseases by combining genomic information and clinical phenotypes. EmbedPVP leverages a large amount of background knowledge from human and model organisms about molecular mechanisms through which abnormal phenotypes may arise. Specifically, EmbedPVP incorporates phenotypes linked to genes, functions of gene products, and the anatomical site of gene expression, and systematically relates them to their phenotypic effects through neuro-symbolic, knowledge-enhanced machine learning. We demonstrate EmbedPVP's efficacy on a large set of synthetic genomes and genomes matched with clinical information. AVAILABILITY AND IMPLEMENTATION EmbedPVP and all evaluation experiments are freely available at https://github.com/bio-ontology-research-group/EmbedPVP.
Collapse
Affiliation(s)
- Azza Althagafi
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
- Computer Science Department, College of Computers and Information Technology, Taif University, Taif 26571, Saudi Arabia
| | - Fernando Zhapa-Camacho
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
| | - Robert Hoehndorf
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
- Computer Science Program, Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
- SDAIA-KAUST Center of Excellence in Data Science and Artificial Intelligence, King Abdullah University of Science and Technology (KAUST), 4700 KAUST, Thuwal 23955, Saudi Arabia
| |
Collapse
|
6
|
Novoa J, Fernandez-Dumont A, Mills ENC, Moreno FJ, Pazos F. Advancing the allergenicity assessment of new proteins using a text mining resource. Food Chem Toxicol 2024; 187:114638. [PMID: 38582341 DOI: 10.1016/j.fct.2024.114638] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2024] [Revised: 03/11/2024] [Accepted: 03/31/2024] [Indexed: 04/08/2024]
Abstract
With a society increasingly demanding alternative protein food sources, new strategies for evaluating protein safety issues, such as allergenic potential, are needed. Large-scale and systemic studies on allergenic proteins are hindered by the limited and non-harmonized clinical information available for these substances in dedicated databases. A missing key information is that representing the symptomatology of the allergens, especially given in terms of standard vocabularies, that would allow connecting with other biomedical resources to carry out different studies related to human health. In this work, we have generated the first resource with a comprehensive annotation of allergens' symptomatology, using a text-mining approach that extracts significant co-mentions between these entities from the scientific literature (PubMed, ∼36 million abstracts). The method identifies statistically significant co-mentions between the textual descriptions of the two types of entities in the literature as indication of relationship. 1,180 clinical signs extracted from the Human Phenotype Ontology, the Medical Subject Heading terms of PubMed together with other allergen-specific symptoms, were linked to 1,036 unique allergens annotated in two main allergen-related public databases via 14,009 relationships. This novel resource, publicly available through an interactive web interface, could serve as a starting point for future manually curated compilation of allergen symptomatology.
Collapse
Affiliation(s)
- Jorge Novoa
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain
| | | | - E N Clare Mills
- School of Biosciences and Medicine, The University of Surrey, Guildford, GU2 7XH, UK
| | - F Javier Moreno
- Instituto de Investigación en Ciencias de La Alimentación (CIAL), CSIC-UAM, CEI (UAM+CSIC), 28049, Madrid, Spain.
| | - Florencio Pazos
- Computational Systems Biology Group, National Centre for Biotechnology (CNB-CSIC), 28049, Madrid, Spain.
| |
Collapse
|
7
|
Simunovic MP, Mammo Z. Mechanisms of cone sensitivity loss in retinitis pigmentosa. Ophthalmic Physiol Opt 2024; 44:605-612. [PMID: 38351866 DOI: 10.1111/opo.13280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 01/10/2024] [Accepted: 01/14/2024] [Indexed: 04/08/2024]
Abstract
PURPOSE To explore the mechanisms of cone sensitivity loss in retinitis pigmentosa by combining two-colour perimetry with threshold versus intensity (tvi) testing. METHODS Seven subjects with autosomal recessive retinitis pigmentosa and 10 normal subjects were recruited and underwent perimetric testing of one eye using 480- and 640-nm Goldman size V targets presented under scotopic conditions (no background illumination) and against a white background ranging in luminance from -1.5 to 2 log cd m-2 in 0.5 log cd m-2 steps. Data were fitted with tvi functions of the form logT = logT0 + log ((A + A0)/A0)n, where T is the threshold, T0 is the absolute threshold, A is the background intensity, A0 is the 'dark-light' constant and n is a gain constant. RESULTS Reliable tvi functions could not be obtained within the region of the visual field corresponding to loss of the ellipsoid zone on optical coherence tomography. At fixation, changes in both T0 and A0 were observed, consistent with a d1 mechanism loss, which resulted in an upwards and rightwards shift of the tvi function. Losses at [±3°, ±3°] demonstrated changes in T0, consistent with a d3 mechanism loss, resulting in an upwards translation of the tvi curve. CONCLUSIONS Although the absolute cone threshold was elevated at each location, shifts in the tvi function (so-called d1 mechanism loss) at fixation minimise threshold elevation in the presence of white adapting backgrounds, such as those typically employed in standard two-colour perimetry. At more peripheral testing locations, changes in threshold occurred independent of background luminance (so-called d3 mechanism loss). These findings suggest that backgrounds which selectively adapt rods while maintaining cones at, or near, absolute threshold may be preferable to conventional two-colour perimetry for assessing loss of cone sensitivity, especially at the point of fixation.
Collapse
Affiliation(s)
- Matthew P Simunovic
- Save Sight Institute, University of Sydney, Sydney, New South Wales, Australia
- Retinal Unit, Sydney Eye Hospital, Sydney, New South Wales, Australia
| | - Zaid Mammo
- Save Sight Institute, University of Sydney, Sydney, New South Wales, Australia
- Retinal Unit, Sydney Eye Hospital, Sydney, New South Wales, Australia
| |
Collapse
|
8
|
Badawy MT, Salama AA, Salama M. Novel Variants Linked to the Prodromal Stage of Parkinson's Disease (PD) Patients. Diagnostics (Basel) 2024; 14:929. [PMID: 38732343 PMCID: PMC11083733 DOI: 10.3390/diagnostics14090929] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 04/24/2024] [Accepted: 04/25/2024] [Indexed: 05/13/2024] Open
Abstract
BACKGROUND AND OBJECTIVE The symptoms of most neurodegenerative diseases, including Parkinson's disease (PD), usually do not occur until substantial neuronal loss occurs. This makes the process of early diagnosis very challenging. Hence, this research used variant call format (VCF) analysis to detect variants and novel genes that could be used as prognostic indicators in the early diagnosis of prodromal PD. MATERIALS AND METHODS Data were obtained from the Parkinson's Progression Markers Initiative (PPMI), and we analyzed prodromal patients with gVCF data collected in the 2021 cohort. A total of 304 participants were included, including 100 healthy controls, 146 prodromal genetic individuals, 21 prodromal hyposmia individuals, and 37 prodromal individuals with RBD. A pipeline was developed to process the samples from gVCF to reach variant annotation and pathway and disease association analysis. RESULTS Novel variant percentages were detected in the analyzed prodromal subgroups. The prodromal subgroup analysis revealed novel variations of 1.0%, 1.2%, 0.6%, 0.3%, 0.5%, and 0.4% for the genetic male, genetic female, hyposmia male, hyposmia female, RBD male, and RBD female groups, respectively. Interestingly, 12 potentially novel loci (MTF2, PIK3CA, ADD1, SYBU, IRS2, USP8, PIGL, FASN, MYLK2, USP25, EP300, and PPP6R2) that were recently detected in PD patients were detected in the prodromal stage of PD. CONCLUSIONS Genetic biomarkers are crucial for the early detection of Parkinson's disease and its prodromal stage. The novel PD genes detected in prodromal patients could aid in the use of gene biomarkers for early diagnosis of the prodromal stage without relying only on phenotypic traits.
Collapse
Affiliation(s)
- Marwa T. Badawy
- Biology Department, School of Sciences and Engineering, The American University in Cairo, New Cairo 11835, Egypt;
| | - Aya A. Salama
- Applied Science, Windows and Web Experience, Microsoft, Cairo 11561, Egypt;
| | - Mohamed Salama
- Institute of Global Health and Human Ecology (IGHHE), The American University in Cairo, New Cairo 11835, Egypt
- Toxicology Department, Faculty of Medicine, Mansoura University, Mansoura 35516, Egypt
- Global Brain Health Institute, Trinity College Dublin, D02 X9W9 Dublin, Ireland
| |
Collapse
|
9
|
Chen J, Landback P, Arsala D, Guzzetta A, Xia S, Atlas J, Sosa D, Zhang YE, Cheng J, Shen B, Long M. Evolutionarily new genes in humans with disease phenotypes reveal functional enrichment patterns shaped by adaptive innovation and sexual selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.14.567139. [PMID: 38045239 PMCID: PMC10690195 DOI: 10.1101/2023.11.14.567139] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
New genes (or young genes) are genetic novelties pivotal in mammalian evolution. Their phenotypic impacts and evolutionary pattern over time, however, remain elusive in humans due to the technical and ethical complexities in functional studies. By combining human gene age dating and Mendelian disease phenotyping, our research reveals a gradual increase in disease gene proportions with gene age. Logistic regression modeling indicates that this increase could be related to longer protein lengths and higher burdens of deleterious de novo germline variants (DNVs) for older genes. We also find a steady integration of new genes with biomedical phenotypes into the human genome over macroevolutionary timescales (~0.07% per million years). Despite this stable pace, we observe distinct patterns in phenotypic enrichment, pleiotropy, and selective pressures across gene ages. Notably, young genes show significant enrichment in diseases related to the male reproductive system, indicating strong sexual selection. Young genes also exhibit disease-related functions in tissues and systems potentially linked to human phenotypic innovations, such as increased brain size, musculoskeletal phenotypes, and color vision. We further reveal a logistic growth pattern of pleiotropy over evolutionary time, indicating a diminishing marginal growth of new functions for older genes due to intensifying selective constraints over time. We propose a "pleiotropy-barrier" model that delineates higher potentials of phenotypic innovation for young genes than for older genes, a process subject to natural selection. Our study demonstrates that evolutionary new genes are critical in influencing human reproductive evolution and adaptive phenotypic innovations driven by sexual and natural selection, with low pleiotropy as a selective advantage.
Collapse
Affiliation(s)
- Jianhai Chen
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
- Institutes for Systems Genetics, West China University Hospital, Chengdu 610041, China
| | - Patrick Landback
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Deanna Arsala
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Alexander Guzzetta
- Department of Pathology, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Shengqian Xia
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Jared Atlas
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Dylan Sosa
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| | - Yong E. Zhang
- Key Laboratory of Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beijing 100101, China
| | - Jingqiu Cheng
- Institutes for Systems Genetics, West China University Hospital, Chengdu 610041, China
| | - Bairong Shen
- Institutes for Systems Genetics, West China University Hospital, Chengdu 610041, China
| | - Manyuan Long
- Department of Ecology and Evolution, The University of Chicago, 1101 E 57th Street, Chicago, IL 60637
| |
Collapse
|
10
|
Yao X, Ouyang S, Lian Y, Peng Q, Zhou X, Huang F, Hu X, Shi F, Xia J. PheSeq, a Bayesian deep learning model to enhance and interpret the gene-disease association studies. Genome Med 2024; 16:56. [PMID: 38627848 PMCID: PMC11020195 DOI: 10.1186/s13073-024-01330-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 04/02/2024] [Indexed: 04/19/2024] Open
Abstract
Despite the abundance of genotype-phenotype association studies, the resulting association outcomes often lack robustness and interpretations. To address these challenges, we introduce PheSeq, a Bayesian deep learning model that enhances and interprets association studies through the integration and perception of phenotype descriptions. By implementing the PheSeq model in three case studies on Alzheimer's disease, breast cancer, and lung cancer, we identify 1024 priority genes for Alzheimer's disease and 818 and 566 genes for breast cancer and lung cancer, respectively. Benefiting from data fusion, these findings represent moderate positive rates, high recall rates, and interpretation in gene-disease association studies.
Collapse
Affiliation(s)
- Xinzhi Yao
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Sizhuo Ouyang
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Yulong Lian
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Qianqian Peng
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Xionghui Zhou
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Feier Huang
- College of Life Science and Technology, Huazhong Agricultural University, Wuhan, China
| | - Xuehai Hu
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China
| | - Feng Shi
- College of Science, Huazhong Agricultural University, Wuhan, China
| | - Jingbo Xia
- College of Informatics, Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China.
- Hubei Key Laboratory of Agricultural Bioinformatics, Huazhong Agricultural University, Wuhan, China.
| |
Collapse
|
11
|
von Hardenberg S, Klefenz I, Steinemann D, Di Donato N, Baumann U, Auber B, Klemann C. Current genetic diagnostics in inborn errors of immunity. Front Pediatr 2024; 12:1279112. [PMID: 38659694 PMCID: PMC11039790 DOI: 10.3389/fped.2024.1279112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 03/28/2024] [Indexed: 04/26/2024] Open
Abstract
New technologies in genetic diagnostics have revolutionized the understanding and management of rare diseases. This review highlights the significant advances and latest developments in genetic diagnostics in inborn errors of immunity (IEI), which encompass a diverse group of disorders characterized by defects in the immune system, leading to increased susceptibility to infections, autoimmunity, autoinflammatory diseases, allergies, and malignancies. Various diagnostic approaches, including targeted gene sequencing panels, whole exome sequencing, whole genome sequencing, RNA sequencing, or proteomics, have enabled the identification of causative genetic variants of rare diseases. These technologies not only facilitated the accurate diagnosis of IEI but also provided valuable insights into the underlying molecular mechanisms. Emerging technologies, currently mainly used in research, such as optical genome mapping, single cell sequencing or the application of artificial intelligence will allow even more insights in the aetiology of hereditary immune defects in the near future. The integration of genetic diagnostics into clinical practice significantly impacts patient care. Genetic testing enables early diagnosis, facilitating timely interventions and personalized treatment strategies. Additionally, establishing a genetic diagnosis is necessary for genetic counselling and prognostic assessments. Identifying specific genetic variants associated with inborn errors of immunity also paved the way for the development of targeted therapies and novel therapeutic approaches. This review emphasizes the challenges related with genetic diagnosis of rare diseases and provides future directions, specifically focusing on IEI. Despite the tremendous progress achieved over the last years, several obstacles remain or have become even more important due to the increasing amount of genetic data produced for each patient. This includes, first and foremost, the interpretation of variants of unknown significance (VUS) in known IEI genes and of variants in genes of unknown significance (GUS). Although genetic diagnostics have significantly contributed to the understanding and management of IEI and other rare diseases, further research, exchange between experts from different clinical disciplines, data integration and the establishment of comprehensive guidelines are crucial to tackle the remaining challenges and maximize the potential of genetic diagnostics in the field of rare diseases, such as IEI.
Collapse
Affiliation(s)
| | - Isabel Klefenz
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
| | - Doris Steinemann
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
| | - Nataliya Di Donato
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
| | - Ulrich Baumann
- Department of Pediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany
| | - Bernd Auber
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
| | - Christian Klemann
- Department of Human Genetics, Hannover Medical School, Hannover, Germany
- Department of Pediatric Immunology, Rheumatology and Infectiology, Hospital for Children and Adolescents, University of Leipzig, Leipzig, Germany
| |
Collapse
|
12
|
Atalaia A, Wandrei D, Lalout N, Thompson R, Tassoni A, 't Hoen PAC, Athanasiou D, Baker SA, Sakellariou P, Paliouras G, D'Angelo C, Horvath R, Mancuso M, van der Beek N, Kornblum C, Kirschner J, Pareyson D, Bassez G, Blacas L, Jacoupy M, Eng C, Lamy F, Plançon JP, Haberlova J, Brusse E, Hoeijmakers JGJ, de Visser M, Claeys KG, Paradas C, Toscano A, Silani V, Gyenge M, Reviers E, Hamroun D, Vroom E, Wilkinson MD, Lochmuller H, Evangelista T. EURO-NMD registry: federated FAIR infrastructure, innovative technologies and concepts of a patient-centred registry for rare neuromuscular disorders. Orphanet J Rare Dis 2024; 19:66. [PMID: 38355534 PMCID: PMC10865673 DOI: 10.1186/s13023-024-03059-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 02/03/2024] [Indexed: 02/16/2024] Open
Abstract
BACKGROUND The EURO-NMD Registry collects data from all neuromuscular patients seen at EURO-NMD's expert centres. In-kind contributions from three patient organisations have ensured that the registry is patient-centred, meaningful, and impactful. The consenting process covers other uses, such as research, cohort finding and trial readiness. RESULTS The registry has three-layered datasets, with European Commission-mandated data elements (EU-CDEs), a set of cross-neuromuscular data elements (NMD-CDEs) and a dataset of disease-specific data elements that function modularly (DS-DEs). The registry captures clinical, neuromuscular imaging, neuromuscular histopathology, biological and genetic data and patient-reported outcomes in a computer-interpretable format using selected ontologies and classifications. The EURO-NMD registry is connected to the EURO-NMD Registry Hub through an interoperability layer. The Hub provides an entry point to other neuromuscular registries that follow the FAIR data stewardship principles and enable GDPR-compliant information exchange. Four national or disease-specific patient registries are interoperable with the EURO-NMD Registry, allowing for federated analysis across these different resources. CONCLUSIONS Collectively, the Registry Hub brings together data that are currently siloed and fragmented to improve healthcare and advance research for neuromuscular diseases.
Collapse
Affiliation(s)
- Antonio Atalaia
- Inserm Center of Research in Myology, Neuro-Myology Service G.H. Pitié-Salpêtrière, Sorbonne Université, Paris, France.
| | - Dagmar Wandrei
- Clinical Trials Unit, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Nawel Lalout
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, Netherlands
- Duchenne Parent Project, Veenendaal, The Netherlands
| | - Rachel Thompson
- Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada
| | - Adrian Tassoni
- Clinical Trials Unit, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
| | - Peter A C 't Hoen
- Medical BioSciences Department, Radboud University Medical Center, Nijmegen, Netherlands
| | | | | | | | | | - Carla D'Angelo
- European Reference Network for Rare Neuromuscular Diseases EURO-NMD, Institute of Myology, University Hospital Pitie-Salpetriere-APHP, Paris, France
| | - Rita Horvath
- Department of Clinical Neurosciences, University of Cambridge, Cambridge, UK
| | - Michelangelo Mancuso
- Department of Clinical and Experimental Medicine, Neurological Institute, University of Pisa, Pisa, Italy
| | - Nadine van der Beek
- Department of Neurology/Center for Lysosomal and Metabolic Diseases, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Cornelia Kornblum
- Department of Neurology, Neuromuscular Diseases Section, University Hospital Bonn, Bonn, Germany
| | - Janbernd Kirschner
- Department of Neuropediatrics and Muscle Disorders, Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
| | - Davide Pareyson
- Unit of Rare Neurological Diseases. Department of Clinical Neurosciences, Fondazione IRCCS Istituto Neurologico Carlo Besta, Milan, Italy
| | - Guillaume Bassez
- Neuromuscular Diseases Reference Center, Pitié-Salpêtrière University Hospital, APHP Paris, Paris, France
| | - Laura Blacas
- Association Institute of Myology, Hôpital Pitié-Salpêtrière, Paris, France
| | - Maxime Jacoupy
- Association Institute of Myology, Hôpital Pitié-Salpêtrière, Paris, France
| | - Catherine Eng
- Association Française Contre Les Myopathies, AFM-Téléthon, Evry, France
| | - François Lamy
- Association Française Contre Les Myopathies, AFM-Téléthon, Evry, France
| | - Jean-Philippe Plançon
- European Patient Organisation for Dysimmune and Inflammatory Neuropathies, Paris, France
| | - Jana Haberlova
- Neuromuscular Center, University Hospital Motol, Prague, Czech Republic
| | - Esther Brusse
- Department of Neurology/Center for Lysosomal and Metabolic Diseases, Erasmus MC University Medical Center, Rotterdam, The Netherlands
| | - Janneke G J Hoeijmakers
- Department of Neurology, Maastricht University Medical Center+, and MHeNS, School for Mental Health and Neuroscience, Maastricht University, Maastricht, The Netherlands
| | - Marianne de Visser
- Department of Neurology, Amsterdam University Medical Center, Location Academic Medical Center, Amsterdam, The Netherlands
| | - Kristl G Claeys
- Department of Neurology, University Hospitals Leuven, and Laboratory for Muscle Diseases and Neuropathies, Department of Neurosciences, KU Leuven, and Leuven Brain Institute (LBI), Louvain, Belgium
| | - Carmen Paradas
- Hospital Universitario Virgen del Rocío/IBiS, Avda Manuel Siurot S/N, 41013, Seville, Andalucía, Spain
| | - Antonio Toscano
- Department of Clinical and Experimental Medicine, AOU G. Martino Di Messina, University of Messina, Messina, Italy
| | - Vincenzo Silani
- Department of Neurology and Laboratory of Neuroscience, IRCCS Istituto Auxologico Italiano, Milan, Italy
| | - Melinda Gyenge
- Neuromuscular Diseases Reference Center, Pitié-Salpêtrière University Hospital, APHP Paris, Paris, France
| | | | - Dalil Hamroun
- CHRU de Montpellier, Direction de la Recherche et de L'Innovation, Hôpital La Colombière, Montpellier, France
| | | | - Mark D Wilkinson
- Departamento de Biotecnología-Biología Vegetal, Escuela Técnica Superior de Ingeniería Agronómica, Alimentaria y de Biosistemas, Centro de Biotecnología y Genómica de Plantas UPM-INIA, Universidad Politécnica de Madrid (UPM), Instituto Nacional de Investigación y Tecnología Agraria y Alimentaria (INIA/CSIC), 28223, Madrid, ES, Spain
| | - Hanns Lochmuller
- Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada
- Department of Neuropediatrics and Muscle Disorders, Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
| | - Teresinha Evangelista
- Neuromuscular Pathology Functional Unit; Neuropathology Service, Institute of Myology, University Hospital Pitié-Salpêtrière-APHP, Paris, France
| |
Collapse
|
13
|
Kilicoglu H, Ensan F, McInnes B, Wang LL. Semantics-enabled biomedical literature analytics. J Biomed Inform 2024; 150:104588. [PMID: 38244957 DOI: 10.1016/j.jbi.2024.104588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 01/10/2024] [Indexed: 01/22/2024]
Affiliation(s)
- Halil Kilicoglu
- School of Information Sciences, University of Illinois Urbana Champaign, Champaign, IL, USA.
| | - Faezeh Ensan
- Department of Electrical, Computer, and Biomedical Engineering, Toronto Metropolitan University, Toronto, ON, Canada.
| | - Bridget McInnes
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
| | - Lucy Lu Wang
- Information School, University of Washington, Seattle, WA, USA.
| |
Collapse
|
14
|
Kwak SH, Srinivasan S, Chen L, Todd J, Mercader JM, Jensen ET, Divers J, Mottl AK, Pihoker C, Gandica RG, Laffel LM, Isganaitis E, Haymond MW, Levitsky LL, Pollin TI, Florez JC, Flannick J. Genetic architecture and biology of youth-onset type 2 diabetes. Nat Metab 2024; 6:226-237. [PMID: 38278947 PMCID: PMC10896722 DOI: 10.1038/s42255-023-00970-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 12/20/2023] [Indexed: 01/28/2024]
Abstract
The prevalence of youth-onset type 2 diabetes (T2D) and childhood obesity has been rising steadily1, producing a growing public health concern1 that disproportionately affects minority groups2. The genetic basis of youth-onset T2D and its relationship to other forms of diabetes are unclear3. Here we report a detailed genetic characterization of youth-onset T2D by analysing exome sequences and common variant associations for 3,005 individuals with youth-onset T2D and 9,777 adult control participants matched for ancestry, including both males and females. We identify monogenic diabetes variants in 2.4% of individuals and three exome-wide significant (P < 2.6 × 10-6) gene-level associations (HNF1A, MC4R, ATXN2L). Furthermore, we report rare variant association enrichments within 25 gene sets related to obesity, monogenic diabetes and β-cell function. Many youth-onset T2D associations are shared with adult-onset T2D, but genetic risk factors of all frequencies-and rare variants in particular-are enriched within youth-onset T2D cases (5.0-fold increase in the rare variant and 3.4-fold increase in common variant genetic liability relative to adult-onset cases). The clinical presentation of participants with youth-onset T2D is influenced in part by the frequency of genetic risk factors within each individual. These findings portray youth-onset T2D as a heterogeneous disease situated on a spectrum between monogenic diabetes and adult-onset T2D.
Collapse
Affiliation(s)
- Soo Heon Kwak
- Department of Internal Medicine, Seoul National University Hospital, Seoul National University College of Medicine, Seoul, Republic of Korea
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Shylaja Srinivasan
- Division of Pediatric Endocrinology, University of California at San Francisco, San Francisco, CA, USA
| | - Ling Chen
- Center for Genomic Medicine, and Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
| | - Jennifer Todd
- Division of Pediatric Endocrinology, Department of Pediatrics, University of Vermont, Burlington, VT, USA
| | - Josep M Mercader
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, and Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Elizabeth T Jensen
- Department of Epidemiology and Prevention, Wake Forest School of Medicine, Winston-Salem, NC, USA
| | - Jasmin Divers
- Department of Foundations of Medicine, NYU Langone Health, New York, NY, USA
| | - Amy K Mottl
- Division of Nephrology and Hypertension, University of North Carolina, Chapel Hill, NC, USA
| | - Catherine Pihoker
- Department of Pediatrics, University of Washington, Seattle, WA, USA
| | - Rachelle G Gandica
- Naomi Berrie Diabetes Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Lori M Laffel
- Joslin Diabetes Center, Harvard Medical School, Boston, MA, USA
| | | | - Morey W Haymond
- Department of Pediatrics, Children's Nutrition Research Center, Baylor College of Medicine, Houston, TX, USA
| | - Lynne L Levitsky
- Pediatric Endocrine Division, Department of Pediatrics, Massachusetts General Hospital for Children and Harvard Medical School, Boston, MA, USA
| | - Toni I Pollin
- Division of Endocrinology, Diabetes, and Nutrition, Department of Medicine, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Jose C Florez
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Center for Genomic Medicine, and Diabetes Unit, Endocrine Division, Department of Medicine, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Jason Flannick
- Programs in Metabolism and Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
- Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, USA.
- Department of Pediatrics, Boston Children's Hospital, Boston, MA, USA.
| |
Collapse
|
15
|
Mohsin N, Hunt D, Yan J, Jabbour AJ, Nghiem P, Choi J, Zhang Y, Freeman AF, Bergerson JRE, Dell’Orso S, Lachance K, Kulikauskas R, Collado L, Cao W, Lack J, Similuk M, Seifert BA, Ghosh R, Walkiewicz MA, Brownell I. Genetic Risk Factors for Early-Onset Merkel Cell Carcinoma. JAMA Dermatol 2024; 160:172-178. [PMID: 38170500 PMCID: PMC10765310 DOI: 10.1001/jamadermatol.2023.5362] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2023] [Accepted: 11/03/2023] [Indexed: 01/05/2024]
Abstract
Importance Merkel cell carcinoma (MCC) is a rare, aggressive neuroendocrine skin cancer. Of the patients who develop MCC annually, only 4% are younger than 50 years. Objective To identify genetic risk factors for early-onset MCC via genomic sequencing. Design, Setting, and Participants The study represents a multicenter collaboration between the National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), the National Institute of Allergy and Infectious Diseases (NIAID), and the University of Washington. Participants with early-onset and later-onset MCC were prospectively enrolled in an institutional review board-approved study at the University of Washington between January 2003 and May 2019. Unrelated controls were enrolled in the NIAID Centralized Sequencing Program (CSP) between September 2017 and September 2021. Analysis was performed from September 2021 and March 2023. Early-onset MCC was defined as disease occurrence in individuals younger than 50 years. Later-onset MCC was defined as disease occurrence at age 50 years or older. Unrelated controls were evaluated by the NIAID CSP for reasons other than familial cancer syndromes, including immunological, neurological, and psychiatric disorders. Results This case-control analysis included 1012 participants: 37 with early-onset MCC, 45 with later-onset MCC, and 930 unrelated controls. Among 37 patients with early-onset MCC, 7 (19%) had well-described variants in genes associated with cancer predisposition. Six patients had variants associated with hereditary cancer syndromes (ATM = 2, BRCA1 = 2, BRCA2 = 1, and TP53 = 1) and 1 patient had a variant associated with immunodeficiency and lymphoma (MAGT1). Compared with 930 unrelated controls, the early-onset MCC cohort was significantly enriched for cancer-predisposing pathogenic or likely pathogenic variants in these 5 genes (odds ratio, 30.35; 95% CI, 8.89-106.30; P < .001). No germline disease variants in these genes were identified in 45 patients with later-onset MCC. Additional variants in DNA repair genes were also identified among patients with MCC. Conclusions and Relevance Because variants in certain DNA repair and cancer predisposition genes are associated with early-onset MCC, genetic counseling and testing should be considered for patients presenting at younger than 50 years.
Collapse
Affiliation(s)
- Noreen Mohsin
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), National Institutes of Health (NIH), Bethesda, Maryland
| | - Devin Hunt
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | - Jia Yan
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | | | - Paul Nghiem
- Division of Dermatology, University of Washington, Seattle
| | - Jaehyuk Choi
- Northwestern University Department of Dermatology and Department of Biochemistry and Molecular Genetics, Chicago, Illinois
| | - Yue Zhang
- Northwestern University Department of Dermatology and Department of Biochemistry and Molecular Genetics, Chicago, Illinois
| | - Alexandra F. Freeman
- Laboratory of Clinical Immunology and Microbiology, NIAID, NIH, Bethesda, Maryland
| | | | | | | | | | - Loren Collado
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), National Institutes of Health (NIH), Bethesda, Maryland
| | - Wenjia Cao
- Collaborative Bioinformatics Resource, NIAID, NIH, Bethesda, Maryland
| | - Justin Lack
- Collaborative Bioinformatics Resource, NIAID, NIH, Bethesda, Maryland
| | - Morgan Similuk
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | - Bryce A. Seifert
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | - Rajarshi Ghosh
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | - Magdalena A. Walkiewicz
- Division of Intramural Research, National Institute of Allergy and Infectious Diseases (NIAID), NIH, Bethesda, Maryland
| | - Isaac Brownell
- Dermatology Branch, National Institute of Arthritis and Musculoskeletal and Skin Diseases (NIAMS), National Institutes of Health (NIH), Bethesda, Maryland
| |
Collapse
|
16
|
Wyrwoll MJ, van der Heijden GW, Krausz C, Aston KI, Kliesch S, McLachlan R, Ramos L, Conrad DF, O'Bryan MK, Veltman JA, Tüttelmann F. Improved phenotypic classification of male infertility to promote discovery of genetic causes. Nat Rev Urol 2024; 21:91-101. [PMID: 37723288 DOI: 10.1038/s41585-023-00816-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/16/2023] [Indexed: 09/20/2023]
Abstract
An increasing number of genes are being described in the context of non-syndromic male infertility. Linking the underlying genetic causes of non-syndromic male infertility with clinical data from patients is important to establish new genotype-phenotype correlations. This process can be facilitated by using universal nomenclature, but no standardized vocabulary is available in the field of non-syndromic male infertility. The International Male Infertility Genomics Consortium aimed at filling this gap, providing a standardized vocabulary containing nomenclature based on the Human Phenotype Ontology (HPO). The "HPO tree" was substantially revised compared with the previous version and is based on the clinical work-up of infertile men, including physical examination and hormonal assessment. Some causes of male infertility can already be suspected based on the patient's clinical history, whereas in other instances, a testicular biopsy is needed for diagnosis. We assembled 49 HPO terms that are linked in a logical hierarchy and showed examples of morphological features of spermatozoa and testicular histology of infertile men with identified genetic diagnoses to describe the phenotypes. This work will help to record patients' phenotypes systematically and facilitate communication between geneticists and andrologists. Collaboration across institutions will improve the identification of patients with the same phenotypes, which will promote the discovery of novel genetic causes for non-syndromic male infertility.
Collapse
Affiliation(s)
- Margot J Wyrwoll
- Institute of Reproductive Genetics, University of Münster, Münster, Germany
| | | | - Csilla Krausz
- Department of Biomedical, Experimental and Clinical Sciences "Mario Serio", University of Florence, University Hospital of Careggi (AOUC), Florence, Italy
| | - Kenneth I Aston
- Andrology and IVF Laboratory, Department of Surgery (Urology), University of Utah, Salt Lake City, UT, USA
| | - Sabine Kliesch
- Centre of Reproductive Medicine and Andrology, Department of Clinical and Surgical Andrology, University of Münster, Münster, Germany
| | - Robert McLachlan
- Department of Clinical Research, Hudson Institute of Medical Research, Melbourne, Victoria, Australia
| | - Liliana Ramos
- Department of Obstetrics and Gynecology, Radboud University Medical Center, Nijmegen, Netherlands
| | - Donald F Conrad
- Department of Genetics, Oregon National Primate Research Center, Oregon Health and Science University, Beaverton, OR, USA
| | - Moira K O'Bryan
- School of BioSciences and Bio21 Institute, The University of Melbourne, Parkville, Victoria, Australia
| | - Joris A Veltman
- Biosciences Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne, UK
| | - Frank Tüttelmann
- Institute of Reproductive Genetics, University of Münster, Münster, Germany.
| |
Collapse
|
17
|
Groza T, Caufield H, Gration D, Baynam G, Haendel MA, Robinson PN, Mungall CJ, Reese JT. An evaluation of GPT models for phenotype concept recognition. BMC Med Inform Decis Mak 2024; 24:30. [PMID: 38297371 PMCID: PMC10829255 DOI: 10.1186/s12911-024-02439-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 01/24/2024] [Indexed: 02/02/2024] Open
Abstract
OBJECTIVE Clinical deep phenotyping and phenotype annotation play a critical role in both the diagnosis of patients with rare disorders as well as in building computationally-tractable knowledge in the rare disorders field. These processes rely on using ontology concepts, often from the Human Phenotype Ontology, in conjunction with a phenotype concept recognition task (supported usually by machine learning methods) to curate patient profiles or existing scientific literature. With the significant shift in the use of large language models (LLMs) for most NLP tasks, we examine the performance of the latest Generative Pre-trained Transformer (GPT) models underpinning ChatGPT as a foundation for the tasks of clinical phenotyping and phenotype annotation. MATERIALS AND METHODS The experimental setup of the study included seven prompts of various levels of specificity, two GPT models (gpt-3.5-turbo and gpt-4.0) and two established gold standard corpora for phenotype recognition, one consisting of publication abstracts and the other clinical observations. RESULTS The best run, using in-context learning, achieved 0.58 document-level F1 score on publication abstracts and 0.75 document-level F1 score on clinical observations, as well as a mention-level F1 score of 0.7, which surpasses the current best in class tool. Without in-context learning, however, performance is significantly below the existing approaches. CONCLUSION Our experiments show that gpt-4.0 surpasses the state of the art performance if the task is constrained to a subset of the target ontology where there is prior knowledge of the terms that are expected to be matched. While the results are promising, the non-deterministic nature of the outcomes, the high cost and the lack of concordance between different runs using the same prompt and input make the use of these LLMs challenging for this particular task.
Collapse
Affiliation(s)
- Tudor Groza
- Rare Care Centre, Perth Children's Hospital, 15 Hospital Avenue, Nedlands, WA, 6009, Australia.
- Telethon Kids Institute, 15 Hospital Avenue, Nedlands, WA, 6009, Australia.
- School of Electrical Engineering, Computing and Mathematical Sciences, Curtin University, Kent St, Bentley, WA, 6102, Australia.
- SingHealth Duke-NUS Institute of Precision Medicine, 5 Hospital Drive Level 9, Singapore, 169609, Singapore.
| | - Harry Caufield
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Dylan Gration
- Western Australian Register of Developmental Anomalies, King Edward Memorial Hospital, 374 Bagot Road, Subiaco, WA, 6008, Australia
| | - Gareth Baynam
- Rare Care Centre, Perth Children's Hospital, 15 Hospital Avenue, Nedlands, WA, 6009, Australia
- Telethon Kids Institute, 15 Hospital Avenue, Nedlands, WA, 6009, Australia
- Western Australian Register of Developmental Anomalies, King Edward Memorial Hospital, 374 Bagot Road, Subiaco, WA, 6008, Australia
- Faculty of Health and Medical Sciences, University of Western Australia, 35 Stirling Hwy, Crawley, WA, 6009, Australia
| | - Melissa A Haendel
- University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
- Institute for Systems Genomics, University of Connecticut, Farmington, CT, 06032, USA
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | - Justin T Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| |
Collapse
|
18
|
Hasselgren C, Oprea TI. Artificial Intelligence for Drug Discovery: Are We There Yet? Annu Rev Pharmacol Toxicol 2024; 64:527-550. [PMID: 37738505 DOI: 10.1146/annurev-pharmtox-040323-040828] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/24/2023]
Abstract
Drug discovery is adapting to novel technologies such as data science, informatics, and artificial intelligence (AI) to accelerate effective treatment development while reducing costs and animal experiments. AI is transforming drug discovery, as indicated by increasing interest from investors, industrial and academic scientists, and legislators. Successful drug discovery requires optimizing properties related to pharmacodynamics, pharmacokinetics, and clinical outcomes. This review discusses the use of AI in the three pillars of drug discovery: diseases, targets, and therapeutic modalities, with a focus on small-molecule drugs. AI technologies, such as generative chemistry, machine learning, and multiproperty optimization, have enabled several compounds to enter clinical trials. The scientific community must carefully vet known information to address the reproducibility crisis. The full potential of AI in drug discovery can only be realized with sufficient ground truth and appropriate human intervention at later pipeline stages.
Collapse
Affiliation(s)
- Catrin Hasselgren
- Safety Assessment, Genentech, Inc., South San Francisco, California, USA
| | - Tudor I Oprea
- Expert Systems Inc., San Diego, California, USA;
- Department of Internal Medicine, University of New Mexico Health Sciences Center, Albuquerque, New Mexico, USA
| |
Collapse
|
19
|
Yang J, Liu C, Deng W, Wu D, Weng C, Zhou Y, Wang K. Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT. PATTERNS (NEW YORK, N.Y.) 2024; 5:100887. [PMID: 38264716 PMCID: PMC10801236 DOI: 10.1016/j.patter.2023.100887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 10/25/2023] [Accepted: 11/06/2023] [Indexed: 01/25/2024]
Abstract
To enhance phenotype recognition in clinical notes of genetic diseases, we developed two models-PhenoBCBERT and PhenoGPT-for expanding the vocabularies of Human Phenotype Ontology (HPO) terms. While HPO offers a standardized vocabulary for phenotypes, existing tools often fail to capture the full scope of phenotypes due to limitations from traditional heuristic or rule-based approaches. Our models leverage large language models to automate the detection of phenotype terms, including those not in the current HPO. We compare these models with PhenoTagger, another HPO recognition tool, and found that our models identify a wider range of phenotype concepts, including previously uncharacterized ones. Our models also show strong performance in case studies on biomedical literature. We evaluate the strengths and weaknesses of BERT- and GPT-based models in aspects such as architecture and accuracy. Overall, our models enhance automated phenotype detection from clinical texts, improving downstream analyses on human diseases.
Collapse
Affiliation(s)
- Jingye Yang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Wendy Deng
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Da Wu
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Yunyun Zhou
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Biostatistics and Bioinformatics Facility, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
20
|
Gargano MA, Matentzoglu N, Coleman B, Addo-Lartey EB, Anagnostopoulos A, Anderton J, Avillach P, Bagley AM, Bakštein E, Balhoff JP, Baynam G, Bello SM, Berk M, Bertram H, Bishop S, Blau H, Bodenstein DF, Botas P, Boztug K, Čady J, Callahan TJ, Cameron R, Carbon S, Castellanos F, Caufield JH, Chan LE, Chute C, Cruz-Rojo J, Dahan-Oliel N, Davids JR, de Dieuleveult M, de Souza V, de Vries BBA, de Vries E, DePaulo JR, Derfalvi B, Dhombres F, Diaz-Byrd C, Dingemans AJM, Donadille B, Duyzend M, Elfeky R, Essaid S, Fabrizzi C, Fico G, Firth HV, Freudenberg-Hua Y, Fullerton JM, Gabriel DL, Gilmour K, Giordano J, Goes FS, Moses RG, Green I, Griese M, Groza T, Gu W, Guthrie J, Gyori B, Hamosh A, Hanauer M, Hanušová K, He Y(O, Hegde H, Helbig I, Holasová K, Hoyt CT, Huang S, Hurwitz E, Jacobsen JOB, Jiang X, Joseph L, Keramatian K, King B, Knoflach K, Koolen DA, Kraus M, Kroll C, Kusters M, Ladewig MS, Lagorce D, Lai MC, Lapunzina P, Laraway B, Lewis-Smith D, Li X, Lucano C, Majd M, Marazita ML, Martinez-Glez V, McHenry TH, McInnis MG, McMurry JA, Mihulová M, Millett CE, Mitchell PB, Moslerová V, Narutomi K, Nematollahi S, Nevado J, Nierenberg AA, Čajbiková NN, Nurnberger JI, Ogishima S, Olson D, Ortiz A, Pachajoa H, Perez de Nanclares G, Peters A, Putman T, Rapp CK, Rath A, Reese J, Rekerle L, Roberts A, Roy S, Sanders SJ, Schuetz C, Schulte EC, Schulze TG, Schwarz M, Scott K, Seelow D, Seitz B, Shen Y, Similuk MN, Simon ES, Singh B, Smedley D, Smith CL, Smolinsky JT, Sperry S, Stafford E, Stefancsik R, Steinhaus R, Strawbridge R, Sundaramurthi JC, Talapova P, Tenorio Castano JA, Tesner P, Thomas RH, Thurm A, Turnovec M, van Gijn ME, Vasilevsky NA, Vlčková M, Walden A, Wang K, Wapner R, Ware JS, Wiafe AA, Wiafe SA, Wiggins LD, Williams AE, Wu C, Wyrwoll MJ, Xiong H, Yalin N, Yamamoto Y, Yatham LN, Yocum AK, Young AH, Yüksel Z, Zandi PP, Zankl A, Zarante I, Zvolský M, Toro S, Carmody LC, Harris NL, Munoz-Torres MC, Danis D, Mungall CJ, Köhler S, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res 2024; 52:D1333-D1346. [PMID: 37953324 PMCID: PMC10767975 DOI: 10.1093/nar/gkad1005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/12/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs.
Collapse
Affiliation(s)
| | | | - Ben Coleman
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | | | - Joel Anderton
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | | | - Anita M Bagley
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Eduard Bakštein
- National Institute of Mental Health, Klecany, Czech Republic
| | - James P Balhoff
- Renaissance Computing Institute, University of North Carolina, Chapel Hill, NC 27517, USA
| | - Gareth Baynam
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Michael Berk
- Deakin University, IMPACT - the Institute for Mental and Physical Health and Clinical Translation, School of Medicine, Barwon Health, Geelong, Australia
| | - Holli Bertram
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Somer Bishop
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - David F Bodenstein
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | | | - Kaan Boztug
- St. Anna Children's Cancer Research Institute (CCRI), Vienna, Austria
| | - Jolana Čady
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Tiffany J Callahan
- Department of Biomedical Informatics, Columbia University Irving Medical Center, NY, NY, USA
| | | | - Seth J Carbon
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - J Harry Caufield
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren E Chan
- College of Public Health and Human Sciences, Oregon State University, Corvallis, OR 97331, USA
| | - Christopher G Chute
- Schools of Medicine, Public Health, and Nursing, Johns Hopkins University, Baltimore, MD 21287, USA
| | - Jaime Cruz-Rojo
- UDISGEN (Dysmorphology and Genetics Unit), 12 de Octubre Hospital, Madrid, Spain
| | - Noémi Dahan-Oliel
- Department of Clinical Research, Shriners Hospitals for Children, Montreal, Quebec, Canada
| | - Jon R Davids
- Shriners Children's Northern California, Sacramento, CA, USA
| | - Maud de Dieuleveult
- Département I&D, AP-HP, Banque Nationale de Données Maladies Rares, Paris, France
| | - Vinicius de Souza
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Bert B A de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | | | - J Raymond DePaulo
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Beata Derfalvi
- Department of Pediatrics, Dalhousie University, Halifax, NS, Canada
| | - Ferdinand Dhombres
- Fetal Medicine Department, Armand Trousseau Hospital, Sorbonne University, GRC26, INSERM, Limics, Paris, France
| | - Claudia Diaz-Byrd
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Alexander J M Dingemans
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Bruno Donadille
- St Antoine Hospital, Reference Center for Rare Growth Endocrine Disorders, Sorbonne University, AP-HP, INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | | | - Reem Elfeky
- Department of Immunology, GOS Hospital for Children NHS Foundation Trust, University College London, London, UK
| | - Shahim Essaid
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | | | - Giovanna Fico
- Bipolar and Depressive Disorders Unit, Institute of Neuroscience, Hospital Clinic, University of Barcelona, IDIBAPS, CIBERSAM, Barcelona, Catalonia, Spain
| | - Helen V Firth
- Addenbrooke's Hospital, Cambridge University Hospitals, Cambridge, UK
| | - Yun Freudenberg-Hua
- Department of Psychiatry, Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | | | - Davera L Gabriel
- School of Medicine, Johns Hopkins University, Baltimore, MD 21287, USA
| | | | - Jessica Giordano
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - Fernando S Goes
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Rachel Gore Moses
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Ian Green
- SNOMED International, London W2 6BD, UK
| | - Matthias Griese
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Tudor Groza
- Rare Care Centre, Perth Children's Hospital, Perth, Australia
| | | | - Julia Guthrie
- Department of Structural and Computational Biology, University of Vienna; Max Perutz Labs, Vienna, Austria
| | - Benjamin Gyori
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Ada Hamosh
- Department of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Marc Hanauer
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Kateřina Hanušová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | | | - Harshad Hegde
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Ingo Helbig
- Neurology, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Kateřina Holasová
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Charles Tapley Hoyt
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | | | - Eric Hurwitz
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Julius O B Jacobsen
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Lisa Joseph
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Kamyar Keramatian
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Bryan King
- Department of Psychiatry and Behavioral Sciences, UCSF Weil Institute for Neuroscience, San Francisco, CA, USA
| | - Katrin Knoflach
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - David A Koolen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, Netherlands
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Carlo Kroll
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | - Maaike Kusters
- Immunology, NIHR Great Ormond Street Hospital BRC, London, UK
| | - Markus S Ladewig
- Department of Ophthalmology, University Clinic Marburg - Campus Fulda, Fulda, Germany
| | - David Lagorce
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Meng-Chuan Lai
- Campbell Family Mental Health Research Institute, Centre for Addiction and Mental Health, Toronto, ON, Canada
| | - Pablo Lapunzina
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Bryan Laraway
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | | | - Caterina Lucano
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Marzieh Majd
- Department of Psychiatry, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Mary L Marazita
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Victor Martinez-Glez
- Center for Genomic Medicine, Parc Taulí Hospital Universitari, Institut d’Investigació i Innovació Parc Taulí (I3PT-CERCA), Sabadell, Spain
| | - Toby H McHenry
- Center for Craniofacial and Dental Genetics, Department of Oral and Craniofacial Sciences, School of Dental Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Melvin G McInnis
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Julie A McMurry
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Michaela Mihulová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Caitlin E Millett
- Feinstein Institutes for Medical Research, Northwell Health, Manhasset, NY, USA
| | - Philip B Mitchell
- Discipline of Psychiatry & Mental Health, School of Clinical Medicine, Faculty of Medicine & Health, University of New South Wales, Sydney, NSW, Australia
| | - Veronika Moslerová
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Kenji Narutomi
- Okinawa Prefectural Nanbu Medical Center & Children's Medical Center
| | - Shahrzad Nematollahi
- School of Physical and Occupational Therapy, McGill University, Montreal, Quebec, Canada
| | - Julian Nevado
- Institute of Medical and Molecular Genetics, Hospital Univ. La Paz, Madrid, Spain
| | - Andrew A Nierenberg
- Dauten Family Center for Bipolar Treatment Innovation, Massachusetts General Hospital, Boston, MA, USA
| | - Nikola Novák Čajbiková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - John I Nurnberger
- Stark Neurosciences Research Institute, Departments of Psychiatry and Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA
| | | | - Daniel Olson
- Data Collaboration Center, Data Science, Critical Path Institute, Tucson, AZ, USA
| | - Abigail Ortiz
- Department of Psychiatry, University of Toronto, Toronto, ON, Canada
| | - Harry Pachajoa
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad Icesi, Cali, Colombia
| | - Guiomar Perez de Nanclares
- Molecular (epi) genetics lab, Bioaraba Health Research Institute, Araba University Hospital, Vitoria-Gasteiz, Spain
| | - Amy Peters
- Department of Psychiatry, Massachusetts General Hospital, Boston, MA, USA
| | - Tim Putman
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Christina K Rapp
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, LMU Munich, German center for Lung research (DZL), Munich, Germany
| | - Ana Rath
- INSERM, US14 - Orphanet, Plateforme Maladies Rares, Paris, France
| | - Justin Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Lauren Rekerle
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Angharad M Roberts
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | - Suzy Roy
- SNOMED International, London W2 6BD, UK
| | - Stephan J Sanders
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, University of Oxford, Oxford, UK
| | - Catharina Schuetz
- Universitätsklinikum Carl Gustav Carus, Medizinische Fakultät, TU, Dresden, Germany
| | - Eva C Schulte
- Institute of Psychiatric Phenomics and Genomics (IPPG), LMU University Hospital, LMU Munich, Munich, Germany
| | - Thomas G Schulze
- Department of Psychiatry and Behavioral Sciences, SUNY Upstate Medical University, Syracuse, NY, USA
| | - Martin Schwarz
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Katie Scott
- Department of Psychiatry, Dalhousie University, Halifax, NS, Canada
| | - Dominik Seelow
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center UKS, Homburg/Saar, Germany
| | | | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892, USA
| | - Eric S Simon
- Eisenberg Family Depression Center, University of Michigan, Ann Arbor, MI, USA
| | - Balwinder Singh
- Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
| | - Damian Smedley
- William Harvey Research Institute, Queen Mary University of London, London, UK
| | | | - Jake T Smolinsky
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, NJ, USA
| | - Sarah Sperry
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | | | - Ray Stefancsik
- European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Robin Steinhaus
- Exploratory Diagnostic Sciences, Berliner Institut für Gesundheitsforschung - Charité, Berlin, Germany
| | - Rebecca Strawbridge
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | | | - Polina Talapova
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | | | - Pavel Tesner
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Rhys H Thomas
- Translational and Clinical Research Institute, Henry Wellcome Building, Framlington Place, Newcastle University, Newcastle-Upon-Tyne NE14LP, UK
| | - Audrey Thurm
- Neurodevelopmental and Behavioral Phenotyping Service, National Institute of Mental Health, Bethesda, MD, USA
| | - Marek Turnovec
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Marielle E van Gijn
- Department of Genetics, University Medical Center Groningen, Groningen, Netherlands
| | | | - Markéta Vlčková
- Department of Biology and Medical Genetics, 2nd Medical Faculty of Charles University and University Hospital Motol, Prague, Czech Republic
| | - Anita Walden
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Kai Wang
- Chinese HPO Consortium, Beijing, China
| | - Ron Wapner
- Department of Obstetrics and Gynecology, Columbia University Irving Medical Center, New York, NY, USA
| | - James S Ware
- National Heart & Lung Institute & MRC London Institute of Medical Sciences, Imperial College London, London W12 0HS, UK
| | | | | | - Lisa D Wiggins
- National Center on Birth Defects and Developmental Disabilities, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Andrew E Williams
- Institute for Research and Health Policy Studies, Tufts Medicine, Boston, MA 2111, USA
| | - Chen Wu
- Chinese HPO Consortium, Beijing, China
| | - Margot J Wyrwoll
- Centre for Regenerative Medicine, Institute for Regeneration and Repair, Institute for Stem Cell Research, University of Edinburgh, Edinburgh, UK
| | - Hui Xiong
- Chinese HPO Consortium, Beijing, China
| | - Nefize Yalin
- Department of Psychological Medicine, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK
| | - Yasunori Yamamoto
- Database Center for Life Science, Joint Support-Center for Data Science Research, Research Organization of Information and Systems, Japan
| | - Lakshmi N Yatham
- Department of Psychiatry, University of British Columbia, Vancouver, BC, Canada
| | - Anastasia K Yocum
- Department of Psychiatry, University of Michigan, Ann Arbor, MI, USA
| | - Allan H Young
- Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience, King's College London & South London and Maudsley NHS Foundation Trust, Bethlem Royal Hospital, Monks Orchard Road, Beckenham, Kent, London SE5 8AF, UK
| | - Zafer Yüksel
- Department of Human Genetics, Bioscientia Healthcare GmbH, Ingelheim, Germany
| | - Peter P Zandi
- Department of Psychiatry and Behavioral Sciences, Johns Hopkins University School of Medicine, Baltimore, MD 21287, USA
| | - Andreas Zankl
- Faculty of Medicine and Health, The University of Sydney, Camperdown, Australia
| | - Ignacio Zarante
- Institute of Human Genetics, Pontificia Universidad Javeriana, Bogotá, Colombia
| | - Miroslav Zvolský
- Institute of Health Information and Statistics of the Czech Republic, Prague, Czech Republic
| | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Nomi L Harris
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | - Monica C Munoz-Torres
- Department of Biomedical Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Daniel Danis
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Christopher J Mungall
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720, USA
| | | | - Melissa A Haendel
- University of Colorado Anschutz Medical Campus, Aurora, CO 80045, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| |
Collapse
|
21
|
Yu D, Huang CJ, Tucker HO. Established and Evolving Roles of the Multifunctional Non-POU Domain-Containing Octamer-Binding Protein (NonO) and Splicing Factor Proline- and Glutamine-Rich (SFPQ). J Dev Biol 2024; 12:3. [PMID: 38248868 PMCID: PMC10801543 DOI: 10.3390/jdb12010003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Revised: 12/25/2023] [Accepted: 12/28/2023] [Indexed: 01/23/2024] Open
Abstract
It has been more than three decades since the discovery of multifunctional factors, the Non-POU-Domain-Containing Octamer-Binding Protein, NonO, and the Splicing Factor Proline- and Glutamine-Rich, SFPQ. Some of their functions, including their participation in transcriptional and posttranscriptional regulation as well as their contribution to paraspeckle subnuclear body organization, have been well documented. In this review, we focus on several other established roles of NonO and SFPQ, including their participation in the cell cycle, nonhomologous end-joining (NHEJ), homologous recombination (HR), telomere stability, childhood birth defects and cancer. In each of these contexts, the absence or malfunction of either or both NonO and SFPQ leads to either genome instability, tumor development or mental impairment.
Collapse
Affiliation(s)
- Danyang Yu
- Department of Biology, New York University in Shanghai, Shanghai 200122, China;
| | - Ching-Jung Huang
- Department of Biology, New York University in Shanghai, Shanghai 200122, China;
| | - Haley O. Tucker
- Molecular Biosciences, Institute for Cellular and Molecular Biology, University of Texas at Austin, 1 University Station A5000, Austin, TX 78712, USA
| |
Collapse
|
22
|
Boschen KE, Dragicevich CJ, Fish EW, Hepperla AJ, Simon JM, Parnell SE. Gastrulation-stage alcohol exposure induces similar rates of craniofacial malformations in male and female C57BL/6J mice. Birth Defects Res 2024; 116:e2292. [PMID: 38116840 PMCID: PMC10872400 DOI: 10.1002/bdr2.2292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 11/18/2023] [Accepted: 12/06/2023] [Indexed: 12/21/2023]
Abstract
BACKGROUND Prenatal alcohol exposure during gastrulation (embryonic day [E] 7 in mice, ~3rd week of human pregnancy) impairs eye, facial, and cortical development, recapitulating birth defects characteristic of Fetal Alcohol Syndrome (FAS). However, it is not known whether the prevalence or severity of craniofacial features associated with FAS is affected by biological sex. METHODS The current study administered either alcohol (2.9 g/kg, two i.p. doses, 4 hr apart) or vehicle to pregnant C57BL/6J females on E7, prior to gonadal sex differentiation, and assessed fetal morphology at E17. RESULTS Whereas sex did not affect fetal size in controls, alcohol-exposed females were smaller than both control females and alcohol-treated males. Alcohol exposure increased the incidence of eye defects to a similar degree in males and females. Together, these data suggest that females might be more sensitive to the general developmental effects of alcohol, but not effects specific to the craniofacies. Whole transcriptomic analysis of untreated E7 embryos found 214 differentially expressed genes in females vs. males, including those in pathways related to cilia and mitochondria, histone demethylase activity, and pluripotency. CONCLUSION Gastrulation-stage alcohol induces craniofacial malformations in male and female mouse fetuses at similar rates and severity, though growth deficits are more prevalent females. These findings support the investigation of biological sex as a contributing factor in prenatal alcohol studies.
Collapse
Affiliation(s)
- Karen E. Boschen
- Bowles Center for Alcohol Studies, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Constance J. Dragicevich
- Bowles Center for Alcohol Studies, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Eric W. Fish
- Bowles Center for Alcohol Studies, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Austin J. Hepperla
- Carolina Institute for Developmental Disabilities, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Jeremy M. Simon
- Carolina Institute for Developmental Disabilities, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- UNC Neuroscience Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott E. Parnell
- Bowles Center for Alcohol Studies, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Cell Biology and Physiology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| |
Collapse
|
23
|
Carmody LC, Gargano MA, Toro S, Vasilevsky NA, Adam MP, Blau H, Chan LE, Gomez-Andres D, Horvath R, Kraus ML, Ladewig MS, Lewis-Smith D, Lochmüller H, Matentzoglu NA, Munoz-Torres MC, Schuetz C, Seitz B, Similuk MN, Sparks TN, Strauss T, Swietlik EM, Thompson R, Zhang XA, Mungall CJ, Haendel MA, Robinson PN. The Medical Action Ontology: A tool for annotating and analyzing treatments and clinical management of human disease. MED 2023; 4:913-927.e3. [PMID: 37963467 PMCID: PMC10842845 DOI: 10.1016/j.medj.2023.10.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/31/2023] [Accepted: 10/14/2023] [Indexed: 11/16/2023]
Abstract
BACKGROUND Navigating the clinical literature to determine the optimal clinical management for rare diseases presents significant challenges. We introduce the Medical Action Ontology (MAxO), an ontology specifically designed to organize medical procedures, therapies, and interventions. METHODS MAxO incorporates logical structures that link MAxO terms to numerous other ontologies within the OBO Foundry. Term development involves a blend of manual and semi-automated processes. Additionally, we have generated annotations detailing diagnostic modalities for specific phenotypic abnormalities defined by the Human Phenotype Ontology (HPO). We introduce a web application, POET, that facilitates MAxO annotations for specific medical actions for diseases using the Mondo Disease Ontology. FINDINGS MAxO encompasses 1,757 terms spanning a wide range of biomedical domains, from human anatomy and investigations to the chemical and protein entities involved in biological processes. These terms annotate phenotypic features associated with specific disease (using HPO and Mondo). Presently, there are over 16,000 MAxO diagnostic annotations that target HPO terms. Through POET, we have created 413 MAxO annotations specifying treatments for 189 rare diseases. CONCLUSIONS MAxO offers a computational representation of treatments and other actions taken for the clinical management of patients. Its development is closely coupled to Mondo and HPO, broadening the scope of our computational modeling of diseases and phenotypic features. We invite the community to contribute disease annotations using POET (https://poet.jax.org/). MAxO is available under the open-source CC-BY 4.0 license (https://github.com/monarch-initiative/MAxO). FUNDING NHGRI 1U24HG011449-01A1 and NHGRI 5RM1HG010860-04.
Collapse
Affiliation(s)
- Leigh C Carmody
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - Sabrina Toro
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | | | - Margaret P Adam
- University of Washington School of Medicine, Seattle, WA, USA
| | - Hannah Blau
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | | | - David Gomez-Andres
- Pediatric Neurology, Vall d'Hebron Institut de Recerca (VHIR), Hospital Universitari Vall d'Hebron, Vall d'Hebron Barcelona Hospital Campus, Passeig Vall d'Hebron 119-129, 08035 Barcelona, Spain
| | - Rita Horvath
- Department of Clinical Neurosciences, University of Cambridge, Robinson Way, Cambridge CB2 0PY, UK
| | - Megan L Kraus
- University of Colorado Anschutz Medical Campus, Aurora, CO, USA
| | - Markus S Ladewig
- Department of Ophthalmology, Klinikum Saarbrücken, Saarbrücken, Germany
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Hanns Lochmüller
- Children's Hospital of Eastern Ontario Research Institute, Ottowa, Canada; Division of Neurology, Department of Medicine, The Ottawa Hospital, Ottawa, Canada; Brain and Mind Research Institute, University of Ottawa, Ottawa, Canada; Department of Neuropediatrics and Muscle Disorders, Medical Center - University of Freiburg, Faculty of Medicine, Freiburg, Germany; Centro Nacional de Análisis Genómico, Barcelona, Spain
| | | | | | - Catharina Schuetz
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Berthold Seitz
- Department of Ophthalmology, Saarland University Medical Center UKS, Homburg, Saar, Germany
| | - Morgan N Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, USA
| | - Teresa N Sparks
- Department of Obstetrics, Gynecology, & Reproductive Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Timmy Strauss
- Department of Pediatrics, Medizinische Fakultät Carl Gustav Carus, Technische Universität Dresden, 01307 Dresden, Germany
| | - Emilia M Swietlik
- Department of Medicine, University of Cambridge, Heart and Lung Research Institute, Cambridge CB2 0BB, UK
| | - Rachel Thompson
- Children's Hospital of Eastern Ontario Research Institute, Ottowa, Canada
| | | | | | | | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA.
| |
Collapse
|
24
|
Jankowska M, Soler MJ, Stevens KI, Torra R. Why do we keep ignoring sex in kidney disease? Clin Kidney J 2023; 16:2327-2335. [PMID: 38046033 PMCID: PMC10689162 DOI: 10.1093/ckj/sfad183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Indexed: 12/05/2023] Open
Abstract
Throughout the history of nephrology, little attention has been paid to the sex and gender differences in kidney disease. This lack of awareness prevents optimal diagnosis and management of kidney disease. In today's world of precision medicine, it is imperative to appreciate the differential factors regarding gender and kidney disease. This editorial summarizes the up-to-date literature regarding sex and gender differences in kidney disease and considers areas where knowledge is incomplete and where further research is needed. We address sex-specific effects on chronic kidney disease epidemiology; risks of dialysis underdosing and medication overdosing in women; unexplained loss of female sex advantage in life expectancy during dialysis, and impact of sex on diagnosis and management of genetic kidney disease. We also aim to highlight the impact of gender on kidney health and raise awareness of disparities that may be faced by women, and transgender and gender-diverse persons when a male-model approach is used by healthcare systems. By understanding the link between sex and kidney disease, kidney specialists can improve the care and outcomes of their patients. In addition, research on this topic can inform the development of targeted prevention and intervention strategies that address the specific needs and risk factors of different populations.
Collapse
Affiliation(s)
- Magdalena Jankowska
- Department of Nephrology, Transplantology and Internal Medicine, Faculty of Medicine, Medical University of Gdansk, Gdańsk, Poland
| | - María José Soler
- Department of Nephrology, Vall d'Hebron Hospital Universitari, Vall d'Hebron Barcelona Hospital Campus, Nephrology and Transplantation Group, Vall d'Hebron Institut de Recerca (VHIR), Barcelona, Spain
| | - Kate I Stevens
- The Renal and Transplant Unit, Queen Elizabeth University Hospital, Glasgow, UK
| | - Roser Torra
- Inherited Kidney Diseases, Nephrology Department, Fundació Puigvert, Institut d'Investigacions Biomèdiques Sant Pau (IIB-Sant Pau), Universitat Autónoma de Barcelona, Barcelona, Spain
| |
Collapse
|
25
|
Wilson LA, Macken WL, Perry LD, Record CJ, Schon KR, Frezatti RSS, Raga S, Naidu K, Köken ÖY, Polat I, Kapapa MM, Dominik N, Efthymiou S, Morsy H, Nel M, Fassad MR, Gao F, Patel K, Schoonen M, Bisschoff M, Vorster A, Jonvik H, Human R, Lubbe E, Nonyane M, Vengalil S, Nashi S, Srivastava K, Lemmers RJLF, Reyaz A, Mishra R, Töpf A, Trainor CI, Steyn EC, Mahungu AC, van der Vliet PJ, Ceylan AC, Hiz AS, Çavdarlı B, Semerci Gündüz CN, Ceylan GG, Nagappa M, Tallapaka KB, Govindaraj P, van der Maarel SM, Narayanappa G, Nandeesh BN, Wa Somwe S, Bearden DR, Kvalsund MP, Ramdharry GM, Oktay Y, Yiş U, Topaloğlu H, Sarkozy A, Bugiardini E, Henning F, Wilmshurst JM, Heckmann JM, McFarland R, Taylor RW, Smuts I, van der Westhuizen FH, Sobreira CFDR, Tomaselli PJ, Marques W, Bhatia R, Dalal A, Srivastava MVP, Yareeda S, Nalini A, Vishnu VY, Thangaraj K, Straub V, Horvath R, Chinnery PF, Pitceathly RDS, Muntoni F, Houlden H, Vandrovcova J, Reilly MM, Hanna MG. Neuromuscular disease genetics in under-represented populations: increasing data diversity. Brain 2023; 146:5098-5109. [PMID: 37516995 PMCID: PMC10690022 DOI: 10.1093/brain/awad254] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 07/04/2023] [Indexed: 08/01/2023] Open
Abstract
Neuromuscular diseases (NMDs) affect ∼15 million people globally. In high income settings DNA-based diagnosis has transformed care pathways and led to gene-specific therapies. However, most affected families are in low-to-middle income countries (LMICs) with limited access to DNA-based diagnosis. Most (86%) published genetic data is derived from European ancestry. This marked genetic data inequality hampers understanding of genetic diversity and hinders accurate genetic diagnosis in all income settings. We developed a cloud-based transcontinental partnership to build diverse, deeply-phenotyped and genetically characterized cohorts to improve genetic architecture knowledge, and potentially advance diagnosis and clinical management. We connected 18 centres in Brazil, India, South Africa, Turkey, Zambia, Netherlands and the UK. We co-developed a cloud-based data solution and trained 17 international neurology fellows in clinical genomic data interpretation. Single gene and whole exome data were analysed via a bespoke bioinformatics pipeline and reviewed alongside clinical and phenotypic data in global webinars to inform genetic outcome decisions. We recruited 6001 participants in the first 43 months. Initial genetic analyses 'solved' or 'possibly solved' ∼56% probands overall. In-depth genetic data review of the four commonest clinical categories (limb girdle muscular dystrophy, inherited peripheral neuropathies, congenital myopathy/muscular dystrophies and Duchenne/Becker muscular dystrophy) delivered a ∼59% 'solved' and ∼13% 'possibly solved' outcome. Almost 29% of disease causing variants were novel, increasing diverse pathogenic variant knowledge. Unsolved participants represent a new discovery cohort. The dataset provides a large resource from under-represented populations for genetic and translational research. In conclusion, we established a remote transcontinental partnership to assess genetic architecture of NMDs across diverse populations. It supported DNA-based diagnosis, potentially enabling genetic counselling, care pathways and eligibility for gene-specific trials. Similar virtual partnerships could be adopted by other areas of global genomic neurological practice to reduce genetic data inequality and benefit patients globally.
Collapse
Affiliation(s)
- Lindsay A Wilson
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - William L Macken
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Luke D Perry
- Institute of Child Health and Centre for Neuromuscular Diseases, Neurosciences Unit, The Dubowitz Neuromuscular Centre, University College London, UCL Great Ormond Street, Great Ormond Street Hospital, London WC1N 3JH, UK
- NIHR Great Ormond Street Hospital Biomedical Research Centre, UCL Great Ormond Street Institute of Child Health, University College London, London WC1N 1EH, UK
| | - Christopher J Record
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Katherine R Schon
- Department of Clinical Neurosciences, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Rodrigo S S Frezatti
- Department of Neurosciences, School of Medicine of Ribeirão Preto, University of São Paulo, São Paulo, Brazil
| | - Sharika Raga
- Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- Division of Paediatric Neurology, Department of Paediatrics and Child Health, Red Cross War Memorial Children’s Hospital, Cape Town, South Africa
| | - Kireshnee Naidu
- Neurology Research Group, Division of Neurology, Department of Medicine, University of Cape Town, Cape Town, South Africa
- Division of Neurology, Department of Medicine, Stellenbosch University, Cape Town, South Africa
| | - Özlem Yayıcı Köken
- Faculty of Medicine, Department of Pediatric Neurology, Akdeniz University, Antalya, Turkey
| | - Ipek Polat
- Faculty of Medicine, Pediatric Neurology Department, Dokuz Eylül University, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
| | - Musambo M Kapapa
- Department of Physiotherapy, University of Zambia School of Health Sciences & University Teaching Hospital Neurology Research Office, Lusaka, Zambia
| | - Natalia Dominik
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Stephanie Efthymiou
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Heba Morsy
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Melissa Nel
- Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- Neurology Research Group, Division of Neurology, Department of Medicine, University of Cape Town, Cape Town, South Africa
| | - Mahmoud R Fassad
- Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Fei Gao
- Department of Clinical Neurosciences, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Krutik Patel
- Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
| | - Maryke Schoonen
- Focus Area for Human Metabolomics, North-West University, Potchefstroom, South Africa
| | - Michelle Bisschoff
- Focus Area for Human Metabolomics, North-West University, Potchefstroom, South Africa
| | - Armand Vorster
- Focus Area for Human Metabolomics, North-West University, Potchefstroom, South Africa
| | - Hallgeir Jonvik
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Ronel Human
- Department of Paediatrics, Steve Biko Academic Hospital, University of Pretoria, Pretoria, South Africa
| | - Elsa Lubbe
- Department of Paediatrics, Steve Biko Academic Hospital, University of Pretoria, Pretoria, South Africa
| | - Malebo Nonyane
- Department of Paediatrics, Steve Biko Academic Hospital, University of Pretoria, Pretoria, South Africa
| | - Seena Vengalil
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Saraswati Nashi
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Kosha Srivastava
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Richard J L F Lemmers
- Department of Human Genetics, Leiden University Medical Center (LUMC), Leiden, The Netherlands
| | - Alisha Reyaz
- Department of Neurology, All India Institute of Medical Sciences (AIIMS), Delhi, India
| | - Rinkle Mishra
- Department of Neurology, All India Institute of Medical Sciences (AIIMS), Delhi, India
| | - Ana Töpf
- John Walton Muscular Dystrophy Research Centre, Newcastle University Translational and Clinical Research Institute and Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Christina I Trainor
- John Walton Muscular Dystrophy Research Centre, Newcastle University Translational and Clinical Research Institute and Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Elizabeth C Steyn
- Neurology Research Group, Division of Neurology, Department of Medicine, University of Cape Town, Cape Town, South Africa
| | - Amokelani C Mahungu
- Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- Neurology Research Group, Division of Neurology, Department of Medicine, University of Cape Town, Cape Town, South Africa
| | - Patrick J van der Vliet
- Department of Human Genetics, Leiden University Medical Center (LUMC), Leiden, The Netherlands
| | - Ahmet Cevdet Ceylan
- Department of Medical Genetics, Ankara Bilkent City Hospital, Ankara, Turkey
- Faculty of Medicine, Department of Medical Genetics, Ankara Yıldırım Beyazıt University, Ankara, Turkey
| | - A Semra Hiz
- Faculty of Medicine, Pediatric Neurology Department, Dokuz Eylül University, Izmir, Turkey
- Izmir Biomedicine and Genome Center (IBG), Izmir, Turkey
| | - Büşranur Çavdarlı
- Department of Medical Genetics, Ankara Bilkent City Hospital, Ankara, Turkey
| | - C Nur Semerci Gündüz
- Department of Medical Genetics, Ankara Bilkent City Hospital, Ankara, Turkey
- Faculty of Medicine, Department of Medical Genetics, Ankara Yıldırım Beyazıt University, Ankara, Turkey
| | - Gülay Güleç Ceylan
- Department of Medical Genetics, Ankara Bilkent City Hospital, Ankara, Turkey
- Faculty of Medicine, Department of Medical Genetics, Ankara Yıldırım Beyazıt University, Ankara, Turkey
| | - Madhu Nagappa
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Karthik B Tallapaka
- CSIR—Centre for Cellular and Molecular Biology (CCMB), Hyderabad, Telangana, India
| | - Periyasamy Govindaraj
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | | | - Gayathri Narayanappa
- Department of Neuropathology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Bevinahalli N Nandeesh
- Department of Neuropathology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Somwe Wa Somwe
- Department of Clinical Sciences, School of Medicine and Health Sciences, University of Lusaka, Lusaka, Zambia
| | - David R Bearden
- University of Zambia Department of Educational Psychology, Lusaka, Zambia
- Department of Neurology, School of Medicine and Dentistry, University of Rochester Medical Center, Rochester, NY 14642, USA
| | - Michelle P Kvalsund
- Department of Neurology, School of Medicine and Dentistry, University of Rochester Medical Center, Rochester, NY 14642, USA
- Department of Internal Medicine, University of Zambia School of Medicine, Lusaka, Zambia
| | - Gita M Ramdharry
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Yavuz Oktay
- Izmir International Biomedicine and Genome Institute, Dokuz Eylül University, Izmir, Turkey
- Izmir Biomedicine and Genome Center (IBG), Izmir, Turkey
| | - Uluç Yiş
- Faculty of Medicine, Pediatric Neurology Department, Dokuz Eylül University, Izmir, Turkey
| | | | - Anna Sarkozy
- NIHR Great Ormond Street Hospital Biomedical Research Centre, UCL Great Ormond Street Institute of Child Health, University College London, London WC1N 1EH, UK
| | - Enrico Bugiardini
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Franclo Henning
- Division of Neurology, Department of Medicine, Stellenbosch University, Cape Town, South Africa
| | - Jo M Wilmshurst
- Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- Division of Paediatric Neurology, Department of Paediatrics and Child Health, Red Cross War Memorial Children’s Hospital, Cape Town, South Africa
| | - Jeannine M Heckmann
- Neuroscience Institute, University of Cape Town, Cape Town, South Africa
- Neurology Research Group, Division of Neurology, Department of Medicine, University of Cape Town, Cape Town, South Africa
| | - Robert McFarland
- Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
- NHS Highly Specialised Service for Rare Mitochondrial Disorders, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne NE1 4LP, UK
| | - Robert W Taylor
- Wellcome Centre for Mitochondrial Research, Translational and Clinical Research Institute, Faculty of Medical Sciences, Newcastle University, Newcastle upon Tyne NE2 4HH, UK
- NHS Highly Specialised Service for Rare Mitochondrial Disorders, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne NE1 4LP, UK
| | - Izelle Smuts
- Department of Paediatrics, Steve Biko Academic Hospital, University of Pretoria, Pretoria, South Africa
| | | | | | - Pedro J Tomaselli
- Department of Neurosciences, School of Medicine of Ribeirão Preto, University of São Paulo, São Paulo, Brazil
| | - Wilson Marques
- Department of Neurosciences, School of Medicine of Ribeirão Preto, University of São Paulo, São Paulo, Brazil
| | - Rohit Bhatia
- Department of Neurology, All India Institute of Medical Sciences (AIIMS), Delhi, India
| | - Ashwin Dalal
- Diagnostics Division, Centre for DNA Fingerprinting and Diagnostics, Hyderabad, Telangana, India
| | - M V Padma Srivastava
- Department of Neurology, All India Institute of Medical Sciences (AIIMS), Delhi, India
| | - Sireesha Yareeda
- Department of Neurology, Nizam’s Institute of Medical Sciences (NIMS), Hyderabad, Telangana, India
| | - Atchayaram Nalini
- Department of Neurology, National Institute of Mental Health and Neurosciences (NIMHANS), Bengaluru, India
| | - Venugopalan Y Vishnu
- Department of Neurology, All India Institute of Medical Sciences (AIIMS), Delhi, India
| | - Kumarasamy Thangaraj
- CSIR—Centre for Cellular and Molecular Biology (CCMB), Hyderabad, Telangana, India
| | - Volker Straub
- John Walton Muscular Dystrophy Research Centre, Newcastle University Translational and Clinical Research Institute and Newcastle Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Rita Horvath
- Department of Clinical Neurosciences, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Patrick F Chinnery
- Department of Clinical Neurosciences, University of Cambridge, Cambridge CB2 0QQ, UK
| | - Robert D S Pitceathly
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Francesco Muntoni
- Institute of Child Health and Centre for Neuromuscular Diseases, Neurosciences Unit, The Dubowitz Neuromuscular Centre, University College London, UCL Great Ormond Street, Great Ormond Street Hospital, London WC1N 3JH, UK
- NIHR Great Ormond Street Hospital Biomedical Research Centre, UCL Great Ormond Street Institute of Child Health, University College London, London WC1N 1EH, UK
| | - Henry Houlden
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Jana Vandrovcova
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Mary M Reilly
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| | - Michael G Hanna
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London WC1N 3BG, UK
| |
Collapse
|
26
|
Chen J. Evolutionarily new genes in humans with disease phenotypes reveal functional enrichment patterns shaped by adaptive innovation and sexual selection. RESEARCH SQUARE 2023:rs.3.rs-3632644. [PMID: 38045389 PMCID: PMC10690325 DOI: 10.21203/rs.3.rs-3632644/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
New genes (or young genes) are structural novelties pivotal in mammalian evolution. Their phenotypic impact on humans, however, remains elusive due to the technical and ethical complexities in functional studies. Through combining gene age dating with Mendelian disease phenotyping, our research reveals that new genes associated with disease phenotypes steadily integrate into the human genome at a rate of ~ 0.07% every million years over macroevolutionary timescales. Despite this stable pace, we observe distinct patterns in phenotypic enrichment, pleiotropy, and selective pressures between young and old genes. Notably, young genes show significant enrichment in the male reproductive system, indicating strong sexual selection. Young genes also exhibit functions in tissues and systems potentially linked to human phenotypic innovations, such as increased brain size, bipedal locomotion, and color vision. Our findings further reveal increasing levels of pleiotropy over evolutionary time, which accompanies stronger selective constraints. We propose a "pleiotropy-barrier" model that delineates different potentials for phenotypic innovation between young and older genes subject to natural selection. Our study demonstrates that evolutionary new genes are critical in influencing human reproductive evolution and adaptive phenotypic innovations driven by sexual and natural selection, with low pleiotropy as a selective advantage.
Collapse
|
27
|
Yang J, Liu C, Deng W, Wu D, Weng C, Zhou Y, Wang K. Enhancing Phenotype Recognition in Clinical Notes Using Large Language Models: PhenoBCBERT and PhenoGPT. ARXIV 2023:arXiv:2308.06294v2. [PMID: 37986722 PMCID: PMC10659449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/22/2023]
Abstract
To enhance phenotype recognition in clinical notes of genetic diseases, we developed two models - PhenoBCBERT and PhenoGPT - for expanding the vocabularies of Human Phenotype Ontology (HPO) terms. While HPO offers a standardized vocabulary for phenotypes, existing tools often fail to capture the full scope of phenotypes, due to limitations from traditional heuristic or rule-based approaches. Our models leverage large language models (LLMs) to automate the detection of phenotype terms, including those not in the current HPO. We compared these models to PhenoTagger, another HPO recognition tool, and found that our models identify a wider range of phenotype concepts, including previously uncharacterized ones. Our models also showed strong performance in case studies on biomedical literature. We evaluated the strengths and weaknesses of BERT-based and GPT-based models in aspects such as architecture and accuracy. Overall, our models enhance automated phenotype detection from clinical texts, improving downstream analyses on human diseases.
Collapse
Affiliation(s)
- Jingye Yang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Mathematics, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Cong Liu
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Wendy Deng
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Da Wu
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Chunhua Weng
- Department of Biomedical Informatics, Columbia University, New York, NY 10032, USA
| | - Yunyun Zhou
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Biostatistics and Bioinformatics facility, Fox Chase Cancer Center, Philadelphia, PA 19111, USA
| | - Kai Wang
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
28
|
Bosch E, Popp B, Güse E, Skinner C, van der Sluijs PJ, Maystadt I, Pinto AM, Renieri A, Bruno LP, Granata S, Marcelis C, Baysal Ö, Hartwich D, Holthöfer L, Isidor B, Cogne B, Wieczorek D, Capra V, Scala M, De Marco P, Ognibene M, Jamra RA, Platzer K, Carter LB, Kuismin O, van Haeringen A, Maroofian R, Valenzuela I, Cuscó I, Martinez-Agosto JA, Rabani AM, Mefford HC, Pereira EM, Close C, Anyane-Yeboa K, Wagner M, Hannibal MC, Zacher P, Thiffault I, Beunders G, Umair M, Bhola PT, McGinnis E, Millichap J, van de Kamp JM, Prijoles EJ, Dobson A, Shillington A, Graham BH, Garcia EJ, Galindo MK, Ropers FG, Nibbeling EAR, Hubbard G, Karimov C, Goj G, Bend R, Rath J, Morrow MM, Millan F, Salpietro V, Torella A, Nigro V, Kurki M, Stevenson RE, Santen GWE, Zweier M, Campeau PM, Severino M, Reis A, Accogli A, Vasileiou G. Elucidating the clinical and molecular spectrum of SMARCC2-associated NDD in a cohort of 65 affected individuals. Genet Med 2023; 25:100950. [PMID: 37551667 DOI: 10.1016/j.gim.2023.100950] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 08/01/2023] [Accepted: 08/01/2023] [Indexed: 08/09/2023] Open
Abstract
PURPOSE Coffin-Siris and Nicolaides-Baraitser syndromes are recognizable neurodevelopmental disorders caused by germline variants in BAF complex subunits. The SMARCC2 BAFopathy was recently reported. Herein, we present clinical and molecular data on a large cohort. METHODS Clinical symptoms for 41 novel and 24 previously published affected individuals were analyzed using the Human Phenotype Ontology. For genotype-phenotype correlations, molecular data were standardized and grouped into non-truncating and likely gene-disrupting (LGD) variants. Missense variant protein expression and BAF-subunit interactions were examined using 3D protein modeling, co-immunoprecipitation, and proximity-ligation assays. RESULTS Neurodevelopmental delay with intellectual disability, muscular hypotonia, and behavioral disorders were the major manifestations. Clinical hallmarks of BAFopathies were rare. Clinical presentation differed significantly, with LGD variants being predominantly inherited and associated with mildly reduced or normal cognitive development, whereas non-truncating variants were mostly de novo and presented with severe developmental delay. These distinct manifestations and non-truncating variant clustering in functional domains suggest different pathomechanisms. In vitro testing showed decreased protein expression for N-terminal missense variants similar to LGD. CONCLUSION This study improved SMARCC2 variant classification and identified discernible SMARCC2-associated phenotypes for LGD and non-truncating variants, which were distinct from other BAFopathies. The pathomechanism of most non-truncating variants has yet to be investigated.
Collapse
Affiliation(s)
- Elisabeth Bosch
- Institute of Human Genetics, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Bernt Popp
- Berlin Institute of Health at Charitè, Universitätsklinikum Berlin, Centre of Functional Genomics, Berlin, Germany; Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Esther Güse
- Institute of Human Genetics, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | | | | | - Isabelle Maystadt
- Center for Human Genetics, Institute of Pathology and Genetics, Gosselies, Belgium
| | - Anna Maria Pinto
- Genetica Medica, Azienda Ospedaliera Universitaria Senese, Siena, Italy
| | - Alessandra Renieri
- Genetica Medica, Azienda Ospedaliera Universitaria Senese, Siena, Italy; Medical Genetics Unit, University of Siena, Siena, Italy
| | - Lucia Pia Bruno
- Telethon Institute of Genetics and Medicine (TIGEM), Naples, Italy
| | - Stefania Granata
- Genetica Medica, Azienda Ospedaliera Universitaria Senese, Siena, Italy; Medical Genetics Unit, University of Siena, Siena, Italy
| | - Carlo Marcelis
- Human Genetics department, Radboud university medical center, Nijmegen, The Netherlands
| | - Özlem Baysal
- Human Genetics department, Radboud university medical center, Nijmegen, The Netherlands
| | - Dewi Hartwich
- Institute of Human Genetics - University Medical Center of the Johannes Gutenberg University Mainz, Germany
| | - Laura Holthöfer
- Institute of Human Genetics - University Medical Center of the Johannes Gutenberg University Mainz, Germany
| | - Bertrand Isidor
- Nantes Université, CHU de Nantes, Service de Génétique médicale, Nantes, France; Nantes Université, CHU de Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Benjamin Cogne
- Nantes Université, CHU de Nantes, Service de Génétique médicale, Nantes, France; Nantes Université, CHU de Nantes, CNRS, INSERM, l'institut du thorax, Nantes, France
| | - Dagmar Wieczorek
- Institute of Human Genetics, Medical Faculty and University Hospital Düsseldorf, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | - Valeria Capra
- Genomics and Clinical Genetics Unit, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Marcello Scala
- Department of Neurosciences, Rehabilitation, Ophthalmology, Genetics, Maternal and Child Health, University of Genoa, Genoa, Italy; Medical Genetics Unit, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Patrizia De Marco
- Medical Genetics Unit, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Marzia Ognibene
- Medical Genetics Unit, IRCCS Istituto Giannina Gaslini, Genoa, Italy
| | - Rami Abou Jamra
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Konrad Platzer
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Lauren B Carter
- Department of Pediatrics, Division of Medical Genetics, Levine Children's Hospital, Atrium Health, Charlotte, NC
| | - Outi Kuismin
- Department of Clinical Genetics, Research Unit of Clinical Medicine, Medical Research Center Oulu, Oulu University Hospital and University of Oulu, Oulu, Finland
| | - Arie van Haeringen
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Reza Maroofian
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology, University College London, London, United Kingdom
| | - Irene Valenzuela
- Department of Clinical and Molecular Genetics, University Hospital Vall d'Hebron, Barcelona, Spain; Medicine Genetics Group, Valle Hebron Research Institute, Barcelona, Spain
| | - Ivon Cuscó
- Department of Clinical and Molecular Genetics, University Hospital Vall d'Hebron, Barcelona, Spain; Medicine Genetics Group, Valle Hebron Research Institute, Barcelona, Spain
| | - Julian A Martinez-Agosto
- Departments of Human Genetics, Pediatrics, and Psychiatry, UCLA David Geffen School of Medicine, Los Angeles, CA
| | - Ahna M Rabani
- Department of Pediatrics & Institute for Precision Health, UCLA David Geffen School of Medicine, Los Angeles, CA
| | - Heather C Mefford
- Center for Pediatric Neurological Disease Research, St. Jude Children's Research Hospital, Memphis, TN
| | - Elaine M Pereira
- Division of Clinical Genetics, Department of Pediatrics, Columbia University Irving Medical Center, New York, NY
| | - Charlotte Close
- Division of Clinical Genetics, Department of Pediatrics, Columbia University Irving Medical Center, New York, NY
| | - Kwame Anyane-Yeboa
- Division of Clinical Genetics, Department of Pediatrics, Columbia University Irving Medical Center, New York, NY
| | - Mallory Wagner
- Division of Pediatric Genetics, Metabolism, and Genomic Medicine, Department of Pediatrics, University of Michigan Health System, University of Michigan, Ann Arbor, MI
| | - Mark C Hannibal
- Division of Pediatric Genetics, Metabolism, and Genomic Medicine, Department of Pediatrics, University of Michigan Health System, University of Michigan, Ann Arbor, MI
| | - Pia Zacher
- Epilepsy Center Kleinwachau, Radeberg, Germany
| | - Isabelle Thiffault
- Department of Pediatrics and Pathology, Genomic Medicine Center, Children's Mercy Kansas City and Children's Mercy Research Institute, Kansas City, MO
| | - Gea Beunders
- Department of Genetics, University Medical Center Groningen, Groningen, The Netherlands
| | - Muhammad Umair
- Medical Genomics Research Department, King Abdullah International Medical Research Center (KAIMRC), King Saud Bin Abdulaziz University for Health Sciences, King Abdulaziz Medical City, MNGHA, Riyadh, Saudi Arabia; Department of Life Sciences, School of Science, University of Management and Technology (UMT), Lahore, Pakistan
| | - Priya T Bhola
- Department of Genetics, Children's Hospital of Eastern Ontario (CHEO), Ottawa, Canada
| | - Erin McGinnis
- Division of Neurology, Department of Pediatrics, Ann & Robert H. Lurie Children's Hospital of Chicago, Chicago, IL
| | - John Millichap
- Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, IL
| | - Jiddeke M van de Kamp
- Department of Human Genetics, Amsterdam UMC, location VU Medical Center, Amsterdam, The Netherlands
| | | | | | - Amelle Shillington
- Department of Human Genetics, Cincinnati Children's Hospital Medical Center, Cincinnati, OH
| | - Brett H Graham
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN
| | - Evan-Jacob Garcia
- Department of Medical and Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN
| | | | - Fabienne G Ropers
- Willem-Alexander Children's Hospital, Department of Pediatrics, Leiden University Medical Center, The Netherlands
| | - Esther A R Nibbeling
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Gail Hubbard
- Department of Medical Genetics, Children's Hospital Los Angeles, Keck School of Medicine of University of Southern California, Los Angeles, CA
| | - Catherine Karimov
- Department of Medical Genetics, Children's Hospital Los Angeles, Keck School of Medicine of University of Southern California, Los Angeles, CA
| | - Guido Goj
- Vestische Kinder- und Jugendklinik, Datteln, Germany
| | - Renee Bend
- PreventionGenetics, Part of Exact Sciences, Marshfield, WI
| | - Julie Rath
- PreventionGenetics, Part of Exact Sciences, Marshfield, WI
| | | | | | - Vincenzo Salpietro
- Department of Neuromuscular Disorders, Queen Square Institute of Neurology, University College London, London, United Kingdom; Department of Biotechnological and Applied Clinical Sciences, University of L'Aquila, L'Aquila, Italy
| | - Annalaura Torella
- Telethon Institute of Genetics and Medicine (TIGEM), Naples, Italy; Department of Precision Medicine, University of Campania "Luigi Vanvitelli," Naples, Italy
| | - Vincenzo Nigro
- Telethon Institute of Genetics and Medicine (TIGEM), Naples, Italy; Department of Precision Medicine, University of Campania "Luigi Vanvitelli," Naples, Italy
| | - Mitja Kurki
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA
| | | | - Gijs W E Santen
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Markus Zweier
- Institute of Medical Genetics, University of Zürich, Schlieren-Zurich, Switzerland
| | - Philippe M Campeau
- Department of Pediatrics, CHU Sainte-Justine and University of Montreal, Montreal, QC, Canada
| | | | - André Reis
- Institute of Human Genetics, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany; Centre for Rare Diseases Erlangen (ZSEER), Erlangen, Germany
| | - Andrea Accogli
- Department of Specialized Medicine, Division of Medical Genetics, McGill University Health Centre; Department of Human Genetics, Faculty of Medicine, McGill University, Montreal, QC, Canada
| | - Georgia Vasileiou
- Institute of Human Genetics, Universitätsklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany; Centre for Rare Diseases Erlangen (ZSEER), Erlangen, Germany.
| |
Collapse
|
29
|
Ahmad N, Fazeli W, Schließke S, Lesca G, Gokce-Samar Z, Mekbib KY, Jin SC, Burton J, Hoganson G, Petersen A, Gracie S, Granger L, Bartels E, Oppermann H, Kundishora A, Till M, Milleret-Pignot C, Dangerfield S, Viskochil D, Anderson KJ, Palculict TB, Schnur RE, Wentzensen IM, Tiller GE, Kahle KT, Kunz WS, Burkart S, Simons M, Sticht H, Abou Jamra R, Neuser S. De Novo Variants in RAB11B Cause Various Degrees of Global Developmental Delay and Intellectual Disability in Children. Pediatr Neurol 2023; 148:164-171. [PMID: 37734130 DOI: 10.1016/j.pediatrneurol.2023.08.023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/20/2023] [Accepted: 08/15/2023] [Indexed: 09/23/2023]
Abstract
BACKGROUND RAB11B was described previously once with a severe form of intellectual disability. We aim at validation and delineation of the role of RAB11B in neurodevelopmental disorders. METHODS We present seven novel individuals with disease-associated variants in RAB11B when compared with the six cases described in the literature. We performed a cross-sectional analysis to identify the clinical spectrum and the core phenotype. Additionally, structural effects of the variants were assessed by molecular modeling. RESULTS Seven distinct de novo missense variants were identified, three of them recurrent (p.(Gly21Arg), p.(Val22Met), and p.(Ala68Thr)). Molecular modeling suggests that those variants either affect the nucleotide binding (at amino acid positions 21, 22, 33, 68) or the interaction with effector molecules (at positions 72 and 75). Our data confirmed the main manifestations as neurodevelopmental disorder with intellectual disability (85%), muscular hypotonia (83%), structural brain anomalies (77%), and visual impairment (70%). Combined analysis indicates a genotype-phenotype correlation; variants impacting the nucleotide binding cause a severe phenotype with intellectual disability, and variants outside the binding pocket lead to a milder phenotype with epilepsy. CONCLUSIONS We confirm that disease-associated missense variants in RAB11B cause a neurodevelopmental disorder and suggest a genotype-phenotype correlation based on the impact on nucleotide binding functionality of RAB11B.
Collapse
Affiliation(s)
- Natalie Ahmad
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Walid Fazeli
- Department of Pediatric Neurology, University Hospital Bonn, Bonn, Germany
| | - Sophia Schließke
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Gaetan Lesca
- Department of Medical Genetics, Lyon University Hospital, University of Lyon, UCB1, Lyon, France
| | | | - Kedous Y Mekbib
- Department of Neurosurgery, Yale University School of Medicine, New Haven, Connecticut; Department of Neurosurgery, Massachusetts General Hospital, Boston, Massachusetts
| | - Sheng Chih Jin
- Department of Genetics, Washington University School of Medicine, St. Louis, Missouri
| | - Jennifer Burton
- University of Illinois College of Medicine, Peoria, Illinois
| | - George Hoganson
- University of Illinois College of Medicine, Peoria, Illinois
| | - Andrea Petersen
- Department of Genetics and Metabolism, Randall Children's Hospital, Portland, Oregon
| | - Sara Gracie
- Department of Genetics and Metabolism, Randall Children's Hospital, Portland, Oregon
| | - Leslie Granger
- Department of Genetics and Metabolism, Randall Children's Hospital, Portland, Oregon
| | - Enrika Bartels
- Institute of Clinical Genetics and Tumor Genetics, Bonn, Germany
| | - Henry Oppermann
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Adam Kundishora
- Department of Neurosurgery, Yale University School of Medicine, New Haven, Connecticut
| | - Marianne Till
- Department of Medical Genetics, Lyon University Hospital, University of Lyon, UCB1, Lyon, France
| | | | | | | | - Katherine J Anderson
- University of Utah, Salt Lake City, Utah; Department of Pediatrics, University of Vermont Medical Center, Burlington, Vermont
| | | | | | | | - George E Tiller
- Department of Genetics, Kaiser Permanente, Los Angeles, California
| | - Kristopher T Kahle
- Department of Neurosurgery, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts; Broad Institute of MIT and Harvard, Cambridge, Massachusetts
| | - Wolfram S Kunz
- Department of Epileptology, University Hospital Bonn, Bonn, Germany
| | - Sebastian Burkart
- Institute of Human Genetics, University Hospital Heidelberg, Heidelberg, Germany
| | - Matias Simons
- Institute of Human Genetics, University Hospital Heidelberg, Heidelberg, Germany
| | - Heinrich Sticht
- Institute of Biochemistry, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany
| | - Rami Abou Jamra
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany
| | - Sonja Neuser
- Institute of Human Genetics, University of Leipzig Medical Center, Leipzig, Germany.
| |
Collapse
|
30
|
Boulogne F, Claus LR, Wiersma H, Oelen R, Schukking F, de Klein N, Li S, Westra HJ, van der Zwaag B, van Reekum F, Sierks D, Schönauer R, Li Z, Bijlsma EK, Bos WJW, Halbritter J, Knoers NVAM, Besse W, Deelen P, Franke L, van Eerde AM. KidneyNetwork: using kidney-derived gene expression data to predict and prioritize novel genes involved in kidney disease. Eur J Hum Genet 2023; 31:1300-1308. [PMID: 36807342 PMCID: PMC10620423 DOI: 10.1038/s41431-023-01296-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2022] [Revised: 11/24/2022] [Accepted: 01/18/2023] [Indexed: 02/22/2023] Open
Abstract
Genetic testing in patients with suspected hereditary kidney disease may not reveal the genetic cause for the disorder as potentially pathogenic variants can reside in genes that are not yet known to be involved in kidney disease. We have developed KidneyNetwork, that utilizes tissue-specific expression to inform candidate gene prioritization specifically for kidney diseases. KidneyNetwork is a novel method constructed by integrating a kidney RNA-sequencing co-expression network of 878 samples with a multi-tissue network of 31,499 samples. It uses expression patterns and established gene-phenotype associations to predict which genes could be related to what (disease) phenotypes in an unbiased manner. We applied KidneyNetwork to rare variants in exome sequencing data from 13 kidney disease patients without a genetic diagnosis to prioritize candidate genes. KidneyNetwork can accurately predict kidney-specific gene functions and (kidney disease) phenotypes for disease-associated genes. The intersection of prioritized genes with genes carrying rare variants in a patient with kidney and liver cysts identified ALG6 as plausible candidate gene. We strengthen this plausibility by identifying ALG6 variants in several cystic kidney and liver disease cases without alternative genetic explanation. We present KidneyNetwork, a publicly available kidney-specific co-expression network with optimized gene-phenotype predictions for kidney disease phenotypes. We designed an easy-to-use online interface that allows clinicians and researchers to use gene expression and co-regulation data and gene-phenotype connections to accelerate advances in hereditary kidney disease diagnosis and research. TRANSLATIONAL STATEMENT: Genetic testing in patients with suspected hereditary kidney disease may not reveal the genetic cause for the patient's disorder. Potentially pathogenic variants can reside in genes not yet known to be involved in kidney disease, making it difficult to interpret the relevance of these variants. This reveals a clear need for methods to predict the phenotypic consequences of genetic variation in an unbiased manner. Here we describe KidneyNetwork, a tool that utilizes tissue-specific expression to predict kidney-specific gene functions. Applying KidneyNetwork to a group of undiagnosed cases identified ALG6 as a candidate gene in cystic kidney and liver disease. In summary, KidneyNetwork can aid the interpretation of genetic variants and can therefore be of value in translational nephrogenetics and help improve the diagnostic yield in kidney disease patients.
Collapse
Affiliation(s)
- Floranne Boulogne
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
| | - Laura R Claus
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Henry Wiersma
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Roy Oelen
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
| | - Floor Schukking
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Niek de Klein
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Shuang Li
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Genomics Coordination Center, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
| | - Harm-Jan Westra
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
| | - Bert van der Zwaag
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Franka van Reekum
- Department of Nephrology, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Dana Sierks
- Medical Department III - Endocrinology, Nephrology, Rheumatology Department of Internal Medicine, Division of Nephrology, University of Leipzig Medical Center, Leipzig, Germany
| | - Ria Schönauer
- Medical Department III - Endocrinology, Nephrology, Rheumatology Department of Internal Medicine, Division of Nephrology, University of Leipzig Medical Center, Leipzig, Germany
- Department of Nephrology and Medical Intensive Care, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Zhigui Li
- Department of Internal Medicine (Nephrology), Yale School of Medicine, New Haven, CT, USA
| | - Emilia K Bijlsma
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Willem Jan W Bos
- Department of Internal Medicine, St Antonius Hospital, Nieuwegein, The Netherlands
- Department of Internal Medicine, Leiden University Medical Center, Leiden, The Netherlands
| | - Jan Halbritter
- Medical Department III - Endocrinology, Nephrology, Rheumatology Department of Internal Medicine, Division of Nephrology, University of Leipzig Medical Center, Leipzig, Germany
- Department of Nephrology and Medical Intensive Care, Charité-Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany
| | - Nine V A M Knoers
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Whitney Besse
- Department of Internal Medicine (Nephrology), Yale School of Medicine, New Haven, CT, USA
| | - Patrick Deelen
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands
| | - Lude Franke
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
- Oncode Institute, Utrecht, The Netherlands
| | - Albertien M van Eerde
- Department of Genetics, University Medical Center Utrecht, Utrecht, The Netherlands.
| |
Collapse
|
31
|
Alsentzer E, Finlayson SG, Li MM, Kobren SN, Kohane IS. Simulation of undiagnosed patients with novel genetic conditions. Nat Commun 2023; 14:6403. [PMID: 37828001 PMCID: PMC10570269 DOI: 10.1038/s41467-023-41980-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 09/26/2023] [Indexed: 10/14/2023] Open
Abstract
Rare Mendelian disorders pose a major diagnostic challenge and collectively affect 300-400 million patients worldwide. Many automated tools aim to uncover causal genes in patients with suspected genetic disorders, but evaluation of these tools is limited due to the lack of comprehensive benchmark datasets that include previously unpublished conditions. Here, we present a computational pipeline that simulates realistic clinical datasets to address this deficit. Our framework jointly simulates complex phenotypes and challenging candidate genes and produces patients with novel genetic conditions. We demonstrate the similarity of our simulated patients to real patients from the Undiagnosed Diseases Network and evaluate common gene prioritization methods on the simulated cohort. These prioritization methods recover known gene-disease associations but perform poorly on diagnosing patients with novel genetic disorders. Our publicly-available dataset and codebase can be utilized by medical genetics researchers to evaluate, compare, and improve tools that aid in the diagnostic process.
Collapse
Grants
- U01 HG007690 NHGRI NIH HHS
- U54 NS108251 NINDS NIH HHS
- U01 HG010219 NHGRI NIH HHS
- U01 HG007672 NHGRI NIH HHS
- U01 HG010233 NHGRI NIH HHS
- U01 HG010230 NHGRI NIH HHS
- U01 HG007943 NHGRI NIH HHS
- U01 HG010217 NHGRI NIH HHS
- U01 HG007942 NHGRI NIH HHS
- U01 HG010215 NHGRI NIH HHS
- U01 HG007708 NHGRI NIH HHS
- T32 HG002295 NHGRI NIH HHS
- T32 GM007753 NIGMS NIH HHS
- U01 HG007674 NHGRI NIH HHS
- U01 TR001395 NCATS NIH HHS
- U01 HG007709 NHGRI NIH HHS
- U54 NS093793 NINDS NIH HHS
- U01 HG007530 NHGRI NIH HHS
- U01 TR002471 NCATS NIH HHS
- U01 HG007703 NHGRI NIH HHS
- UDN research reported in this manuscript was supported by the NIH Common Fund, through the Office of Strategic Coordination/Office of the NIH Director under Award Number(s) U01HG007709, U01HG010219, U01HG010230, U01HG010217, U01HG010233, U01HG010215, U01HG007672, U01HG007690, U01HG007708, U01HG007703, U01HG007674, U01HG007530, U01HG007942, U01HG007943, U01TR001395, U01TR002471, U54NS108251, and U54NS093793.
- E.A. is supported by a Microsoft Research PhD Fellowship.
- S.F. is supported by award Number T32GM007753 from the National Institute of General Medical Sciences.
- M.L. is supported by T32HG002295 from the National Human Genome Research Institute and a National Science Foundation Graduate Research Fellowship.
Collapse
Affiliation(s)
- Emily Alsentzer
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Program in Health Sciences and Technology, MIT, Cambridge, MA, 02139, USA
| | - Samuel G Finlayson
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Program in Health Sciences and Technology, MIT, Cambridge, MA, 02139, USA
- Department of Pediatrics, Division of Genetic Medicine, Seattle Children's Hospital, Seattle, WA, 98105, USA
- Division of Medical Genetics, Department of Medicine, University of Washington, Seattle, WA, 98105, USA
| | - Michelle M Li
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA
- Bioinformatics and Integrative Genomics, Harvard Medical School, Boston, MA, 02115, USA
| | - Shilpa N Kobren
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA.
| | - Isaac S Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, 02115, USA.
| |
Collapse
|
32
|
Lv Q, Chen G, He H, Yang Z, Zhao L, Chen HY, Chen CYC. TCMBank: bridges between the largest herbal medicines, chemical ingredients, target proteins, and associated diseases with intelligence text mining. Chem Sci 2023; 14:10684-10701. [PMID: 37829020 PMCID: PMC10566508 DOI: 10.1039/d3sc02139d] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 07/30/2023] [Indexed: 10/14/2023] Open
Abstract
Traditional Chinese Medicine (TCM) has long been viewed as a precious source of modern drug discovery. AI-assisted drug discovery (AIDD) has been investigated extensively. However, there are still two challenges in applying AIDD to guide TCM drug discovery: the lack of a large amount of standardized TCM-related information and AIDD is prone to pathological failures in out-of-domain data. We have released TCM Database@Taiwan in 2011, and it has been widely disseminated and used. Now, we developed TCMBank, the largest systematic free TCM database, which is an extension of TCM Database@Taiwan. TCMBank contains 9192 herbs, 61 966 ingredients (unduplicated), 15 179 targets, 32 529 diseases, and their pairwise relationships. By integrating multiple data sources, TCMBank provides 3D structure information of ingredients and provides a standard list and detailed information on herbs, ingredients, targets and diseases. TCMBank has an intelligent document identification module that continuously adds TCM-related information retrieved from the literature in PubChem. In addition, driven by TCMBank big data, we developed an ensemble learning-based drug discovery protocol for identifying potential leads and drug repurposing. We take colorectal cancer and Alzheimer's disease as examples to demonstrate how to accelerate drug discovery by artificial intelligence. Using TCMBank, researchers can view literature-driven relationship mapping between herbs/ingredients and genes/diseases, allowing the understanding of molecular action mechanisms for ingredients and identification of new potentially effective treatments. TCMBank is available at https://TCMBank.CN/.
Collapse
Affiliation(s)
- Qiujie Lv
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
| | - Guanxing Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
| | - Haohuai He
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
| | - Ziduo Yang
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
| | - Lu Zhao
- Department of Clinical Laboratory, The Sixth Affiliated Hospital, Sun Yat-sen University Guangzhou Guangdong 510655 P. R. China
- Biomedical Innovation Center, The Sixth Affiliated Hospital, Sun Yat-sen University Guangzhou Guangdong 510655 P. R. China
| | - Hsin-Yi Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
| | - Calvin Yu-Chian Chen
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University Shenzhen Guangdong 518107 P. R. China
- Department of Medical Research, China Medical University Hospital Taichung 40447 Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University Taichung 41354 Taiwan
- Guangdong L-Med Medicine Biotechnology Co., Ltd Meizhou Guangdong 514699 P. R. China
| |
Collapse
|
33
|
Budu-Aggrey A, Kilanowski A, Sobczyk MK, Shringarpure SS, Mitchell R, Reis K, Reigo A, Mägi R, Nelis M, Tanaka N, Brumpton BM, Thomas LF, Sole-Navais P, Flatley C, Espuela-Ortiz A, Herrera-Luis E, Lominchar JVT, Bork-Jensen J, Marenholz I, Arnau-Soler A, Jeong A, Fawcett KA, Baurecht H, Rodriguez E, Alves AC, Kumar A, Sleiman PM, Chang X, Medina-Gomez C, Hu C, Xu CJ, Qi C, El-Heis S, Titcombe P, Antoun E, Fadista J, Wang CA, Thiering E, Wu B, Kress S, Kothalawala DM, Kadalayil L, Duan J, Zhang H, Hadebe S, Hoffmann T, Jorgenson E, Choquet H, Risch N, Njølstad P, Andreassen OA, Johansson S, Almqvist C, Gong T, Ullemar V, Karlsson R, Magnusson PKE, Szwajda A, Burchard EG, Thyssen JP, Hansen T, Kårhus LL, Dantoft TM, Jeanrenaud ACSN, Ghauri A, Arnold A, Homuth G, Lau S, Nöthen MM, Hübner N, Imboden M, Visconti A, Falchi M, Bataille V, Hysi P, Ballardini N, Boomsma DI, Hottenga JJ, Müller-Nurasyid M, Ahluwalia TS, Stokholm J, Chawes B, Schoos AMM, Esplugues A, Bustamante M, Raby B, Arshad S, German C, Esko T, Milani LA, Metspalu A, Terao C, Abuabara K, Løset M, Hveem K, Jacobsson B, Pino-Yanes M, Strachan DP, Grarup N, Linneberg A, Lee YA, Probst-Hensch N, Weidinger S, Jarvelin MR, Melén E, Hakonarson H, Irvine AD, Jarvis D, Nijsten T, Duijts L, Vonk JM, Koppelmann GH, Godfrey KM, Barton SJ, Feenstra B, Pennell CE, Sly PD, Holt PG, Williams LK, Bisgaard H, Bønnelykke K, Curtin J, Simpson A, Murray C, Schikowski T, Bunyavanich S, Weiss ST, Holloway JW, Min JL, Brown SJ, Standl M, Paternoster L. European and multi-ancestry genome-wide association meta-analysis of atopic dermatitis highlights importance of systemic immune regulation. Nat Commun 2023; 14:6172. [PMID: 37794016 PMCID: PMC10550990 DOI: 10.1038/s41467-023-41180-2] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 08/24/2023] [Indexed: 10/06/2023] Open
Abstract
Atopic dermatitis (AD) is a common inflammatory skin condition and prior genome-wide association studies (GWAS) have identified 71 associated loci. In the current study we conducted the largest AD GWAS to date (discovery N = 1,086,394, replication N = 3,604,027), combining previously reported cohorts with additional available data. We identified 81 loci (29 novel) in the European-only analysis (which all replicated in a separate European analysis) and 10 additional loci in the multi-ancestry analysis (3 novel). Eight variants from the multi-ancestry analysis replicated in at least one of the populations tested (European, Latino or African), while two may be specific to individuals of Japanese ancestry. AD loci showed enrichment for DNAse I hypersensitivity and eQTL associations in blood. At each locus we prioritised candidate genes by integrating multi-omic data. The implicated genes are predominantly in immune pathways of relevance to atopic inflammation and some offer drug repurposing opportunities.
Collapse
Affiliation(s)
- Ashley Budu-Aggrey
- Medical Research Council Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, England
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, England
| | - Anna Kilanowski
- Institute of Epidemiology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- Division of Metabolic and Nutritional Medicine, Dr. von Hauner Children's Hospital, University of Munich Medical Center, Munich, Germany
- Pettenkofer School of Public Health, Ludwig-Maximilians University Munich, Munich, Germany
| | - Maria K Sobczyk
- Medical Research Council Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, England
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, England
| | | | - Ruth Mitchell
- Medical Research Council Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, England
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, England
| | - Kadri Reis
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Anu Reigo
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Reedik Mägi
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Mari Nelis
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
- Core Facility of Genomics, University of Tartu, Tartu, Estonia
| | - Nao Tanaka
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Department of Rheumatology, Graduate School of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, Japan
| | - Ben M Brumpton
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, 7030, Norway
- HUNT Research Centre, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Levanger, 7600, Norway
- Clinic of Medicine, St. Olavs Hospital, Trondheim University Hospital, Trondheim, 7030, Norway
| | - Laurent F Thomas
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, 7030, Norway
- Department of Clinical and Molecular Medicine, NTNU Norwegian University of Science and Technology, Trondheim, Norway
- BioCore - Bioinformatics Core Facility, NTNU, Norwegian University of Science and Technology, Trondheim, Norway
- Clinic of Laboratory Medicine, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway
| | - Pol Sole-Navais
- Department of Obstetrics and Gynecology, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Christopher Flatley
- Department of Obstetrics and Gynecology, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Antonio Espuela-Ortiz
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, La Laguna, Tenerife, Spain
| | - Esther Herrera-Luis
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, La Laguna, Tenerife, Spain
| | - Jesus V T Lominchar
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, København, Denmark
| | - Jette Bork-Jensen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, København, Denmark
| | - Ingo Marenholz
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Clinic for Pediatric Allergy, Experimental and Clinical Research Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Aleix Arnau-Soler
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Clinic for Pediatric Allergy, Experimental and Clinical Research Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Ayoung Jeong
- Swiss Tropical and Public Health Institute, CH-4123, Basel, Switzerland
- University of Basel, CH-4001, Basel, Switzerland
| | - Katherine A Fawcett
- Department of Health Sciences, University of Leicester, Leicester, LE1 7RH, UK
| | - Hansjorg Baurecht
- Department of Epidemiology and Preventive Medicine, University of Regensburg, Regensburg, Germany
| | - Elke Rodriguez
- Department of Dermatology and Allergy, University Hospital Schleswig-Holstein, Kiel, Germany
| | | | - Ashish Kumar
- Department of Clinical Science and Education Södersjukhuset, Karolinska Institutet, Solna, Sweden
| | - Patrick M Sleiman
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA
- Rhythm Pharmaceuticals, 222 Berkley Street, Boston, 02116, USA
| | - Xiao Chang
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Carolina Medina-Gomez
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Chen Hu
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Dermatology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Cheng-Jian Xu
- University of Groningen, University Medical Center Groningen, Department of Pediatric Pulmonology and Pediatric Allergy, Beatrix Children's Hospital, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, GRIAC Research Institute, Groningen, The Netherlands
- Centre for Individualized Infection Medicine, CiiM, a joint venture between Hannover Medical School and the Helmholtz Centre for Infection Research, Hannover, Germany
- TWINCORE, Centre for Experimental and Clinical Infection Research, a joint venture between the Hannover Medical School and the Helmholtz Centre for Infection Research, Hannover, Germany
| | - Cancan Qi
- University of Groningen, University Medical Center Groningen, Department of Pediatric Pulmonology and Pediatric Allergy, Beatrix Children's Hospital, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, GRIAC Research Institute, Groningen, The Netherlands
| | - Sarah El-Heis
- MRC Lifecourse Epidemiology Centre, University of Southampton, Southampton, UK
| | - Philip Titcombe
- MRC Lifecourse Epidemiology Centre, University of Southampton, Southampton, UK
| | - Elie Antoun
- Faculty of Medicine, University of Southampton, Southampton, UK
- Institute of Developmental Sciences, University of Southampton, Southampton, UK
| | - João Fadista
- Department of Bioinformatics & Data Mining, Måløv, Denmark
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
- Department of Clinical Sciences, Lund University Diabetes Centre, Malmö, Sweden
- Institute for Molecular Medicine Finland (FIMM), University of Helsinki, Helsinki, Finland
| | - Carol A Wang
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, Australia
- Hunter Medical Research Institute, Newcastle, NSW, Australia
| | - Elisabeth Thiering
- Institute of Epidemiology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- Division of Metabolic and Nutritional Medicine, Dr. von Hauner Children's Hospital, University of Munich Medical Center, Munich, Germany
| | - Baojun Wu
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Medicine, Henry Ford Health, Detroit, MI, 48104, USA
| | - Sara Kress
- Environmental Epidemiology of Lung, Brain and Skin Aging, IUF - Leibniz Research Institute for Environmental Medicine, Düsseldorf, Germany
| | - Dilini M Kothalawala
- Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
- NIHR Southampton Biomedical Research Centre, University Hospital Southampton, Southampton, UK
| | - Latha Kadalayil
- Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
| | - Jiasong Duan
- Division of Epidemiology, Biostatistics, and Environmental Health, School of Public Health, University of Memphis, Memphis, TN, USA
| | - Hongmei Zhang
- Division of Epidemiology, Biostatistics, and Environmental Health, School of Public Health, University of Memphis, Memphis, TN, USA
| | - Sabelo Hadebe
- Division of Immunology, Department of Pathology, Faculty of Health Sciences, University of Cape Town, Cape Town, South Africa
| | - Thomas Hoffmann
- Institute for Human Genetics, UCSF, San Francisco, CA, 94143, USA
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, 94158, USA
| | | | - Hélène Choquet
- Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA
| | - Neil Risch
- Institute for Human Genetics, UCSF, San Francisco, CA, 94143, USA
- Department of Epidemiology and Biostatistics, UCSF, San Francisco, CA, 94158, USA
| | - Pål Njølstad
- Center for Diabetes Research, Department of Clinical Science, University of Bergen, NO-5020, Bergen, Norway
- Children and Youth Clinic, Haukeland University Hospital, NO-5021, Bergen, Norway
| | - Ole A Andreassen
- NORMENT Centre, Institute of Clinical Medicine, University of Oslo, 0450, Oslo, Norway
- Division of Mental Health and Addiction, Oslo University Hospital, 0450, Oslo, Norway
| | - Stefan Johansson
- Center for Diabetes Research, Department of Clinical Science, University of Bergen, NO-5020, Bergen, Norway
- Department of Medical Genetics, Haukeland University Hospital, NO-5021, Bergen, Norway
| | - Catarina Almqvist
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
- Pediatric Lung and Allergy Unit, Astrid Lindgren Children's Hospital, Karolinska University Hospital, Stockholm, Sweden
| | - Tong Gong
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Vilhelmina Ullemar
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Robert Karlsson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Patrik K E Magnusson
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Agnieszka Szwajda
- Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
| | - Esteban G Burchard
- Department of Medicine, University of California San Francisco, San Francisco, CA, USA
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Jacob P Thyssen
- Department of Dermatology, Bispebjerg Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Torben Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, København, Denmark
| | - Line L Kårhus
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Frederiksberg, Denmark
| | - Thomas M Dantoft
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Frederiksberg, Denmark
| | - Alexander C S N Jeanrenaud
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Clinic for Pediatric Allergy, Experimental and Clinical Research Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Ahla Ghauri
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Clinic for Pediatric Allergy, Experimental and Clinical Research Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Andreas Arnold
- Clinic and Polyclinic of Dermatology, University Medicine Greifswald, Greifswald, Germany
| | - Georg Homuth
- Department of Functional Genomics, Interfaculty Institute for Genetics and Functional Genomics, University Medicine Greifswald, Greifswald, Germany
| | - Susanne Lau
- Department of Pediatric Respiratory Medicine, Immunology, and Critical Care Medicine, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Markus M Nöthen
- Institute of Human Genetics, University of Bonn, School of Medicine & University Hospital Bonn, Bonn, Germany
| | - Norbert Hübner
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Charite-Universitätsmedizin Berlin, Berlin, Germany
| | - Medea Imboden
- Swiss Tropical and Public Health Institute, CH-4123, Basel, Switzerland
- University of Basel, CH-4001, Basel, Switzerland
| | - Alessia Visconti
- Department of Twin Research & Genetics Epidemiology, Kings College London, London, UK
| | - Mario Falchi
- Department of Twin Research & Genetics Epidemiology, Kings College London, London, UK
| | - Veronique Bataille
- Department of Twin Research & Genetics Epidemiology, Kings College London, London, UK
- Dermatology Department, West Herts NHS Trust, Watford, UK
| | - Pirro Hysi
- Department of Twin Research & Genetics Epidemiology, Kings College London, London, UK
| | - Natalia Ballardini
- Department of Clinical Science and Education Södersjukhuset, Karolinska Institutet, Solna, Sweden
| | - Dorret I Boomsma
- Dept Biological Psychology, Netherlands Twin Register, VU University, Amsterdam, the Netherlands
- Institute for Health and Care Research (EMGO), VU University, Amsterdam, the Netherlands
| | - Jouke J Hottenga
- Dept Biological Psychology, Netherlands Twin Register, VU University, Amsterdam, the Netherlands
| | - Martina Müller-Nurasyid
- Institute of Genetic Epidemiology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- IBE, Faculty of Medicine, LMU Munich, Munich, Germany
- Institute of Medical Biostatistics, Epidemiology and Informatics (IMBEI), University Medical Center, Johannes Gutenberg University, Mainz, Germany
| | - Tarunveer S Ahluwalia
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
- Steno Diabetes Center Copenhagen, Herlev, Denmark
- Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Jakob Stokholm
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
- Department of Pediatrics, Slagelse Hospital, Slagelse, Denmark
| | - Bo Chawes
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Ann-Marie M Schoos
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
- Department of Pediatrics, Slagelse Hospital, Slagelse, Denmark
| | - Ana Esplugues
- Nursing School, University of Valencia, FISABIO-University Jaume I-University of Valencia, Valencia, Spain
- Joint Research Unit of Epidemiology and Environmental Health, CIBERESP, Valencia, Spain
| | - Mariona Bustamante
- ISGlobal, Institute for Global Health, Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- CIBER Epidemiología y Salud Pública, Madrid, Spain
| | - Benjamin Raby
- Channing Division of Network Medicine, Brigham & Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - Syed Arshad
- Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
- David Hide Asthma and Allergy Research Centre, Isle of Wight, UK
| | | | - Tõnu Esko
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Lili A Milani
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Andres Metspalu
- Estonian Genome Centre, Institute of Genomics, University of Tartu, Tartu, Estonia
| | - Chikashi Terao
- Laboratory for Statistical and Translational Genetics, RIKEN Center for Integrative Medical Sciences, Yokohama, Japan
- Clinical Research Center, Shizuoka General Hospital, Shizuoka, Japan
- Department of Applied Genetics, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka, Japan
| | - Katrina Abuabara
- Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
| | - Mari Løset
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, 7030, Norway
- Department of Dermatology, Clinic of Orthopaedy, Rheumatology and Dermatology, St. Olavs Hospital, Trondheim University Hospital, Trondheim, Norway
| | - Kristian Hveem
- K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and Nursing, NTNU, Norwegian University of Science and Technology, Trondheim, 7030, Norway
- HUNT Research Centre, Department of Public Health and General Practice, Norwegian University of Science and Technology, Levanger, Norway
| | - Bo Jacobsson
- Department of Obstetrics and Gynecology, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
- Department of Genetics and Bioinformatics, Norwegian Institute of Public Health, Oslo, Norway
| | - Maria Pino-Yanes
- Genomics and Health Group, Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, La Laguna, Tenerife, Spain
- CIBER de Enfermedades Respiratorias, Instituto de Salud Carlos III, Madrid, Spain
- Instituto de Tecnologías Biomédicas (ITB), Universidad de La Laguna, San Cristóbal de La Laguna, Santa Cruz de Tenerife, Spain
| | - David P Strachan
- Population Health Research Institute, St George's, University of London, Cranmer Terrace, London, SW17 0RE, UK
| | - Niels Grarup
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, København, Denmark
| | - Allan Linneberg
- Center for Clinical Research and Prevention, Bispebjerg and Frederiksberg Hospital, Frederiksberg, Denmark
- Department of Clinical Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Young-Ae Lee
- Max-Delbrück-Center for Molecular Medicine, Berlin, Germany
- Clinic for Pediatric Allergy, Experimental and Clinical Research Center, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Nicole Probst-Hensch
- Swiss Tropical and Public Health Institute, CH-4123, Basel, Switzerland
- University of Basel, CH-4001, Basel, Switzerland
| | - Stephan Weidinger
- Department of Dermatology, Allergology and Venereology, University Hospital Schleswig-Holstein, Kiel, Germany
| | - Marjo-Riitta Jarvelin
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment & Health, School of Public Health,Imperial College London, London, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Erik Melén
- Department of Clinical Science and Education Södersjukhuset, Karolinska Institutet, Solna, Sweden
| | - Hakon Hakonarson
- Center for Applied Genomics, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
- Department of Pediatrics, Divisions of Human Genetics and Pulmonary Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, 19104, USA
- Faculty of Medicine, University of Iceland, 101, Reykjavík, Iceland
| | - Alan D Irvine
- Department of Clinical Medicine, Trinity College, Dublin, Ireland
| | - Deborah Jarvis
- Respiratory Epidemiology, Occupational Medicine and Public Health, National Heart and Lung Institute, Imperial College London, London, United Kingdom
- Medical Research Council and Public Health England Centre for Environment and Health, London, United Kingdom
| | - Tamar Nijsten
- Department of Dermatology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Liesbeth Duijts
- Department of Pediatrics, division of Respiratory Medicine and Allergology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
- Department of Pediatrics, division of Neonatology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, The Netherlands
| | - Judith M Vonk
- University of Groningen, University Medical Center Groningen, GRIAC Research Institute, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, Department of Epidemiology, Groningen, The Netherlands
| | - Gerard H Koppelmann
- University of Groningen, University Medical Center Groningen, Department of Pediatric Pulmonology and Pediatric Allergy, Beatrix Children's Hospital, Groningen, The Netherlands
- University of Groningen, University Medical Center Groningen, GRIAC Research Institute, Groningen, The Netherlands
| | - Keith M Godfrey
- MRC Lifecourse Epidemiology Centre and NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
| | - Sheila J Barton
- MRC Lifecourse Epidemiology Centre, University of Southampton, Southampton, UK
| | - Bjarke Feenstra
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Craig E Pennell
- School of Medicine and Public Health, University of Newcastle, Newcastle, NSW, Australia
- Hunter Medical Research Institute, Newcastle, NSW, Australia
| | - Peter D Sly
- Children's Health and Environment Program, Child Health Research Centre, The University of Queensland, South Brisbane, 4101, Queensland, Australia
- Australian Infectious Diseases Research Centre, The University of Queensland, St Lucia, 4072, QLD, Australia
| | - Patrick G Holt
- Telethon Kids Institute, University of Western Australia, Perth, WA, Australia
| | - L Keoki Williams
- Center for Individualized and Genomic Medicine Research (CIGMA), Department of Medicine, Henry Ford Health, Detroit, MI, 48104, USA
| | - Hans Bisgaard
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
| | - Klaus Bønnelykke
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
| | - John Curtin
- Division of Immunology, Immunity to Infection and Respiratory Medicine, School of Biological Sciences, The University of Manchester, Manchester Academic Health Science Centre, and Manchester University NHS Foundation Trust, Manchester, England
| | - Angela Simpson
- Division of Immunology, Immunity to Infection and Respiratory Medicine, School of Biological Sciences, The University of Manchester, Manchester Academic Health Science Centre, and Manchester University NHS Foundation Trust, Manchester, England
| | - Clare Murray
- Division of Immunology, Immunity to Infection and Respiratory Medicine, School of Biological Sciences, The University of Manchester, Manchester Academic Health Science Centre, and Manchester University NHS Foundation Trust, Manchester, England
| | - Tamara Schikowski
- Environmental Epidemiology of Lung, Brain and Skin Aging, Leibniz Research Institute for Environmental Medicine, Düsseldorf, Germany
| | - Supinda Bunyavanich
- Division of Allergy and Immunology, Department of Pediatrics, and Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, USA
| | - Scott T Weiss
- Channing Division of Network Medicine, Brigham & Women's Hospital and Harvard Medical School, Boston, MA, USA
| | - John W Holloway
- Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, UK
- Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK
| | - Josine L Min
- Medical Research Council Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, England
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, England
| | - Sara J Brown
- Centre for Genomics and Experimental Medicine, Institute for Genetics and Cancer, University of Edinburgh, Crewe Road, Edinburgh, UK EH4 2XU, Scotland
| | - Marie Standl
- Institute of Epidemiology, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany
- German Center for Lung Research (DZL), Munich, Germany
| | - Lavinia Paternoster
- Medical Research Council Integrative Epidemiology Unit, Bristol Medical School, University of Bristol, Bristol, England.
- Population Health Sciences, Bristol Medical School, University of Bristol, Bristol, England.
| |
Collapse
|
34
|
Wang R, Helbig I, Edmondson AC, Lin L, Xing Y. Splicing defects in rare diseases: transcriptomics and machine learning strategies towards genetic diagnosis. Brief Bioinform 2023; 24:bbad284. [PMID: 37580177 PMCID: PMC10516351 DOI: 10.1093/bib/bbad284] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 07/10/2023] [Accepted: 07/20/2023] [Indexed: 08/16/2023] Open
Abstract
Genomic variants affecting pre-messenger RNA splicing and its regulation are known to underlie many rare genetic diseases. However, common workflows for genetic diagnosis and clinical variant interpretation frequently overlook splice-altering variants. To better serve patient populations and advance biomedical knowledge, it has become increasingly important to develop and refine approaches for detecting and interpreting pathogenic splicing variants. In this review, we will summarize a few recent developments and challenges in using RNA sequencing technologies for rare disease investigation. Moreover, we will discuss how recent computational splicing prediction tools have emerged as complementary approaches for revealing disease-causing variants underlying splicing defects. We speculate that continuous improvements to sequencing technologies and predictive modeling will not only expand our understanding of splicing regulation but also bring us closer to filling the diagnostic gap for rare disease patients.
Collapse
Affiliation(s)
- Robert Wang
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Genomics and Computational Biology Graduate Program, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Ingo Helbig
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Neurology, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Andrew C Edmondson
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pediatrics, Division of Human Genetics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Lan Lin
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Raymond G. Perelman Center for Cellular and Molecular Therapeutics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
| | - Yi Xing
- Center for Computational and Genomic Medicine, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Biomedical and Health Informatics, Children's Hospital of Philadelphia, Philadelphia, PA 19104, USA
- Department of Pathology and Laboratory Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
35
|
Xu W, Duan L, Zheng H, Li-Ling J, Jiang W, Zhang Y, Wang T, Qin R. An Integrative Disease Information Network Approach to Similar Disease Detection. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:2724-2735. [PMID: 34478379 DOI: 10.1109/tcbb.2021.3110127] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Disease similarity analysis impacts significantly in pathogenesis revealing, treatment recommending, and disease-causing genes predicting. Previous works study the disease similarity based on the semantics obtaining from biomedical ontologies (e.g., disease ontology) or the function of disease-causing molecules. However, such methods almost focus on a single perspective for obtaining disease features, which may lead to biased results for similar disease detection. To address this issue, we propose a disease information network-based integrative approach named MISSION for detecting similar diseases. By leveraging the associations between diseases and other biomedical entities, the disease information network is established first. Then, the disease similarity features extracted from the aspects of disease taxonomy, attributes, literature, and annotations are integrated into the disease information network. Finally, the top-k similar disease query is performed based on the integrative disease information. The experiments conducted on real-world datasets demonstrate that MISSION is effective and useful in similar disease detection.
Collapse
|
36
|
Dingemans AJM, Hinne M, Truijen KMG, Goltstein L, van Reeuwijk J, de Leeuw N, Schuurs-Hoeijmakers J, Pfundt R, Diets IJ, den Hoed J, de Boer E, Coenen-van der Spek J, Jansen S, van Bon BW, Jonis N, Ockeloen CW, Vulto-van Silfhout AT, Kleefstra T, Koolen DA, Campeau PM, Palmer EE, Van Esch H, Lyon GJ, Alkuraya FS, Rauch A, Marom R, Baralle D, van der Sluijs PJ, Santen GWE, Kooy RF, van Gerven MAJ, Vissers LELM, de Vries BBA. PhenoScore quantifies phenotypic variation for rare genetic diseases by combining facial analysis with other clinical features using a machine-learning framework. Nat Genet 2023; 55:1598-1607. [PMID: 37550531 DOI: 10.1038/s41588-023-01469-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 07/05/2023] [Indexed: 08/09/2023]
Abstract
Several molecular and phenotypic algorithms exist that establish genotype-phenotype correlations, including facial recognition tools. However, no unified framework that investigates both facial data and other phenotypic data directly from individuals exists. We developed PhenoScore: an open-source, artificial intelligence-based phenomics framework, combining facial recognition technology with Human Phenotype Ontology data analysis to quantify phenotypic similarity. Here we show PhenoScore's ability to recognize distinct phenotypic entities by establishing recognizable phenotypes for 37 of 40 investigated syndromes against clinical features observed in individuals with other neurodevelopmental disorders and show it is an improvement on existing approaches. PhenoScore provides predictions for individuals with variants of unknown significance and enables sophisticated genotype-phenotype studies by testing hypotheses on possible phenotypic (sub)groups. PhenoScore confirmed previously known phenotypic subgroups caused by variants in the same gene for SATB1, SETBP1 and DEAF1 and provides objective clinical evidence for two distinct ADNP-related phenotypes, already established functionally.
Collapse
Affiliation(s)
- Alexander J M Dingemans
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Max Hinne
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Kim M G Truijen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Lia Goltstein
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Jeroen van Reeuwijk
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Nicole de Leeuw
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Janneke Schuurs-Hoeijmakers
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Rolph Pfundt
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Illja J Diets
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Joery den Hoed
- Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, the Netherlands
| | - Elke de Boer
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Jet Coenen-van der Spek
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Sandra Jansen
- Department of Human Genetics, Amsterdam UMC, University of Amsterdam, Amsterdam, the Netherlands
| | - Bregje W van Bon
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Noraly Jonis
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Charlotte W Ockeloen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Anneke T Vulto-van Silfhout
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Tjitske Kleefstra
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - David A Koolen
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands
| | - Philippe M Campeau
- Department of Pediatrics, University of Montreal, Montreal, Quebec, Canada
| | - Elizabeth E Palmer
- Faculty of Medicine and Health, UNSW Sydney, Sydney, New South Wales, Australia
- Sydney Children's Hospitals Network, Sydney, New South Wales, Australia
| | - Hilde Van Esch
- Center for Human Genetics, University Hospitals Leuven, University of Leuven, Leuven, Belgium
| | - Gholson J Lyon
- Department of Human Genetics and George A. Jervis Clinic, Institute for Basic Research in Developmental Disabilities (IBR), Staten Island, NY, USA
- Biology PhD Program, The Graduate Center, The City University of New York, New York City, NY, USA
| | - Fowzan S Alkuraya
- Department of Translational Genomics, Center for Genomic Medicine, King Faisal Specialist Hospital and Research Center, Riyadh, Saudi Arabia
| | - Anita Rauch
- Institute of Medical Genetics, University of Zürich, Zürich, Switzerland
| | - Ronit Marom
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Diana Baralle
- Faculty of Medicine, University of Southampton, Southampton, UK
| | | | - Gijs W E Santen
- Department of Clinical Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - R Frank Kooy
- Department of Medical Genetics, University of Antwerp, Antwerp, Belgium
| | - Marcel A J van Gerven
- Department of Artificial Intelligence, Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, the Netherlands
| | - Lisenka E L M Vissers
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands.
| | - Bert B A de Vries
- Department of Human Genetics, Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, the Netherlands.
| |
Collapse
|
37
|
Zhu C, Xia X, Li N, Zhong F, Yang Z, Liu L. RDKG-115: Assisting drug repurposing and discovery for rare diseases by trimodal knowledge graph embedding. Comput Biol Med 2023; 164:107262. [PMID: 37481946 DOI: 10.1016/j.compbiomed.2023.107262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 07/07/2023] [Accepted: 07/16/2023] [Indexed: 07/25/2023]
Abstract
Rare diseases (RDs) may affect individuals in small numbers, but they have a significant impact on a global scale. Accurate diagnosis of RDs is challenging, and there is a severe lack of drugs available for treatment. Pharmaceutical companies have shown a preference for drug repurposing from existing drugs developed for other diseases due to the high investment, high risk, and long cycle involved in RD drug development. Compared to traditional approaches, knowledge graph embedding (KGE) based methods are more efficient and convenient, as they treat drug repurposing as a link prediction task. KGE models allow for the enrichment of existing knowledge by incorporating multimodal information from various sources. In this study, we constructed RDKG-115, a rare disease knowledge graph involving 115 RDs, composed of 35,643 entities, 25 relations, and 5,539,839 refined triplets, based on 372,384 high-quality literature and 4 biomedical datasets: DRKG, Pathway Commons, PharmKG, and PMapp. Subsequently, we developed a trimodal KGE model containing structure, category, and description embeddings using reverse-hyperplane projection. We utilized this model to infer 4199 reliable new inferred triplets from RDKG-115. Finally, we calculated potential drugs and small molecules for each of the 115 RDs, taking multiple sclerosis as a case study. This study provides a paradigm for large-scale screening of drug repurposing and discovery for RDs, which will speed up the drug development process and ultimately benefit patients with RDs. The source code and data are available at https://github.com/ZhuChaoY/RDKG-115.
Collapse
Affiliation(s)
- Chaoyu Zhu
- Intelligent Medicine Institute, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Xiaoqiong Xia
- Intelligent Medicine Institute, Shanghai Medical College, Fudan University, Shanghai, 200032, China
| | - Nan Li
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Fan Zhong
- Intelligent Medicine Institute, Shanghai Medical College, Fudan University, Shanghai, 200032, China.
| | - Zhihao Yang
- College of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China.
| | - Lei Liu
- Intelligent Medicine Institute, Shanghai Medical College, Fudan University, Shanghai, 200032, China; Shanghai Institute of Stem Cell Research and Clinical Translation, Shanghai, 200120, China.
| |
Collapse
|
38
|
Lubala TK, Kayembe-Kitenge T, Mubungu G, Lumaka A, Kanteng G, Savage S, Luboya O, Hagerman R, Devriendt K, Lukusa-Tshilobo P. Usefulness of automated image analysis for recognition of the fragile X syndrome gestalt in Congolese subjects. Eur J Med Genet 2023; 66:104819. [PMID: 37532084 DOI: 10.1016/j.ejmg.2023.104819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 07/16/2023] [Accepted: 07/30/2023] [Indexed: 08/04/2023]
Abstract
BACKGROUND Computer-aided software such as the facial image diagnostic aid (FIDA) and Face2Gene has been developed to perform pattern recognition of facial features with promising clinical results. The aim of this pilot study was to test Face2Gene's recognition performance on Bantu Congolese subjects with Fragile X syndrome (FXS) as compared to Congolese subjects with intellectual disability but without FXS (non-FXS). METHOD Frontal facial photograph from 156 participants (14 patients with FXS and 142 controls) predominantly young-adults to adults, median age 18.9 age range 4-39yo, were uploaded. Automated face analysis was conducted by using the technology used in proprietary software tools called Face2Gene CLINIC and Face2Gene RESEARCH (version 17.6.2). To estimate the statistical power of the Face2Gene technology in distinguishing affected individuals from controls, a cross validation scheme was used. RESULTS The similarity seen in the upper facial region (of males and females) is greater than the similarity seen in other parts of the face. Binary comparison of subjects with FXS versus non-FXS and subjects with FXS versus subjects with Down syndrome reveal an area under the curve values of 0.955 (p = 0.002) and 0.986 (p = 0.003). CONCLUSION The Face2Gene algorithm is separating well between FXS and Non-FXS subjects.
Collapse
Affiliation(s)
- Toni Kasole Lubala
- Division of Dysmorphology & Birth Defects, Department of Pediatrics, University of Lubumbashi, Democratic Republic of the Congo
| | - Tony Kayembe-Kitenge
- Unit of Toxicology and Environment, School of Public Health, University of Lubumbashi, Democratic Republic of the Congo; Center for Environment and Health, Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium; Higher Institute of Medical Techniques, Lubumbashi, Democratic Republic of the Congo.
| | - Gerrye Mubungu
- Department of Pediatrics, Faculty of Medicine, University of Kinshasa, Democratic Republic of the Congo; Center for Human Genetics, National Institute for Biomedical Research (NIBR), Democratic Republic of the Congo; Center for Human Genetics, University Hospital Leuven, University of Leuven, Belgium
| | - Aimé Lumaka
- Department of Pediatrics, Faculty of Medicine, University of Kinshasa, Democratic Republic of the Congo; Center for Human Genetics, National Institute for Biomedical Research (NIBR), Democratic Republic of the Congo; Center for Human Genetics, University Hospital Leuven, University of Leuven, Belgium; Department of Biomedical and Preclinical Sciences, GIGA-Rm Laboratory of Human Genetics, University of Liège, Liège, Belgium
| | - Gray Kanteng
- Division of Dysmorphology & Birth Defects, Department of Pediatrics, University of Lubumbashi, Democratic Republic of the Congo
| | | | - Oscar Luboya
- Division of Dysmorphology & Birth Defects, Department of Pediatrics, University of Lubumbashi, Democratic Republic of the Congo; Higher Institute of Medical Techniques, Lubumbashi, Democratic Republic of the Congo
| | - Randi Hagerman
- Department of Pediatrics, MIND Institute, University of California Davis Medical Center, Sacramento, CA, USA
| | - Koenraad Devriendt
- Center for Human Genetics, University Hospital Leuven, University of Leuven, Belgium
| | - Prosper Lukusa-Tshilobo
- Department of Pediatrics, Faculty of Medicine, University of Kinshasa, Democratic Republic of the Congo; Center for Human Genetics, National Institute for Biomedical Research (NIBR), Democratic Republic of the Congo; Center for Human Genetics, University Hospital Leuven, University of Leuven, Belgium
| |
Collapse
|
39
|
Alghamdi SM, Hoehndorf R. Improving the classification of cardinality phenotypes using collections. J Biomed Semantics 2023; 14:9. [PMID: 37550716 PMCID: PMC10405428 DOI: 10.1186/s13326-023-00290-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 07/07/2023] [Indexed: 08/09/2023] Open
Abstract
MOTIVATION Phenotypes are observable characteristics of an organism and they can be highly variable. Information about phenotypes is collected in a clinical context to characterize disease, and is also collected in model organisms and stored in model organism databases where they are used to understand gene functions. Phenotype data is also used in computational data analysis and machine learning methods to provide novel insights into disease mechanisms and support personalized diagnosis of disease. For mammalian organisms and in a clinical context, ontologies such as the Human Phenotype Ontology and the Mammalian Phenotype Ontology are widely used to formally and precisely describe phenotypes. We specifically analyze axioms pertaining to phenotypes of collections of entities within a body, and we find that some of the axioms in phenotype ontologies lead to inferences that may not accurately reflect the underlying biological phenomena. RESULTS We reformulate the phenotypes of collections of entities using an ontological theory of collections. By reformulating phenotypes of collections in phenotypes ontologies, we avoid potentially incorrect inferences pertaining to the cardinality of these collections. We apply our method to two phenotype ontologies and show that the reformulation not only removes some problematic inferences but also quantitatively improves biological data analysis.
Collapse
Affiliation(s)
- Sarah M Alghamdi
- Computational Bioscience Research Center (CBRC), Computer, Electrical, and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, 4700 KAUST, 23955, Thuwal, Saudi Arabia.
- King Abdul-Aziz University, Faculty of Computing and Information Technology, 25732, Rabigh, Saudi Arabia.
| | - Robert Hoehndorf
- Computational Bioscience Research Center (CBRC), Computer, Electrical, and Mathematical Sciences & Engineering Division, King Abdullah University of Science and Technology, 4700 KAUST, 23955, Thuwal, Saudi Arabia.
| |
Collapse
|
40
|
Aspromonte MC, Conte AD, Zhu S, Tan W, Shen Y, Zhang Y, Li Q, Wang MH, Babbi G, Bovo S, Martelli PL, Casadio R, Althagafi A, Toonsi S, Kulmanov M, Hoehndorf R, Katsonis P, Williams A, Lichtarge O, Xian S, Surento W, Pejaver V, Mooney SD, Sunderam U, Srinivasan R, Murgia A, Piovesan D, Tosatto SCE, Leonardi E. CAGI6 ID-Challenge: Assessment of phenotype and variant predictions in 415 children with Neurodevelopmental Disorders (NDDs). RESEARCH SQUARE 2023:rs.3.rs-3209168. [PMID: 37577579 PMCID: PMC10418555 DOI: 10.21203/rs.3.rs-3209168/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]
Abstract
In the context of the Critical Assessment of the Genome Interpretation, 6th edition (CAGI6), the Genetics of Neurodevelopmental Disorders Lab in Padua proposed a new ID-challenge to give the opportunity of developing computational methods for predicting patient's phenotype and the causal variants. Eight research teams and 30 models had access to the phenotype details and real genetic data, based on the sequences of 74 genes (VCF format) in 415 pediatric patients affected by Neurodevelopmental Disorders (NDDs). NDDs are clinically and genetically heterogeneous conditions, with onset in infant age. In this study we evaluate the ability and accuracy of computational methods to predict comorbid phenotypes based on clinical features described in each patient and causal variants. Finally, we asked to develop a method to find new possible genetic causes for patients without a genetic diagnosis. As already done for the CAGI5, seven clinical features (ID, ASD, ataxia, epilepsy, microcephaly, macrocephaly, hypotonia), and variants (causative, putative pathogenic and contributing factors) were provided. Considering the overall clinical manifestation of our cohort, we give out the variant data and phenotypic traits of the 150 patients from CAGI5 ID-Challenge as training and validation for the prediction methods development.
Collapse
Affiliation(s)
| | | | - Shaowen Zhu
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
| | - Wuwei Tan
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
| | - Yang Shen
- Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843
| | | | - Qi Li
- CUHK Shenzhen Research Institute, Shenzhen
| | | | - Giulia Babbi
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
| | - Samuele Bovo
- Department of Agricultural and Food Sciences, University of Bologna
| | - Pier Luigi Martelli
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
| | - Rita Casadio
- Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna
| | - Azza Althagafi
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
| | - Sumyyah Toonsi
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
| | - Maxat Kulmanov
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
| | - Robert Hoehndorf
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), King Abdullah University of Science and Technology (KAUST), Thuwal 23
| | - Panagiotis Katsonis
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
| | - Amanda Williams
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
| | - Olivier Lichtarge
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030
| | - Su Xian
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
| | - Wesley Surento
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
| | - Vikas Pejaver
- Institute for Genomic Health, Icahn School of Medicine at Mount Sinai, New York, NY 10029
| | - Sean D Mooney
- Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98195
| | - Uma Sunderam
- Innovation Labs, Tata Consultancy Services, Hyderabad
| | | | | | | | | | | |
Collapse
|
41
|
Serrano JG, O'Leary M, VanNoy GE, Mangilog BE, Holm IA, Fraiman YS, Rehm HL, O'Donnell-Luria A, Wojcik MH. Advancing Understanding of Inequities in Rare Disease Genomics. Clin Ther 2023; 45:745-753. [PMID: 37517917 PMCID: PMC10527807 DOI: 10.1016/j.clinthera.2023.06.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 05/07/2023] [Accepted: 06/02/2023] [Indexed: 08/01/2023]
Abstract
PURPOSE Advances in genomic research have facilitated rare disease diagnosis for thousands of individuals. Unfortunately, the benefits of advanced genetic diagnostic technology are not distributed equitably among the population, as has been seen in many other health care contexts. Quantifying and describing inequities in genetic diagnostic yield is inherently challenging due to barriers to both clinical and research genetic testing. We therefore present an implementation protocol developed to expand access to our rare disease genomic research study and to further understand existing inequities. METHODS AND FINDINGS The Rare Genomes Project (RGP) at the Broad Institute of MIT and Harvard offers research genome sequencing to individuals with rare disease who remain genetically undiagnosed through direct interaction with the individual or family. This presents an opportunity for diagnosis beyond the clinical context, thus eliminating many barriers to access. An initial goal of RGP was to equalize access to genomic sequencing by decoupling testing access from proximity to a major medical center and physician referral. However, study participants over the initial 3 years of this project were predominantly white and well resourced. To further understand and address the lack of diversity within RGP, we developed a novel protocol embedded within the larger RGP study, in an approach informed by an implementation science framework. The aims of this protocol were: (1) to diversify recruitment and enrollment within RGP; (2) understand the process and context of implementing genomic medicine for rare disease diagnosis; and (3) investigate the value of a diagnosis for underserved populations. IMPLICATIONS Improved understanding of existing inequities and potential strategies to address them are needed to advance equity in rare disease genetic diagnosis and research. In addition to the moral imperative of equity in genomic medicine, this approach is critical in order to fully understand the genomic underpinnings of rare disease.
Collapse
Affiliation(s)
- Jillian G Serrano
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Melanie O'Leary
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Grace E VanNoy
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Brian E Mangilog
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Ingrid A Holm
- Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Yarden S Fraiman
- Division of Newborn Medicine, Department of Pediatrics, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA; Department of Neonatology, Beth Israel Deaconess Medical Center and Harvard Medical School, Boston, Massachusetts, USA
| | - Heidi L Rehm
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA; Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Anne O'Donnell-Luria
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA; Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA; Center for Genomic Medicine, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, USA
| | - Monica H Wojcik
- Broad Center for Mendelian Genomics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA; Division of Genetics and Genomics, Department of Pediatrics, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA; Division of Newborn Medicine, Department of Pediatrics, Boston Children's Hospital and Harvard Medical School, Boston, Massachusetts, USA.
| |
Collapse
|
42
|
Kafkas Ș, Abdelhakim M, Uludag M, Althagafi A, Alghamdi M, Hoehndorf R. Starvar: symptom-based tool for automatic ranking of variants using evidence from literature and genomes. BMC Bioinformatics 2023; 24:294. [PMID: 37479972 PMCID: PMC10362560 DOI: 10.1186/s12859-023-05406-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 07/10/2023] [Indexed: 07/23/2023] Open
Abstract
BACKGROUND Identifying variants associated with diseases is a challenging task in medical genetics research. Current studies that prioritize variants within individual genomes generally rely on known variants, evidence from literature and genomes, and patient symptoms and clinical signs. The functionalities of the existing tools, which rank variants based on given patient symptoms and clinical signs, are restricted to the coverage of ontologies such as the Human Phenotype Ontology (HPO). However, most clinicians do not limit themselves to HPO while describing patient symptoms/signs and their associated variants/genes. There is thus a need for an automated tool that can prioritize variants based on freely expressed patient symptoms and clinical signs. RESULTS STARVar is a Symptom-based Tool for Automatic Ranking of Variants using evidence from literature and genomes. STARVar uses patient symptoms and clinical signs, either linked to HPO or expressed in free text format. It returns a ranked list of variants based on a combined score from two classifiers utilizing evidence from genomics and literature. STARVar improves over related tools on a set of synthetic patients. In addition, we demonstrated its distinct contribution to the domain on another synthetic dataset covering publicly available clinical genotype-phenotype associations by using symptoms and clinical signs expressed in free text format. CONCLUSIONS STARVar stands as a unique and efficient tool that has the advantage of ranking variants with flexibly expressed patient symptoms in free-form text. Therefore, STARVar can be easily integrated into bioinformatics workflows designed to analyze disease-associated genomes. AVAILABILITY STARVar is freely available from https://github.com/bio-ontology-research-group/STARVar .
Collapse
Affiliation(s)
- Șenay Kafkas
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
| | - Marwa Abdelhakim
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
| | - Mahmut Uludag
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
| | - Azza Althagafi
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
- Computer Science Department, College of Computers and Information Technology, Taif University, 21655 Taif, Saudi Arabia
| | - Malak Alghamdi
- Medical Genetic Division, Department of Pediatrics, College of Medicine, King Saud University, 2925 Riyadh, Saudi Arabia
| | - Robert Hoehndorf
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
- Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, 23955 Thuwal, Saudi Arabia
| |
Collapse
|
43
|
Montanucci L, Lewis-Smith D, Collins RL, Niestroj LM, Parthasarathy S, Xian J, Ganesan S, Macnee M, Brünger T, Thomas RH, Talkowski M, Helbig I, Leu C, Lal D. Genome-wide identification and phenotypic characterization of seizure-associated copy number variations in 741,075 individuals. Nat Commun 2023; 14:4392. [PMID: 37474567 PMCID: PMC10359300 DOI: 10.1038/s41467-023-39539-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Accepted: 06/16/2023] [Indexed: 07/22/2023] Open
Abstract
Copy number variants (CNV) are established risk factors for neurodevelopmental disorders with seizures or epilepsy. With the hypothesis that seizure disorders share genetic risk factors, we pooled CNV data from 10,590 individuals with seizure disorders, 16,109 individuals with clinically validated epilepsy, and 492,324 population controls and identified 25 genome-wide significant loci, 22 of which are novel for seizure disorders, such as deletions at 1p36.33, 1q44, 2p21-p16.3, 3q29, 8p23.3-p23.2, 9p24.3, 10q26.3, 15q11.2, 15q12-q13.1, 16p12.2, 17q21.31, duplications at 2q13, 9q34.3, 16p13.3, 17q12, 19p13.3, 20q13.33, and reciprocal CNVs at 16p11.2, and 22q11.21. Using genetic data from additional 248,751 individuals with 23 neuropsychiatric phenotypes, we explored the pleiotropy of these 25 loci. Finally, in a subset of individuals with epilepsy and detailed clinical data available, we performed phenome-wide association analyses between individual CNVs and clinical annotations categorized through the Human Phenotype Ontology (HPO). For six CNVs, we identified 19 significant associations with specific HPO terms and generated, for all CNVs, phenotype signatures across 17 clinical categories relevant for epileptologists. This is the most comprehensive investigation of CNVs in epilepsy and related seizure disorders, with potential implications for clinical practice.
Collapse
Affiliation(s)
- Ludovica Montanucci
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
- Clinical Neurosciences, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Ryan L Collins
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard, Cambridge, USA
| | | | - Shridhar Parthasarathy
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Julie Xian
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shiva Ganesan
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Marie Macnee
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
| | - Tobias Brünger
- Cologne Center for Genomics, University of Cologne, Cologne, Germany
| | - Rhys H Thomas
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
- Clinical Neurosciences, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Michael Talkowski
- Center for Genomic Medicine, Massachusetts General Hospital, Boston, USA
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard, Cambridge, USA
| | - Ingo Helbig
- The Epilepsy NeuroGenetics Initiative, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
| | - Costin Leu
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, USA.
- Department of Clinical and Experimental Epilepsy, Institute of Neurology, University College London, London, UK.
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and M.I.T, Cambridge, MA, USA.
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH, US.
| | - Dennis Lal
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, Cleveland, USA.
- Program in Medical and Population Genetics, Broad Institute of Massachusetts Institute of Technology (M.I.T.) and Harvard, Cambridge, USA.
- Stanley Center for Psychiatric Research, Broad Institute of Harvard and M.I.T, Cambridge, MA, USA.
- Epilepsy Center, Neurological Institute, Cleveland Clinic, Cleveland, OH, US.
| |
Collapse
|
44
|
Leggatt G, Cheng G, Narain S, Briseño-Roa L, Annereau JP, Gast C, Gilbert RD, Ennis S. A genotype-to-phenotype approach suggests under-reporting of single nucleotide variants in nephrocystin-1 (NPHP1) related disease (UK 100,000 Genomes Project). Sci Rep 2023; 13:9369. [PMID: 37296294 PMCID: PMC10256716 DOI: 10.1038/s41598-023-32169-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Accepted: 03/23/2023] [Indexed: 06/12/2023] Open
Abstract
Autosomal recessive whole gene deletions of nephrocystin-1 (NPHP1) result in abnormal structure and function of the primary cilia. These deletions can result in a tubulointerstitial kidney disease known as nephronophthisis and retinal (Senior-Løken syndrome) and neurological (Joubert syndrome) diseases. Nephronophthisis is a common cause of end-stage kidney disease (ESKD) in children and up to 1% of adult onset ESKD. Single nucleotide variants (SNVs) and small insertions and deletions (Indels) have been less well characterised. We used a gene pathogenicity scoring system (GenePy) and a genotype-to-phenotype approach on individuals recruited to the UK Genomics England (GEL) 100,000 Genomes Project (100kGP) (n = 78,050). This approach identified all participants with NPHP1-related diseases reported by NHS Genomics Medical Centres and an additional eight participants. Extreme NPHP1 gene scores, often underpinned by clear recessive inheritance, were observed in patients from diverse recruitment categories, including cancer, suggesting the possibility of a more widespread disease than previously appreciated. In total, ten participants had homozygous CNV deletions with eight homozygous or compound heterozygous with SNVs. Our data also reveals strong in-silico evidence that approximately 44% of NPHP1 related disease may be due to SNVs with AlphaFold structural modelling evidence for a significant impact on protein structure. This study suggests historical under-reporting of SNVS in NPHP1 related diseases compared with CNVs.
Collapse
Affiliation(s)
- Gary Leggatt
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK.
- Wessex Kidney Centre, Portsmouth Hospitals University NHS Trust, Southwick Hill Road, Cosham, Portsmouth, PO6 3LY, UK.
- University Hospital Southampton NHS Foundation Trust, Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK.
| | - Guo Cheng
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
| | - Sumit Narain
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
| | - Luis Briseño-Roa
- Medetia, Imagine Institute for Genetic Diseases, 24 Boulevard du Montparnasse, 75015, Paris, France
| | - Jean-Philippe Annereau
- Medetia, Imagine Institute for Genetic Diseases, 24 Boulevard du Montparnasse, 75015, Paris, France
| | - Christine Gast
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
- Wessex Kidney Centre, Portsmouth Hospitals University NHS Trust, Southwick Hill Road, Cosham, Portsmouth, PO6 3LY, UK
| | - Rodney D Gilbert
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
- Southampton Children's Hospital, Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
| | - Sarah Ennis
- University of Southampton, Duthie Building (MP 808), Southampton General Hospital, Tremona Road Shirley, Southampton, SO16 6YD, UK
| |
Collapse
|
45
|
Geoffroy V, Lamouche JB, Guignard T, Nicaise S, Kress A, Scheidecker S, Le Béchec A, Muller J. The AnnotSV webserver in 2023: updated visualization and ranking. Nucleic Acids Res 2023:7175348. [PMID: 37216590 DOI: 10.1093/nar/gkad426] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 04/20/2023] [Accepted: 05/09/2023] [Indexed: 05/24/2023] Open
Abstract
Much of the human genetics variant repertoire is composed of single nucleotide variants (SNV) and small insertion/deletions (indel) but structural variants (SV) remain a major part of our modified DNA. SV detection has often been a complex question to answer either because of the necessity to use different technologies (array CGH, SNP array, Karyotype, Optical Genome Mapping…) to detect each category of SV or to get an appropriate resolution (Whole Genome Sequencing). Thanks to the deluge of pangenomic analysis, Human geneticists are accumulating SV and their interpretation remains time consuming and challenging. The AnnotSV webserver (https://www.lbgi.fr/AnnotSV/) aims at being an efficient tool to (i) annotate and interpret SV potential pathogenicity in the context of human diseases, (ii) recognize potential false positive variants from all the SV identified and (iii) visualize the patient variants repertoire. The most recent developments in the AnnotSV webserver are: (i) updated annotations sources and ranking, (ii) three novel output formats to allow diverse utilization (analysis, pipelines), as well as (iii) two novel user interfaces including an interactive circos view.
Collapse
Affiliation(s)
- Véronique Geoffroy
- Université de Brest, Inserm, EFS, UMR 1078, GGB, F-29200 Brest, France
- Laboratoire de Génétique Médicale, UMR 1112, INSERM, IGMA, Université de Strasbourg, Strasbourg, France
| | - Jean-Baptiste Lamouche
- Laboratoire de Génétique Médicale, UMR 1112, INSERM, IGMA, Université de Strasbourg, Strasbourg, France
- Unité Fonctionnelle de Bioinformatique Médicale appliquée au diagnostic (UF7363), Hôpitaux Universitaires de Strasbourg, Strasbourg, France
| | | | - Samuel Nicaise
- Unité Fonctionnelle de Bioinformatique Médicale appliquée au diagnostic (UF7363), Hôpitaux Universitaires de Strasbourg, Strasbourg, France
| | - Arnaud Kress
- Complex Systems and Translational Bioinformatics, ICube, UMR 7357, University of Strasbourg, CNRS, FMTS, Strasbourg, France
| | - Sophie Scheidecker
- Laboratoire de Génétique Médicale, UMR 1112, INSERM, IGMA, Université de Strasbourg, Strasbourg, France
- Laboratoires de Diagnostic Génétique, IGMA, Hôpitaux Universitaires de Strasbourg, Strasbourg, France
| | - Antony Le Béchec
- Unité Fonctionnelle de Bioinformatique Médicale appliquée au diagnostic (UF7363), Hôpitaux Universitaires de Strasbourg, Strasbourg, France
| | - Jean Muller
- Laboratoire de Génétique Médicale, UMR 1112, INSERM, IGMA, Université de Strasbourg, Strasbourg, France
- Unité Fonctionnelle de Bioinformatique Médicale appliquée au diagnostic (UF7363), Hôpitaux Universitaires de Strasbourg, Strasbourg, France
- Laboratoires de Diagnostic Génétique, IGMA, Hôpitaux Universitaires de Strasbourg, Strasbourg, France
| |
Collapse
|
46
|
Callahan TJ, Stefanski AL, Wyrwa JM, Zeng C, Ostropolets A, Banda JM, Baumgartner WA, Boyce RD, Casiraghi E, Coleman BD, Collins JH, Deakyne Davies SJ, Feinstein JA, Lin AY, Martin B, Matentzoglu NA, Meeker D, Reese J, Sinclair J, Taneja SB, Trinkley KE, Vasilevsky NA, Williams AE, Zhang XA, Denny JC, Ryan PB, Hripcsak G, Bennett TD, Haendel MA, Robinson PN, Hunter LE, Kahn MG. Ontologizing health systems data at scale: making translational discovery a reality. NPJ Digit Med 2023; 6:89. [PMID: 37208468 PMCID: PMC10196319 DOI: 10.1038/s41746-023-00830-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 04/28/2023] [Indexed: 05/21/2023] Open
Abstract
Common data models solve many challenges of standardizing electronic health record (EHR) data but are unable to semantically integrate all of the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OBO ontologies requires significant manual curation and domain expertise. We introduce OMOP2OBO, an algorithm for mapping Observational Medical Outcomes Partnership (OMOP) vocabularies to OBO ontologies. Using OMOP2OBO, we produced mappings for 92,367 conditions, 8611 drug ingredients, and 10,673 measurement results, which covered 68-99% of concepts used in clinical practice when examined across 24 hospitals. When used to phenotype rare disease patients, the mappings helped systematically identify undiagnosed patients who might benefit from genetic testing. By aligning OMOP vocabularies to OBO ontologies our algorithm presents new opportunities to advance EHR-based deep phenotyping.
Collapse
Affiliation(s)
- Tiffany J Callahan
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA.
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, 10032, USA.
| | - Adrianne L Stefanski
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Jordan M Wyrwa
- Department of Physical Medicine and Rehabilitation, School of Medicine, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Chenjie Zeng
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Anna Ostropolets
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Juan M Banda
- Department of Computer Science, Georgia State University, Atlanta, GA, 30303, USA
| | - William A Baumgartner
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Richard D Boyce
- Department of Biomedical Informatics, University of Pittsburgh School of Medicine, Pittsburgh, PA, 15260, USA
| | - Elena Casiraghi
- Computer Science, Università degli Studi di Milano, Milan, Italy
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - Ben D Coleman
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - Janine H Collins
- Department of Haematology, University of Cambridge, Cambridge, UK
| | - Sara J Deakyne Davies
- Department of Research Informatics & Data Science, Analytics Resource Center, Children's Hospital Colorado, Aurora, CO, 80045, USA
| | - James A Feinstein
- Adult and Child Center for Health Outcomes Research and Delivery Science (ACCORDS), University of Colorado Anschutz School of Medicine, Aurora, CO, 80045, USA
| | - Asiyah Y Lin
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Blake Martin
- Departments of Biomedical Informatics and Pediatrics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | | | | | - Justin Reese
- Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA, 94720, USA
| | | | - Sanya B Taneja
- Intelligent Systems Program, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Katy E Trinkley
- Department of Family Medicine, University of Colorado Anschutz School of Medicine, Aurora, CO, 80045, USA
| | - Nicole A Vasilevsky
- Translational and Integrative Sciences Lab, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
| | - Andrew E Williams
- Tufts Institute for Clinical Research and Health Policy Studies, Tufts University, Boston, MA, 02155, USA
| | - Xingmin A Zhang
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - Joshua C Denny
- National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Patrick B Ryan
- Janssen Research and Development, Raritan, NJ, 08869, USA
| | - George Hripcsak
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, 10032, USA
| | - Tellen D Bennett
- Departments of Biomedical Informatics and Pediatrics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Melissa A Haendel
- Departments of Biomedical Informatics and Pediatrics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Peter N Robinson
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA
| | - Lawrence E Hunter
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, 80045, USA
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| | - Michael G Kahn
- Department of Biomedical Informatics, University of Colorado School of Medicine, Aurora, CO, 80045, USA
| |
Collapse
|
47
|
Kwak SH, Srinivasan S, Chen L, Todd J, Mercader J, Jensen E, Divers J, Mottl A, Pihoker C, Gandica R, Laffel L, Isganaitis E, Haymond M, Levitsky L, Pollin T, Florez J, Flannick J. Insights from rare variants into the genetic architecture and biology of youth-onset type 2 diabetes. RESEARCH SQUARE 2023:rs.3.rs-2886343. [PMID: 37292813 PMCID: PMC10246295 DOI: 10.21203/rs.3.rs-2886343/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Youth-onset type 2 diabetes (T2D) is a growing public health concern. Its genetic basis and relationship to other forms of diabetes are largely unknown. To gain insight into the genetic architecture and biology of youth-onset T2D, we analyzed exome sequences of 3,005 youth-onset T2D cases and 9,777 ancestry matched adult controls. We identified (a) monogenic diabetes variants in 2.1% of individuals; (b) two exome-wide significant (P < 4.3×10-7) common coding variant associations (in WFS1 and SLC30A8); (c) three exome-wide significant (P < 2.5×10-6) rare variant gene-level associations (HNF1A, MC4R, ATX2NL); and (d) rare variant association enrichments within 25 gene sets broadly related to obesity, monogenic diabetes, and β-cell function. Many association signals were shared between youth-onset and adult-onset T2D but had larger effects for youth-onset T2D risk (1.18-fold increase for common variants and 2.86-fold increase for rare variants). Both common and rare variant associations contributed more to youth-onset T2D liability variance than they did to adult-onset T2D, but the relative increase was larger for rare variant associations (5.0-fold) than for common variant associations (3.4-fold). Youth-onset T2D cases showed phenotypic differences depending on whether their genetic risk was driven by common variants (primarily related to insulin resistance) or rare variants (primarily related to β-cell dysfunction). These data paint a picture of youth-onset T2D as a disease genetically similar to both monogenic diabetes and adult-onset T2D, in which genetic heterogeneity might be used to sub-classify patients for different treatment strategies.
Collapse
Affiliation(s)
| | | | - Ling Chen
- Diabetes Research Center (Diabetes Unit), Department of Medicine, Massachusetts General Hospital
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Jason Flannick
- Broad Institute of MIT and Harvard/Boston Children's Hospital/Harvard Medical School
| |
Collapse
|
48
|
Beers BJ, Similuk MN, Ghosh R, Seifert BA, Jamal L, Kamen M, Setzer MR, Jodarski C, Duncan R, Hunt D, Mixer M, Cao W, Bi W, Veltri D, Karlins E, Zhang L, Li Z, Oler AJ, Jevtich K, Yu Y, Hullfish H, Bielekova B, Frischmeyer-Guerrerio P, Dang Do A, Huryn LA, Olivier KN, Su HC, Lyons JJ, Zerbe CS, Rao VK, Keller MD, Freeman AF, Holland SM, Franco LM, Walkiewicz MA, Yan J. Chromosomal microarray analysis supplements exome sequencing to diagnose children with suspected inborn errors of immunity. Front Immunol 2023; 14:1172004. [PMID: 37215141 PMCID: PMC10196392 DOI: 10.3389/fimmu.2023.1172004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 04/04/2023] [Indexed: 05/24/2023] Open
Abstract
Purpose Though copy number variants (CNVs) have been suggested to play a significant role in inborn errors of immunity (IEI), the precise nature of this role remains largely unexplored. We sought to determine the diagnostic contribution of CNVs using genome-wide chromosomal microarray analysis (CMA) in children with IEI. Methods We performed exome sequencing (ES) and CMA for 332 unrelated pediatric probands referred for evaluation of IEI. The analysis included primary, secondary, and incidental findings. Results Of the 332 probands, 134 (40.4%) received molecular diagnoses. Of these, 116/134 (86.6%) were diagnosed by ES alone. An additional 15/134 (11.2%) were diagnosed by CMA alone, including two likely de novo changes. Three (2.2%) participants had diagnostic molecular findings from both ES and CMA, including two compound heterozygotes and one participant with two distinct diagnoses. Half of the participants with CMA contribution to diagnosis had CNVs in at least one non-immune gene, highlighting the clinical complexity of these cases. Overall, CMA contributed to 18/134 diagnoses (13.4%), increasing the overall diagnostic yield by 15.5% beyond ES alone. Conclusion Pairing ES and CMA can provide a comprehensive evaluation to clarify the complex factors that contribute to both immune and non-immune phenotypes. Such a combined approach to genetic testing helps untangle complex phenotypes, not only by clarifying the differential diagnosis, but in some cases by identifying multiple diagnoses contributing to the overall clinical presentation.
Collapse
Affiliation(s)
- Breanna J. Beers
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Morgan N. Similuk
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Rajarshi Ghosh
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Bryce A. Seifert
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Leila Jamal
- National Cancer Institute, National Institutes of Health, Bethesda, MD, United States
| | - Michael Kamen
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Michael R. Setzer
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Colleen Jodarski
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Rylee Duncan
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Devin Hunt
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Madison Mixer
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Wenjia Cao
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Weimin Bi
- Baylor Genetics, Houston, TX, United States
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, United States
| | - Daniel Veltri
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Eric Karlins
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Lingwen Zhang
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Zhiwen Li
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Andrew J. Oler
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Kathleen Jevtich
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Yunting Yu
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Haley Hullfish
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Bibiana Bielekova
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Pamela Frischmeyer-Guerrerio
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - An Dang Do
- Eunice Kennedy Shriver National Institute of Child Health and Human Development, National Institutes of Health, Bethesda, MD, United States
| | - Laryssa A. Huryn
- National Eye Institute, National Institutes of Health, Bethesda, MD, United States
| | - Kenneth N. Olivier
- Division of Pulmonary Diseases and Critical Care Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Helen C. Su
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Jonathan J. Lyons
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Christa S. Zerbe
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - V. Koneti Rao
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Michael D. Keller
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Alexandra F. Freeman
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Steven M. Holland
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Luis M. Franco
- National Institute of Arthritis and Musculoskeletal and Skin Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Magdalena A. Walkiewicz
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| | - Jia Yan
- National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD, United States
| |
Collapse
|
49
|
Echeverry-Quiceno LM, Candelo E, Gómez E, Solís P, Ramírez D, Ortiz D, González A, Sevillano X, Cuéllar JC, Pachajoa H, Martínez-Abadías N. Population-specific facial traits and diagnosis accuracy of genetic and rare diseases in an admixed Colombian population. Sci Rep 2023; 13:6869. [PMID: 37106005 PMCID: PMC10140286 DOI: 10.1038/s41598-023-33374-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2022] [Accepted: 04/12/2023] [Indexed: 04/29/2023] Open
Abstract
Up to 40% of rare disorders (RD) present facial dysmorphologies, and visual assessment is commonly used for clinical diagnosis. Quantitative approaches are more objective, but mostly rely on European descent populations, disregarding diverse population ancestry. Here, we assessed the facial phenotypes of Down (DS), Morquio (MS), Noonan (NS) and Neurofibromatosis type 1 (NF1) syndromes in a Latino-American population, recording the coordinates of 18 landmarks in 2D images from 79 controls and 51 patients. We quantified facial differences using Euclidean Distance Matrix Analysis, and assessed the diagnostic accuracy of Face2Gene, an automatic deep-learning algorithm. Individuals diagnosed with DS and MS presented severe phenotypes, with 58.2% and 65.4% of significantly different facial traits. The phenotype was milder in NS (47.7%) and non-significant in NF1 (11.4%). Each syndrome presented a characteristic dysmorphology pattern, supporting the diagnostic potential of facial biomarkers. However, population-specific traits were detected in the Colombian population. Diagnostic accuracy was 100% in DS, moderate in NS (66.7%) but lower in comparison to a European population (100%), and below 10% in MS and NF1. Moreover, admixed individuals showed lower facial gestalt similarities. Our results underscore that incorporating populations with Amerindian, African and European ancestry is crucial to improve diagnostic methods of rare disorders.
Collapse
Affiliation(s)
- Luis M Echeverry-Quiceno
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals (BEECA), Facultat de Biologia, Universitat de Barcelona (UB), Av. Diagonal, 643. Planta 2, 08028, Barcelona, Spain
| | - Estephania Candelo
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
- Servicio de Genética Clínica, Fundación Valle del Lili, Cali, Colombia
| | - Eidith Gómez
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
| | - Paula Solís
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
| | - Diana Ramírez
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
| | - Diana Ortiz
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
| | - Alejandro González
- HER - Human-Environment Research Group, La Salle - Universitat Ramon Llull, Barcelona, Spain
| | - Xavier Sevillano
- HER - Human-Environment Research Group, La Salle - Universitat Ramon Llull, Barcelona, Spain
| | | | - Harry Pachajoa
- Centro de Investigaciones en Anomalías Congénitas y Enfermedades Raras (CIACER), Universidad ICESI, Cali, Colombia
- Servicio de Genética Clínica, Fundación Valle del Lili, Cali, Colombia
| | - Neus Martínez-Abadías
- Departament de Biologia Evolutiva, Ecologia i Ciències Ambientals (BEECA), Facultat de Biologia, Universitat de Barcelona (UB), Av. Diagonal, 643. Planta 2, 08028, Barcelona, Spain.
| |
Collapse
|
50
|
Xiao T, Dong X, Lu Y, Zhou W. High-Resolution and Multidimensional Phenotypes Can Complement Genomics Data to Diagnose Diseases in the Neonatal Population. PHENOMICS (CHAM, SWITZERLAND) 2023; 3:204-215. [PMID: 37197647 PMCID: PMC10110825 DOI: 10.1007/s43657-022-00071-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Revised: 08/03/2022] [Accepted: 08/08/2022] [Indexed: 05/19/2023]
Abstract
Advances in genomic medicine have greatly improved our understanding of human diseases. However, phenome is not well understood. High-resolution and multidimensional phenotypes have shed light on the mechanisms underlying neonatal diseases in greater details and have the potential to optimize clinical strategies. In this review, we first highlight the value of analyzing traditional phenotypes using a data science approach in the neonatal population. We then discuss recent research on high-resolution, multidimensional, and structured phenotypes in neonatal critical diseases. Finally, we briefly introduce current technologies available for the analysis of multidimensional data and the value that can be provided by integrating these data into clinical practice. In summary, a time series of multidimensional phenome can improve our understanding of disease mechanisms and diagnostic decision-making, stratify patients, and provide clinicians with optimized strategies for therapeutic intervention; however, the available technologies for collecting multidimensional data and the best platform for connecting multiple modalities should be considered.
Collapse
Affiliation(s)
- Tiantian Xiao
- Division of Neonatology, Children’s Hospital of Fudan University, National Children’s Medical Center, 399 Wanyuan Road, Shanghai, 201102 China
- Department of Neonatology, Chengdu Women’s and Children’s Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, 610000 China
| | - Xinran Dong
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| | - Yulan Lu
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| | - Wenhao Zhou
- Division of Neonatology, Children’s Hospital of Fudan University, National Children’s Medical Center, 399 Wanyuan Road, Shanghai, 201102 China
- Center for Molecular Medicine, Pediatric Research Institute, Children’s Hospital of Fudan University, National Children’s Medical Center, Shanghai, 201102 China
| |
Collapse
|