1
|
Liu X, Liu H, Yang G, Jiang Z, Cui S, Zhang Z, Wang H, Tao L, Sun Y, Song Z, Hong T, Yang J, Gao T, Zhang J, Li X, Zhang J, Sang Y, Yang Z, Xue K, Wu S, Zhang P, Yang J, Song C, Wang G. A generalist medical language model for disease diagnosis assistance. Nat Med 2025:10.1038/s41591-024-03416-6. [PMID: 39779927 DOI: 10.1038/s41591-024-03416-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2024] [Accepted: 11/12/2024] [Indexed: 01/11/2025]
Abstract
The delivery of accurate diagnoses is crucial in healthcare and represents the gateway to appropriate and timely treatment. Although recent large language models (LLMs) have demonstrated impressive capabilities in few-shot or zero-shot learning, their effectiveness in clinical diagnosis remains unproven. Here we present MedFound, a generalist medical language model with 176 billion parameters, pre-trained on a large-scale corpus derived from diverse medical text and real-world clinical records. We further fine-tuned MedFound to learn physicians' inferential diagnosis with a self-bootstrapping strategy-based chain-of-thought approach and introduced a unified preference alignment framework to align it with standard clinical practice. Extensive experiments demonstrate that our medical LLM outperforms other baseline LLMs and specialized models in in-distribution (common diseases), out-of-distribution (external validation) and long-tailed distribution (rare diseases) scenarios across eight specialties. Further ablation studies indicate the effectiveness of key components in our medical LLM training approach. We conducted a comprehensive evaluation of the clinical applicability of LLMs for diagnosis involving artificial intelligence (AI) versus physician comparison, AI-assistance study and human evaluation framework. Our proposed framework incorporates eight clinical evaluation metrics, covering capabilities such as medical record summarization, diagnostic reasoning and risk management. Our findings demonstrate the model's feasibility in assisting physicians with disease diagnosis as part of the clinical workflow.
Collapse
Affiliation(s)
- Xiaohong Liu
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Hao Liu
- Department of Orthopedics, Peking University Third Hospital & Beijing Key Laboratory of Spinal Disease & Engineering Research Center of Bone and Joint Precision Medicine, Beijing, China
| | - Guoxing Yang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Zeyu Jiang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Shuguang Cui
- School of Science and Engineering (SSE), Future Network of Intelligence Institute (FNii) and Guangdong Provincial Key Laboratory of Future Networks of Intelligence, Chinese University of Hong Kong, Shenzhen, China
| | - Zhaoze Zhang
- Department of Orthopedics, Peking University Third Hospital & Beijing Key Laboratory of Spinal Disease & Engineering Research Center of Bone and Joint Precision Medicine, Beijing, China
| | - Huan Wang
- Department of Orthopedics, Peking University Third Hospital & Beijing Key Laboratory of Spinal Disease & Engineering Research Center of Bone and Joint Precision Medicine, Beijing, China
| | - Liyuan Tao
- Research Center of Clinical Epidemiology, Peking University Third Hospital, Beijing, China
| | - Yongchang Sun
- Department of Respiratory and Critical Care Medicine, Peking University Third Hospital and Research Center for Chronic Airway Diseases, Peking University Health Science Center, Beijing, China
| | - Zhu Song
- Department of Respiratory and Critical Care Medicine, Peking University Third Hospital and Research Center for Chronic Airway Diseases, Peking University Health Science Center, Beijing, China
| | - Tianpei Hong
- Department of Endocrinology and Metabolism, Peking University Third Hospital, Beijing, China
| | - Jin Yang
- Department of Endocrinology and Metabolism, Peking University Third Hospital, Beijing, China
| | - Tianrun Gao
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Jiangjiang Zhang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Xiaohu Li
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Jing Zhang
- Department of Cardiology, The First College of Clinical Medical Science, China Three Gorges University and Yichang Central People's Hospital, Yichang, China
| | - Ye Sang
- Department of Cardiology, The First College of Clinical Medical Science, China Three Gorges University and Yichang Central People's Hospital, Yichang, China
| | - Zhao Yang
- Peking University First Hospital and Research Center of Public Policy, Peking University, Beijing, China
| | - Kanmin Xue
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Song Wu
- South China Hospital, Medical School, Shenzhen University, Shenzhen, China
| | - Ping Zhang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China
| | - Jian Yang
- Department of Cardiology, The First College of Clinical Medical Science, China Three Gorges University and Yichang Central People's Hospital, Yichang, China.
| | - Chunli Song
- Department of Orthopedics, Peking University Third Hospital & Beijing Key Laboratory of Spinal Disease & Engineering Research Center of Bone and Joint Precision Medicine, Beijing, China.
| | - Guangyu Wang
- State Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing, China.
| |
Collapse
|
2
|
Di Fede E, Grazioli P, Lettieri A, Parodi C, Castiglioni S, Taci E, Colombo EA, Ancona S, Priori A, Gervasini C, Massa V. Epigenetic disorders: Lessons from the animals–animal models in chromatinopathies. Front Cell Dev Biol 2022; 10:979512. [PMID: 36225316 PMCID: PMC9548571 DOI: 10.3389/fcell.2022.979512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 09/05/2022] [Indexed: 11/13/2022] Open
Abstract
Chromatinopathies are defined as genetic disorders caused by mutations in genes coding for protein involved in the chromatin state balance. So far 82 human conditions have been described belonging to this group of congenital disorders, sharing some molecular features and clinical signs. For almost all of these conditions, no specific treatment is available. For better understanding the molecular cascade caused by chromatin imbalance and for envisaging possible therapeutic strategies it is fundamental to combine clinical and basic research studies. To this end, animal modelling systems represent an invaluable tool to study chromatinopathies. In this review, we focused on available data in the literature of animal models mimicking the human genetic conditions. Importantly, affected organs and abnormalities are shared in the different animal models and most of these abnormalities are reported as clinical manifestation, underlying the parallelism between clinics and translational research.
Collapse
Affiliation(s)
- Elisabetta Di Fede
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Paolo Grazioli
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Antonella Lettieri
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Chiara Parodi
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Silvia Castiglioni
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Esi Taci
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Elisa Adele Colombo
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Silvia Ancona
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
| | - Alberto Priori
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
- “Aldo Ravelli” Center for Neurotechnology and Experimental Brain Therapeutics, Università Degli Studi di Milano, Milan, Italy
| | - Cristina Gervasini
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
- “Aldo Ravelli” Center for Neurotechnology and Experimental Brain Therapeutics, Università Degli Studi di Milano, Milan, Italy
| | - Valentina Massa
- Department of Health Sciences, Università Degli Studi di Milano, Milan, Italy
- “Aldo Ravelli” Center for Neurotechnology and Experimental Brain Therapeutics, Università Degli Studi di Milano, Milan, Italy
- *Correspondence: Valentina Massa,
| |
Collapse
|
3
|
Gayduk A, Vlasov Y, Smirnova D. Application of modern approaches in the screening and early diagnosis programs for the orphan diseases. Zh Nevrol Psikhiatr Im S S Korsakova 2022; 122:30-39. [DOI: 10.17116/jnevro202212206130] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
4
|
Park E, Lee K, Han T, Nam HS. Agreement and Reliability Analysis of Machine Learning Scaling and Wireless Monitoring in the Assessment of Acute Proximal Weakness by Experts and Non-Experts: A Feasibility Study. J Pers Med 2022; 12:jpm12010020. [PMID: 35055335 PMCID: PMC8780198 DOI: 10.3390/jpm12010020] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 12/17/2021] [Indexed: 12/15/2022] Open
Abstract
Assessing the symptoms of proximal weakness caused by neurological deficits requires the knowledge and experience of neurologists. Recent advances in machine learning and the Internet of Things have resulted in the development of automated systems that emulate physicians’ assessments. The application of those systems requires not only accuracy in the classification but also reliability regardless of users’ proficiency in the real environment for the clinical point-of-care and the personalized health management. This study provides an agreement and reliability analysis of using a machine learning-based scaling of Medical Research Council (MRC) proximal scores to evaluate proximal weakness by experts and non-experts. The system trains an ensemble learning model using the signals from sensors attached to the limbs of patients in a neurological intensive care unit. For the agreement analysis, we investigated the percent agreement of MRC proximal scores and Bland-Altman plots of kinematic features between the expert- and non-expert scaling. We also analyzed the intra-class correlation coefficients (ICCs) of kinematic features and Krippendorff’s alpha of the observers’ scaling for the reliability analysis. The mean percent agreement between the expert- and the non-expert scaling was 0.542 for manual scaling and 0.708 for autonomous scaling. The ICCs of kinematic features measured using sensors ranged from 0.742 to 0.850, whereas the Krippendorff’s alpha of manual scaling for the three observers was 0.275. The autonomous assessment system can be utilized by the caregivers, paramedics, or other observers during an emergency to evaluate acute stroke patients.
Collapse
Affiliation(s)
- Eunjeong Park
- Integrative Research Center for Cerebrovascular and Cardiovascular Diseases, Yonsei University College of Medicine, Seoul 03722, Korea;
| | - Kijeong Lee
- Department of Neurology, National Health Insurance Service, Ilsan Hospital, Goyang 10444, Korea;
| | - Taehwa Han
- Health-IT Center, Yonsei University College of Medicine, Seoul 03722, Korea;
| | - Hyo Suk Nam
- Department of Neurology, Yonsei University College of Medicine, Seoul 03722, Korea
- Correspondence: ; Tel.: +82-2-2228-1617
| |
Collapse
|
5
|
Yates J, Gutiérrez-Sacristán A, Jouhet V, LeBlanc K, Esteves C, DeSain TN, Benik N, Stedman J, Palmer N, Mellon G, Kohane I, Avillach P. Finding commonalities in rare diseases through the undiagnosed diseases network. J Am Med Inform Assoc 2021; 28:1694-1702. [PMID: 34009343 PMCID: PMC8324228 DOI: 10.1093/jamia/ocab050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 03/05/2021] [Indexed: 11/14/2022] Open
Abstract
Objective When studying any specific rare disease, heterogeneity and scarcity of affected individuals has historically hindered investigators from discerning on what to focus to understand and diagnose a disease. New nongenomic methodologies must be developed that identify similarities in seemingly dissimilar conditions. Materials and Methods This observational study analyzes 1042 patients from the Undiagnosed Diseases Network (2015-2019), a multicenter, nationwide research study using phenotypic data annotated by specialized staff using Human Phenotype Ontology terms. We used Louvain community detection to cluster patients linked by Jaccard pairwise similarity and 2 support vector classifier to assign new cases. We further validated the clusters’ most representative comorbidities using a national claims database (67 million patients). Results Patients were divided into 2 groups: those with symptom onset before 18 years of age (n = 810) and at 18 years of age or older (n = 232) (average symptom onset age: 10 [interquartile range, 0-14] years). For 810 pediatric patients, we identified 4 statistically significant clusters. Two clusters were characterized by growth disorders, and developmental delay enriched for hypotonia presented a higher likelihood of diagnosis. Support vector classifier showed 0.89 balanced accuracy (0.83 for Human Phenotype Ontology terms only) on test data. Discussions To set the framework for future discovery, we chose as our endpoint the successful grouping of patients by phenotypic similarity and provide a classification tool to assign new patients to those clusters. Conclusion This study shows that despite the scarcity and heterogeneity of patients, we can still find commonalities that can potentially be harnessed to uncover new insights and targets for therapy.
Collapse
Affiliation(s)
- Josephine Yates
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | | | - Vianney Jouhet
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Kimberly LeBlanc
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Cecilia Esteves
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | | | - Thomas N DeSain
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Nick Benik
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Jason Stedman
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Nathan Palmer
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Guillaume Mellon
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Isaac Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Paul Avillach
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
- Corresponding Author: Paul Avillach, MD, PhD, 10 Shattuck Street, 02115 Boston, MA, USA;
| |
Collapse
|
6
|
Seoighe C, Bracken AP, Buckley P, Doran P, Green R, Healy S, Kavanagh D, Kenny E, Lawler M, Lowery M, Morris D, Morrissey D, O'Byrne JJ, Shields D, Smith O, Steward CA, Sweeney B, Kolch W. The future of genomics in Ireland - focus on genomics for health. HRB Open Res 2020; 3:89. [PMID: 33855271 PMCID: PMC7993626 DOI: 10.12688/hrbopenres.13187.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/30/2020] [Indexed: 12/15/2022] Open
Abstract
Genomics is revolutionizing biomedical research, medicine and healthcare globally in academic, public and industry sectors alike. Concrete examples around the world show that huge benefits for patients, society and economy can be accrued through effective and responsible genomic research and clinical applications. Unfortunately, Ireland has fallen behind and needs to act now in order to catch up. Here, we identify key issues that have resulted in Ireland lagging behind, describe how genomics can benefit Ireland and its people and outline the measures needed to make genomics work for Ireland and Irish patients. There is now an urgent need for a national genomics strategy that enables an effective, collaborative, responsible, well-regulated, and patient centred environment where genome research and clinical genomics can thrive. We present eight recommendations that could be the pillars of a national genomics health strategy.
Collapse
Affiliation(s)
- Cathal Seoighe
- National University of Ireland Galway, Galway, H91 TK33, Ireland
| | | | | | - Peter Doran
- University College Dublin, Dublin, 4, Ireland
- Mater Misericordiae University Hospital, Dublin, 7, Ireland
| | - Robert Green
- Brigham Health, Broad Institute, Ariadne Labs, Harvard Medical School, Boston, MA, 02115, USA
| | - Sandra Healy
- National University of Ireland Galway, Galway, H91 TK33, Ireland
| | - David Kavanagh
- Genuity Science (Ireland) Ltd., Dublin, D18 K7W4, Ireland
| | - Elaine Kenny
- Trinity College Dublin, Dublin, 2, Ireland
- ELDA Biotech, Trinity Translational Medicine Institute, St James's Hospital, Dublin, D08 W9RT, Ireland
| | - Mark Lawler
- Queen's University Belfast, Belfast, Northern Ireland, BT7 1NN, Ireland
| | - Maeve Lowery
- Trinity College Dublin, Dublin, 2, Ireland
- Saint James' Hospital, Dublin, D08 NHY1, Ireland
| | - Derek Morris
- National University of Ireland Galway, Galway, H91 TK33, Ireland
| | - Darrin Morrissey
- National Institute for Bioprocessing Research and Training, Blackrock, A94 X099, Ireland
| | | | | | - Owen Smith
- University College Dublin, Dublin, 4, Ireland
- Children’s Health Ireland, Crumlin, Dublin, D12 N512, Ireland
| | | | | | - Walter Kolch
- National University of Ireland Galway, Galway, H91 TK33, Ireland
| |
Collapse
|
7
|
Yan T, Li J, Zhou X, Yang Z, Zhang Y, Zhang J, Xu N, Huang Y, Yang H. Genetic determinants of fracture non-union: A systematic review from the literature. Gene 2020; 751:144766. [PMID: 32413481 DOI: 10.1016/j.gene.2020.144766] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Accepted: 05/11/2020] [Indexed: 12/28/2022]
Abstract
Approximately 10-15% of fracture patients suffer impaired healing, which is either delayed or even results in non-union. We performed a Systematic Review, aiming to examine the types and frequency of specific genetic abnormalities in patients experiencing bone fracture and to ascertain whether a genetic association exists regarding the tendency for some patients to suffer fracture non-union or postoperative non-union events. GO and KEGG analyses were used to identify the likely function of the genes involved. Furthermore, we evaluated the functional significance of single nucleotide polymorphisms using RegulomeDB and GTEx. Seven eligible studies involving 29 genes and 89 SNPs were analyzed in this review. We found that the polymorphisms in gene NOS2, NOG, BMP4, CYR61, IL1β and FGFR1 apparently predisposed patients to fracture non-union, while the polymorphisms in gene MMP13, BMP6 and FAM5C appeared to provide protection from non-union. Bioinformatics analysis suggested that these genes were enriched in inflammatory pathways, suggesting that inflammation may be a potential factor involved in fracture non-union. Three SNPs (rs17563, rs3753793 and rs2853550) had smaller RegulomeDB scores, indicating significant biological function. In conclusion, we have identified a number of genes and their polymorphisms that might contribute to a genetic susceptibility to fracture non-union. Further studies with larger cohorts will enhance our understanding of fracture non-union and may inform and direct early interventions.
Collapse
Affiliation(s)
- Ting Yan
- Department of Nursing, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Jin Li
- Department of Orthopedics Surgery, The Second Affiliated Hospital of Jiaxing University, Jiaxing 314000, China
| | - Xindie Zhou
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Zhicheng Yang
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Yi Zhang
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Junjie Zhang
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Nanwei Xu
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China
| | - Yong Huang
- Department of Orthopedics, The Affiliated Changzhou No.2 People's Hospital of Nanjing Medical University, Changzhou 213000, China.
| | - Haoyu Yang
- Department of Orthopedics, Wuxi 9th People's Hospital Affiliated to Soochow University, Wuxi 214000, China.
| |
Collapse
|
8
|
Schwarz JM, Hombach D, Köhler S, Cooper DN, Schuelke M, Seelow D. RegulationSpotter: annotation and interpretation of extratranscriptic DNA variants. Nucleic Acids Res 2020; 47:W106-W113. [PMID: 31106382 PMCID: PMC6602480 DOI: 10.1093/nar/gkz327] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 04/17/2019] [Accepted: 05/09/2019] [Indexed: 02/07/2023] Open
Abstract
RegulationSpotter is a web-based tool for the user-friendly annotation and interpretation of DNA variants located outside of protein-coding transcripts (extratranscriptic variants). It is designed for clinicians and researchers who wish to assess the potential impact of the considerable number of non-coding variants found in Whole Genome Sequencing runs. It annotates individual variants with underlying regulatory features in an intuitive way by assessing over 100 genome-wide annotations. Additionally, it calculates a score, which reflects the regulatory potential of the variant region. Its dichotomous classifications, ‘functional’ or ‘non-functional’, and a human-readable presentation of the underlying evidence allow a biologically meaningful interpretation of the score. The output shows key aspects of every variant and allows rapid access to more detailed information about its possible role in gene regulation. RegulationSpotter can either analyse single variants or complete VCF files. Variants located within protein-coding transcripts are automatically assessed by MutationTaster as well as by RegulationSpotter to account for possible intragenic regulatory effects. RegulationSpotter offers the possibility of using phenotypic data to focus on known disease genes or genomic elements interacting with them. RegulationSpotter is freely available at https://www.regulationspotter.org.
Collapse
Affiliation(s)
- Jana Marie Schwarz
- Department of Neuropediatrics, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,Centrum für Therapieforschung, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,NeuroCure Cluster of Excellence and NeuroCure Clinical Research Center, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany
| | - Daniela Hombach
- Centrum für Therapieforschung, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,NeuroCure Cluster of Excellence and NeuroCure Clinical Research Center, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany
| | - Sebastian Köhler
- Centrum für Therapieforschung, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,Berlin Institute of Health (BIH), Berlin, Germany.,Einstein Center for Digital Future, Berlin, Germany
| | - David N Cooper
- Institute of Medical Genetics, Cardiff University, Cardiff, UK
| | - Markus Schuelke
- Department of Neuropediatrics, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,NeuroCure Cluster of Excellence and NeuroCure Clinical Research Center, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany
| | - Dominik Seelow
- Centrum für Therapieforschung, Charité - Universitätsmedizin Berlin, corporate member of Freie Universität Berlin, Humboldt-Universität zu Berlin, and Berlin Institute of Health (BIH), Berlin, Germany.,Berlin Institute of Health (BIH), Berlin, Germany
| |
Collapse
|
9
|
Abu Diab A, AlTalbishi A, Rosin B, Kanaan M, Kamal L, Swaroop A, Chowers I, Banin E, Sharon D, Khateb S. The combination of whole-exome sequencing and clinical analysis allows better diagnosis of rare syndromic retinal dystrophies. Acta Ophthalmol 2019; 97:e877-e886. [PMID: 30925032 PMCID: PMC11377105 DOI: 10.1111/aos.14095] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2018] [Accepted: 03/03/2019] [Indexed: 01/05/2023]
Abstract
PURPOSE To identify the accurate clinical diagnosis of rare syndromic inherited retinal diseases (IRDs) based on the combination of clinical and genetic analyses. METHODS Four unrelated families with various autosomal recessive syndromic inherited retinal diseases were genetically investigated using whole-exome sequencing (WES). RESULTS Two affected subjects in family MOL0760 presented with a distinctive combination of short stature, developmental delay, congenital mental retardation, microcephaly, facial dysmorphism and retinitis pigmentosa (RP). Subjects were clinically diagnosed with suspected Kabuki syndrome. WES revealed a homozygous nonsense mutation (c.5492dup, p.Asn1831Lysfs*8) in VPS13B that is known to cause Cohen syndrome. The index case of family MOL1514 presented with both RP and liver dysfunction, suspected initially to be related. WES identified a homozygous frameshift mutation (c.1787_1788del, p.His596Argfs*47) in AGBL5, associated with nonsyndromic RP. The MOL1592 family included three affected subjects with crystalline retinopathy, skin ichthyosis, short stature and congenital adrenal hypoplasia, and were found to harbour a homozygous nonsense mutation (c.682C>T, p.Arg228Cys) in ALDH3A2, reported to cause Sjögren-Larsson syndrome (SLS). In the fourth family, SJ002, two siblings presented with hypotony, psychomotor delay, dysmorphic facial features, pathologic myopia, progressive external ophthalmoplegia and diffuse retinal atrophy. Probands were suspected to have atypical Kearns-Sayre syndrome, but were diagnosed with combined oxidative phosphorylation deficiency-20 due to a novel suspected missense variant (c.1691C>T, p.Ala564Val) in VARS2. CONCLUSION Our findings emphasize the important complement of WES and thorough clinical investigation in establishing precise clinical diagnosis. This approach constitutes the basis for personalized medicine in rare IRDs.
Collapse
Affiliation(s)
- Alaa Abu Diab
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | | | - Boris Rosin
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | - Moien Kanaan
- Hereditary Research Lab, Bethlehem University, Jerusalem, Israel
| | - Lara Kamal
- Hereditary Research Lab, Bethlehem University, Jerusalem, Israel
| | - Anand Swaroop
- Neurobiology-Neurodegeneration & Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Itay Chowers
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | - Eyal Banin
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | - Dror Sharon
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| | - Samer Khateb
- Department of Ophthalmology, Hadassah-Hebrew University Medical Center, Jerusalem, Israel
| |
Collapse
|
10
|
Mossotto E, Ashton JJ, O'Gorman L, Pengelly RJ, Beattie RM, MacArthur BD, Ennis S. GenePy - a score for estimating gene pathogenicity in individuals using next-generation sequencing data. BMC Bioinformatics 2019; 20:254. [PMID: 31096927 PMCID: PMC6524327 DOI: 10.1186/s12859-019-2877-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 05/06/2019] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Next-generation sequencing is revolutionising diagnosis and treatment of rare diseases, however its application to understanding common disease aetiology is limited. Rare disease applications binarily attribute genetic change(s) at a single locus to a specific phenotype. In common diseases, where multiple genetic variants within and across genes contribute to disease, binary modelling cannot capture the burden of pathogenicity harboured by an individual across a given gene/pathway. We present GenePy, a novel gene-level scoring system for integration and analysis of next-generation sequencing data on a per-individual basis that transforms NGS data interpretation from variant-level to gene-level. This simple and flexible scoring system is intuitive and amenable to integration for machine learning, network and topological approaches, facilitating the investigation of complex phenotypes. RESULTS Whole-exome sequencing data from 508 individuals were used to generate GenePy scores. For each variant a score is calculated incorporating: i) population allele frequency estimates; ii) individual zygosity, determined through standard variant calling pipelines and; iii) any user defined deleteriousness metric to inform on functional impact. GenePy then combines scores generated for all variants observed into a single gene score for each individual. We generated a matrix of ~ 14,000 GenePy scores for all individuals for each of sixteen popular deleteriousness metrics. All per-gene scores are corrected for gene length. The majority of genes generate GenePy scores < 0.01 although individuals harbouring multiple rare highly deleterious mutations can accumulate extremely high GenePy scores. In the absence of a comparator metric, we examine GenePy performance in discriminating genes known to be associated with three common, complex diseases. A Mann-Whitney U test conducted on GenePy scores for this positive control gene in cases versus controls demonstrates markedly more significant results (p = 1.37 × 10- 4) compared to the most commonly applied association tool that combines common and rare variation (p = 0.003). CONCLUSIONS Per-gene per-individual GenePy scores are intuitive when assessing genetic variation in individual patients or comparing scores between groups. GenePy outperforms the currently accepted best practice tools for combining common and rare variation. GenePy scores are suitable for downstream data integration with transcriptomic and proteomic data that also report at the gene level.
Collapse
Affiliation(s)
- E Mossotto
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK.
- Institute for Life Sciences, University of Southampton, Southampton, UK.
| | - J J Ashton
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Department of Paediatric Gastroenterology, Southampton Children's Hospital, Southampton, UK
| | - L O'Gorman
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| | - R J Pengelly
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - R M Beattie
- Department of Paediatric Gastroenterology, Southampton Children's Hospital, Southampton, UK
| | - B D MacArthur
- Institute for Life Sciences, University of Southampton, Southampton, UK
| | - S Ennis
- Department of Human Genetics and Genomic Medicine, University of Southampton, Southampton, UK
| |
Collapse
|
11
|
Hong S, Wang L, Zhao D, Zhang Y, Chen Y, Tan J, Liang L, Zhu T. Clinical utility in infants with suspected monogenic conditions through next-generation sequencing. Mol Genet Genomic Med 2019; 7:e684. [PMID: 30968598 PMCID: PMC6565546 DOI: 10.1002/mgg3.684] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Revised: 02/06/2019] [Accepted: 03/04/2019] [Indexed: 12/14/2022] Open
Abstract
Background Rare diseases are complex disorders with huge variability in clinical manifestations. Decreasing cost of next‐generation sequencing (NGS) tests in recent years made it affordable. We witnessed the diagnostic yield and clinical use of different NGS strategies on a myriad of monogenic disorders in a pediatric setting. Methods Next‐generation sequencing tests are performed for 98 unrelated Chinese patients within their first year of life, who were admitted to Xin Hua Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, during a 2‐year period. Results Clinical indications for NGS tests included a range of medical concerns. The mean age was 4.4 ± 4.2 months of age for infants undergoing targeting specific (known) disease‐causing genes (TRS) analysis, and 4.4 ± 4.3 months of age for whole‐exome sequencing (WES) (p > 0.05). A molecular diagnosis is done in 72 infants (73.47%), which finds a relatively high yield with phenotypes of metabolism/homeostasis abnormality (HP: 0001939) (odds ratio, 1.83; 95% CI, 0.56–6.04; p = 0.32) and a significantly low yield with atypical symptoms (without a definite HPO term) (odds ratio, 0.08; 95% CI, 0.01–0.73; p = 0.03). TRS analysis provides molecular yields higher than WES (p = 0.01). Ninety‐eight different mutations are discovered in 72 patients. Twenty‐seven of them have not been reported previously. Nearly half (43.06%, 31/72) of the patients are found to carry 11 common disorders, mostly being inborn errors of metabolism (IEM) and neurogenetic disorders and all of them are observed through TRS analysis. Eight positive cases are identified through WES, and all of them are sporadic, of highly variable phenotypes and severity. There are 26 patients with negative findings in this study. Conclusion This study provides evidence that NGS can yield high success rates in a tertiary pediatric setting, but suggests that the scope of known Mendelian conditions may be considerably broader than currently recognized.
Collapse
Affiliation(s)
- Sha Hong
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Li Wang
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Dongying Zhao
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yonghong Zhang
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Yan Chen
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Jintong Tan
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Lili Liang
- Department of Endocrinology and Genetic Metabolism, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Tianwen Zhu
- Department of Neonatal Medicine, Xin-Hua Hospital Affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| |
Collapse
|
12
|
Adamson KI, Sheridan E, Grierson AJ. Use of zebrafish models to investigate rare human disease. J Med Genet 2018; 55:641-649. [PMID: 30065072 DOI: 10.1136/jmedgenet-2018-105358] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Revised: 07/02/2018] [Accepted: 07/04/2018] [Indexed: 01/07/2023]
Abstract
Rare diseases are collectively common and often extremely debilitating. Following the emergence of next-generation sequencing (NGS) technologies, the variants underpinning rare genetic disorders are being unearthed at an accelerating rate. However, many rare conditions lack effective treatments due to their poorly understood pathophysiology. There is therefore a growing demand for the development of novel experimental models of rare genetic diseases, so that potentially causative variants can be validated, pathogenic mechanisms can be investigated and therapeutic targets can be identified. Animal models of rare diseases need to be genetically and physiologically similar to humans, and well-suited to large-scale experimental manipulation, considering the vast number of novel variants that are being identified through NGS. The zebrafish has emerged as a popular model system for investigating these variants, combining conserved vertebrate characteristics with a capacity for large-scale phenotypic and therapeutic screening. In this review, we aim to highlight the unique advantages of the zebrafish over other in vivo model systems for the large-scale study of rare genetic variants. We will also consider the generation of zebrafish disease models from a practical standpoint, by discussing how genome editing technologies, particularly the recently developed clustered regularly interspaced repeat (CRISPR)/CRISPR-associated protein 9 system, can be used to model rare pathogenic variants in zebrafish. Finally, we will review examples in the literature where zebrafish models have played a pivotal role in confirming variant causality and revealing the underlying mechanisms of rare diseases, often with wider implications for our understanding of human biology.
Collapse
Affiliation(s)
- Kathryn Isabel Adamson
- Sheffield Institute for Translational Neuroscience, University of Sheffield, Sheffield, UK
| | | | - Andrew James Grierson
- Sheffield Institute for Translational Neuroscience, University of Sheffield, Sheffield, UK.,Department of Neuroscience, University of Sheffield, Sheffield, UK
| |
Collapse
|
13
|
Yubero D, Montero R, Santos-Ocaña C, Salviati L, Navas P, Artuch R. Molecular diagnosis of coenzyme Q 10 deficiency: an update. Expert Rev Mol Diagn 2018; 18:491-498. [PMID: 29781757 DOI: 10.1080/14737159.2018.1478290] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
INTRODUCTION Coenzyme Q10 (CoQ) deficiency syndromes comprise a growing number of genetic disorders. While primary CoQ deficiency syndromes are rare diseases, secondary deficiencies have been related to both genetic and environmental conditions, which are the main causes of biochemical CoQ deficiency. The diagnosis is the essential first step for planning future treatment strategies, as the potential treatability of CoQ deficiency is the most critical issue for the patients. Areas covered: While the quickest and most effective tool to define a CoQ-deficient status is its biochemical determination in biological fluids or tissues, this quantification does not provide a definite diagnosis of a CoQ-deficient status nor insight about the genetic etiology of the disease. The different laboratory tests to check for CoQ deficiency are evaluated in order to choose the best diagnostic pathway for the patient. Expert commentary: New insights are being discovered about the implication of new proteins in the intricate CoQ biosynthetic pathway. These insights reinforce the idea that next generation sequencing diagnostic strategies are the unique alternative in terms of rapid and accurate molecular diagnosis of CoQ deficiency.
Collapse
Affiliation(s)
- Delia Yubero
- a Department of Genetic and Clinical Biochemistry , Institut de Recerca Sant Joan de Déu, and CIBER de Enfermedades Raras (CIBERER) , Barcelona , Spain
| | - Raquel Montero
- a Department of Genetic and Clinical Biochemistry , Institut de Recerca Sant Joan de Déu, and CIBER de Enfermedades Raras (CIBERER) , Barcelona , Spain
| | - Carlos Santos-Ocaña
- b Centro Andaluz de Biología del Desarrollo , Universidad Pablo de Olavide and CIBERER , Sevilla , Spain
| | - Leonardo Salviati
- c Clinical Genetics Unit, Department of Pediatrics , University of Padova , Padova , Italy
| | - Placido Navas
- b Centro Andaluz de Biología del Desarrollo , Universidad Pablo de Olavide and CIBERER , Sevilla , Spain
| | - Rafael Artuch
- a Department of Genetic and Clinical Biochemistry , Institut de Recerca Sant Joan de Déu, and CIBER de Enfermedades Raras (CIBERER) , Barcelona , Spain
| |
Collapse
|
14
|
Blazejewski SM, Bennison SA, Smith TH, Toyo-Oka K. Neurodevelopmental Genetic Diseases Associated With Microdeletions and Microduplications of Chromosome 17p13.3. Front Genet 2018; 9:80. [PMID: 29628935 PMCID: PMC5876250 DOI: 10.3389/fgene.2018.00080] [Citation(s) in RCA: 46] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2017] [Accepted: 02/26/2018] [Indexed: 01/24/2023] Open
Abstract
Chromosome 17p13.3 is a region of genomic instability that is linked to different rare neurodevelopmental genetic diseases, depending on whether a deletion or duplication of the region has occurred. Chromosome microdeletions within 17p13.3 can result in either isolated lissencephaly sequence (ILS) or Miller-Dieker syndrome (MDS). Both conditions are associated with a smooth cerebral cortex, or lissencephaly, which leads to developmental delay, intellectual disability, and seizures. However, patients with MDS have larger deletions than patients with ILS, resulting in additional symptoms such as poor muscle tone, congenital anomalies, abnormal spasticity, and craniofacial dysmorphisms. In contrast to microdeletions in 17p13.3, recent studies have attracted considerable attention to a condition known as a 17p13.3 microduplication syndrome. Depending on the genes involved in their microduplication, patients with 17p13.3 microduplication syndrome may be categorized into either class I or class II. Individuals in class I have microduplications of the YWHAE gene encoding 14-3-3ε, as well as other genes in the region. However, the PAFAH1B1 gene encoding LIS1 is never duplicated in these patients. Class I microduplications generally result in learning disabilities, autism, and developmental delays, among other disorders. Individuals in class II always have microduplications of the PAFAH1B1 gene, which may include YWHAE and other genetic microduplications. Class II microduplications generally result in smaller body size, developmental delays, microcephaly, and other brain malformations. Here, we review the phenotypes associated with copy number variations (CNVs) of chromosome 17p13.3 and detail their developmental connection to particular microdeletions or microduplications. We also focus on existing single and double knockout mouse models that have been used to study human phenotypes, since the highly limited number of patients makes a study of these conditions difficult in humans. These models are also crucial for the study of brain development at a mechanistic level since this cannot be accomplished in humans. Finally, we emphasize the usefulness of the CRISPR/Cas9 system and next generation sequencing in the study of neurodevelopmental diseases.
Collapse
Affiliation(s)
- Sara M Blazejewski
- Department of Neurobiology and Anatomy, Drexel University College of Medicine, Philadelphia, PA, United States
| | - Sarah A Bennison
- Department of Neurobiology and Anatomy, Drexel University College of Medicine, Philadelphia, PA, United States
| | - Trevor H Smith
- Department of Neurobiology and Anatomy, Drexel University College of Medicine, Philadelphia, PA, United States
| | - Kazuhito Toyo-Oka
- Department of Neurobiology and Anatomy, Drexel University College of Medicine, Philadelphia, PA, United States
| |
Collapse
|
15
|
Cremers S, Aronson JK. Drugs for rare disorders. Br J Clin Pharmacol 2017; 83:1607-1613. [PMID: 28653488 DOI: 10.1111/bcp.13331] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Accepted: 05/10/2017] [Indexed: 02/06/2023] Open
Abstract
Estimates of the frequencies of rare disorders vary from country to country; the global average defined prevalence is 40 per 100 000 (0.04%). Some occur in only one or a few patients. However, collectively rare disorders are fairly common, affecting 6-8% of the US population, or about 30 million people, and a similar number in the European Union. Most of them affect children and most are genetically determined. Diagnosis can be difficult, partly because of variable presentations and partly because few clinicians have experience of individual rare disorders, although they may be assisted by searching databases. Relatively few rare disorders have specific pharmacological treatments (so-called orphan drugs), partly because of difficulties in designing trials large enough to determine benefits and harms alike. Incentives have been introduced to encourage the development of orphan drugs, including tax credits and research aids, simplification of marketing authorization procedures and exemption from fees, and extended market exclusivity. Consequently, the number of applications for orphan drugs has grown, as have the costs of using them, so much so that treatments may not be cost-effective. It has therefore been suggested that not-for-profit organizations that are socially motivated to reduce those costs should be tasked with producing them. A growing role for patient organizations, improved clinical and translational infrastructures, and developments in genetics have also contributed to successful drug development. The translational discipline of clinical pharmacology is an essential component in drug development, including orphan drugs. Clinical pharmacologists, skilled in basic pharmacology and its links to clinical medicine, can be involved at all stages. They can contribute to the delineation of genetic factors that determine clinical outcomes of pharmacological interventions, develop biomarkers, design and perform clinical trials, assist regulatory decision making, and conduct postmarketing surveillance and pharmacoepidemiological and pharmacoeconomic assessments.
Collapse
Affiliation(s)
- Serge Cremers
- Departments of Pathology & Cell Biology and Medicine, and Irving Institute for Clinical and Translational Research, Columbia University Medical Center, New York, NY, 10027, USA
| | - Jeffrey K Aronson
- Centre for Evidence Based Medicine, Nuffield Department of Primary Care Health Sciences, Radcliffe Infirmary, Woodstock Road, Oxford,, OX2 6GG, UK
| |
Collapse
|
16
|
Affiliation(s)
- Sean Ekins
- Collaborations Pharmaceuticals, Inc., 5616 Hilltop Needmore Road, Fuquay Varina, NC27526, USA
- Phoenix Nest, P.O. BOX 150057, Brooklyn NY 11215, USA
- Hereditary Neuropathy Foundation, 432 Park Avenue South – 4 floor, New York, NY 10016, USA
| |
Collapse
|
17
|
Whole-Genome Sequencing of a Canine Family Trio Reveals a FAM83G Variant Associated with Hereditary Footpad Hyperkeratosis. G3-GENES GENOMES GENETICS 2016; 6:521-7. [PMID: 26747202 PMCID: PMC4777115 DOI: 10.1534/g3.115.025643] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Over 250 Mendelian traits and disorders, caused by rare alleles have been mapped in the canine genome. Although each disease is rare in the dog as a species, they are collectively common and have major impact on canine health. With SNP-based genotyping arrays, genome-wide association studies (GWAS) have proven to be a powerful method to map the genomic region of interest when 10–20 cases and 10–20 controls are available. However, to identify the genetic variant in associated regions, fine-mapping and targeted resequencing is required. Here we present a new approach using whole-genome sequencing (WGS) of a family trio without prior GWAS. As a proof-of-concept, we chose an autosomal recessive disease known as hereditary footpad hyperkeratosis (HFH) in Kromfohrländer dogs. To our knowledge, this is the first time this family trio WGS-approach has been used successfully to identify a genetic variant that perfectly segregates with a canine disorder. The sequencing of three Kromfohrländer dogs from a family trio (an affected offspring and both its healthy parents) resulted in an average genome coverage of 9.2X per individual. After applying stringent filtering criteria for candidate causative coding variants, 527 single nucleotide variants (SNVs) and 15 indels were found to be homozygous in the affected offspring and heterozygous in the parents. Using the computer software packages ANNOVAR and SIFT to functionally annotate coding sequence differences, and to predict their functional effect, resulted in seven candidate variants located in six different genes. Of these, only FAM83G:c155G > C (p.R52P) was found to be concordant in eight additional cases, and 16 healthy Kromfohrländer dogs.
Collapse
|