1
|
REEV: review, evaluate and explain variants. Nucleic Acids Res 2024:gkae366. [PMID: 38769069 DOI: 10.1093/nar/gkae366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 04/07/2024] [Accepted: 05/03/2024] [Indexed: 05/22/2024] Open
Abstract
In the era of high throughput sequencing, special software is required for the clinical evaluation of genetic variants. We developed REEV (Review, Evaluate and Explain Variants), a user-friendly platform for clinicians and researchers in the field of rare disease genetics. Supporting data was aggregated from public data sources. We compared REEV with seven other tools for clinical variant evaluation. REEV (semi-)automatically fills individual ACMG criteria facilitating variant interpretation. REEV can store disease and phenotype data related to a case to use these for phenotype similarity measures. Users can create public permanent links for individual variants that can be saved as browser bookmarks and shared. REEV may help in the fast diagnostic assessment of genetic variants in a clinical as well as in a research context. REEV (https://reev.bihealth.org/) is free and open to all users and there is no login requirement.
Collapse
|
2
|
Loss-of-function variants affecting the STAGA complex component SUPT7L cause a developmental disorder with generalized lipodystrophy. Hum Genet 2024; 143:683-694. [PMID: 38592547 PMCID: PMC11098864 DOI: 10.1007/s00439-024-02669-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Accepted: 03/11/2024] [Indexed: 04/10/2024]
Abstract
Generalized lipodystrophy is a feature of various hereditary disorders, often leading to a progeroid appearance. In the present study we identified a missense and a frameshift variant in a compound heterozygous state in SUPT7L in a boy with intrauterine growth retardation, generalized lipodystrophy, and additional progeroid features. SUPT7L encodes a component of the transcriptional coactivator complex STAGA. By transcriptome sequencing, we showed the predicted missense variant to cause aberrant splicing, leading to exon truncation and thereby to a complete absence of SUPT7L in dermal fibroblasts. In addition, we found altered expression of genes encoding DNA repair pathway components. This pathway was further investigated and an increased rate of DNA damage was detected in proband-derived fibroblasts and genome-edited HeLa cells. Finally, we performed transient overexpression of wildtype SUPT7L in both cellular systems, which normalizes the number of DNA damage events. Our findings suggest SUPT7L as a novel disease gene and underline the link between genome instability and progeroid phenotypes.
Collapse
|
3
|
The Human Phenotype Ontology in 2024: phenotypes around the world. Nucleic Acids Res 2024; 52:D1333-D1346. [PMID: 37953324 PMCID: PMC10767975 DOI: 10.1093/nar/gkad1005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/12/2023] [Accepted: 10/19/2023] [Indexed: 11/14/2023] Open
Abstract
The Human Phenotype Ontology (HPO) is a widely used resource that comprehensively organizes and defines the phenotypic features of human disease, enabling computational inference and supporting genomic and phenotypic analyses through semantic similarity and machine learning algorithms. The HPO has widespread applications in clinical diagnostics and translational research, including genomic diagnostics, gene-disease discovery, and cohort analytics. In recent years, groups around the world have developed translations of the HPO from English to other languages, and the HPO browser has been internationalized, allowing users to view HPO term labels and in many cases synonyms and definitions in ten languages in addition to English. Since our last report, a total of 2239 new HPO terms and 49235 new HPO annotations were developed, many in collaboration with external groups in the fields of psychiatry, arthrogryposis, immunology and cardiology. The Medical Action Ontology (MAxO) is a new effort to model treatments and other measures taken for clinical management. Finally, the HPO consortium is contributing to efforts to integrate the HPO and the GA4GH Phenopacket Schema into electronic health records (EHRs) with the goal of more standardized and computable integration of rare disease data in EHRs.
Collapse
|
4
|
Discovery of a non-canonical GRHL1 binding site using deep convolutional and recurrent neural networks. BMC Genomics 2023; 24:736. [PMID: 38049725 PMCID: PMC10696883 DOI: 10.1186/s12864-023-09830-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 11/22/2023] [Indexed: 12/06/2023] Open
Abstract
BACKGROUND Transcription factors regulate gene expression by binding to transcription factor binding sites (TFBSs). Most models for predicting TFBSs are based on position weight matrices (PWMs), which require a specific motif to be present in the DNA sequence and do not consider interdependencies of nucleotides. Novel approaches such as Transcription Factor Flexible Models or recurrent neural networks consequently provide higher accuracies. However, it is unclear whether such approaches can uncover novel non-canonical, hitherto unexpected TFBSs relevant to human transcriptional regulation. RESULTS In this study, we trained a convolutional recurrent neural network with HT-SELEX data for GRHL1 binding and applied it to a set of GRHL1 binding sites obtained from ChIP-Seq experiments from human cells. We identified 46 non-canonical GRHL1 binding sites, which were not found by a conventional PWM approach. Unexpectedly, some of the newly predicted binding sequences lacked the CNNG core motif, so far considered obligatory for GRHL1 binding. Using isothermal titration calorimetry, we experimentally confirmed binding between the GRHL1-DNA binding domain and predicted GRHL1 binding sites, including a non-canonical GRHL1 binding site. Mutagenesis of individual nucleotides revealed a correlation between predicted binding strength and experimentally validated binding affinity across representative sequences. This correlation was neither observed with a PWM-based nor another deep learning approach. CONCLUSIONS Our results show that convolutional recurrent neural networks may uncover unanticipated binding sites and facilitate quantitative transcription factor binding predictions.
Collapse
|
5
|
Alternative splicing is coupled to gene expression in a subset of variably expressed genes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.13.544742. [PMID: 37398049 PMCID: PMC10312658 DOI: 10.1101/2023.06.13.544742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Numerous factors regulate alternative splicing of human genes at a co-transcriptional level. However, how alternative splicing depends on the regulation of gene expression is poorly understood. We leveraged data from the Genotype-Tissue Expression (GTEx) project to show a significant association of gene expression and splicing for 6874 (4.9%) of 141,043 exons in 1106 (13.3%) of 8314 genes with substantially variable expression in ten GTEx tissues. About half of these exons demonstrate higher inclusion with higher gene expression, and half demonstrate higher exclusion, with the observed direction of coupling being highly consistent across different tissues and in external datasets. The exons differ with respect to sequence characteristics, enriched sequence motifs, RNA polymerase II binding, and inferred transcription rate of downstream introns. The exons were enriched for hundreds of isoform-specific Gene Ontology annotations, suggesting that the coupling of expression and alternative splicing described here may provide an important gene regulatory mechanism that might be used in a variety of biological contexts. In particular, higher inclusion exons could play an important role during cell division.
Collapse
|
6
|
Broadening the phenotypic and molecular spectrum of FINCA syndrome: Biallelic NHLRC2 variants in 15 novel individuals. Eur J Hum Genet 2023; 31:905-917. [PMID: 37188825 PMCID: PMC10400545 DOI: 10.1038/s41431-023-01382-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Revised: 04/17/2023] [Accepted: 04/26/2023] [Indexed: 05/17/2023] Open
Abstract
FINCA syndrome [MIM: 618278] is an autosomal recessive multisystem disorder characterized by fibrosis, neurodegeneration and cerebral angiomatosis. To date, 13 patients from nine families with biallelic NHLRC2 variants have been published. In all of them, the recurrent missense variant p.(Asp148Tyr) was detected on at least one allele. Common manifestations included lung or muscle fibrosis, respiratory distress, developmental delay, neuromuscular symptoms and seizures often followed by early death due to rapid disease progression.Here, we present 15 individuals from 12 families with an overlapping phenotype associated with nine novel NHLRC2 variants identified by exome analysis. All patients described here presented with moderate to severe global developmental delay and variable disease progression. Seizures, truncal hypotonia and movement disorders were frequently observed. Notably, we also present the first eight cases in which the recurrent p.(Asp148Tyr) variant was not detected in either homozygous or compound heterozygous state.We cloned and expressed all novel and most previously published non-truncating variants in HEK293-cells. From the results of these functional studies, we propose a potential genotype-phenotype correlation, with a greater reduction in protein expression being associated with a more severe phenotype.Taken together, our findings broaden the known phenotypic and molecular spectrum and emphasize that NHLRC2-related disease should be considered in patients presenting with intellectual disability, movement disorders, neuroregression and epilepsy with or without pulmonary involvement.
Collapse
|
7
|
Editorial: the 21st annual Nucleic Acids Research Web Server Issue 2023. Nucleic Acids Res 2023:7195025. [PMID: 37309890 PMCID: PMC10320045 DOI: 10.1093/nar/gkad517] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 06/09/2023] [Indexed: 06/14/2023] Open
|
8
|
GA4GH Phenopackets: A Practical Introduction. ADVANCED GENETICS (HOBOKEN, N.J.) 2023; 4:2200016. [PMID: 36910590 PMCID: PMC10000265 DOI: 10.1002/ggn2.202200016] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 06/30/2022] [Indexed: 11/08/2022]
Abstract
The Global Alliance for Genomics and Health (GA4GH) is developing a suite of coordinated standards for genomics for healthcare. The Phenopacket is a new GA4GH standard for sharing disease and phenotype information that characterizes an individual person, linking that individual to detailed phenotypic descriptions, genetic information, diagnoses, and treatments. A detailed example is presented that illustrates how to use the schema to represent the clinical course of a patient with retinoblastoma, including demographic information, the clinical diagnosis, phenotypic features and clinical measurements, an examination of the extirpated tumor, therapies, and the results of genomic analysis. The Phenopacket Schema, together with other GA4GH data and technical standards, will enable data exchange and provide a foundation for the computational analysis of disease and phenotype information to improve our ability to diagnose and conduct research on all types of disorders, including cancer and rare diseases.
Collapse
|
9
|
Editorial: the 20th annual Nucleic Acids Research Web Server Issue 2022. Nucleic Acids Res 2022; 50:W1-W3. [PMID: 37012857 PMCID: PMC9252774 DOI: 10.1093/nar/gkac525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
10
|
RegEl corpus: identifying DNA regulatory elements in the scientific literature. Database (Oxford) 2022; 2022:6618549. [PMID: 35758881 PMCID: PMC9235371 DOI: 10.1093/database/baac043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Revised: 05/25/2022] [Accepted: 06/02/2022] [Indexed: 11/17/2022]
Abstract
High-throughput technologies led to the generation of a wealth of data on regulatory DNA elements in the human genome. However, results from disease-driven studies are primarily shared in textual form as scientific articles. Information extraction (IE) algorithms allow this information to be (semi-)automatically accessed. Their development, however, is dependent on the availability of annotated corpora. Therefore, we introduce RegEl (Regulatory Elements), the first freely available corpus annotated with regulatory DNA elements comprising 305 PubMed abstracts for a total of 2690 sentences. We focus on enhancers, promoters and transcription factor binding sites. Three annotators worked in two stages, achieving an overall 0.73 F1 inter-annotator agreement and 0.46 for regulatory elements. Depending on the entity type, IE baselines reach F1-scores of 0.48–0.91 for entity detection and 0.71–0.88 for entity normalization. Next, we apply our entity detection models to the entire PubMed collection and extract co-occurrences of genes or diseases with regulatory elements. This generates large collections of regulatory elements associated with 137 870 unique genes and 7420 diseases, which we make openly available. Database URL: https://zenodo.org/record/6418451#.YqcLHvexVqg
Collapse
|
11
|
FABIAN-variant: predicting the effects of DNA variants on transcription factor binding. Nucleic Acids Res 2022; 50:W322-W329. [PMID: 35639768 PMCID: PMC9252790 DOI: 10.1093/nar/gkac393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/22/2022] [Accepted: 05/06/2022] [Indexed: 12/03/2022] Open
Abstract
While great advances in predicting the effects of coding variants have been made, the assessment of non-coding variants remains challenging. This is especially problematic for variants within promoter regions which can lead to over-expression of a gene or reduce or even abolish its expression. The binding of transcription factors to the DNA can be predicted using position weight matrices (PWMs). More recently, transcription factor flexible models (TFFMs) have been introduced and shown to be more accurate than PWMs. TFFMs are based on hidden Markov models and can account for complex positional dependencies. Our new web-based application FABIAN-variant uses 1224 TFFMs and 3790 PWMs to predict whether and to which degree DNA variants affect the binding of 1387 different human transcription factors. For each variant and transcription factor, the software combines the results of different models for a final prediction of the resulting binding-affinity change. The software is written in C++ for speed but variants can be entered through a web interface. Alternatively, a VCF file can be uploaded to assess variants identified by high-throughput sequencing. The search can be restricted to variants in the vicinity of candidate genes. FABIAN-variant is available freely at https://www.genecascade.org/fabian/.
Collapse
|
12
|
Deep phenotyping: symptom annotation made simple with SAMS. Nucleic Acids Res 2022; 50:W677-W681. [PMID: 35524573 PMCID: PMC9252818 DOI: 10.1093/nar/gkac329] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 04/16/2022] [Accepted: 04/24/2022] [Indexed: 11/14/2022] Open
Abstract
Precision medicine needs precise phenotypes. The Human Phenotype Ontology (HPO) uses clinical signs instead of diagnoses and has become the standard annotation for patients’ phenotypes when describing single gene disorders. Use of the HPO beyond human genetics is however still limited. With SAMS (Symptom Annotation Made Simple), we want to bring sign-based phenotyping to routine clinical care, to hospital patients as well as to outpatients. Our web-based application provides access to three widely used annotation systems: HPO, OMIM, Orphanet. Whilst data can be stored in our database, phenotypes can also be imported and exported as Global Alliance for Genomics and Health (GA4GH) Phenopackets without using the database. The web interface can easily be integrated into local databases, e.g. clinical information systems. SAMS offers users to share their data with others, empowering patients to record their own signs and symptoms (or those of their children) and thus provide their doctors with additional information. We think that our approach will lead to better characterised patients which is not only helpful for finding disease mutations but also to better understand the pathophysiology of diseases and to recruit patients for studies and clinical trials. SAMS is freely available at https://www.genecascade.org/SAMS/.
Collapse
|
13
|
AutozygosityMapper: Identification of disease-mutations in consanguineous families. Nucleic Acids Res 2022; 50:W83-W89. [PMID: 35489060 PMCID: PMC9252840 DOI: 10.1093/nar/gkac280] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Revised: 04/06/2022] [Accepted: 04/12/2022] [Indexed: 11/14/2022] Open
Abstract
With the shift from SNP arrays to high-throughput sequencing, most researchers studying diseases in consanguineous families do not rely on linkage analysis any longer, but simply search for deleterious variants which are homozygous in all patients. AutozygosityMapper allows the fast and convenient identification of disease mutations in patients from consanguineous pedigrees by focussing on homozygous segments shared by all patients. Users can upload multi-sample VCF files, including WGS data, without any pre-processing. Genome-wide runs of homozygosity and the underlying genotypes are presented in graphical interfaces. AutozygosityMapper extends the functions of its predecessor, HomozygosityMapper, to the search for autozygous regions, in which all patients share the same homozygous genotype. We provide export of VCF files containing only the variants found in homozygous regions, this usually reduces the number of variants by two orders of magnitude. These regions can also directly be analysed with our disease mutation identification tool MutationDistiller. The application comes with simple and intuitive graphical interfaces for data upload, analysis, and results. We kept the structure of HomozygosityMapper so that previous users will find it easy to switch. With AutozygosityMapper, we provide a fast web-based way to identify disease mutations in consanguineous families. AutozygosityMapper is freely available at https://www.genecascade.org/AutozygosityMapper/.
Collapse
|
14
|
Novel sequencing technologies and bioinformatic tools for deciphering the non-coding genome. MED GENET-BERLIN 2021. [DOI: 10.1515/medgen-2021-2072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
High-throughput sequencing techniques have significantly increased the molecular diagnosis rate for patients with monogenic disorders. This is primarily due to a substantially increased identification rate of disease mutations in the coding sequence, primarily SNVs and indels. Further progress is hampered by difficulties in the detection of structural variants and the interpretation of variants outside the coding sequence. In this review, we provide an overview about how novel sequencing techniques and state-of-the-art algorithms can be used to discover small and structural variants across the whole genome and introduce bioinformatic tools for the prediction of effects variants may have in the non-coding part of the genome.
Collapse
|
15
|
Public data sources for regulatory genomic features. MED GENET-BERLIN 2021. [DOI: 10.1515/medgen-2021-2075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
High-throughput technologies have led to a continuously growing amount of information about regulatory features in the genome. A wealth of data generated by large international research consortia is available from online databases. Disease-driven studies provide details on specific DNA elements or epigenetic modifications regulating gene expression in specific cellular and developmental contexts, but these results are usually only published in scientific articles. All this information can be helpful in interpreting variants in the regulatory genome. This review describes a selection of high-profile data sources providing information on the non-coding genome, as well as pitfalls and techniques to search and capture information from the literature.
Collapse
|
16
|
Aviator: a web service for monitoring the availability of web services. Nucleic Acids Res 2021; 49:W46-W51. [PMID: 34038559 PMCID: PMC8262725 DOI: 10.1093/nar/gkab396] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 04/26/2021] [Accepted: 04/28/2021] [Indexed: 02/06/2023] Open
Abstract
With Aviator, we present a web service and repository that facilitates surveillance of online tools. Aviator consists of a user-friendly website and two modules, a literature-mining based general and a manually curated module. The general module currently checks 9417 websites twice a day with respect to their availability and stores many features (frontend and backend response time, required RAM and size of the web page, security certificates, analytic tools and trackers embedded in the webpage and others) in a data warehouse. Aviator is also equipped with an analysis functionality, for example authors can check and evaluate the availability of their own tools or those of their peers. Likewise, users can check the availability of a certain tool they intend to use in research or teaching to avoid including unstable tools. The curated section of Aviator offers additional services. We provide API snippets for common programming languages (Perl, PHP, Python, JavaScript) as well as an OpenAPI documentation for embedding in the backend of own web services for an automatic test of their function. We query the respective APIs twice a day and send automated notifications in case of an unexpected result. Naturally, the same analysis functionality as for the literature-based module is available for the curated section. Aviator can freely be used at https://www.ccb.uni-saarland.de/aviator.
Collapse
|
17
|
MutationTaster2021. Nucleic Acids Res 2021; 49:W446-W451. [PMID: 33893808 PMCID: PMC8262698 DOI: 10.1093/nar/gkab266] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 03/26/2021] [Accepted: 04/01/2021] [Indexed: 01/13/2023] Open
Abstract
Here we present an update to MutationTaster, our DNA variant effect prediction tool. The new version uses a different prediction model and attains higher accuracy than its predecessor, especially for rare benign variants. In addition, we have integrated many sources of data that only became available after the last release (such as gnomAD and ExAC pLI scores) and changed the splice site prediction model. To more easily assess the relevance of detected known disease mutations to the clinical phenotype of the patient, MutationTaster now provides information on the diseases they cause. Further changes represent a major overhaul of the interfaces to increase user-friendliness whilst many changes under the hood have been designed to accelerate the processing of uploaded VCF files. We also offer an API for the rapid automated query of smaller numbers of variants from within other software. MutationTaster2021 integrates our disease mutation search engine, MutationDistiller, to prioritise variants from VCF files using the patient's clinical phenotype. The novel version is available at https://www.genecascade.org/MutationTaster2021/. This website is free and open to all users and there is no login requirement.
Collapse
|
18
|
Biallelic truncating variants in ATP9A cause a novel neurodevelopmental disorder involving postnatal microcephaly and failure to thrive. J Med Genet 2021; 59:662-668. [PMID: 34379057 PMCID: PMC9252857 DOI: 10.1136/jmedgenet-2021-107843] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Accepted: 05/20/2021] [Indexed: 12/04/2022]
Abstract
Background Genes implicated in the Golgi and endosomal trafficking machinery are crucial for brain development, and mutations in them are particularly associated with postnatal microcephaly (POM). Methods Exome sequencing was performed in three affected individuals from two unrelated consanguineous families presenting with delayed neurodevelopment, intellectual disability of variable degree, POM and failure to thrive. Patient-derived fibroblasts were tested for functional effects of the variants. Results We detected homozygous truncating variants in ATP9A. While the variant in family A is predicted to result in an early premature termination codon, the variant in family B affects a canonical splice site. Both variants lead to a substantial reduction of ATP9A mRNA expression. It has been shown previously that ATP9A localises to early and recycling endosomes, whereas its depletion leads to altered gene expression of components from this compartment. Consistent with previous findings, we also observed overexpression of ARPC3 and SNX3, genes strongly interacting with ATP9A. Conclusion In aggregate, our findings show that pathogenic variants in ATP9A cause a novel autosomal recessive neurodevelopmental disorder with POM. While the physiological function of endogenous ATP9A is still largely elusive, our results underline a crucial role of this gene in endosomal transport in brain tissue.
Collapse
|
19
|
VarFish: comprehensive DNA variant analysis for diagnostics and research. Nucleic Acids Res 2020; 48:W162-W169. [PMID: 32338743 PMCID: PMC7319464 DOI: 10.1093/nar/gkaa241] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 03/27/2020] [Accepted: 04/16/2020] [Indexed: 12/18/2022] Open
Abstract
VarFish is a user-friendly web application for the quality control, filtering, prioritization, analysis, and user-based annotation of DNA variant data with a focus on rare disease genetics. It is capable of processing variant call files with single or multiple samples. The variants are automatically annotated with population frequencies, molecular impact, and presence in databases such as ClinVar. Further, it provides support for pathogenicity scores including CADD, MutationTaster, and phenotypic similarity scores. Users can filter variants based on these annotations and presumed inheritance pattern and sort the results by these scores. Variants passing the filter are listed with their annotations and many useful link-outs to genome browsers, other gene/variant data portals, and external tools for variant assessment. VarFish allows users to create their own annotations including support for variant assessment following ACMG-AMP guidelines. In close collaboration with medical practitioners, VarFish was designed for variant analysis and prioritization in diagnostic and research settings as described in the software's extensive manual. The user interface has been optimized for supporting these protocols. Users can install VarFish on their own in-house servers where it provides additional lab notebook features for collaborative analysis and allows re-analysis of cases, e.g. after update of genotype or phenotype databases.
Collapse
|
20
|
Pervasive and CpG-dependent promoter-like characteristics of transcribed enhancers. Nucleic Acids Res 2020; 48:5306-5317. [PMID: 32338759 PMCID: PMC7261191 DOI: 10.1093/nar/gkaa223] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Revised: 03/23/2020] [Accepted: 03/25/2020] [Indexed: 12/17/2022] Open
Abstract
The temporal and spatial expression of genes is controlled by promoters and enhancers. Findings obtained over the last decade that not only promoters but also enhancers are characterized by bidirectional, divergent transcription have challenged the traditional notion that promoters and enhancers represent distinct classes of regulatory elements. Over half of human promoters are associated with CpG islands (CGIs), relatively CpG-rich stretches of generally several hundred nucleotides that are often associated with housekeeping genes. Only about 6% of transcribed enhancers defined by CAGE-tag analysis are associated with CGIs. Here, we present an analysis of enhancer and promoter characteristics and relate them to the presence or absence of CGIs. We show that transcribed enhancers share a number of CGI-dependent characteristics with promoters, including statistically significant local overrepresentation of core promoter elements. CGI-associated enhancers are longer, display higher directionality of transcription, greater expression, a lesser degree of tissue specificity, and a higher frequency of transcription-factor binding events than non-CGI-associated enhancers. Genes putatively regulated by CGI-associated enhancers are enriched for transcription regulator activity. Our findings show that CGI-associated transcribed enhancers display a series of characteristics related to sequence, expression and function that distinguish them from enhancers not associated with CGIs.
Collapse
|
21
|
An intronic splice site alteration in combination with a large deletion affecting VPS13B (COH1) causes Cohen syndrome. Eur J Med Genet 2020; 63:103973. [PMID: 32505691 DOI: 10.1016/j.ejmg.2020.103973] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 04/06/2020] [Accepted: 06/01/2020] [Indexed: 01/15/2023]
Abstract
Cohen syndrome (CS) is a rare, autosomal recessive disorder characterized by intellectual disability, postnatal microcephaly, facial abnormalities, abnormal truncal fat distribution, myopia, and pigmentary retinopathy. It is often considered an underdiagnosed condition, especially in children with developmental delay and intellectual disability. Here we report on four individuals from a large Jordanian family clinically diagnosed with CS. Using Trio Exome Sequencing (Trio-WES) and MLPA analyses we identified a maternally inherited novel intronic nucleotide substitution c.3446-23T>G leading to the activation of a cryptic splice site and a paternally inherited multi-exon deletion in VPS13B (previously termed COH1) in the index patient. Expression analysis showed a strong decrease of VPS13B mRNA levels and direct sequencing of cDNA confirmed splicing at a cryptic upstream splice acceptor site, resulting in the inclusion of 22 intronic bases. This extension results in a frameshift and a premature stop of translation (p.Gly1149Valfs*9). Segregation analysis revealed that three affected maternal cousins were homozygous for the intronic splice site variant. Our data show causality of both alterations and strongly suggest the expansion of the diagnostic strategy to search for intronic splice variants in molecularly unconfirmed patients affected by CS.
Collapse
|
22
|
MutationDistiller: user-driven identification of pathogenic DNA variants. Nucleic Acids Res 2020; 47:W114-W120. [PMID: 31106342 PMCID: PMC6602447 DOI: 10.1093/nar/gkz330] [Citation(s) in RCA: 31] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 04/12/2019] [Accepted: 05/10/2019] [Indexed: 01/10/2023] Open
Abstract
MutationDistiller is a freely available online tool for user-driven analyses of Whole Exome Sequencing data. It offers a user-friendly interface aimed at clinicians and researchers, who are not necessarily bioinformaticians. MutationDistiller combines MutationTaster's pathogenicity predictions with a phenotype-based approach. Phenotypic information is not limited to symptoms included in the Human Phenotype Ontology (HPO), but may also comprise clinical diagnoses and the suspected mode of inheritance. The search can be restricted to lists of candidate genes (e.g. virtual gene panels) and by tissue-specific gene expression. The inclusion of GeneOntology (GO) and metabolic pathways facilitates the discovery of hitherto unknown disease genes. In a novel approach, we trained MutationDistiller's HPO-based prioritization on authentic genotype-phenotype sets obtained from ClinVar and found it to match or outcompete current prioritization tools in terms of accuracy. In the output, the program provides a list of potential disease mutations ordered by the likelihood of the affected genes to cause the phenotype. MutationDistiller provides links to gene-related information from various resources. It has been extensively tested by clinicians and their suggestions have been valued in many iterative cycles of revisions. The tool, a comprehensive documentation and examples are freely available at https://www.mutationdistiller.org/.
Collapse
|
23
|
RegulationSpotter: annotation and interpretation of extratranscriptic DNA variants. Nucleic Acids Res 2020; 47:W106-W113. [PMID: 31106382 PMCID: PMC6602480 DOI: 10.1093/nar/gkz327] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 04/17/2019] [Accepted: 05/09/2019] [Indexed: 02/07/2023] Open
Abstract
RegulationSpotter is a web-based tool for the user-friendly annotation and interpretation of DNA variants located outside of protein-coding transcripts (extratranscriptic variants). It is designed for clinicians and researchers who wish to assess the potential impact of the considerable number of non-coding variants found in Whole Genome Sequencing runs. It annotates individual variants with underlying regulatory features in an intuitive way by assessing over 100 genome-wide annotations. Additionally, it calculates a score, which reflects the regulatory potential of the variant region. Its dichotomous classifications, ‘functional’ or ‘non-functional’, and a human-readable presentation of the underlying evidence allow a biologically meaningful interpretation of the score. The output shows key aspects of every variant and allows rapid access to more detailed information about its possible role in gene regulation. RegulationSpotter can either analyse single variants or complete VCF files. Variants located within protein-coding transcripts are automatically assessed by MutationTaster as well as by RegulationSpotter to account for possible intragenic regulatory effects. RegulationSpotter offers the possibility of using phenotypic data to focus on known disease genes or genomic elements interacting with them. RegulationSpotter is freely available at https://www.regulationspotter.org.
Collapse
|
24
|
Phenotero: Annotate as you write. Clin Genet 2018; 95:287-292. [PMID: 30417324 DOI: 10.1111/cge.13471] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2018] [Revised: 10/25/2018] [Accepted: 11/06/2018] [Indexed: 11/28/2022]
Abstract
In clinical genetics, the Human Phenotype Ontology as well as disease ontologies are often used for deep phenotyping of patients and coding of clinical diagnoses. However, assigning ontology classes to patient descriptions is often disconnected from writing patient reports or manuscripts in word processing software. This additional workload and the requirement to install dedicated software may discourage usage of ontologies for parts of the target audience. Here we present Phenotero, a freely available and simple solution to annotate patient phenotypes and diseases at the time of writing clinical reports or manuscripts. We adopt Zotero, a citation management software to create a tool which allows to reference classes from ontologies within text at the time of writing. We expect this approach to decrease the additional workload to a minimum while ensuring high quality associations with ontology classes. Standardized collection of phenotypic information at the time of describing the patient allows for streamlining the clinic workflow and efficient data entry. It will subsequently promote clinical and molecular diagnosis with the ultimate goal of better understanding genetic diseases. Thus, we believe that Phenotero eases the usage of ontologies and controlled vocabularies in the field of clinical genetics.
Collapse
|
25
|
De novo mutation in ELOVL1 causes ichthyosis, acanthosis nigricans, hypomyelination, spastic paraplegia, high frequency deafness and optic atrophy. J Med Genet 2018; 56:164-175. [PMID: 30487246 DOI: 10.1136/jmedgenet-2018-105711] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 11/07/2018] [Accepted: 11/13/2018] [Indexed: 12/21/2022]
Abstract
BACKGROUND Very long-chain fatty acids (VLCFAs) are essential for functioning of biological membranes. ELOVL fatty acid elongase 1 catalyses elongation of saturated and monounsaturated C22-C26-VLCFAs. We studied two patients with a dominant ELOVL1 mutation. Independently, Kutkowska-Kaźmierczak et al. had investigated the same patients and found the same mutation. We extended our study towards additional biochemical, functional, and therapeutic aspects. METHODS We did mutation screening by whole exome sequencing. RNA-sequencing was performed in patient and control fibroblasts. Ceramide and sphingomyelin levels were measured by LC-MS/MS. ELOVL1 activity was determined by a stable isotope-labelled [13C]malonyl-CoA elongation assay. ELOVL1 expression patterns were investigated by immunofluorescence, in situ hybridisation and RT-qPCR. As treatment option, we investigated VLCFA loading of fibroblasts. RESULTS Both patients carried an identical heterozygous de novo ELOVL1 mutation (c.494C>T, NM_001256399; p.S165F) not deriving from a founder allele. Patients suffered from epidermal hyperproliferation and increased keratinisation (ichthyosis). Hypomyelination of the central white matter explained spastic paraplegia and central nystagmus, while optic atrophy was causative for reduction of peripheral vision and visual acuity. The mutation abrogated ELOVL1 enzymatic activity and reduced ≥C24 ceramides and sphingomyelins in patient cells. Fibroblast loading with C22:0-VLCFAs increased C24:0-ceramides and sphingomyelins. We found competitive inhibition for ceramide and sphingomyelin synthesis between saturated and monounsaturated VLCFAs. Transcriptome analysis revealed upregulation of modules involved in epidermal development and keratinisation, and downregulation of genes for neurodevelopment, myelination, and synaptogenesis. Many regulated genes carried consensus proliferator-activated receptor (PPAR)α and PPARγ binding motifs in their 5'-regions. CONCLUSION A dominant ELOVL1 mutation causes a neuro-ichthyotic disorder possibly amenable to treatment with PPAR-modulating drugs.
Collapse
|
26
|
Harmonising phenomics information for a better interoperability in the rare disease field. Eur J Med Genet 2018; 61:706-714. [PMID: 29425702 DOI: 10.1016/j.ejmg.2018.01.013] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Revised: 11/30/2017] [Accepted: 01/27/2018] [Indexed: 01/30/2023]
Abstract
HIPBI-RD (Harmonising phenomics information for a better interoperability in the rare disease field) is a three-year project which started in 2016 funded via the E-Rare 3 ERA-NET program. This project builds on three resources largely adopted by the rare disease (RD) community: Orphanet, its ontology ORDO (the Orphanet Rare Disease Ontology), HPO (the Human Phenotype Ontology) as well as PhenoTips software for the capture and sharing of structured phenotypic data for RD patients. Our project is further supported by resources developed by the European Bioinformatics Institute and the Garvan Institute. HIPBI-RD aims to provide the community with an integrated, RD-specific bioinformatics ecosystem that will harmonise the way phenomics information is stored in databases and patient files worldwide, and thereby contribute to interoperability. This ecosystem will consist of a suite of tools and ontologies, optimized to work together, and made available through commonly used software repositories. The project workplan follows three main objectives: The HIPBI-RD ecosystem will contribute to the interpretation of variants identified through exome and full genome sequencing by harmonising the way phenotypic information is collected, thus improving diagnostics and delineation of RD. The ultimate goal of HIPBI-RD is to provide a resource that will contribute to bridging genome-scale biology and a disease-centered view on human pathobiology. Achievements in Year 1.
Collapse
|
27
|
Erratum to: A systematic, large-scale comparison of transcription factor binding site models. BMC Genomics 2016; 17:502. [PMID: 27440159 PMCID: PMC4952225 DOI: 10.1186/s12864-016-2818-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
28
|
A systematic, large-scale comparison of transcription factor binding site models. BMC Genomics 2016; 17:388. [PMID: 27209209 PMCID: PMC4875604 DOI: 10.1186/s12864-016-2729-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 05/06/2016] [Indexed: 11/10/2022] Open
Abstract
Background The modelling of gene regulation is a major challenge in biomedical research. This process is dominated by transcription factors (TFs) and mutations in their binding sites (TFBSs) may cause the misregulation of genes, eventually leading to disease. The consequences of DNA variants on TF binding are modelled in silico using binding matrices, but it remains unclear whether these are capable of accurately representing in vivo binding. In this study, we present a systematic comparison of binding models for 82 human TFs from three freely available sources: JASPAR matrices, HT-SELEX-generated models and matrices derived from protein binding microarrays (PBMs). We determined their ability to detect experimentally verified “real” in vivo TFBSs derived from ENCODE ChIP-seq data. As negative controls we chose random downstream exonic sequences, which are unlikely to harbour TFBS. All models were assessed by receiver operating characteristics (ROC) analysis. Results While the area-under-curve was low for most of the tested models with only 47 % reaching a score of 0.7 or higher, we noticed strong differences between the various position-specific scoring matrices with JASPAR and HT-SELEX models showing higher success rates than PBM-derived models. In addition, we found that while TFBS sequences showed a higher degree of conservation than randomly chosen sequences, there was a high variability between individual TFBSs. Conclusions Our results show that only few of the matrix-based models used to predict potential TFBS are able to reliably detect experimentally confirmed TFBS. We compiled our findings in a freely accessible web application called ePOSSUM (http:/mutationtaster.charite.de/ePOSSUM/) which uses a Bayes classifier to assess the impact of genetic alterations on TF binding in user-defined sequences. Additionally, ePOSSUM provides information on the reliability of the prediction using our test set of experimentally confirmed binding sites. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2729-8) contains supplementary material, which is available to authorized users.
Collapse
|
29
|
Recessive REEP1 mutation is associated with congenital axonal neuropathy and diaphragmatic palsy. NEUROLOGY-GENETICS 2015; 1:e32. [PMID: 27066569 PMCID: PMC4811389 DOI: 10.1212/nxg.0000000000000032] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2015] [Accepted: 09/08/2015] [Indexed: 12/22/2022]
Abstract
OBJECTIVE To identify the underlying genetic cause of a congenital neuropathy in a 5-year-old boy as part of a cohort of 32 patients from 23 families with genetically unresolved neuropathies. METHODS We used autozygosity mapping coupled with next-generation sequencing to investigate a consanguineous family from Lebanon with 1 affected and 2 healthy children. Variants were investigated for segregation in the family by Sanger sequencing. A splice site mutation was further evaluated on the messenger RNA level by quantitative reverse transcription PCR. Subsequently, a larger cohort was specifically screened for receptor expression-enhancing protein 1 (REEP1) gene mutations. RESULTS We detected a homozygous splice donor mutation in REEP1 (c.303+1-7GTAATAT>AC, p.F62Kfs23*; NM_022912) that cosegregated with the phenotype in the family, leading to complete skipping of exon 4 and a premature stop codon. The phenotype of the patient is similar to spinal muscular atrophy with respiratory distress type 1 (SMARD1) with additional distal arthrogryposis and involvement of the upper motor neuron manifested by pronounced hyperreflexia. CONCLUSION To date, only dominant REEP1 mutations have been reported to be associated with a slowly progressive hereditary spastic paraplegia. The findings from our patient expand the phenotypical spectrum and the mode of inheritance of REEP1-associated disorders. Recessive mutations in REEP1 should be considered in the molecular genetic workup of patients with a neuromuscular disorder resembling SMARD1, especially if additional signs of upper motor neuron involvement and distal arthrogryposis are present.
Collapse
|
30
|
Clinical application of whole exome sequencing reveals a novel compound heterozygous TK2-mutation in two brothers with rapidly progressive combined muscle-brain atrophy, axonal neuropathy, and status epilepticus. Mitochondrion 2014; 20:1-6. [PMID: 25446393 DOI: 10.1016/j.mito.2014.10.007] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2014] [Revised: 10/05/2014] [Accepted: 10/29/2014] [Indexed: 10/24/2022]
Abstract
Mutations in several genes cause mtDNA depletion associated with encephalomyopathy. Due to phenotypic overlap, it is difficult to conclude from clinical phenotype to genetic defect. Here we report on two brothers who presented with rapid fatty muscle degeneration, axonal neuropathy, rapid loss of supratentorial white and gray matter, and status epilepticus. Whole exome sequencing coupled with 'identity-by-state' (IBS) analysis revealed a compound heterozygous missense mutation (p.M117V, p.A139V) in the thymidine kinase 2 (TK2) gene that segregated with the phenotype. Both mutations were located in the thymidine binding pouch of the enzyme. Residual mtDNA copy numbers in muscle were 8.5%, but normal in blood and fibroblasts. Our results broaden the clinical phenotype spectrum of TK2 mutations and promote WES as a useful method in the clinical setting for mutation detection, even in untypical cases. If two or more affected siblings from a non-consanguineous family can be investigated, IBS-analysis provides a powerful tool to narrow the number of disease candidates, similarly to autozygosity mapping in consanguineous families.
Collapse
|
31
|
|
32
|
GrabBlur--a framework to facilitate the secure exchange of whole-exome and -genome SNV data using VCF files. BMC Genomics 2014; 15 Suppl 4:S8. [PMID: 25055742 PMCID: PMC4083413 DOI: 10.1186/1471-2164-15-s4-s8] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Background Next Generation Sequencing (NGS) of whole exomes or genomes is increasingly being used in human genetic research and diagnostics. Sharing NGS data with third parties can help physicians and researchers to identify causative or predisposing mutations for a specific sample of interest more efficiently. In many cases, however, the exchange of such data may collide with data privacy regulations. GrabBlur is a newly developed tool to aggregate and share NGS-derived single nucleotide variant (SNV) data in a public database, keeping individual samples unidentifiable. In contrast to other currently existing SNV databases, GrabBlur includes phenotypic information and contact details of the submitter of a given database entry. By means of GrabBlur human geneticists can securely and easily share SNV data from resequencing projects. GrabBlur can ease the interpretation of SNV data by offering basic annotations, genotype frequencies and in particular phenotypic information - given that this information was shared - for the SNV of interest. Tool description GrabBlur facilitates the combination of phenotypic and NGS data (VCF files) via a local interface or command line operations. Data submissions may include HPO (Human Phenotype Ontology) terms, other trait descriptions, NGS technology information and the identity of the submitter. Most of this information is optional and its provision at the discretion of the submitter. Upon initial intake, GrabBlur merges and aggregates all sample-specific data. If a certain SNV is rare, the sample-specific information is replaced with the submitter identity. Generally, all data in GrabBlur are highly aggregated so that they can be shared with others while ensuring maximum privacy. Thus, it is impossible to reconstruct complete exomes or genomes from the database or to re-identify single individuals. After the individual information has been sufficiently "blurred", the data can be uploaded into a publicly accessible domain where aggregated genotypes are provided alongside phenotypic information. A web interface allows querying the database and the extraction of gene-wise SNV information. If an interesting SNV is found, the interrogator can get in contact with the submitter to exchange further information on the carrier and clarify, for example, whether the latter's phenotype matches with phenotype of their own patient.
Collapse
|
33
|
Abstract
Numerous new disease-gene associations have been identified by whole-exome sequencing studies in the last few years. However, many cases remain unsolved due to the sheer number of candidate variants remaining after common filtering strategies such as removing low quality and common variants and those deemed unlikely to be pathogenic. The observation that each of our genomes contains about 100 genuine loss-of-function variants makes identification of the causative mutation problematic when using these strategies alone. We propose using the wealth of genotype to phenotype data that already exists from model organism studies to assess the potential impact of these exome variants. Here, we introduce PHenotypic Interpretation of Variants in Exomes (PHIVE), an algorithm that integrates the calculation of phenotype similarity between human diseases and genetically modified mouse models with evaluation of the variants according to allele frequency, pathogenicity, and mode of inheritance approaches in our Exomiser tool. Large-scale validation of PHIVE analysis using 100,000 exomes containing known mutations demonstrated a substantial improvement (up to 54.1-fold) over purely variant-based (frequency and pathogenicity) methods with the correct gene recalled as the top hit in up to 83% of samples, corresponding to an area under the ROC curve of >95%. We conclude that incorporation of phenotype data can play a vital role in translational bioinformatics and propose that exome sequencing projects should systematically capture clinical phenotypes to take advantage of the strategy presented here.
Collapse
|
34
|
Identification of a Ninein (NIN) mutation in a family with spondyloepimetaphyseal dysplasia with joint laxity (leptodactylic type)-like phenotype. Matrix Biol 2013; 32:387-92. [DOI: 10.1016/j.matbio.2013.05.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2013] [Revised: 04/30/2013] [Accepted: 05/01/2013] [Indexed: 12/29/2022]
|
35
|
CNVinspector: a web-based tool for the interactive evaluation of copy number variations in single patients and in cohorts. J Med Genet 2013; 50:529-33. [PMID: 23729504 DOI: 10.1136/jmedgenet-2012-101497] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
OBJECTIVES Many genetic disorders are caused by copy number variations (CNVs) in the human genome. However, the large number of benign CNV polymorphisms makes it difficult to delineate causative variants for a certain disease phenotype. Hence, we set out to create software that accumulates and visualises locus-specific knowledge and enables clinicians to study their own CNVs in the context of known polymorphisms and disease variants. METHODS CNV data from healthy cohorts (Database of Genomic Variants) and from disease-related databases (DECIPHER) were integrated into a joint resource. Data are presented in an interactive web-based application that allows inspection, evaluation and filtering of CNVs in single individuals or in entire cohorts. RESULTS CNVinspector provides simple interfaces to upload CNV data, compare them with own or published control data and visualise the results in graphical interfaces. Beyond choosing control data from different public studies, platforms and methods, dedicated filter options allow the detection of CNVs that are either enriched in patients or depleted in controls. Alternatively, a search can be restricted to those CNVs that appear in individuals of similar clinical phenotype. For each gene of interest within a CNV, we provide a link to NCBI, ENSEMBL and the GeneDistiller search engine to browse for potential disease-associated genes. CONCLUSIONS With its user-friendly handling, the integration of control data and the filtering options, CNVinspector will facilitate the daily work of clinical geneticists and accelerate the delineation of new syndromes and gene functions. CNVinspector is freely accessible under http://www.cnvinspector.org.
Collapse
|
36
|
Abstract
Zusammenfassung
Das mitoNET wurde als interdisziplinäres, deutschlandweites Netzwerk mit dem Ziel konzipiert, eine Verbesserung der Patientenversorgung auf dem Gebiet der mitochondrialen Erkrankungen zu erreichen. Das horizontale klinische Netzwerk des mitoNET umfasst 8 neurologische und 13 pädiatrische Kliniken, die für die Patientenrekrutierung, deren Phänotypisierung und die Erfassung des natürlichen Verlaufs im Rahmen von jährlichen Kontrolluntersuchungen zuständig sind. Die Speicherung der erhobenen Daten erfolgt in einer eigens entwickelten webbasierten Registerdatenbank. Das Netzwerk betreibt 2 Biobanken zur Asservierung von DNA, RNA, Plasma sowie von diagnostisch gewonnenen Fibro- und Myoblasten. Vier Forschungsprojekte zielen auf eine Verbesserung des diagnostischen Spektrums durch Etablierung neuer Methoden, und ein Teilprojekt beschäftigt sich mit der Überprüfung neuer Therapiemöglichkeiten. Das mitoNET wird seit Anfang 2009 vom Bundesministerium für Bildung und Forschung (BMBF, Förderkennzeichen 01GM1113A) gefördert.
Collapse
|
37
|
HomozygosityMapper2012--bridging the gap between homozygosity mapping and deep sequencing. Nucleic Acids Res 2012; 40:W516-20. [PMID: 22669902 PMCID: PMC3394249 DOI: 10.1093/nar/gks487] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Homozygosity mapping is a common method to map recessive traits in consanguineous families. To facilitate these analyses, we have developed HomozygosityMapper, a web-based approach to homozygosity mapping. HomozygosityMapper allows researchers to directly upload the genotype files produced by the major genotyping platforms as well as deep sequencing data. It detects stretches of homozygosity shared by the affected individuals and displays them graphically. Users can interactively inspect the underlying genotypes, manually refine these regions and eventually submit them to our candidate gene search engine GeneDistiller to identify the most promising candidate genes. Here, we present the new version of HomozygosityMapper. The most striking new feature is the support of Next Generation Sequencing *.vcf files as input. Upon users' requests, we have implemented the analysis of common experimental rodents as well as of important farm animals. Furthermore, we have extended the options for single families and loss of heterozygosity studies. Another new feature is the export of *.bed files for targeted enrichment of the potential disease regions for deep sequencing strategies. HomozygosityMapper also generates files for conventional linkage analyses which are already restricted to the possible disease regions, hence superseding CPU-intensive genome-wide analyses. HomozygosityMapper is freely available at http://www.homozygositymapper.org/.
Collapse
|
38
|
Systematic comparison of three methods for fragmentation of long-range PCR products for next generation sequencing. PLoS One 2011; 6:e28240. [PMID: 22140562 PMCID: PMC3227650 DOI: 10.1371/journal.pone.0028240] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2011] [Accepted: 11/04/2011] [Indexed: 11/19/2022] Open
Abstract
Next Generation Sequencing (NGS) technologies are gaining importance in the routine clinical diagnostic setting. It is thus desirable to simplify the workflow for high-throughput diagnostics. Fragmentation of DNA is a crucial step for preparation of template libraries and various methods are currently known. Here we evaluated the performance of nebulization, sonication and random enzymatic digestion of long-range PCR products on the results of NGS. All three methods produced high-quality sequencing libraries for the 454 platform. However, if long-range PCR products of different length were pooled equimolarly, sequence coverage drastically dropped for fragments below 3,000 bp. All three methods performed equally well with regard to overall sequence quality (PHRED) and read length. Enzymatic fragmentation showed highest consistency between three library preparations but performed slightly worse than sonication and nebulization with regard to insertions/deletions in the raw sequence reads. After filtering for homopolymer errors, enzymatic fragmentation performed best if compared to the results of classic Sanger sequencing. As the overall performance of all three methods was equal with only minor differences, a fragmentation method can be chosen solely according to lab facilities, feasibility and experimental design.
Collapse
|
39
|
Faulty initiation of proteoglycan synthesis causes cardiac and joint defects. Am J Hum Genet 2011; 89:15-27. [PMID: 21763480 DOI: 10.1016/j.ajhg.2011.05.021] [Citation(s) in RCA: 78] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2011] [Revised: 04/14/2011] [Accepted: 05/16/2011] [Indexed: 02/08/2023] Open
Abstract
Proteoglycans are a major component of extracellular matrix and contribute to normal embryonic and postnatal development by ensuring tissue stability and signaling functions. We studied five patients with recessive joint dislocations and congenital heart defects, including bicuspid aortic valve (BAV) and aortic root dilatation. We identified linkage to chromosome 11 and detected a mutation (c.830G>A, p.Arg277Gln) in B3GAT3, the gene coding for glucuronosyltransferase-I (GlcAT-I). The enzyme catalyzes an initial step in the synthesis of glycosaminoglycan side chains of proteoglycans. Patients' cells as well as recombinant mutant protein showed reduced glucuronyltransferase activity. Patient fibroblasts demonstrated decreased levels of dermatan sulfate, chondroitin sulfate, and heparan sulfate proteoglycans, indicating that the defect in linker synthesis affected all three lines of O-glycanated proteoglycans. Further studies demonstrated that GlcAT-I resides in the cis and cis-medial Golgi apparatus and is expressed in the affected tissues, i.e., heart, aorta, and bone. The study shows that reduced GlcAT-I activity impairs skeletal as well as heart development and results in variable combinations of heart malformations, including mitral valve prolapse, ventricular septal defect, and bicuspid aortic valve. The described family constitutes a syndrome characterized by heart defects and joint dislocations resulting from altered initiation of proteoglycan synthesis (Larsen-like syndrome, B3GAT3 type).
Collapse
|
40
|
Generalized progressive retinal atrophy in the Irish Glen of Imaal Terrier is associated with a deletion in the ADAM9 gene. Mol Cell Probes 2010; 24:357-63. [DOI: 10.1016/j.mcp.2010.07.007] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2010] [Revised: 07/26/2010] [Accepted: 07/26/2010] [Indexed: 01/05/2023]
|
41
|
|
42
|
Individuals with mutations in XPNPEP3, which encodes a mitochondrial protein, develop a nephronophthisis-like nephropathy. J Clin Invest 2010. [DOI: 10.1172/jci40076c1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
43
|
Fatal cardiac arrhythmia and long-QT syndrome in a new form of congenital generalized lipodystrophy with muscle rippling (CGL4) due to PTRF-CAVIN mutations. PLoS Genet 2010; 6:e1000874. [PMID: 20300641 PMCID: PMC2837386 DOI: 10.1371/journal.pgen.1000874] [Citation(s) in RCA: 166] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2009] [Accepted: 02/05/2010] [Indexed: 11/19/2022] Open
Abstract
We investigated eight families with a novel subtype of congenital generalized lipodystrophy (CGL4) of whom five members had died from sudden cardiac death during their teenage years. ECG studies revealed features of long-QT syndrome, bradycardia, as well as supraventricular and ventricular tachycardias. Further symptoms comprised myopathy with muscle rippling, skeletal as well as smooth-muscle hypertrophy, leading to impaired gastrointestinal motility and hypertrophic pyloric stenosis in some children. Additionally, we found impaired bone formation with osteopenia, osteoporosis, and atlanto-axial instability. Homozygosity mapping located the gene within 2 Mbp on chromosome 17. Prioritization of 74 candidate genes with GeneDistiller for high expression in muscle and adipocytes suggested PTRF-CAVIN (Polymerase I and transcript release factor/Cavin) as the most probable candidate leading to the detection of homozygous mutations (c.160delG, c.362dupT). PTRF-CAVIN is essential for caveolae biogenesis. These cholesterol-rich plasmalemmal vesicles are involved in signal-transduction and vesicular trafficking and reside primarily on adipocytes, myocytes, and osteoblasts. Absence of PTRF-CAVIN did not influence abundance of its binding partner caveolin-1 and caveolin-3. In patient fibroblasts, however, caveolin-1 failed to localize toward the cell surface and electron microscopy revealed reduction of caveolae to less than 3%. Transfection of full-length PTRF-CAVIN reestablished the presence of caveolae. The loss of caveolae was confirmed by Atomic Force Microscopy (AFM) in combination with fluorescent imaging. PTRF-CAVIN deficiency thus presents the phenotypic spectrum caused by a quintessential lack of functional caveolae. Patients with generalized lipodystrophy have a marked lack of body fat. Several gene defects have been described that impede fat synthesis and maturation of fat cells. Here we report on mutations in a novel gene, called PTRF-CAVIN, causing congenital generalized lipodystrophy type 4 (CGL4) that is additionally associated with muscle disease. Patients' muscles are large but weak and show an involuntary, rolling contraction pattern called “rippling.” Further symptoms comprise life-threatening cardiac arrhythmias and a disorder of bone formation. We searched for shared segments in the genome of seven patients and found the responsible gene, called PTRF-CAVIN, on chromosome 17. This gene is crucial for caveolae (latin for “small caves”) formation. These small indentations of the cell membrane are found on the surface of muscle, bone, fat, and immune cells and facilitate cell-to-cell communication and the absorption of substances from the extracellular space. Patients lack more than 97% of caveolae and artificial insertion of the correct gene into patient skin cells led to the reappearance of caveolae. As cardiac arrhythmia is a severe and potentially life-threatening condition, patients with CGL4 should be closely monitored by ECG and, if necessary, fitted with an implanted pacemaker and cardioverter defibrillator (ICD) device.
Collapse
|
44
|
Individuals with mutations in XPNPEP3, which encodes a mitochondrial protein, develop a nephronophthisis-like nephropathy. J Clin Invest 2010; 120:791-802. [PMID: 20179356 DOI: 10.1172/jci40076] [Citation(s) in RCA: 85] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2009] [Accepted: 01/06/2010] [Indexed: 01/06/2023] Open
Abstract
The autosomal recessive kidney disease nephronophthisis (NPHP) constitutes the most frequent genetic cause of terminal renal failure in the first 3 decades of life. Ten causative genes (NPHP1-NPHP9 and NPHP11), whose products localize to the primary cilia-centrosome complex, support the unifying concept that cystic kidney diseases are "ciliopathies". Using genome-wide homozygosity mapping, we report here what we believe to be a new locus (NPHP-like 1 [NPHPL1]) for an NPHP-like nephropathy. In 2 families with an NPHP-like phenotype, we detected homozygous frameshift and splice-site mutations, respectively, in the X-prolyl aminopeptidase 3 (XPNPEP3) gene. In contrast to all known NPHP proteins, XPNPEP3 localizes to mitochondria of renal cells. However, in vivo analyses also revealed a likely cilia-related function; suppression of zebrafish xpnpep3 phenocopied the developmental phenotypes of ciliopathy morphants, and this effect was rescued by human XPNPEP3 that was devoid of a mitochondrial localization signal. Consistent with a role for XPNPEP3 in ciliary function, several ciliary cystogenic proteins were found to be XPNPEP3 substrates, for which resistance to N-terminal proline cleavage resulted in attenuated protein function in vivo in zebrafish. Our data highlight an emerging link between mitochondria and ciliary dysfunction, and suggest that further understanding the enzymatic activity and substrates of XPNPEP3 will illuminate novel cystogenic pathways.
Collapse
|
45
|
Abstract
Homozygosity mapping is a common method for mapping recessive traits in consanguineous families. In most studies, applications for multipoint linkage analyses are applied to determine the genomic region linked to the disease. Unfortunately, these are neither suited for very large families nor for the inclusion of tens of thousands of SNPs. Even if less than 10,000 markers are employed, such an analysis may easily last hours if not days. Here we present a web-based approach to homozygosity mapping. Our application stores marker data in a database into which users can directly upload their own SNP genotype files. Within a few minutes, the database analyses the data, detects homozygous stretches and provides an intuitive graphical interface to the results. The homozygosity in affected individuals is visualized genome-wide with the ability to zoom into single chromosomes and user-defined chromosomal regions. The software also displays the underlying genotypes in all samples. It is integrated with our candidate gene search engine, GeneDistiller, so that users can interactively determine the most promising gene. They can at any point restrict access to their data or make it public, allowing HomozygosityMapper to be used as a data repository for homozygosity-mapping studies. HomozygosityMapper is available at http://www.homozygositymapper.org/.
Collapse
|
46
|
A systematic approach to mapping recessive disease genes in individuals from outbred populations. PLoS Genet 2009; 5:e1000353. [PMID: 19165332 PMCID: PMC2621355 DOI: 10.1371/journal.pgen.1000353] [Citation(s) in RCA: 132] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2008] [Accepted: 12/22/2008] [Indexed: 12/29/2022] Open
Abstract
The identification of recessive disease-causing genes by homozygosity mapping is often restricted by lack of suitable consanguineous families. To overcome these limitations, we apply homozygosity mapping to single affected individuals from outbred populations. In 72 individuals of 54 kindred ascertained worldwide with known homozygous mutations in 13 different recessive disease genes, we performed total genome homozygosity mapping using 250,000 SNP arrays. Likelihood ratio Z-scores (ZLR) were plotted across the genome to detect ZLR peaks that reflect segments of homozygosity by descent, which may harbor the mutated gene. In 93% of cases, the causative gene was positioned within a consistent ZLR peak of homozygosity. The number of peaks reflected the degree of inbreeding. We demonstrate that disease-causing homozygous mutations can be detected in single cases from outbred populations within a single ZLR peak of homozygosity as short as 2 Mb, containing an average of only 16 candidate genes. As many specialty clinics have access to cohorts of individuals from outbred populations, and as our approach will result in smaller genetic candidate regions, the new strategy of homozygosity mapping in single outbred individuals will strongly accelerate the discovery of novel recessive disease genes. Many childhood diseases are caused by single-gene mutations of recessive genes, in which a child has inherited one mutated gene copy from each parent causing disease in the child, but not in the parents who are healthy heterozygous carriers. As the two mutations represent the disease cause, gene mapping helped understand disease mechanisms. “Homozygosity mapping” has been particularly useful. It assumes that the parents are related and that a disease-causing mutation together with a chromosomal segment of identical markers (i.e., homozygous markers) is transmitted to the affected child through the paternal and the maternal line from an ancestor common to both parents. Homozygosity mapping seeks out those homozygous regions to map the disease-causing gene. Homozygosity mapping requires families, in which the parents are knowingly related, and have multiple affected children. To overcome these limitations, we applied homozygosity mapping to single affected individuals from outbred populations. In 72 individuals with known homozygous mutations in 13 different recessive disease genes, we performed homozygosity mapping. In 93% we detected the causative gene in a segment of homozygosity. We demonstrate that disease-causing homozygous mutations can be detected in single cases from outbred populations. This will strongly accelerate the discovery of novel recessive disease genes.
Collapse
|
47
|
The Human Phenotype Ontology: a tool for annotating and analyzing human hereditary disease. Am J Hum Genet 2008; 83:610-5. [PMID: 18950739 DOI: 10.1016/j.ajhg.2008.09.017] [Citation(s) in RCA: 614] [Impact Index Per Article: 38.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2008] [Revised: 09/24/2008] [Accepted: 09/30/2008] [Indexed: 10/21/2022] Open
Abstract
There are many thousands of hereditary diseases in humans, each of which has a specific combination of phenotypic features, but computational analysis of phenotypic data has been hampered by lack of adequate computational data structures. Therefore, we have developed a Human Phenotype Ontology (HPO) with over 8000 terms representing individual phenotypic anomalies and have annotated all clinical entries in Online Mendelian Inheritance in Man with the terms of the HPO. We show that the HPO is able to capture phenotypic similarities between diseases in a useful and highly significant fashion.
Collapse
|
48
|
Acetylcholine receptor pathway mutations explain various fetal akinesia deformation sequence disorders. Am J Hum Genet 2008; 82:464-76. [PMID: 18252226 DOI: 10.1016/j.ajhg.2007.11.006] [Citation(s) in RCA: 83] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2007] [Revised: 11/21/2007] [Accepted: 11/21/2007] [Indexed: 01/21/2023] Open
Abstract
Impaired fetal movement causes malformations, summarized as fetal akinesia deformation sequence (FADS), and is triggered by environmental and genetic factors. Acetylcholine receptor (AChR) components are suspects because mutations in the fetally expressed gamma subunit (CHRNG) of AChR were found in two FADS disorders, lethal multiple pterygium syndrome (LMPS) and Escobar syndrome. Other AChR subunits alpha1, beta1, and delta (CHRNA1, CHRNB1, CHRND) as well as receptor-associated protein of the synapse (RAPSN) previously revealed missense or compound nonsense-missense mutations in viable congenital myasthenic syndrome; lethality of homozygous null mutations was predicted but never shown. We provide the first report to our knowledge of homozygous nonsense mutations in CHRNA1 and CHRND and show that they were lethal, whereas novel recessive missense mutations in RAPSN caused a severe but not necessarily lethal phenotype. To elucidate disease-associated malformations such as frequent abortions, fetal edema, cystic hygroma, or cardiac defects, we studied Chrna1, Chrnb1, Chrnd, Chrng, and Rapsn in mouse embryos and found expression in skeletal muscles but also in early somite development. This indicates that early developmental defects might be due to somite expression in addition to solely muscle-specific effects. We conclude that complete or severe functional disruption of fetal AChR causes lethal multiple pterygium syndrome whereas milder alterations result in fetal hypokinesia with inborn contractures or a myasthenic syndrome later in life.
Collapse
|
49
|
AssociationDB: web-based exploration of genomic association. Bioinformatics 2007; 23:2643-4. [PMID: 17660207 DOI: 10.1093/bioinformatics/btm376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Genome-wide association studies use hundreds of thousands of markers making it challenging to present and finally interpret the results. We developed a graphical, web-based solution for an interactive exploration of the results of case-control studies, with a tight integration of related gene information and tissue-specific expression data. Association results are presented as physical position-based vertical bars with known genes included as horizontal bars at their respective physical positions. The interface allows the specification of filtering criteria for the association data and highlights potentially interesting genes with user-specified terms occurring in their reports or with relevant expression patterns. Pop-up windows and hyperlinks provide drill-down capabilities and quick access to relevant data AssociationDB can either be used as a stand-alone solution or as a front-end joining association results obtained by other software with genomic information.
Collapse
|
50
|
Loss of GLIS2 causes nephronophthisis in humans and mice by increased apoptosis and fibrosis. Nat Genet 2007; 39:1018-24. [PMID: 17618285 DOI: 10.1038/ng2072] [Citation(s) in RCA: 183] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2006] [Accepted: 05/17/2007] [Indexed: 01/16/2023]
Abstract
Nephronophthisis (NPHP), an autosomal recessive kidney disease, is the most frequent genetic cause of end-stage renal failure in the first three decades of life. Positional cloning of the six known NPHP genes has linked its pathogenesis to primary cilia function. Here we identify mutation of GLIS2 as causing an NPHP-like phenotype in humans and mice, using positional cloning and mouse transgenics, respectively. Kidneys of Glis2 mutant mice show severe renal atrophy and fibrosis starting at 8 weeks of age. Differential gene expression studies on Glis2 mutant kidneys demonstrate that genes promoting epithelial-to-mesenchymal transition and fibrosis are upregulated in the absence of Glis2. Thus, we identify Glis2 as a transcription factor mutated in NPHP and demonstrate its essential role for the maintenance of renal tissue architecture through prevention of apoptosis and fibrosis.
Collapse
|