1
|
Andhika NS, Biswas S, Hardcastle C, Green DJ, Ramsden SC, Birney E, Black GC, Sergouniotis PI. Using computational approaches to enhance the interpretation of missense variants in the PAX6 gene. Eur J Hum Genet 2024; 32:1005-1013. [PMID: 38849599 PMCID: PMC11292026 DOI: 10.1038/s41431-024-01638-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Revised: 04/12/2024] [Accepted: 05/14/2024] [Indexed: 06/09/2024] Open
Abstract
The PAX6 gene encodes a highly-conserved transcription factor involved in eye development. Heterozygous loss-of-function variants in PAX6 can cause a range of ophthalmic disorders including aniridia. A key molecular diagnostic challenge is that many PAX6 missense changes are presently classified as variants of uncertain significance. While computational tools can be used to assess the effect of genetic alterations, the accuracy of their predictions varies. Here, we evaluated and optimised the performance of computational prediction tools in relation to PAX6 missense variants. Through inspection of publicly available resources (including HGMD, ClinVar, LOVD and gnomAD), we identified 241 PAX6 missense variants that were used for model training and evaluation. The performance of ten commonly used computational tools was assessed and a threshold optimization approach was utilized to determine optimal cut-off values. Validation studies were subsequently undertaken using PAX6 variants from a local database. AlphaMissense, SIFT4G and REVEL emerged as the best-performing predictors; the optimized thresholds of these tools were 0.967, 0.025, and 0.772, respectively. Combining the prediction from these top-three tools resulted in lower performance compared to using AlphaMissense alone. Tailoring the use of computational tools by employing optimized thresholds specific to PAX6 can enhance algorithmic performance. Our findings have implications for PAX6 variant interpretation in clinical settings.
Collapse
Affiliation(s)
- Nadya S Andhika
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Susmito Biswas
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- Manchester Royal Eye Hospital, Manchester University NHS Foundation Trust, Manchester, UK
| | - Claire Hardcastle
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Manchester University NHS Foundation Trust, Manchester, UK
| | - David J Green
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
| | - Simon C Ramsden
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Manchester University NHS Foundation Trust, Manchester, UK
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, UK
| | - Graeme C Black
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Manchester University NHS Foundation Trust, Manchester, UK
| | - Panagiotis I Sergouniotis
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.
- Manchester Royal Eye Hospital, Manchester University NHS Foundation Trust, Manchester, UK.
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Manchester University NHS Foundation Trust, Manchester, UK.
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Cambridge, UK.
| |
Collapse
|
2
|
Stephanou C, Petrou M, Kountouris P, Makariou C, Christou S, Hadjigavriel M, Kleanthous M, Papasavva T. Unravelling the Complexity of the +33 C>G [HBB:c.-18C>G] Variant in Beta Thalassemia. Biomedicines 2024; 12:296. [PMID: 38397898 PMCID: PMC10886608 DOI: 10.3390/biomedicines12020296] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 01/12/2024] [Accepted: 01/18/2024] [Indexed: 02/25/2024] Open
Abstract
The +33 C>G variant [NM_000518.5(HBB):c.-18C>G] in the 5' untranslated region (UTR) of the β-globin gene is described in the literature as both mild and silent, while it causes a phenotype of thalassemia intermedia in the presence of a severe β-thalassemia allele. Despite its potential clinical significance, the determination of its pathogenicity according to established standards requires a greater number of published cases and co-segregation evidence than what is currently available. The present study provides an extensive phenotypic characterization of +33 C>G using 26 heterozygous and 11 compound heterozygous novel cases detected in Cyprus and employs computational predictors (CADD, RegulomeDB) to better understand its impact on clinical severity. Genotype identification of globin gene variants, including α- and δ-thalassemia determinants, and rs7482144 (XmnI) was carried out using Sanger sequencing, gap-PCR, and restriction enzyme digestion methods. The heterozygous state of +33 C>G had a silent phenotype without apparent microcytosis or hypochromia, while compound heterozygosity with a β+ or β0 allele had a spectrum of clinical phenotypes. Awareness of the +33 C>G is required across Mediterranean populations where β-thalassemia is frequent, particularly in Cyprus, with significant relevance in population screening and fetal diagnostic applications.
Collapse
Affiliation(s)
- Coralea Stephanou
- Molecular Genetics Thalassemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Miranda Petrou
- Molecular Genetics Thalassemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Petros Kountouris
- Molecular Genetics Thalassemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Christiana Makariou
- Thalassemia Clinic Nicosia, Archbishop Makarios III Hospital, Nicosia 2012, Cyprus
| | - Soteroula Christou
- Thalassemia Clinic Nicosia, Archbishop Makarios III Hospital, Nicosia 2012, Cyprus
| | | | - Marina Kleanthous
- Molecular Genetics Thalassemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| | - Thessalia Papasavva
- Molecular Genetics Thalassemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia 2371, Cyprus
| |
Collapse
|
3
|
Walker LC, Hoya MDL, Wiggins GAR, Lindy A, Vincent LM, Parsons MT, Canson DM, Bis-Brewer D, Cass A, Tchourbanov A, Zimmermann H, Byrne AB, Pesaran T, Karam R, Harrison SM, Spurdle AB. Using the ACMG/AMP framework to capture evidence related to predicted and observed impact on splicing: Recommendations from the ClinGen SVI Splicing Subgroup. Am J Hum Genet 2023; 110:1046-1067. [PMID: 37352859 PMCID: PMC10357475 DOI: 10.1016/j.ajhg.2023.06.002] [Citation(s) in RCA: 73] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Revised: 06/01/2023] [Accepted: 06/02/2023] [Indexed: 06/25/2023] Open
Abstract
The American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) framework for classifying variants uses six evidence categories related to the splicing potential of variants: PVS1, PS3, PP3, BS3, BP4, and BP7. However, the lack of guidance on how to apply such codes has contributed to variation in the specifications developed by different Clinical Genome Resource (ClinGen) Variant Curation Expert Panels. The ClinGen Sequence Variant Interpretation Splicing Subgroup was established to refine recommendations for applying ACMG/AMP codes relating to splicing data and computational predictions. We utilized empirically derived splicing evidence to (1) determine the evidence weighting of splicing-related data and appropriate criteria code selection for general use, (2) outline a process for integrating splicing-related considerations when developing a gene-specific PVS1 decision tree, and (3) exemplify methodology to calibrate splice prediction tools. We propose repurposing the PVS1_Strength code to capture splicing assay data that provide experimental evidence for variants resulting in RNA transcript(s) with loss of function. Conversely, BP7 may be used to capture RNA results demonstrating no splicing impact for intronic and synonymous variants. We propose that the PS3/BS3 codes are applied only for well-established assays that measure functional impact not directly captured by RNA-splicing assays. We recommend the application of PS1 based on similarity of predicted RNA-splicing effects for a variant under assessment in comparison with a known pathogenic variant. The recommendations and approaches for consideration and evaluation of RNA-assay evidence described aim to help standardize variant pathogenicity classification processes when interpreting splicing-based evidence.
Collapse
Affiliation(s)
- Logan C Walker
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Miguel de la Hoya
- Molecular Oncology Laboratory, CIBERONC, Hospital Clinico San Carlos, IdISSC (Instituto de Investigación Sanitaria del Hospital Clínico San Carlos), Madrid, Spain
| | - George A R Wiggins
- Department of Pathology and Biomedical Science, University of Otago, Christchurch, New Zealand
| | | | | | - Michael T Parsons
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | - Daffodil M Canson
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia
| | | | | | | | | | - Alicia B Byrne
- Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | | | - Steven M Harrison
- Ambry Genetics, Aliso Viejo, CA, USA; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| | - Amanda B Spurdle
- Population Health Program, QIMR Berghofer Medical Research Institute, Brisbane, QLD, Australia; Faculty of Medicine, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
4
|
Xenophontos M, Minaidou A, Stephanou C, Tamana S, Kleanthous M, Kountouris P. IthaPhen: An Interactive Database of Genotype-Phenotype Data for Hemoglobinopathies. Hemasphere 2023; 7:e922. [PMID: 37359188 PMCID: PMC10289560 DOI: 10.1097/hs9.0000000000000922] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2022] [Accepted: 05/31/2023] [Indexed: 06/28/2023] Open
Affiliation(s)
- Maria Xenophontos
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Anna Minaidou
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Coralea Stephanou
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Stella Tamana
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Marina Kleanthous
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| | - Petros Kountouris
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus
| |
Collapse
|
5
|
Walker LC, de la Hoya M, Wiggins GA, Lindy A, Vincent LM, Parsons M, Canson DM, Bis-Brewer D, Cass A, Tchourbanov A, Zimmermann H, Byrne AB, Pesaran T, Karam R, Harrison SM, Spurdle AB. APPLICATION OF THE ACMG/AMP FRAMEWORK TO CAPTURE EVIDENCE RELEVANT TO PREDICTED AND OBSERVED IMPACT ON SPLICING: RECOMMENDATIONS FROM THE CLINGEN SVI SPLICING SUBGROUP. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.02.24.23286431. [PMID: 36865205 PMCID: PMC9980257 DOI: 10.1101/2023.02.24.23286431] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/01/2023]
Abstract
The American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) framework for classifying variants uses six evidence categories related to the splicing potential of variants: PVS1 (null variant in a gene where loss-of-function is the mechanism of disease), PS3 (functional assays show damaging effect on splicing), PP3 (computational evidence supports a splicing effect), BS3 (functional assays show no damaging effect on splicing), BP4 (computational evidence suggests no splicing impact), and BP7 (silent change with no predicted impact on splicing). However, the lack of guidance on how to apply such codes has contributed to variation in the specifications developed by different Clinical Genome Resource (ClinGen) Variant Curation Expert Panels. The ClinGen Sequence Variant Interpretation (SVI) Splicing Subgroup was established to refine recommendations for applying ACMG/AMP codes relating to splicing data and computational predictions. Our study utilised empirically derived splicing evidence to: 1) determine the evidence weighting of splicing-related data and appropriate criteria code selection for general use, 2) outline a process for integrating splicing-related considerations when developing a gene-specific PVS1 decision tree, and 3) exemplify methodology to calibrate bioinformatic splice prediction tools. We propose repurposing of the PVS1_Strength code to capture splicing assay data that provide experimental evidence for variants resulting in RNA transcript(s) with loss of function. Conversely BP7 may be used to capture RNA results demonstrating no impact on splicing for both intronic and synonymous variants, and for missense variants if protein functional impact has been excluded. Furthermore, we propose that the PS3 and BS3 codes are applied only for well-established assays that measure functional impact that is not directly captured by RNA splicing assays. We recommend the application of PS1 based on similarity of predicted RNA splicing effects for a variant under assessment in comparison to a known Pathogenic variant. The recommendations and approaches for consideration and evaluation of RNA assay evidence described aim to help standardise variant pathogenicity classification processes and result in greater consistency when interpreting splicing-based evidence.
Collapse
|
6
|
Tamana S, Xenophontos M, Minaidou A, Stephanou C, Harteveld CL, Bento C, Traeger-Synodinos J, Fylaktou I, Yasin NM, Abdul Hamid FS, Esa E, Halim-Fikri H, Zilfalil BA, Kakouri AC, Kleanthous M, Kountouris P. Evaluation of in silico predictors on short nucleotide variants in HBA1, HBA2, and HBB associated with haemoglobinopathies. eLife 2022; 11:79713. [PMID: 36453528 PMCID: PMC9731569 DOI: 10.7554/elife.79713] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 10/31/2022] [Indexed: 12/03/2022] Open
Abstract
Haemoglobinopathies are the commonest monogenic diseases worldwide and are caused by variants in the globin gene clusters. With over 2400 variants detected to date, their interpretation using the American College of Medical Genetics and Genomics (ACMG)/Association for Molecular Pathology (AMP) guidelines is challenging and computational evidence can provide valuable input about their functional annotation. While many in silico predictors have already been developed, their performance varies for different genes and diseases. In this study, we evaluate 31 in silico predictors using a dataset of 1627 variants in HBA1, HBA2, and HBB. By varying the decision threshold for each tool, we analyse their performance (a) as binary classifiers of pathogenicity and (b) by using different non-overlapping pathogenic and benign thresholds for their optimal use in the ACMG/AMP framework. Our results show that CADD, Eigen-PC, and REVEL are the overall top performers, with the former reaching moderate strength level for pathogenic prediction. Eigen-PC and REVEL achieve the highest accuracies for missense variants, while CADD is also a reliable predictor of non-missense variants. Moreover, SpliceAI is the top performing splicing predictor, reaching strong level of evidence, while GERP++ and phyloP are the most accurate conservation tools. This study provides evidence about the optimal use of computational tools in globin gene clusters under the ACMG/AMP framework.
Collapse
Affiliation(s)
- Stella Tamana
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | - Maria Xenophontos
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | - Anna Minaidou
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | - Coralea Stephanou
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | - Cornelis L Harteveld
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus,Leiden University Medical CenterLeidenNetherlands
| | - Celeste Bento
- Centro Hospitalar e Universitário de CoimbraCoimbraPortugal
| | | | - Irene Fylaktou
- Division of Endocrinology, Metabolism and Diabetes, First Department of Pediatrics, National and Kapodistrian University of AthensAthensGreece
| | - Norafiza Mohd Yasin
- Haematology Unit, Cancer Research Centre, Institute for Medical Research, National Health of Institutes (NIH), Ministry of Health MalaysiaSelangorMalaysia
| | - Faidatul Syazlin Abdul Hamid
- Haematology Unit, Cancer Research Centre, Institute for Medical Research, National Health of Institutes (NIH), Ministry of Health MalaysiaSelangorMalaysia
| | - Ezalia Esa
- Haematology Unit, Cancer Research Centre, Institute for Medical Research, National Health of Institutes (NIH), Ministry of Health MalaysiaSelangorMalaysia
| | - Hashim Halim-Fikri
- Malaysian Node of the Human Variome Project, School of Medical Sciences, Health Campus, Universiti Sains MalaysiaKelantanMalaysia
| | - Bin Alwi Zilfalil
- Human Genome Centre, School of Medical Sciences, Health Campus, Universiti Sains MalaysiaKelantanMalaysia
| | - Andrea C Kakouri
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | | | - Marina Kleanthous
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| | - Petros Kountouris
- Molecular Genetics Thalassaemia Department, The Cyprus Institute of Neurology and GeneticsNicosiaCyprus
| |
Collapse
|