1
|
Kamal MM, Islam MN, Rabby MG, Zahid MA, Hasan MM. In Silico Functional and Structural Analysis of Non-synonymous Single Nucleotide Polymorphisms (nsSNPs) in Human Paired Box 4 Gene. Biochem Genet 2024; 62:2975-2998. [PMID: 38062275 DOI: 10.1007/s10528-023-10589-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Accepted: 11/06/2023] [Indexed: 07/31/2024]
Abstract
In human genome, members of Paired box (PAX) transcription factor family are highly sequence-specific DNA-binding proteins. Among PAX gene family members, PAX4 gene has significant role in growth, proliferation, differentiation, and insulin secretion of pancreatic β-cells. Single nucleotide polymorphisms (SNPs) in PAX4 gene progress in the pathogenesis of various human diseases. Hence, the molecular mechanism of how these SNPs in PAX4 gene significantly progress diseases pathogenesis needs to be elucidated. For the reason, a series of bioinformatic analyzes were done to identify the SNPs of PAX4 gene that contribute in diseases pathogenesis. From the analyzes, 4145 SNPs (rsIDs) in PAX4 gene were obtained, where, 362 missense (8.73%), 169 synonymous (4.08%), and 2323 intron variants (56.04%). The rest SNPs were unspecified. Among the 362 missense variants, 118 nsSNPs were found as deleterious in SIFT analysis. Among those, 25 nsSNPs were most probably damaging and 23 were deleterious as observed in PolyPhen-2 and PROVEAN analyzes, respectively. Following all analyzes, 14 nsSNPs (rs149708455, rs115887120, rs147279315, rs35155575, rs370095957, rs373939873, rs145468905, rs121917718, rs2233580, rs3824004, rs372751660, rs369459316, rs375472849, rs372497946) were common and observed as deleterious, probably damaging, affective and diseases associated. Following structural analyzes, 11 nsSNPs guided proteins were found as most unstable and highly conserved. Among these, R20W, R39Q, R45Q, R60H, G65D, and A223D mutated proteins were highly harmful. Hence, the results from above-mentioned integrated comprehensive bioinformatic analyzes guide how different nsSNPs in PAX4 gene alter structural and functional characteristics of the protein that might progress diseases pathogenesis in human including type 2 diabetes.
Collapse
Affiliation(s)
- Md Mostafa Kamal
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, Bangladesh
| | - Md Numan Islam
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, Bangladesh
- Department of Food Engineering, North Pacific International University of Bangladesh, Dhaka, Bangladesh
| | - Md Golam Rabby
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, Bangladesh
| | - Md Ashrafuzzaman Zahid
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, Bangladesh
| | - Md Mahmudul Hasan
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, Bangladesh.
| |
Collapse
|
2
|
Dereli O, Kuru N, Akkoyun E, Bircan A, Tastan O, Adebali O. PHACTboost: A Phylogeny-Aware Pathogenicity Predictor for Missense Mutations via Boosting. Mol Biol Evol 2024; 41:msae136. [PMID: 38934805 PMCID: PMC11251492 DOI: 10.1093/molbev/msae136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Revised: 05/30/2024] [Accepted: 06/24/2024] [Indexed: 06/28/2024] Open
Abstract
Most algorithms that are used to predict the effects of variants rely on evolutionary conservation. However, a majority of such techniques compute evolutionary conservation by solely using the alignment of multiple sequences while overlooking the evolutionary context of substitution events. We had introduced PHACT, a scoring-based pathogenicity predictor for missense mutations that can leverage phylogenetic trees, in our previous study. By building on this foundation, we now propose PHACTboost, a gradient boosting tree-based classifier that combines PHACT scores with information from multiple sequence alignments, phylogenetic trees, and ancestral reconstruction. By learning from data, PHACTboost outperforms PHACT. Furthermore, the results of comprehensive experiments on carefully constructed sets of variants demonstrated that PHACTboost can outperform 40 prevalent pathogenicity predictors reported in the dbNSFP, including conventional tools, metapredictors, and deep learning-based approaches as well as more recent tools such as AlphaMissense, EVE, and CPT-1. The superiority of PHACTboost over these methods was particularly evident in case of hard variants for which different pathogenicity predictors offered conflicting results. We provide predictions of 215 million amino acid alterations over 20,191 proteins. PHACTboost is available at https://github.com/CompGenomeLab/PHACTboost. PHACTboost can improve our understanding of genetic diseases and facilitate more accurate diagnoses.
Collapse
Affiliation(s)
- Onur Dereli
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Nurdan Kuru
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Emrah Akkoyun
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
- Network Technologies Department, TÜBİTAK-ULAKBİM Turkish Academic Network and Information Center, Ankara 06530, Turkey
| | - Aylin Bircan
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Oznur Tastan
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
| | - Ogün Adebali
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul 34956, Turkey
- Biological Sciences, TÜBİTAK Research Institute for Fundamental Sciences, Gebze 41470, Turkey
| |
Collapse
|
3
|
Kamal MM, Teeya ST, Rahman MM, Talukder MEK, Sarmin S, Wani TA, Hasan MM. Prediction and assessment of deleterious and disease causing nonsynonymous single nucleotide polymorphisms (nsSNPs) in human FOXP4 gene: An in - silico study. Heliyon 2024; 10:e32791. [PMID: 38994097 PMCID: PMC11237951 DOI: 10.1016/j.heliyon.2024.e32791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Accepted: 06/10/2024] [Indexed: 07/13/2024] Open
Abstract
In humans, FOXP gene family is involved in embryonic development and cancer progression. The FOXP4 (Forkhead box protein P4) gene belongs to this FOXP gene family. FOXP4 gene plays a crucial role in oncogenesis. Single nucleotide polymorphisms are biological markers and common determinants of human diseases. Mutations can largely affect the function of the corresponding protein. Therefore, the molecular mechanism of nsSNPs in the FOXP4 gene needs to be elucidated. Initially, the SNPs of the FOXP4 gene were extracted from the dbSNP database and a total of 23124 SNPs was found, where 555 nonsynonymous, 20525 intronic, 1114 noncoding transcript, 334 synonymous were obtained and the rest were unspecified. Then, a series of bioinformatics tools (SIFT, PolyPhen2, SNAP2, PhD SNP, PANTHER, I-Mutant2.0, MUpro, GOR IV, ConSurf, NetSurfP 2.0, HOPE, DynaMut2, GeneMANIA, STRING and Schrodinger) were used to explore the effect of nsSNPs on FOXP4 protein function and structural stability. First, 555 nsSNPs were analyzed using SIFT, of which 57 were found as deleterious. Following, PolyPhen2, SNAP2, PhD SNP and PANTHER analyses, 10 nsSNPs (rs372762294, rs141899153, rs142575732, rs376938850, rs367607523, rs112517943, rs140387832, rs373949416, rs373949416 and rs376160648) were common and observed as deleterious, damaging and diseases associated. Following that, using I-Mutant2.0 and MUpro servers, 7 nsSNPs were found to be the most unstable. GOR IV predicted that these seven nsSNPs affect protein structure by altering the protein contents of alpha helixes, extended strands, and random coils. Following DynaMut2, 5 nsSNPs showed a decrease in the ΔΔG value compared with the wild-type and were found to be responsible for destabilizing the corresponding protein. GeneMANIA and STRING network revealed interaction of FOXP4 with other genes. Finally, molecular dynamics simulation analysis revealed consistent fluctuation in RMSD and RMSF values, Rg and hydrogen bonds in the mutant proteins compared with WT, which might alter the functional and structural stability of the corresponding protein. As a result, the aforementioned integrated comprehensive bioinformatic analyses provide insight into how various nsSNPs of the FOXP4 gene change the structural and functional properties of the corresponding protein, potentially proceeding with the pathophysiology of human diseases.
Collapse
Affiliation(s)
- Md Mostafa Kamal
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre, Jashore, 7408, Bangladesh
| | - Shamiha Tabassum Teeya
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
| | - Md Mahfuzur Rahman
- Department of Genetic Engineering & Biotechnology, Bangabandhu Sheikh Mujibur Rahman Maritime University, Dhaka, 1216, Bangladesh
| | - Md Enamul Kabir Talukder
- Department of Genetic Engineering & Biotechnology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
- Laboratory of Computational Biology, Biological Solution Centre, Jashore, 7408, Bangladesh
| | - Sonia Sarmin
- BIRTAN-Bangladesh Institute of Research and Training on Applied Nutrition, Jhenaidah, 7300, Bangladesh
| | - Tanveer A Wani
- Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451, Saudi Arabia
| | - Md Mahmudul Hasan
- Department of Nutrition and Food Technology, Jashore University of Science and Technology, Jashore, 7408, Bangladesh
| |
Collapse
|
4
|
Iyer SR, Nusser K, Jones K, Shinde P, Keddy C, Beach CZ, Aguero E, Force J, Shinde U, Davare MA. Discovery of oncogenic ROS1 missense mutations with sensitivity to tyrosine kinase inhibitors. EMBO Mol Med 2023; 15:e17367. [PMID: 37587872 PMCID: PMC10565643 DOI: 10.15252/emmm.202217367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2022] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 08/18/2023] Open
Abstract
ROS1 is the largest receptor tyrosine kinase in the human genome. Rearrangements of the ROS1 gene result in oncogenic ROS1 kinase fusion proteins that are currently the only validated biomarkers for targeted therapy with ROS1 TKIs in patients. While numerous somatic missense mutations in ROS1 exist in the cancer genome, their impact on catalytic activity and pathogenic potential is unknown. We interrogated the AACR Genie database and identified 34 missense mutations in the ROS1 tyrosine kinase domain for further analysis. Our experiments revealed that these mutations have varying effects on ROS1 kinase function, ranging from complete loss to significantly increased catalytic activity. Notably, Asn and Gly substitutions at Asp2113 in the ROS1 kinase domain were found to be TKI-sensitive oncogenic variants in cell-based model systems. In vivo experiments showed that ROS1 D2113N induced tumor formation that was sensitive to crizotinib and lorlatinib, FDA-approved ROS1-TKIs. Collectively, these findings highlight the tumorigenic potential of specific point mutations within the ROS1 kinase domain and their potential as therapeutic targets with FDA-approved ROS1-TKIs.
Collapse
Affiliation(s)
- Sudarshan R Iyer
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Kevin Nusser
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Kristen Jones
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Pushkar Shinde
- Department of Chemical PhysiologyOregon Health and Sciences UniversityORPortlandUSA
| | - Clare Keddy
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Catherine Z Beach
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Erin Aguero
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| | - Jeremy Force
- Department of Medicine, Division of Medical Oncology, Duke Cancer InstituteDuke UniversityNCDurhamUSA
| | - Ujwal Shinde
- Department of Chemical PhysiologyOregon Health and Sciences UniversityORPortlandUSA
| | - Monika A Davare
- Division of Pediatric Hematology/Oncology, Department of Pediatrics, Papé Family Pediatric Research InstituteOregon Health and Sciences UniversityORPortlandUSA
| |
Collapse
|
5
|
Bouatrous E, Nouira S, Menif S, Ouragini H. Identification of High-Risk Single Nucleotide Polymorphisms in the Human CYB5R3 Gene Responsible for Recessive Congenital Methemoglobinemia: A Computational Approach. Mol Syndromol 2023; 14:375-393. [PMID: 37901856 PMCID: PMC10601824 DOI: 10.1159/000530173] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 03/10/2023] [Indexed: 10/31/2023] Open
Abstract
Introduction NADH-cytochrome b5 reductase deficiency due to pathogenic variants in the CYB5R3 gene causes recessive congenital methemoglobinemia (RCM) type I or type II. In type I, cyanosis from birth is the only major symptom, and the enzyme deficiency is restricted only to erythrocytes. Whereas in type II, cyanosis is associated with severe neurological manifestations, and the enzyme deficiency is generalized to all tissues. Methods In this study, several computational methods (SIFT, Polyphen-2, PROVEAN, Mutation Assessor, Panther, Phd-SNP, SNPs&GO, SNAP2, Align, GVGD, MutPred2, I-Mutant 2.0, MUpro, Duet, ConSurf and Netsurf-2.0 tools) were used to find the most deleterious nsSNPs in the CYB5R3 gene. Furthermore, structural analysis by Swiss-PDB viewer, protein-ligand docking using FTSite, and protein-protein interaction using STRING were carried out to evaluate the impact of these nsSNPs on the protein structure and function. Results Our in silico analysis suggested that out of 339 nsSNPs of the CYB5R3 gene, 17 (L47H, L47P, R61P, L73R G76D, G76C, P96H, G104C, S128P, G144D, P145S, L149P, Y151H, M177T, I178T, I216N, and G251V), are the most deleterious. Among them, two (P96H and S128P) were reported to be associated with the severe form RCM type II, six are related to RCM type I (G104C, G144D, P145S, L149P, M177T, and I178T), and the remaining nine high-risk nsSNPs have not yet been reported in RCM patients. Discussion This study highlighted the potential pathogenic nsSNPs of the CYB5R3 gene. To comprehend how these most harmful nsSNPs contribute to disease, it is crucial to experimentally validate their functional effects.
Collapse
Affiliation(s)
- Emna Bouatrous
- LR16IPT07, Laboratory of Molecular and Cellular Hematology, Pasteur Institute of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Faculty of Sciences, University of Tunis El Manar, Tunis, Tunisia
| | - Sonia Nouira
- LR16IPT07, Laboratory of Molecular and Cellular Hematology, Pasteur Institute of Tunis, University of Tunis El Manar, Tunis, Tunisia
- Molecular Biology Cell and Biotechnology Department, Higher Institute of Biotechnology of Monastir, University of Monastir, Monastir, Tunisia
| | - Samia Menif
- LR16IPT07, Laboratory of Molecular and Cellular Hematology, Pasteur Institute of Tunis, University of Tunis El Manar, Tunis, Tunisia
| | - Houyem Ouragini
- LR16IPT07, Laboratory of Molecular and Cellular Hematology, Pasteur Institute of Tunis, University of Tunis El Manar, Tunis, Tunisia
| |
Collapse
|
6
|
Ahmad SU, Ali Y, Jan Z, Rasheed S, Nazir NUA, Khan A, Rukh Abbas S, Wadood A, Rehman AU. Computational screening and analysis of deleterious nsSNPs in human p14ARF ( CDKN2A gene) protein using molecular dynamic simulation approach. J Biomol Struct Dyn 2023; 41:3964-3975. [PMID: 35446184 DOI: 10.1080/07391102.2022.2059570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 03/24/2022] [Indexed: 10/18/2022]
Abstract
Cyclin-dependent kinase inhibitor 2 A (CDKN2A) gene belongs to the cyclin-dependent kinase family that code for two transcripts (p16INK4A and p14ARF), both work as tumor suppressors proteins. The mutation that occurs in the p14ARF protein can lead to different types of cancers. Single nucleotide polymorphisms (SNPs) are an important type of genetic alteration that can lead to different types of diseases. In this study, we applied the computational strategy on human p14ARF protein to identify the potential deleterious nsSNPs and check their impact on the structure, function, and protein stability. We applied more than ten prediction tools to screen the retrieved 288 nsSNPs, consequently extracting four deleterious nsSNPs i.e., rs139725688 (R10G), rs139725688 (R21W), rs374360796 (F23L) and rs747717236 (L124R). Homology modeling, conservation and conformational analysis of mutant models were performed to examine the divergence of these variants from the native p14ARF structure. All-atom molecular dynamics simulation revealed a significant impact of these mutations on protein stability, compactness, globularity, solvent accessibility and secondary structure elements. Protein-protein interactions indicated that p14ARF operates as a hub linking clusters of different proteins and that changes in p14ARF may result in the disassociation of numerous signal cascades. Our current study is the first survey of computational analysis on p14ARF protein that determines the association of these nsSNPs with the altered function of p14ARF protein and leads to the development of various types of cancers. This research proposes the described functional SNPs as possible targets for proteomic investigations, diagnostic procedures, and treatments.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Syed Umair Ahmad
- Department of Bioinformatics, Hazara University, Mansehra, Pakistan
| | - Yasir Ali
- National Center for Bioinformatics, Quaid-i- Azam University, Islamabad, Pakistan
| | - Zainab Jan
- Department of Bioinformatics, Hazara University, Mansehra, Pakistan
| | - Salman Rasheed
- National Center for Bioinformatics, Quaid-i- Azam University, Islamabad, Pakistan
| | - Noor Ul Ain Nazir
- Atta Ur Rahman School of Applied Biosciences, National University of Sciences and Technology, Islamabad, Pakistan
| | - Asif Khan
- Department of Botany, Abdul Wali Khan University, Mardan, KPK, Pakistan
| | - Shah Rukh Abbas
- Atta Ur Rahman School of Applied Biosciences, National University of Sciences and Technology, Islamabad, Pakistan
| | - Abdul Wadood
- Department of Biochemistry, Abdul Wali Khan University, Mardan, KPK, Pakistan
| | - Ashfaq Ur Rehman
- Department of Molecular Biology and Biochemistry, University of California, Irvine, CA, USA
| |
Collapse
|
7
|
Ruiz-De-La-Cruz G, Sifuentes-Rincón AM, Casas E, Paredes-Sánchez FA, Parra-Bracamonte GM, Riley DG, Perry GA, Welsh TH, Randel RD. Genetic Variants and Their Putative Effects on microRNA-Seed Sites: Characterization of the 3' Untranslated Region of Genes Associated with Temperament. Genes (Basel) 2023; 14:genes14051004. [PMID: 37239364 DOI: 10.3390/genes14051004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/21/2023] [Accepted: 04/25/2023] [Indexed: 05/28/2023] Open
Abstract
The 3' untranslated region has an important role in gene regulation through microRNAs, and it has been estimated that microRNAs regulate up to 50% of coding genes in mammals. With the aim of allelic variant identification of 3' untranslated region microRNA seed sites, the 3' untranslated region was searched for seed sites of four temperament-associated genes (CACNG4, EXOC4, NRXN3, and SLC9A4). The microRNA seed sites were predicted in the four genes, and the CACNG4 gene had the greatest number with 12 predictions. To search for variants affecting the predicted microRNA seed sites, the four 3' untranslated regions were re-sequenced in a Brahman cattle population. Eleven single nucleotide polymorphisms were identified in the CACNG4, and eleven in the SLC9A4. Rs522648682:T>G of the CACNG4 gene was located at the predicted seed site for bta-miR-191. Rs522648682:T>G evidenced an association with both exit velocity (p = 0.0054) and temperament score (p = 0.0097). The genotype TT had a lower mean exit velocity (2.93 ± 0.4 m/s) compared with the TG and GG genotypes (3.91 ± 0.46 m/s and 3.67 ± 0.46 m/s, respectively). The allele associated with the temperamental phenotype antagonizes the seed site, disrupting the bta-miR-191 recognition. The G allele of CACNG4-rs522648682 has the potential to influence bovine temperament through a mechanism associated with unspecific recognition of bta-miR-191.
Collapse
Affiliation(s)
- Gilberto Ruiz-De-La-Cruz
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - Ana María Sifuentes-Rincón
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - Eduardo Casas
- National Animal Disease Center, Agricultural Research Service, Unite States Department of Agriculture, Ames, IA 50010, USA
| | | | - Gaspar Manuel Parra-Bracamonte
- Laboratorio de Biotecnología Animal, Centro de Biotecnología Genómica, Instituto Politécnico Nacional, Reynosa 88710, Mexico
| | - David G Riley
- Department of Animal Science, Texas A&M University, College Station, TX 77843, USA
| | | | - Thomas H Welsh
- Department of Animal Science, Texas A&M University, College Station, TX 77843, USA
| | | |
Collapse
|
8
|
Montañez-Miranda C, Perszyk RE, Harbin NH, Okalova J, Ramineni S, Traynelis SF, Hepler JR. Functional Assessment of Cancer-Linked Mutations in Sensitive Regions of Regulators of G Protein Signaling Predicted by Three-Dimensional Missense Tolerance Ratio Analysis. Mol Pharmacol 2023; 103:21-37. [PMID: 36384958 PMCID: PMC10955721 DOI: 10.1124/molpharm.122.000614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 10/04/2022] [Accepted: 10/18/2022] [Indexed: 11/18/2022] Open
Abstract
Regulators of G protein signaling (RGS) proteins modulate G protein-coupled receptor (GPCR) signaling by acting as negative regulators of G proteins. Genetic variants in RGS proteins are associated with many diseases, including cancers, although the impact of these mutations on protein function is uncertain. Here we analyze the RGS domains of 15 RGS protein family members using a novel bioinformatic tool that measures the missense tolerance ratio (MTR) using a three-dimensional (3D) structure (3DMTR). Subsequent permutation analysis can define the protein regions that are most significantly intolerant (P < 0.05) in each dataset. We further focused on RGS14, RGS10, and RGS4. RGS14 exhibited seven significantly tolerant and seven significantly intolerant residues, RGS10 had six intolerant residues, and RGS4 had eight tolerant and six intolerant residues. Intolerant and tolerant-control residues that overlap with pathogenic cancer mutations reported in the COSMIC cancer database were selected to define the functional phenotype. Using complimentary cellular and biochemical approaches, proteins were tested for effects on GPCR-Gα activation, Gα binding properties, and downstream cAMP levels. Identified intolerant residues with reported cancer-linked mutations RGS14-R173C/H and RGS4-K125Q/E126K, and tolerant RGS14-S127P and RGS10-S64T resulted in a loss-of-function phenotype in GPCR-G protein signaling activity. In downstream cAMP measurement, tolerant RGS14-D137Y and RGS10-S64T and intolerant RGS10-K89M resulted in change of function phenotypes. These findings show that 3DMTR identified intolerant residues that overlap with cancer-linked mutations cause phenotypic changes that negatively impact GPCR-G protein signaling and suggests that 3DMTR is a potentially useful bioinformatics tool for predicting functionally important protein residues. SIGNIFICANCE STATEMENT: Human genetic variant/mutation information has expanded rapidly in recent years, including cancer-linked mutations in regulator of G protein signaling (RGS) proteins. However, experimental testing of the impact of this vast catalogue of mutations on protein function is not feasible. We used the novel bioinformatics tool three-dimensional missense tolerance ratio (3DMTR) to define regions of genetic intolerance in RGS proteins and prioritize which cancer-linked mutants to test. We found that 3DMTR more accurately classifies loss-of-function mutations in RGS proteins than other databases thereby offering a valuable new research tool.
Collapse
Affiliation(s)
- Carolina Montañez-Miranda
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - Riley E Perszyk
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - Nicholas H Harbin
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - Jennifer Okalova
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - Suneela Ramineni
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - Stephen F Traynelis
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| | - John R Hepler
- Department of Pharmacology and Chemical Biology (C.M.-M., R.E.P., N.H.H., S.R., S.F.T., J.R.H.) and Aflac Cancer and Blood Disorders Center, Department of Pediatrics (J.O.), Emory University School of Medicine, Atlanta, Georgia
| |
Collapse
|
9
|
Dharmarajan A, Gopinath V, Keloth Nayanar S, Velandi Kunnummal S, Balasubramanian S, Roshan Valiyaparambil Gopi D. Genomic analysis of breast cancer patients from Kerala: A novel BRCA1 mutation detected. Breast Dis 2023; 42:341-347. [PMID: 37980640 DOI: 10.3233/bd-220002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2023]
Abstract
BACKGROUND Breast cancer is the most common cancer among females, with an incidence of 6,41,000 cases annually. The genetic makeup of the individuals, ethnicity, geographical location, lifestyle, and BMI are some well-described factors associated with breast cancer. It is well known that pathogenic variants in BRCA1 and BRCA2 are associated with a majority of hereditary breast cancer. Genome-wide association studies (GWAS) have identified more than 80 germline susceptibility loci responsible for hereditary breast cancer. METHODS In the present study, analysis of 94 genes associated with hereditary cancer was performed using next generation sequencing (NGS) in twelve patients having breast cancer and suspected with hereditary association. RESULTS Four out of twelve (33%) patients harbored pathogenic mutation of the BRCA1 gene. Two patients was identified p. E23Vfs*17 mutation in BRCA1, one patient had p.Glu1580Gln in BRCA1, and a novel frameshift variant p.T1456Ifs*9(c.4367Cdel) in one patient. CONCLUSION In the present study, out of four detected mutations in the BRCA1 gene, three were known and one was a novel BRCA1 mutation. It is advised to perform NGS-based genome sequencing to identify the genetic predisposition in breast cancer patients.
Collapse
Affiliation(s)
- Adarsh Dharmarajan
- Department of Surgical Oncology, Malabar Cancer Centre, Moozhikkara P.O., Thalassery, Kerala, India
| | - Vipin Gopinath
- Division of Genetics and Cytogenetics, Department of Clinical Lab Services and Translational Research, Malabar Cancer Centre, Moozhikkara P.O., Thalassery, Kerala, India
| | - Sangeetha Keloth Nayanar
- Division of Oncopathology, Department of Clinical Lab Services and Translational Research, Malabar Cancer Centre, Moozhikkara P.O., Thalassery, Kerala, India
| | | | | | - Deepak Roshan Valiyaparambil Gopi
- Division of Genetics and Cytogenetics, Department of Clinical Lab Services and Translational Research, Malabar Cancer Centre, Moozhikkara P.O., Thalassery, Kerala, India
| |
Collapse
|
10
|
Zhang C, Dalby PA. Assessing and Engineering Antibody Stability Using Experimental and Computational Methods. Methods Mol Biol 2023; 2552:165-197. [PMID: 36346592 DOI: 10.1007/978-1-0716-2609-2_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Engineering increased stability into antibodies can improve their developability. While a range of properties need to be optimized, thermal stability and aggregation are two key factors that affect the antibody yield, purity, and specificity throughout the development and manufacturing pipeline. Therefore, an ideal goal would be to apply protein engineering methods early-on, such as in parallel to affinity maturation, to screen out potential drug molecules with the desired conformational and colloidal stability. This chapter introduces our methods to computationally characterize an antibody Fab fragment, propose stabilizing variants, and then experimentally verify these predictions.
Collapse
Affiliation(s)
- Cheng Zhang
- Department of Biochemical Engineering, University College London, London, UK
| | - Paul Anthony Dalby
- Department of Biochemical Engineering, University College London, London, UK.
| |
Collapse
|
11
|
Thirumal Kumar D, Shaikh N, Bithia R, Karthick V, George Priya Doss C, Magesh R. Computational screening and structural analysis of Gly201Arg and Gly201Asp missense mutations in human cyclin-dependent kinase 4 protein. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2023; 135:57-96. [PMID: 37061341 DOI: 10.1016/bs.apcsb.2023.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2023]
Abstract
The regulatory proteins, cyclins, and cyclin-dependent kinases (CDKs) control the cell cycle progression. CDK4 gene mutations are associated with certain cancers such as melanoma, breast cancer, and rhabdomyosarcoma. Therefore, understanding the mechanisms of cell cycle control and cell proliferation is essential in developing cancer treatment regimens. In this study, we obtained cancer-causing CDK4 mutations from the COSMIC database and subjected them to a series of in silico analyses to identify the most significant mutations. An overall of 238 mutations (119 missense mutations) retrieved from the COSMIC database were investigated for the pathogenic and destabilizing properties using the PredictSNP and iStable algorithms. Further, the amino acid position of the most pathogenic and destabilizing mutations were analyzed to understand the nature of amino acid conservation across the species during the evolution. We observed that the missense mutations G201R and G201D were more significant and the Glycine at position 201 was found to highly conserved. These significant mutations were subjected to molecular dynamics simulation analysis to understand the protein's structural changes. The results from molecular dynamics simulations revealed that both G201R and G201D of CDK4 are capable of altering the protein's native form. On comparison among the most significant mutations, G201R disrupted the protein structure higher than the protein with G201D.
Collapse
Affiliation(s)
- D Thirumal Kumar
- Faculty of Allied Health Sciences, Meenakshi Academy of Higher Education and Research, Chennai, Tamil Nadu, India
| | - Nishaat Shaikh
- Mahimkar Lab [Tobacco Carcinogenesis Lab], Cancer Research Institute, Advanced Centre for Treatment, Research and Education in Cancer [ACTREC], TATA Memorial Centre, Navi Mumbai, Maharashtra, India
| | - R Bithia
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Bio Sciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - V Karthick
- Department of Biotechnology, School of Applied Sciences, REVA University, Bengaluru, Karnataka, India
| | - C George Priya Doss
- Laboratory of Integrative Genomics, Department of Integrative Biology, School of Bio Sciences and Technology, Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India.
| | - R Magesh
- Department of Biotechnology, FBMS&T, Sri Ramachandra Institute of Higher Education and Research (DU), Porur, Chennai, Tamil Nadu, India
| |
Collapse
|
12
|
Wu TH, Lin PC, Chou HH, Shen MR, Hsieh SY. Pathogenicity Prediction of Single Amino Acid Variants With Machine Learning Model Based on Protein Structural Energies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:606-615. [PMID: 34962874 DOI: 10.1109/tcbb.2021.3139048] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
The most popular tools for predicting pathogenicity of single amino acid variants (SAVs) were developed based on sequence-based techniques. SAVs may change protein structure and function. In the context of van der Waals force and disulfide bridge calculations, no method directly predicts the impact of mutations on the energies of the protein structure. Here, we combined machine learning methods and energy scores of protein structures calculated by Rosetta Energy Function 2015 to predict SAV pathogenicity. The accuracy level of our model (0.76) is higher than that of six prediction tools. Further analyses revealed that the differential reference energies, attractive energies, and solvation of polar atoms between wildtype and mutant side-chains played essential roles in distinguishing benign from pathogenic variants. These features indicated the physicochemical properties of amino acids, which were observed in 3D structures instead of sequences. We added 16 features to Rhapsody (the prediction tool we used for our data set) and consequently improved its performance. The results indicated that these energy scores were more appropriate and more detailed representations of the pathogenicity of SAVs.
Collapse
|
13
|
Ahmad HI, Ijaz N, Afzal G, Asif AR, ur Rehman A, Rahman A, Ahmed I, Yousaf M, Elokil A, Muhammad SA, Albogami SM, Alotaibi SS. Computational Insights into the Structural and Functional Impacts of nsSNPs of Bone Morphogenetic Proteins. BIOMED RESEARCH INTERNATIONAL 2022; 2022:4013729. [PMID: 35832847 PMCID: PMC9273450 DOI: 10.1155/2022/4013729] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2022] [Accepted: 06/15/2022] [Indexed: 12/12/2022]
Abstract
BMPs (bone morphogenetic proteins) are multipurpose (transforming growth factor)TGF-superfamily released cytokines. These glycoproteins, acting as disulfide-linked homo- or heterodimers, are highly potent regulators of bone and cartilage production and repair, cell proliferation throughout embryonic development, and bone homeostasis in the adults. Due to the fact that genetic variation might influence structural functions, this study is aimed to determine the pathogenic effect of nonsynonymous single-nucleotide polymorphisms (nsSNPs) in BMP genes. The implications of these variations, investigated using computational analysis and molecular models of the mature TGF-β domain, revealed the impact of modifications on the function of BMP protein. The three-dimensional (3D) structure analysis was performed on the nsSNP Y316S, V386G, E387G, C389G, and C391G nsSNP in the TGF-β domain of chicken BMP2 and H344P, S347P, V357A nsSNP in the TGF-β domain of chicken BMP4 protein that was anticipated to be harmful and of high risk. The ability of the proteins to perform variety of tasks interact with other molecules depends on their tertiary structural composition. The current analysis revealed the four most damaging variants (Y316S, V386G, E387G, C389G, and C391G), highly conserved and functional and are located in the TGF-beta domain of BMP2 and BMP4. The amino acid substitutions E387G, C389G, and C391G are discovered in the binding region. It was observed that the mutations in the TGF-beta domain caused significant changes in its structural organization including the substrate binding sites. Current findings will assist future research focused on the role of these variants in BMP function loss and their role in skeletal disorders, and this will possibly help to develop practical strategies for treating bone-related conditions.
Collapse
Affiliation(s)
- Hafiz Ishfaq Ahmad
- Department of Animal Breeding and Genetics, University of Veterinary and Animal Sciences, Lahore, Pakistan
| | - Nabeel Ijaz
- Department of Clinical Science, Faculty of Veterinary Sciences, Bahauddin Zakariya University Multan, Pakistan
| | - Gulnaz Afzal
- Department of Zoology, The Islamia University of Bahawalpur, Bahawalpur, Pakistan
| | - Akhtar Rasool Asif
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Huazhong Agricultural University, Wuhan, China
- University of Veterinary and Animal Sciences, Lahore, Sub-Campus Jhang, Pakistan
| | - Aziz ur Rehman
- Key Laboratory of Animal Genetics, Breeding and Reproduction, Huazhong Agricultural University, Wuhan, China
- University of Veterinary and Animal Sciences, Lahore, Sub-Campus Jhang, Pakistan
| | - Abdur Rahman
- University of Veterinary and Animal Sciences, Lahore, Sub-Campus Jhang, Pakistan
- Department of Animal Nutrition, Afyon Kocatepe University, Turkey
| | - Irfan Ahmed
- Department of Animal Nutrition, Faculty of Veterinary and Animal Sciences, The Islamia University of Bahawalpur, Pakistan
| | - Muhammad Yousaf
- Department of Animal Nutrition, Faculty of Veterinary and Animal Sciences, The Islamia University of Bahawalpur, Pakistan
| | - Abdelmotaleb Elokil
- Department of Animal Production, Faculty of Agriculture, Benha University, Moshtohor 13736, Egypt
| | - Sayyed Aun Muhammad
- University of Veterinary and Animal Sciences, Lahore, Sub-Campus Jhang, Pakistan
| | - Sarah M. Albogami
- Department of Biotechnology, College of Science, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| | - Saqer S. Alotaibi
- Department of Biotechnology, College of Science, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia
| |
Collapse
|
14
|
Kekenes-Huskey PM, Burgess DE, Sun B, Bartos DC, Rozmus ER, Anderson CL, January CT, Eckhardt LL, Delisle BP. Mutation-Specific Differences in Kv7.1 ( KCNQ1) and Kv11.1 ( KCNH2) Channel Dysfunction and Long QT Syndrome Phenotypes. Int J Mol Sci 2022; 23:7389. [PMID: 35806392 PMCID: PMC9266926 DOI: 10.3390/ijms23137389] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2022] [Revised: 06/22/2022] [Accepted: 06/24/2022] [Indexed: 11/16/2022] Open
Abstract
The electrocardiogram (ECG) empowered clinician scientists to measure the electrical activity of the heart noninvasively to identify arrhythmias and heart disease. Shortly after the standardization of the 12-lead ECG for the diagnosis of heart disease, several families with autosomal recessive (Jervell and Lange-Nielsen Syndrome) and dominant (Romano-Ward Syndrome) forms of long QT syndrome (LQTS) were identified. An abnormally long heart rate-corrected QT-interval was established as a biomarker for the risk of sudden cardiac death. Since then, the International LQTS Registry was established; a phenotypic scoring system to identify LQTS patients was developed; the major genes that associate with typical forms of LQTS were identified; and guidelines for the successful management of patients advanced. In this review, we discuss the molecular and cellular mechanisms for LQTS associated with missense variants in KCNQ1 (LQT1) and KCNH2 (LQT2). We move beyond the "benign" to a "pathogenic" binary classification scheme for different KCNQ1 and KCNH2 missense variants and discuss gene- and mutation-specific differences in K+ channel dysfunction, which can predispose people to distinct clinical phenotypes (e.g., concealed, pleiotropic, severe, etc.). We conclude by discussing the emerging computational structural modeling strategies that will distinguish between dysfunctional subtypes of KCNQ1 and KCNH2 variants, with the goal of realizing a layered precision medicine approach focused on individuals.
Collapse
Affiliation(s)
- Peter M. Kekenes-Huskey
- Department of Cell and Molecular Physiology, Stritch School of Medicine, Loyola University Chicago, Maywood, IL 60153, USA
| | - Don E. Burgess
- Department of Physiology, College of Medicine, University of Kentucky, Lexington, KY 40536, USA; (D.E.B.); (E.R.R.)
| | - Bin Sun
- Department of Pharmacology, Harbin Medical University, Harbin 150081, China;
| | | | - Ezekiel R. Rozmus
- Department of Physiology, College of Medicine, University of Kentucky, Lexington, KY 40536, USA; (D.E.B.); (E.R.R.)
| | - Corey L. Anderson
- Cellular and Molecular Arrythmias Program, Division of Cardiovascular Medicine, Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, USA; (C.L.A.); (C.T.J.); (L.L.E.)
| | - Craig T. January
- Cellular and Molecular Arrythmias Program, Division of Cardiovascular Medicine, Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, USA; (C.L.A.); (C.T.J.); (L.L.E.)
| | - Lee L. Eckhardt
- Cellular and Molecular Arrythmias Program, Division of Cardiovascular Medicine, Department of Medicine, University of Wisconsin-Madison, Madison, WI 53705, USA; (C.L.A.); (C.T.J.); (L.L.E.)
| | - Brian P. Delisle
- Department of Physiology, College of Medicine, University of Kentucky, Lexington, KY 40536, USA; (D.E.B.); (E.R.R.)
| |
Collapse
|
15
|
Comparative analysis of web-based programs for single amino acid substitutions in proteins. PLoS One 2022; 17:e0267084. [PMID: 35507592 PMCID: PMC9067658 DOI: 10.1371/journal.pone.0267084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Accepted: 04/02/2022] [Indexed: 11/19/2022] Open
Abstract
Single amino-acid substitution in a protein affects its structure and function. These changes are the primary reasons for the advent of many complex diseases. Analyzing single point mutations in a protein is crucial to see their impact and to understand the disease mechanism. This has given many biophysical resources, including databases and web-based tools to explore the effects of mutations on the structure and function of human proteins. For a given mutation, each tool provides a score-based outcomes which indicate deleterious probability. In recent years, developments in existing programs and the introduction of new prediction algorithms have transformed the state-of-the-art protein mutation analysis. In this study, we have performed a systematic study of the most commonly used mutational analysis programs (10 sequence-based and 5 structure-based) to compare their prediction efficiency. We have carried out extensive mutational analyses using these tools for previously known pathogenic single point mutations of five different proteins. These analyses suggested that sequence-based tools, PolyPhen2, PROVEAN, and PMut, and structure-based web tool, mCSM have a better prediction accuracy. This study indicates that the employment of more than one program based on different approaches should significantly improve the prediction power of the available methods.
Collapse
|
16
|
Li B, Jin B, Capra JA, Bush WS. Integration of Protein Structure and Population-Scale DNA Sequence Data for Disease Gene Discovery and Variant Interpretation. Annu Rev Biomed Data Sci 2022; 5:141-161. [PMID: 35508071 DOI: 10.1146/annurev-biodatasci-122220-112147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The experimental and computational techniques for capturing information about protein structures and genetic variation within the human genome have advanced dramatically in the past 20 years, generating extensive new data resources. In this review, we discuss these advances, along with new approaches for determining the impact a genetic variant has on protein function. We focus on the potential of new methods that integrate human genetic variation into protein structures to discover relationships to disease, including the discovery of mutational hotspots in cancer-related proteins, the localization of protein-altering variants within protein regions for common complex diseases, and the assessment of variants of unknown significance for Mendelian traits. We expect that approaches that integrate these data sources will play increasingly important roles in disease gene discovery and variant interpretation. Expected final online publication date for the Annual Review of Biomedical Data Science, Volume 5 is August 2022. Please see http://www.annualreviews.org/page/journal/pubdates for revised estimates.
Collapse
Affiliation(s)
- Bian Li
- Department of Biological Sciences and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, USA
| | - Bowen Jin
- Graduate Program in Systems Biology and Bioinformatics, Department of Nutrition, School of Medicine, Case Western Reserve University, Cleveland, Ohio, USA
| | - John A Capra
- Bakar Computational Health Sciences Institute and Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA;
| | - William S Bush
- Cleveland Institute for Computational Biology, Department of Population and Quantitative Health Sciences, Case Western Reserve University, Cleveland, Ohio, USA;
| |
Collapse
|
17
|
Tiberti M, Terkelsen T, Degn K, Beltrame L, Cremers TC, da Piedade I, Di Marco M, Maiani E, Papaleo E. MutateX: an automated pipeline for in silico saturation mutagenesis of protein structures and structural ensembles. Brief Bioinform 2022; 23:6552273. [PMID: 35323860 DOI: 10.1093/bib/bbac074] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 01/28/2022] [Accepted: 02/16/2022] [Indexed: 12/26/2022] Open
Abstract
Mutations, which result in amino acid substitutions, influence the stability of proteins and their binding to biomolecules. A molecular understanding of the effects of protein mutations is both of biotechnological and medical relevance. Empirical free energy functions that quickly estimate the free energy change upon mutation (ΔΔG) can be exploited for systematic screenings of proteins and protein complexes. In silico saturation mutagenesis can guide the design of new experiments or rationalize the consequences of known mutations. Often software such as FoldX, while fast and reliable, lack the necessary automation features to apply them in a high-throughput manner. We introduce MutateX, a software to automate the prediction of ΔΔGs associated with the systematic mutation of each residue within a protein, or protein complex to all other possible residue types, using the FoldX energy function. MutateX also supports ΔΔG calculations over protein ensembles, upon post-translational modifications and in multimeric assemblies. At the heart of MutateX lies an automated pipeline engine that handles input preparation, parallelization and outputs publication-ready figures. We illustrate the MutateX protocol applied to different case studies. The results of the high-throughput scan provided by our tools can help in different applications, such as the analysis of disease-associated mutations, to complement experimental deep mutational scans, or assist the design of variants for industrial applications. MutateX is a collection of Python tools that relies on open-source libraries. It is available free of charge under the GNU General Public License from https://github.com/ELELAB/mutatex.
Collapse
Affiliation(s)
- Matteo Tiberti
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Thilde Terkelsen
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Kristine Degn
- Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark
| | - Ludovica Beltrame
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Tycho Canter Cremers
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Isabelle da Piedade
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Miriam Di Marco
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Emiliano Maiani
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark
| | - Elena Papaleo
- Cancer Structural Biology, Danish Cancer Society Research Center, 2100, Copenhagen, Denmark.,Cancer Systems Biology, Section for Bioinformatics, Department of Health and Technology, Technical University of Denmark, 2800, Lyngby, Denmark.,Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
18
|
Wang D, Li J, Wang Y, Wang E. A comparison on predicting functional impact of genomic variants. NAR Genom Bioinform 2022; 4:lqab122. [PMID: 35047814 PMCID: PMC8759571 DOI: 10.1093/nargab/lqab122] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2021] [Revised: 11/13/2021] [Accepted: 12/20/2021] [Indexed: 12/23/2022] Open
Abstract
Single-nucleotide polymorphism (SNPs) may cause the diverse functional impact on RNA or protein changing genotype and phenotype, which may lead to common or complex diseases like cancers. Accurate prediction of the functional impact of SNPs is crucial to discover the ‘influential’ (deleterious, pathogenic, disease-causing, and predisposing) variants from massive background polymorphisms in the human genome. Increasing computational methods have been developed to predict the functional impact of variants. However, predictive performances of these computational methods on massive genomic variants are still unclear. In this regard, we systematically evaluated 14 important computational methods including specific methods for one type of variant and general methods for multiple types of variants from several aspects; none of these methods achieved excellent (AUC ≥ 0.9) performance in both data sets. CADD and REVEL achieved excellent performance on multiple types of variants and missense variants, respectively. This comparison aims to assist researchers and clinicians to select appropriate methods or develop better predictive methods.
Collapse
Affiliation(s)
- Dong Wang
- School of Computer Science and Technology, Harbin Institute of Technology Harbin, Harbin, Heilongjiang, 150001,China
| | - Jie Li
- To whom correspondence should be addressed. Tel: +86 0451 86413309;
| | - Yadong Wang
- Correspondence may also be addressed to Yadong Wang. Tel: +86 0451 86413309;
| | - Edwin Wang
- Department of Medical Genetics, University of Calgary, Calgary, HSC 1185, Canada
| |
Collapse
|
19
|
Gonzalez A, Belmonte I, Nuñez A, Farago G, Barrecheguren M, Pons M, Orriols G, Gabriel-Medina P, Rodríguez-Frías F, Miravitlles M, Esquinas C. New variants of alpha-1-antitrypsin: structural simulations and clinical expression. Respir Res 2022; 23:339. [PMID: 36496391 PMCID: PMC9741788 DOI: 10.1186/s12931-022-02271-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 12/01/2022] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Alpha-1 antitrypsin deficiency (AATD) is characterized by reduced serum levels of the AAT protein and predisposes to liver and lung disease. The characterization at structural level of novel pathogenic SERPINA1 mutants coding for circulating AAT could provide novel insights into the mechanisms of AAT misfolding. The present study aimed to provide a practical framework for the identification and analysis of new AAT mutations, combining structural simulations and clinical data. METHODS We analysed a total of five mutations (four not previously described) in a total of six subjects presenting moderate to severe AATD: Gly95Alafs*18, Val210Glu, Asn247Ser, Pi*S + Asp341His and Pi*S + Leu383Phe + Lys394Ile. Clinical data, genotyping and phenotyping assays, structural mapping, and conformational characterization through molecular dynamic (MD) simulations were developed and combined. RESULTS Newly discovered AAT missense variants were localized both on the interaction surface and the hydrophobic core of the protein. Distribution of mutations across the structure revealed Val210Glu at the solvent exposed s4C strand and close to the "Gate" region. Asn247Ser was located on the accessible surface, which is important for glycan attachment. On the other hand, Asp341His, Leu383Phe were mapped close to the "breach" and "shutter" regions. MD analysis revealed the reshaping of local interactions around the investigated substitutions that have varying effects on AAT conformational flexibility, hydrophobic packing, and electronic surface properties. The most severe structural changes were observed in the double- and triple-mutant (Pi*S + Asp341His and Pi*S + Leu383Phe + Lys394Ile) molecular models. The two carriers presented impaired lung function. CONCLUSIONS The results characterize five variants, four of them previously unknown, of the SERPINA1 gene, which define new alleles contributing to the deficiency of AAT. Rare variants might be more frequent than expected, and therefore, in discordant cases, standardized screening of the S and Z alleles needs complementation with gene sequencing and structural approaches. The utility of computational modelling for providing supporting evidence of the pathogenicity of rare single nucleotide variations is discussed.
Collapse
Affiliation(s)
- Angel Gonzalez
- grid.7080.f0000 0001 2296 0625Department of Computational Medicine, Statistic Unit, Medicine Faculty, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain
| | - Irene Belmonte
- grid.411083.f0000 0001 0675 8654Department of Clinical Biochemistry, Hospital Universitari Vall d’Hebron, Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Alexa Nuñez
- grid.411083.f0000 0001 0675 8654Pneumology Department, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, P. Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Georgina Farago
- grid.411083.f0000 0001 0675 8654Pneumology Department, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, P. Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Miriam Barrecheguren
- grid.411083.f0000 0001 0675 8654Pneumology Department, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, P. Vall d’Hebron 119-129, 08035 Barcelona, Spain
| | - Mònica Pons
- grid.411083.f0000 0001 0675 8654Liver Unit, Department of Internal Medicine, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Gerard Orriols
- grid.411083.f0000 0001 0675 8654Department of Clinical Biochemistry, Hospital Universitari Vall d’Hebron, Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Pablo Gabriel-Medina
- grid.411083.f0000 0001 0675 8654Department of Clinical Biochemistry, Hospital Universitari Vall d’Hebron, Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Francisco Rodríguez-Frías
- grid.411083.f0000 0001 0675 8654Department of Clinical Biochemistry, Hospital Universitari Vall d’Hebron, Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain ,grid.7080.f0000 0001 2296 0625Facultat de Medicina, Universitat Autònoma de Barcelona, Bellaterra, Barcelona, Spain ,grid.452371.60000 0004 5930 4607Centro de Investigación Biomédica en Red de Enfermedades Hepáticas y Digestivas, (CIBEREHD), Barcelona, Spain ,grid.430994.30000 0004 1763 0287Clinical Biochemistry Research Group/Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, Barcelona, Spain
| | - Marc Miravitlles
- grid.411083.f0000 0001 0675 8654Pneumology Department, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, P. Vall d’Hebron 119-129, 08035 Barcelona, Spain ,grid.512891.6Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), Barcelona, Spain
| | - Cristina Esquinas
- grid.411083.f0000 0001 0675 8654Pneumology Department, Hospital Universitari Vall d’Hebron, Vall d’Hebron Institut de Recerca (VHIR), Vall d’Hebron Barcelona Hospital Campus, P. Vall d’Hebron 119-129, 08035 Barcelona, Spain ,grid.5841.80000 0004 1937 0247Public Health, Mental, Maternal and Child Health Nursing Departament, Faculty of Medicine and Health Sciences, University of Barcelona, Barcelona, Spain
| |
Collapse
|
20
|
Hashemi ZS, Zarei M, Mubarak SMH, Hessami A, Mard-Soltani M, Khalesi B, Zakeri A, Rahbar MR, Jahangiri A, Pourzardosht N, Khalili S. Pierce into Structural Changes of Interactions Between Mutated Spike Glycoproteins and ACE2 to Evaluate Its Potential Biological and Therapeutic Consequences. Int J Pept Res Ther 2021; 28:33. [PMID: 34931119 PMCID: PMC8674523 DOI: 10.1007/s10989-021-10346-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/08/2021] [Indexed: 12/27/2022]
Abstract
The structural consequences of ongoing mutations on the SARS-CoV-2 spike-protein remains to be fully elucidated. These mutations could change the binding affinity between the virus and its target cell. Moreover, obtaining new mutations would also change the therapeutic efficacy of the designed drug candidates. To evaluate these consequences, 3D structure of a mutant spike protein was predicted and checked for stability, cavity sites, and residue depth. The docking analyses were performed between the 3D model of the mutated spike protein and the ACE2 protein and an engineered therapeutic ACE2 against COVID-19. The obtained results revealed that the N501Y substitution has altered the interaction orientation, augmented the number of interface bonds, and increased the affinity against the ACE2. On the other hand, the P681H mutation contributed to the increased cavity size and relatively higher residue depth. The binding affinity between the engineered therapeutic ACE2 and the mutant spike was significantly higher with a distinguished binding orientation. It could be concluded that the mutant spike protein increased the affinity, preserved the location, changed the orientation, and altered the interface amino acids of its interaction with both the ACE2 and its therapeutic engineered version. The obtained results corroborate the more aggressive nature of mutated SARS-CoV-2 due to their higher binding affinity. Moreover, designed ACe2-baased therapeutics would be still highly effective against covid-19, which could be the result of conserved nature of cellular ACE2. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s10989-021-10346-1.
Collapse
Affiliation(s)
- Zahra Sadat Hashemi
- ATMP Department, Breast Cancer Research Center, Motamed Cancer Institute, ACECR, Tehran, Iran
| | - Mahboubeh Zarei
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Shaden M. H. Mubarak
- Department of Clinical Laboratory Science, Faculty of Pharmacy, University of Kufa, Najaf, Iraq
| | - Anahita Hessami
- School of Pharmacy, Shiraz University of medical sciences, Shiraz, Iran
| | - Maysam Mard-Soltani
- Department of Clinical Biochemistry, Faculty of Medical Sciences, Dezful University of Medical Sciences, Dezful, Iran
| | - Bahman Khalesi
- Department of Research and Production of Poultry Viral Vaccine, Razi Vaccine and Serum Research Institute, Agricultural Research Education and Extension Organization, Karaj, Iran
| | - Alireza Zakeri
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| | - Mohammad Reza Rahbar
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Abolfazl Jahangiri
- Applied Microbiology Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran
| | - Navid Pourzardosht
- Biochemistry Department, Guilan University of Medical Sciences, Rasht, Iran
| | - Saeed Khalili
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| |
Collapse
|
21
|
Hossain MS, Tonmoy MIQ, Fariha A, Islam MS, Roy AS, Islam MN, Kar K, Alam MR, Rahaman MM. Prediction of the Effects of Variants and Differential Expression of Key Host Genes ACE2, TMPRSS2, and FURIN in SARS-CoV-2 Pathogenesis: An In Silico Approach. Bioinform Biol Insights 2021; 15:11779322211054684. [PMID: 34720581 PMCID: PMC8554545 DOI: 10.1177/11779322211054684] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 10/02/2021] [Indexed: 12/15/2022] Open
Abstract
A new strain of the beta coronavirus, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), is solely responsible for the ongoing coronavirus disease 2019 (COVID-19) pandemic. Although several studies suggest that the spike protein of this virus interacts with the cell surface receptor, angiotensin-converting enzyme 2 (ACE2), and is subsequently cleaved by TMPRSS2 and FURIN to enter into the host cell, conclusive insight about the interaction pattern of the variants of these proteins is still lacking. Thus, in this study, we analyzed the functional conjugation among the spike protein, ACE2, TMPRSS2, and FURIN in viral pathogenesis as well as the effects of the mutations of the proteins through the implementation of several bioinformatics approaches. Analysis of the intermolecular interactions revealed that T27A (ACE2), G476S (receptor-binding domain [RBD] of the spike protein), C297T (TMPRSS2), and P812S (cleavage site for TMPRSS2) coding variants may render resistance in viral infection, whereas Q493L (RBD), S477I (RBD), P681R (cleavage site for FURIN), and P683W (cleavage site for FURIN) may lead to increase viral infection. Genotype-specific expression analysis predicted several genetic variants of ACE2 (rs2158082, rs2106806, rs4830971, and rs4830972), TMPRSS2 (rs458213, rs468444, rs4290734, and rs6517666), and FURIN (rs78164913 and rs79742014) that significantly alter their normal expression which might affect the viral spread. Furthermore, we also found that ACE2, TMPRSS2, and FURIN proteins are functionally co-related with each other, and several genes are highly co-expressed with them, which might be involved in viral pathogenesis. This study will thus help in future genomics and proteomics studies of SARS-CoV-2 and will provide an opportunity to understand the underlying molecular mechanism during SARS-CoV-2 pathogenesis.
Collapse
Affiliation(s)
- Md. Shahadat Hossain
- Department of Biotechnology & Genetic Engineering, Noakhali Science and Technology University, Noakhali, Bangladesh
| | | | - Atqiya Fariha
- Department of Biotechnology & Genetic Engineering, Noakhali Science and Technology University, Noakhali, Bangladesh
| | - Md. Sajedul Islam
- Department of Biochemistry & Biotechnology, University of Barishal, Barishal, Bangladesh
| | - Arpita Singha Roy
- Department of Biotechnology & Genetic Engineering, Noakhali Science and Technology University, Noakhali, Bangladesh
| | - Md. Nur Islam
- Department of Biotechnology & Genetic Engineering, Noakhali Science and Technology University, Noakhali, Bangladesh
| | - Kumkum Kar
- Department of Biotechnology & Genetic Engineering, Noakhali Science and Technology University, Noakhali, Bangladesh
| | - Mohammad Rahanur Alam
- Department of Food Technology & Nutrition Science, Noakhali Science and Technology University, Noakhali, Bangladesh
| | | |
Collapse
|
22
|
Pacini L, Lesieur C. A computational methodology to diagnose sequence-variant dynamic perturbations by comparing atomic protein structures. Bioinformatics 2021; 38:703-709. [PMID: 34694373 PMCID: PMC8574318 DOI: 10.1093/bioinformatics/btab736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 09/29/2021] [Accepted: 10/21/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION The objective is to diagnose dynamics perturbations caused by amino-acid mutations as prerequisite to assess protein functional health or drug failure, simply using network models of protein X-ray structures. RESULTS We find that the differences in the allocation of the atomic interactions of each amino acid to 1D, 2D, 3D, 4D structural levels between variants structurally robust, recover experimental dynamic perturbations. The allocation measure validated on two B-pentamers variants of AB5 toxins having 17 mutations, also distinguishes dynamic perturbations of pathogenic and non-pathogenic Transthyretin single-mutants. Finally, the main proteases of the coronaviruses SARS-CoV and SARS-CoV-2 exhibit changes in the allocation measure, raising the possibility of drug failure despite the main proteases structural similarity. AVAILABILITY AND IMPLEMENTATION The Python code used for the production of the results is available at github.com/lorpac/protein_partitioning_atomic_contacts. The authors will run the analysis on any PDB structures of protein variants upon request. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Lorenza Pacini
- AMPERE, CNRS, Université de Lyon, Lyon, 69622, France,Institut Rhônalpin des systèmes complexes (IXXI), École Normale Supérieure de Lyon, Lyon, 69007, France
| | - Claire Lesieur
- AMPERE, CNRS, Université de Lyon, Lyon, 69622, France,Institut Rhônalpin des systèmes complexes (IXXI), École Normale Supérieure de Lyon, Lyon, 69007, France,To whom correspondence should be addressed. E-mail:
| |
Collapse
|
23
|
Ganguly P, Madonsela L, Chao JT, Loewen CJR, O’Connor TP, Verheyen EM, Allan DW. A scalable Drosophila assay for clinical interpretation of human PTEN variants in suppression of PI3K/AKT induced cellular proliferation. PLoS Genet 2021; 17:e1009774. [PMID: 34492006 PMCID: PMC8448351 DOI: 10.1371/journal.pgen.1009774] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Revised: 09/17/2021] [Accepted: 08/10/2021] [Indexed: 12/28/2022] Open
Abstract
Gene variant discovery is becoming routine, but it remains difficult to usefully interpret the functional consequence or disease relevance of most variants. To fill this interpretation gap, experimental assays of variant function are becoming common place. Yet, it remains challenging to make these assays reproducible, scalable to high numbers of variants, and capable of assessing defined gene-disease mechanism for clinical interpretation aligned to the ClinGen Sequence Variant Interpretation (SVI) Working Group guidelines for 'well-established assays'. Drosophila melanogaster offers great potential as an assay platform, but was untested for high numbers of human variants adherent to these guidelines. Here, we wished to test the utility of Drosophila as a platform for scalable well-established assays. We took a genetic interaction approach to test the function of ~100 human PTEN variants in cancer-relevant suppression of PI3K/AKT signaling in cellular growth and proliferation. We validated the assay using biochemically characterized PTEN mutants as well as 23 total known pathogenic and benign PTEN variants, all of which the assay correctly assigned into predicted functional categories. Additionally, function calls for these variants correlated very well with our recent published data from a human cell line. Finally, using these pathogenic and benign variants to calibrate the assay, we could set readout thresholds for clinical interpretation of the pathogenicity of 70 other PTEN variants. Overall, we demonstrate that Drosophila offers a powerful assay platform for clinical variant interpretation, that can be used in conjunction with other well-established assays, to increase confidence in the accurate assessment of variant function and pathogenicity.
Collapse
Affiliation(s)
- Payel Ganguly
- Department of Cellular and Physiological Sciences, Life Sciences Institute, Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, British Columbia, Canada
| | - Landiso Madonsela
- Department of Molecular Biology and Biochemistry, Centre for Cell Biology, Development and Disease, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Jesse T. Chao
- Department of Cellular and Physiological Sciences, Life Sciences Institute, Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, British Columbia, Canada
| | - Christopher J. R. Loewen
- Department of Cellular and Physiological Sciences, Life Sciences Institute, Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, British Columbia, Canada
| | - Timothy P. O’Connor
- Department of Cellular and Physiological Sciences, Life Sciences Institute, Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, British Columbia, Canada
| | - Esther M. Verheyen
- Department of Molecular Biology and Biochemistry, Centre for Cell Biology, Development and Disease, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Douglas W. Allan
- Department of Cellular and Physiological Sciences, Life Sciences Institute, Djavad Mowafaghian Centre for Brain Health, University of British Columbia, Vancouver, British Columbia, Canada
| |
Collapse
|
24
|
Ghadie M, Xia Y. Mutation Edgotype Drives Fitness Effect in Human. FRONTIERS IN BIOINFORMATICS 2021; 1:690769. [PMID: 36303776 PMCID: PMC9581054 DOI: 10.3389/fbinf.2021.690769] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 08/18/2021] [Indexed: 11/24/2022] Open
Abstract
Missense mutations are known to perturb protein-protein interaction networks (known as interactome networks) in different ways. However, it remains unknown how different interactome perturbation patterns (“edgotypes”) impact organismal fitness. Here, we estimate the fitness effect of missense mutations with different interactome perturbation patterns in human, by calculating the fractions of neutral and deleterious mutations that do not disrupt PPIs (“quasi-wild-type”), or disrupt PPIs either by disrupting the binding interface (“edgetic”) or by disrupting overall protein stability (“quasi-null”). We first map pathogenic mutations and common non-pathogenic mutations onto homology-based three-dimensional structural models of proteins and protein-protein interactions in human. Next, we perform structure-based calculations to classify each mutation as either quasi-wild-type, edgetic, or quasi-null. Using our predicted as well as experimentally determined interactome perturbation patterns, we estimate that >∼40% of quasi-wild-type mutations are effectively neutral and the remaining are mostly mildly deleterious, that >∼75% of edgetic mutations are only mildly deleterious, and that up to ∼75% of quasi-null mutations may be strongly detrimental. These estimates are the first such estimates of fitness effect for different network perturbation patterns in any interactome. Our results suggest that while mutations that do not disrupt the interactome tend to be effectively neutral, the majority of human PPIs are under strong purifying selection and the stability of most human proteins is essential to human life.
Collapse
|
25
|
Qureshi S, Bibi N, Ahmed J, Khan MJ. Computational screening of pathogenic non-synonymous SNPs of the human TEX11 gene and their structural and functional consequences. Meta Gene 2021. [DOI: 10.1016/j.mgene.2021.100874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
|
26
|
Chao JT, Roskelley CD, Loewen CJR. MAPS: machine-assisted phenotype scoring enables rapid functional assessment of genetic variants by high-content microscopy. BMC Bioinformatics 2021; 22:202. [PMID: 33879063 PMCID: PMC8056608 DOI: 10.1186/s12859-021-04117-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 04/02/2021] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND Genetic testing is widely used in evaluating a patient's predisposition to hereditary diseases. In the case of cancer, when a functionally impactful mutation (i.e. genetic variant) is identified in a disease-relevant gene, the patient is at elevated risk of developing a lesion in their lifetime. Unfortunately, as the rate and coverage of genetic testing has accelerated, our ability to assess the functional status of new variants has fallen behind. Therefore, there is an urgent need for more practical, streamlined and cost-effective methods for classifying variants. RESULTS To directly address this issue, we designed a new approach that uses alterations in protein subcellular localization as a key indicator of loss of function. Thus, new variants can be rapidly functionalized using high-content microscopy (HCM). To facilitate the analysis of the large amounts of imaging data, we developed a new software toolkit, named MAPS for machine-assisted phenotype scoring, that utilizes deep learning to extract and classify cell-level features. MAPS helps users leverage cloud-based deep learning services that are easy to train and deploy to fit their specific experimental conditions. Model training is code-free and can be done with limited training images. Thus, MAPS allows cell biologists to easily incorporate deep learning into their image analysis pipeline. We demonstrated an effective variant functionalization workflow that integrates HCM and MAPS to assess missense variants of PTEN, a tumor suppressor that is frequently mutated in hereditary and somatic cancers. CONCLUSIONS This paper presents a new way to rapidly assess variant function using cloud deep learning. Since most tumor suppressors have well-defined subcellular localizations, our approach could be widely applied to functionalize variants of uncertain significance and help improve the utility of genetic testing.
Collapse
Affiliation(s)
- Jesse T Chao
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, V6T1Z3, Canada.
| | - Calvin D Roskelley
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, V6T1Z3, Canada
| | - Christopher J R Loewen
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, V6T1Z3, Canada
| |
Collapse
|
27
|
Turina P, Fariselli P, Capriotti E. ThermoScan: Semi-automatic Identification of Protein Stability Data From PubMed. Front Mol Biosci 2021; 8:620475. [PMID: 33842537 PMCID: PMC8027235 DOI: 10.3389/fmolb.2021.620475] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 02/18/2021] [Indexed: 11/13/2022] Open
Abstract
During the last years, the increasing number of DNA sequencing and protein mutagenesis studies has generated a large amount of variation data published in the biomedical literature. The collection of such data has been essential for the development and assessment of tools predicting the impact of protein variants at functional and structural levels. Nevertheless, the collection of manually curated data from literature is a highly time consuming and costly process that requires domain experts. In particular, the development of methods for predicting the effect of amino acid variants on protein stability relies on the thermodynamic data extracted from literature. In the past, such data were deposited in the ProTherm database, which however is no longer maintained since 2013. For facilitating the collection of protein thermodynamic data from literature, we developed the semi-automatic tool ThermoScan. ThermoScan is a text mining approach for the identification of relevant thermodynamic data on protein stability from full-text articles. The method relies on a regular expression searching for groups of words, including the most common conceptual words appearing in experimental studies on protein stability, several thermodynamic variables, and their units of measure. ThermoScan analyzes full-text articles from the PubMed Central Open Access subset and calculates an empiric score that allows the identification of manuscripts reporting thermodynamic data on protein stability. The method was optimized on a set of publications included in the ProTherm database, and tested on a new curated set of articles, manually selected for presence of thermodynamic data. The results show that ThermoScan returns accurate predictions and outperforms recently developed text-mining algorithms based on the analysis of publication abstracts. Availability: The ThermoScan server is freely accessible online at https://folding.biofold.org/thermoscan. The ThermoScan python code and the Google Chrome extension for submitting visualized PMC web pages to the ThermoScan server are available at https://github.com/biofold/ThermoScan.
Collapse
Affiliation(s)
- Paola Turina
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| | - Piero Fariselli
- Department of Medical Sciences, University of Torino, Torino, Italy
| | - Emidio Capriotti
- Department of Pharmacy and Biotechnology (FaBiT), University of Bologna, Bologna, Italy
| |
Collapse
|
28
|
Pereira GRC, Vieira BDAA, De Mesquita JF. Comprehensive in silico analysis and molecular dynamics of the superoxide dismutase 1 (SOD1) variants related to amyotrophic lateral sclerosis. PLoS One 2021; 16:e0247841. [PMID: 33630959 PMCID: PMC7906464 DOI: 10.1371/journal.pone.0247841] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 02/15/2021] [Indexed: 12/29/2022] Open
Abstract
Amyotrophic Lateral Sclerosis (ALS) is the most frequent motor neuron disorder, with a significant social and economic burden. ALS remains incurable, and the only drugs approved for its treatments confers a survival benefit of a few months for the patients. Missense mutations in superoxide dismutase 1 (SOD1), a major cytoplasmic antioxidant enzyme, has been associated with ALS development, accounting for 23% of its familial cases and 7% of all sporadic cases. This work aims to characterize in silico the structural and functional effects of SOD1 protein variants. Missense mutations in SOD1 were compiled from the literature and databases. Twelve algorithms were used to predict the functional and stability effects of these mutations. ConSurf was used to estimate the evolutionary conservation of SOD1 amino-acids. GROMACS was used to perform molecular dynamics (MD) simulations of SOD1 wild-type and variants A4V, D90A, H46R, and I113T, which account for approximately half of all ALS-SOD1 cases in the United States, Europe, Japan, and United Kingdom, respectively. 233 missense mutations in SOD1 protein were compiled from the databases and literature consulted. The predictive analyses pointed to an elevated rate of deleterious and destabilizing predictions for the analyzed variants, indicating their harmful effects. The ConSurf analysis suggested that mutations in SOD1 mainly affect conserved and possibly functionally essential amino acids. The MD analyses pointed to flexibility and essential dynamics alterations at the electrostatic and metal-binding loops of variants A4V, D90A, H46R, and I113T that could lead to aberrant interactions triggering toxic protein aggregation. These alterations may have harmful implications for SOD1 and explain their association with ALS. Understanding the effects of SOD1 mutations on protein structure and function facilitates the design of further experiments and provides relevant information on the molecular mechanism of pathology, which may contribute to improvements in existing treatments for ALS.
Collapse
Affiliation(s)
- Gabriel Rodrigues Coutinho Pereira
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | | | - Joelma Freire De Mesquita
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail:
| |
Collapse
|
29
|
Shulman C, Liang E, Kamura M, Udwan K, Yao T, Cattran D, Reich H, Hladunewich M, Pei Y, Savige J, Paterson AD, Suico MA, Kai H, Barua M. Type IV Collagen Variants in CKD: Performance of Computational Predictions for Identifying Pathogenic Variants. Kidney Med 2021; 3:257-266. [PMID: 33851121 PMCID: PMC8039416 DOI: 10.1016/j.xkme.2020.12.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
Rationale & Objective Pathogenic variants in type IV collagen have been reported to account for a significant proportion of chronic kidney disease. Accordingly, genetic testing is increasingly used to diagnose kidney diseases, but testing also may reveal rare missense variants that are of uncertain clinical significance. To aid in interpretation, computational prediction (called in silico) programs may be used to predict whether a variant is clinically important. We evaluate the performance of in silico programs for COL4A3/A4/A5 variants. Study Design, Setting, & Participants Rare missense variants in COL4A3/A4/A5 were identified in disease cohorts, including a local focal segmental glomerulosclerosis (FSGS) cohort and publicly available disease databases, in which they are categorized as pathogenic or benign based on clinical criteria. Tests Compared & Outcomes All rare missense variants identified in the 4 disease cohorts were subjected to in silico predictions using 12 different programs. Comparisons between the predictions were compared with: (1) variant classification (pathogenic or benign) in the cohorts and (2) functional characterization in a randomly selected smaller number (17) of pathogenic or uncertain significance variants obtained from the local FSGS cohort. Results In silico predictions correctly classified 75% to 97% of pathogenic and 57% to 100% of benign COL4A3/A4/A5 variants in public disease databases. The congruency of in silico predictions was similar for variants categorized as pathogenic and benign, with the exception of benign COL4A5 variants, in which disease effects were overestimated. By contrast, in silico predictions and functional characterization classified all 9 pathogenic COL4A3/A4/A5 variants correctly that were obtained from a local FSGS cohort. However, these programs also overestimated the effects of genomic variants of uncertain significance when compared with functional characterization. Each of the 12 in silico programs used yielded similar results. Limitations Overestimation of in silico program sensitivity given that they may have been used in the categorization of variants labeled as pathogenic in disease repositories. Conclusions Our results suggest that in silico predictions are sensitive but not specific to assign COL4A3/A4/A5 variant pathogenicity, with misclassification of benign variants and variants of uncertain significance. Thus, we do not recommend in silico programs but instead recommend pursuing more objective levels of evidence suggested by medical genetics guidelines.
Collapse
Affiliation(s)
- Cole Shulman
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada
| | - Emerald Liang
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada
| | - Misato Kamura
- Department of Molecular Medicine, Graduate School of Pharmaceutical Science, Kumamoto University, Kumamoto, Japan
| | - Khalil Udwan
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada
| | - Tony Yao
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada
| | - Daniel Cattran
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada.,Institute of Medical Sciences, Toronto, Canada.,Department of Medicine, Toronto, Canada
| | - Heather Reich
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada.,Institute of Medical Sciences, Toronto, Canada.,Department of Medicine, Toronto, Canada
| | - Michelle Hladunewich
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada.,Institute of Medical Sciences, Toronto, Canada.,Department of Medicine, Toronto, Canada
| | - York Pei
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada.,Institute of Medical Sciences, Toronto, Canada.,Department of Medicine, Toronto, Canada
| | - Judy Savige
- University of Melbourne, Melbourne, Australia
| | - Andrew D Paterson
- Division of Epidemiology and Biostatistics, Dalla Lana School of Public Health, Toronto, Canada.,Genetics and Genome Biology, Research Institute at Hospital for Sick Children, Toronto, Canada
| | - Mary Ann Suico
- Department of Molecular Medicine, Graduate School of Pharmaceutical Science, Kumamoto University, Kumamoto, Japan
| | - Hirofumi Kai
- Department of Molecular Medicine, Graduate School of Pharmaceutical Science, Kumamoto University, Kumamoto, Japan
| | - Moumita Barua
- Division of Nephrology, University Health Network, Toronto, Canada.,Toronto General Hospital Research Institute, Toronto General Hospital, Toronto, Canada.,Institute of Medical Sciences, Toronto, Canada.,Department of Medicine, Toronto, Canada
| |
Collapse
|
30
|
Villoutreix BO, Calvez V, Marcelin AG, Khatib AM. In Silico Investigation of the New UK (B.1.1.7) and South African (501Y.V2) SARS-CoV-2 Variants with a Focus at the ACE2-Spike RBD Interface. Int J Mol Sci 2021; 22:1695. [PMID: 33567580 PMCID: PMC7915722 DOI: 10.3390/ijms22041695] [Citation(s) in RCA: 62] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2021] [Revised: 02/03/2021] [Accepted: 02/04/2021] [Indexed: 12/24/2022] Open
Abstract
SARS-CoV-2 exploits angiotensin-converting enzyme 2 (ACE2) as a receptor to invade cells. It has been reported that the UK and South African strains may have higher transmission capabilities, eventually in part due to amino acid substitutions on the SARS-CoV-2 Spike protein. The pathogenicity seems modified but is still under investigation. Here we used the experimental structure of the Spike RBD domain co-crystallized with part of the ACE2 receptor, several in silico methods and numerous experimental data reported recently to analyze the possible impacts of three amino acid replacements (Spike K417N, E484K, N501Y) with regard to ACE2 binding. We found that the N501Y replacement in this region of the interface (present in both the UK and South African strains) should be favorable for the interaction with ACE2, while the K417N and E484K substitutions (South African strain) would seem neutral or even unfavorable. It is unclear if the N501Y substitution in the South African strain could counterbalance the K417N and E484K Spike replacements with regard to ACE2 binding. Our finding suggests that the UK strain should have higher affinity toward ACE2 and therefore likely increased transmissibility and possibly pathogenicity. If indeed the South African strain has a high transmission level, this could be due to the N501Y replacement and/or to substitutions in regions located outside the direct Spike-ACE2 interface but not so much to the K417N and E484K replacements. Yet, it should be noted that amino acid changes at Spike position 484 can lead to viral escape from neutralizing antibodies. Further, these amino acid substitutions do not seem to induce major structural changes in this region of the Spike protein. This structure-function study allows us to rationalize some observations made for the UK strain but raises questions for the South African strain.
Collapse
Affiliation(s)
- Bruno O. Villoutreix
- Integrative Computational Pharmacology and Data Mining, INSERM UMR 1141, NeuroDiderot, Robert-Debré Hospital, 75019 Paris, France
| | - Vincent Calvez
- Sorbonne Université, INSERM 1136, Institut Pierre Louis d’Epidémiologie et de Santé Publique, AP-HP, Hôpitaux Universitaires Pitié-Salpêtrière-Charles Foix, Laboratoire de Virologie, F75013 Paris, France; (V.C.); (A.-G.M.)
| | - Anne-Geneviève Marcelin
- Sorbonne Université, INSERM 1136, Institut Pierre Louis d’Epidémiologie et de Santé Publique, AP-HP, Hôpitaux Universitaires Pitié-Salpêtrière-Charles Foix, Laboratoire de Virologie, F75013 Paris, France; (V.C.); (A.-G.M.)
| | - Abdel-Majid Khatib
- Université de Bordeaux, INSERM, LAMC, U1029, F-33600 Pessac, France
- Institut Bergonié, 33000 Bordeaux, France
| |
Collapse
|
31
|
SAAFEC-SEQ: A Sequence-Based Method for Predicting the Effect of Single Point Mutations on Protein Thermodynamic Stability. Int J Mol Sci 2021; 22:ijms22020606. [PMID: 33435356 PMCID: PMC7827184 DOI: 10.3390/ijms22020606] [Citation(s) in RCA: 58] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2020] [Revised: 12/23/2020] [Accepted: 01/06/2021] [Indexed: 01/04/2023] Open
Abstract
Modeling the effect of mutations on protein thermodynamics stability is useful for protein engineering and understanding molecular mechanisms of disease-causing variants. Here, we report a new development of the SAAFEC method, the SAAFEC-SEQ, which is a gradient boosting decision tree machine learning method to predict the change of the folding free energy caused by amino acid substitutions. The method does not require the 3D structure of the corresponding protein, but only its sequence and, thus, can be applied on genome-scale investigations where structural information is very sparse. SAAFEC-SEQ uses physicochemical properties, sequence features, and evolutionary information features to make the predictions. It is shown to consistently outperform all existing state-of-the-art sequence-based methods in both the Pearson correlation coefficient and root-mean-squared-error parameters as benchmarked on several independent datasets. The SAAFEC-SEQ has been implemented into a web server and is available as stand-alone code that can be downloaded and embedded into other researchers’ code.
Collapse
|
32
|
Soltani I, Bahia W, Radhouani A, Mahdhi A, Ferchichi S, Almawi WY. Comprehensive in-silico analysis of damage associated SNPs in hOCT1 affecting Imatinib response in chronic myeloid leukemia. Genomics 2020; 113:755-766. [PMID: 33075481 DOI: 10.1016/j.ygeno.2020.10.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2020] [Revised: 09/18/2020] [Accepted: 10/05/2020] [Indexed: 12/22/2022]
Abstract
Non-synonymous single nucleotide polymorphisms (nsSNPs) in hOCT1 (encoded by SLC22A1 gene) are expected to affect Imatinib uptake in chronic myeloid leukemia (CML). In this study, sequence homology-based genetic analysis of a set of 270 coding SNPs identified 18 nsSNPs to be putatively damaging/deleterious using eight different algorithms. Subsequently, based on conservation of amino acid residues, stability analysis, posttranscriptional modifications, and solvent accessibility analysis, the possible structural-functional relationship was established for high-confidence nsSNPs. Furthermore, based on the modeling results, some dissimilarities of mutant type amino acids from wild-type amino acids such as size, charge, interaction and hydrophobicity were revealed. Three highly deleterious mutations consisting of P283L, G401S and R402G in SLC22A1 gene that may alter the protein structure, function and stability were identified. These results provide a filtered data to explore the effect of uncharacterized nsSNP and find their association with Imatinib resistance in CML.
Collapse
Affiliation(s)
- Ismael Soltani
- Molecular and Cellular Hematology Laboratory, Institut Pasteur de Tunis, Université Tunis El Manar, Tunis, Tunisia.
| | - Wael Bahia
- Research Unit of Clinical and Molecular Biology, Department of Biochemistry, Faculty of Pharmacy of Monastir, University of Monastir, Tunisia
| | - Assala Radhouani
- Faculty of Medicine, University of Tunis El Manar, Tunis, Tunisia
| | - Abdelkarim Mahdhi
- Laboratory of Analysis, Treatment and Valorization of Pollutants of the Environment and Products, Faculty of Pharmacy, University of Monastir, Tunisia
| | - Salima Ferchichi
- Research Unit of Clinical and Molecular Biology, Department of Biochemistry, Faculty of Pharmacy of Monastir, University of Monastir, Tunisia
| | - Wassim Y Almawi
- Faculty of Sciences, El Manar University, Tunis, Tunisia; College of Health Sciences, Abu Dhabi University, Abu Dhabi, United Arab Emirates
| |
Collapse
|
33
|
An Ensemble Approach to Predict the Pathogenicity of Synonymous Variants. Genes (Basel) 2020; 11:genes11091102. [PMID: 32967157 PMCID: PMC7565489 DOI: 10.3390/genes11091102] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Revised: 09/08/2020] [Accepted: 09/17/2020] [Indexed: 12/18/2022] Open
Abstract
Single-nucleotide variants (SNVs) are a major form of genetic variation in the human genome that contribute to various disorders. There are two types of SNVs, namely non-synonymous (missense) variants (nsSNVs) and synonymous variants (sSNVs), predominantly involved in RNA processing or gene regulation. sSNVs, unlike missense or nsSNVs, do not alter the amino acid sequences, thereby making challenging candidates for downstream functional studies. Numerous computational methods have been developed to evaluate the clinical impact of nsSNVs, but very few methods are available for understanding the effects of sSNVs. For this analysis, we have downloaded sSNVs from the ClinVar database with various features such as conservation, DNA-RNA, and splicing properties. We performed feature selection and implemented an ensemble random forest (RF) classification algorithm to build a classifier to predict the pathogenicity of the sSNVs. We demonstrate that the ensemble predictor with selected features (20 features) enhances the classification of sSNVs into two categories, pathogenic and benign, with high accuracy (87%), precision (79%), and recall (91%). Furthermore, we used this prediction model to reclassify sSNVs with unknown clinical significance. Finally, the method is very robust and can be used to predict the effect of other unknown sSNVs.
Collapse
|
34
|
Koohyanizadeh F, Karaji AG, Falahi S, Rezaiemanesh A, Salari F. In silico prediction of deleterious single nucleotide polymorphisms in human interleukin 27 (IL-27) gene. Meta Gene 2020. [DOI: 10.1016/j.mgene.2020.100710] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
|
35
|
Havranek B, Islam SM. Prediction and evaluation of deleterious and disease causing non-synonymous SNPs (nsSNPs) in human NF2 gene responsible for neurofibromatosis type 2 (NF2). J Biomol Struct Dyn 2020; 39:7044-7055. [PMID: 32787631 DOI: 10.1080/07391102.2020.1805018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The majority of genetic variations in the human genome that lead to variety of different diseases are caused by non-synonymous single nucleotide polymorphisms (nsSNPs). Neurofibromatosis type 2 (NF2) is a deadly disease caused by nsSNPs in the NF2 gene that encodes for a protein called merlin. This study used various in silico methods, SIFT, Polyphen-2, PhD-SNP and MutPred, to investigate the pathogenic effect of 14 nsSNPs in the merlin FERM domain. The G197C and L234R mutations were found to be two deleterious and disease mutations associated with the mild and severe forms of NF2, respectively. Molecular dynamics (MD) simulations were conducted to understand the stability, structure and dynamics of these mutations. Both mutant structures experienced larger flexibility compared to the wildtype. The L234R mutant suffered from more prominent structural instability, which may help to explain why it is associated with the more severe form of NF2. The intramolecular hydrogen bonding in L234R mutation decreased from the wildtype, while intermolecular hydrogen bonding of L234R mutation with solvent greatly increased. The native contacts were also found to be important. Protein-protein docking revealed that L234R mutation decreased the binding complementarity and binding affinity of LATS2 to merlin, which may have an impact on merlin's ability to regulate the Hippo signaling pathway. The calculated binding affinity of the LATS2 to L234R mutant and wildtype merlin protein is found to be 21.73 and -11 kcal/mol, respectively. The binding affinity of the wildtype merlin agreed very well with the experimental value, -8 kcal/mol.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Brandon Havranek
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, USA
| | - Shahidul M Islam
- Department of Chemistry, University of Illinois at Chicago, Chicago, IL, USA
| |
Collapse
|
36
|
Chao JT, Hollman R, Meyers WM, Meili F, Matreyek KA, Dean P, Fowler DM, Haas K, Roskelley CD, Loewen CJR. A Premalignant Cell-Based Model for Functionalization and Classification of PTEN Variants. Cancer Res 2020; 80:2775-2789. [PMID: 32366478 DOI: 10.1158/0008-5472.can-19-3278] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 12/16/2019] [Accepted: 04/23/2020] [Indexed: 11/16/2022]
Abstract
As sequencing becomes more economical, we are identifying sequence variations in the population faster than ever. For disease-associated genes, it is imperative that we differentiate a sequence variant as either benign or pathogenic, such that the appropriate therapeutic interventions or surveillance can be implemented. PTEN is a frequently mutated tumor suppressor that has been linked to the PTEN hamartoma tumor syndrome. Although the domain structure of PTEN and the functional impact of a number of its most common tumor-linked mutations have been characterized, there is a lack of information about many recently identified clinical variants. To address this challenge, we developed a cell-based assay that utilizes a premalignant phenotype of normal mammary epithelial cells lacking PTEN. We measured the ability of PTEN variants to rescue the spheroid formation phenotype of PTEN-/- MCF10A cells maintained in suspension. As proof of concept, we functionalized 47 missense variants using this assay, only 19 of which have clear classifications in ClinVar. We utilized a machine learning model trained with annotated genotypic data to classify variants as benign or pathogenic based on our functional scores. Our model predicted with high accuracy that loss of PTEN function was indicative of pathogenicity. We also determined that the pathogenicity of certain variants may have arisen from reduced stability of the protein product. Overall, this assay outperformed computational predictions, was scalable, and had a short run time, serving as an ideal alternative for annotating the clinical significance of cancer-associated PTEN variants. SIGNIFICANCE: Combined three-dimensional tumor spheroid modeling and machine learning classifies PTEN missense variants, over 70% of which are currently listed as variants of uncertain significance. GRAPHICAL ABSTRACT: http://cancerres.aacrjournals.org/content/canres/80/13/2775/F1.large.jpg.
Collapse
Affiliation(s)
- Jesse T Chao
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Rocio Hollman
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Warren M Meyers
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Fabian Meili
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Kenneth A Matreyek
- Department of Pathology, Case Western Reserve University School of Medicine, Cleveland, Ohio.,Department of Genome Sciences, University of Washington, Seattle, Washington
| | - Pamela Dean
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, Washington.,Department of Bioengineering, University of Washington, Seattle, Washington
| | - Kurt Haas
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Calvin D Roskelley
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada
| | - Christopher J R Loewen
- Department of Cellular and Physiological Sciences, Life Sciences Institute, University of British Columbia, Vancouver, Canada.
| |
Collapse
|
37
|
Gyulkhandanyan A, Rezaie AR, Roumenina L, Lagarde N, Fremeaux-Bacchi V, Miteva MA, Villoutreix BO. Analysis of protein missense alterations by combining sequence- and structure-based methods. Mol Genet Genomic Med 2020; 8:e1166. [PMID: 32096919 PMCID: PMC7196459 DOI: 10.1002/mgg3.1166] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 01/20/2020] [Accepted: 01/27/2020] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Different types of in silico approaches can be used to predict the phenotypic consequence of missense variants. Such algorithms are often categorized as sequence based or structure based, when they necessitate 3D structural information. In addition, many other in silico tools, not dedicated to the analysis of variants, can be used to gain additional insights about the possible mechanisms at play. METHODS Here we applied different computational approaches to a set of 20 known missense variants present on different proteins (CYP, complement factor B, antithrombin and blood coagulation factor VIII). The tools that were used include fast computational approaches and web servers such as PolyPhen-2, PopMusic, DUET, MaestroWeb, SAAFEC, Missense3D, VarSite, FlexPred, PredyFlexy, Clustal Omega, meta-PPISP, FTMap, ClusPro, pyDock, PPM, RING, Cytoscape, and ChannelsDB. RESULTS We observe some conflicting results among the methods but, most of the time, the combination of several engines helped to clarify the potential impacts of the amino acid substitutions. CONCLUSION Combining different computational approaches including some that were not developed to investigate missense variants help to predict the possible impact of the amino acid substitutions. Yet, when the modified residues are involved in a salt-bridge, the tools tend to fail, even when the analysis is performed in 3D. Thus, interactive structural analysis with molecular graphics packages such as Chimera or PyMol or others are still needed to clarify automatic prediction.
Collapse
Affiliation(s)
- Aram Gyulkhandanyan
- INSERM U973, Laboratory MTi, University Paris Diderot, Paris, France
- Laboratory SABNP, University of Evry, INSERM U1204, Université Paris-Saclay, Evry, France
| | - Alireza R Rezaie
- Cardiovascular Biology Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, USA
- Department of Biochemistry and Molecular Biology, University of Oklahoma Health Sciences Center, Oklahoma City, OK, USA
| | - Lubka Roumenina
- INSERM, UMR_S 1138, Centre de Recherche des Cordeliers, Paris, France
- Sorbonne Universités, Paris, France
- Université Paris Descartes, Sorbonne Paris Cité, Paris, France
| | - Nathalie Lagarde
- INSERM U973, Laboratory MTi, University Paris Diderot, Paris, France
- Laboratoire GBCM, EA7528, Conservatoire national des arts et métiers, Hesam Université, Paris, France
| | - Veronique Fremeaux-Bacchi
- INSERM, UMR_S 1138, Centre de Recherche des Cordeliers, Paris, France
- Sorbonne Universités, Paris, France
- Université Paris Descartes, Sorbonne Paris Cité, Paris, France
- Assistance Publique-Hôpitaux de Paris, Service d'Immunologie Biologique, Hôpital Européen Georges Pompidou, Paris, France
| | - Maria A Miteva
- INSERM U973, Laboratory MTi, University Paris Diderot, Paris, France
- Inserm U1268 MCTR, CNRS UMR 8038 CiTCoM, Faculté de Pharmacie de Paris, Univ. De Paris, Paris, France
| | - Bruno O Villoutreix
- INSERM U973, Laboratory MTi, University Paris Diderot, Paris, France
- INSERM, Institut Pasteur de Lille, U1177-Drugs and Molecules for Living Systems, Université de Lille, Lille, France
| |
Collapse
|
38
|
Kumar R, Kumar R, Tanwar P, Rath GK, Kumar R, Kumar S, Dash N, Das P, Hussain S. Deciphering the impact of missense mutations on structure and dynamics of SMAD4 protein involved in pathogenesis of gall bladder cancer. J Biomol Struct Dyn 2020; 39:1940-1954. [PMID: 32151199 DOI: 10.1080/07391102.2020.1740789] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Gall bladder cancer (GBC) is the most common malignancy of biliary tract cancer associated with high mortality rate and poor prognosis due to lack of suitable biomarkers. In this study, we explored the structural and functional effects of different missense mutations occurs in SMAD4 that was associated with the development of GBC. We utilized in silico methods to predict the harmful effects of nonsynonymous missense mutations and monitored the stability of protein. We found that all mutations (D351N, G352E, R361C, R361H, E526Q) associated with SMAD4 were deleterious in nature resulting in the formation of deformed or unstable protein structure. Molecular dynamics simulation studies revealed how these mutations affect protein stability, structure, conformation and function. We observed, different mutants increase the compactness and rigidity of SMAD4 protein, alter secondary structure composition, decrease the surface area and protein-ligand interaction and affect its conformation. Findings of current work indicated that the analyzed mutations might affect the structure of protein and its caliber to interact with other molecules, which probably related to functional impairment of SMAD4 upon D351N, G352E, R361C, R361H, E526Q mutations and their involvement in cancer. Hence, the present study has significance of rational drug design and further increase our understanding of GBC development.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Rakesh Kumar
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - Rahul Kumar
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - Pranay Tanwar
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - G K Rath
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - Ritesh Kumar
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - Sunil Kumar
- Dr. B. R. A.-Institute Rotary Cancer Hospital, All India Institute of Medical Sciences, New Delhi, India
| | - Nihar Dash
- Department of Gastrointestinal Surgery, All India Institute of Medical Sciences, New Delhi, India
| | - Prasenjit Das
- Department of Pathology, All India Institute of Medical Sciences, New Delhi, India
| | - Showket Hussain
- Division of Molecular Oncology, National Institute of Cancer Prevention and Research, Noida, India
| |
Collapse
|
39
|
Pereira GRC, Tavares GDB, de Freitas MC, De Mesquita JF. In silico analysis of the tryptophan hydroxylase 2 (TPH2) protein variants related to psychiatric disorders. PLoS One 2020; 15:e0229730. [PMID: 32119710 PMCID: PMC7051086 DOI: 10.1371/journal.pone.0229730] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Accepted: 02/12/2020] [Indexed: 11/19/2022] Open
Abstract
The tryptophan hydroxylase 2 (TPH2) enzyme catalyzes the first step of serotonin biosynthesis. Serotonin is known for its role in several homeostatic systems related to sleep, mood, and food intake. As the reaction catalyzed by TPH2 is the rate-limiting step of serotonin biosynthesis, mutations in TPH2 have been associated with several psychiatric disorders (PD). This work undertakes an in silico analysis of the effects of genetic mutations in the human TPH2 protein. Ten algorithms were used to predict the functional and stability effects of the TPH2 mutations. ConSurf was used to estimate the evolutionary conservation of TPH2 amino acids. GROMACS was used to perform molecular dynamics (MD) simulations of TPH2 WT and P260S, R303W, and R441H, which had already been associated with the development of PD. Forty-six TPH2 variants were compiled from the literature. Among the analyzed variants, those occurring at the catalytic domain were shown to be more damaging to protein structure and function. The ConSurf analysis indicated that the mutations affecting the catalytic domain were also more conserved throughout evolution. The variants S364K and S383F were predicted to be deleterious by all the functional algorithms used and occurred at conserved positions, suggesting that they might be deleterious. The MD analyses indicate that the mutations P206S, R303W, and R441H affect TPH2 flexibility and essential mobility at the catalytic and oligomerization domains. The variants P206S, R303W, and R441H also exhibited alterations in dimer binding affinity and stability throughout the simulations. Thus, these mutations may impair TPH2 functional interactions and, consequently, its function, leading to the development of PD. Furthermore, we developed a database, SNPMOL (http://www.snpmol.org/), containing the results presented in this paper. Understanding the effects of TPH2 mutations on protein structure and function may lead to improvements in existing treatments for PD and facilitate the design of further experiments.
Collapse
Affiliation(s)
- Gabriel Rodrigues Coutinho Pereira
- Bioinformatics and Computational Biology Laboratory, Department of Genetics and Molecular Biology, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Gustavo Duarte Bocayuva Tavares
- Bioinformatics and Computational Biology Laboratory, Department of Genetics and Molecular Biology, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Marta Costa de Freitas
- Bioinformatics and Computational Biology Laboratory, Department of Genetics and Molecular Biology, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Joelma Freire De Mesquita
- Bioinformatics and Computational Biology Laboratory, Department of Genetics and Molecular Biology, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
40
|
Angeli D, Salvi S, Tedaldi G. Genetic Predisposition to Breast and Ovarian Cancers: How Many and Which Genes to Test? Int J Mol Sci 2020; 21:E1128. [PMID: 32046255 PMCID: PMC7038038 DOI: 10.3390/ijms21031128] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2019] [Revised: 02/03/2020] [Accepted: 02/05/2020] [Indexed: 12/19/2022] Open
Abstract
Breast and ovarian cancers are some of the most common tumors in females, and the genetic predisposition is emerging as one of the key risk factors in the development of these two malignancies. BRCA1 and BRCA2 are the best-known genes associated with hereditary breast and ovarian cancer. However, recent advances in molecular techniques, Next-Generation Sequencing in particular, have led to the identification of many new genes involved in the predisposition to breast and/or ovarian cancer, with different penetrance estimates. TP53, PTEN, STK11, and CDH1 have been identified as high penetrance genes for the risk of breast/ovarian cancers. Besides them, PALB2, BRIP1, ATM, CHEK2, BARD1, NBN, NF1, RAD51C, RAD51D and mismatch repair genes have been recognized as moderate and low penetrance genes, along with other genes encoding proteins involved in the same pathways, possibly associated with breast/ovarian cancer risk. In this review, we summarize the past and more recent findings in the field of cancer predisposition genes, with insights into the role of the encoded proteins and the associated genetic disorders. Furthermore, we discuss the possible clinical utility of genetic testing in terms of prevention protocols and therapeutic approaches.
Collapse
Affiliation(s)
- Davide Angeli
- Biostatistics and Clinical Trials Unit, Istituto Scientifico Romagnolo per lo Studio e la Cura dei Tumori (IRST) IRCCS, 47014 Meldola, Italy;
| | - Samanta Salvi
- Biosciences Laboratory, Istituto Scientifico Romagnolo per lo Studio e la Cura dei Tumori (IRST) IRCCS, 47014 Meldola, Italy;
| | - Gianluca Tedaldi
- Biosciences Laboratory, Istituto Scientifico Romagnolo per lo Studio e la Cura dei Tumori (IRST) IRCCS, 47014 Meldola, Italy;
| |
Collapse
|
41
|
Ganakammal SR, Alexov E. Evaluation of performance of leading algorithms for variant pathogenicity predictions and designing a combinatory predictor method: application to Rett syndrome variants. PeerJ 2019; 7:e8106. [PMID: 31799076 PMCID: PMC6884988 DOI: 10.7717/peerj.8106] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2019] [Accepted: 10/27/2019] [Indexed: 12/13/2022] Open
Abstract
Background Genomics diagnostic tests are done for a wide spectrum of complex genetics conditions such as autism and cancer. The growth of technology has not only aided in successfully decoding the genetic variants that causes or trigger these disorders. However, interpretation of these variants is not a trivial task even at a level of distinguish pathogenic vs benign variants. Methods We used the clinically significant variants from ClinVar database to evaluate the performance of 14 most popular in-silico predictors using supervised learning methods. We implemented a feature selection and random forest classification algorithm to identify the best combination of predictors to evaluate the pathogenicity of a variant. Finally, we have also utilized this combination of predictors to reclassify the variants of unknown significance in MeCP2 gene that are associated with the Rett syndrome. Results The results from analysis shows an optimized selection of prediction algorithm and developed a combinatory predictor method. Our combinatory approach of using both best performing independent and ensemble predictors reduces any algorithm biases in variant characterization. The reclassification of variants (such as VUS) in MECP2 gene associated with RETT syndrome suggest that the combinatory in-silico predictor approach had a higher success rate in categorizing their pathogenicity.
Collapse
Affiliation(s)
| | - Emil Alexov
- Department of Physics, Clemson University, Clemson, SC, USA
| |
Collapse
|
42
|
Pereira GRC, Tellini GHAS, De Mesquita JF. In silico analysis of PFN1 related to amyotrophic lateral sclerosis. PLoS One 2019; 14:e0215723. [PMID: 31216283 PMCID: PMC6583998 DOI: 10.1371/journal.pone.0215723] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 04/09/2019] [Indexed: 12/11/2022] Open
Abstract
Profilin 1 (PFN1) protein plays key roles in neuronal growth and differentiation, membrane trafficking, and regulation of the actin cytoskeleton. Four natural variants of PFN1 were described as related to ALS, the most common adult-onset motor neuron disorder. However, the pathological mechanism of PFN1 in ALS is not yet completely understood. The goal of this work is to thoroughly analyze the effects of the ALS-related mutations on PFN1 structure and function using computational simulations. Here, PhD-SNP, PMUT, PolyPhen-2, SIFT, SNAP, SNPS&GO, SAAP, nsSNPAnalyzer, SNPeffect4.0 and I-Mutant2.0 were used to predict the functional and stability effects of PFN1 mutations. ConSurf was used for the evolutionary conservation analysis, and GROMACS was used to perform the MD simulations. The mutations C71G, M114T, and G118V, but not E117G, were predicted as deleterious by most of the functional prediction algorithms that were used. The stability prediction indicated that the ALS-related mutations could destabilize PFN1. The ConSurf analysis indicated that the mutation C71G, M114T, E117G, and G118V occur in highly conserved positions. The MD results indicated that the studied mutations could affect the PFN1 flexibility at the actin and PLP-binding domains, and consequently, their intermolecular interactions. It may be therefore related to the functional impairment of PFN1 upon C71G, M114T, E117G and G118V mutations, and their involvement in ALS development. We also developed a database, SNPMOL (http://www.snpmol.org/), containing the results presented on this paper for biologists and clinicians to exploit PFN1 and its natural variants.
Collapse
Affiliation(s)
- Gabriel Rodrigues Coutinho Pereira
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Giovanni Henrique Almeida Silva Tellini
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Joelma Freire De Mesquita
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail:
| |
Collapse
|
43
|
Masoodi TA, Shaik NA, Burhan S, Hasan Q, Shafi G, Talluri VR. Structural prediction, whole exome sequencing and molecular dynamics simulation confirms p.G118D somatic mutation of PIK3CA as functionally important in breast cancer patients. Comput Biol Chem 2019; 80:472-479. [PMID: 31174159 DOI: 10.1016/j.compbiolchem.2019.05.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2017] [Revised: 06/16/2018] [Accepted: 05/28/2019] [Indexed: 12/13/2022]
Abstract
To understand the structural and functional importance of PIK3CA somatic mutations, whole exome sequencing, molecular dynamics simulation techniques in combination with in silico prediction algorithms such as SIFT, PolyPhen, Provean and CADD were employed. Twenty out of eighty missense somatic mutations in PIK3CA gene were found to be pathogenic by all the four algorithms. Most recurrent mutations found were known hotspot PIK3CA mutations with known clinical significance like p.E545 K, p.E545A, p.E545 G and p.C420R. A missense mutation p.G118D was found to be recurrently mutated in 5 cases. Interestingly, this mutation was observed in one of the patients who underwent whole exome sequencing and was completely absent from the controls. To see the effect of this mutation on the structure of PIK3CA protein, molecular dynamics simulation was performed. By molecular dynamics approach, we have shown that p.G118D mutation deviated from the native structure which was supported by the decrease in the number of hydrogen bonds, difference in hydrogen bond distance and angle, difference in root mean square deviation between the native and the mutant structures.
Collapse
Affiliation(s)
- Tariq Ahmad Masoodi
- Department of Biotechnology, K L University, Guntur, Andhra Pradesh, India; Department of Biotechnology, Hyderabad Science Society, Hyderabad, 500004, India.
| | - Noor Ahmad Shaik
- Department of Biotechnology, Hyderabad Science Society, Hyderabad, 500004, India; Department of Genetic Medicine, Princess Al-Jawhara Al-Brahim Centre of Excellene in Research of Hereditary Disorders, Faculty of Medicine, King Abdulaziz University, Jeddah, Saudi Arabia.
| | - Syed Burhan
- Department of Oncology, Sher-i-Kashmir Institute of Medical Sciences, Srinagar, India
| | - Qurratulain Hasan
- Department of Biotechnology, Hyderabad Science Society, Hyderabad, 500004, India
| | | | - Venkateswara Rao Talluri
- Department of Biotechnology, K L University, Guntur, Andhra Pradesh, India; Prof. TNA Innovation Center, Varsha Bioscience and Technology India Private Limited, Telangana, India.
| |
Collapse
|
44
|
Comparison of Predictive In Silico Tools on Missense Variants in GJB2, GJB6, and GJB3 Genes Associated with Autosomal Recessive Deafness 1A (DFNB1A). ScientificWorldJournal 2019; 2019:5198931. [PMID: 31015822 PMCID: PMC6446107 DOI: 10.1155/2019/5198931] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Revised: 01/25/2019] [Accepted: 02/03/2019] [Indexed: 01/30/2023] Open
Abstract
In silico predictive software allows assessing the effect of amino acid substitutions on the structure or function of a protein without conducting functional studies. The accuracy of in silico pathogenicity prediction tools has not been previously assessed for variants associated with autosomal recessive deafness 1A (DFNB1A). Here, we identify in silico tools with the most accurate clinical significance predictions for missense variants of the GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) connexin genes associated with DFNB1A. To evaluate accuracy of selected in silico tools (SIFT, FATHMM, MutationAssessor, PolyPhen-2, CONDEL, MutationTaster, MutPred, Align GVGD, and PROVEAN), we tested nine missense variants with previously confirmed clinical significance in a large cohort of deaf patients and control groups from the Sakha Republic (Eastern Siberia, Russia): Сх26: p.Val27Ile, p.Met34Thr, p.Val37Ile, p.Leu90Pro, p.Glu114Gly, p.Thr123Asn, and p.Val153Ile; Cx30: p.Glu101Lys; Cx31: p.Ala194Thr. We compared the performance of the in silico tools (accuracy, sensitivity, and specificity) by using the missense variants in GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) genes associated with DFNB1A. The correlation coefficient (r) and coefficient of the area under the Receiver Operating Characteristic (ROC) curve as alternative quality indicators of the tested programs were used. The resulting ROC curves demonstrated that the largest coefficient of the area under the curve was provided by three programs: SIFT (AUC = 0.833, p = 0.046), PROVEAN (AUC = 0.833, p = 0.046), and MutationAssessor (AUC = 0.833, p = 0.002). The most accurate predictions were given by two tested programs: SIFT and PROVEAN (Ac = 89%, Se = 67%, Sp = 100%, r = 0.75, AUC = 0.833). The results of this study may be applicable for analysis of novel missense variants of the GJB2 (Cx26), GJB6 (Cx30), and GJB3 (Cx31) connexin genes.
Collapse
|
45
|
Identification and structural characterization of deleterious non-synonymous single nucleotide polymorphisms in the human SKP2 gene. Comput Biol Chem 2019; 79:127-136. [PMID: 30802828 DOI: 10.1016/j.compbiolchem.2019.02.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 01/27/2019] [Accepted: 02/13/2019] [Indexed: 12/17/2022]
Abstract
In SCF (Skp, Cullin, F-box) ubiquitin-protein ligase complexes, S-phase kinase 2 (SKP2) is one of the major players of F-box family, that is responsible for the degradation of several important cell regulators and tumor suppressor proteins. Despite of having significant evidence for the role of SKP2 on tumorgenesis, there is a lack of available data regarding the effect of non-synonymous polymorphisms. In this communication, the structural and functional consequences of non-synonymous single nucleotide polymorphisms (nsSNPs) of SKP2 have been reported by employing various computational approaches and molecular dynamics simulation. Initially, several computational tools like SIFT, PolyPhen-2, PredictSNP, I-Mutant 2.0 and ConSurf have been implicated in this study to explore the damaging SNPs. In total of 172 nsSNPs, 5 nsSNPs were identified as deleterious and 3 of them were predicted to be decreased the stability of protein. Guided from ConSurf analysis, P101L (rs761253702) and Y346C (rs755010517) were categorized as the highly conserved and functional disrupting mutations. Therefore, these mutations were subjected to three dimensional model building and molecular dynamics simulation study for the detailed structural consequences upon the mutations. The study revealed that P101L and Y346C mutations increased the flexibility and changed the structural dynamics. As both these mutations are located in the most functional regions of SKP2 protein, these computational insights might be helpful to consider these nsSNPs for wet-lab confirmatory analysis as well as in rationalizing future population based studies and structure based drug design against SKP2.
Collapse
|
46
|
Tessier L, Côté O, Bienzle D. Sequence variant analysis of RNA sequences in severe equine asthma. PeerJ 2018; 6:e5759. [PMID: 30324028 PMCID: PMC6186407 DOI: 10.7717/peerj.5759] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2017] [Accepted: 09/15/2018] [Indexed: 12/13/2022] Open
Abstract
Background Severe equine asthma is a chronic inflammatory disease of the lung in horses similar to low-Th2 late-onset asthma in humans. This study aimed to determine the utility of RNA-Seq to call gene sequence variants, and to identify sequence variants of potential relevance to the pathogenesis of asthma. Methods RNA-Seq data were generated from endobronchial biopsies collected from six asthmatic and seven non-asthmatic horses before and after challenge (26 samples total). Sequences were aligned to the equine genome with Spliced Transcripts Alignment to Reference software. Read preparation for sequence variant calling was performed with Picard tools and Genome Analysis Toolkit (GATK). Sequence variants were called and filtered using GATK and Ensembl Variant Effect Predictor (VEP) tools, and two RNA-Seq predicted sequence variants were investigated with both PCR and Sanger sequencing. Supplementary analysis of novel sequence variant selection with VEP was based on a score of <0.01 predicted with Sorting Intolerant from Tolerant software, missense nature, location within the protein coding sequence and presence in all asthmatic individuals. For select variants, effect on protein function was assessed with Polymorphism Phenotyping 2 and screening for non-acceptable polymorphism 2 software. Sequences were aligned and 3D protein structures predicted with Geneious software. Difference in allele frequency between the groups was assessed using a Pearson’s Chi-squared test with Yates’ continuity correction, and difference in genotype frequency was calculated using the Fisher’s exact test for count data. Results RNA-Seq variant calling and filtering correctly identified substitution variants in PACRG and RTTN. Sanger sequencing confirmed that the PACRG substitution was appropriately identified in all 26 samples while the RTTN substitution was identified correctly in 24 of 26 samples. These variants of uncertain significance had substitutions that were predicted to result in loss of function and to be non-neutral. Amino acid substitutions projected no change of hydrophobicity and isoelectric point in PACRG, and a change in both for RTTN. For PACRG, no difference in allele frequency between the two groups was detected but a higher proportion of asthmatic horses had the altered RTTN allele compared to non-asthmatic animals. Discussion RNA-Seq was sensitive and specific for calling gene sequence variants in this disease model. Even moderate coverage (<10–20 counts per million) yielded correct identification in 92% of samples, suggesting RNA-Seq may be suitable to detect sequence variants in low coverage samples. The impact of amino acid alterations in PACRG and RTTN proteins, and possible association of the sequence variants with asthma, is of uncertain significance, but their role in ciliary function may be of future interest.
Collapse
Affiliation(s)
- Laurence Tessier
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada.,BenchSci, Toronto, ON, Canada
| | - Olivier Côté
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada.,BioAssay Works, Ijamsville, MD, USA
| | - Dorothee Bienzle
- Department of Pathobiology, University of Guelph, Guelph, ON, Canada
| |
Collapse
|
47
|
Yang Y, Xu C, Liu X, Xu C, Zhang Y, Shen L, Vihinen M, Shen B. NDDVD: an integrated and manually curated Neurodegenerative Diseases Variation Database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4922736. [PMID: 29688368 PMCID: PMC5841369 DOI: 10.1093/database/bay018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Accepted: 01/31/2018] [Indexed: 12/21/2022]
Abstract
Neurodegenerative diseases (NDDs) are associated with genetic variations including point substitutions, copy number alterations, insertions and deletions. At present, a few genetic variation repositories for some individual NDDs have been created, however, these databases are needed to be integrated and expanded to all the NDDs for systems biological investigation. We here build a relational database termed as NDDVD to integrate all the variations of NDDs using Leiden Open Variation Database (LOVD) platform. The items in the NDDVD are collected manually from PubMed or extracted from the existed variation databases. The cross-disease database includes over 6374 genetic variations of 289 genes associated with 37 different NDDs. The patterns, conservations and biological functions for variations in different NDDs are statistically compared and a user-friendly interface is provided for NDDVD at: http://bioinf.suda.edu.cn/NDDvarbase/LOVDv.3.0. URL: http://bioinf.suda.edu.cn/NDDvarbase/LOVDv.3.0
Collapse
Affiliation(s)
- Yang Yang
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China.,School of Computer Science and Technology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China.,Department of Experimental Medical Science, Lund University, SE-221 84 Lund, Sweden and
| | - Chen Xu
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China
| | - Xingyun Liu
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China
| | - Chao Xu
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China
| | - Yuanyuan Zhang
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China
| | - Li Shen
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China.,Department of Genetics and Systems Biology Institute, Yale University School of Medicine, West Haven, CT 06516, USA
| | - Mauno Vihinen
- Department of Experimental Medical Science, Lund University, SE-221 84 Lund, Sweden and
| | - Bairong Shen
- Center for Systems Biology, Soochow University, No1. Shizi Street, Suzhou, Jiangsu 215006, China
| |
Collapse
|
48
|
In silico analysis of the functional non-synonymous single nucleotide polymorphisms in the human CYP27B1 gene. EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS 2018. [DOI: 10.1016/j.ejmhg.2018.03.001] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
|
49
|
Pereira GRC, Da Silva ANR, Do Nascimento SS, De Mesquita JF. In silico analysis and molecular dynamics simulation of human superoxide dismutase 3 (SOD3) genetic variants. J Cell Biochem 2018; 120:3583-3598. [DOI: 10.1002/jcb.27636] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 08/16/2018] [Indexed: 01/05/2023]
Affiliation(s)
- G. R. C. Pereira
- Department of Genetics and Molecular Biology Federal University of the State of Rio de Janeiro (UNIRIO) Rio de Janeiro Brazil
| | - A. N. R. Da Silva
- Department of Genetics and Molecular Biology Federal University of the State of Rio de Janeiro (UNIRIO) Rio de Janeiro Brazil
| | - S. S. Do Nascimento
- Department of Genetics and Molecular Biology Federal University of the State of Rio de Janeiro (UNIRIO) Rio de Janeiro Brazil
| | - J. F. De Mesquita
- Department of Genetics and Molecular Biology Federal University of the State of Rio de Janeiro (UNIRIO) Rio de Janeiro Brazil
| |
Collapse
|
50
|
Sedaghat A, Zamani M, Jahanshahi A, Ghaderian SB, Shariati G, Saberi A, Hamid M, Aminzadeh M, Galehdari H. Frequent novel mutations are causative for maple syrup urine disease from Southwest Iran. Meta Gene 2018. [DOI: 10.1016/j.mgene.2018.01.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022] Open
|