1
|
Campitelli P, Ross D, Swint-Kruse L, Ozkan SB. Dynamics-based protein network features accurately discriminate neutral and rheostat positions. Biophys J 2024:S0006-3495(24)00625-8. [PMID: 39277794 DOI: 10.1016/j.bpj.2024.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 07/03/2024] [Accepted: 09/11/2024] [Indexed: 09/17/2024] Open
Abstract
In some proteins, a unique class of nonconserved positions is characterized by their ability to generate diverse functional outcomes through single amino acid substitutions. Due to their ability to tune protein function, accurately identifying such "rheostat" positions is crucial for protein design, for understanding the impact of mutations observed in humans, and for predicting the evolution of pathogen drug resistance. However, identifying rheostat positions has been challenging, due-in part-to the absence of a clear structural relationship with binding sites. In this study, experimental data from our previous study of the Escherichia coli lactose repressor protein (LacI) was used to identify rheostat positions for which mutations tune in vivo EC50 for the allosteric ligand "IPTG." We next used the rheostat assignments to test the hypothesis that rheostat positions have unique dynamic features that will enable their identification. To that end, we integrated all-atom molecular dynamics simulations with perturbation residue response analysis. Results first revealed distinct dynamic behavior in IPTG-bound LacI compared with apo LacI, which was consistent with IPTG's role as an allosteric inducer. Next, we used a variety of dynamic features to build a classification model that discriminates experimentally characterized rheostat positions in LacI from positions with other types of substitution outcomes. In parallel, we built a second classifier model based on the 3D structural "static" network features of LacI. In comparative studies, the dynamic model better identified rheostat positions that were >8 Å from the binding site. In summary, our study provides insights into the dynamic characteristics of rheostat positions and suggests that models built on dynamic features may be useful for predicting the locations of rheostat positions in a wide range of proteins.
Collapse
Affiliation(s)
- P Campitelli
- Department of Physics, Center for Biological Physics, Arizona State University, Tempe, Arizona
| | - D Ross
- Biosystems and Biomaterials Division, Material Measurement Laboratory, National Institute of Standards and Technology, Gaithersburg, Maryland
| | - L Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas.
| | - S B Ozkan
- Department of Physics, Center for Biological Physics, Arizona State University, Tempe, Arizona.
| |
Collapse
|
2
|
Ahmad RM, Ali BR, Al-Jasmi F, Al Dhaheri N, Al Turki S, Kizhakkedath P, Mohamad MS. AI-derived comparative assessment of the performance of pathogenicity prediction tools on missense variants of breast cancer genes. Hum Genomics 2024; 18:99. [PMID: 39256852 PMCID: PMC11389290 DOI: 10.1186/s40246-024-00667-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 08/22/2024] [Indexed: 09/12/2024] Open
Abstract
Single nucleotide variants (SNVs) can exert substantial and extremely variable impacts on various cellular functions, making accurate predictions of their consequences challenging, albeit crucial especially in clinical settings such as in oncology. Laboratory-based experimental methods for assessing these effects are time-consuming and often impractical, highlighting the importance of in-silico tools for variant impact prediction. However, the performance metrics of currently available tools on breast cancer missense variants from benchmarking databases have not been thoroughly investigated, creating a knowledge gap in the accurate prediction of pathogenicity. In this study, the benchmarking datasets ClinVar and HGMD were used to evaluate 21 Artificial Intelligence (AI)-derived in-silico tools. Missense variants in breast cancer genes were extracted from ClinVar and HGMD professional v2023.1. The HGMD dataset focused on pathogenic variants only, to ensure balance, benign variants for the same genes were included from the ClinVar database. Interestingly, our analysis of both datasets revealed variants across genes with varying penetrance levels like low and moderate in addition to high, reinforcing the value of disease-specific tools. The top-performing tools on ClinVar dataset identified were MutPred (Accuracy = 0.73), Meta-RNN (Accuracy = 0.72), ClinPred (Accuracy = 0.71), Meta-SVM, REVEL, and Fathmm-XF (Accuracy = 0.70). While on HGMD dataset they were ClinPred (Accuracy = 0.72), MetaRNN (Accuracy = 0.71), CADD (Accuracy = 0.69), Fathmm-MKL (Accuracy = 0.68), and Fathmm-XF (Accuracy = 0.67). These findings offer clinicians and researchers valuable insights for selecting, improving, and developing effective in-silico tools for breast cancer pathogenicity prediction. Bridging this knowledge gap contributes to advancing precision medicine and enhancing diagnostic and therapeutic approaches for breast cancer patients with potential implications for other conditions.
Collapse
Affiliation(s)
- Rahaf M Ahmad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Bassam R Ali
- Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Fatma Al-Jasmi
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
- Division of Metabolic Genetics, Department of Pediatrics, Tawam Hospital, Al Ain, United Arab Emirates
| | - Noura Al Dhaheri
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
- Division of Metabolic Genetics, Department of Pediatrics, Tawam Hospital, Al Ain, United Arab Emirates
| | - Saeed Al Turki
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Praseetha Kizhakkedath
- Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates
| | - Mohd Saberi Mohamad
- Health Data Science Lab, Department of Genetics and Genomics, College of Medical and Health Sciences, United Arab Emirates University, Tawam road, Al Maqam district, Al Ain, Abu Dhabi, United Arab Emirates.
- Center for Engineering Computational Intelligence, Faculty of Engineering and Technology, Multimedia University, Melaka, Malaysia.
| |
Collapse
|
3
|
Moldovean-Cioroianu NS. Reviewing the Structure-Function Paradigm in Polyglutamine Disorders: A Synergistic Perspective on Theoretical and Experimental Approaches. Int J Mol Sci 2024; 25:6789. [PMID: 38928495 PMCID: PMC11204371 DOI: 10.3390/ijms25126789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2024] [Revised: 06/13/2024] [Accepted: 06/17/2024] [Indexed: 06/28/2024] Open
Abstract
Polyglutamine (polyQ) disorders are a group of neurodegenerative diseases characterized by the excessive expansion of CAG (cytosine, adenine, guanine) repeats within host proteins. The quest to unravel the complex diseases mechanism has led researchers to adopt both theoretical and experimental methods, each offering unique insights into the underlying pathogenesis. This review emphasizes the significance of combining multiple approaches in the study of polyQ disorders, focusing on the structure-function correlations and the relevance of polyQ-related protein dynamics in neurodegeneration. By integrating computational/theoretical predictions with experimental observations, one can establish robust structure-function correlations, aiding in the identification of key molecular targets for therapeutic interventions. PolyQ proteins' dynamics, influenced by their length and interactions with other molecular partners, play a pivotal role in the polyQ-related pathogenic cascade. Moreover, conformational dynamics of polyQ proteins can trigger aggregation, leading to toxic assembles that hinder proper cellular homeostasis. Understanding these intricacies offers new avenues for therapeutic strategies by fine-tuning polyQ kinetics, in order to prevent and control disease progression. Last but not least, this review highlights the importance of integrating multidisciplinary efforts to advancing research in this field, bringing us closer to the ultimate goal of finding effective treatments against polyQ disorders.
Collapse
Affiliation(s)
- Nastasia Sanda Moldovean-Cioroianu
- Institute of Materials Science, Bioinspired Materials and Biosensor Technologies, Kiel University, Kaiserstraße 2, 24143 Kiel, Germany;
- Faculty of Physics, Babeș-Bolyai University, Kogălniceanu 1, RO-400084 Cluj-Napoca, Romania
| |
Collapse
|
4
|
Banerjee A, Mathew S, Naqvi MM, Yilmaz SZ, Zacharopoulou M, Doruker P, Kumita JR, Yang SH, Gur M, Itzhaki LS, Gordon R, Bahar I. Influence of point mutations on PR65 conformational adaptability: Insights from molecular simulations and nanoaperture optical tweezers. SCIENCE ADVANCES 2024; 10:eadn2208. [PMID: 38820156 PMCID: PMC11141623 DOI: 10.1126/sciadv.adn2208] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Accepted: 04/29/2024] [Indexed: 06/02/2024]
Abstract
PR65 is the HEAT repeat scaffold subunit of the heterotrimeric protein phosphatase 2A (PP2A) and an archetypal tandem repeat protein. Its conformational mechanics plays a crucial role in PP2A function by opening/closing substrate binding/catalysis interface. Using in silico saturation mutagenesis, we identified PR65 "hinge" residues whose substitutions could alter its conformational adaptability and thereby PP2A function, and selected six mutations that were verified to be expressed and soluble. Molecular simulations and nanoaperture optical tweezers revealed consistent results on the specific effects of the mutations on the structure and dynamics of PR65. Two mutants observed in simulations to stabilize extended/open conformations exhibited higher corner frequencies and lower translational scattering in experiments, indicating a shift toward extended conformations, whereas another displayed the opposite features, confirmed by both simulations and experiments. The study highlights the power of single-molecule nanoaperture-based tweezers integrated with in silico approaches for exploring the effect of mutations on protein structure and dynamics.
Collapse
Affiliation(s)
- Anupam Banerjee
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
| | - Samuel Mathew
- Department of Electrical and Computer Engineering, University of Victoria, Victoria V8P 5C2, Canada
| | - Mohsin M. Naqvi
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Sema Z. Yilmaz
- Department of Mechanical Engineering, Istanbul Technical University, 34437 Istanbul, Turkey
| | - Maria Zacharopoulou
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Pemra Doruker
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Janet R. Kumita
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Shang-Hua Yang
- Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan
| | - Mert Gur
- Department of Mechanical Engineering, Istanbul Technical University, 34437 Istanbul, Turkey
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Laura S. Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Reuven Gordon
- Department of Electrical and Computer Engineering, University of Victoria, Victoria V8P 5C2, Canada
| | - Ivet Bahar
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
- Department of Biochemistry and Cell Biology, School of Medicine, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
5
|
Zheng W. Predicting hotspots for disease-causing single nucleotide variants using sequences-based coevolution, network analysis, and machine learning. PLoS One 2024; 19:e0302504. [PMID: 38743747 PMCID: PMC11093321 DOI: 10.1371/journal.pone.0302504] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 04/05/2024] [Indexed: 05/16/2024] Open
Abstract
To enable personalized medicine, it is important yet highly challenging to accurately predict disease-causing mutations in target proteins at high throughput. Previous computational methods have been developed using evolutionary information in combination with various biochemical and structural features of protein residues to discriminate neutral vs. deleterious mutations. However, the power of these methods is often limited because they either assume known protein structures or treat residues independently without fully considering their interactions. To address the above limitations, we build upon recent progress in machine learning, network analysis, and protein language models, and develop a sequences-based variant site prediction workflow based on the protein residue contact networks: 1. We employ and integrate various methods of building protein residue networks using state-of-the-art coevolution analysis tools (RaptorX, DeepMetaPSICOV, and SPOT-Contact) powered by deep learning. 2. We use machine learning algorithms (Random Forest, Gradient Boosting, and Extreme Gradient Boosting) to optimally combine 20 network centrality scores to jointly predict key residues as hot spots for disease mutations. 3. Using a dataset of 107 proteins rich in disease mutations, we rigorously evaluate the network scores individually and collectively (via machine learning). This work supports a promising strategy of combining an ensemble of network scores based on different coevolution analysis methods (and optionally predictive scores from other methods) via machine learning to predict hotspot sites of disease mutations, which will inform downstream applications of disease diagnosis and targeted drug design.
Collapse
Affiliation(s)
- Wenjun Zheng
- Department of Physics, State University of New York at Buffalo, Buffalo, NY, United States of America
| |
Collapse
|
6
|
Zhang G, Zhang C, Cai M, Luo C, Zhu F, Liang Z. FuncPhos-STR: An integrated deep neural network for functional phosphosite prediction based on AlphaFold protein structure and dynamics. Int J Biol Macromol 2024; 266:131180. [PMID: 38552697 DOI: 10.1016/j.ijbiomac.2024.131180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/19/2024] [Accepted: 03/26/2024] [Indexed: 04/01/2024]
Abstract
Phosphorylation modifications play important regulatory roles in most biological processes. However, the functional assignment for the vast majority of the identified phosphosites remains a major challenge. Here, we provide a deep learning framework named FuncPhos-STR as an online resource, for functional prediction and structural visualization of human proteome-level phosphosites. Based on our reported FuncPhos-SEQ framework, which was built by integrating phosphosite sequence evolution and protein-protein interaction (PPI) information, FuncPhos-STR was developed by further integrating the structural and dynamics information on AlphaFold protein structures. The characterized structural topology and dynamics features underlying functional phosphosites emphasized their molecular mechanism for regulating protein functions. By integrating the structural and dynamics, sequence evolutionary, and PPI network features from protein different dimensions, FuncPhos-STR has advantage over other reported models, with the best AUC value of 0.855. Using FuncPhos-STR, the phosphosites inside the pocket regions are accessible to higher functional scores, theoretically supporting their potential regulatory mechanism. Overall, FuncPhos-STR would accelerate the functional identification of huge unexplored phosphosites, and facilitate the elucidation of their allosteric regulation mechanisms. The web server of FuncPhos-STR is freely available at http://funcptm.jysw.suda.edu.cn/str.
Collapse
Affiliation(s)
- Guangyu Zhang
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China
| | - Cai Zhang
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China
| | - Mingyue Cai
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Cheng Luo
- State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China
| | - Fei Zhu
- School of Computer Science and Technology, Soochow University, Suzhou 215006, China.
| | - Zhongjie Liang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China; Jiangsu Province Engineering Research Center of Precision Diagnostics and Therapeutics Development, Soochow University, Suzhou 215123, China.
| |
Collapse
|
7
|
Lesgidou N, Vlassi M. Community analysis of large-scale molecular dynamics simulations elucidated dynamics-driven allostery in tyrosine kinase 2. Proteins 2024; 92:474-498. [PMID: 37950407 DOI: 10.1002/prot.26631] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Revised: 10/13/2023] [Accepted: 10/24/2023] [Indexed: 11/12/2023]
Abstract
TYK2 is a nonreceptor tyrosine kinase, member of the Janus kinases (JAK), with a central role in several diseases, including cancer. The JAKs' catalytic domains (KD) are highly conserved, yet the isolated TYK2-KD exhibits unique specificities. In a previous work, using molecular dynamics (MD) simulations of a catalytically impaired TYK2-KD variant (P1104A) we found that this amino acid change of its JAK-characteristic insert (αFG), acts at the dynamics level. Given that structural dynamics is key to the allosteric activation of protein kinases, in this study we applied a long-scale MD simulation and investigated an active TYK2-KD form in the presence of adenosine 5'-triphosphate and one magnesium ion that represents a dynamic and crucial step of the catalytic cycle, in other protein kinases. Community analysis of the MD trajectory shed light, for the first time, on the dynamic profile and dynamics-driven allosteric communications within the TYK2-KD during activation and revealed that αFG and amino acids P1104, P1105, and I1112 in particular, hold a pivotal role and act synergistically with a dynamically coupled communication network of amino acids serving intra-KD signaling for allosteric regulation of TYK2 activity. Corroborating our findings, most of the identified amino acids are associated with cancer-related missense/splice-site mutations of the Tyk2 gene. We propose that the conformational dynamics at this step of the catalytic cycle, coordinated by αFG, underlie TYK2-unique substrate recognition and account for its distinct specificity. In total, this work adds to knowledge towards an in-depth understanding of TYK2 activation and may be valuable towards a rational design of allosteric TYK2-specific inhibitors.
Collapse
Affiliation(s)
- Nastazia Lesgidou
- National Center for Scientific Research "Demokritos", Institute of Biosciences & Applications, Athens, Greece
| | - Metaxia Vlassi
- National Center for Scientific Research "Demokritos", Institute of Biosciences & Applications, Athens, Greece
| |
Collapse
|
8
|
Meller A, Kelly D, Smith LG, Bowman GR. Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants. Protein Sci 2024; 33:e4902. [PMID: 38358129 PMCID: PMC10868452 DOI: 10.1002/pro.4902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/01/2023] [Accepted: 01/04/2024] [Indexed: 02/16/2024]
Abstract
The goal of precision medicine is to utilize our knowledge of the molecular causes of disease to better diagnose and treat patients. However, there is a substantial mismatch between the small number of food and drug administration (FDA)-approved drugs and annotated coding variants compared to the needs of precision medicine. This review introduces the concept of physics-based precision medicine, a scalable framework that promises to improve our understanding of sequence-function relationships and accelerate drug discovery. We show that accounting for the ensemble of structures a protein adopts in solution with computer simulations overcomes many of the limitations imposed by assuming a single protein structure. We highlight studies of protein dynamics and recent methods for the analysis of structural ensembles. These studies demonstrate that differences in conformational distributions predict functional differences within protein families and between variants. Thanks to new computational tools that are providing unprecedented access to protein structural ensembles, this insight may enable accurate predictions of variant pathogenicity for entire libraries of variants. We further show that explicitly accounting for protein ensembles, with methods like alchemical free energy calculations or docking to Markov state models, can uncover novel lead compounds. To conclude, we demonstrate that cryptic pockets, or cavities absent in experimental structures, provide an avenue to target proteins that are currently considered undruggable. Taken together, our review provides a roadmap for the field of protein science to accelerate precision medicine.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular BiophysicsWashington University in St. LouisSt. LouisMissouriUSA
- Medical Scientist Training ProgramWashington University in St. LouisSt. LouisMissouriUSA
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Devin Kelly
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Louis G. Smith
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Gregory R. Bowman
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| |
Collapse
|
9
|
Tam B, Qin Z, Zhao B, Sinha S, Lei CL, Wang SM. Classification of MLH1 Missense VUS Using Protein Structure-Based Deep Learning-Ramachandran Plot-Molecular Dynamics Simulations Method. Int J Mol Sci 2024; 25:850. [PMID: 38255924 PMCID: PMC10815254 DOI: 10.3390/ijms25020850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Revised: 01/04/2024] [Accepted: 01/05/2024] [Indexed: 01/24/2024] Open
Abstract
Pathogenic variation in DNA mismatch repair (MMR) gene MLH1 is associated with Lynch syndrome (LS), an autosomal dominant hereditary cancer. Of the 3798 MLH1 germline variants collected in the ClinVar database, 38.7% (1469) were missense variants, of which 81.6% (1199) were classified as Variants of Uncertain Significance (VUS) due to the lack of functional evidence. Further determination of the impact of VUS on MLH1 function is important for the VUS carriers to take preventive action. We recently developed a protein structure-based method named "Deep Learning-Ramachandran Plot-Molecular Dynamics Simulation (DL-RP-MDS)" to evaluate the deleteriousness of MLH1 missense VUS. The method extracts protein structural information by using the Ramachandran plot-molecular dynamics simulation (RP-MDS) method, then combines the variation data with an unsupervised learning model composed of auto-encoder and neural network classifier to identify the variants causing significant change in protein structure. In this report, we applied the method to classify 447 MLH1 missense VUS. We predicted 126/447 (28.2%) MLH1 missense VUS were deleterious. Our study demonstrates that DL-RP-MDS is able to classify the missense VUS based solely on their impact on protein structure.
Collapse
Affiliation(s)
- Benjamin Tam
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Zixin Qin
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Bojin Zhao
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Siddharth Sinha
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - Chon Lok Lei
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| | - San Ming Wang
- Ministry of Education Frontiers Science Center for Precision Oncology, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Cancer Centre, Faculty of Health Sciences, University of Macau, Macau SAR, China
- Institute of Translational Medicine, Faculty of Health Sciences, University of Macau, Macau SAR, China
| |
Collapse
|
10
|
Khazaal NM, Alghetaa HF, Al-Shuhaib MBS, Al-Thuwaini TM, Alkhammas AH. A novel deleterious oxytocin variant is associated with the lower twinning ratio in Awassi ewes. Anim Biotechnol 2023; 34:3404-3415. [PMID: 36449364 DOI: 10.1080/10495398.2022.2152038] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/05/2022]
Abstract
This study aimed to assess the possible association of oxytocin (OXT) gene with reproductive traits in two groups of Awassi ewes that differ in their reproductive potentials. Sheep were genotyped using PCR-single-stranded conformation polymorphism approach. Three genotypes were detected in exon 2, CC, CA, and AA, and a novel SNP was identified with a missense effect on oxytocin (c.188C > A → p.Arg55Leu). A significant (p < 0.01) association of p.Arg55Leu with the twinning rate was found as ewes with AA and CA genotypes exhibited, respectively a lower twinning ratio than those with the wild-type CC genotype. The deleterious impact of p.Arg55Leu was demonstrated by all in silico tools that were utilized to assess the effect of this variant on the structure, function, and stability of oxytocin. Molecular docking showed that p.Arg55Leu caused a dramatic alteration in the binding of oxytocin with its receptor and reduced the number of interacted amino acids between them. Our study suggests that ewes with AA and CA genotypes showed a lower reproductive performance due to the presence of p.Arg55Leu, which caused damaging impacts on oxytocin and is binding with the OXT receptor. The utilization of the p.Arg55Leu could be useful for improving Awassi reproductive potential.
Collapse
Affiliation(s)
- Neam M Khazaal
- Department of Physiology, Biochemistry and Pharmacology, College of Veterinary Medicine, University of Baghdad, Baghdad, Iraq
| | - Hasan F Alghetaa
- Department of Physiology, Biochemistry and Pharmacology, College of Veterinary Medicine, University of Baghdad, Baghdad, Iraq
| | | | - Tahreer M Al-Thuwaini
- Department of Animal Production, College of Agriculture, Al-Qasim Green University, Al-Qasim, Iraq
| | - Ahmed H Alkhammas
- Department of Animal Production, College of Agriculture, Al-Qasim Green University, Al-Qasim, Iraq
| |
Collapse
|
11
|
Bahar I, Banerjee A, Mathew S, Naqvi M, Yilmaz S, Zachoropoulou M, Doruker P, Kumita J, Yang SH, Gur M, Itzhaki L, Gordon R. Influence of Point Mutations on PR65 Conformational Adaptability: Insights from Nanoaperture Optical Tweezer Experiments and Molecular Simulations. RESEARCH SQUARE 2023:rs.3.rs-3599809. [PMID: 38014259 PMCID: PMC10680943 DOI: 10.21203/rs.3.rs-3599809/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
PR65 is the HEAT-repeat scaffold subunit of the heterotrimeric protein phosphatase 2A (PP2A) and an archetypal tandem-repeat protein, forming a spring-like architecture. PR65 conformational mechanics play a crucial role in PP2A function by opening/closing the substrate-binding/catalysis interface. Using in-silico saturation mutagenesis we identified "hinge" residues of PR65, whose substitutions are predicted to restrict its conformational adaptability and thereby disrupt PP2A function. Molecular simulations revealed that a subset of hinge mutations stabilized the extended/open conformation, whereas another had the opposite effect. By trapping in nanoaperture optical tweezer, we characterized PR65 motion and showed that the former mutants exhibited higher corner frequencies and lower translational scattering, indicating a shift towards extended conformations, whereas the latter showed the opposite behavior. Thus, experiments confirm the conformations predicted computationally. The study highlights the utility of nanoaperture-based tweezers for exploring structure and dynamics, and the power of integrating this single-molecule method with in silico approaches.
Collapse
|
12
|
Nguyen NH, Sarangi S, McChesney EM, Sheng S, Durrant JD, Porter AW, Kleyman TR, Pitluk ZW, Brodsky JL. Genome mining yields putative disease-associated ROMK variants with distinct defects. PLoS Genet 2023; 19:e1011051. [PMID: 37956218 PMCID: PMC10695394 DOI: 10.1371/journal.pgen.1011051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 12/04/2023] [Accepted: 11/04/2023] [Indexed: 11/15/2023] Open
Abstract
Bartter syndrome is a group of rare genetic disorders that compromise kidney function by impairing electrolyte reabsorption. Left untreated, the resulting hyponatremia, hypokalemia, and dehydration can be fatal, and there is currently no cure. Bartter syndrome type II specifically arises from mutations in KCNJ1, which encodes the renal outer medullary potassium channel, ROMK. Over 40 Bartter syndrome-associated mutations in KCNJ1 have been identified, yet their molecular defects are mostly uncharacterized. Nevertheless, a subset of disease-linked mutations compromise ROMK folding in the endoplasmic reticulum (ER), which in turn results in premature degradation via the ER associated degradation (ERAD) pathway. To identify uncharacterized human variants that might similarly lead to premature degradation and thus disease, we mined three genomic databases. First, phenotypic data in the UK Biobank were analyzed using a recently developed computational platform to identify individuals carrying KCNJ1 variants with clinical features consistent with Bartter syndrome type II. In parallel, we examined genomic data in both the NIH TOPMed and ClinVar databases with the aid of Rhapsody, a verified computational algorithm that predicts mutation pathogenicity and disease severity. Subsequent phenotypic studies using a yeast screen to assess ROMK function-and analyses of ROMK biogenesis in yeast and human cells-identified four previously uncharacterized mutations. Among these, one mutation uncovered from the two parallel approaches (G228E) destabilized ROMK and targeted it for ERAD, resulting in reduced cell surface expression. Another mutation (T300R) was ERAD-resistant, but defects in channel activity were apparent based on two-electrode voltage clamp measurements in X. laevis oocytes. Together, our results outline a new computational and experimental pipeline that can be applied to identify disease-associated alleles linked to a range of other potassium channels, and further our understanding of the ROMK structure-function relationship that may aid future therapeutic strategies to advance precision medicine.
Collapse
Affiliation(s)
- Nga H. Nguyen
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Srikant Sarangi
- Paradigm4, Inc., Waltham, Massachusetts, United States of America
| | - Erin M. McChesney
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Shaohu Sheng
- Renal-Electrolyte Division, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Jacob D. Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Aidan W. Porter
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Thomas R. Kleyman
- Renal-Electrolyte Division, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | | | - Jeffrey L. Brodsky
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
13
|
Nussinov R, Liu Y, Zhang W, Jang H. Protein conformational ensembles in function: roles and mechanisms. RSC Chem Biol 2023; 4:850-864. [PMID: 37920394 PMCID: PMC10619138 DOI: 10.1039/d3cb00114h] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 09/02/2023] [Indexed: 11/04/2023] Open
Abstract
The sequence-structure-function paradigm has dominated twentieth century molecular biology. The paradigm tacitly stipulated that for each sequence there exists a single, well-organized protein structure. Yet, to sustain cell life, function requires (i) that there be more than a single structure, (ii) that there be switching between the structures, and (iii) that the structures be incompletely organized. These fundamental tenets called for an updated sequence-conformational ensemble-function paradigm. The powerful energy landscape idea, which is the foundation of modernized molecular biology, imported the conformational ensemble framework from physics and chemistry. This framework embraces the recognition that proteins are dynamic and are always interconverting between conformational states with varying energies. The more stable the conformation the more populated it is. The changes in the populations of the states are required for cell life. As an example, in vivo, under physiological conditions, wild type kinases commonly populate their more stable "closed", inactive, conformations. However, there are minor populations of the "open", ligand-free states. Upon their stabilization, e.g., by high affinity interactions or mutations, their ensembles shift to occupy the active states. Here we discuss the role of conformational propensities in function. We provide multiple examples of diverse systems, including protein kinases, lipid kinases, and Ras GTPases, discuss diverse conformational mechanisms, and provide a broad outlook on protein ensembles in the cell. We propose that the number of molecules in the active state (inactive for repressors), determine protein function, and that the dynamic, relative conformational propensities, rather than the rigid structures, are the hallmark of cell life.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University Tel Aviv 69978 Israel
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Wengang Zhang
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research Frederick MD 21702 USA
- Cancer Innovation Laboratory, National Cancer Institute Frederick MD 21702 USA
| |
Collapse
|
14
|
Chi YI, Jorge SD, Jensen DR, Smith BC, Volkman BF, Mathison AJ, Lomberk G, Zimmermann MT, Urrutia R. A multi-layered computational structural genomics approach enhances domain-specific interpretation of Kleefstra syndrome variants in EHMT1. Comput Struct Biotechnol J 2023; 21:5249-5258. [PMID: 37954151 PMCID: PMC10632586 DOI: 10.1016/j.csbj.2023.10.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 10/06/2023] [Accepted: 10/12/2023] [Indexed: 11/14/2023] Open
Abstract
This study investigates the functional significance of assorted variants of uncertain significance (VUS) in euchromatic histone lysine methyltransferase 1 (EHMT1), which is critical for early development and normal physiology. EHMT1 mutations cause Kleefstra syndrome and are linked to various human cancers. However, accurate functional interpretations of these variants are yet to be made, limiting diagnoses and future research. To overcome this, we integrate conventional tools for variant calling with computational biophysics and biochemistry to conduct multi-layered mechanistic analyses of the SET catalytic domain of EHMT1, which is critical for this protein function. We use molecular mechanics and molecular dynamics (MD)-based metrics to analyze the SET domain structure and functional motions resulting from 97 Kleefstra syndrome missense variants within the domain. Our approach allows us to classify the variants in a mechanistic manner into SV (Structural Variant), DV (Dynamic Variant), SDV (Structural and Dynamic Variant), and VUS (Variant of Uncertain Significance). Our findings reveal that the damaging variants are mostly mapped around the active site, substrate binding site, and pre-SET regions. Overall, we report an improvement for this method over conventional tools for variant interpretation and simultaneously provide a molecular mechanism for variant dysfunction.
Collapse
Affiliation(s)
- Young-In Chi
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Salomão D. Jorge
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Davin R. Jensen
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian C. Smith
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian F. Volkman
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Angela J. Mathison
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Gwen Lomberk
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Pharmacology and Toxicology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Michael T. Zimmermann
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
- Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Raul Urrutia
- Linda T. and John A. Mellowes Center for Genomic Sciences and Precision Medicine, Medical College of Wisconsin, Milwaukee, WI, USA
- Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| |
Collapse
|
15
|
Chi YI, Jorge SD, Jensen DR, Smith BC, Volkman BF, Mathison AJ, Lomberk G, Zimmermann MT, Urrutia R. A Multi-Layered Computational Structural Genomics Approach Enhances Domain-Specific Interpretation of Kleefstra Syndrome Variants in EHMT1. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.06.556558. [PMID: 37786696 PMCID: PMC10541560 DOI: 10.1101/2023.09.06.556558] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/04/2023]
Abstract
This study investigates the functional significance of assorted variants of uncertain significance (VUS) in euchromatic histone lysine methyltransferase 1 (EHMT1), which is critical for early development and normal physiology. EHMT1 mutations cause Kleefstra syndrome and are linked to various human cancers. However, accurate functional interpretation of these variants are yet to be made, limiting diagnoses and future research. To overcome this, we integrate conventional tools for variant calling with computational biophysics and biochemistry to conduct multi-layered mechanistic analyses of the SET catalytic domain of EHMT1, which is critical for this protein function. We use molecular mechanics and molecular dynamics (MD)-based metrics to analyze the SET domain structure and functional motions resulting from 97 Kleefstra syndrome missense variants within this domain. Our approach allows us to classify the variants in a mechanistic manner into SV (Structural Variant), DV (Dynamic Variant), SDV (Structural and Dynamic Variant), and VUS (Variant of Uncertain Significance). Our findings reveal that the damaging variants are mostly mapped around the active site, substrate binding site, and pre-SET regions. Overall, we report an improvement for this method over conventional tools for variant interpretation and simultaneously provide a molecular mechanism of variant dysfunction.
Collapse
|
16
|
Wang B, Lei X, Tian W, Perez-Rathke A, Tseng YY, Liang J. Structure-based pathogenicity relationship identifier for predicting effects of single missense variants and discovery of higher-order cancer susceptibility clusters of mutations. Brief Bioinform 2023; 24:bbad206. [PMID: 37332013 PMCID: PMC10359089 DOI: 10.1093/bib/bbad206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 04/19/2023] [Accepted: 05/13/2023] [Indexed: 06/20/2023] Open
Abstract
We report the structure-based pathogenicity relationship identifier (SPRI), a novel computational tool for accurate evaluation of pathological effects of missense single mutations and prediction of higher-order spatially organized units of mutational clusters. SPRI can effectively extract properties determining pathogenicity encoded in protein structures, and can identify deleterious missense mutations of germ line origin associated with Mendelian diseases, as well as mutations of somatic origin associated with cancer drivers. It compares favorably to other methods in predicting deleterious mutations. Furthermore, SPRI can discover spatially organized pathogenic higher-order spatial clusters (patHOS) of deleterious mutations, including those of low recurrence, and can be used for discovery of candidate cancer driver genes and driver mutations. We further demonstrate that SPRI can take advantage of AlphaFold2 predicted structures and can be deployed for saturation mutation analysis of the whole human proteome.
Collapse
Affiliation(s)
- Boshen Wang
- Center for Bioinformatics and Quantitative Biology, Richard and Loan Hill, Department of Biomedical Engineering, University of Illinois at Chicago, W103 Suite, 820 S Wood St, 60612 IL, USA
| | - Xue Lei
- Center for Bioinformatics and Quantitative Biology, Richard and Loan Hill, Department of Biomedical Engineering, University of Illinois at Chicago, W103 Suite, 820 S Wood St, 60612 IL, USA
| | - Wei Tian
- Center for Bioinformatics and Quantitative Biology, Richard and Loan Hill, Department of Biomedical Engineering, University of Illinois at Chicago, W103 Suite, 820 S Wood St, 60612 IL, USA
| | - Alan Perez-Rathke
- Center for Bioinformatics and Quantitative Biology, Richard and Loan Hill, Department of Biomedical Engineering, University of Illinois at Chicago, W103 Suite, 820 S Wood St, 60612 IL, USA
| | - Yan-Yuan Tseng
- Center for Molecular Medicine and Genetics, Biochemistry and Molecular Biology Department, School of Medicine, Wayne State University, 540 E. Canfield Avenue, 48201MI, USA
| | - Jie Liang
- Center for Bioinformatics and Quantitative Biology, Richard and Loan Hill, Department of Biomedical Engineering, University of Illinois at Chicago, W103 Suite, 820 S Wood St, 60612 IL, USA
| |
Collapse
|
17
|
Banerjee A, Bahar I. Structural Dynamics Predominantly Determine the Adaptability of Proteins to Amino Acid Deletions. Int J Mol Sci 2023; 24:8450. [PMID: 37176156 PMCID: PMC10179678 DOI: 10.3390/ijms24098450] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 05/01/2023] [Accepted: 05/06/2023] [Indexed: 05/15/2023] Open
Abstract
The insertion or deletion (indel) of amino acids has a variety of effects on protein function, ranging from disease-forming changes to gaining new functions. Despite their importance, indels have not been systematically characterized towards protein engineering or modification goals. In the present work, we focus on deletions composed of multiple contiguous amino acids (mAA-dels) and their effects on the protein (mutant) folding ability. Our analysis reveals that the mutant retains the native fold when the mAA-del obeys well-defined structural dynamics properties: localization in intrinsically flexible regions, showing low resistance to mechanical stress, and separation from allosteric signaling paths. Motivated by the possibility of distinguishing the features that underlie the adaptability of proteins to mAA-dels, and by the rapid evaluation of these features using elastic network models, we developed a positive-unlabeled learning-based classifier that can be adopted for protein design purposes. Trained on a consolidated set of features, including those reflecting the intrinsic dynamics of the regions where the mAA-dels occur, the new classifier yields a high recall of 84.3% for identifying mAA-dels that are stably tolerated by the protein. The comparative examination of the relative contribution of different features to the prediction reveals the dominant role of structural dynamics in enabling the adaptation of the mutant to mAA-del without disrupting the native fold.
Collapse
Affiliation(s)
- Anupam Banerjee
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
| | - Ivet Bahar
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY 11794, USA
- Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY 11794, USA
| |
Collapse
|
18
|
Nguyen NH, Sarangi S, McChesney EM, Sheng S, Porter AW, Kleyman TR, Pitluk ZW, Brodsky JL. Genome mining yields new disease-associated ROMK variants with distinct defects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539609. [PMID: 37214976 PMCID: PMC10197530 DOI: 10.1101/2023.05.05.539609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Bartter syndrome is a group of rare genetic disorders that compromise kidney function by impairing electrolyte reabsorption. Left untreated, the resulting hyponatremia, hypokalemia, and dehydration can be fatal. Although there is no cure for this disease, specific genes that lead to different Bartter syndrome subtypes have been identified. Bartter syndrome type II specifically arises from mutations in the KCNJ1 gene, which encodes the renal outer medullary potassium channel, ROMK. To date, over 40 Bartter syndrome-associated mutations in KCNJ1 have been identified. Yet, their molecular defects are mostly uncharacterized. Nevertheless, a subset of disease-linked mutations compromise ROMK folding in the endoplasmic reticulum (ER), which in turn results in premature degradation via the ER associated degradation (ERAD) pathway. To identify uncharacterized human variants that might similarly lead to premature degradation and thus disease, we mined three genomic databases. First, phenotypic data in the UK Biobank were analyzed using a recently developed computational platform to identify individuals carrying KCNJ1 variants with clinical features consistent with Bartter syndrome type II. In parallel, we examined ROMK genomic data in both the NIH TOPMed and ClinVar databases with the aid of a computational algorithm that predicts protein misfolding and disease severity. Subsequent phenotypic studies using a high throughput yeast screen to assess ROMK function-and analyses of ROMK biogenesis in yeast and human cells-identified four previously uncharacterized mutations. Among these, one mutation uncovered from the two parallel approaches (G228E) destabilized ROMK and targeted it for ERAD, resulting in reduced protein expression at the cell surface. Another ERAD-targeted ROMK mutant (L320P) was found in only one of the screens. In contrast, another mutation (T300R) was ERAD-resistant, but defects in ROMK activity were apparent after expression and two-electrode voltage clamp measurements in Xenopus oocytes. Together, our results outline a new computational and experimental pipeline that can be applied to identify disease-associated alleles linked to a range of other potassium channels, and further our understanding of the ROMK structure-function relationship that may aid future therapeutic strategies. Author Summary Bartter syndrome is a rare genetic disorder characterized by defective renal electrolyte handing, leading to debilitating symptoms and, in some patients, death in infancy. Currently, there is no cure for this disease. Bartter syndrome is divided into five types based on the causative gene. Bartter syndrome type II results from genetic variants in the gene encoding the ROMK protein, which is expressed in the kidney and assists in regulating sodium, potassium, and water homeostasis. Prior work established that some disease-associated ROMK mutants misfold and are destroyed soon after their synthesis in the endoplasmic reticulum (ER). Because a growing number of drugs have been identified that correct defective protein folding, we wished to identify an expanded cohort of similarly misshapen and unstable disease-associated ROMK variants. To this end, we developed a pipeline that employs computational analyses of human genome databases with genetic and biochemical assays. Next, we both confirmed the identity of known variants and uncovered previously uncharacterized ROMK variants associated with Bartter syndrome type II. Further analyses indicated that select mutants are targeted for ER-associated degradation, while another mutant compromises ROMK function. This work sets-the-stage for continued mining for ROMK loss of function alleles as well as other potassium channels, and positions select Bartter syndrome mutations for correction using emerging pharmaceuticals.
Collapse
|
19
|
Banerjee A, Saha S, Tvedt NC, Yang LW, Bahar I. Mutually beneficial confluence of structure-based modeling of protein dynamics and machine learning methods. Curr Opin Struct Biol 2023; 78:102517. [PMID: 36587424 PMCID: PMC10038760 DOI: 10.1016/j.sbi.2022.102517] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 11/19/2022] [Accepted: 11/22/2022] [Indexed: 12/31/2022]
Abstract
Proteins sample an ensemble of conformers under physiological conditions, having access to a spectrum of modes of motions, also called intrinsic dynamics. These motions ensure the adaptation to various interactions in the cell, and largely assist in, if not determine, viable mechanisms of biological function. In recent years, machine learning frameworks have proven uniquely useful in structural biology, and recent studies further provide evidence to the utility and/or necessity of considering intrinsic dynamics for increasing their predictive ability. Efficient quantification of dynamics-based attributes by recently developed physics-based theories and models such as elastic network models provides a unique opportunity to generate data on dynamics for training ML models towards inferring mechanisms of protein function, assessing pathogenicity, or estimating binding affinities.
Collapse
Affiliation(s)
- Anupam Banerjee
- Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh PA 15261, USA
| | - Satyaki Saha
- Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh PA 15261, USA
| | - Nathan C Tvedt
- Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh PA 15261, USA; Computational and Applied Mathematics and Statistics, The College of William and Mary, Williamsburg, VA 23185, USA
| | - Lee-Wei Yang
- Institute of Bioinformatics and Structural Biology, and PhD Program in Biomedical Artificial Intelligence, National Tsing Hua University, Hsinchu 300044, Taiwan; Physics Division, National Center for Theoretical Sciences, Taipei 106319, Taiwan
| | - Ivet Bahar
- Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh PA 15261, USA.
| |
Collapse
|
20
|
Wordom update 2: A user-friendly program for the analysis of molecular structures and conformational ensembles. Comput Struct Biotechnol J 2023; 21:1390-1402. [PMID: 36817953 PMCID: PMC9929209 DOI: 10.1016/j.csbj.2023.01.026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/19/2023] [Accepted: 01/20/2023] [Indexed: 01/29/2023] Open
Abstract
We present the second update of Wordom, a user-friendly and efficient program for manipulation and analysis of conformational ensembles from molecular simulations. The actual update expands some of the existing modules and adds 21 new modules to the update 1 published in 2011. The new adds can be divided into three sets that: 1) analyze atomic fluctuations and structural communication; 2) explore ion-channel conformational dynamics and ionic translocation; and 3) compute geometrical indices of structural deformation. Set 1 serves to compute correlations of motions, find geometrically stable domains, identify a dynamically invariant core, find changes in domain-domain separation and mutual orientation, perform wavelet analysis of large-scale simulations, process the output of principal component analysis of atomic fluctuations, perform functional mode analysis, infer regions of mechanical rigidity, analyze overall fluctuations, and perform the perturbation response scanning. Set 2 includes modules specific for ion channels, which serve to monitor the pore radius as well as water or ion fluxes, and measure functional collective motions like receptor twisting or tilting angles. Finally, set 3 includes tools to monitor structural deformations by computing angles, perimeter, area, volume, β-sheet curvature, radial distribution function, and center of mass. The ring perception module is also included, helpful to monitor supramolecular self-assemblies. This update places Wordom among the most suitable, complete, user-friendly, and efficient software for the analysis of biomolecular simulations. The source code of Wordom and the relative documentation are available under the GNU general public license at http://wordom.sf.net.
Collapse
|
21
|
Pacini L, Lesieur C. GCAT: A network model of mutational influences between amino acid positions in PSD95pdz3. Front Mol Biosci 2022; 9:1035248. [PMID: 36387271 PMCID: PMC9659846 DOI: 10.3389/fmolb.2022.1035248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2022] [Accepted: 10/13/2022] [Indexed: 12/05/2022] Open
Abstract
Proteins exist for more than 3 billion years: proof of a sustainable design. They have mechanisms coping with internal perturbations (e.g., amino acid mutations), which tie genetic backgrounds to diseases or drug therapy failure. One difficulty to grasp these mechanisms is the asymmetry of amino acid mutational impact: a mutation at position i in the sequence, which impact a position j does not imply that the mutation at position j impacts the position i. Thus, to distinguish the influence of the mutation of i on j from the influence of the mutation of j on i, position mutational influences must be represented with directions. Using the X ray structure of the third PDZ domain of PDS-95 (Protein Data Bank 1BE9) and in silico mutations, we build a directed network called GCAT that models position mutational influences. In the GCAT, a position is a node with edges that leave the node (out-edges) for the influences of the mutation of the position on other positions and edges that enter the position (in-edges) for the influences of the mutation of other positions on the position. 1BE9 positions split into four influence categories called G, C, A and T going from positions influencing on average less other positions and influenced on average by less other positions (category C) to positions influencing on average more others positions and influenced on average by more other positions (category T). The four categories depict position neighborhoods in the protein structure with different tolerance to mutations.
Collapse
Affiliation(s)
- Lorenza Pacini
- University Lyon, CNRS, INSA Lyon, Ecole Centrale de Lyon, UMR5005, Université Claude Bernard Lyon 1, Villeurbanne, France
- Institut Rhônalpin des Systèmes Complexes, IXXI-ENS-Lyon, Lyon, France
| | - Claire Lesieur
- University Lyon, CNRS, INSA Lyon, Ecole Centrale de Lyon, UMR5005, Université Claude Bernard Lyon 1, Villeurbanne, France
- Institut Rhônalpin des Systèmes Complexes, IXXI-ENS-Lyon, Lyon, France
- *Correspondence: Claire Lesieur,
| |
Collapse
|
22
|
The Cancermuts software package for the prioritization of missense cancer variants: a case study of AMBRA1 in melanoma. Cell Death Dis 2022; 13:872. [PMID: 36243772 PMCID: PMC9569343 DOI: 10.1038/s41419-022-05318-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 09/27/2022] [Accepted: 10/03/2022] [Indexed: 11/07/2022]
Abstract
Cancer genomics and cancer mutation databases have made an available wealth of information about missense mutations found in cancer patient samples. Contextualizing by means of annotation and predicting the effect of amino acid change help identify which ones are more likely to have a pathogenic impact. Those can be validated by means of experimental approaches that assess the impact of protein mutations on the cellular functions or their tumorigenic potential. Here, we propose the integrative bioinformatic approach Cancermuts, implemented as a Python package. Cancermuts is able to gather known missense cancer mutations from databases such as cBioPortal and COSMIC, and annotate them with the pathogenicity score REVEL as well as information on their source. It is also able to add annotations about the protein context these mutations are found in, such as post-translational modification sites, structured/unstructured regions, presence of short linear motifs, and more. We applied Cancermuts to the intrinsically disordered protein AMBRA1, a key regulator of many cellular processes frequently deregulated in cancer. By these means, we classified mutations of AMBRA1 in melanoma, where AMBRA1 is highly mutated and displays a tumor-suppressive role. Next, based on REVEL score, position along the sequence, and their local context, we applied cellular and molecular approaches to validate the predicted pathogenicity of a subset of mutations in an in vitro melanoma model. By doing so, we have identified two AMBRA1 mutations which show enhanced tumorigenic potential and are worth further investigation, highlighting the usefulness of the tool. Cancermuts can be used on any protein targets starting from minimal information, and it is available at https://www.github.com/ELELAB/cancermuts as free software.
Collapse
|
23
|
Nussinov R, Zhang M, Maloney R, Liu Y, Tsai CJ, Jang H. Allostery: Allosteric Cancer Drivers and Innovative Allosteric Drugs. J Mol Biol 2022; 434:167569. [PMID: 35378118 PMCID: PMC9398924 DOI: 10.1016/j.jmb.2022.167569] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/11/2022] [Accepted: 03/25/2022] [Indexed: 01/12/2023]
Abstract
Here, we discuss the principles of allosteric activating mutations, propagation downstream of the signals that they prompt, and allosteric drugs, with examples from the Ras signaling network. We focus on Abl kinase where mutations shift the landscape toward the active, imatinib binding-incompetent conformation, likely resulting in the high affinity ATP outcompeting drug binding. Recent pharmacological innovation extends to allosteric inhibitor (GNF-5)-linked PROTAC, targeting Bcr-Abl1 myristoylation site, and broadly, allosteric heterobifunctional degraders that destroy targets, rather than inhibiting them. Designed chemical linkers in bifunctional degraders can connect the allosteric ligand that binds the target protein and the E3 ubiquitin ligase warhead anchor. The physical properties and favored conformational state of the engineered linker can precisely coordinate the distance and orientation between the target and the recruited E3. Allosteric PROTACs, noncompetitive molecular glues, and bitopic ligands, with covalent links of allosteric ligands and orthosteric warheads, increase the effective local concentration of productively oriented and placed ligands. Through covalent chemical or peptide linkers, allosteric drugs can collaborate with competitive drugs, degrader anchors, or other molecules of choice, driving innovative drug discovery.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Ryan Maloney
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Yonglan Liu
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| |
Collapse
|
24
|
Woodard J, Iqbal S, Mashaghi A. Circuit topology predicts pathogenicity of missense mutations. Proteins 2022; 90:1634-1644. [PMID: 35394672 PMCID: PMC9543832 DOI: 10.1002/prot.26342] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 03/07/2022] [Accepted: 03/30/2022] [Indexed: 12/05/2022]
Abstract
The contact topology of a protein determines important aspects of the folding process. The topological measure of contact order has been shown to be predictive of the rate of folding. Circuit topology is emerging as another fundamental descriptor of biomolecular structure, with predicted effects on the folding rate. We analyze the residue‐based circuit topological environments of 21 K mutations labeled as pathogenic or benign. Multiple statistical lines of reasoning support the conclusion that the number of contacts in two specific circuit topological arrangements, namely inverse parallel and cross relations, with contacts involving the mutated residue have discriminatory value in determining the pathogenicity of human variants. We investigate how results vary with residue type and according to whether the gene is essential. We further explore the relationship to a number of structural features and find that circuit topology provides nonredundant information on protein structures and pathogenicity of mutations. Results may have implications for the polymer physics of protein folding and suggest that “local” topological information, including residue‐based circuit topology and residue contact order, could be useful in improving state‐of‐the‐art machine learning algorithms for pathogenicity prediction.
Collapse
Affiliation(s)
- Jaie Woodard
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA
| | - Sumaiya Iqbal
- Center for the Development of Therapeutics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA.,Analytic and Translational Genetics Unit, Massachusetts General Hospital, Boston, Massachusetts, USA
| | - Alireza Mashaghi
- Medical Systems Biophysics and Bioengineering, Leiden Academic Centre for Drug Research, Faculty of Science, Leiden University, Leiden, The Netherlands.,Centre for Interdisciplinary Genome Research, Faculty of Science, Leiden University, Leiden, The Netherlands
| |
Collapse
|
25
|
Yazar M, Ozbek P. Assessment of 13 in silico pathogenicity methods on cancer-related variants. Comput Biol Med 2022; 145:105434. [DOI: 10.1016/j.compbiomed.2022.105434] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Revised: 03/04/2022] [Accepted: 03/20/2022] [Indexed: 11/03/2022]
|
26
|
Ose NJ, Butler BM, Kumar A, Kazan IC, Sanderford M, Kumar S, Ozkan SB. Dynamic coupling of residues within proteins as a mechanistic foundation of many enigmatic pathogenic missense variants. PLoS Comput Biol 2022; 18:e1010006. [PMID: 35389981 PMCID: PMC9017885 DOI: 10.1371/journal.pcbi.1010006] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 04/19/2022] [Accepted: 03/09/2022] [Indexed: 01/07/2023] Open
Abstract
Many pathogenic missense mutations are found in protein positions that are neither well-conserved nor fall in any known functional domains. Consequently, we lack any mechanistic underpinning of dysfunction caused by such mutations. We explored the disruption of allosteric dynamic coupling between these positions and the known functional sites as a possible mechanism for pathogenesis. In this study, we present an analysis of 591 pathogenic missense variants in 144 human enzymes that suggests that allosteric dynamic coupling of mutated positions with known active sites is a plausible biophysical mechanism and evidence of their functional importance. We illustrate this mechanism in a case study of β-Glucocerebrosidase (GCase) in which a vast majority of 94 sites harboring Gaucher disease-associated missense variants are located some distance away from the active site. An analysis of the conformational dynamics of GCase suggests that mutations on these distal sites cause changes in the flexibility of active site residues despite their distance, indicating a dynamic communication network throughout the protein. The disruption of the long-distance dynamic coupling caused by missense mutations may provide a plausible general mechanistic explanation for biological dysfunction and disease.
Collapse
Affiliation(s)
- Nicholas J. Ose
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Brandon M. Butler
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Avishek Kumar
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - I. Can Kazan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| | - Maxwell Sanderford
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania, United States of America
- Department of Biology, Temple University, Philadelphia, Pennsylvania, United States of America
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania, United States of America
- Department of Biology, Temple University, Philadelphia, Pennsylvania, United States of America
- Center for Genomic Medicine Research, King Abdulaziz University, Jeddah, Saudi Arabia
| | - S. Banu Ozkan
- Department of Physics and Center for Biological Physics, Arizona State University, Tempe, Arizona, United States of America
| |
Collapse
|
27
|
Structural Bioinformatics Enhances the Interpretation of Somatic Mutations in KDM6A Found in Human Cancers. Comput Struct Biotechnol J 2022; 20:2200-2211. [PMID: 35615018 PMCID: PMC9111933 DOI: 10.1016/j.csbj.2022.04.028] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2022] [Revised: 04/18/2022] [Accepted: 04/18/2022] [Indexed: 11/24/2022] Open
|
28
|
Xiao F, Zhou Z, Song X, Gan M, Long J, Verkhivker G, Hu G. Dissecting mutational allosteric effects in alkaline phosphatases associated with different Hypophosphatasia phenotypes: An integrative computational investigation. PLoS Comput Biol 2022; 18:e1010009. [PMID: 35320273 PMCID: PMC8979438 DOI: 10.1371/journal.pcbi.1010009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 04/04/2022] [Accepted: 03/10/2022] [Indexed: 11/18/2022] Open
Abstract
Hypophosphatasia (HPP) is a rare inherited disorder characterized by defective bone mineralization and is highly variable in its clinical phenotype. The disease occurs due to various loss-of-function mutations in ALPL, the gene encoding tissue-nonspecific alkaline phosphatase (TNSALP). In this work, a data-driven and biophysics-based approach is proposed for the large-scale analysis of ALPL mutations-from nonpathogenic to severe HPPs. By using a pipeline of synergistic approaches including sequence-structure analysis, network modeling, elastic network models and atomistic simulations, we characterized allosteric signatures and effects of the ALPL mutations on protein dynamics and function. Statistical analysis of molecular features computed for the ALPL mutations showed a significant difference between the control, mild and severe HPP phenotypes. Molecular dynamics simulations coupled with protein structure network analysis were employed to analyze the effect of single-residue variation on conformational dynamics of TNSALP dimers, and the developed machine learning model suggested that the topological network parameters could serve as a robust indicator of severe mutations. The results indicated that the severity of disease-associated mutations is often linked with mutation-induced modulation of allosteric communications in the protein. This study suggested that ALPL mutations associated with mild and more severe HPPs can exert markedly distinct effects on the protein stability and long-range network communications. By linking the disease phenotypes with dynamic and allosteric molecular signatures, the proposed integrative computational approach enabled to characterize and quantify the allosteric effects of ALPL mutations and role of allostery in the pathogenesis of HPPs.
Collapse
Affiliation(s)
- Fei Xiao
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Ziyun Zhou
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Xingyu Song
- Department of Chemistry, Multiscale Research Institute of Complex Systems and Institute of Biomedical Sciences, Fudan University, Shanghai, China
| | - Mi Gan
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Jie Long
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Gennady Verkhivker
- Department of Computational and Data Sciences, Chapman University, One University Drive, Orange, California, United States of America
- Department of Biomedical and Pharmaceutical Sciences, Chapman University Pharmacy School 9401 Jeronimo Rd, Irvine, California, United States of America
| | - Guang Hu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
- * E-mail:
| |
Collapse
|
29
|
Zhang H, Shan G, Yang B. Optimized Elastic Network Models With Direct Characterization of Inter-Residue Cooperativity for Protein Dynamics. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:1064-1074. [PMID: 32915744 DOI: 10.1109/tcbb.2020.3023147] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
The elastic network models (ENMs)are known as representative coarse-grained models to capture essential dynamics of proteins. Due to simple designs of the force constants as a decay with spatial distances of residue pairs in many previous studies, there is still much room for the improvement of ENMs. In this article, we directly computed the force constants with the inverse covariance estimation using a ridge-type operater for the precision matrix estimation (ROPE)on a large-scale set of NMR ensembles. Distance-dependent statistical analyses on the force constants were further comprehensively performed in terms of several paired types of sequence and structural information, including secondary structure, relative solvent accessibility, sequence distance and terminal. Various distinguished distributions of the mean force constants highlight the structural and sequential characteristics coupled with the inter-residue cooperativity beyond the spatial distances. We finally integrated these structural and sequential characteristics to build novel ENM variations using the particle swarm optimization for the parameter estimation. The considerable improvements on the correlation coefficient of the mean-square fluctuation and the mode overlap were achieved by the proposed variations when compared with traditional ENMs. This study opens a novel way to develop more accurate elastic network models for protein dynamics.
Collapse
|
30
|
Rath SL, Padhi AK, Mandal N. Scanning the RBD-ACE2 molecular interactions in Omicron variant. Biochem Biophys Res Commun 2022; 592:18-23. [PMID: 35007846 PMCID: PMC8732900 DOI: 10.1016/j.bbrc.2022.01.006] [Citation(s) in RCA: 37] [Impact Index Per Article: 18.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 12/23/2021] [Accepted: 01/04/2022] [Indexed: 12/12/2022]
Abstract
The emergence of new SARS-CoV-2 variants poses a threat to the human population where it is difficult to assess the severity of a particular variant of the virus. Spike protein and specifically its receptor binding domain (RBD) which makes direct interaction with the ACE2 receptor of the human has shown prominent amino acid substitutions in most of the Variants of Concern. Here, by using all-atom molecular dynamics simulations we compare the interaction of Wild-type RBD/ACE2 receptor complex with that of the latest Omicron variant of the virus. We observed a very interesting diversification of the charge, dynamics and energetics of the protein complex formed upon mutations. These results would help us in understanding the molecular basis of binding of the Omicron variant with that of SARS-CoV-2 Wild-type.
Collapse
Affiliation(s)
- Soumya Lipsa Rath
- National Institute of Technology, Warangal, Telangana, 506004, India.
| | - Aditya K Padhi
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, 1-7-22 Suehiro, Tsurumi, Yokohama, Kanagawa, 230-0045, Japan
| | - Nabanita Mandal
- National Institute of Technology, Warangal, Telangana, 506004, India
| |
Collapse
|
31
|
Lai J, Yang J, Gamsiz Uzun ED, Rubenstein BM, Sarkar IN. LYRUS: a machine learning model for predicting the pathogenicity of missense variants. BIOINFORMATICS ADVANCES 2021; 2:vbab045. [PMID: 35036922 PMCID: PMC8754197 DOI: 10.1093/bioadv/vbab045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 12/08/2021] [Accepted: 12/21/2021] [Indexed: 01/27/2023]
Abstract
SUMMARY Single amino acid variations (SAVs) are a primary contributor to variations in the human genome. Identifying pathogenic SAVs can provide insights to the genetic architecture of complex diseases. Most approaches for predicting the functional effects or pathogenicity of SAVs rely on either sequence or structural information. This study presents 〈Lai Yang Rubenstein Uzun Sarkar〉 (LYRUS), a machine learning method that uses an XGBoost classifier to predict the pathogenicity of SAVs. LYRUS incorporates five sequence-based, six structure-based and four dynamics-based features. Uniquely, LYRUS includes a newly proposed sequence co-evolution feature called the variation number. LYRUS was trained using a dataset that contains 4363 protein structures corresponding to 22 639 SAVs from the ClinVar database, and tested using the VariBench testing dataset. Performance analysis showed that LYRUS achieved comparable performance to current variant effect predictors. LYRUS's performance was also benchmarked against six Deep Mutational Scanning datasets for PTEN and TP53. AVAILABILITY AND IMPLEMENTATION LYRUS is freely available and the source code can be found at https://github.com/jiaying2508/LYRUS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Jiaying Lai
- Center for Biomedical Informatics, Brown University, Providence, RI 02903, USA,Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA
| | - Jordan Yang
- Department of Chemistry, Brown University, Providence, RI 02906, USA
| | - Ece D Gamsiz Uzun
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA,Department of Pathology and Laboratory Medicine, Brown University Alpert Medical School, Providence, RI 02903, USA,Department of Pathology, Rhode Island Hospital, Providence, RI 02903, USA
| | - Brenda M Rubenstein
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA,Department of Chemistry, Brown University, Providence, RI 02906, USA,To whom correspondence should be addressed. and
| | - Indra Neil Sarkar
- Center for Biomedical Informatics, Brown University, Providence, RI 02903, USA,Rhode Island Quality Institute, Providence, RI 02908, USA,To whom correspondence should be addressed. and
| |
Collapse
|
32
|
Zhang H, He J, Hu G, Zhu F, Jiang H, Gao J, Zhou H, Lin H, Wang Y, Chen K, Meng F, Hao M, Zhao K, Luo C, Liang Z. Dynamics of Post-Translational Modification Inspires Drug Design in the Kinase Family. J Med Chem 2021; 64:15111-15125. [PMID: 34668699 DOI: 10.1021/acs.jmedchem.1c01076] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Post-translational modification (PTM) on protein plays important roles in the regulation of cellular function and disease pathogenesis. The systematic analysis of PTM dynamics presents great opportunities to enlarge the target space by PTM allosteric regulation. Here, we presented a framework by integrating the sequence, structural topology, and particular dynamics features to characterize the functional context and druggabilities of PTMs in the well-known kinase family. The machine learning models with these biophysical features could successfully predict PTMs. On the other hand, PTMs were identified to be significantly enriched in the reported allosteric pockets and the allosteric potential of PTM pockets were thus proposed through these biophysical features. In the end, the covalent inhibitor DC-Srci-6668 targeting the PTM pocket in c-Src kinase was identified, which inhibited the phosphorylation and locked c-Src in the inactive state. Our findings represent a crucial step toward PTM-inspired drug design in the kinase family.
Collapse
Affiliation(s)
- Huimin Zhang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China.,Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,School of Life Science and Technology, Shanghai Tech University, 100 Haike Road, Shanghai 201210, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China
| | - Jixiao He
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Guang Hu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Fei Zhu
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Hao Jiang
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China
| | - Jing Gao
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China
| | - Hu Zhou
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China
| | - Hua Lin
- Biomedical Research Center of South China, College of Life Sciences, Fujian Normal University, 1 Keji Road, Fuzhou 350117, China
| | - Yingjuan Wang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| | - Kaixian Chen
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,School of Life Science and Technology, Shanghai Tech University, 100 Haike Road, Shanghai 201210, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China
| | - Fanwang Meng
- Department of Chemistry and Chemical Biology, McMaster University, Hamilton, ON L8S 4L8, Canada
| | - Minghong Hao
- Ensem Therapeutics, Inc., 200 Boston Avenue, Medford, Massachusetts 02155, United States
| | - Kehao Zhao
- School of Pharmacy, Key Laboratory of Molecular Pharmacology and Drug Evaluation (Yantai University), Ministry of Education, Collaborative Innovation Center of Advanced Drug Delivery System and Biotech Drugs in Universities of Shandong, Yantai University, Yantai 264005, China
| | - Cheng Luo
- Drug Discovery and Design Center, the Center for Chemical Biology, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China.,School of Life Science and Technology, Shanghai Tech University, 100 Haike Road, Shanghai 201210, China.,University of Chinese Academy of Sciences (UCAS), 19 Yuquan Road, Beijing 100049, China.,School of Pharmaceutical Science and Technology, Hangzhou Institute for Advanced Study, UCAS, Hangzhou 310024, China
| | - Zhongjie Liang
- Center for Systems Biology, Department of Bioinformatics, School of Biology and Basic Medical Sciences, Soochow University, Suzhou 215123, China
| |
Collapse
|
33
|
Pacini L, Dorantes-Gilardi R, Vuillon L, Lesieur C. Mapping Function from Dynamics: Future Challenges for Network-Based Models of Protein Structures. Front Mol Biosci 2021; 8:744646. [PMID: 34708077 PMCID: PMC8543124 DOI: 10.3389/fmolb.2021.744646] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 08/19/2021] [Indexed: 11/25/2022] Open
Abstract
Proteins fulfill complex and diverse biological functions through the controlled atomic motions of their structures (functional dynamics). The protein composition is given by its amino-acid sequence, which was assumed to encode the function. However, the discovery of functional sequence variants proved that the functional encoding does not come down to the sequence, otherwise a change in the sequence would mean a change of function. Likewise, the discovery that function is fulfilled by a set of structures and not by a unique structure showed that the functional encoding does not come down to the structure either. That leaves us with the possibility that a set of atomic motions, achievable by different sequences and different structures, encodes a specific function. Thanks to the exponential growth in annual depositions in the Protein Data Bank of protein tridimensional structures at atomic resolutions, network models using the Cartesian coordinates of atoms of a protein structure as input have been used over 20 years to investigate protein features. Combining networks with experimental measures or with Molecular Dynamics (MD) simulations and using typical or ad-hoc network measures is well suited to decipher the link between protein dynamics and function. One perspective is to consider static structures alone as alternatives to address the question and find network measures relevant to dynamics that can be subsequently used for mining and classification of dynamic sequence changes functionally robust, adaptable or faulty. This way the set of dynamics that fulfill a function over a diversity of sequences and structures will be determined.
Collapse
Affiliation(s)
- Lorenza Pacini
- Ecole Centrale de Lyon, Ampère, UMR5005, Univ. Lyon, CNRS, INSA Lyon, Université Claude Bernard Lyon 1, Villeurbanne, France
- Institut Rhônalpin des Systèmes Complexes, IXXI-ENS-Lyon, Lyon, France
| | - Rodrigo Dorantes-Gilardi
- Institut Rhônalpin des Systèmes Complexes, IXXI-ENS-Lyon, Lyon, France
- USMB, CNRS, LAMA UMR5127, Le Bourget du Lac, France
| | | | - Claire Lesieur
- Ecole Centrale de Lyon, Ampère, UMR5005, Univ. Lyon, CNRS, INSA Lyon, Université Claude Bernard Lyon 1, Villeurbanne, France
- Institut Rhônalpin des Systèmes Complexes, IXXI-ENS-Lyon, Lyon, France
| |
Collapse
|
34
|
Woodard J, Zheng W, Zhang Y. Protein structural features predict responsiveness to pharmacological chaperone treatment for three lysosomal storage disorders. PLoS Comput Biol 2021; 17:e1009370. [PMID: 34529671 PMCID: PMC8478239 DOI: 10.1371/journal.pcbi.1009370] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 09/28/2021] [Accepted: 08/21/2021] [Indexed: 12/15/2022] Open
Abstract
Three-dimensional structures of proteins can provide important clues into the efficacy of personalized treatment. We perform a structural analysis of variants within three inherited lysosomal storage disorders, comparing variants responsive to pharmacological chaperone treatment to those unresponsive to such treatment. We find that predicted ΔΔG of mutation is higher on average for variants unresponsive to treatment, in the case of datasets for both Fabry disease and Pompe disease, in line with previous findings. Using both a single decision tree and an advanced machine learning approach based on the larger Fabry dataset, we correctly predict responsiveness of three Gaucher disease variants, and we provide predictions for untested variants. Many variants are predicted to be responsive to treatment, suggesting that drug-based treatments may be effective for a number of variants in Gaucher disease. In our analysis, we observe dependence on a topological feature reporting on contact arrangements which is likely connected to the order of folding of protein residues, and we provide a potential justification for this observation based on steady-state cellular kinetics. Pharmacological chaperones are small molecule drugs that bind to proteins to help stabilize the folded state. One set of diseases for which this treatment has been effective is the lysosomal storage disorders, which are caused by defective lysosomal enzymes. However, not all genotypes are equally responsive to treatment. For instance, missense mutants that are particularly destabilized relative to WT are less likely to respond. The availability of datasets containing responsiveness data for large numbers of mutants, along with crystal structures of the protein involved in each disease, make machine learning methods incorporating sequence-based and structural data feasible. We hypothesize that data from two diseases, Fabry and Pompe disease, may be useful for predicting responsiveness of variants in the related Gaucher disease. Results suggest that many rare variants in Gaucher disease could be amenable to existing drugs. Results also suggest that drug responsiveness depends on protein topology in such a way that mutations in early-to-fold residues are more likely to be non-responsive to pharmacological chaperone treatment, which is consistent with a simple kinetic model of stability rescue. This study provides an example of how machine learning can be used to inform further studies towards personalized treatment in medicine.
Collapse
Affiliation(s)
- Jaie Woodard
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Biological Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
- * E-mail:
| |
Collapse
|
35
|
Tripathi S, Dsouza NR, Urrutia R, Zimmermann MT. Structural bioinformatics enhances mechanistic interpretation of genomic variation, demonstrated through the analyses of 935 distinct RAS family mutations. Bioinformatics 2021; 37:1367-1375. [PMID: 33226070 PMCID: PMC8208742 DOI: 10.1093/bioinformatics/btaa972] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 10/04/2020] [Accepted: 11/11/2020] [Indexed: 12/26/2022] Open
Abstract
MOTIVATION Protein-coding genetic alterations are frequently observed in Clinical Genetics, but the high yield of variants of uncertain significance remains a limitation in decision making. RAS-family GTPases are cancer drivers, but only 54 variants, across all family members, fall within well-known hotspots. However, extensive sequencing has identified 881 non-hotspot variants for which significance remains to be investigated. RESULTS Here, we evaluate 935 missense variants from seven RAS genes, observed in cancer, RASopathies and the healthy adult population. We characterized hotspot variants, previously studied experimentally, using 63 sequence- and 3D structure-based scores, chosen by their breadth of biophysical properties. Applying scores that display best correlation with experimental measures, we report new valuable mechanistic inferences for both hot-spot and non-hotspot variants. Moreover, we demonstrate that 3D scores have little-to-no correlation with those based on DNA sequence, which are commonly used in Clinical Genetics. Thus, combined, these new knowledge bear significant relevance. AVAILABILITY AND IMPLEMENTATION All genomic and 3D scores, and markdown for generating figures, are provided in our supplemental data. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Swarnendu Tripathi
- Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Precision Medicine Simulation Unit, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA
| | - Nikita R Dsouza
- Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Precision Medicine Simulation Unit, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA
| | - Raul Urrutia
- Precision Medicine Simulation Unit, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Department of Surgery, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA
| | - Michael T Zimmermann
- Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Precision Medicine Simulation Unit, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Clinical and Translational Sciences Institute, Genomic Sciences and Precision Medicine Center, Milwaukee, WI 53226, USA.,Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI 53226, USA
| |
Collapse
|
36
|
The structure-based cancer-related single amino acid variation prediction. Sci Rep 2021; 11:13599. [PMID: 34193921 PMCID: PMC8245468 DOI: 10.1038/s41598-021-92793-w] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 06/16/2021] [Indexed: 11/09/2022] Open
Abstract
Single amino acid variation (SAV) is an amino acid substitution of the protein sequence that can potentially influence the entire protein structure or function, as well as its binding affinity. Protein destabilization is related to diseases, including several cancers, although using traditional experiments to clarify the relationship between SAVs and cancer uses much time and resources. Some SAV prediction methods use computational approaches, with most predicting SAV-induced changes in protein stability. In this investigation, all SAV characteristics generated from protein sequences, structures and the microenvironment were converted into feature vectors and fed into an integrated predicting system using a support vector machine and genetic algorithm. Critical features were used to estimate the relationship between their properties and cancers caused by SAVs. We describe how we developed a prediction system based on protein sequences and structure that is capable of distinguishing if the SAV is related to cancer or not. The five-fold cross-validation performance of our system is 89.73% for the accuracy, 0.74 for the Matthews correlation coefficient, and 0.81 for the F1 score. We have built an online prediction server, CanSavPre ( http://bioinfo.cmu.edu.tw/CanSavPre/ ), which is expected to become a useful, practical tool for cancer research and precision medicine.
Collapse
|
37
|
Mikulska-Ruminska K, Anthonymuthu TS, Levkina A, Shrivastava IH, Kapralov AA, Bayır H, Kagan VE, Bahar I. NO ● Represses the Oxygenation of Arachidonoyl PE by 15LOX/PEBP1: Mechanism and Role in Ferroptosis. Int J Mol Sci 2021; 22:ijms22105253. [PMID: 34067535 PMCID: PMC8156958 DOI: 10.3390/ijms22105253] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2021] [Revised: 05/06/2021] [Accepted: 05/12/2021] [Indexed: 12/18/2022] Open
Abstract
We recently discovered an anti-ferroptotic mechanism inherent to M1 macrophages whereby high levels of NO● suppressed ferroptosis via inhibition of hydroperoxy-eicosatetraenoyl-phosphatidylethanolamine (HpETE-PE) production by 15-lipoxygenase (15LOX) complexed with PE-binding protein 1 (PEBP1). However, the mechanism of NO● interference with 15LOX/PEBP1 activity remained unclear. Here, we use a biochemical model of recombinant 15LOX-2 complexed with PEBP1, LC-MS redox lipidomics, and structure-based modeling and simulations to uncover the mechanism through which NO● suppresses ETE-PE oxidation. Our study reveals that O2 and NO● use the same entry pores and channels connecting to 15LOX-2 catalytic site, resulting in a competition for the catalytic site. We identified residues that direct O2 and NO● to the catalytic site, as well as those stabilizing the esterified ETE-PE phospholipid tail. The functional significance of these residues is supported by in silico saturation mutagenesis. We detected nitrosylated PE species in a biochemical system consisting of 15LOX-2/PEBP1 and NO● donor and in RAW264.7 M2 macrophages treated with ferroptosis-inducer RSL3 in the presence of NO●, in further support of the ability of NO● to diffuse to, and react at, the 15LOX-2 catalytic site. The results provide first insights into the molecular mechanism of repression of the ferroptotic Hp-ETE-PE production by NO●.
Collapse
Affiliation(s)
- Karolina Mikulska-Ruminska
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA;
- Institute of Physics, Faculty of Physics, Astronomy and Informatics, Nicolaus Copernicus University in Toruń, Grudziadzka 5, 87-100 Toruń, Poland
- Correspondence: (K.M.-R.); (V.E.K.); (I.B.)
| | - Tamil S. Anthonymuthu
- Department of Critical Care Medicine, Safar Center for Resuscitation Research, Children’s Neuroscience Institute, Children’s Hospital of Pittsburgh, University of Pittsburgh, Pittsburgh, PA 15260, USA; (T.S.A.); (H.B.)
| | - Anastasia Levkina
- Department of Environmental and Occupational Health and Center for Free Radical and Antioxidant Health, University of Pittsburgh, Pittsburgh, PA 15260, USA; (A.L.); (A.A.K.)
- Institute of Translational Medicine, Pirogov Russian National Research Medical University, Ostrovityanova 1, 117997 Moscow, Russia
| | - Indira H. Shrivastava
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA;
- Department of Environmental and Occupational Health and Center for Free Radical and Antioxidant Health, University of Pittsburgh, Pittsburgh, PA 15260, USA; (A.L.); (A.A.K.)
| | - Alexandr A. Kapralov
- Department of Environmental and Occupational Health and Center for Free Radical and Antioxidant Health, University of Pittsburgh, Pittsburgh, PA 15260, USA; (A.L.); (A.A.K.)
| | - Hülya Bayır
- Department of Critical Care Medicine, Safar Center for Resuscitation Research, Children’s Neuroscience Institute, Children’s Hospital of Pittsburgh, University of Pittsburgh, Pittsburgh, PA 15260, USA; (T.S.A.); (H.B.)
- Department of Environmental and Occupational Health and Center for Free Radical and Antioxidant Health, University of Pittsburgh, Pittsburgh, PA 15260, USA; (A.L.); (A.A.K.)
| | - Valerian E. Kagan
- Department of Environmental and Occupational Health and Center for Free Radical and Antioxidant Health, University of Pittsburgh, Pittsburgh, PA 15260, USA; (A.L.); (A.A.K.)
- Department of Radiation Oncology, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
- Institute of Regenerative Medicine, IM Sechenov Moscow State Medical University, 119048 Moscow, Russia
- Correspondence: (K.M.-R.); (V.E.K.); (I.B.)
| | - Ivet Bahar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA;
- Correspondence: (K.M.-R.); (V.E.K.); (I.B.)
| |
Collapse
|
38
|
Bucher M, Niebling S, Han Y, Molodenskiy D, Hassani Nia F, Kreienkamp HJ, Svergun D, Kim E, Kostyukova AS, Kreutz MR, Mikhaylova M. Autism-associated SHANK3 missense point mutations impact conformational fluctuations and protein turnover at synapses. eLife 2021; 10:66165. [PMID: 33945465 PMCID: PMC8169116 DOI: 10.7554/elife.66165] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2020] [Accepted: 05/01/2021] [Indexed: 12/18/2022] Open
Abstract
Members of the SH3- and ankyrin repeat (SHANK) protein family are considered as master scaffolds of the postsynaptic density of glutamatergic synapses. Several missense mutations within the canonical SHANK3 isoform have been proposed as causative for the development of autism spectrum disorders (ASDs). However, there is a surprising paucity of data linking missense mutation-induced changes in protein structure and dynamics to the occurrence of ASD-related synaptic phenotypes. In this proof-of-principle study, we focus on two ASD-associated point mutations, both located within the same domain of SHANK3 and demonstrate that both mutant proteins indeed show distinct changes in secondary and tertiary structure as well as higher conformational fluctuations. Local and distal structural disturbances result in altered synaptic targeting and changes of protein turnover at synaptic sites in rat primary hippocampal neurons.
Collapse
Affiliation(s)
- Michael Bucher
- AG Optobiology, Institute of Biology, Humboldt-University, Berlin, Germany.,DFG Emmy Noether Guest Group 'Neuronal Protein Transport', Institute for Molecular Neurogenetics, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.,RG Neuroplasticity, Leibniz-Institute for Neurobiology (LIN), Magdeburg, Germany
| | - Stephan Niebling
- Molecular Biophysics and High-Throughput Crystallization, European Molecular Biology Laboratory (EMBL), Hamburg, Germany
| | - Yuhao Han
- DFG Emmy Noether Guest Group 'Neuronal Protein Transport', Institute for Molecular Neurogenetics, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.,Structural Cell Biology of Viruses, Centre for Structural Systems Biology (CSSB) and Leibniz Institute for Experimental Virology, Hamburg, Germany
| | - Dmitry Molodenskiy
- European Molecular Biology Laboratory (EMBL) Hamburg Unit, DESY, Hamburg, Germany
| | - Fatemeh Hassani Nia
- Institute of Human Genetics, Center for Obstetrics and Pediatrics, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Hans-Jürgen Kreienkamp
- Institute of Human Genetics, Center for Obstetrics and Pediatrics, University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| | - Dmitri Svergun
- European Molecular Biology Laboratory (EMBL) Hamburg Unit, DESY, Hamburg, Germany
| | - Eunjoon Kim
- Center for Synaptic Brain Dysfunctions, Institute for Basic Science (IBS) and Department of Biological Sciences, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea
| | - Alla S Kostyukova
- DFG Emmy Noether Guest Group 'Neuronal Protein Transport', Institute for Molecular Neurogenetics, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.,The Gene and Linda Voiland School of Chemical Engineering and Bioengineering, Washington State University (WSU), Pullman, United States
| | - Michael R Kreutz
- RG Neuroplasticity, Leibniz-Institute for Neurobiology (LIN), Magdeburg, Germany.,Leibniz Group 'Dendritic Organelles and Synaptic Function', Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany.,German Center for Neurodegenerative Diseases, Magdeburg, Germany.,Center for Behavioral Brain Sciences, Magdeburg, Germany
| | - Marina Mikhaylova
- AG Optobiology, Institute of Biology, Humboldt-University, Berlin, Germany.,DFG Emmy Noether Guest Group 'Neuronal Protein Transport', Institute for Molecular Neurogenetics, Center for Molecular Neurobiology (ZMNH), University Medical Center Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
39
|
Echave J. Fast computational mutation-response scanning of proteins. PeerJ 2021; 9:e11330. [PMID: 33976988 PMCID: PMC8067912 DOI: 10.7717/peerj.11330] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Accepted: 03/31/2021] [Indexed: 12/21/2022] Open
Abstract
Studying the effect of perturbations on protein structure is a basic approach in protein research. Important problems, such as predicting pathological mutations and understanding patterns of structural evolution, have been addressed by computational simulations that model mutations using forces and predict the resulting deformations. In single mutation-response scanning simulations, a sensitivity matrix is obtained by averaging deformations over point mutations. In double mutation-response scanning simulations, a compensation matrix is obtained by minimizing deformations over pairs of mutations. These very useful simulation-based methods may be too slow to deal with large proteins, protein complexes, or large protein databases. To address this issue, I derived analytical closed formulas to calculate the sensitivity and compensation matrices directly, without simulations. Here, I present these derivations and show that the resulting analytical methods are much faster than their simulation counterparts.
Collapse
Affiliation(s)
- Julian Echave
- Instituto de Ciencias Físicas, Escuela de Ciencia y Tecnología, Universidad Nacional de San Martín, San Martín, Buenos Aires, Argentina
| |
Collapse
|
40
|
Sayılgan JF, Haliloğlu T, Gönen M. Protein dynamics analysis identifies candidate cancer driver genes and mutations in TCGA data. Proteins 2021; 89:721-730. [PMID: 33550612 DOI: 10.1002/prot.26054] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 01/04/2021] [Accepted: 01/31/2021] [Indexed: 11/09/2022]
Abstract
Recently, it has been showed that cancer missense mutations selectively target the neighborhood of hinge residues, which are key sites in protein dynamics. Here, we show that this approach can be extended to find previously unknown candidate mutations and genes. To this aim, we developed a computational pipeline to detect significantly enriched three-dimensional (3D) clustering of missense mutations around hinge residues. The hinge residues were detected by applying a Gaussian network model. By systematically analyzing the PanCancer compendium of somatic mutations in nearly 10 000 tumors from the Cancer Genome Atlas, we identified candidate genes and mutations in addition to well known ones. For instance, we found significantly enriched 3D clustering of missense mutations in known cancer genes including CDK4, CDKN2A, TCL1A, and MAPK1. Beside these known genes, we also identified significantly enriched 3D clustering of missense mutations around hinge residues in PLA2G4A, which may lead to excessive phosphorylation of the extracellular signal-regulated kinases. Furthermore, we demonstrated that hinge-based features improves pathogenicity prediction for missense mutations. Our results show that the consideration of clustering around hinge residues can help us explain the functional role of the mutations in known cancer genes and identify candidate genes.
Collapse
Affiliation(s)
- Jan Fehmi Sayılgan
- Graduate School of Sciences and Engineering, Koç University, İstanbul, Turkey
| | - Türkan Haliloğlu
- Department of Chemical Engineering, School of Engineering, Boğaziçi University, İstanbul, Turkey.,Polymer Research Center, Boğaziçi University, İstanbul, Turkey
| | - Mehmet Gönen
- Department of Industrial Engineering, College of Engineering, Koç University, İstanbul, Turkey.,School of Medicine, Koç University, İstanbul, Turkey
| |
Collapse
|
41
|
Chi YI, Stodola TJ, De Assuncao TM, Leverence EN, Tripathi S, Dsouza NR, Mathison AJ, Basel DG, Volkman BF, Smith BC, Lomberk G, Zimmermann MT, Urrutia R. Molecular mechanics and dynamic simulations of well-known Kabuki syndrome-associated KDM6A variants reveal putative mechanisms of dysfunction. Orphanet J Rare Dis 2021; 16:66. [PMID: 33546721 PMCID: PMC7866879 DOI: 10.1186/s13023-021-01692-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Accepted: 01/15/2021] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Kabuki syndrome is a genetic disorder that affects several body systems and presents with variations in symptoms and severity. The syndrome is named for a common phenotype of faces resembling stage makeup used in a Japanese traditional theatrical art named kabuki. The most frequent cause of this syndrome is mutations in the H3K4 family of histone methyltransferases while a smaller percentage results from genetic alterations affecting the histone demethylase, KDM6A. Because of the rare presentation of the latter form of the disease, little is known about how missense changes in the KDM6A protein sequence impact protein function. RESULTS In this study, we use molecular mechanic and molecular dynamic simulations to enhance the annotation and mechanistic interpretation of the potential impact of eleven KDM6A missense variants found in Kabuki syndrome patients. These variants (N910S, D980V, S1025G, C1153R, C1153Y, P1195L, L1200F, Q1212R, Q1248R, R1255W, and R1351Q) are predicted to be pathogenic, likely pathogenic or of uncertain significance by sequence-based analysis. Here, we demonstrate, for the first time, that although Kabuki syndrome missense variants are found outside the functionally critical regions, they could affect overall function by significantly disrupting global and local conformation (C1153R, C1153Y, P1195L, L1200F, Q1212R, Q1248R, R1255W and R1351Q), chemical environment (C1153R, C1153Y, P1195L, L1200F, Q1212R, Q1248R, R1255W and R1351Q), and/or molecular dynamics of the catalytic domain (all variants). In addition, our approaches predict that many mutations, in particular C1153R, could allosterically disrupt the key enzymatic interactions of KDM6A. CONCLUSIONS Our study demonstrates that the KDM6A Kabuki syndrome variants may impair histone demethylase function through various mechanisms that include altered protein integrity, local environment, molecular interactions and protein dynamics. Molecular dynamics simulations of the wild type and the variants are critical to gain a better understanding of molecular dysfunction. This type of comprehensive structure- and MD-based analyses should help develop improved impact scoring systems to interpret the damaging effects of variants in this protein and other related proteins as well as provide detailed mechanistic insight that is not currently predictable from sequence alone.
Collapse
Affiliation(s)
- Young-In Chi
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA.,Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Timothy J Stodola
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Thiago M De Assuncao
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Elise N Leverence
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA
| | - Swarnendu Tripathi
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Nikita R Dsouza
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Angela J Mathison
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA.,Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Donald G Basel
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Division of Pediatric Genetics, Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Brian C Smith
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Gwen Lomberk
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA.,Department of Pharmacology and Toxicology, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Michael T Zimmermann
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA.,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA.,Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Raul Urrutia
- Genomic Sciences and Precision Medicine Center (GSPMC), Medical College of Wisconsin, Milwaukee, WI, USA. .,Bioinformatics Research and Development Laboratory, and Precision Medicine Simulation Unit, GSPMC, Medical College of Wisconsin, Milwaukee, WI, USA. .,Division of Research, Department of Surgery, Medical College of Wisconsin, Milwaukee, WI, USA. .,Division of Pediatric Genetics, Department of Pediatrics, Medical College of Wisconsin, Milwaukee, WI, USA.
| |
Collapse
|
42
|
ADDRESS: A Database of Disease-associated Human Variants Incorporating Protein Structure and Folding Stabilities. J Mol Biol 2021; 433:166840. [PMID: 33539887 DOI: 10.1016/j.jmb.2021.166840] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 01/17/2021] [Accepted: 01/20/2021] [Indexed: 11/22/2022]
Abstract
Numerous human diseases are caused by mutations in genomic sequences. Since amino acid changes affect protein function through mechanisms often predictable from protein structure, the integration of structural and sequence data enables us to estimate with greater accuracy whether and how a given mutation will lead to disease. Publicly available annotated databases enable hypothesis assessment and benchmarking of prediction tools. However, the results are often presented as summary statistics or black box predictors, without providing full descriptive information. We developed a new semi-manually curated human variant database presenting information on the protein contact-map, sequence-to-structure mapping, amino acid identity change, and stability prediction for the popular UniProt database. We found that the profiles of pathogenic and benign missense polymorphisms can be effectively deduced using decision trees and comparative analyses based on the presented dataset. The database is made publicly available through https://zhanglab.ccmb.med.umich.edu/ADDRESS.
Collapse
|
43
|
Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]
Abstract
Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data. Database URL: http://structure.bmc.lu.se/VariBench
Collapse
Affiliation(s)
- Anasua Sarkar
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| | - Yang Yang
- School of Computer Science and Technology, Soochow University, No1. Shizi Street, Suzhou, 215006 Jiangsu, China.,Provincial Key Laboratory for Computer Information Processing Technology, No1. Shizi Street, Soochow University, Suzhou, 215006 Jiangsu, China
| | - Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| |
Collapse
|
44
|
Gao S, Tao R, Tong X, Xu Q, Zhao J, Guo Y, Schinckel AP, Zhou B. Identification of Functional Single Nucleotide Polymorphisms in Porcine HSD17B14 Gene Associated with Estrus Behavior Difference between Large White and Mi Gilts. Biomolecules 2020; 10:biom10111545. [PMID: 33198360 PMCID: PMC7697482 DOI: 10.3390/biom10111545] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2020] [Revised: 11/09/2020] [Accepted: 11/11/2020] [Indexed: 11/16/2022] Open
Abstract
Steroid hormone levels are associated with estrous behavior, which affects timely mating and reproductive efficiency in pigs. 17β-hydroxysteroid dehydrogenase type 14 (HSD17B14) modulates steroid synthesis and metabolism. To identify the functional single nucleotide polymorphisms (SNPs) in the porcine HSD17B14 gene, ear tissues from Large White and Mi gilts were collected to extract genomic DNA. Variable lengths of truncated promoter of HSD17B14 gene were used to determine the promoter activity by a dual luciferase reporter system. The vector HSD17B14Phe or HSD17B14Val was transfected into porcine granulosa cells (GCs). The core promoter region was identified between -72bp and -218bp. Six of seven SNPs had significant differences of allele frequency between Large White and Mi gilts. The plasmids with the wild genotype AA of rs329427898 maintained a smaller fraction of promoter activity compared with the plasmids with the mutant genotype GG, while the plasmids with wild the genotype TT of rs319864566 had a greater promoter activity than the plasmids with the mutant genotype CC. A missense mutation (Phe73Val) caused changes in the structural dynamics and function of the HSD17B14 protein. The highly expressed HSD17B14Val degraded less estradiol into estrone, while the relatively lowly expressed HSD17B14Phe degraded more estradiol into estrone, suggesting the protein activity of HSD17B14Phe was greater than that of HSD17B14Val. Moreover, the HSD17B14Phe group has a greater apoptosis rate of porcine GCs. The HSD17B14 gene could been used as a candidate molecular marker for estrus behavior in pigs.
Collapse
Affiliation(s)
- Siyuan Gao
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Ruixin Tao
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Xian Tong
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Qinglei Xu
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Jing Zhao
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Yanli Guo
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
| | - Allan P. Schinckel
- Department of Animal Sciences, Purdue University, West Lafayette, IN 47907-2054, USA;
| | - Bo Zhou
- College of Animal Science and Technology, Nanjing Agricultural University, Nanjing 210095, China; (S.G.); (R.T.); (X.T.); (Q.X.); (J.Z.); (Y.G.)
- Correspondence: ; Tel.: +86-025-84395362
| |
Collapse
|
45
|
Verkhivker GM. Molecular Simulations and Network Modeling Reveal an Allosteric Signaling in the SARS-CoV-2 Spike Proteins. J Proteome Res 2020; 19:4587-4608. [PMID: 33006900 PMCID: PMC7640983 DOI: 10.1021/acs.jproteome.0c00654] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Indexed: 12/13/2022]
Abstract
The development of computational strategies for the quantitative characterization of the functional mechanisms of SARS-CoV-2 spike proteins is of paramount importance in efforts to accelerate the discovery of novel therapeutic agents and vaccines combating the COVID-19 pandemic. Structural and biophysical studies have recently characterized the conformational landscapes of the SARS-CoV-2 spike glycoproteins in the prefusion form, revealing a spectrum of stable and more dynamic states. By employing molecular simulations and network modeling approaches, this study systematically examined functional dynamics and identified the regulatory centers of allosteric interactions for distinct functional states of the wild-type and mutant variants of the SARS-CoV-2 prefusion spike trimer. This study presents evidence that the SARS-CoV-2 spike protein can function as an allosteric regulatory engine that fluctuates between dynamically distinct functional states. Perturbation-based modeling of the interaction networks revealed a key role of the cross-talk between the effector hotspots in the receptor binding domain and the fusion peptide proximal region of the SARS-CoV-2 spike protein. The results have shown that the allosteric hotspots of the interaction networks in the SARS-CoV-2 spike protein can control the dynamic switching between functional conformational states that are associated with virus entry to the host receptor. This study offers a useful and novel perspective on the underlying mechanisms of the SARS-CoV-2 spike protein through the lens of allosteric signaling as a regulatory apparatus of virus transmission that could open up opportunities for targeted allosteric drug discovery against SARS-CoV-2 proteins and contribute to the rapid response to the current and potential future pandemic scenarios.
Collapse
Affiliation(s)
- Gennady M. Verkhivker
- Graduate
Program in Computational and Data Sciences, Keck Center for Science
and Engineering, Schmid College of Science and Technology, Chapman University, One University Drive, Orange, California 92866, United States
- Department
of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
| |
Collapse
|
46
|
Ponzoni L, Peñaherrera DA, Oltvai ZN, Bahar I. Rhapsody: predicting the pathogenicity of human missense variants. Bioinformatics 2020; 36:3084-3092. [PMID: 32101277 PMCID: PMC7214033 DOI: 10.1093/bioinformatics/btaa127] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Revised: 12/27/2019] [Accepted: 02/21/2020] [Indexed: 12/22/2022] Open
Abstract
MOTIVATION The biological effects of human missense variants have been studied experimentally for decades but predicting their effects in clinical molecular diagnostics remains challenging. Available computational tools are usually based on the analysis of sequence conservation and structural properties of the mutant protein. We recently introduced a new machine learning method that demonstrated for the first time the significance of protein dynamics in determining the pathogenicity of missense variants. RESULTS Here, we present a new interface (Rhapsody) that enables fully automated assessment of pathogenicity, incorporating both sequence coevolution data and structure- and dynamics-based features. Benchmarked against a dataset of about 20 000 annotated variants, the methodology is shown to outperform well-established and/or advanced prediction tools. We illustrate the utility of Rhapsody by in silico saturation mutagenesis studies of human H-Ras, phosphatase and tensin homolog and thiopurine S-methyltransferase. AVAILABILITY AND IMPLEMENTATION The new tool is available both as an online webserver at http://rhapsody.csb.pitt.edu and as an open-source Python package (GitHub repository: https://github.com/prody/rhapsody; PyPI package installation: pip install prody-rhapsody). Links to additional resources, tutorials and package documentation are provided in the 'Python package' section of the website. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Luca Ponzoni
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Daniel A Peñaherrera
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | - Zoltán N Oltvai
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA.,Department of Pathology, University of Pittsburgh, Pittsburgh, PA 15261, USA.,Department of Laboratory Medicine and Pathology, University of Minnesota, Minneapolis, MN 55455, USA
| | - Ivet Bahar
- Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA 15260, USA
| |
Collapse
|
47
|
Krieger JM, Doruker P, Scott AL, Perahia D, Bahar I. Towards gaining sight of multiscale events: utilizing network models and normal modes in hybrid methods. Curr Opin Struct Biol 2020; 64:34-41. [PMID: 32622329 DOI: 10.1016/j.sbi.2020.05.013] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Revised: 05/13/2020] [Accepted: 05/20/2020] [Indexed: 11/28/2022]
Abstract
With the explosion of normal mode analyses (NMAs) based on elastic network models (ENMs) in the last decade, and the proven precision of MD simulations for visualizing interactions at atomic scale, many hybrid methods have been proposed in recent years. These aim at exploiting the best of both worlds: the atomic precision of MD that often fall short of exploring time and length scales of biological interest, and the capability of ENM-NMA to predict the cooperative and often functional rearrangements of large structures and assemblies, albeit at low resolution. We present an overview of recent progress in the field with examples of successful applications highlighting the utility of such hybrid methods and pointing to emerging future directions guided by advances in experimental characterization of biomolecular systems structure and dynamics.
Collapse
Affiliation(s)
- James M Krieger
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Ave, Suite 3064 BST3, Pittsburgh, PA 15260, USA
| | - Pemra Doruker
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Ave, Suite 3064 BST3, Pittsburgh, PA 15260, USA
| | - Ana Ligia Scott
- Laboratory of Bioinformatics and Computational Biology, Federal University of ABC, Santo André, SP, Brazil
| | - David Perahia
- Laboratoire de Biologie et de Pharmacologie Appliquée, Ecole Normale Superieure Paris-Saclay, UMR 8113, CNRS, 4 Avenue des Sciences, 91190 Gif-sur-Yvette, France
| | - Ivet Bahar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, 3501 Fifth Ave, Suite 3064 BST3, Pittsburgh, PA 15260, USA.
| |
Collapse
|
48
|
Jalalypour F, Sensoy O, Atilgan C. Perturb-Scan-Pull: A Novel Method Facilitating Conformational Transitions in Proteins. J Chem Theory Comput 2020; 16:3825-3841. [PMID: 32324386 DOI: 10.1021/acs.jctc.9b01222] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Conformational transitions in proteins facilitate precise physiological functions. Therefore, it is crucial to understand the mechanisms underlying these processes to modulate protein function. Yet, studying structural and dynamical properties of proteins is notoriously challenging due to the complexity of the underlying potential energy surfaces (PES). We have previously developed the perturbation-response scanning (PRS) method to identify key residues that participate in the communication network responsible for specific conformational transitions. PRS is based on a residue-by-residue scan of the protein to determine the subset of residues/forces which provide the closest conformational change leading to a target conformational state, inasmuch as linear response theory applies to these motions. Here, we develop a novel method to further evaluate if conformational transitions may be triggered on the PES. We aim to study functionally relevant conformational transitions in proteins by using results obtained from PRS and feeding them as inputs to steered molecular dynamics simulations. The success and the transferability of the method are evaluated on three protein systems having different complexities of motion on the PES: calmodulin, adenylate kinase, and bacterial ferric binding protein. We find that the method captures the target conformation, while providing key residues and the optimum paths with relatively low free energy profiles.
Collapse
Affiliation(s)
- Farzaneh Jalalypour
- Faculty of Engineering and Natural Sciences, Sabanci University, 34956, Istanbul, Turkey
| | - Ozge Sensoy
- School of Engineering and Natural Sciences, Istanbul Medipol University, 34810, Istanbul, Turkey
| | - Canan Atilgan
- Faculty of Engineering and Natural Sciences, Sabanci University, 34956, Istanbul, Turkey.,Sabanci University Nanotechnology Research and Application Center, SUNUM, 34956, Istanbul, Turkey
| |
Collapse
|
49
|
Ponzoni L, Nguyen NH, Bahar I, Brodsky JL. Complementary computational and experimental evaluation of missense variants in the ROMK potassium channel. PLoS Comput Biol 2020; 16:e1007749. [PMID: 32251469 PMCID: PMC7162551 DOI: 10.1371/journal.pcbi.1007749] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 04/16/2020] [Accepted: 02/25/2020] [Indexed: 02/02/2023] Open
Abstract
The renal outer medullary potassium (ROMK) channel is essential for potassium transport in the kidney, and its dysfunction is associated with a salt-wasting disorder known as Bartter syndrome. Despite its physiological significance, we lack a mechanistic understanding of the molecular defects in ROMK underlying most Bartter syndrome-associated mutations. To this end, we employed a ROMK-dependent yeast growth assay and tested single amino acid variants selected by a series of computational tools representative of different approaches to predict each variants’ pathogenicity. In one approach, we used in silico saturation mutagenesis, i.e. the scanning of all possible single amino acid substitutions at all sequence positions to estimate their impact on function, and then employed a new machine learning classifier known as Rhapsody. We also used two additional tools, EVmutation and Polyphen-2, which permitted us to make consensus predictions on the pathogenicity of single amino acid variants in ROMK. Experimental tests performed for selected mutants in different classes validated the vast majority of our predictions and provided insights into variants implicated in ROMK dysfunction. On a broader scope, our analysis suggests that consolidation of data from complementary computational approaches provides an improved and facile method to predict the severity of an amino acid substitution and may help accelerate the identification of disease-causing mutations in any protein. As the number of sequenced human genomes rises, a major challenge is to identify which single amino acid variations in a protein affect function and predispose individuals to disease. While predictive algorithms are available for this purpose, a comparative analysis of recently developed algorithms has not been adequately performed, nor is it clear whether combining algorithms would improve predictive power. To this end, we compared the efficacy of three publicly available algorithms and applied the results to Bartter syndrome, a human disease for which numerous poorly-characterized single amino acid variants have been identified and for which there is no cure. In silico saturation mutagenesis, i.e., the computational prediction of pathogenesis for every possible amino acid substitution, allowed us to experimentally test predictions by measuring the activity of an ion channel linked to Bartter syndrome. Based on data from blinded experiments, we discovered that Rhapsody and EVmutation successfully predicted deleterious mutations. Moreover, Rhapsody—which takes into account evolutionary as well as structural and dynamic considerations—predicted that >90% of known Bartter syndrome mutations are deleterious. Overall, our data will aid investigators who wish to test single amino acid variants in any protein and aid biomedical researchers who wish to develop hypotheses on the potential severity of genetic variants uncovered from genome databases.
Collapse
Affiliation(s)
- Luca Ponzoni
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Nga H. Nguyen
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Ivet Bahar
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- * E-mail: (IB); (JLB)
| | - Jeffrey L. Brodsky
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
- * E-mail: (IB); (JLB)
| |
Collapse
|
50
|
Campitelli P, Modi T, Kumar S, Ozkan SB. The Role of Conformational Dynamics and Allostery in Modulating Protein Evolution. Annu Rev Biophys 2020; 49:267-288. [PMID: 32075411 DOI: 10.1146/annurev-biophys-052118-115517] [Citation(s) in RCA: 89] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Advances in sequencing techniques and statistical methods have made it possible not only to predict sequences of ancestral proteins but also to identify thousands of mutations in the human exome, some of which are disease associated. These developments have motivated numerous theories and raised many questions regarding the fundamental principles behind protein evolution, which have been traditionally investigated horizontally using the tip of the phylogenetic tree through comparative studies of extant proteins within a family. In this article, we review a vertical comparison of the modern and resurrected ancestral proteins. We focus mainly on the dynamical properties responsible for a protein's ability to adapt new functions in response to environmental changes. Using the Dynamic Flexibility Index and the Dynamic Coupling Index to quantify the relative flexibility and dynamic coupling at a site-specific, single-amino-acid level, we provide evidence that the migration of hinges, which are often functionally critical rigid sites, is a mechanism through which proteins can rapidly evolve. Additionally, we show that disease-associated mutations in proteins often result in flexibility changes even at positions distal from mutational sites, particularly in the modulation of active site dynamics.
Collapse
Affiliation(s)
- Paul Campitelli
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| | - Tushar Modi
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| | - Sudhir Kumar
- Institute for Genomics and Evolutionary Medicine, Temple University, Philadelphia, Pennsylvania 19122, USA; .,Department of Biology, Temple University, Philadelphia, Pennsylvania 19122, USA.,Center for Excellence in Genome Medicine and Research, King Abdulaziz University, Jeddah 21589, Saudi Arabia
| | - S Banu Ozkan
- Center for Biological Physics, Department of Physics, Arizona State University, Tempe, Arizona 85281, USA; , ,
| |
Collapse
|