1
|
Biswas G, Mukherjee D, Basu S. Combining Complementarity and Binding Energetics in the Assessment of Protein Interactions: EnCPdock-A Practical Manual. J Comput Biol 2024; 31:769-781. [PMID: 38885081 DOI: 10.1089/cmb.2024.0554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024] Open
Abstract
The combined effect of shape and electrostatic complementarities (Sc, EC) at the interface of the interacting protein partners (PPI) serves as the physical basis for such associations and is a strong determinant of their binding energetics. EnCPdock (https://www.scinetmol.in/EnCPdock/) presents a comprehensive web platform for the direct conjoint comparative analyses of complementarity and binding energetics in PPIs. It elegantly interlinks the dual nature of local (Sc) and nonlocal complementarity (EC) in PPIs using the complementarity plot. It further derives an AI-based ΔGbinding with a prediction accuracy comparable to the state of the art. This book chapter presents a practical manual to conceptualize and implement EnCPdock with its various features and functionalities, collectively having the potential to serve as a valuable protein engineering tool in the design of novel protein interfaces.
Collapse
Affiliation(s)
- Gargi Biswas
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | | | - Sankar Basu
- Department of Microbiology, Asutosh College, University of Calcutta, Kolkata, India
| |
Collapse
|
2
|
Khan RT, Pokorna P, Stourac J, Borko S, Arefiev I, Planas-Iglesias J, Dobias A, Pinto G, Szotkowska V, Sterba J, Slaby O, Damborsky J, Mazurenko S, Bednar D. A computational workflow for analysis of missense mutations in precision oncology. J Cheminform 2024; 16:86. [PMID: 39075588 PMCID: PMC11285293 DOI: 10.1186/s13321-024-00876-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 06/26/2024] [Indexed: 07/31/2024] Open
Abstract
Every year, more than 19 million cancer cases are diagnosed, and this number continues to increase annually. Since standard treatment options have varying success rates for different types of cancer, understanding the biology of an individual's tumour becomes crucial, especially for cases that are difficult to treat. Personalised high-throughput profiling, using next-generation sequencing, allows for a comprehensive examination of biopsy specimens. Furthermore, the widespread use of this technology has generated a wealth of information on cancer-specific gene alterations. However, there exists a significant gap between identified alterations and their proven impact on protein function. Here, we present a bioinformatics pipeline that enables fast analysis of a missense mutation's effect on stability and function in known oncogenic proteins. This pipeline is coupled with a predictor that summarises the outputs of different tools used throughout the pipeline, providing a single probability score, achieving a balanced accuracy above 86%. The pipeline incorporates a virtual screening method to suggest potential FDA/EMA-approved drugs to be considered for treatment. We showcase three case studies to demonstrate the timely utility of this pipeline. To facilitate access and analysis of cancer-related mutations, we have packaged the pipeline as a web server, which is freely available at https://loschmidt.chemi.muni.cz/predictonco/ .Scientific contributionThis work presents a novel bioinformatics pipeline that integrates multiple computational tools to predict the effects of missense mutations on proteins of oncological interest. The pipeline uniquely combines fast protein modelling, stability prediction, and evolutionary analysis with virtual drug screening, while offering actionable insights for precision oncology. This comprehensive approach surpasses existing tools by automating the interpretation of mutations and suggesting potential treatments, thereby striving to bridge the gap between sequencing data and clinical application.
Collapse
Affiliation(s)
- Rayyan Tariq Khan
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Petra Pokorna
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
- Department of Biology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Simeon Borko
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Ihor Arefiev
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Adam Dobias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Gaspar Pinto
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Veronika Szotkowska
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Jaroslav Sterba
- Department of Paediatric Oncology, University Hospital Brno and Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Ondrej Slaby
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
- Department of Biology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic.
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic.
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic.
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic.
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic.
| |
Collapse
|
3
|
Stourac J, Borko S, Khan RT, Pokorna P, Dobias A, Planas-Iglesias J, Mazurenko S, Pinto G, Szotkowska V, Sterba J, Slaby O, Damborsky J, Bednar D. PredictONCO: a web tool supporting decision-making in precision oncology by extending the bioinformatics predictions with advanced computing and machine learning. Brief Bioinform 2023; 25:bbad441. [PMID: 38066711 PMCID: PMC10709543 DOI: 10.1093/bib/bbad441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 10/25/2023] [Accepted: 11/10/2023] [Indexed: 12/18/2023] Open
Abstract
PredictONCO 1.0 is a unique web server that analyzes effects of mutations on proteins frequently altered in various cancer types. The server can assess the impact of mutations on the protein sequential and structural properties and apply a virtual screening to identify potential inhibitors that could be used as a highly individualized therapeutic approach, possibly based on the drug repurposing. PredictONCO integrates predictive algorithms and state-of-the-art computational tools combined with information from established databases. The user interface was carefully designed for the target specialists in precision oncology, molecular pathology, clinical genetics and clinical sciences. The tool summarizes the effect of the mutation on protein stability and function and currently covers 44 common oncological targets. The binding affinities of Food and Drug Administration/ European Medicines Agency -approved drugs with the wild-type and mutant proteins are calculated to facilitate treatment decisions. The reliability of predictions was confirmed against 108 clinically validated mutations. The server provides a fast and compact output, ideal for the often time-sensitive decision-making process in oncology. Three use cases of missense mutations, (i) K22A in cyclin-dependent kinase 4 identified in melanoma, (ii) E1197K mutation in anaplastic lymphoma kinase 4 identified in lung carcinoma and (iii) V765A mutation in epidermal growth factor receptor in a patient with congenital mismatch repair deficiency highlight how the tool can increase levels of confidence regarding the pathogenicity of the variants and identify the most effective inhibitors. The server is available at https://loschmidt.chemi.muni.cz/predictonco.
Collapse
Affiliation(s)
- Jan Stourac
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Simeon Borko
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
- IT4Innovations Centre of Excellence, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Rayyan T Khan
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Petra Pokorna
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
- Department of Biology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Adam Dobias
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Joan Planas-Iglesias
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Gaspar Pinto
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - Veronika Szotkowska
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
| | - Jaroslav Sterba
- Department of Paediatric Oncology, University Hospital Brno and Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Ondrej Slaby
- Central European Institute of Technology, Masaryk University, Brno, Czech Republic
- Department of Biology, Faculty of Medicine, Masaryk University, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czech Republic
- Loschmidt Laboratories, RECETOX, Faculty of Science, Masaryk University, Brno, Czech Republic
- International Clinical Research Center, St. Anne's University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
4
|
Malbranke C, Rostain W, Depardieu F, Cocco S, Monasson R, Bikard D. Computational design of novel Cas9 PAM-interacting domains using evolution-based modelling and structural quality assessment. PLoS Comput Biol 2023; 19:e1011621. [PMID: 37976326 PMCID: PMC10729993 DOI: 10.1371/journal.pcbi.1011621] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 12/19/2023] [Accepted: 10/19/2023] [Indexed: 11/19/2023] Open
Abstract
We present here an approach to protein design that combines (i) scarce functional information such as experimental data (ii) evolutionary information learned from a natural sequence variants and (iii) physics-grounded modeling. Using a Restricted Boltzmann Machine (RBM), we learn a sequence model of a protein family. We use semi-supervision to leverage available functional information during the RBM training. We then propose a strategy to explore the protein representation space that can be informed by external models such as an empirical force-field method (FoldX). Our approach is applied to a domain of the Cas9 protein responsible for recognition of a short DNA motif. We experimentally assess the functionality of 71 variants generated to explore a range of RBM and FoldX energies. Sequences with as many as 50 differences (20% of the protein domain) to the wild-type retained functionality. Overall, 21/71 sequences designed with our method were functional. Interestingly, 6/71 sequences showed an improved activity in comparison with the original wild-type protein sequence. These results demonstrate the interest in further exploring the synergies between machine-learning of protein sequence representations and physics grounded modeling strategies informed by structural information.
Collapse
Affiliation(s)
- Cyril Malbranke
- Laboratory of Physics of the Ecole Normale Superieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Paris, France
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, Paris, France
| | - William Rostain
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, Paris, France
| | - Florence Depardieu
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, Paris, France
| | - Simona Cocco
- Laboratory of Physics of the Ecole Normale Superieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Paris, France
| | - Rémi Monasson
- Laboratory of Physics of the Ecole Normale Superieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Paris, France
| | - David Bikard
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, Paris, France
| |
Collapse
|
5
|
Gerasimavicius L, Livesey BJ, Marsh JA. Correspondence between functional scores from deep mutational scans and predicted effects on protein stability. Protein Sci 2023; 32:e4688. [PMID: 37243972 PMCID: PMC10273344 DOI: 10.1002/pro.4688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Revised: 04/19/2023] [Accepted: 05/24/2023] [Indexed: 05/29/2023]
Abstract
Many methodologically diverse computational methods have been applied to the growing challenge of predicting and interpreting the effects of protein variants. As many pathogenic mutations have a perturbing effect on protein stability or intermolecular interactions, one highly interpretable approach is to use protein structural information to model the physical impacts of variants and predict their likely effects on protein stability and interactions. Previous efforts have assessed the accuracy of stability predictors in reproducing thermodynamically accurate values and evaluated their ability to distinguish between known pathogenic and benign mutations. Here, we take an alternate approach, and explore how well stability predictor scores correlate with functional impacts derived from deep mutational scanning (DMS) experiments. In this work, we compare the predictions of 9 protein stability-based tools against mutant protein fitness values from 49 independent DMS datasets, covering 170,940 unique single amino acid variants. We find that FoldX and Rosetta show the strongest correlations with DMS-based functional scores, similar to their previous top performance in distinguishing between pathogenic and benign variants. For both methods, performance is considerably improved when considering intermolecular interactions from protein complex structures, when available. Furthermore, using these two predictors, we derive a "Foldetta" consensus score, which improves upon the performance of both, and manages to match dedicated variant effect predictors in reflecting variant functional impacts. Finally, we also highlight that predicted stability effects show consistently higher correlations with certain DMS experimental phenotypes, particularly those based upon protein abundance, and, in certain cases, can significantly outcompete sequence-based variant effect prediction methodologies for predicting functional scores from DMS experiments.
Collapse
Affiliation(s)
- Lukas Gerasimavicius
- MRC Human Genetics Unit, Institute of Genetics & CancerUniversity of EdinburghEdinburghUK
| | - Benjamin J. Livesey
- MRC Human Genetics Unit, Institute of Genetics & CancerUniversity of EdinburghEdinburghUK
| | - Joseph A. Marsh
- MRC Human Genetics Unit, Institute of Genetics & CancerUniversity of EdinburghEdinburghUK
| |
Collapse
|
6
|
Montero‐Blay A, Blanco JD, Rodriguez‐Arce I, Lastrucci C, Piñero‐Lambea C, Lluch‐Senar M, Serrano L. Bacterial expression of a designed single-chain IL-10 prevents severe lung inflammation. Mol Syst Biol 2023; 19:e11037. [PMID: 36598022 PMCID: PMC9834763 DOI: 10.15252/msb.202211037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Revised: 09/27/2022] [Accepted: 10/05/2022] [Indexed: 01/05/2023] Open
Abstract
Interleukin-10 (IL-10) is an anti-inflammatory cytokine that is active as a swapped domain dimer and is used in bacterial therapy of gut inflammation. IL-10 can be used as treatment of a wide range of pulmonary diseases. Here we have developed a non-pathogenic chassis (CV8) of the human lung bacterium Mycoplasma pneumoniae (MPN) to treat lung diseases. We find that IL-10 expression by MPN has a limited impact on the lung inflammatory response in mice. To solve these issues, we rationally designed a single-chain IL-10 (SC-IL10) with or without surface mutations, using our protein design software (ModelX and FoldX). As compared to the IL-10 WT, the designed SC-IL10 molecules increase the effective expression in MPN four-fold, and the activity in mouse and human cell lines between 10 and 60 times, depending on the cell line. The SC-IL10 molecules expressed in the mouse lung by CV8 in vivo have a powerful anti-inflammatory effect on Pseudomonas aeruginosa lung infection. This rational design strategy could be used to other molecules with immunomodulatory properties used in bacterial therapy.
Collapse
Affiliation(s)
- Ariadna Montero‐Blay
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Javier Delgado Blanco
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Irene Rodriguez‐Arce
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Claire Lastrucci
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Carlos Piñero‐Lambea
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Maria Lluch‐Senar
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Luis Serrano
- Centre for Genomic Regulation (CRG)The Barcelona Institute of Science and TechnologyBarcelonaSpain
- Universitat Pompeu Fabra (UPF)BarcelonaSpain
- ICREABarcelonaSpain
| |
Collapse
|
7
|
Scaffa A, Tollefson GA, Yao H, Rizal S, Wallace J, Oulhen N, Carr JF, Hegarty K, Uzun A, Dennery PA. Identification of Heme Oxygenase-1 as a Putative DNA-Binding Protein. Antioxidants (Basel) 2022; 11:2135. [PMID: 36358506 PMCID: PMC9686683 DOI: 10.3390/antiox11112135] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 10/04/2022] [Accepted: 10/25/2022] [Indexed: 09/30/2023] Open
Abstract
Heme oxygenase-1 (HO-1) is a rate-limiting enzyme in degrading heme into biliverdin and iron. HO-1 can also enter the nucleus and regulate gene transcription independent of its enzymatic activity. Whether HO-1 can alter gene expression through direct binding to target DNA remains unclear. Here, we performed HO-1 CHIP-seq and then employed 3D structural modeling to reveal putative HO-1 DNA binding domains. We identified three probable DNA binding domains on HO-1. Using the Proteinarium, we identified several genes as the most highly connected nodes in the interactome among the HO-1 gene binding targets. We further demonstrated that HO-1 modulates the expression of these key genes using Hmox1 deficient cells. Finally, mutation of four conserved amino acids (E215, I211, E201, and Q27) within HO-1 DNA binding domain 1 significantly increased expression of Gtpbp3 and Eif1 genes that were identified within the top 10 binding hits normalized by gene length predicted to bind this domain. Based on these data, we conclude that HO-1 protein is a putative DNA binding protein, and regulates targeted gene expression. This provides the foundation for developing specific inhibitors or activators targeting HO-1 DNA binding domains to modulate targeted gene expression and corresponding cellular function.
Collapse
Affiliation(s)
- Alejandro Scaffa
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - George A. Tollefson
- Department of Pathology and Laboratory Medicine, Rhode Island Hospital, Providence, RI 02903, USA
| | - Hongwei Yao
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - Salu Rizal
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - Joselynn Wallace
- Center for Computational Biology of Human Disease, and Center for Computation and Visualization, Brown University, Providence, RI 02906, USA
| | - Nathalie Oulhen
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - Jennifer F. Carr
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - Katy Hegarty
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
| | - Alper Uzun
- Department of Pediatrics, Warren Alpert Medical School of Brown University, Providence, RI 02903, USA
- Department of Pediatrics, Women and Infants Hospital, Providence, RI 02905, USA
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA
| | - Phyllis A. Dennery
- Department of Molecular Biology, Cell Biology & Biochemistry, Division of Biology and Medicine, Brown University, Providence, RI 02912, USA
- Department of Pediatrics, Warren Alpert Medical School of Brown University, Providence, RI 02903, USA
| |
Collapse
|
8
|
Lai J, Yang J, Gamsiz Uzun ED, Rubenstein BM, Sarkar IN. LYRUS: a machine learning model for predicting the pathogenicity of missense variants. BIOINFORMATICS ADVANCES 2021; 2:vbab045. [PMID: 35036922 PMCID: PMC8754197 DOI: 10.1093/bioadv/vbab045] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 12/08/2021] [Accepted: 12/21/2021] [Indexed: 01/27/2023]
Abstract
SUMMARY Single amino acid variations (SAVs) are a primary contributor to variations in the human genome. Identifying pathogenic SAVs can provide insights to the genetic architecture of complex diseases. Most approaches for predicting the functional effects or pathogenicity of SAVs rely on either sequence or structural information. This study presents 〈Lai Yang Rubenstein Uzun Sarkar〉 (LYRUS), a machine learning method that uses an XGBoost classifier to predict the pathogenicity of SAVs. LYRUS incorporates five sequence-based, six structure-based and four dynamics-based features. Uniquely, LYRUS includes a newly proposed sequence co-evolution feature called the variation number. LYRUS was trained using a dataset that contains 4363 protein structures corresponding to 22 639 SAVs from the ClinVar database, and tested using the VariBench testing dataset. Performance analysis showed that LYRUS achieved comparable performance to current variant effect predictors. LYRUS's performance was also benchmarked against six Deep Mutational Scanning datasets for PTEN and TP53. AVAILABILITY AND IMPLEMENTATION LYRUS is freely available and the source code can be found at https://github.com/jiaying2508/LYRUS. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Jiaying Lai
- Center for Biomedical Informatics, Brown University, Providence, RI 02903, USA,Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA
| | - Jordan Yang
- Department of Chemistry, Brown University, Providence, RI 02906, USA
| | - Ece D Gamsiz Uzun
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA,Department of Pathology and Laboratory Medicine, Brown University Alpert Medical School, Providence, RI 02903, USA,Department of Pathology, Rhode Island Hospital, Providence, RI 02903, USA
| | - Brenda M Rubenstein
- Center for Computational Molecular Biology, Brown University, Providence, RI 02906, USA,Department of Chemistry, Brown University, Providence, RI 02906, USA,To whom correspondence should be addressed. and
| | - Indra Neil Sarkar
- Center for Biomedical Informatics, Brown University, Providence, RI 02903, USA,Rhode Island Quality Institute, Providence, RI 02908, USA,To whom correspondence should be addressed. and
| |
Collapse
|
9
|
Gene Editing in Pluripotent Stem Cells and Their Derived Organoids. Stem Cells Int 2021; 2021:8130828. [PMID: 34887928 PMCID: PMC8651378 DOI: 10.1155/2021/8130828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 11/22/2021] [Indexed: 12/26/2022] Open
Abstract
With the rapid rise in gene-editing technology, pluripotent stem cells (PSCs) and their derived organoids have increasingly broader and practical applications in regenerative medicine. Gene-editing technologies, from large-scale nucleic acid endonucleases to CRISPR, have ignited a global research and development boom with significant implications in regenerative medicine. The development of regenerative medicine technologies, regardless of whether it is PSCs or gene editing, is consistently met with controversy. Are the tools for rewriting the code of life a boon to humanity or a Pandora's box? These technologies raise concerns regarding ethical issues, unexpected mutations, viral infection, etc. These concerns remain even as new treatments emerge. However, the potential negatives cannot obscure the virtues of PSC gene editing, which have, and will continue to, benefit mankind at an unprecedented rate. Here, we briefly introduce current gene-editing technology and its application in PSCs and their derived organoids, while addressing ethical concerns and safety risks and discussing the latest progress in PSC gene editing. Gene editing in PSCs creates visualized in vitro models, providing opportunities for examining mechanisms of known and unknown mutations and offering new possibilities for the treatment of cancer, genetic diseases, and other serious or refractory disorders. From model construction to treatment exploration, the important role of PSCs combined with gene editing in basic and clinical medicine studies is illustrated. The applications, characteristics, and existing challenges are summarized in combination with our lab experiences in this field in an effort to help gene-editing technology better serve humans in a regulated manner. Current preclinical and clinical trials have demonstrated initial safety and efficacy of PSC gene editing; however, for better application in clinical settings, additional investigation is warranted.
Collapse
|
10
|
Wang Y, Li G, Meng T, Qi L, Yan H, Wang Z. Molecular insights into the selective binding mechanism targeting parallel human telomeric G-quadruplex. J Mol Graph Model 2021; 110:108058. [PMID: 34736054 DOI: 10.1016/j.jmgm.2021.108058] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Revised: 10/18/2021] [Accepted: 10/21/2021] [Indexed: 12/15/2022]
Abstract
Stabilizing human telomere DNA G-quadruplex (G4) proves a promising anti-cancer strategy. Though plenty of G4 stabilizing molecules have been reported, little is known about their selective binding mechanism among various G4s. Recently, a designed monohydrazone derivative (compound 15) was reported to display specific preference in binding and stabilizing parallel human telomeric G4. To reveal the selective binding mechanism, a comparative theoretical investigation was performed on two monohydrazone derivatives (compounds 1 and 15) and three telomeric G4s showing parallel, hybrid-I, and hybrid-II conformations. Two probable binding modes, i.e. the end-stacking binding and the groove binding, were predicted by molecular dockings for each monohydrazone in its binding with the telomeric G4s. Further long-timescale molecular dynamics simulations reveal the conversion from the groove binding to the end-stacking binding for both compounds, indicating the preference of the end-stacking binding mode. Structural analysis together with binding free energy calculations show that the van der Waals interaction plays a leading role in ranking the binding affinity. By forming extensive van der Waals interactions, the parallel G4-15 binding complex shows the highest binding affinity, and the corresponding compound 15 exhibits the strongest stabilizing effect to the telomeric G4. These findings agree well with the experimental observations. Through characterizing the selective binding between monohydrazones and telomeric G4s at the atomic level, the current study provides support to the design of novel selective stabilizers targeting telomeric G4s.
Collapse
Affiliation(s)
- Yue Wang
- School of Pharmaceutical Sciences, Liaocheng University, Liaocheng, Shandong Province, 252059, China
| | - Guo Li
- Department of Biochemistry and Molecular Biology, Hainan Medical College, Haikou, Hainan Province, 571199, China
| | - Tong Meng
- School of Pharmaceutical Sciences, Liaocheng University, Liaocheng, Shandong Province, 252059, China
| | - Lin Qi
- Railway Police College, Zhengzhou, Henan Province, 450053, China
| | - Hui Yan
- School of Pharmaceutical Sciences, Liaocheng University, Liaocheng, Shandong Province, 252059, China.
| | - Zhiguo Wang
- Institute of Ageing Research, School of Basic Medical Sciences, Hangzhou Normal University, Hangzhou, Zhejiang Province, 311121, China.
| |
Collapse
|
11
|
Mei LC, Wang YL, Wu FX, Wang F, Hao GF, Yang GF. HISNAPI: a bioinformatic tool for dynamic hot spot analysis in nucleic acid-protein interface with a case study. Brief Bioinform 2021; 22:bbaa373. [PMID: 33406224 PMCID: PMC7929440 DOI: 10.1093/bib/bbaa373] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 11/19/2020] [Accepted: 11/23/2020] [Indexed: 01/18/2023] Open
Abstract
Protein-nucleic acid interactions play essential roles in many biological processes, such as transcription, replication and translation. In protein-nucleic acid interfaces, hotspot residues contribute the majority of binding affinity toward molecular recognition. Hotspot residues are commonly regarded as potential binding sites for compound molecules in drug design projects. The dynamic property is a considerable factor that affects the binding of ligands. Computational approaches have been developed to expedite the prediction of hotspot residues on protein-nucleic acid interfaces. However, existing approaches overlook hotspot dynamics, despite their essential role in protein function. Here, we report a web server named Hotspots In silico Scanning on Nucleic Acid and Protein Interface (HISNAPI) to analyze hotspot residue dynamics by integrating molecular dynamics simulation and one-step free energy perturbation. HISNAPI is capable of not only predicting the hotspot residues in protein-nucleic acid interfaces but also providing insights into their intensity and correlation of dynamic motion. Protein dynamics have been recognized as a vital factor that has an effect on the interaction specificity and affinity of the binding partners. We applied HISNAPI to the case of SARS-CoV-2 RNA-dependent RNA polymerase, a vital target of the antiviral drug for the treatment of coronavirus disease 2019. We identified the hotspot residues and characterized their dynamic behaviors, which might provide insight into the target site for antiviral drug design. The web server is freely available via a user-friendly web interface at http://chemyang.ccnu.edu.cn/ccb/server/HISNAPI/ and http://agroda.gzu.edu.cn:9999/ccb/server/HISNAPI/.
Collapse
Affiliation(s)
- Long-Can Mei
- College of Chemistry, Central China Normal University
| | | | | | | | | | - Guang-Fu Yang
- Pesticide Science from Nankai University, Tianjin, China
| |
Collapse
|
12
|
Abstract
Biological processes are often mediated by complexes formed between proteins and various biomolecules. The 3D structures of such protein-biomolecule complexes provide insights into the molecular mechanism of their action. The structure of these complexes can be predicted by various computational methods. Choosing an appropriate method for modelling depends on the category of biomolecule that a protein interacts with and the availability of structural information about the protein and its interacting partner. We intend for the contents of this chapter to serve as a guide as to what software would be the most appropriate for the type of data at hand and the kind of 3D complex structure required. Particularly, we have dealt with protein-small molecule ligand, protein-peptide, protein-protein, and protein-nucleic acid interactions.Most, if not all, model building protocols perform some sampling and scoring. Typically, several alternate conformations and configurations of the interactors are sampled. Each such sample is then scored for optimization. To boost the confidence in these predicted models, their assessment using other independent scoring schemes besides the inbuilt/default ones would prove to be helpful. This chapter also lists such software and serves as a guide to gauge the fidelity of modelled structures of biomolecular complexes.
Collapse
|
13
|
Aditham AK, Markin CJ, Mokhtari DA, DelRosso N, Fordyce PM. High-Throughput Affinity Measurements of Transcription Factor and DNA Mutations Reveal Affinity and Specificity Determinants. Cell Syst 2020; 12:112-127.e11. [PMID: 33340452 DOI: 10.1016/j.cels.2020.11.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 08/08/2020] [Accepted: 11/24/2020] [Indexed: 01/28/2023]
Abstract
Transcription factors (TFs) bind regulatory DNA to control gene expression, and mutations to either TFs or DNA can alter binding affinities to rewire regulatory networks and drive phenotypic variation. While studies have profiled energetic effects of DNA mutations extensively, we lack similar information for TF variants. Here, we present STAMMP (simultaneous transcription factor affinity measurements via microfluidic protein arrays), a high-throughput microfluidic platform enabling quantitative characterization of hundreds of TF variants simultaneously. Measured affinities for ∼210 mutants of a model yeast TF (Pho4) interacting with 9 oligonucleotides (>1,800 Kds) reveal that many combinations of mutations to poorly conserved TF residues and nucleotides flanking the core binding site alter but preserve physiological binding, providing a mechanism by which combinations of mutations in cis and trans could modulate TF binding to tune occupancies during evolution. Moreover, biochemical double-mutant cycles across the TF-DNA interface reveal molecular mechanisms driving recognition, linking sequence to function. A record of this paper's Transparent Peer Review process is included in the Supplemental Information.
Collapse
Affiliation(s)
- Arjun K Aditham
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Craig J Markin
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Daniel A Mokhtari
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - Nicole DelRosso
- Graduate Program in Biophysics, Stanford University, Stanford, CA 94305, USA
| | - Polly M Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA; Department of Genetics, Stanford University, Stanford, CA 94305, USA; Chan Zuckerberg Biohub, San Francisco, CA 94110, USA.
| |
Collapse
|
14
|
Yadav D, Kaur S, Banerjee D, Bhattacharyya R. Metformin and Rifampicin combination augments active to latent tuberculosis conversion: A computational study. Biotechnol Appl Biochem 2020; 68:1307-1312. [PMID: 33059386 DOI: 10.1002/bab.2052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 10/07/2020] [Indexed: 11/10/2022]
Abstract
Tuberculosis, a global threat, is a highly infectious disease intensified by the emergence of drug-resistant strains. In tuberculosis disease spectrum, a typical situation is a dormant or latent phase where a person exposed to Mycobacterium tuberculosis has the reservoir of the disease that may or may not result in an active state. Existence of the dormant state is retarding the eradication of tuberculosis. Transcription of several genes helps M. tuberculosis to survive in nonreplicative mode. DosR transcription factor is the hallmark for this genesis. Diabetes mellitus is a predisposition factor leading to the development of tuberculosis and latent tuberculosis. High plasma insulin concentrations in the prediabetic state can increase the tuberculosis bacterium. On the other hand, antidiabetic drug metformin is known to reduce active tuberculosis disease when provided in combination with antitubercular therapy. However, the effect of the same on latent tuberculosis is still unknown. In the present work using tools of computational biology, we have tried to find the consequence of adding metformin in combination with rifampicin, a well-known antitubercular drug, on molecular mechanisms of latent tuberculosis. We have investigated whether metformin and rifampicin interact with DosR machinery or not. Our results indicate that if metformin-bound DosR-DNA complex binds with rifampicin, it will result in the conversion of active tuberculosis to latent tuberculosis.
Collapse
Affiliation(s)
- Deepak Yadav
- Department of Experimental Medicine and Biotechnology, Postgraduate Institute of Medical Education and Research, Chandigarh, India
| | - Sumanpreet Kaur
- Department of Experimental Medicine and Biotechnology, Postgraduate Institute of Medical Education and Research, Chandigarh, India
| | - Dibyajyoti Banerjee
- Department of Experimental Medicine and Biotechnology, Postgraduate Institute of Medical Education and Research, Chandigarh, India
| | - Rajasri Bhattacharyya
- Department of Experimental Medicine and Biotechnology, Postgraduate Institute of Medical Education and Research, Chandigarh, India
| |
Collapse
|
15
|
Aderinwale T, Christoffer CW, Sarkar D, Alnabati E, Kihara D. Computational structure modeling for diverse categories of macromolecular interactions. Curr Opin Struct Biol 2020; 64:1-8. [PMID: 32599506 PMCID: PMC7665979 DOI: 10.1016/j.sbi.2020.05.017] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 05/06/2020] [Accepted: 05/21/2020] [Indexed: 01/23/2023]
Abstract
Computational protein-protein docking is one of the most intensively studied topics in structural bioinformatics. The field has made substantial progress through over three decades of development. The development began with methods for rigid-body docking of two proteins, which have now been extended in different directions to cover the various macromolecular interactions observed in a cell. Here, we overview the recent developments of the variations of docking methods, including multiple protein docking, peptide-protein docking, and disordered protein docking methods.
Collapse
Affiliation(s)
- Tunde Aderinwale
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | | | - Daipayan Sarkar
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA
| | - Eman Alnabati
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA
| | - Daisuke Kihara
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA; Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| |
Collapse
|
16
|
Cianferoni D, Radusky LG, Head SA, Serrano L, Delgado J. ProteinFishing: a protein complex generator within the ModelX toolsuite. Bioinformatics 2020; 36:4208-4210. [PMID: 32437555 PMCID: PMC7390992 DOI: 10.1093/bioinformatics/btaa533] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 05/11/2020] [Accepted: 05/18/2020] [Indexed: 11/26/2022] Open
Abstract
Summary Accurate 3D modelling of protein–protein interactions (PPI) is essential to compensate for the absence of experimentally determined complex structures. Here, we present a new set of commands within the ModelX toolsuite capable of generating atomic-level protein complexes suitable for interface design. Among these commands, the new tool ProteinFishing proposes known and/or putative alternative 3D PPI for a given protein complex. The algorithm exploits backbone compatibility of protein fragments to generate mutually exclusive protein interfaces that are quickly evaluated with a knowledge-based statistical force field. Using interleukin-10-R2 co-crystalized with interferon-lambda-3, and a database of X-ray structures containing interleukin-10, this algorithm was able to generate interleukin-10-R2/interleukin-10 structural models in agreement with experimental data. Availability and implementation ProteinFishing is a portable command-line tool included in the ModelX toolsuite, written in C++, that makes use of an SQL (tested for MySQL and MariaDB) relational database delivered with a template SQL dump called FishXDB. FishXDB contains the empty tables of ModelX fragments and the data used by the embedded statistical force field. ProteinFishing is compiled for Linux-64bit, MacOS-64bit and Windows-32bit operating systems. This software is a proprietary license and is distributed as an executable with its correspondent database dumps. It can be downloaded publicly at http://modelx.crg.es/. Licenses are freely available for academic users after registration on the website and are available under commercial license for for-profit organizations or companies. Contact javier.delgado@crg.eu or luis.serrano@crg.eu Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Damiano Cianferoni
- Systems Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain
| | - Leandro G Radusky
- Systems Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain
| | - Sarah A Head
- Systems Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain
| | - Luis Serrano
- Systems Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain.,Universitat Pompeu Fabra (UPF), Barcelona 08002, Spain.,ICREA, Barcelona 08010, Spain
| | - Javier Delgado
- Systems Biology, Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology, Barcelona 08003, Spain
| |
Collapse
|
17
|
Protein-assisted RNA fragment docking (RnaX) for modeling RNA-protein interactions using ModelX. Proc Natl Acad Sci U S A 2019; 116:24568-24573. [PMID: 31732673 PMCID: PMC6900601 DOI: 10.1073/pnas.1910999116] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Protein–RNA interactions, key in biological processes, remained refractory to prediction algorithms. Here we present a new extension of the ModelX tool suite designed for this purpose. RNA–protein complexes in the Protein Data Bank were decomposed into small peptide–oligonucleotide interacting fragment pairs and used as building blocks to assemble big scaffolds representing complex RNA–protein interactions. This method has already been successful for designing DNA–protein and protein–protein interfaces. Areas under the curve up to 0.86 were achieved on binding site prediction showing the accuracy and coverage of our approach over established and in-house benchmarking sets. Together with FoldX protein design tool suite we were able to engineer backbone- and side chain-compatible interfaces using naked protein structures as input. RNA–protein interactions are crucial for such key biological processes as regulation of transcription, splicing, translation, and gene silencing, among many others. Knowing where an RNA molecule interacts with a target protein and/or engineering an RNA molecule to specifically bind to a protein could allow for rational interference with these cellular processes and the design of novel therapies. Here we present a robust RNA–protein fragment pair-based method, termed RnaX, to predict RNA-binding sites. This methodology, which is integrated into the ModelX tool suite (http://modelx.crg.es), takes advantage of the structural information present in all released RNA–protein complexes. This information is used to create an exhaustive database for docking and a statistical forcefield for fast discrimination of true backbone-compatible interactions. RnaX, together with the protein design forcefield FoldX, enables us to predict RNA–protein interfaces and, when sufficient crystallographic information is available, to reengineer the interface at the sequence-specificity level by mimicking those conformational changes that occur on protein and RNA mutagenesis. These results, obtained at just a fraction of the computational cost of methods that simulate conformational dynamics, open up perspectives for the engineering of RNA–protein interfaces.
Collapse
|
18
|
Vanmeert M, Razzokov J, Mirza MU, Weeks SD, Schepers G, Bogaerts A, Rozenski J, Froeyen M, Herdewijn P, Pinheiro VB, Lescrinier E. Rational design of an XNA ligase through docking of unbound nucleic acids to toroidal proteins. Nucleic Acids Res 2019; 47:7130-7142. [PMID: 31334814 PMCID: PMC6649754 DOI: 10.1093/nar/gkz551] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Revised: 05/24/2019] [Accepted: 06/12/2019] [Indexed: 02/06/2023] Open
Abstract
Xenobiotic nucleic acids (XNA) are nucleic acid analogues not present in nature that can be used for the storage of genetic information. In vivo XNA applications could be developed into novel biocontainment strategies, but are currently limited by the challenge of developing XNA processing enzymes such as polymerases, ligases and nucleases. Here, we present a structure-guided modelling-based strategy for the rational design of those enzymes essential for the development of XNA molecular biology. Docking of protein domains to unbound double-stranded nucleic acids is used to generate a first approximation of the extensive interaction of nucleic acid processing enzymes with their substrate. Molecular dynamics is used to optimise that prediction allowing, for the first time, the accurate prediction of how proteins that form toroidal complexes with nucleic acids interact with their substrate. Using the Chlorella virus DNA ligase as a proof of principle, we recapitulate the ligase's substrate specificity and successfully predict how to convert it into an XNA-templated XNA ligase.
Collapse
Affiliation(s)
- Michiel Vanmeert
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| | - Jamoliddin Razzokov
- Research group PLASMANT, Department of Chemistry, University of Antwerp, Universiteitsplein 1, B-2610 Antwerp, Belgium
| | - Muhammad Usman Mirza
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
- Centre for Research in Molecular Medicine (CRiMM), University of Lahore, Pakistan
| | - Stephen D Weeks
- Biocrystallography, KU Leuven, Herestraat 49, box 822, 3000 Leuven, Belgium
| | - Guy Schepers
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| | - Annemie Bogaerts
- Research group PLASMANT, Department of Chemistry, University of Antwerp, Universiteitsplein 1, B-2610 Antwerp, Belgium
| | - Jef Rozenski
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| | - Mathy Froeyen
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| | - Piet Herdewijn
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| | - Vitor B Pinheiro
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
- University College London, Department of Structural and Molecular Biology, Gower Street, London, WC1E 6BT, UK
| | - Eveline Lescrinier
- Medicinal Chemistry, Rega Institute for Medical Research, KU Leuven, Herestraat 49, box 1041, 3000 Leuven, Belgium
| |
Collapse
|
19
|
Peng Y, Alexov E, Basu S. Structural Perspective on Revealing and Altering Molecular Functions of Genetic Variants Linked with Diseases. Int J Mol Sci 2019; 20:ijms20030548. [PMID: 30696058 PMCID: PMC6386852 DOI: 10.3390/ijms20030548] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2018] [Revised: 01/25/2019] [Accepted: 01/26/2019] [Indexed: 12/25/2022] Open
Abstract
Structural information of biological macromolecules is crucial and necessary to deliver predictions about the effects of mutations-whether polymorphic or deleterious (i.e., disease causing), wherein, thermodynamic parameters, namely, folding and binding free energies potentially serve as effective biomarkers. It may be emphasized that the effect of a mutation depends on various factors, including the type of protein (globular, membrane or intrinsically disordered protein) and the structural context in which it occurs. Such information may positively aid drug-design. Furthermore, due to the intrinsic plasticity of proteins, even mutations involving radical change of the structural and physico⁻chemical properties of the amino acids (native vs. mutant) can still have minimal effects on protein thermodynamics. However, if a mutation causes significant perturbation by either folding or binding free energies, it is quite likely to be deleterious. Mitigating such effects is a promising alternative to the traditional approaches of designing inhibitors. This can be done by structure-based in silico screening of small molecules for which binding to the dysfunctional protein restores its wild type thermodynamics. In this review we emphasize the effects of mutations on two important biophysical properties, stability and binding affinity, and how structures can be used for structure-based drug design to mitigate the effects of disease-causing variants on the above biophysical properties.
Collapse
Affiliation(s)
- Yunhui Peng
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| | - Emil Alexov
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| | - Sankar Basu
- Department of Physics and Astronomy, Clemson University, Clemson, SC 29634, USA.
| |
Collapse
|