1
|
Rodrigues CHM, Portelli S, Ascher DB. Exploring the effects of missense mutations on protein thermodynamics through structure-based approaches: findings from the CAGI6 challenges. Hum Genet 2024:10.1007/s00439-023-02623-4. [PMID: 38227011 DOI: 10.1007/s00439-023-02623-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Accepted: 11/18/2023] [Indexed: 01/17/2024]
Abstract
Missense mutations are known contributors to diverse genetic disorders, due to their subtle, single amino acid changes imparted on the resultant protein. Because of this, understanding the impact of these mutations on protein stability and function is crucial for unravelling disease mechanisms and developing targeted therapies. The Critical Assessment of Genome Interpretation (CAGI) provides a valuable platform for benchmarking state-of-the-art computational methods in predicting the impact of disease-related mutations on protein thermodynamics. Here we report the performance of our comprehensive platform of structure-based computational approaches to evaluate mutations impacting protein structure and function on 3 challenges from CAGI6: Calmodulin, MAPK1 and MAPK3. Our stability predictors have achieved correlations of up to 0.74 and AUCs of 1 when predicting changes in ΔΔG for MAPK1 and MAPK3, respectively, and AUC of up to 0.75 in the Calmodulin challenge. Overall, our study highlights the importance of structure-based approaches in understanding the effects of missense mutations on protein thermodynamics. The results obtained from the CAGI6 challenges contribute to the ongoing efforts to enhance our understanding of disease mechanisms and facilitate the development of personalised medicine approaches.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, 4072, Australia
| | - Stephanie Portelli
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, 4072, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, 3004, Australia.
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, 4072, Australia.
| |
Collapse
|
2
|
Abstract
The greatest challenge in drug discovery remains the high rate of attrition across the different phases of the process, which cost the industry billions of dollars every year. While all phases remain crucial to ensure pharmaceutical-level safety, quality, and efficacy of the end product, streamlining these efforts toward compounds with success potential is pivotal for a more efficient and cost-effective process. The use of artificial intelligence (AI) within the pharmaceutical industry aims at just this, and has applications in preclinical screening for biological activity, optimization of pharmacokinetic properties for improved drug formulation, early toxicity prediction which reduces attrition, and pre-emptively screening for genetic changes in the biological target to improve therapeutic longevity. Here, we present a series of in silico tools that address these applications in small molecule development and describe how they can be embedded within the current pharmaceutical development pipeline.
Collapse
Affiliation(s)
- Adam Serghini
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
| | - Stephanie Portelli
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, Australia.
| | - David B Ascher
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, QLD, Australia.
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
| |
Collapse
|
3
|
Rana MM, Nguyen DD. Geometric Graph Learning to Predict Changes in Binding Free Energy and Protein Thermodynamic Stability upon Mutation. J Phys Chem Lett 2023; 14:10870-10879. [PMID: 38032742 DOI: 10.1021/acs.jpclett.3c02679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
Accurate prediction of binding free energy changes upon mutations is vital for optimizing drugs, designing proteins, understanding genetic diseases, and cost-effective virtual screening. While machine learning methods show promise in this domain, achieving accuracy and generalization across diverse data sets remains a challenge. This study introduces Geometric Graph Learning for Protein-Protein Interactions (GGL-PPI), a novel approach integrating geometric graph representation and machine learning to forecast mutation-induced binding free energy changes. GGL-PPI leverages atom-level graph coloring and multiscale weighted colored geometric subgraphs to capture structural features of biomolecules, demonstrating superior performance on three standard data sets, namely, AB-Bind, SKEMPI 1.0, and SKEMPI 2.0 data sets. The model's efficacy extends to predicting protein thermodynamic stability in a blind test set, providing unbiased predictions for both direct and reverse mutations and showcasing notable generalization. GGL-PPI's precision in predicting changes in binding free energy and stability due to mutations enhances our comprehension of protein complexes, offering valuable insights for drug design endeavors.
Collapse
Affiliation(s)
- Md Masud Rana
- Department of Mathematics, University of Kentucky, Lexington, Kentucky 40506, United States
| | - Duc Duy Nguyen
- Department of Mathematics, University of Kentucky, Lexington, Kentucky 40506, United States
| |
Collapse
|
4
|
Hervin V, Roy V, Agrofoglio LA. Antibiotics and Antibiotic Resistance-Mur Ligases as an Antibacterial Target. Molecules 2023; 28:8076. [PMID: 38138566 PMCID: PMC10745416 DOI: 10.3390/molecules28248076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 11/09/2023] [Accepted: 12/11/2023] [Indexed: 12/24/2023] Open
Abstract
The emergence of Multidrug Resistance (MDR) strains of bacteria has accelerated the search for new antibacterials. The specific bacterial peptidoglycan biosynthetic pathway represents opportunities for the development of novel antibacterial agents. Among the enzymes involved, Mur ligases, described herein, and especially the amide ligases MurC-F are key targets for the discovery of multi-inhibitors, as they share common active sites and structural features.
Collapse
Affiliation(s)
| | - Vincent Roy
- ICOA UMR CNRS 7311, Université d’Orléans et CNRS, Rue de Chartres, 45067 Orléans, France;
| | - Luigi A. Agrofoglio
- ICOA UMR CNRS 7311, Université d’Orléans et CNRS, Rue de Chartres, 45067 Orléans, France;
| |
Collapse
|
5
|
Weissenow K, Rost B. Rendering protein mutation movies with MutAmore. BMC Bioinformatics 2023; 24:469. [PMID: 38087198 PMCID: PMC10714560 DOI: 10.1186/s12859-023-05610-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 12/08/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND The success of AlphaFold2 in reliable protein three-dimensional (3D) structure prediction, assists the move of structural biology toward studies of protein dynamics and mutational impact on structure and function. This transition needs tools that qualitatively assess alternative 3D conformations. RESULTS We introduce MutAmore, a bioinformatics tool that renders individual images of protein 3D structures for, e.g., sequence mutations into a visually intuitive movie format. MutAmore streamlines a pipeline casting single amino-acid variations (SAVs) into a dynamic 3D mutation movie providing a qualitative perspective on the mutational landscape of a protein. By default, the tool first generates all possible variants of the sequence reachable through SAVs (L*19 for proteins with L residues). Next, it predicts the structural conformation for all L*19 variants using state-of-the-art models. Finally, it visualizes the mutation matrix and produces a color-coded 3D animation. Alternatively, users can input other types of variants, e.g., from experimental structures. CONCLUSION MutAmore samples alternative protein configurations to study the dynamical space accessible from SAVs in the post-AlphaFold2 era of structural biology. As the field shifts towards the exploration of alternative conformations of proteins, MutAmore aids in the understanding of the structural impact of mutations by providing a flexible pipeline for the generation of protein mutation movies using current and future structure prediction models.
Collapse
Affiliation(s)
- Konstantin Weissenow
- Department of Informatics, Bioinformatics and Computational Biology i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching, Munich, Germany.
- TUM Graduate School, Center of Doctoral Studies in Informatics and Its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany.
| | - Burkhard Rost
- Department of Informatics, Bioinformatics and Computational Biology i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching, Munich, Germany
- Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, 85748, Garching, Munich, Germany
- TUM School of Life Sciences Weihenstephan (WZW), Alte Akademie 8, Freising, Germany
| |
Collapse
|
6
|
Portelli S, Heaton R, Ascher DB. Identifying Innate Resistance Hotspots for SARS-CoV-2 Antivirals Using In Silico Protein Techniques. Genes (Basel) 2023; 14:1699. [PMID: 37761839 PMCID: PMC10531314 DOI: 10.3390/genes14091699] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Revised: 08/02/2023] [Accepted: 08/22/2023] [Indexed: 09/29/2023] Open
Abstract
The development and approval of antivirals against SARS-CoV-2 has further equipped clinicians with treatment strategies against the COVID-19 pandemic, reducing deaths post-infection. Extensive clinical use of antivirals, however, can impart additional selective pressure, leading to the emergence of antiviral resistance. While we have previously characterized possible effects of circulating SARS-CoV-2 missense mutations on proteome function and stability, their direct effects on the novel antivirals remains unexplored. To address this, we have computationally calculated the consequences of mutations in the antiviral targets: RNA-dependent RNA polymerase and main protease, on target stability and interactions with their antiviral, nucleic acids, and other proteins. By analyzing circulating variants prior to antiviral approval, this work highlighted the inherent resistance potential of different genome regions. Namely, within the main protease binding site, missense mutations imparted a lower fitness cost, while the opposite was noted for the RNA-dependent RNA polymerase binding site. This suggests that resistance to nirmatrelvir/ritonavir combination treatment is more likely to occur and proliferate than that to molnupiravir. These insights are crucial both clinically in drug stewardship, and preclinically in the identification of less mutable targets for novel therapeutic design.
Collapse
Affiliation(s)
- Stephanie Portelli
- School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072, Australia
- Baker Heart and Diabetes Institute, 75 Commercial Road, Melbourne, VIC 3004, Australia
| | - Ruby Heaton
- School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072, Australia
| | - David B. Ascher
- School of Chemistry and Molecular Biosciences, The University of Queensland, St Lucia, QLD 4072, Australia
- Baker Heart and Diabetes Institute, 75 Commercial Road, Melbourne, VIC 3004, Australia
| |
Collapse
|
7
|
Ascher DB, Kaminskas LM, Myung Y, Pires DEV. Using Graph-Based Signatures to Guide Rational Antibody Engineering. Methods Mol Biol 2023; 2552:375-397. [PMID: 36346604 DOI: 10.1007/978-1-0716-2609-2_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Antibodies are essential experimental and diagnostic tools and as biotherapeutics have significantly advanced our ability to treat a range of diseases. With recent innovations in computational tools to guide protein engineering, we can now rationally design better antibodies with improved efficacy, stability, and pharmacokinetics. Here, we describe the use of the mCSM web-based in silico suite, which uses graph-based signatures to rapidly identify the structural and functional consequences of mutations, to guide rational antibody engineering to improve stability, affinity, and specificity.
Collapse
Affiliation(s)
- David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- Department of Biochemistry, Cambridge University, Cambridge, UK
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland, Australia
| | - Lisa M Kaminskas
- School of Biological Sciences, University of Queensland, St Lucia, QLD, Australia
| | - Yoochan Myung
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland, Australia
| | - Douglas E V Pires
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia.
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- School of Computing and Information Systems, University of Melbourne, Parkville, VIC, Australia.
| |
Collapse
|
8
|
Sugawara-Mikami M, Tanigawa K, Kawashima A, Kiriya M, Nakamura Y, Fujiwara Y, Suzuki K. Pathogenicity and virulence of Mycobacterium leprae. Virulence 2022; 13:1985-2011. [PMID: 36326715 PMCID: PMC9635560 DOI: 10.1080/21505594.2022.2141987] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Leprosy is caused by Mycobacterium leprae (M. leprae) and M. lepromatosis, an obligate intracellular organism, and over 200,000 new cases occur every year. M. leprae parasitizes histiocytes (skin macrophages) and Schwann cells in the peripheral nerves. Although leprosy can be treated by multidrug therapy, some patients relapse or have a prolonged clinical course and/or experience leprosy reaction. These varying outcomes depend on host factors such as immune responses against bacterial components that determine a range of symptoms. To understand these host responses, knowledge of the mechanisms by which M. leprae parasitizes host cells is important. This article describes the characteristics of leprosy through bacteriology, genetics, epidemiology, immunology, animal models, routes of infection, and clinical findings. It also discusses recent diagnostic methods, treatment, and measures according to the World Health Organization (WHO), including prevention. Recently, the antibacterial activities of anti-hyperlipidaemia agents against other pathogens, such as M. tuberculosis and Staphylococcus aureus have been investigated. Our laboratory has been focused on the metabolism of lipids which constitute the cell wall of M. leprae. Our findings may be useful for the development of future treatments.
Collapse
Affiliation(s)
- Mariko Sugawara-Mikami
- Department of Clinical Laboratory Science, Faculty of Medical Technology, Teikyo University, Tokyo, Japan.,West Yokohama Sugawara Dermatology Clinic, Yokohama, Japan
| | - Kazunari Tanigawa
- Department of Molecular Pharmaceutics, Faculty of Pharma-Science, Teikyo University, Tokyo, Japan
| | - Akira Kawashima
- Department of Clinical Laboratory Science, Faculty of Medical Technology, Teikyo University, Tokyo, Japan
| | - Mitsuo Kiriya
- Department of Clinical Laboratory Science, Faculty of Medical Technology, Teikyo University, Tokyo, Japan
| | - Yasuhiro Nakamura
- Department of Molecular Pharmaceutics, Faculty of Pharma-Science, Teikyo University, Tokyo, Japan
| | - Yoko Fujiwara
- Department of Clinical Laboratory Science, Faculty of Medical Technology, Teikyo University, Tokyo, Japan
| | - Koichi Suzuki
- Department of Clinical Laboratory Science, Faculty of Medical Technology, Teikyo University, Tokyo, Japan
| |
Collapse
|
9
|
Wang C, Wu Z, Jiang H, Shi Y, Zhang W, Zhang M, Wang H. Global prevalence of resistance to rifampicin in Mycobacterium leprae: A meta-analysis. J Glob Antimicrob Resist 2022; 31:119-127. [PMID: 36055549 DOI: 10.1016/j.jgar.2022.08.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 08/15/2022] [Accepted: 08/25/2022] [Indexed: 12/30/2022] Open
Abstract
OBJECTIVES As the only bactericidal drug in multidrug therapy is rifampicin, monitoring of antimicrobial resistance is important in leprosy patients. Therefore, we conducted a meta-analysis on the resistance of Mycobacterium leprae (M. leprae) to rifampicin and estimated drug resistance in different therapeutic states and regions. METHODS Embase, Medline, PubMed, and Web of Science were searched to identify studies between 1 January 1993 and 1 January 2022. Two independent reviewers extracted study data. Pooled cumulative incidences were computed using random-effects meta-analyses. RESULTS We included 32 papers describing the resistance of M. leprae to rifampicin (pooled cumulative incidences, 11% [95% confidence interval {CI}, 7% to 15%]). Therapeutic states and regional distribution were obtained for subgroup analyses. A total of 51 of 1135 new cases (pooled incidence, 10% [95% CI, 5% to 16%]) and 81 of 733 relapsed cases (pooled incidence, 20% [95% CI, 13% to 27%]) had rifampicin resistance. A total of 139 participants, including 11 patients with rifampicin resistance (pooled incidence, 42% [95% CI, -21% to 105%]), were nonresponsive and intractable cases. The incidence of rifampicin resistance was highest in the Western Pacific (pooled incidence, 21% [95% CI, 13% to 29%]) and lowest in the Americas (pooled incidence, 4% [95% CI, 1% to 7%]). CONCLUSIONS Drug resistance testing and a robust and rigorous surveillance system are recommended to detect the prevalence of drug resistance in leprosy.
Collapse
Affiliation(s)
- Chen Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, China; Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Ziwei Wu
- Center for Global Health, School of Public Health, Nanjing Medical University; Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Haiqin Jiang
- Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Ying Shi
- Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Wenyue Zhang
- Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Mengyan Zhang
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, China; Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| | - Hongsheng Wang
- Department of Epidemiology and Biostatistics, School of Public Health, Nanjing Medical University, Nanjing, China; Center for Global Health, School of Public Health, Nanjing Medical University; Institute of Dermatology, Chinese Academy of Medical Sciences and Peking Union Medical College, Nanjing, China; Jiangsu Key Laboratory of Molecular Biology for Skin Diseases and STIs, National Centre for Leprosy Control, China CDC, Nanjing, China
| |
Collapse
|
10
|
Pan Q, Nguyen TB, Ascher DB, Pires DEV. Systematic evaluation of computational tools to predict the effects of mutations on protein stability in the absence of experimental structures. Brief Bioinform 2022; 23:bbac025. [PMID: 35189634 PMCID: PMC9155634 DOI: 10.1093/bib/bbac025] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 01/13/2022] [Accepted: 01/30/2022] [Indexed: 12/26/2022] Open
Abstract
Changes in protein sequence can have dramatic effects on how proteins fold, their stability and dynamics. Over the last 20 years, pioneering methods have been developed to try to estimate the effects of missense mutations on protein stability, leveraging growing availability of protein 3D structures. These, however, have been developed and validated using experimentally derived structures and biophysical measurements. A large proportion of protein structures remain to be experimentally elucidated and, while many studies have based their conclusions on predictions made using homology models, there has been no systematic evaluation of the reliability of these tools in the absence of experimental structural data. We have, therefore, systematically investigated the performance and robustness of ten widely used structural methods when presented with homology models built using templates at a range of sequence identity levels (from 15% to 95%) and contrasted performance with sequence-based tools, as a baseline. We found there is indeed performance deterioration on homology models built using templates with sequence identity below 40%, where sequence-based tools might become preferable. This was most marked for mutations in solvent exposed residues and stabilizing mutations. As structure prediction tools improve, the reliability of these predictors is expected to follow, however we strongly suggest that these factors should be taken into consideration when interpreting results from structure-based predictors of mutation effects on protein stability.
Collapse
Affiliation(s)
- Qisheng Pan
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland 4072, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville, Victoria 3052, Australia
| | - Thanh Binh Nguyen
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland 4072, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville, Victoria 3052, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland 4072, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville, Victoria 3052, Australia
- Department of Biochemistry, University of Cambridge, 80 Tennis Ct Rd, Cambridge CB2 1GA, UK
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria 3004, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane City, Queensland 4072, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville, Victoria 3052, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria 3053, Australia
| |
Collapse
|
11
|
Nguyen TB, Myung Y, de Sá AGC, Pires DEV, Ascher DB. mmCSM-NA: accurately predicting effects of single and multiple mutations on protein-nucleic acid binding affinity. NAR Genom Bioinform 2021; 3:lqab109. [PMID: 34805992 PMCID: PMC8600011 DOI: 10.1093/nargab/lqab109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2021] [Revised: 09/20/2021] [Accepted: 10/27/2021] [Indexed: 02/02/2023] Open
Abstract
While protein-nucleic acid interactions are pivotal for many crucial biological processes, limited experimental data has made the development of computational approaches to characterise these interactions a challenge. Consequently, most approaches to understand the effects of missense mutations on protein-nucleic acid affinity have focused on single-point mutations and have presented a limited performance on independent data sets. To overcome this, we have curated the largest dataset of experimentally measured effects of mutations on nucleic acid binding affinity to date, encompassing 856 single-point mutations and 141 multiple-point mutations across 155 experimentally solved complexes. This was used in combination with an optimized version of our graph-based signatures to develop mmCSM-NA (http://biosig.unimelb.edu.au/mmcsm_na), the first scalable method capable of quantitatively and accurately predicting the effects of multiple-point mutations on nucleic acid binding affinities. mmCSM-NA obtained a Pearson's correlation of up to 0.67 (RMSE of 1.06 Kcal/mol) on single-point mutations under cross-validation, and up to 0.65 on independent non-redundant datasets of multiple-point mutations (RMSE of 1.12 kcal/mol), outperforming similar tools. mmCSM-NA is freely available as an easy-to-use web-server and API. We believe it will be an invaluable tool to shed light on the role of mutations affecting protein-nucleic acid interactions in diseases.
Collapse
Affiliation(s)
- Thanh Binh Nguyen
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Yoochan Myung
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Alex G C de Sá
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
12
|
Swain SS, Sahoo G, Mahapatra PK, Panda SK. Disease burden and current therapeutical status of leprosy with special emphasis on phytochemicals. Curr Top Med Chem 2021; 22:1611-1625. [PMID: 34503409 DOI: 10.2174/1568026621666210909162435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Revised: 08/05/2021] [Accepted: 08/28/2021] [Indexed: 11/22/2022]
Abstract
BACKGROUND Leprosy (Hansen's disease) is a neglected tropical disease affecting millions of people globally. The combined formulations of dapsone, rifampicin and clofazimine (multidrug therapy, MDT) is only supportive in the early stage of detection, while "reemergence" is a significant problem. There is still a need to develop newer antileprosy molecules either of natural or (semi)synthetic origin. OBJECTIVE The review intends to present the latest developments in the disease prevalence, available therapeutic interventions and the possibility of identifying new molecules from phytoextracts. METHODS Literature on the use of plant extracts and their active components to treat leprosy was searched. Selected phytoconstituents were subjected to molecular docking study on both wild and mutant types of the Mycobacterium leprae. Since the M. leprae dihydropteroate synthase (DHPS) is not available in the protein data bank (PDB), it was modelled by the homology model method and validated with the Ramachandran plot along with other bioinformatics approaches. Two mutations were introduced at codons 53 (Thr to Ile) and 55 (Pro to Leu) for docking against twenty-five selected phytoconstituents reported from eight plants that recorded effective anti-leprosy activity. The chemical structure of phytochemicals and the standard dapsone structure were retrieved from the PubChem database and prepared accordingly for docking study with the virtual-screening platform of PyRx-AutoDock 4.1. RESULTS Based on the docking score (kcal/mol), most of the phytochemicals exhibited a higher docking score than dapsone. Asiaticoside, an active saponin (-11.3, -11.2 and -11.2 kcal/mol), was proved to be the lead phytochemical against both wild and mutant types DHPS. Some other useful phytoconstituents include echinocystic acid (-9.6, -9.5 and -9.5 kcal/mol), neobavaisoflavone (-9.2, -9.0 and -9.0 kcal/mol), boswellic acid (-8.90, -8.90 and -8.90 kcal/mol), asiatic acid (-8.9, -8.8 and -8.9 kcal/mol), corylifol A (-8.8, 8.0, and -8.0), etc. Overall, the computational predictions support the previously reported active phytoextracts of Centella asiatica (L.) Urban, Albizia amara (Roxb.) Boivin, Boswellia serrata Roxb. and Psoralea corylifolia L. to be effective against leprosy. CONCLUSION A very small percentage of well-known plants have been evaluated scientifically for antileprosy activity. Further in vivo experiments are essential to confirm anti-leprosy properties of such useful phytochemicals.
Collapse
Affiliation(s)
- Shasank Sekhar Swain
- Division of Microbiology & NCDs, ICMR-Regional Medical Research Centre, Bhubaneswar-751023, Odisha. India
| | - Gunanidhi Sahoo
- Department of Zoology, Utkal University, Vani Vihar, Bhubaneswar-751004, Odisha. India
| | | | - Sujogya Kumar Panda
- Department of Zoology, Utkal University, Vani Vihar, Bhubaneswar-751004, Odisha. India
| |
Collapse
|
13
|
Baseri N, Najar-Peerayeh S, Bakhshi B. Investigating the effect of an identified mutation within a critical site of PAS domain of WalK protein in a vancomycin-intermediate resistant Staphylococcus aureus by computational approaches. BMC Microbiol 2021; 21:240. [PMID: 34474665 PMCID: PMC8414773 DOI: 10.1186/s12866-021-02298-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Accepted: 08/23/2021] [Indexed: 11/15/2022] Open
Abstract
Background Vancomycin-intermediate resistant Staphylococcus aureus (VISA) is becoming a common cause of nosocomial infections worldwide. VISA isolates are developed by unclear molecular mechanisms via mutations in several genes, including walKR. Although studies have verified some of these mutations, there are a few studies that pay attention to the importance of molecular modelling of mutations. Method For genomic and transcriptomic comparisons in a laboratory-derived VISA strain and its parental strain, Sanger sequencing and reverse transcriptase quantitative PCR (RT-qPCR) methods were used, respectively. After structural protein mapping of the detected mutation, mutation effects were analyzed using molecular computational approaches and crystal structures of related proteins. Results A mutation WalK-H364R was occurred in a functional zinc ion coordinating residue within the PAS domain in the VISA strain. WalK-H364R was predicted to destabilize protein and decrease WalK interactions with proteins and nucleic acids. The RT-qPCR method showed downregulation of walKR, WalKR-regulated autolysins, and agr locus. Conclusion Overall, WalK-H364R mutation within a critical metal-coordinating site was presumably related to the VISA development. We assume that the WalK-H364R mutation resulted in deleterious effects on protein, which was verified by walKR gene expression changes.. Therefore, molecular modelling provides detailed insight into the molecular mechanism of VISA development, in particular, where allelic replacement experiments are not readily available. Supplementary Information The online version contains supplementary material available at 10.1186/s12866-021-02298-9.
Collapse
Affiliation(s)
- Neda Baseri
- Department of Bacteriology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Shahin Najar-Peerayeh
- Department of Bacteriology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Bita Bakhshi
- Department of Bacteriology, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran.
| |
Collapse
|
14
|
Rodrigues CHM, Pires DEV, Ascher DB. mmCSM-PPI: predicting the effects of multiple point mutations on protein-protein interactions. Nucleic Acids Res 2021; 49:W417-W424. [PMID: 33893812 PMCID: PMC8262703 DOI: 10.1093/nar/gkab273] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/18/2021] [Accepted: 04/15/2021] [Indexed: 11/16/2022] Open
Abstract
Protein-protein interactions play a crucial role in all cellular functions and biological processes and mutations leading to their disruption are enriched in many diseases. While a number of computational methods to assess the effects of variants on protein-protein binding affinity have been proposed, they are in general limited to the analysis of single point mutations and have been shown to perform poorly on independent test sets. Here, we present mmCSM-PPI, a scalable and effective machine learning model for accurately assessing changes in protein-protein binding affinity caused by single and multiple missense mutations. We expanded our well-established graph-based signatures in order to capture physicochemical and geometrical properties of multiple wild-type residue environments and integrated them with substitution scores and dynamics terms from normal mode analysis. mmCSM-PPI was able to achieve a Pearson's correlation of up to 0.75 (RMSE = 1.64 kcal/mol) under 10-fold cross-validation and 0.70 (RMSE = 2.06 kcal/mol) on a non-redundant blind test, outperforming existing methods. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/mmcsm_ppi.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
15
|
Portelli S, Barr L, de Sá AG, Pires DE, Ascher DB. Distinguishing between PTEN clinical phenotypes through mutation analysis. Comput Struct Biotechnol J 2021; 19:3097-3109. [PMID: 34141133 PMCID: PMC8180946 DOI: 10.1016/j.csbj.2021.05.028] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Revised: 04/29/2021] [Accepted: 05/19/2021] [Indexed: 12/28/2022] Open
Abstract
Phosphate and tensin homolog on chromosome ten (PTEN) germline mutations are associated with an overarching condition known as PTEN hamartoma tumor syndrome. Clinical phenotypes associated with this syndrome range from macrocephaly and autism spectrum disorder to Cowden syndrome, which manifests as multiple noncancerous tumor-like growths (hamartomas), and an increased predisposition to certain cancers. It is unclear, however, the basis by which mutations might lead to these very diverse phenotypic outcomes. Here we show that, by considering the molecular consequences of mutations in PTEN on protein structure and function, we can accurately distinguish PTEN mutations exhibiting different phenotypes. Changes in phosphatase activity, protein stability, and intramolecular interactions appeared to be major drivers of clinical phenotype, with cancer-associated variants leading to the most drastic changes, while ASD and non-pathogenic variants associated with more mild and neutral changes, respectively. Importantly, we show via saturation mutagenesis that more than half of variants of unknown significance could be associated with disease phenotypes, while over half of Cowden syndrome mutations likely lead to cancer. These insights can assist in exploring potentially important clinical outcomes delineated by PTEN variation.
Collapse
Affiliation(s)
- Stephanie Portelli
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
| | - Lucy Barr
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
| | - Alex G.C. de Sá
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Baker Department of Cardiometabolic Health, Melbourne Medical School, University of Melbourne, Melbourne, Victoria, Australia
| | - Douglas E.V. Pires
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B. Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Baker Department of Cardiometabolic Health, Melbourne Medical School, University of Melbourne, Melbourne, Victoria, Australia
- Department of Biochemistry, University of Cambridge, 80 Tennis Ct Rd, Cambridge CB2 1GA, United States
| |
Collapse
|
16
|
Vedithi SC, Malhotra S, Acebrón-García-de-Eulate M, Matusevicius M, Torres PHM, Blundell TL. Structure-Guided Computational Approaches to Unravel Druggable Proteomic Landscape of Mycobacterium leprae. Front Mol Biosci 2021; 8:663301. [PMID: 34026836 PMCID: PMC8138464 DOI: 10.3389/fmolb.2021.663301] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Accepted: 04/12/2021] [Indexed: 02/02/2023] Open
Abstract
Leprosy, caused by Mycobacterium leprae (M. leprae), is treated with a multidrug regimen comprising Dapsone, Rifampicin, and Clofazimine. These drugs exhibit bacteriostatic, bactericidal and anti-inflammatory properties, respectively, and control the dissemination of infection in the host. However, the current treatment is not cost-effective, does not favor patient compliance due to its long duration (12 months) and does not protect against the incumbent nerve damage, which is a severe leprosy complication. The chronic infectious peripheral neuropathy associated with the disease is primarily due to the bacterial components infiltrating the Schwann cells that protect neuronal axons, thereby inducing a demyelinating phenotype. There is a need to discover novel/repurposed drugs that can act as short duration and effective alternatives to the existing treatment regimens, preventing nerve damage and consequent disability associated with the disease. Mycobacterium leprae is an obligate pathogen resulting in experimental intractability to cultivate the bacillus in vitro and limiting drug discovery efforts to repositioning screens in mouse footpad models. The dearth of knowledge related to structural proteomics of M. leprae, coupled with emerging antimicrobial resistance to all the three drugs in the multidrug therapy, poses a need for concerted novel drug discovery efforts. A comprehensive understanding of the proteomic landscape of M. leprae is indispensable to unravel druggable targets that are essential for bacterial survival and predilection of human neuronal Schwann cells. Of the 1,614 protein-coding genes in the genome of M. leprae, only 17 protein structures are available in the Protein Data Bank. In this review, we discussed efforts made to model the proteome of M. leprae using a suite of software for protein modeling that has been developed in the Blundell laboratory. Precise template selection by employing sequence-structure homology recognition software, multi-template modeling of the monomeric models and accurate quality assessment are the hallmarks of the modeling process. Tools that map interfaces and enable building of homo-oligomers are discussed in the context of interface stability. Other software is described to determine the druggable proteome by using information related to the chokepoint analysis of the metabolic pathways, gene essentiality, homology to human proteins, functional sites, druggable pockets and fragment hotspot maps.
Collapse
Affiliation(s)
- Sundeep Chaitanya Vedithi
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom,*Correspondence: Sundeep Chaitanya Vedithi,
| | - Sony Malhotra
- Rutherford Appleton Laboratory, Science and Technology Facilities Council, Oxon, United Kingdom
| | | | | | - Pedro Henrique Monteiro Torres
- Laboratório de Modelagem e Dinâmica Molecular, Instituto de Biofísica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom,Tom L. Blundell,
| |
Collapse
|
17
|
Xavier JS, Nguyen TB, Karmarkar M, Portelli S, Rezende PM, Velloso JPL, Ascher DB, Pires DEV. ThermoMutDB: a thermodynamic database for missense mutations. Nucleic Acids Res 2021; 49:D475-D479. [PMID: 33095862 PMCID: PMC7778973 DOI: 10.1093/nar/gkaa925] [Citation(s) in RCA: 40] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Revised: 09/21/2020] [Accepted: 10/12/2020] [Indexed: 01/17/2023] Open
Abstract
Proteins are intricate, dynamic structures, and small changes in their amino acid sequences can lead to large effects on their folding, stability and dynamics. To facilitate the further development and evaluation of methods to predict these changes, we have developed ThermoMutDB, a manually curated database containing >14,669 experimental data of thermodynamic parameters for wild type and mutant proteins. This represents an increase of 83% in unique mutations over previous databases and includes thermodynamic information on 204 new proteins. During manual curation we have also corrected annotation errors in previously curated entries. Associated with each entry, we have included information on the unfolding Gibbs free energy and melting temperature change, and have associated entries with available experimental structural information. ThermoMutDB supports users to contribute to new data points and programmatic access to the database via a RESTful API. ThermoMutDB is freely available at: http://biosig.unimelb.edu.au/thermomutdb.
Collapse
Affiliation(s)
- Joicymara S Xavier
- Institute of Agricultural Sciences, Universidade Federal dos Vales do Jequitinhonha e Mucuri.,Instituto René Rachou, Fundação Oswaldo Cruz
| | | | - Malancha Karmarkar
- Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute
| | - Stephanie Portelli
- Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute
| | | | | | - David B Ascher
- Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute.,Department of Biochemistry, University of Cambridge
| | - Douglas E V Pires
- Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute.,School of Computing and Information Systems, University of Melbourne
| |
Collapse
|
18
|
HARP: a database of structural impacts of systematic missense mutations in drug targets of Mycobacterium leprae. Comput Struct Biotechnol J 2020; 18:3692-3704. [PMID: 33304465 PMCID: PMC7711215 DOI: 10.1016/j.csbj.2020.11.013] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2020] [Accepted: 11/08/2020] [Indexed: 12/20/2022] Open
Abstract
Computational Saturation Mutagenesis is an in-silico approach that employs systematic mutagenesis of each amino acid residue in the protein to all other amino acid types, and predicts changes in thermodynamic stability and affinity to the other subunits/protein counterparts, ligands and nucleic acid molecules. The data thus generated are useful in understanding the functional consequences of mutations in antimicrobial resistance phenotypes. In this study, we applied computational saturation mutagenesis to three important drug-targets in Mycobacterium leprae (M. leprae) for the drugs dapsone, rifampin and ofloxacin namely Dihydropteroate Synthase (DHPS), RNA Polymerase (RNAP) and DNA Gyrase (GYR), respectively. M. leprae causes leprosy and is an obligate intracellular bacillus with limited protein structural information associating mutations with phenotypic resistance outcomes in leprosy. Experimentally solved structures of DHPS, RNAP and GYR of M. leprae are not available in the Protein Data Bank, therefore, we modelled the structures of these proteins using template-based comparative modelling and introduced systematic mutations in each model generating 80,902 mutations and mutant structures for all the three proteins. Impacts of mutations on stability and protein-subunit, protein-ligand and protein-nucleic acid affinities were computed using various in-house developed and other published protein stability and affinity prediction software. A consensus impact was estimated for each mutation using qualitative scoring metrics for physicochemical properties and by a categorical grouping of stability and affinity predictions. We developed a web database named HARP (a database of Hansen's Disease Antimicrobial Resistance Profiles), which is accessible at the URL - https://harp-leprosy.org and provides the details to each of these predictions.
Collapse
|
19
|
Tunstall T, Portelli S, Phelan J, Clark TG, Ascher DB, Furnham N. Combining structure and genomics to understand antimicrobial resistance. Comput Struct Biotechnol J 2020; 18:3377-3394. [PMID: 33294134 PMCID: PMC7683289 DOI: 10.1016/j.csbj.2020.10.017] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 10/15/2020] [Accepted: 10/17/2020] [Indexed: 02/07/2023] Open
Abstract
Antimicrobials against bacterial, viral and parasitic pathogens have transformed human and animal health. Nevertheless, their widespread use (and misuse) has led to the emergence of antimicrobial resistance (AMR) which poses a potentially catastrophic threat to public health and animal husbandry. There are several routes, both intrinsic and acquired, by which AMR can develop. One major route is through non-synonymous single nucleotide polymorphisms (nsSNPs) in coding regions. Large scale genomic studies using high-throughput sequencing data have provided powerful new ways to rapidly detect and respond to such genetic mutations linked to AMR. However, these studies are limited in their mechanistic insight. Computational tools can rapidly and inexpensively evaluate the effect of mutations on protein function and evolution. Subsequent insights can then inform experimental studies, and direct existing or new computational methods. Here we review a range of sequence and structure-based computational tools, focussing on tools successfully used to investigate mutational effect on drug targets in clinically important pathogens, particularly Mycobacterium tuberculosis. Combining genomic results with the biophysical effects of mutations can help reveal the molecular basis and consequences of resistance development. Furthermore, we summarise how the application of such a mechanistic understanding of drug resistance can be applied to limit the impact of AMR.
Collapse
Affiliation(s)
- Tanushree Tunstall
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Stephanie Portelli
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Jody Phelan
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - Taane G. Clark
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
- Department of Infectious Disease Epidemiology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| | - David B. Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Australia
| | - Nicholas Furnham
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London WC1E 7HT, UK
| |
Collapse
|
20
|
Portelli S, Myung Y, Furnham N, Vedithi SC, Pires DEV, Ascher DB. Prediction of rifampicin resistance beyond the RRDR using structure-based machine learning approaches. Sci Rep 2020; 10:18120. [PMID: 33093532 PMCID: PMC7581776 DOI: 10.1038/s41598-020-74648-y] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2020] [Accepted: 09/21/2020] [Indexed: 01/23/2023] Open
Abstract
Rifampicin resistance is a major therapeutic challenge, particularly in tuberculosis, leprosy, P. aeruginosa and S. aureus infections, where it develops via missense mutations in gene rpoB. Previously we have highlighted that these mutations reduce protein affinities within the RNA polymerase complex, subsequently reducing nucleic acid affinity. Here, we have used these insights to develop a computational rifampicin resistance predictor capable of identifying resistant mutations even outside the well-defined rifampicin resistance determining region (RRDR), using clinical M. tuberculosis sequencing information. Our tool successfully identified up to 90.9% of M. tuberculosis rpoB variants correctly, with sensitivity of 92.2%, specificity of 83.6% and MCC of 0.69, outperforming the current gold-standard GeneXpert-MTB/RIF. We show our model can be translated to other clinically relevant organisms: M. leprae, P. aeruginosa and S. aureus, despite weak sequence identity. Our method was implemented as an interactive tool, SUSPECT-RIF (StrUctural Susceptibility PrEdiCTion for RIFampicin), freely available at https://biosig.unimelb.edu.au/suspect_rif/ .
Collapse
Affiliation(s)
- Stephanie Portelli
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Victoria, 3010, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, 3004, VIC, Australia
| | - Yoochan Myung
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Victoria, 3010, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, 3004, VIC, Australia
| | - Nicholas Furnham
- Department of Infection Biology, London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, UK
| | | | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, 3004, VIC, Australia
- School of Computing and Information Systems, University of Melbourne, Victoria, 3010, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Victoria, 3010, Australia.
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, 3004, VIC, Australia.
- Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
21
|
Pires DEV, Rodrigues CHM, Ascher DB. mCSM-membrane: predicting the effects of mutations on transmembrane proteins. Nucleic Acids Res 2020; 48:W147-W153. [PMID: 32469063 PMCID: PMC7319563 DOI: 10.1093/nar/gkaa416] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 05/04/2020] [Accepted: 05/28/2020] [Indexed: 12/17/2022] Open
Abstract
Significant efforts have been invested into understanding and predicting the molecular consequences of mutations in protein coding regions, however nearly all approaches have been developed using globular, soluble proteins. These methods have been shown to poorly translate to studying the effects of mutations in membrane proteins. To fill this gap, here we report, mCSM-membrane, a user-friendly web server that can be used to analyse the impacts of mutations on membrane protein stability and the likelihood of them being disease associated. mCSM-membrane derives from our well-established mutation modelling approach that uses graph-based signatures to model protein geometry and physicochemical properties for supervised learning. Our stability predictor achieved correlations of up to 0.72 and 0.67 (on cross validation and blind tests, respectively), while our pathogenicity predictor achieved a Matthew's Correlation Coefficient (MCC) of up to 0.77 and 0.73, outperforming previously described methods in both predicting changes in stability and in identifying pathogenic variants. mCSM-membrane will be an invaluable and dedicated resource for investigating the effects of single-point mutations on membrane proteins through a freely available, user friendly web server at http://biosig.unimelb.edu.au/mcsm_membrane.
Collapse
Affiliation(s)
- Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville, VIC, 3052, Australia
| | - Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| |
Collapse
|
22
|
Myung Y, Rodrigues CHM, Ascher DB, Pires DEV. mCSM-AB2: guiding rational antibody design using graph-based signatures. Bioinformatics 2020; 36:1453-1459. [PMID: 31665262 DOI: 10.1093/bioinformatics/btz779] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 10/07/2019] [Accepted: 10/23/2019] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION A lack of accurate computational tools to guide rational mutagenesis has made affinity maturation a recurrent challenge in antibody (Ab) development. We previously showed that graph-based signatures can be used to predict the effects of mutations on Ab binding affinity. RESULTS Here we present an updated and refined version of this approach, mCSM-AB2, capable of accurately modelling the effects of mutations on Ab-antigen binding affinity, through the inclusion of evolutionary and energetic terms. Using a new and expanded database of over 1800 mutations with experimental binding measurements and structural information, mCSM-AB2 achieved a Pearson's correlation of 0.73 and 0.77 across training and blind tests, respectively, outperforming available methods currently used for rational Ab engineering. AVAILABILITY AND IMPLEMENTATION mCSM-AB2 is available as a user-friendly and freely accessible web server providing rapid analysis of both individual mutations or the entire binding interface to guide rational antibody affinity maturation at http://biosig.unimelb.edu.au/mcsm_ab2. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yoochan Myung
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia.,Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Douglas E V Pires
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, VIC 3010, Australia
| |
Collapse
|
23
|
Rodrigues CHM, Pires DEV, Ascher DB. DynaMut2: Assessing changes in stability and flexibility upon single and multiple point missense mutations. Protein Sci 2020; 30:60-69. [PMID: 32881105 PMCID: PMC7737773 DOI: 10.1002/pro.3942] [Citation(s) in RCA: 207] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 08/27/2020] [Accepted: 08/28/2020] [Indexed: 12/11/2022]
Abstract
Predicting the effect of missense variations on protein stability and dynamics is important for understanding their role in diseases, and the link between protein structure and function. Approaches to estimate these changes have been proposed, but most only consider single‐point missense variants and a static state of the protein, with those that incorporate dynamics are computationally expensive. Here we present DynaMut2, a web server that combines Normal Mode Analysis (NMA) methods to capture protein motion and our graph‐based signatures to represent the wildtype environment to investigate the effects of single and multiple point mutations on protein stability and dynamics. DynaMut2 was able to accurately predict the effects of missense mutations on protein stability, achieving Pearson's correlation of up to 0.72 (RMSE: 1.02 kcal/mol) on a single point and 0.64 (RMSE: 1.80 kcal/mol) on multiple‐point missense mutations across 10‐fold cross‐validation and independent blind tests. For single‐point mutations, DynaMut2 achieved comparable performance with other methods when predicting variations in Gibbs Free Energy (ΔΔG) and in melting temperature (ΔTm). We anticipate our tool to be a valuable suite for the study of protein flexibility analysis and the study of the role of variants in disease. DynaMut2 is freely available as a web server and API at http://biosig.unimelb.edu.au/dynamut2.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
24
|
Munir A, Vedithi SC, Chaplin AK, Blundell TL. Genomics, Computational Biology and Drug Discovery for Mycobacterial Infections: Fighting the Emergence of Resistance. Front Genet 2020; 11:965. [PMID: 33101362 PMCID: PMC7498718 DOI: 10.3389/fgene.2020.00965] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2020] [Accepted: 07/31/2020] [Indexed: 12/14/2022] Open
Abstract
Tuberculosis (TB) and leprosy are mycobacterial infections caused by Mycobacterium tuberculosis and Mycobacterium leprae respectively. These diseases continue to be endemic in developing countries where the cost of new medicines presents major challenges. The situation is further exacerbated by the emergence of resistance to many front-line antibiotics. A priority now is to design new antimycobacterials that are not only effective in combatting the diseases but are also less likely to give rise to resistance. In both these respects understanding the structure of drug targets in M. tuberculosis and M. leprae is crucial. In this review we describe structure-guided approaches to understanding the impacts of mutations that give rise to antimycobacterial resistance and the use of this information in the design of new medicines.
Collapse
Affiliation(s)
- Asma Munir
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | | | - Amanda K Chaplin
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
25
|
Thomas SE, Whitehouse AJ, Brown K, Burbaud S, Belardinelli J, Sangen J, Lahiri R, Libardo M, Gupta P, Malhotra S, Boshoff HIM, Jackson M, Abell C, Coyne A, Blundell TL, Floto RA, Mendes V. Fragment-based discovery of a new class of inhibitors targeting mycobacterial tRNA modification. Nucleic Acids Res 2020; 48:8099-8112. [PMID: 32602532 PMCID: PMC7641325 DOI: 10.1093/nar/gkaa539] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 06/10/2020] [Accepted: 06/15/2020] [Indexed: 12/13/2022] Open
Abstract
Translational frameshift errors are often deleterious to the synthesis of functional proteins and could therefore be promoted therapeutically to kill bacteria. TrmD (tRNA-(N(1)G37) methyltransferase) is an essential tRNA modification enzyme in bacteria that prevents +1 errors in the reading frame during protein translation and represents an attractive potential target for the development of new antibiotics. Here, we describe the application of a structure-guided fragment-based drug discovery approach to the design of a new class of inhibitors against TrmD in Mycobacterium abscessus. Fragment library screening, followed by structure-guided chemical elaboration of hits, led to the rapid development of drug-like molecules with potent in vitro TrmD inhibitory activity. Several of these compounds exhibit activity against planktonic M. abscessus and M. tuberculosis as well as against intracellular M. abscessus and M. leprae, indicating their potential as the basis for a novel class of broad-spectrum mycobacterial drugs.
Collapse
Affiliation(s)
- Sherine E Thomas
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Andrew J Whitehouse
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Karen Brown
- University of Cambridge Molecular Immunity Unit, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
- Cambridge Centre for Lung Infection, Royal Papworth Hospital, Cambridge CB2 0AY, UK
| | - Sophie Burbaud
- University of Cambridge Molecular Immunity Unit, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Juan M Belardinelli
- Mycobacteria Research Laboratories, Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO, USA
| | - Jasper Sangen
- University of Cambridge Molecular Immunity Unit, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
| | - Ramanuj Lahiri
- National Hansen's Disease Program, Healthcare Systems Bureau, Health Resources and Services Administration, Department of Health and Human Services, Baton Rouge, LA, USA
| | - Mark Daben J Libardo
- Tuberculosis Research Section, Laboratory of Clinical Immunology and Microbiology, National Institute of Allergy and Infectious Disease, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Pooja Gupta
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Sony Malhotra
- Birkbeck College, University of London, Malet Street WC1E7HX, UK
| | - Helena I M Boshoff
- Tuberculosis Research Section, Laboratory of Clinical Immunology and Microbiology, National Institute of Allergy and Infectious Disease, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA
| | - Mary Jackson
- Mycobacteria Research Laboratories, Department of Microbiology, Immunology and Pathology, Colorado State University, Fort Collins, CO, USA
| | - Chris Abell
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Anthony G Coyne
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| | - Rodrigo Andres Floto
- University of Cambridge Molecular Immunity Unit, MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, UK
- Cambridge Centre for Lung Infection, Royal Papworth Hospital, Cambridge CB2 0AY, UK
| | - Vítor Mendes
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, UK
| |
Collapse
|
26
|
Abstract
Mutations in protein-coding regions can lead to large biological changes and are associated with genetic conditions, including cancers and Mendelian diseases, as well as drug resistance. Although whole genome and exome sequencing help to elucidate potential genotype-phenotype correlations, there is a large gap between the identification of new variants and deciphering their molecular consequences. A comprehensive understanding of these mechanistic consequences is crucial to better understand and treat diseases in a more personalized and effective way. This is particularly relevant considering estimates that over 80% of mutations associated with a disease are incorrectly assumed to be causative. A thorough analysis of potential effects of mutations is required to correctly identify the molecular mechanisms of disease and enable the distinction between disease-causing and non-disease-causing variation within a gene. Here we present an overview of our integrative mutation analysis platform, which focuses on refining the current genotype-phenotype correlation methods by using the wealth of protein structural information.
Collapse
|
27
|
Myung Y, Pires DEV, Ascher DB. mmCSM-AB: guiding rational antibody engineering through multiple point mutations. Nucleic Acids Res 2020; 48:W125-W131. [PMID: 32432715 PMCID: PMC7319589 DOI: 10.1093/nar/gkaa389] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 04/18/2020] [Accepted: 05/16/2020] [Indexed: 12/15/2022] Open
Abstract
While antibodies are becoming an increasingly important therapeutic class, especially in personalized medicine, their development and optimization has been largely through experimental exploration. While there have been many efforts to develop computational tools to guide rational antibody engineering, most approaches are of limited accuracy when applied to antibody design, and have largely been limited to analysing a single point mutation at a time. To overcome this gap, we have curated a dataset of 242 experimentally determined changes in binding affinity upon multiple point mutations in antibody-target complexes (89 increasing and 153 decreasing binding affinity). Here, we have shown that by using our graph-based signatures and atomic interaction information, we can accurately analyse the consequence of multi-point mutations on antigen binding affinity. Our approach outperformed other available tools across cross-validation and two independent blind tests, achieving Pearson's correlations of up to 0.95. We have implemented our new approach, mmCSM-AB, as a web-server that can help guide the process of affinity maturation in antibody design. mmCSM-AB is freely available at http://biosig.unimelb.edu.au/mmcsm_ab/.
Collapse
Affiliation(s)
- Yoochan Myung
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
- School of Computing and Information Systems, University of Melbourne, Parkville, VIC 3052, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| |
Collapse
|
28
|
George J. Metabolism and interactions of antileprosy drugs. Biochem Pharmacol 2020; 177:113993. [PMID: 32339493 DOI: 10.1016/j.bcp.2020.113993] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 04/21/2020] [Indexed: 01/29/2023]
Abstract
Leprosy is a chronic infectious disease caused my Mycobacterium leprae that primarily affects peripheral nervous system and extremities and is prevalent in tropical countries. Treatment for leprosy with multidrug regimens is very effective compared to monotherapy especially in multibacillary cases. The three major antileprosy drugs currently in use are 4, 4'-diaminodiphenyl sulfone (DDS, dapsone), rifampicin, and clofazimine. During multidrug therapy, the potent antibiotic rifampicin induces the metabolism of dapsone, which results in decreased plasma half-life of dapsone and its metabolites. Furthermore, rifampicin induces its own metabolism and decreases its half-life during monotherapy. Rifampicin upregulates several hepatic microsomal drug-metabolizing enzymes, especially cytochrome P450 (CYP) family that in turn induce the metabolism of dapsone. Clofazimine lacks significant induction of any drug-metabolizing enzyme including CYP family and does not interact with dapsone metabolism. Rifampicin does not induce clofazimine metabolism during combination treatment. Administration of dapsone in the acetylated form (acedapsone) can release the drug slowly into circulation up to 75 days and could be useful for the effective treatment of paucibacillary cases along with rifampicin. This review summarizes the major aspects of antileprosy drug metabolism and drug interactions and the role of cytochrome P450 family of drug metabolizing enzymes, especially CYP3A4 during multidrug regimens for the treatment of leprosy.
Collapse
Affiliation(s)
- Joseph George
- Department of Biochemistry, Central Leprosy Teaching and Research Institute, Chengalpattu 603001, Tamil Nadu, India.
| |
Collapse
|
29
|
A Comprehensive Computational Platform to Guide Drug Development Using Graph-Based Signature Methods. Methods Mol Biol 2020. [PMID: 32006280 DOI: 10.1007/978-1-0716-0270-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
High-throughput computational techniques have become invaluable tools to help increase the overall success, process efficiency, and associated costs of drug development. By designing ligands tailored to specific protein structures in a disease of interest, an understanding of molecular interactions and ways to optimize them can be achieved prior to chemical synthesis. This understanding can help direct crucial chemical and biological experiments by maximizing available resources on higher quality leads. Moreover, predicting molecular binding affinity within specific biological contexts, as well as ligand pharmacokinetics and toxicities, can aid in filtering out redundant leads early on within the process. We describe a set of computational tools which can aid in drug discovery at different stages, from hit identification (EasyVS) to lead optimization and candidate selection (CSM-lig, mCSM-lig, Arpeggio, pkCSM). Incorporating these tools along the drug development process can help ensure that candidate leads are chemically and biologically feasible to become successful and tractable drugs.
Collapse
|
30
|
Vedithi SC, Rodrigues CHM, Portelli S, Skwark MJ, Das M, Ascher DB, Blundell TL, Malhotra S. Computational saturation mutagenesis to predict structural consequences of systematic mutations in the beta subunit of RNA polymerase in Mycobacterium leprae. Comput Struct Biotechnol J 2020; 18:271-286. [PMID: 32042379 PMCID: PMC7000446 DOI: 10.1016/j.csbj.2020.01.002] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 01/03/2020] [Accepted: 01/07/2020] [Indexed: 11/26/2022] Open
Abstract
Rifampin resistance in leprosy may remain undetected due to the lack of rapid and effective diagnostic tools. A quick and reliable method is essential to determine the impacts of emerging detrimental mutations in the drug targets. The functional consequences of missense mutations in the β-subunit of RNA polymerase (RNAP) in Mycobacterium leprae (M. leprae) contribute to phenotypic resistance to rifampin in leprosy. Here, we report in-silico saturation mutagenesis of all residues in the β-subunit of RNAP to all other 19 amino acid types (generating 21,394 mutations for 1126 residues) and predict their impacts on overall thermodynamic stability, on interactions at subunit interfaces, and on β-subunit-RNA and rifampin affinities (only for the rifampin binding site) using state-of-the-art structure, sequence and normal mode analysis-based methods. Mutations in the conserved residues that line the active-site cleft show largely destabilizing effects, resulting in increased relative solvent accessibility and a concomitant decrease in residue-depth (the extent to which a residue is buried in the protein structure space) of the mutant residues. The mutations at residue positions S437, G459, H451, P489, K884 and H1035 are identified as extremely detrimental as they induce highly destabilizing effects on the overall protein stability, and nucleic acid and rifampin affinities. Destabilizing effects were predicted for all the clinically/experimentally identified rifampin-resistant mutations in M. leprae indicating that this model can be used as a surveillance tool to monitor emerging detrimental mutations that destabilise RNAP-rifampin interactions and confer rifampin resistance in leprosy. Author summary The emergence of primary and secondary drug resistance to rifampin in leprosy is a growing concern and poses a threat to the leprosy control and elimination measures globally. In the absence of an effective in-vitro system to detect and monitor phenotypic resistance to rifampin in leprosy, diagnosis mainly relies on the presence of mutations in drug resistance determining regions of the rpoB gene that encodes the β-subunit of RNAP in M. leprae. Few labs in the world perform mouse food pad propagation of M. leprae in the presence of drugs (rifampin) to determine growth patterns and confirm resistance, however the duration of these methods lasts from 8 to 12 months making them impractical for diagnosis. Understanding molecular mechanisms of drug resistance is vital to associating mutations to clinically detected drug resistance in leprosy. Here we propose an in-silico saturation mutagenesis approach to comprehensively elucidate the structural implications of any mutations that exist or that can arise in the β-subunit of RNAP in M. leprae. Most of the predicted mutations may not occur in M. leprae due to fitness costs but the information thus generated by this approach help decipher the impacts of mutations across the structure and conversely enable identification of stable regions in the protein that are least impacted by mutations (mutation coolspots) which can be a potential choice for small molecule binding and structure guided drug discovery.
Collapse
Affiliation(s)
| | - Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia.,Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Stephanie Portelli
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia.,Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Marcin J Skwark
- Department of Biochemistry, University of Cambridge, Tennis Court Rd., CB2 1GA, UK
| | - Madhusmita Das
- Molecular Biology Laboratory, Schieffelin Institute of Heath-Research and Leprosy Center, Karigiri, Vellore, Tamil Nadu 632106, India
| | - David B Ascher
- Department of Biochemistry, University of Cambridge, Tennis Court Rd., CB2 1GA, UK.,Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia.,Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Tom L Blundell
- Department of Biochemistry, University of Cambridge, Tennis Court Rd., CB2 1GA, UK
| | - Sony Malhotra
- Department of Biochemistry, University of Cambridge, Tennis Court Rd., CB2 1GA, UK
| |
Collapse
|
31
|
Pandurangan AP, Blundell TL. Prediction of impacts of mutations on protein structure and interactions: SDM, a statistical approach, and mCSM, using machine learning. Protein Sci 2020; 29:247-257. [PMID: 31693276 PMCID: PMC6933854 DOI: 10.1002/pro.3774] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 10/31/2019] [Accepted: 10/31/2019] [Indexed: 02/02/2023]
Abstract
Next-generation sequencing methods have not only allowed an understanding of genome sequence variation during the evolution of organisms but have also provided invaluable information about genetic variants in inherited disease and the emergence of resistance to drugs in cancers and infectious disease. A challenge is to distinguish mutations that are drivers of disease or drug resistance, from passengers that are neutral or even selectively advantageous to the organism. This requires an understanding of impacts of missense mutations in gene expression and regulation, and on the disruption of protein function by modulating protein stability or disturbing interactions with proteins, nucleic acids, small molecule ligands, and other biological molecules. Experimental approaches to understanding differences between wild-type and mutant proteins are most accurate but are also time-consuming and costly. Computational tools used to predict the impacts of mutations can provide useful information more quickly. Here, we focus on two widely used structure-based approaches, originally developed in the Blundell lab: site-directed mutator (SDM), a statistical approach to analyze amino acid substitutions, and mutation cutoff scanning matrix (mCSM), which uses graph-based signatures to represent the wild-type structural environment and machine learning to predict the effect of mutations on protein stability. Here, we describe DUET that uses machine learning to combine the two approaches. We discuss briefly the development of mCSM for understanding the impacts of mutations on interfaces with other proteins, nucleic acids, and ligands, and we exemplify the wide application of these approaches to understand human genetic disorders and drug resistance mutations relevant to cancer and mycobacterial infections. STATEMENT FOR A BROADER AUDIENCE: Genetic or somatic changes in genes can lead to mutations in human proteins, which give rise to genetic disorders or cancer, or to genes of pathogens leading to drug resistance. Computer software described here, using statistical approaches or machine learning, uses the information from genome sequencing of humans and pathogens, together with experimental or modeled 3D structures of gene products, the proteins, to predict impacts of mutations in genetic disease, cancer and drug resistance.
Collapse
Affiliation(s)
- Arun Prasad Pandurangan
- Department of BiochemistryUniversity of CambridgeCambridgeUK
- MRC Laboratory of Molecular BiologyCambridgeUK
| | - Tom L. Blundell
- Department of BiochemistryUniversity of CambridgeCambridgeUK
| |
Collapse
|
32
|
Rodrigues CHM, Myung Y, Pires DEV, Ascher DB. mCSM-PPI2: predicting the effects of mutations on protein-protein interactions. Nucleic Acids Res 2019; 47:W338-W344. [PMID: 31114883 PMCID: PMC6602427 DOI: 10.1093/nar/gkz383] [Citation(s) in RCA: 200] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 04/30/2019] [Accepted: 05/20/2019] [Indexed: 12/13/2022] Open
Abstract
Protein-protein Interactions are involved in most fundamental biological processes, with disease causing mutations enriched at their interfaces. Here we present mCSM-PPI2, a novel machine learning computational tool designed to more accurately predict the effects of missense mutations on protein-protein interaction binding affinity. mCSM-PPI2 uses graph-based structural signatures to model effects of variations on the inter-residue interaction network, evolutionary information, complex network metrics and energetic terms to generate an optimised predictor. We demonstrate that our method outperforms previous methods, ranking first among 26 others on CAPRI blind tests. mCSM-PPI2 is freely available as a user friendly webserver at http://biosig.unimelb.edu.au/mcsm_ppi2/.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Yoochan Myung
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Douglas E V Pires
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
33
|
Empirical ways to identify novel Bedaquiline resistance mutations in AtpE. PLoS One 2019; 14:e0217169. [PMID: 31141524 PMCID: PMC6541270 DOI: 10.1371/journal.pone.0217169] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 05/01/2019] [Indexed: 12/28/2022] Open
Abstract
Clinical resistance against Bedaquiline, the first new anti-tuberculosis compound with a novel mechanism of action in over 40 years, has already been detected in Mycobacterium tuberculosis. As a new drug, however, there is currently insufficient clinical data to facilitate reliable and timely identification of genomic determinants of resistance. Here we investigate the structural basis for M. tuberculosis associated bedaquiline resistance in the drug target, AtpE. Together with the 9 previously identified resistance-associated variants in AtpE, 54 non-resistance-associated mutations were identified through comparisons of bedaquiline susceptibility across 23 different mycobacterial species. Computational analysis of the structural and functional consequences of these variants revealed that resistance associated variants were mainly localized at the drug binding site, disrupting key interactions with bedaquiline leading to reduced binding affinity. This was used to train a supervised predictive algorithm, which accurately identified likely resistance mutations (93.3% accuracy). Application of this model to circulating variants present in the Asia-Pacific region suggests that current circulating variants are likely to be susceptible to bedaquiline. We have made this model freely available through a user-friendly web interface called SUSPECT-BDQ, StrUctural Susceptibility PrEdiCTion for bedaquiline (http://biosig.unimelb.edu.au/suspect_bdq/). This tool could be useful for the rapid characterization of novel clinical variants, to help guide the effective use of bedaquiline, and to minimize the spread of clinical resistance.
Collapse
|
34
|
Synthesis and Structure-Activity relationship of 1-(5-isoquinolinesulfonyl)piperazine analogues as inhibitors of Mycobacterium tuberculosis IMPDH. Eur J Med Chem 2019; 174:309-329. [PMID: 31055147 PMCID: PMC6990405 DOI: 10.1016/j.ejmech.2019.04.027] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Revised: 04/11/2019] [Accepted: 04/11/2019] [Indexed: 02/06/2023]
Abstract
Tuberculosis (TB) is a major infectious disease associated increasingly with drug resistance. Thus, new anti-tubercular agents with novel mechanisms of action are urgently required for the treatment of drug-resistant TB. In prior work, we identified compound 1 (cyclohexyl(4-(isoquinolin-5-ylsulfonyl)piperazin-1-yl)methanone) and showed that its anti-tubercular activity is attributable to inhibition of inosine-5′-monophosphate dehydrogenase (IMPDH) in Mycobacterium tuberculosis. In the present study, we explored the structure–activity relationship around compound 1 by synthesizing and evaluating the inhibitory activity of analogues against M. tuberculosis IMPDH in biochemical and whole-cell assays. X-ray crystallography was performed to elucidate the mode of binding of selected analogues to IMPDH. We establish the importance of the cyclohexyl, piperazine and isoquinoline rings for activity, and report the identification of an analogue with IMPDH-selective activity against a mutant of M. tuberculosis that is highly resistant to compound 1. We also show that the nitrogen in urea analogues is required for anti-tubercular activity and identify benzylurea derivatives as promising inhibitors that warrant further investigation. Forty-eight analogues of 1-(5-isoquinolinesulfonyl)piperazine were synthesized. Biochemical, whole-cell, and X-ray studies were performed to elucidate the IMPDH inhibition. Piperazine and isoquinoline rings were essential for target-selective whole-cell activity. Compound 47 showed improved IC50 against the MtbIMPDH and maintained on-target whole-cell activity. Compound 21 showed activity against IMPDH in both wild type M. tuberculosis and a resistant mutant of compound 1.
Collapse
|
35
|
Waman VP, Vedithi SC, Thomas SE, Bannerman BP, Munir A, Skwark MJ, Malhotra S, Blundell TL. Mycobacterial genomics and structural bioinformatics: opportunities and challenges in drug discovery. Emerg Microbes Infect 2019; 8:109-118. [PMID: 30866765 PMCID: PMC6334779 DOI: 10.1080/22221751.2018.1561158] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 12/03/2018] [Accepted: 12/09/2018] [Indexed: 01/08/2023]
Abstract
Of the more than 190 distinct species of Mycobacterium genus, many are economically and clinically important pathogens of humans or animals. Among those mycobacteria that infect humans, three species namely Mycobacterium tuberculosis (causative agent of tuberculosis), Mycobacterium leprae (causative agent of leprosy) and Mycobacterium abscessus (causative agent of chronic pulmonary infections) pose concern to global public health. Although antibiotics have been successfully developed to combat each of these, the emergence of drug-resistant strains is an increasing challenge for treatment and drug discovery. Here we describe the impact of the rapid expansion of genome sequencing and genome/pathway annotations that have greatly improved the progress of structure-guided drug discovery. We focus on the applications of comparative genomics, metabolomics, evolutionary bioinformatics and structural proteomics to identify potential drug targets. The opportunities and challenges for the design of drugs for M. tuberculosis, M. leprae and M. abscessus to combat resistance are discussed.
Collapse
Affiliation(s)
| | | | | | | | - Asma Munir
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Marcin J. Skwark
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Sony Malhotra
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, London, UK
| | - Tom L. Blundell
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
36
|
Pires DEV, Rodrigues CHM, Albanaz ATS, Karmakar M, Myung Y, Xavier J, Michanetzi EM, Portelli S, Ascher DB. Exploring Protein Supersecondary Structure Through Changes in Protein Folding, Stability, and Flexibility. Methods Mol Biol 2019; 1958:173-185. [PMID: 30945219 DOI: 10.1007/978-1-4939-9161-7_9] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
The ability to predict how mutations affect protein structure, folding, and flexibility can elucidate the molecular mechanisms leading to disruption of supersecondary structures, the emergence of phenotypes, as well guiding rational protein engineering. The advent of fast and accurate computational tools has enabled us to comprehensively explore the landscape of mutation effects on protein structures, prioritizing mutations for rational experimental validation.Here we describe the use of two complementary web-based in silico methods, DUET and DynaMut, developed to infer the effects of mutations on folding, stability, and flexibility and how they can be used to explore and interpret these effects on protein supersecondary structures.
Collapse
Affiliation(s)
- Douglas E V Pires
- Instituto René Rachou, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil. .,Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia.
| | - Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
| | | | - Malancha Karmakar
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
| | - Yoochan Myung
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
| | - Joicymara Xavier
- Instituto René Rachou, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil
| | - Eleni-Maria Michanetzi
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
| | - Stephanie Portelli
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
| | - David B Ascher
- Instituto René Rachou, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.,Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
37
|
Rodrigues CHM, Ascher DB, Pires DEV. Kinact: a computational approach for predicting activating missense mutations in protein kinases. Nucleic Acids Res 2018; 46:W127-W132. [PMID: 29788456 PMCID: PMC6031004 DOI: 10.1093/nar/gky375] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2018] [Revised: 04/15/2018] [Accepted: 04/28/2018] [Indexed: 12/31/2022] Open
Abstract
Protein phosphorylation is tightly regulated due to its vital role in many cellular processes. While gain of function mutations leading to constitutive activation of protein kinases are known to be driver events of many cancers, the identification of these mutations has proven challenging. Here we present Kinact, a novel machine learning approach for predicting kinase activating missense mutations using information from sequence and structure. By adapting our graph-based signatures, Kinact represents both structural and sequence information, which are used as evidence to train predictive models. We show the combination of structural and sequence features significantly improved the overall accuracy compared to considering either primary or tertiary structure alone, highlighting their complementarity. Kinact achieved a precision of 87% and 94% and Area Under ROC Curve of 0.89 and 0.92 on 10-fold cross-validation, and on blind tests, respectively, outperforming well established tools (P < 0.01). We further show that Kinact performs equally well on homology models built using templates with sequence identity as low as 33%. Kinact is freely available as a user-friendly web server at http://biosig.unimelb.edu.au/kinact/.
Collapse
Affiliation(s)
- Carlos HM Rodrigues
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne
| | - David B Ascher
- Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne
- Department of Biochemistry, University of Cambridge
- Instituto René Rachou, Fundação Oswaldo Cruz
| | | |
Collapse
|