1
|
AlJarf R, Rodrigues CHM, Myung Y, Pires DEV, Ascher DB. piscesCSM: prediction of anticancer synergistic drug combinations. J Cheminform 2024; 16:81. [PMID: 39030592 PMCID: PMC11264925 DOI: 10.1186/s13321-024-00859-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2023] [Accepted: 05/12/2024] [Indexed: 07/21/2024] Open
Abstract
While drug combination therapies are of great importance, particularly in cancer treatment, identifying novel synergistic drug combinations has been a challenging venture. Computational methods have emerged in this context as a promising tool for prioritizing drug combinations for further evaluation, though they have presented limited performance, utility, and interpretability. Here, we propose a novel predictive tool, piscesCSM, that leverages graph-based representations to model small molecule chemical structures to accurately predict drug combinations with favourable anticancer synergistic effects against one or multiple cancer cell lines. Leveraging these insights, we developed a general supervised machine learning model to guide the prediction of anticancer synergistic drug combinations in over 30 cell lines. It achieved an area under the receiver operating characteristic curve (AUROC) of up to 0.89 on independent non-redundant blind tests, outperforming state-of-the-art approaches on both large-scale oncology screening data and an independent test set generated by AstraZeneca (with more than a 16% improvement in predictive accuracy). Moreover, by exploring the interpretability of our approach, we found that simple physicochemical properties and graph-based signatures are predictive of chemotherapy synergism. To provide a simple and integrated platform to rapidly screen potential candidate pairs with favourable synergistic anticancer effects, we made piscesCSM freely available online at https://biosig.lab.uq.edu.au/piscescsm/ as a web server and API. We believe that our predictive tool will provide a valuable resource for optimizing and augmenting combinatorial screening libraries to identify effective and safe synergistic anticancer drug combinations. SCIENTIFIC CONTRIBUTION: This work proposes piscesCSM, a machine-learning-based framework that relies on well-established graph-based representations of small molecules to identify and provide better predictive accuracy of syngenetic drug combinations. Our model, piscesCSM, shows that combining physiochemical properties with graph-based signatures can outperform current architectures on classification prediction tasks. Furthermore, implementing our tool as a web server offers a user-friendly platform for researchers to screen for potential synergistic drug combinations with favorable anticancer effects against one or multiple cancer cell lines.
Collapse
Affiliation(s)
- Raghad AlJarf
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, VIC, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
| | - Carlos H M Rodrigues
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, VIC, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Yoochan Myung
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, VIC, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia
| | - Douglas E V Pires
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, VIC, Australia
| | - David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, VIC, Australia.
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, VIC, Australia.
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- School of Chemistry and Molecular Biosciences, University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
2
|
Akula S, Mullaguri SC, Melton NM, Katta A, Naga VSGR, Kandula S, Pedada RK, Subramanian J, Kancha RK. Large-scale pathogenicity prediction analysis of cancer-associated kinase mutations reveals variability in sensitivity and specificity of computational methods. Cancer Med 2023; 12:17468-17474. [PMID: 37409618 PMCID: PMC10501281 DOI: 10.1002/cam4.6324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 05/26/2023] [Accepted: 06/27/2023] [Indexed: 07/07/2023] Open
Abstract
BACKGROUND Mutations in kinases are the most frequent genetic alterations in cancer; however, experimental evidence establishing their cancerous nature is available only for a small fraction of these mutants. AIMS Predicition analysis of kinome mutations is the primary aim of this study. Further objective is to compare the performance of various softwares in pathogenicity prediction of kinase mutations. MATERIALS AND METHODS We employed a set of computational tools to predict the pathogenicity of over forty-two thousand mutations and deposited the kinase-wise data in Mendeley database (Estimated Pathogenicity of Kinase Mutants [EPKiMu]). RESULTS Mutations are more likely to be drivers when being present in the kinase domain (vs. non-kinase domain) and belonging to hotspot residues (vs. non-hotspot residues). We identified that, while predictive tools have low specificity in general, PolyPhen-2 had the best accuracy. Further efforts to combine all four tools by consensus, voting, or other simple methods did not significantly improve accuracy. DISCUSSION The study provides a large dataset of kinase mutations along with their predicted pathogenicity that can be used as a training set for future studies. Furthermore, a comparative sensitivity and selectivity of commonly used computational tools is presented. CONCLUSION Primary-structure-based in silico tools identified more cancerous/deleterious mutations in the kinase domains and at the hot spot residues while having higher sensitivity than specificity in detecting deleterious mutations.
Collapse
Affiliation(s)
- Sravani Akula
- Molecular Medicine and Therapeutics Laboratory, CPMBOsmania UniversityHyderabadIndia
| | | | - Niklas Max Melton
- Thoracic Oncology, Inova Schar Cancer InstituteFairfaxVirginiaUSA
- Applied Computational Intelligence LabMissouri University of Science and TechnologyRollaMissouriUSA
| | - Archana Katta
- Molecular Medicine and Therapeutics Laboratory, CPMBOsmania UniversityHyderabadIndia
| | | | - Shyamson Kandula
- Molecular Medicine and Therapeutics Laboratory, CPMBOsmania UniversityHyderabadIndia
| | - Raj Kumar Pedada
- Molecular Medicine and Therapeutics Laboratory, CPMBOsmania UniversityHyderabadIndia
| | | | - Rama Krishna Kancha
- Molecular Medicine and Therapeutics Laboratory, CPMBOsmania UniversityHyderabadIndia
| |
Collapse
|
3
|
GÖKER BAGCA B, GÖDE S, TURHAL G, ÖZATEŞ NP, VERAL A, GÜNDÜZ C, AVCI ÇB. Nadir paranazal sinüs kanserlerinde yeni tanımlanan reseptör tirozin kinaz mutasyonları ve potansiyel fonksiyonel etkileri. EGE TIP DERGISI 2023. [DOI: 10.19161/etd.1262612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/11/2023] Open
Abstract
Amaç: Paranazal sinüs kanserleri oldukça nadir görülen heterojen bir hastalık grubudur. Maksiler sinüs skuamoz hücreli karsinomu, paranazal sinüs kanserlerinin anatomik ve histolojik olarak en
yaygın alt tipidir. Bu kanserin genetik profiline dair bilginin sınırlı olması, hastaların hedefli tedavi seçeneklerinden yararlanamamasına neden olmaktadır. Çalışmamızda bu nadir kanserdeki reseptör tirozin kinaz mutasyonlarının tanımlanması ve mutasyonların olası fonksiyonel etkilerinin tahmin edilmesi amaçlanmıştır.
Gereç ve Yöntem: Bu amaçla 30 olgunun tümörüne ait FFPE dokulardan DNA izolasyonu gerçekleştirildi, olguların mutasyon profili yeni nesil sekanslama yöntemi ve biyoinformatik
değerlendirme ile belirlendi. Belirlenen patojenik/ olası patojenik varyantların fonksiyonel etkileri farklı in silico araçlar yardımıyla tahminlendi.
Bulgular: Olgularının tamamında en az bir adet patojenik/olası patojenik KIT, PDFGRA ve RETmutasyonu belirlendi. KIT geninin katalitik bölgesindeki mutasyonların kinaz aktivitesini arttıracağı
tahmin edildi. PDFGRA genindeki p.P567P ve p.D1074D mutasyonları, 30 olgunun tamamında ve SRA veritabanından elde edilen normal dokulara ait okumaların tümünde belirlendi.
Sonuç: Reseptör tirozin kinaz mutasyonlarının paranazal sinüs kanserlerinde de önemli rol oynayabileceğinin belirlenmiş olması özellikle artmış kinaz aktivitesini hedefleyen tedavi yaklaşımlarını
bu olguların erişimine sunma potansiyeli taşıması bakımından oldukça önemlidir.
Collapse
Affiliation(s)
- Bakiye GÖKER BAGCA
- Aydın Adnan Menderes Üniversitesi, Tıp Fakültesi, Tıbbi Biyoloji Anabilim Dalı, Aydın, Türkiye
| | - Sercan GÖDE
- Ege Üniversitesi, Tıp Fakültesi, Kulak Burun Boğaz Anabilim Dalı, İzmir, Türkiye
| | - Göksel TURHAL
- Ege Üniversitesi, Tıp Fakültesi, Kulak Burun Boğaz Anabilim Dalı, İzmir, Türkiye
| | - Neslihan Pınar ÖZATEŞ
- Harran Üniversitesi, Tıp Fakültesi, Tıbbi Biyoloji Anabilim Dalı, Şanlıurfa, Türkiye
| | - Ali VERAL
- Ege Üniversitesi, Tıp Fakültesi, Patoloji Anabilim Dalı, İzmir, Türkiye
| | - Cumhur GÜNDÜZ
- Ege Üniversitesi, Tıp Fakültesi, Tıbbi Biyoloji Anabilim Dalı, İzmir, Türkiye
| | - Çığır Biray AVCI
- Ege Üniversitesi, Tıp Fakültesi, Tıbbi Biyoloji Anabilim Dalı, İzmir, Türkiye
| |
Collapse
|
4
|
Carrillo-Rodriguez P, Selheim F, Hernandez-Valladares M. Mass Spectrometry-Based Proteomics Workflows in Cancer Research: The Relevance of Choosing the Right Steps. Cancers (Basel) 2023; 15:cancers15020555. [PMID: 36672506 PMCID: PMC9856946 DOI: 10.3390/cancers15020555] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 01/12/2023] [Indexed: 01/19/2023] Open
Abstract
The qualitative and quantitative evaluation of proteome changes that condition cancer development can be achieved with liquid chromatography-mass spectrometry (LC-MS). LC-MS-based proteomics strategies are carried out according to predesigned workflows that comprise several steps such as sample selection, sample processing including labeling, MS acquisition methods, statistical treatment, and bioinformatics to understand the biological meaning of the findings and set predictive classifiers. As the choice of best options might not be straightforward, we herein review and assess past and current proteomics approaches for the discovery of new cancer biomarkers. Moreover, we review major bioinformatics tools for interpreting and visualizing proteomics results and suggest the most popular machine learning techniques for the selection of predictive biomarkers. Finally, we consider the approximation of proteomics strategies for clinical diagnosis and prognosis by discussing current barriers and proposals to circumvent them.
Collapse
Affiliation(s)
- Paula Carrillo-Rodriguez
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
- Vall d'Hebron Institute of Oncology (VHIO), 08035 Barcelona, Spain
| | - Frode Selheim
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
| | - Maria Hernandez-Valladares
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
- Department of Physical Chemistry, University of Granada, Avenida de la Fuente Nueva S/N, 18071 Granada, Spain
- Instituto de Investigación Biosanitaria ibs.GRANADA, 18012 Granada, Spain
| |
Collapse
|
5
|
Ascher DB, Kaminskas LM, Myung Y, Pires DEV. Using Graph-Based Signatures to Guide Rational Antibody Engineering. Methods Mol Biol 2023; 2552:375-397. [PMID: 36346604 DOI: 10.1007/978-1-0716-2609-2_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Antibodies are essential experimental and diagnostic tools and as biotherapeutics have significantly advanced our ability to treat a range of diseases. With recent innovations in computational tools to guide protein engineering, we can now rationally design better antibodies with improved efficacy, stability, and pharmacokinetics. Here, we describe the use of the mCSM web-based in silico suite, which uses graph-based signatures to rapidly identify the structural and functional consequences of mutations, to guide rational antibody engineering to improve stability, affinity, and specificity.
Collapse
Affiliation(s)
- David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- Department of Biochemistry, Cambridge University, Cambridge, UK
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland, Australia
| | - Lisa M Kaminskas
- School of Biological Sciences, University of Queensland, St Lucia, QLD, Australia
| | - Yoochan Myung
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia
- School of Chemistry and Molecular Biosciences, University of Queensland, St Lucia, Queensland, Australia
| | - Douglas E V Pires
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, Australia.
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC, Australia.
- School of Computing and Information Systems, University of Melbourne, Parkville, VIC, Australia.
| |
Collapse
|
6
|
Unni P, Friend J, Weinberg J, Okur V, Hochscherf J, Dominguez I. Predictive functional, statistical and structural analysis of CSNK2A1 and CSNK2B variants linked to neurodevelopmental diseases. Front Mol Biosci 2022; 9:851547. [PMID: 36310603 PMCID: PMC9608649 DOI: 10.3389/fmolb.2022.851547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2022] [Accepted: 06/29/2022] [Indexed: 12/02/2022] Open
Abstract
Okur-Chung Neurodevelopmental Syndrome (OCNDS) and Poirier-Bienvenu Neurodevelopmental Syndrome (POBINDS) were recently identified as rare neurodevelopmental disorders. OCNDS and POBINDS are associated with heterozygous mutations in the CSNK2A1 and CSNK2B genes which encode CK2α, a serine/threonine protein kinase, and CK2β, a regulatory protein, respectively, which together can form a tetrameric enzyme called protein kinase CK2. A challenge in OCNDS and POBINDS is to understand the genetic basis of these diseases and the effect of the various CK2⍺ and CK2β mutations. In this study we have collected all variants available to date in CSNK2A1 and CSNK2B, and identified hotspots. We have investigated CK2⍺ and CK2β missense mutations through prediction programs which consider the evolutionary conservation, functionality and structure or these two proteins, compared these results with published experimental data on CK2α and CK2β mutants, and suggested prediction programs that could help predict changes in functionality of CK2α mutants. We also investigated the potential effect of CK2α and CK2β mutations on the 3D structure of the proteins and in their binding to each other. These results indicate that there are functional and structural consequences of mutation of CK2α and CK2β, and provide a rationale for further study of OCNDS and POBINDS-associated mutations. These data contribute to understanding the genetic and functional basis of these diseases, which is needed to identify their underlying mechanisms.
Collapse
Affiliation(s)
- Prasida Unni
- Department of Medicine, Boston University School of Medicine and Boston Medical Center, Boston University, Boston, MA, United States
| | - Jack Friend
- Department of Medicine, Boston University School of Medicine and Boston Medical Center, Boston University, Boston, MA, United States
| | - Janice Weinberg
- Department of Biostatistics, Boston University School of Public Health, Boston University, Boston, MA, United States
| | - Volkan Okur
- New York Genome Center, New York, NY, United States
| | - Jennifer Hochscherf
- Department of Chemistry, Institute of Biochemistry, University of Cologne, Cologne, Germany
| | - Isabel Dominguez
- Department of Medicine, Boston University School of Medicine and Boston Medical Center, Boston University, Boston, MA, United States
- *Correspondence: Isabel Dominguez,
| |
Collapse
|
7
|
Nussinov R, Zhang M, Maloney R, Liu Y, Tsai CJ, Jang H. Allostery: Allosteric Cancer Drivers and Innovative Allosteric Drugs. J Mol Biol 2022; 434:167569. [PMID: 35378118 PMCID: PMC9398924 DOI: 10.1016/j.jmb.2022.167569] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 03/11/2022] [Accepted: 03/25/2022] [Indexed: 01/12/2023]
Abstract
Here, we discuss the principles of allosteric activating mutations, propagation downstream of the signals that they prompt, and allosteric drugs, with examples from the Ras signaling network. We focus on Abl kinase where mutations shift the landscape toward the active, imatinib binding-incompetent conformation, likely resulting in the high affinity ATP outcompeting drug binding. Recent pharmacological innovation extends to allosteric inhibitor (GNF-5)-linked PROTAC, targeting Bcr-Abl1 myristoylation site, and broadly, allosteric heterobifunctional degraders that destroy targets, rather than inhibiting them. Designed chemical linkers in bifunctional degraders can connect the allosteric ligand that binds the target protein and the E3 ubiquitin ligase warhead anchor. The physical properties and favored conformational state of the engineered linker can precisely coordinate the distance and orientation between the target and the recruited E3. Allosteric PROTACs, noncompetitive molecular glues, and bitopic ligands, with covalent links of allosteric ligands and orthosteric warheads, increase the effective local concentration of productively oriented and placed ligands. Through covalent chemical or peptide linkers, allosteric drugs can collaborate with competitive drugs, degrader anchors, or other molecules of choice, driving innovative drug discovery.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Ryan Maloney
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Yonglan Liu
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| |
Collapse
|
8
|
Rezende PM, Xavier JS, Ascher DB, Fernandes GR, Pires DEV. Evaluating hierarchical machine learning approaches to classify biological databases. Brief Bioinform 2022; 23:6611916. [PMID: 35724625 PMCID: PMC9310517 DOI: 10.1093/bib/bbac216] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 04/29/2022] [Accepted: 05/09/2022] [Indexed: 12/04/2022] Open
Abstract
The rate of biological data generation has increased dramatically in recent years, which has driven the importance of databases as a resource to guide innovation and the generation of biological insights. Given the complexity and scale of these databases, automatic data classification is often required. Biological data sets are often hierarchical in nature, with varying degrees of complexity, imposing different challenges to train, test and validate accurate and generalizable classification models. While some approaches to classify hierarchical data have been proposed, no guidelines regarding their utility, applicability and limitations have been explored or implemented. These include ‘Local’ approaches considering the hierarchy, building models per level or node, and ‘Global’ hierarchical classification, using a flat classification approach. To fill this gap, here we have systematically contrasted the performance of ‘Local per Level’ and ‘Local per Node’ approaches with a ‘Global’ approach applied to two different hierarchical datasets: BioLip and CATH. The results show how different components of hierarchical data sets, such as variation coefficient and prediction by depth, can guide the choice of appropriate classification schemes. Finally, we provide guidelines to support this process when embarking on a hierarchical classification task, which will help optimize computational resources and predictive performance.
Collapse
Affiliation(s)
- Pâmela M Rezende
- Universidade Federal de Minas Gerais.,Instituto René Rachou, Fundação Oswaldo Cruz.,Stilingue Inteligência Artificial
| | - Joicymara S Xavier
- Universidade Federal de Minas Gerais.,Instituto René Rachou, Fundação Oswaldo Cruz.,Institute of Agricultural Sciences, Universidade Federal dos Vales do Jequitinhonha e Mucuri
| | - David B Ascher
- School of Chemistry and Molecular Biosciences, University of Queensland.,Systems and Computational Biology, Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute
| | | | - Douglas E V Pires
- Systems and Computational Biology, Bio 21 Institute, University of Melbourne.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute.,School of Computing and Information Systems, University of Melbourne
| |
Collapse
|
9
|
da Silva BM, Myung Y, Ascher DB, Pires DEV. epitope3D: a machine learning method for conformational B-cell epitope prediction. Brief Bioinform 2021; 23:6407730. [PMID: 34676398 DOI: 10.1093/bib/bbab423] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 08/25/2021] [Accepted: 09/14/2021] [Indexed: 11/13/2022] Open
Abstract
The ability to identify antigenic determinants of pathogens, or epitopes, is fundamental to guide rational vaccine development and immunotherapies, which are particularly relevant for rapid pandemic response. A range of computational tools has been developed over the past two decades to assist in epitope prediction; however, they have presented limited performance and generalization, particularly for the identification of conformational B-cell epitopes. Here, we present epitope3D, a novel scalable machine learning method capable of accurately identifying conformational epitopes trained and evaluated on the largest curated epitope data set to date. Our method uses the concept of graph-based signatures to model epitope and non-epitope regions as graphs and extract distance patterns that are used as evidence to train and test predictive models. We show epitope3D outperforms available alternative approaches, achieving Mathew's Correlation Coefficient and F1-scores of 0.55 and 0.57 on cross-validation and 0.45 and 0.36 during independent blind tests, respectively.
Collapse
Affiliation(s)
- Bruna Moreira da Silva
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - YooChan Myung
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Baker Department of Cardiometabolic Health, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Baker Department of Cardiometabolic Health, University of Melbourne, Melbourne, Victoria, Australia.,Department of Biochemistry, University of Cambridge, 80 Tennis Ct Rd, Cambridge CB2 1GA, UK
| | - Douglas E V Pires
- Structural Biology and Bioinformatics, Department of Biochemistry, University of Melbourne, Melbourne, Victoria, Australia.,Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| |
Collapse
|
10
|
Rodrigues CHM, Pires DEV, Ascher DB. mmCSM-PPI: predicting the effects of multiple point mutations on protein-protein interactions. Nucleic Acids Res 2021; 49:W417-W424. [PMID: 33893812 PMCID: PMC8262703 DOI: 10.1093/nar/gkab273] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2021] [Revised: 03/18/2021] [Accepted: 04/15/2021] [Indexed: 11/16/2022] Open
Abstract
Protein-protein interactions play a crucial role in all cellular functions and biological processes and mutations leading to their disruption are enriched in many diseases. While a number of computational methods to assess the effects of variants on protein-protein binding affinity have been proposed, they are in general limited to the analysis of single point mutations and have been shown to perform poorly on independent test sets. Here, we present mmCSM-PPI, a scalable and effective machine learning model for accurately assessing changes in protein-protein binding affinity caused by single and multiple missense mutations. We expanded our well-established graph-based signatures in order to capture physicochemical and geometrical properties of multiple wild-type residue environments and integrated them with substitution scores and dynamics terms from normal mode analysis. mmCSM-PPI was able to achieve a Pearson's correlation of up to 0.75 (RMSE = 1.64 kcal/mol) under 10-fold cross-validation and 0.70 (RMSE = 2.06 kcal/mol) on a non-redundant blind test, outperforming existing methods. Our method is freely available as a user-friendly and easy-to-use web server and API at http://biosig.unimelb.edu.au/mmcsm_ppi.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Pharmacology, University of Melbourne, Melbourne, Victoria, Australia
- Systems and Computational Biology, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
11
|
Vanova V, Mitrevska K, Milosavljevic V, Hynek D, Richtera L, Adam V. Peptide-based electrochemical biosensors utilized for protein detection. Biosens Bioelectron 2021; 180:113087. [PMID: 33662844 DOI: 10.1016/j.bios.2021.113087] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2020] [Revised: 02/04/2021] [Accepted: 02/10/2021] [Indexed: 02/07/2023]
Abstract
Proteins are generally detected as biomarkers for tracing or determining various disorders in organisms. Biomarker proteins can be tracked in samples with various origins and in different concentrations, revealing whether an organism is in a healthy or unhealthy state. In regard to detection, electrochemical biosensors are a potential fusion of electronics, chemistry, and biology, allowing for fast and early point-of-care detection from a biological sample with the advantages of high sensitivity, simple construction, and easy operation. Peptides present a promising approach as a biorecognition element when connected with electrochemical biosensors. The benefits of short peptides lie mainly in their good stability and selective affinity to a target analyte. Therefore, peptide-based electrochemical biosensors (PBEBs) represent an alternative approach for the detection of different protein biomarkers. This review provides a summary of the past decade of recently proposed PBEBs designed for protein detection, dividing them according to different protein types: (i) enzyme detection, including proteases and kinases; (ii) antibody detection; and (iii) other protein detection. According to these protein types, different sensing mechanisms are discussed, such as the peptide cleavage by a proteases, phosphorylation by kinases, presence of antibodies, and exploiting of affinities; furthermore, measurements are obtained by different electrochemical methods. A discussion and comparison of various constructions, modifications, immobilization strategies and different sensing techniques in terms of high sensitivity, selectivity, repeatability, and potential for practical application are presented.
Collapse
Affiliation(s)
- Veronika Vanova
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic
| | - Katerina Mitrevska
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic
| | - Vedran Milosavljevic
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - David Hynek
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - Lukas Richtera
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic
| | - Vojtech Adam
- Department of Chemistry and Biochemistry, Mendel University in Brno, Zemedelska 1, 613 00, Brno, Czech Republic; Central European Institute of Technology, Brno University of Technology, Purkynova 123, 61 200, Brno, Czech Republic.
| |
Collapse
|
12
|
Sarkar A, Yang Y, Vihinen M. Variation benchmark datasets: update, criteria, quality and applications. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2020:5710862. [PMID: 32016318 PMCID: PMC6997940 DOI: 10.1093/database/baz117] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Revised: 06/03/2019] [Accepted: 07/01/2019] [Indexed: 02/07/2023]
Abstract
Development of new computational methods and testing their performance has to be carried out using experimental data. Only in comparison to existing knowledge can method performance be assessed. For that purpose, benchmark datasets with known and verified outcome are needed. High-quality benchmark datasets are valuable and may be difficult, laborious and time consuming to generate. VariBench and VariSNP are the two existing databases for sharing variation benchmark datasets used mainly for variation interpretation. They have been used for training and benchmarking predictors for various types of variations and their effects. VariBench was updated with 419 new datasets from 109 papers containing altogether 329 014 152 variants; however, there is plenty of redundancy between the datasets. VariBench is freely available at http://structure.bmc.lu.se/VariBench/. The contents of the datasets vary depending on information in the original source. The available datasets have been categorized into 20 groups and subgroups. There are datasets for insertions and deletions, substitutions in coding and non-coding region, structure mapped, synonymous and benign variants. Effect-specific datasets include DNA regulatory elements, RNA splicing, and protein property for aggregation, binding free energy, disorder and stability. Then there are several datasets for molecule-specific and disease-specific applications, as well as one dataset for variation phenotype effects. Variants are often described at three molecular levels (DNA, RNA and protein) and sometimes also at the protein structural level including relevant cross references and variant descriptions. The updated VariBench facilitates development and testing of new methods and comparison of obtained performances to previously published methods. We compared the performance of the pathogenicity/tolerance predictor PON-P2 to several benchmark studies, and show that such comparisons are feasible and useful, however, there may be limitations due to lack of provided details and shared data. Database URL: http://structure.bmc.lu.se/VariBench
Collapse
Affiliation(s)
- Anasua Sarkar
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| | - Yang Yang
- School of Computer Science and Technology, Soochow University, No1. Shizi Street, Suzhou, 215006 Jiangsu, China.,Provincial Key Laboratory for Computer Information Processing Technology, No1. Shizi Street, Soochow University, Suzhou, 215006 Jiangsu, China
| | - Mauno Vihinen
- Department of Experimental Medical Science, BMC B13, Lund University, SE-22 184 Lund, Sweden
| |
Collapse
|
13
|
Pires DEV, Rodrigues CHM, Ascher DB. mCSM-membrane: predicting the effects of mutations on transmembrane proteins. Nucleic Acids Res 2020; 48:W147-W153. [PMID: 32469063 PMCID: PMC7319563 DOI: 10.1093/nar/gkaa416] [Citation(s) in RCA: 62] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 05/04/2020] [Accepted: 05/28/2020] [Indexed: 12/17/2022] Open
Abstract
Significant efforts have been invested into understanding and predicting the molecular consequences of mutations in protein coding regions, however nearly all approaches have been developed using globular, soluble proteins. These methods have been shown to poorly translate to studying the effects of mutations in membrane proteins. To fill this gap, here we report, mCSM-membrane, a user-friendly web server that can be used to analyse the impacts of mutations on membrane protein stability and the likelihood of them being disease associated. mCSM-membrane derives from our well-established mutation modelling approach that uses graph-based signatures to model protein geometry and physicochemical properties for supervised learning. Our stability predictor achieved correlations of up to 0.72 and 0.67 (on cross validation and blind tests, respectively), while our pathogenicity predictor achieved a Matthew's Correlation Coefficient (MCC) of up to 0.77 and 0.73, outperforming previously described methods in both predicting changes in stability and in identifying pathogenic variants. mCSM-membrane will be an invaluable and dedicated resource for investigating the effects of single-point mutations on membrane proteins through a freely available, user friendly web server at http://biosig.unimelb.edu.au/mcsm_membrane.
Collapse
Affiliation(s)
- Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville, VIC, 3052, Australia
| | - Carlos H M Rodrigues
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, Victoria 3004, Australia.,Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC, 3052, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, CB2 1GA, UK
| |
Collapse
|
14
|
Myung Y, Rodrigues CHM, Ascher DB, Pires DEV. mCSM-AB2: guiding rational antibody design using graph-based signatures. Bioinformatics 2020; 36:1453-1459. [PMID: 31665262 DOI: 10.1093/bioinformatics/btz779] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2019] [Revised: 10/07/2019] [Accepted: 10/23/2019] [Indexed: 12/11/2022] Open
Abstract
MOTIVATION A lack of accurate computational tools to guide rational mutagenesis has made affinity maturation a recurrent challenge in antibody (Ab) development. We previously showed that graph-based signatures can be used to predict the effects of mutations on Ab binding affinity. RESULTS Here we present an updated and refined version of this approach, mCSM-AB2, capable of accurately modelling the effects of mutations on Ab-antigen binding affinity, through the inclusion of evolutionary and energetic terms. Using a new and expanded database of over 1800 mutations with experimental binding measurements and structural information, mCSM-AB2 achieved a Pearson's correlation of 0.73 and 0.77 across training and blind tests, respectively, outperforming available methods currently used for rational Ab engineering. AVAILABILITY AND IMPLEMENTATION mCSM-AB2 is available as a user-friendly and freely accessible web server providing rapid analysis of both individual mutations or the entire binding interface to guide rational antibody affinity maturation at http://biosig.unimelb.edu.au/mcsm_ab2. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yoochan Myung
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia.,Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| | - Douglas E V Pires
- Department of Biochemistry and Molecular Biology.,ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, VIC 3010, Australia.,Structural Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, VIC 3004, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, VIC 3010, Australia
| |
Collapse
|
15
|
Rodrigues CHM, Pires DEV, Ascher DB. DynaMut2: Assessing changes in stability and flexibility upon single and multiple point missense mutations. Protein Sci 2020; 30:60-69. [PMID: 32881105 PMCID: PMC7737773 DOI: 10.1002/pro.3942] [Citation(s) in RCA: 207] [Impact Index Per Article: 51.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2020] [Revised: 08/27/2020] [Accepted: 08/28/2020] [Indexed: 12/11/2022]
Abstract
Predicting the effect of missense variations on protein stability and dynamics is important for understanding their role in diseases, and the link between protein structure and function. Approaches to estimate these changes have been proposed, but most only consider single‐point missense variants and a static state of the protein, with those that incorporate dynamics are computationally expensive. Here we present DynaMut2, a web server that combines Normal Mode Analysis (NMA) methods to capture protein motion and our graph‐based signatures to represent the wildtype environment to investigate the effects of single and multiple point mutations on protein stability and dynamics. DynaMut2 was able to accurately predict the effects of missense mutations on protein stability, achieving Pearson's correlation of up to 0.72 (RMSE: 1.02 kcal/mol) on a single point and 0.64 (RMSE: 1.80 kcal/mol) on multiple‐point missense mutations across 10‐fold cross‐validation and independent blind tests. For single‐point mutations, DynaMut2 achieved comparable performance with other methods when predicting variations in Gibbs Free Energy (ΔΔG) and in melting temperature (ΔTm). We anticipate our tool to be a valuable suite for the study of protein flexibility analysis and the study of the role of variants in disease. DynaMut2 is freely available as a web server and API at http://biosig.unimelb.edu.au/dynamut2.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia
| | - Douglas E V Pires
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,School of Computing and Information Systems, University of Melbourne, Melbourne, Victoria, Australia
| | - David B Ascher
- Structural Biology and Bioinformatics, Department of Biochemistry, Bio21 Institute, University of Melbourne, Melbourne, Victoria, Australia.,Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, Melbourne, Victoria, Australia.,Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|
16
|
Abstract
Mutations in protein-coding regions can lead to large biological changes and are associated with genetic conditions, including cancers and Mendelian diseases, as well as drug resistance. Although whole genome and exome sequencing help to elucidate potential genotype-phenotype correlations, there is a large gap between the identification of new variants and deciphering their molecular consequences. A comprehensive understanding of these mechanistic consequences is crucial to better understand and treat diseases in a more personalized and effective way. This is particularly relevant considering estimates that over 80% of mutations associated with a disease are incorrectly assumed to be causative. A thorough analysis of potential effects of mutations is required to correctly identify the molecular mechanisms of disease and enable the distinction between disease-causing and non-disease-causing variation within a gene. Here we present an overview of our integrative mutation analysis platform, which focuses on refining the current genotype-phenotype correlation methods by using the wealth of protein structural information.
Collapse
|
17
|
Ward RA, Fawell S, Floc'h N, Flemington V, McKerrecher D, Smith PD. Challenges and Opportunities in Cancer Drug Resistance. Chem Rev 2020; 121:3297-3351. [PMID: 32692162 DOI: 10.1021/acs.chemrev.0c00383] [Citation(s) in RCA: 175] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
There has been huge progress in the discovery of targeted cancer therapies in recent years. However, even for the most successful and impactful cancer drugs which have been approved, both innate and acquired mechanisms of resistance are commonplace. These emerging mechanisms of resistance have been studied intensively, which has enabled drug discovery scientists to learn how it may be possible to overcome such resistance in subsequent generations of treatments. In some cases, novel drug candidates have been able to supersede previously approved agents; in other cases they have been used sequentially or in combinations with existing treatments. This review summarizes the current field in terms of the challenges and opportunities that cancer resistance presents to drug discovery scientists, with a focus on small molecule therapeutics. As part of this review, common themes and approaches have been identified which have been utilized to successfully target emerging mechanisms of resistance. This includes the increase in target potency and selectivity, alternative chemical scaffolds, change of mechanism of action (covalents, PROTACs), increases in blood-brain barrier permeability (BBBP), and the targeting of allosteric pockets. Finally, wider approaches are covered such as monoclonal antibodies (mAbs), bispecific antibodies, antibody drug conjugates (ADCs), and combination therapies.
Collapse
Affiliation(s)
- Richard A Ward
- Medicinal Chemistry, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, U.K
| | - Stephen Fawell
- Oncology R&D, AstraZeneca, Waltham, Massachusetts 02451, United States
| | - Nicolas Floc'h
- Bioscience, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, U.K
| | | | | | - Paul D Smith
- Bioscience, Oncology R&D, AstraZeneca, Cambridge CB4 0WG, U.K
| |
Collapse
|
18
|
Pires DEV, Ascher DB. mycoCSM: Using Graph-Based Signatures to Identify Safe Potent Hits against Mycobacteria. J Chem Inf Model 2020; 60:3450-3456. [PMID: 32615035 DOI: 10.1021/acs.jcim.0c00362] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Development of new potent, safe drugs to treat Mycobacteria has proven to be challenging, with limited hit rates of initial screens restricting subsequent development efforts. Despite significant efforts and the evolution of quantitative structure-activity relationship as well as machine learning-based models for computationally predicting molecule bioactivity, there is an unmet need for efficient and reliable methods for identifying biologically active compounds against Mycobacterium that are also safe for humans. Here we developed mycoCSM, a graph-based signature approach to rapidly identify compounds likely to be active against bacteria from the genus Mycobacterium, or against specific Mycobacteria species. mycoCSM was trained and validated on eight organism-specific and for the first time a general Mycobacteria data set, achieving correlation coefficients of up to 0.89 on cross-validation and 0.88 on independent blind tests, when predicting bioactivity in terms of minimum inhibitory concentration. In addition, we also developed a predictor to identify those compounds likely to penetrate in necrotic tuberculosis foci, which achieved a correlation coefficient of 0.75. Together with a built-in estimator of the maximum tolerated dose in humans, we believe this method will provide a valuable resource to enrich screening libraries with potent, safe molecules. To provide simple guidance in the selection of libraries with favorable anti-Mycobacteria properties, we made mycoCSM freely available online at http://biosig.unimelb.edu.au/myco_csm.
Collapse
Affiliation(s)
- Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, 75 Commercial Road, Melbourne 3004, VIC, Australia.,Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville 3052, VIC, Australia.,School of Computing and Information Systems, University of Melbourne, Parkville 3052, VIC, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Heart and Diabetes Institute, 75 Commercial Road, Melbourne 3004, VIC, Australia.,Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, 30 Flemington Rd, Parkville 3052, VIC, Australia.,Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge CB2 1GA, England
| |
Collapse
|
19
|
Myung Y, Pires DEV, Ascher DB. mmCSM-AB: guiding rational antibody engineering through multiple point mutations. Nucleic Acids Res 2020; 48:W125-W131. [PMID: 32432715 PMCID: PMC7319589 DOI: 10.1093/nar/gkaa389] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2020] [Revised: 04/18/2020] [Accepted: 05/16/2020] [Indexed: 12/15/2022] Open
Abstract
While antibodies are becoming an increasingly important therapeutic class, especially in personalized medicine, their development and optimization has been largely through experimental exploration. While there have been many efforts to develop computational tools to guide rational antibody engineering, most approaches are of limited accuracy when applied to antibody design, and have largely been limited to analysing a single point mutation at a time. To overcome this gap, we have curated a dataset of 242 experimentally determined changes in binding affinity upon multiple point mutations in antibody-target complexes (89 increasing and 153 decreasing binding affinity). Here, we have shown that by using our graph-based signatures and atomic interaction information, we can accurately analyse the consequence of multi-point mutations on antigen binding affinity. Our approach outperformed other available tools across cross-validation and two independent blind tests, achieving Pearson's correlations of up to 0.95. We have implemented our new approach, mmCSM-AB, as a web-server that can help guide the process of affinity maturation in antibody design. mmCSM-AB is freely available at http://biosig.unimelb.edu.au/mmcsm_ab/.
Collapse
Affiliation(s)
- Yoochan Myung
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
| | - Douglas E V Pires
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
- School of Computing and Information Systems, University of Melbourne, Parkville, VIC 3052, Australia
| | - David B Ascher
- Computational Biology and Clinical Informatics, Baker Institute, Melbourne, VIC 3004, Australia
- Structural Biology and Bioinformatics, Department of Biochemistry and Molecular Biology, Bio21 Institute, University of Melbourne, Parkville, VIC 3052, Australia
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK
| |
Collapse
|
20
|
A Comprehensive Computational Platform to Guide Drug Development Using Graph-Based Signature Methods. Methods Mol Biol 2020. [PMID: 32006280 DOI: 10.1007/978-1-0716-0270-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]
Abstract
High-throughput computational techniques have become invaluable tools to help increase the overall success, process efficiency, and associated costs of drug development. By designing ligands tailored to specific protein structures in a disease of interest, an understanding of molecular interactions and ways to optimize them can be achieved prior to chemical synthesis. This understanding can help direct crucial chemical and biological experiments by maximizing available resources on higher quality leads. Moreover, predicting molecular binding affinity within specific biological contexts, as well as ligand pharmacokinetics and toxicities, can aid in filtering out redundant leads early on within the process. We describe a set of computational tools which can aid in drug discovery at different stages, from hit identification (EasyVS) to lead optimization and candidate selection (CSM-lig, mCSM-lig, Arpeggio, pkCSM). Incorporating these tools along the drug development process can help ensure that candidate leads are chemically and biologically feasible to become successful and tractable drugs.
Collapse
|
21
|
Nussinov R, Tsai C, Jang H. Autoinhibition can identify rare driver mutations and advise pharmacology. FASEB J 2019; 34:16-29. [DOI: 10.1096/fj.201901341r] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 09/18/2019] [Accepted: 10/09/2019] [Indexed: 12/16/2022]
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section Basic Science Program Frederick National Laboratory for Cancer Research Frederick MD USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| | - Chung‐Jung Tsai
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| | - Hyunbum Jang
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine Tel Aviv University Tel Aviv Israel
| |
Collapse
|
22
|
dendPoint: a web resource for dendrimer pharmacokinetics investigation and prediction. Sci Rep 2019; 9:15465. [PMID: 31664080 PMCID: PMC6820739 DOI: 10.1038/s41598-019-51789-3] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Accepted: 09/24/2019] [Indexed: 01/01/2023] Open
Abstract
Nanomedicine development currently suffers from a lack of efficient tools to predict pharmacokinetic behavior without relying upon testing in large numbers of animals, impacting success rates and development costs. This work presents dendPoint, the first in silico model to predict the intravenous pharmacokinetics of dendrimers, a commonly explored drug vector, based on physicochemical properties. We have manually curated the largest relational database of dendrimer pharmacokinetic parameters and their structural/physicochemical properties. This was used to develop a machine learning-based model capable of accurately predicting pharmacokinetic parameters, including half-life, clearance, volume of distribution and dose recovered in the liver and urine. dendPoint successfully predicts dendrimer pharmacokinetic properties, achieving correlations of up to r = 0.83 and Q2 up to 0.68. dendPoint is freely available as a user-friendly web-service and database at http://biosig.unimelb.edu.au/dendpoint. This platform is ultimately expected to be used to guide dendrimer construct design and refinement prior to embarking on more time consuming and expensive in vivo testing.
Collapse
|
23
|
Rodrigues CHM, Myung Y, Pires DEV, Ascher DB. mCSM-PPI2: predicting the effects of mutations on protein-protein interactions. Nucleic Acids Res 2019; 47:W338-W344. [PMID: 31114883 PMCID: PMC6602427 DOI: 10.1093/nar/gkz383] [Citation(s) in RCA: 200] [Impact Index Per Article: 40.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 04/30/2019] [Accepted: 05/20/2019] [Indexed: 12/13/2022] Open
Abstract
Protein-protein Interactions are involved in most fundamental biological processes, with disease causing mutations enriched at their interfaces. Here we present mCSM-PPI2, a novel machine learning computational tool designed to more accurately predict the effects of missense mutations on protein-protein interaction binding affinity. mCSM-PPI2 uses graph-based structural signatures to model effects of variations on the inter-residue interaction network, evolutionary information, complex network metrics and energetic terms to generate an optimised predictor. We demonstrate that our method outperforms previous methods, ranking first among 26 others on CAPRI blind tests. mCSM-PPI2 is freely available as a user friendly webserver at http://biosig.unimelb.edu.au/mcsm_ppi2/.
Collapse
Affiliation(s)
- Carlos H M Rodrigues
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Yoochan Myung
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - Douglas E V Pires
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
| | - David B Ascher
- Department of Biochemistry and Molecular Biology, University of Melbourne, Melbourne, Australia
- ACRF Facility for Innovative Cancer Drug Discovery, Bio21 Institute, University of Melbourne, Melbourne, Australia
- Structural Biology and Bioinformatics, Baker Heart and Diabetes Institute, Melbourne, Australia
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| |
Collapse
|