1
|
Le SP, Krishna J, Gupta P, Dutta R, Li S, Chen J, Thayumanavan S. Polymers for Disrupting Protein-Protein Interactions: Where Are We and Where Should We Be? Biomacromolecules 2024. [PMID: 39254158 DOI: 10.1021/acs.biomac.4c00850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
Protein-protein interactions (PPIs) are central to the cellular signaling and regulatory networks that underlie many physiological and pathophysiological processes. It is challenging to target PPIs using traditional small molecule or peptide-based approaches due to the frequent lack of well-defined binding pockets at the large and flat PPI interfaces. Synthetic polymers offer an opportunity to circumvent these challenges by providing unparalleled flexibility in tuning their physiochemical properties to achieve the desired binding properties. In this review, we summarize the current state of the field pertaining to polymer-protein interactions in solution, highlighting various polyelectrolyte systems, their tunable parameters, and their characterization. We provide an outlook on how these architectures can be improved by incorporating sequence control, foldability, and machine learning to mimic proteins at every structural level. Advances in these directions will enable the design of more specific protein-binding polymers and provide an effective strategy for targeting dynamic proteins, such as intrinsically disordered proteins.
Collapse
Affiliation(s)
- Stephanie P Le
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - Jithu Krishna
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - Prachi Gupta
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - Ranit Dutta
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - Shanlong Li
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - Jianhan Chen
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| | - S Thayumanavan
- Department of Chemistry, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Center for Bioactive Delivery, Institute for Applied Life Sciences, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
- Department of Biomedical Engineering, University of Massachusetts, Amherst, Amherst, Massachusetts 01003, United States
| |
Collapse
|
2
|
Nandi S, Bhaduri S, Das D, Ghosh P, Mandal M, Mitra P. Deciphering the Lexicon of Protein Targets: A Review on Multifaceted Drug Discovery in the Era of Artificial Intelligence. Mol Pharm 2024; 21:1563-1590. [PMID: 38466810 DOI: 10.1021/acs.molpharmaceut.3c01161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/13/2024]
Abstract
Understanding protein sequence and structure is essential for understanding protein-protein interactions (PPIs), which are essential for many biological processes and diseases. Targeting protein binding hot spots, which regulate signaling and growth, with rational drug design is promising. Rational drug design uses structural data and computational tools to study protein binding sites and protein interfaces to design inhibitors that can change these interactions, thereby potentially leading to therapeutic approaches. Artificial intelligence (AI), such as machine learning (ML) and deep learning (DL), has advanced drug discovery and design by providing computational resources and methods. Quantum chemistry is essential for drug reactivity, toxicology, drug screening, and quantitative structure-activity relationship (QSAR) properties. This review discusses the methodologies and challenges of identifying and characterizing hot spots and binding sites. It also explores the strategies and applications of artificial-intelligence-based rational drug design technologies that target proteins and protein-protein interaction (PPI) binding hot spots. It provides valuable insights for drug design with therapeutic implications. We have also demonstrated the pathological conditions of heat shock protein 27 (HSP27) and matrix metallopoproteinases (MMP2 and MMP9) and designed inhibitors of these proteins using the drug discovery paradigm in a case study on the discovery of drug molecules for cancer treatment. Additionally, the implications of benzothiazole derivatives for anticancer drug design and discovery are deliberated.
Collapse
Affiliation(s)
- Suvendu Nandi
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Soumyadeep Bhaduri
- Centre for Computational and Data Sciences, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Debraj Das
- Centre for Computational and Data Sciences, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Priya Ghosh
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Mahitosh Mandal
- School of Medical Science and Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| | - Pralay Mitra
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal 721302, India
| |
Collapse
|
3
|
Hernández González JE, de Araujo AS. Alchemical Calculation of Relative Free Energies for Charge-Changing Mutations at Protein-Protein Interfaces Considering Fixed and Variable Protonation States. J Chem Inf Model 2023; 63:6807-6822. [PMID: 37851531 DOI: 10.1021/acs.jcim.3c00972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2023]
Abstract
The calculation of relative free energies (ΔΔG) for charge-changing mutations at protein-protein interfaces through alchemical methods remains challenging due to variations in the system's net charge during charging steps, the possibility of mutated and contacting ionizable residues occurring in various protonation states, and undersampling issues. In this study, we present a set of strategies, collectively termed TIRST/TIRST-H+, to address some of these challenges. Our approaches combine thermodynamic integration (TI) with the prediction of pKa shifts to calculate ΔΔG values. Moreover, special sets of restraints are employed to keep the alchemically transformed molecules separated. The accuracy of the devised approaches was assessed on a large and diverse data set comprising 164 point mutations of charged residues (Asp, Glu, Lys, and Arg) to Ala at the protein-protein interfaces of complexes with known three-dimensional structures. Mean absolute and root-mean-square errors ranging from 1.38 to 1.66 and 1.89 to 2.44 kcal/mol, respectively, and Pearson correlation coefficients of ∼0.6 were obtained when testing the approaches on the selected data set using the GPU-TI module of Amber18 suite and the ff14SB force field. Furthermore, the inclusion of variable protonation states for the mutated acid residues improved the accuracy of the predicted ΔΔG values. Therefore, our results validate the use of TIRST/TIRST-H+ in prospective studies aimed at evaluating the impact of charge-changing mutations to Ala on the stability of protein-protein complexes.
Collapse
|
4
|
Hagg A, Kirschner KN. Open-Source Machine Learning in Computational Chemistry. J Chem Inf Model 2023; 63:4505-4532. [PMID: 37466636 PMCID: PMC10430767 DOI: 10.1021/acs.jcim.3c00643] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Indexed: 07/20/2023]
Abstract
The field of computational chemistry has seen a significant increase in the integration of machine learning concepts and algorithms. In this Perspective, we surveyed 179 open-source software projects, with corresponding peer-reviewed papers published within the last 5 years, to better understand the topics within the field being investigated by machine learning approaches. For each project, we provide a short description, the link to the code, the accompanying license type, and whether the training data and resulting models are made publicly available. Based on those deposited in GitHub repositories, the most popular employed Python libraries are identified. We hope that this survey will serve as a resource to learn about machine learning or specific architectures thereof by identifying accessible codes with accompanying papers on a topic basis. To this end, we also include computational chemistry open-source software for generating training data and fundamental Python libraries for machine learning. Based on our observations and considering the three pillars of collaborative machine learning work, open data, open source (code), and open models, we provide some suggestions to the community.
Collapse
Affiliation(s)
- Alexander Hagg
- Institute
of Technology, Resource and Energy-Efficient Engineering (TREE), University of Applied Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany
- Department
of Electrical Engineering, Mechanical Engineering and Technical Journalism, University of Applied Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany
| | - Karl N. Kirschner
- Institute
of Technology, Resource and Energy-Efficient Engineering (TREE), University of Applied Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany
- Department
of Computer Science, University of Applied
Sciences Bonn-Rhein-Sieg, 53757 Sankt Augustin, Germany
| |
Collapse
|
5
|
Kim Y, Yoon T, Park WB, Na S. Predicting mechanical properties of silk from its amino acid sequences via machine learning. J Mech Behav Biomed Mater 2023; 140:105739. [PMID: 36871478 DOI: 10.1016/j.jmbbm.2023.105739] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 02/12/2023] [Accepted: 02/21/2023] [Indexed: 02/25/2023]
Abstract
The silk fiber is increasingly being sought for its superior mechanical properties, biocompatibility, and eco-friendliness, making it promising as a base material for various applications. One of the characteristics of protein fibers, such as silk, is that their mechanical properties are significantly dependent on the amino acid sequence. Numerous studies have been conducted to determine the specific relationship between the amino acid sequence of silk and its mechanical properties. Still, the relationship between the amino acid sequence of silk and its mechanical properties is yet to be clarified. Other fields have adopted machine learning (ML) to establish a relationship between the inputs, such as the ratio of different input material compositions and the resulting mechanical properties. We have proposed a method to convert the amino acid sequence into numerical values for input and succeeded in predicting the mechanical properties of silk from its amino acid sequences. Our study sheds light on predicting mechanical properties of silk fiber from respective amino acid sequences.
Collapse
|
6
|
Tuameh A, Harding SE, Darton NJ. Methods for addressing host cell protein impurities in biopharmaceutical product development. Biotechnol J 2023; 18:e2200115. [PMID: 36427352 DOI: 10.1002/biot.202200115] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 11/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]
Abstract
The high demand for monoclonal antibody (mAb) therapeutics in recent years has resulted in significant efforts to improve their costly manufacturing process. The high cost of manufacturing mAbs derives mainly from the purification process, which contributes to 50%-80% of the total manufacturing cost. One of the main challenges facing industry at the purification stage is the clearance of host cell proteins (HCPs) that are produced and often co-purified with the desired mAb product. One of the issues HCPs can cause is the degradation of the final mAb protein product. In this review, techniques are considered that can be used at different stages (upstream and downstream) of mAb manufacture to improve HCP clearance. In addition to established techniques, many new approaches for HCP removal are discussed that have the potential to replace current methods for improving HCP reduction and thereby the quality and stability of the final mAb product.
Collapse
Affiliation(s)
- Abdulrahman Tuameh
- National Centre for Macromolecular Hydrodynamics, School of Biosciences, University of Nottingham, Sutton Bonington, UK
| | - Stephen E Harding
- National Centre for Macromolecular Hydrodynamics, School of Biosciences, University of Nottingham, Sutton Bonington, UK
| | - Nicholas J Darton
- Dosage Form Design and Development, BioPharmaceuticals R&D, AstraZeneca, Cambridge, UK
| |
Collapse
|
7
|
Khan A, Cowen-Rivers AI, Grosnit A, Deik DGX, Robert PA, Greiff V, Smorodina E, Rawat P, Akbar R, Dreczkowski K, Tutunov R, Bou-Ammar D, Wang J, Storkey A, Bou-Ammar H. Toward real-world automated antibody design with combinatorial Bayesian optimization. CELL REPORTS METHODS 2023; 3:100374. [PMID: 36814835 PMCID: PMC9939385 DOI: 10.1016/j.crmeth.2022.100374] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2022] [Revised: 10/08/2022] [Accepted: 12/07/2022] [Indexed: 06/14/2023]
Abstract
Antibodies are multimeric proteins capable of highly specific molecular recognition. The complementarity determining region 3 of the antibody variable heavy chain (CDRH3) often dominates antigen-binding specificity. Hence, it is a priority to design optimal antigen-specific CDRH3 to develop therapeutic antibodies. The combinatorial structure of CDRH3 sequences makes it impossible to query binding-affinity oracles exhaustively. Moreover, antibodies are expected to have high target specificity and developability. Here, we present AntBO, a combinatorial Bayesian optimization framework utilizing a CDRH3 trust region for an in silico design of antibodies with favorable developability scores. The in silico experiments on 159 antigens demonstrate that AntBO is a step toward practically viable in vitro antibody design. In under 200 calls to the oracle, AntBO suggests antibodies outperforming the best binding sequence from 6.9 million experimentally obtained CDRH3s. Additionally, AntBO finds very-high-affinity CDRH3 in only 38 protein designs while requiring no domain knowledge.
Collapse
Affiliation(s)
- Asif Khan
- School of Informatics, University of Edinburgh, Edinburgh EH8 9YL, UK
| | | | | | | | - Philippe A. Robert
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo 0315, Norway
| | - Victor Greiff
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo 0315, Norway
| | - Eva Smorodina
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo 0315, Norway
| | - Puneet Rawat
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo 0315, Norway
| | - Rahmad Akbar
- Department of Immunology, University of Oslo and Oslo University Hospital, Oslo 0315, Norway
| | | | | | - Dany Bou-Ammar
- American University of Beirut Medical Centre, Beirut 11-0236, Lebanon
| | - Jun Wang
- Huawei Noah’s Ark Lab, London N1C 4AG, UK
- University College London, London WC1E 6BT, UK
| | - Amos Storkey
- School of Informatics, University of Edinburgh, Edinburgh EH8 9YL, UK
| | - Haitham Bou-Ammar
- Huawei Noah’s Ark Lab, London N1C 4AG, UK
- University College London, London WC1E 6BT, UK
| |
Collapse
|
8
|
Oliva F, Musiani F, Giorgetti A, De Rubeis S, Sorokina O, Armstrong DJ, Carloni P, Ruggerone P. Modelling eNvironment for Isoforms (MoNvIso): A general platform to predict structural determinants of protein isoforms in genetic diseases. Front Chem 2023; 10:1059593. [PMID: 36700074 PMCID: PMC9868658 DOI: 10.3389/fchem.2022.1059593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2022] [Accepted: 12/06/2022] [Indexed: 01/11/2023] Open
Abstract
The seamless integration of human disease-related mutation data into protein structures is an essential component of any attempt to correctly assess the impact of the mutation. The key step preliminary to any structural modelling is the identification of the isoforms onto which mutations should be mapped due to there being several functionally different protein isoforms from the same gene. To handle large sets of data coming from omics techniques, this challenging task needs to be automatized. Here we present the MoNvIso (Modelling eNvironment for Isoforms) code, which identifies the most useful isoform for computational modelling, balancing the coverage of mutations of interest and the availability of templates to build a structural model of both the wild-type isoform and the related variants.
Collapse
Affiliation(s)
- Francesco Oliva
- Department of Physics, University of Cagliari, Monserrato (CA), Italy,Institute of Neuroscience and Medicine INM-9, Institute for Advanced Simulations IAS-5, Forschungszentrum Jülich, Jülich, Germany
| | - Francesco Musiani
- Laboratory of Bioinorganic Chemistry, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy
| | - Alejandro Giorgetti
- Institute of Neuroscience and Medicine INM-9, Institute for Advanced Simulations IAS-5, Forschungszentrum Jülich, Jülich, Germany,Department of Biotechnology, University of Verona, Verona, Italy
| | - Silvia De Rubeis
- Seaver Autism Center for Research and Treatment, Icahn School of Medicine at Mount Sinai, New York, NY, United States,Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, United States,The Mindich Child Health and Development Institute, Icahn School of Medicine at Mount Sinai, New York, NY, United States,Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, United States
| | - Oksana Sorokina
- The School of Informatics, University of Edinburgh, Edinburgh, United Kingdom
| | - Douglas J. Armstrong
- Institute of Neuroscience and Medicine INM-9, Institute for Advanced Simulations IAS-5, Forschungszentrum Jülich, Jülich, Germany,The School of Informatics, University of Edinburgh, Edinburgh, United Kingdom,Simons Initiative for the Developing Brain, University of Edinburgh, Edinburgh, United Kingdom
| | - Paolo Carloni
- Institute of Neuroscience and Medicine INM-9, Institute for Advanced Simulations IAS-5, Forschungszentrum Jülich, Jülich, Germany,Department of Physics, RWTH Aachen University, Aachen, Germany,JARA-Institute: Molecular Neuroscience and Neuroimaging, Institute for Neuroscience and Medicine INM-11/JARA-BRAIN Institute JBI-2, Forschungszentrum Jülich GmbH, Jülich, Germany
| | - Paolo Ruggerone
- Department of Physics, University of Cagliari, Monserrato (CA), Italy,*Correspondence: Paolo Ruggerone,
| |
Collapse
|
9
|
Delaunay M, Ha-Duong T. Computational Tools and Strategies to Develop Peptide-Based Inhibitors of Protein-Protein Interactions. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2022; 2405:205-230. [PMID: 35298816 DOI: 10.1007/978-1-0716-1855-4_11] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Protein-protein interactions play crucial and subtle roles in many biological processes and modifications of their fine mechanisms generally result in severe diseases. Peptide derivatives are very promising therapeutic agents for modulating protein-protein associations with sizes and specificities between those of small compounds and antibodies. For the same reasons, rational design of peptide-based inhibitors naturally borrows and combines computational methods from both protein-ligand and protein-protein research fields. In this chapter, we aim to provide an overview of computational tools and approaches used for identifying and optimizing peptides that target protein-protein interfaces with high affinity and specificity. We hope that this review will help to implement appropriate in silico strategies for peptide-based drug design that builds on available information for the systems of interest.
Collapse
Affiliation(s)
| | - Tâp Ha-Duong
- Université Paris-Saclay, CNRS, BioCIS, Châtenay-Malabry, France.
| |
Collapse
|
10
|
Zheng P, Wang S, Wang X, Zeng X. Editorial: Artificial Intelligence in Bioinformatics and Drug Repurposing: Methods and Applications. Front Genet 2022; 13:870795. [PMID: 35368698 PMCID: PMC8969764 DOI: 10.3389/fgene.2022.870795] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 02/21/2022] [Indexed: 11/13/2022] Open
Affiliation(s)
- Pan Zheng
- Department of Accounting and Information Systems, University of Canterbury, Christchurch, New Zealand
- *Correspondence: Pan Zheng,
| | - Shudong Wang
- College of Computer Science and Technology, China University of Petroleum, Qingdao, China
| | - Xun Wang
- College of Computer Science and Technology, China University of Petroleum, Qingdao, China
| | - Xiangxiang Zeng
- Department of Computer Science, Hunan University, Changsha, China
| |
Collapse
|
11
|
Li S, Wu S, Wang L, Li F, Jiang H, Bai F. Recent advances in predicting protein-protein interactions with the aid of artificial intelligence algorithms. Curr Opin Struct Biol 2022; 73:102344. [PMID: 35219216 DOI: 10.1016/j.sbi.2022.102344] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 01/02/2022] [Accepted: 01/17/2022] [Indexed: 12/15/2022]
Abstract
Protein-protein interactions (PPIs) are essential in the regulation of biological functions and cell events, therefore understanding PPIs have become a key issue to understanding the molecular mechanism and investigating the design of drugs. Here we highlight the major developments in computational methods developed for predicting PPIs by using types of artificial intelligence algorithms. The first part introduces the source of experimental PPI data. The second part is devoted to the PPI prediction methods based on sequential information. The third part covers representative methods using structural information as the input feature. The last part is methods designed by combining different types of features. For each part, the state-of-the-art computational PPI prediction methods are reviewed in an inclusive view. Finally, we discuss the flaws existing in this area and future directions of next-generation algorithms.
Collapse
Affiliation(s)
- Shiwei Li
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Sanan Wu
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Lin Wang
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China
| | - Fenglei Li
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; School of Information Science and Technology, ShanghaiTech University, Shanghai, China
| | - Hualiang Jiang
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; Drug Discovery and Design Center, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Pudong, Shanghai, 201203, China
| | - Fang Bai
- Shanghai Institute for Advanced Immunochemical Studies and School of Life Science and Technology, ShanghaiTech University, Shanghai, China; School of Information Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
12
|
Chen YC, Chen YH, Wright JD, Lim C. PPI-Hotspot DB: Database of Protein-Protein Interaction Hot Spots. J Chem Inf Model 2022; 62:1052-1060. [PMID: 35147037 DOI: 10.1021/acs.jcim.2c00025] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Single-point mutations of certain residues (so-called hot spots) impair/disrupt protein-protein interactions (PPIs), leading to pathogenesis and drug resistance. Conventionally, a PPI-hot spot is identified when its replacement decreased the binding free energy significantly, generally by ≥2 kcal/mol. The relatively few mutations with such a significant binding free energy drop limited the number of distinct PPI-hot spots. By defining PPI-hot spots based on mutations that have been manually curated in UniProtKB to significantly impair/disrupt PPIs in addition to binding free energy changes, we have greatly expanded the number of distinct PPI-hot spots by an order of magnitude. These experimentally determined PPI-hot spots along with available structures have been collected in a database called PPI-HotspotDB. We have applied the PPI-HotspotDB to create a nonredundant benchmark, PPI-Hotspot+PDBBM, for assessing methods to predict PPI-hot spots using the free structure as input. PPI-HotspotDB will benefit the design of mutagenesis experiments and development of PPI-hot spot prediction methods. The database and benchmark are freely available at https://ppihotspot.limlab.dnsalias.org.
Collapse
Affiliation(s)
- Yao Chi Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
| | - Yu-Hsien Chen
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
| | - Jon D Wright
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan
| | - Carmay Lim
- Institute of Biomedical Sciences, Academia Sinica, Taipei 115, Taiwan.,Department of Chemistry, National Tsing Hua University, Hsinchu 300, Taiwan
| |
Collapse
|
13
|
Khetan R, Curtis R, Deane CM, Hadsund JT, Kar U, Krawczyk K, Kuroda D, Robinson SA, Sormanni P, Tsumoto K, Warwicker J, Martin ACR. Current advances in biopharmaceutical informatics: guidelines, impact and challenges in the computational developability assessment of antibody therapeutics. MAbs 2022; 14:2020082. [PMID: 35104168 PMCID: PMC8812776 DOI: 10.1080/19420862.2021.2020082] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Therapeutic monoclonal antibodies and their derivatives are key components of clinical pipelines in the global biopharmaceutical industry. The availability of large datasets of antibody sequences, structures, and biophysical properties is increasingly enabling the development of predictive models and computational tools for the "developability assessment" of antibody drug candidates. Here, we provide an overview of the antibody informatics tools applicable to the prediction of developability issues such as stability, aggregation, immunogenicity, and chemical degradation. We further evaluate the opportunities and challenges of using biopharmaceutical informatics for drug discovery and optimization. Finally, we discuss the potential of developability guidelines based on in silico metrics that can be used for the assessment of antibody stability and manufacturability.
Collapse
Affiliation(s)
- Rahul Khetan
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Robin Curtis
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | | | | | - Uddipan Kar
- Department of Biological Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA, USA
| | | | - Daisuke Kuroda
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan
| | | | - Pietro Sormanni
- Chemistry of Health, Yusuf Hamied Department of Chemistry, University of Cambridge
| | - Kouhei Tsumoto
- Department of Bioengineering, School of Engineering, The University of Tokyo, Tokyo, Japan.,Medical Device Development and Regulation Research Center, School of Engineering, The University of Tokyo, Tokyo, Japan.,Department of Chemistry and Biotechnology, School of Engineering, The University of Tokyo, Tokyo, Japan.,The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| | - Jim Warwicker
- Manchester Institute of Biotechnology, University of Manchester, Manchester, UK
| | - Andrew C R Martin
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK
| |
Collapse
|
14
|
Zou W, Huang C, Sun Q, Zhao K, Gao H, Su R, Li Y. A stepwise mutagenesis approach using histidine and acidic amino acid to engineer highly pH-dependent protein switches. 3 Biotech 2022; 12:21. [PMID: 34956814 PMCID: PMC8686790 DOI: 10.1007/s13205-021-03079-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Accepted: 11/26/2021] [Indexed: 10/26/2022] Open
Abstract
Antibody-based drugs can be highly toxic, because they target normal tissue as well as tumor tissue. The pH value of the extracellular microenvironments around tumor tissues is lower than that of normal tissues. Therefore, antibodies that engage in pH-dependent binding at slightly acidic pH are crucial for improving the safety of antibody-based drugs. Thus, we implemented a stepwise mutagenesis approach to engineering pH-dependent antibodies capable of selective binding in the acidic microenvironment in this study. The first step involved single-residue histidine scanning mutagenesis of the antibody's complementarity-determining regions to prescreen for pH-dependent mutants and identify ionizable sensitive hot-spot residues that could be substituted by acidic amino acids to obtain pH-dependent antibodies. The second step involved single-acidic amino acid residue substitutions of the identified residues and the assessment of pH-dependent binding. We identified six ionizable sensitive hot-spot residues using single-histidine scanning mutagenesis. Nine pH-dependent antibodies were isolated using single-acidic amino acid residue mutagenesis at the six hot-spot residue positions. Relative to wild-type anti-CEA chimera antibody, the binding selectivity of the best performing mutant was improved by approximately 32-fold according to ELISA and by tenfold according to FACS assay. The mutant had a high affinity in the pH range of 5.5-6.0. This study supports the development of pH-dependent protein switches and increases our understanding of the role of ionizable residues in protein interfaces. The stepwise mutagenesis approach is rapid, general, and robust and is expected to produce pH-sensitive protein affinity reagents for various applications.
Collapse
Affiliation(s)
- Wenjun Zou
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Chuncui Huang
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Qing Sun
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Keli Zhao
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Huanyu Gao
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| | - Rong Su
- Department of Clinical Laboratory, Foshan Hospital of Traditional Chinese Medicine, Foshan, 528000 Guangdong China
| | - Yan Li
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing, 100049 China
- Key Laboratory of Interdisciplinary Research, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101 China
| |
Collapse
|
15
|
Ovek D, Abali Z, Zeylan ME, Keskin O, Gursoy A, Tuncbag N. Artificial intelligence based methods for hot spot prediction. Curr Opin Struct Biol 2021; 72:209-218. [PMID: 34954608 DOI: 10.1016/j.sbi.2021.11.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Revised: 10/07/2021] [Accepted: 11/08/2021] [Indexed: 11/29/2022]
Abstract
Proteins interact through their interfaces to fulfill essential functions in the cell. They bind to their partners in a highly specific manner and form complexes that have a profound effect on understanding the biological pathways they are involved in. Any abnormal interactions may cause diseases. Therefore, the identification of small molecules which modulate protein interactions through their interfaces has high therapeutic potential. However, discovering such molecules is challenging. Most protein-protein binding affinity is attributed to a small set of amino acids found in protein interfaces known as hot spots. Recent studies demonstrate that drug-like small molecules specifically may bind to hot spots. Therefore, hot spot prediction is crucial. As experimental data accumulates, artificial intelligence begins to be used for computational hot spot prediction. First, we review machine learning and deep learning for computational hot spot prediction and then explain the significance of hot spots toward drug design.
Collapse
Affiliation(s)
- Damla Ovek
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | - Zeynep Abali
- College of Engineering, Koc University, 34450 Istanbul, Turkey
| | | | - Ozlem Keskin
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Attila Gursoy
- College of Engineering, Koc University, 34450 Istanbul, Turkey.
| | - Nurcan Tuncbag
- College of Engineering, Koc University, 34450 Istanbul, Turkey; School of Medicine, Koc University, 34450 Istanbul, Turkey.
| |
Collapse
|
16
|
Lin HH, Zhang QR, Kong X, Zhang L, Zhang Y, Tang Y, Xu H. Machine learning prediction of antiviral-HPV protein interactions for anti-HPV pharmacotherapy. Sci Rep 2021; 11:24367. [PMID: 34934067 PMCID: PMC8692573 DOI: 10.1038/s41598-021-03000-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2021] [Accepted: 11/22/2021] [Indexed: 02/05/2023] Open
Abstract
Persistent infection with high-risk types Human Papillomavirus could cause diseases including cervical cancers and oropharyngeal cancers. Nonetheless, so far there is no effective pharmacotherapy for treating the infection from high-risk HPV types, and hence it remains to be a severe threat to the health of female. Based on drug repositioning strategy, we trained and benchmarked multiple machine learning models so as to predict potential effective antiviral drugs for HPV infection in this work. Through optimizing models, measuring models' predictive performance using 182 pairs of antiviral-target interaction dataset which were all approved by the United States Food and Drug Administration, and benchmarking different models' predictive performance, we identified the optimized Support Vector Machine and K-Nearest Neighbor classifier with high precision score were the best two predictors (0.80 and 0.85 respectively) amongst classifiers of Support Vector Machine, Random forest, Adaboost, Naïve Bayes, K-Nearest Neighbors, and Logistic regression classifier. We applied these two predictors together and successfully predicted 57 pairs of antiviral-HPV protein interactions from 864 pairs of antiviral-HPV protein associations. Our work provided good drug candidates for anti-HPV drug discovery. So far as we know, we are the first one to conduct such HPV-oriented computational drug repositioning study.
Collapse
Affiliation(s)
- Hui-Heng Lin
- Yuebei People's Hospital, Shantou University Medical College, No. 133 of Huimin South road, Wujiang District, Shaoguan City, 512025, China.
| | - Qian-Ru Zhang
- Key Lab of the Basic Pharmacology of the Ministry of Education, School of Pharmacy, Zunyi Medical University, Guizhou Province, 6 West Xue-Fu Road, Zunyi City, 563000, China
| | - Xiangjun Kong
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau Avenida de Universidade, Macau, 999078, Macau, China
| | - Liuping Zhang
- Department of Gynecology, Panyu Central Hospital, No. 8 of Fuyu East Road, Panyu District, Guangzhou, 511400, China
| | - Yong Zhang
- Interdisciplinary Research Center for Agriculture Green Development in Yangtze River Basin, Southwest University, Beibei District, No.1-2-1 Tiansheng Road, Chongqing, 400715, China
| | - Yanyan Tang
- Department of Neurology, The First Affiliated Hospital of Guangxi Medical University, No.6 Shuangyong Road, Nanning, 530021, Guangxi, China
| | - Hongyan Xu
- Yuebei People's Hospital, Shantou University Medical College, No. 133 of Huimin South road, Wujiang District, Shaoguan City, 512025, China.
- Department of Gynecology, Yuebei People's Hospital, Shantou University Medical College, No. 133 of Huimin South road, Wujiang District, Shaoguan City, 512025, China.
| |
Collapse
|
17
|
Liu J, Liu S, Liu C, Zhang Y, Pan Y, Wang Z, Wang J, Wen T, Deng L. Nabe: an energetic database of amino acid mutations in protein-nucleic acid binding interfaces. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6352208. [PMID: 34389843 PMCID: PMC8363842 DOI: 10.1093/database/baab050] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 07/23/2021] [Accepted: 07/29/2021] [Indexed: 12/17/2022]
Abstract
Protein–nucleic acid complexes play essential roles in regulating transcription, translation, DNA replication, repair and recombination, RNA processing and translocation. Site-directed mutagenesis has been extremely useful in understanding the principles of protein–DNA and protein–RNA interactions, and experimentally determined mutagenesis data are prerequisites for designing effective algorithms for predicting the binding affinity change upon mutation. However, a vital challenge in this area is the lack of sufficient public experimentally recognized mutation data, which leads to difficulties in developing computational prediction methods. In this article, we present Nabe, an integrated database of amino acid mutations and their effects on the binding free energy in protein–DNA and protein–RNA interactions for which binding affinities have been experimentally determined. Compared with existing databases and data sets, Nabe is the largest protein–nucleic acid mutation database, containing 2506 mutations in 473 protein–DNA and protein–RNA complexes, and of that 1751 are alanine mutations in 405 protein–nucleic acid complexes. For researchers to conveniently utilize the data, Nabe assembles protein–DNA and protein–RNA benchmark databases by adopting the data-processing procedures in the majority of models. To further facilitate users to query data, Nabe provides a searchable and graphical web page. Database URL: http://nabe.denglab.org
Collapse
Affiliation(s)
- Junyi Liu
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China.,Viterbi School of Engineering, University of Southern California, 3650 McClintock Ave. OHE 106, Los Angeles, CA 90089, USA
| | - Siyu Liu
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Chenzhe Liu
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Yaping Zhang
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Yuliang Pan
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Zixiang Wang
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Jiacheng Wang
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Ting Wen
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| | - Lei Deng
- School of Computer Science and Engineering, Central South University, 22 Shaoshan South Road, Changsha 410075, China
| |
Collapse
|
18
|
Malhotra S, Joseph AP, Thiyagalingam J, Topf M. Assessment of protein-protein interfaces in cryo-EM derived assemblies. Nat Commun 2021; 12:3399. [PMID: 34099703 PMCID: PMC8184972 DOI: 10.1038/s41467-021-23692-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 04/14/2021] [Indexed: 02/05/2023] Open
Abstract
Structures of macromolecular assemblies derived from cryo-EM maps often contain errors that become more abundant with decreasing resolution. Despite efforts in the cryo-EM community to develop metrics for map and atomistic model validation, thus far, no specific scoring metrics have been applied systematically to assess the interface between the assembly subunits. Here, we comprehensively assessed protein-protein interfaces in macromolecular assemblies derived by cryo-EM. To this end, we developed Protein Interface-score (PI-score), a density-independent machine learning-based metric, trained using the features of protein-protein interfaces in crystal structures. We evaluated 5873 interfaces in 1053 PDB-deposited cryo-EM models (including SARS-CoV-2 complexes), as well as the models submitted to CASP13 cryo-EM targets and the EM model challenge. We further inspected the interfaces associated with low-scores and found that some of those, especially in intermediate-to-low resolution (worse than 4 Å) structures, were not captured by density-based assessment scores. A combined score incorporating PI-score and fit-to-density score showed discriminatory power, allowing our method to provide a powerful complementary assessment tool for the ever-increasing number of complexes solved by cryo-EM.
Collapse
Affiliation(s)
- Sony Malhotra
- grid.4464.20000 0001 2161 2573Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, London, UK ,grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Agnel Praveen Joseph
- grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Jeyan Thiyagalingam
- grid.14467.30Scientific Computing Department, Science and Technology Facilities Council, Didcot, UK
| | - Maya Topf
- grid.4464.20000 0001 2161 2573Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, London, UK ,grid.13648.380000 0001 2180 3484Centre for Structural Systems Biology, Leibniz-Institut für Experimentelle Virologie and Universitätsklinikum Hamburg-Eppendorf (UKE), Hamburg, Germany
| |
Collapse
|
19
|
Schlick T, Portillo-Ledesma S, Myers CG, Beljak L, Chen J, Dakhel S, Darling D, Ghosh S, Hall J, Jan M, Liang E, Saju S, Vohr M, Wu C, Xu Y, Xue E. Biomolecular Modeling and Simulation: A Prospering Multidisciplinary Field. Annu Rev Biophys 2021; 50:267-301. [PMID: 33606945 PMCID: PMC8105287 DOI: 10.1146/annurev-biophys-091720-102019] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We reassess progress in the field of biomolecular modeling and simulation, following up on our perspective published in 2011. By reviewing metrics for the field's productivity and providing examples of success, we underscore the productive phase of the field, whose short-term expectations were overestimated and long-term effects underestimated. Such successes include prediction of structures and mechanisms; generation of new insights into biomolecular activity; and thriving collaborations between modeling and experimentation, including experiments driven by modeling. We also discuss the impact of field exercises and web games on the field's progress. Overall, we note tremendous success by the biomolecular modeling community in utilization of computer power; improvement in force fields; and development and application of new algorithms, notably machine learning and artificial intelligence. The combined advances are enhancing the accuracy andscope of modeling and simulation, establishing an exemplary discipline where experiment and theory or simulations are full partners.
Collapse
Affiliation(s)
- Tamar Schlick
- Department of Chemistry, New York University, New York, New York 10003, USA;
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
- New York University-East China Normal University Center for Computational Chemistry, New York University Shanghai, Shanghai 200122, China
| | | | - Christopher G Myers
- Department of Chemistry, New York University, New York, New York 10003, USA;
| | - Lauren Beljak
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Justin Chen
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sami Dakhel
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Daniel Darling
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sayak Ghosh
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Joseph Hall
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mikaeel Jan
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Emily Liang
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Sera Saju
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Mackenzie Vohr
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Chris Wu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Yifan Xu
- College of Arts and Science, New York University, New York, New York 10003, USA
| | - Eva Xue
- College of Arts and Science, New York University, New York, New York 10003, USA
| |
Collapse
|
20
|
Structural Characterization of Receptor-Receptor Interactions in the Allosteric Modulation of G Protein-Coupled Receptor (GPCR) Dimers. Int J Mol Sci 2021; 22:ijms22063241. [PMID: 33810175 PMCID: PMC8005122 DOI: 10.3390/ijms22063241] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Revised: 03/17/2021] [Accepted: 03/20/2021] [Indexed: 01/07/2023] Open
Abstract
G protein-coupled receptor (GPCR) oligomerization, while contentious, continues to attract the attention of researchers. Numerous experimental investigations have validated the presence of GPCR dimers, and the relevance of dimerization in the effectuation of physiological functions intensifies the attractiveness of this concept as a potential therapeutic target. GPCRs, as a single entity, have been the main source of scrutiny for drug design objectives for multiple diseases such as cancer, inflammation, cardiac, and respiratory diseases. The existence of dimers broadens the research scope of GPCR functions, revealing new signaling pathways that can be targeted for disease pathogenesis that have not previously been reported when GPCRs were only viewed in their monomeric form. This review will highlight several aspects of GPCR dimerization, which include a summary of the structural elucidation of the allosteric modulation of class C GPCR activation offered through recent solutions to the three-dimensional, full-length structures of metabotropic glutamate receptor and γ-aminobutyric acid B receptor as well as the role of dimerization in the modification of GPCR function and allostery. With the growing influence of computational methods in the study of GPCRs, we will also be reviewing recent computational tools that have been utilized to map protein-protein interactions (PPI).
Collapse
|
21
|
Durairaj J, Akdel M, de Ridder D, van Dijk ADJ. Geometricus represents protein structures as shape-mers derived from moment invariants. Bioinformatics 2021; 36:i718-i725. [PMID: 33381814 DOI: 10.1093/bioinformatics/btaa839] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/15/2020] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION As the number of experimentally solved protein structures rises, it becomes increasingly appealing to use structural information for predictive tasks involving proteins. Due to the large variation in protein sizes, folds and topologies, an attractive approach is to embed protein structures into fixed-length vectors, which can be used in machine learning algorithms aimed at predicting and understanding functional and physical properties. Many existing embedding approaches are alignment based, which is both time-consuming and ineffective for distantly related proteins. On the other hand, library- or model-based approaches depend on a small library of fragments or require the use of a trained model, both of which may not generalize well. RESULTS We present Geometricus, a novel and universally applicable approach to embedding proteins in a fixed-dimensional space. The approach is fast, accurate, and interpretable. Geometricus uses a set of 3D moment invariants to discretize fragments of protein structures into shape-mers, which are then counted to describe the full structure as a vector of counts. We demonstrate the applicability of this approach in various tasks, ranging from fast structure similarity search, unsupervised clustering and structure classification across proteins from different superfamilies as well as within the same family. AVAILABILITY AND IMPLEMENTATION Python code available at https://git.wur.nl/durai001/geometricus.
Collapse
Affiliation(s)
| | - Mehmet Akdel
- Bioinformatics Group, Department of Plant Sciences
| | | | - Aalt D J van Dijk
- Bioinformatics Group, Department of Plant Sciences.,Mathematical and Statistical Methods - Biometris, Department of Plant Sciences, Wageningen University and Research, Wageningen 6700AP, The Netherlands
| |
Collapse
|
22
|
Shahnazari M, Alemzadeh A, Zakipour Z, Razi H. Evolution and classification of Na/K ATPase α-subunit in Arthropoda and Nematoda. GENE REPORTS 2021. [DOI: 10.1016/j.genrep.2020.101015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
23
|
Sitani D, Giorgetti A, Alfonso-Prieto M, Carloni P. Robust principal component analysis-based prediction of protein-protein interaction hot spots. Proteins 2021; 89:639-647. [PMID: 33458895 DOI: 10.1002/prot.26047] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 12/28/2020] [Accepted: 12/31/2020] [Indexed: 12/21/2022]
Abstract
Proteins often exert their function by binding to other cellular partners. The hot spots are key residues for protein-protein binding. Their identification may shed light on the impact of disease associated mutations on protein complexes and help design protein-protein interaction inhibitors for therapy. Unfortunately, current machine learning methods to predict hot spots, suffer from limitations caused by gross errors in the data matrices. Here, we present a novel data pre-processing pipeline that overcomes this problem by recovering a low rank matrix with reduced noise using Robust Principal Component Analysis. Application to existing databases shows the predictive power of the method.
Collapse
Affiliation(s)
- Divya Sitani
- JARA-Institute: Molecular Neuroscience and Neuroimaging, Institute for Neuroscience and Medicine INM-11/JARA-BRAIN Institute JBI-2, Forschungszentrum Jülich GmbH, Jülich, Germany.,Department of Biology, RWTH Aachen University, Aachen, Germany
| | - Alejandro Giorgetti
- Institute for Advanced Simulations IAS-5 / Institute for Neuroscience and Medicine INM-9, Computational Biomedicine, Forschungszentrum Jülich GmbH, Jülich, Germany.,Department of Biotechnology, University of Verona, Verona, Italy
| | - Mercedes Alfonso-Prieto
- Institute for Advanced Simulations IAS-5 / Institute for Neuroscience and Medicine INM-9, Computational Biomedicine, Forschungszentrum Jülich GmbH, Jülich, Germany.,Cécile and Oskar Vogt Institute for Brain Research, University Hospital Düsseldorf, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany
| | - Paolo Carloni
- JARA-Institute: Molecular Neuroscience and Neuroimaging, Institute for Neuroscience and Medicine INM-11/JARA-BRAIN Institute JBI-2, Forschungszentrum Jülich GmbH, Jülich, Germany.,Institute for Advanced Simulations IAS-5 / Institute for Neuroscience and Medicine INM-9, Computational Biomedicine, Forschungszentrum Jülich GmbH, Jülich, Germany.,Department of Physics, RWTH Aachen University, Aachen, Germany.,JARA-HPC, IAS-5/INM-9 Computational Biomedicine, Forschungszentrum Jülich GmbH, Jülich, Germany
| |
Collapse
|
24
|
Das S, Chakrabarti S. Classification and prediction of protein-protein interaction interface using machine learning algorithm. Sci Rep 2021; 11:1761. [PMID: 33469042 PMCID: PMC7815773 DOI: 10.1038/s41598-020-80900-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 12/15/2020] [Indexed: 01/29/2023] Open
Abstract
Structural insight of the protein-protein interaction (PPI) interface can provide knowledge about the kinetics, thermodynamics and molecular functions of the complex while elucidating its role in diseases and further enabling it as a potential therapeutic target. However, owing to experimental lag in solving protein-protein complex structures, three-dimensional (3D) knowledge of the PPI interfaces can be gained via computational approaches like molecular docking and post-docking analyses. Despite development of numerous docking tools and techniques, success in identification of native like interfaces based on docking score functions is limited. Hence, we employed an in-depth investigation of the structural features of the interface that might successfully delineate native complexes from non-native ones. We identify interface properties, which show statistically significant difference between native and non-native interfaces belonging to homo and hetero, protein-protein complexes. Utilizing these properties, a support vector machine (SVM) based classification scheme has been implemented to differentiate native and non-native like complexes generated using docking decoys. Benchmarking and comparative analyses suggest very good performance of our SVM classifiers. Further, protein interactions, which are proven via experimental findings but not resolved structurally, were subjected to this approach where 3D-models of the complexes were generated and most likely interfaces were predicted. A web server called Protein Complex Prediction by Interface Properties (PCPIP) is developed to predict whether interface of a given protein-protein dimer complex resembles known protein interfaces. The server is freely available at http://www.hpppi.iicb.res.in/pcpip/ .
Collapse
Affiliation(s)
- Subhrangshu Das
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| | - Saikat Chakrabarti
- grid.417635.20000 0001 2216 5074Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, WB India
| |
Collapse
|
25
|
Massoud TF, Paulmurugan R. Molecular Imaging of Protein–Protein Interactions and Protein Folding. Mol Imaging 2021. [DOI: 10.1016/b978-0-12-816386-3.00071-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
|
26
|
Bartocci A, Gillet N, Jiang T, Szczepaniak F, Dumont E. Molecular Dynamics Approach for Capturing Calixarene-Protein Interactions: The Case of Cytochrome C. J Phys Chem B 2020; 124:11371-11378. [PMID: 33270456 DOI: 10.1021/acs.jpcb.0c08482] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Functionalized supramolecular cages are of growing importance in biology and biochemistry. They have recently been proposed as efficient auxiliaries to obtain high-resolution cocrystallized proteins. Here, we propose a molecular dynamics investigation of the supramolecular association of sulfonated calix-[8]-arenes to cytochrome c starting from initially distant proteins and ligands. We characterize two main binding sites for the sulfonated calixarene on the cytochrome c surface which are in perfect agreement with the previous experiments with regard to the structure (comparison with the X-ray structure PDB 6GD8) and the binding free energies [comparison between the molecular mechanics Poisson-Boltzmann surface area analysis and the isothermal titration calorimetry measurements]. The per-residue decomposition of the interaction energies reveals the detailed picture of this electrostatically driven association and notably the role of arginine R13 as a bridging residue between the two main anchoring sites. In addition, the analysis of the residue behavior by means of a supervised machine learning protocol unveils the formation of a hydrogen bond network far from the binding sites, increasing the rigidity of the protein. This study paves the way toward an automated procedure to predict the supramolecular protein-cage association, with the possibility of a computational screening of new promising derivatives for controlled protein assembly and protein surface recognition processes.
Collapse
Affiliation(s)
- Alessio Bartocci
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Natacha Gillet
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Tao Jiang
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Florence Szczepaniak
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France
| | - Elise Dumont
- Univ Lyon, ENS de Lyon, CNRS UMR 5182, Université Claude Bernard Lyon 1, Laboratoire de Chimie, F-69342 Lyon, France.,Institut Universitaire de France, 5 Rue Descartes, 75005 Paris, France
| |
Collapse
|
27
|
Smolikova G, Gorbach D, Lukasheva E, Mavropolo-Stolyarenko G, Bilova T, Soboleva A, Tsarev A, Romanovskaya E, Podolskaya E, Zhukov V, Tikhonovich I, Medvedev S, Hoehenwarter W, Frolov A. Bringing New Methods to the Seed Proteomics Platform: Challenges and Perspectives. Int J Mol Sci 2020; 21:E9162. [PMID: 33271881 PMCID: PMC7729594 DOI: 10.3390/ijms21239162] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 11/26/2020] [Accepted: 11/27/2020] [Indexed: 12/14/2022] Open
Abstract
For centuries, crop plants have represented the basis of the daily human diet. Among them, cereals and legumes, accumulating oils, proteins, and carbohydrates in their seeds, distinctly dominate modern agriculture, thus play an essential role in food industry and fuel production. Therefore, seeds of crop plants are intensively studied by food chemists, biologists, biochemists, and nutritional physiologists. Accordingly, seed development and germination as well as age- and stress-related alterations in seed vigor, longevity, nutritional value, and safety can be addressed by a broad panel of analytical, biochemical, and physiological methods. Currently, functional genomics is one of the most powerful tools, giving direct access to characteristic metabolic changes accompanying plant development, senescence, and response to biotic or abiotic stress. Among individual post-genomic methodological platforms, proteomics represents one of the most effective ones, giving access to cellular metabolism at the level of proteins. During the recent decades, multiple methodological advances were introduced in different branches of life science, although only some of them were established in seed proteomics so far. Therefore, here we discuss main methodological approaches already employed in seed proteomics, as well as those still waiting for implementation in this field of plant research, with a special emphasis on sample preparation, data acquisition, processing, and post-processing. Thereby, the overall goal of this review is to bring new methodologies emerging in different areas of proteomics research (clinical, food, ecological, microbial, and plant proteomics) to the broad society of seed biologists.
Collapse
Affiliation(s)
- Galina Smolikova
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
| | - Daria Gorbach
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Elena Lukasheva
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Gregory Mavropolo-Stolyarenko
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Tatiana Bilova
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Alena Soboleva
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Alexander Tsarev
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Ekaterina Romanovskaya
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Ekaterina Podolskaya
- Institute of Analytical Instrumentation, Russian Academy of Science; 190103 St. Petersburg, Russia;
- Institute of Toxicology, Russian Federal Medical Agency; 192019 St. Petersburg, Russia
| | - Vladimir Zhukov
- All-Russia Research Institute for Agricultural Microbiology; 196608 St. Petersburg, Russia; (V.Z.); (I.T.)
| | - Igor Tikhonovich
- All-Russia Research Institute for Agricultural Microbiology; 196608 St. Petersburg, Russia; (V.Z.); (I.T.)
- Department of Genetics and Biotechnology, St. Petersburg State University; 199034 St. Petersburg, Russia
| | - Sergei Medvedev
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
| | - Wolfgang Hoehenwarter
- Proteome Analytics Research Group, Leibniz Institute of Plant Biochemistry, 06120 Halle (Saale), Germany;
| | - Andrej Frolov
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| |
Collapse
|
28
|
Abdizadeh H, Jalalypour F, Atilgan AR, Atilgan C. A Coarse-Grained Methodology Identifies Intrinsic Mechanisms That Dissociate Interacting Protein Pairs. Front Mol Biosci 2020; 7:210. [PMID: 33195399 PMCID: PMC7477071 DOI: 10.3389/fmolb.2020.00210] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Accepted: 08/03/2020] [Indexed: 11/13/2022] Open
Abstract
We address the problem of triggering dissociation events between proteins that have formed a complex. We have collected a set of 25 non-redundant, functionally diverse protein complexes having high-resolution three-dimensional structures in both the unbound and bound forms. We unify elastic network models with perturbation response scanning (PRS) methodology as an efficient approach for predicting residues that have the propensity to trigger dissociation of an interacting protein pair, using the three-dimensional structures of the bound and unbound proteins as input. PRS reveals that while for a group of protein pairs, residues involved in the conformational shifts are confined to regions with large motions, there are others where they originate from parts of the protein unaffected structurally by binding. Strikingly, only a few of the complexes have interface residues responsible for dissociation. We find two main modes of response: In one mode, remote control of dissociation in which disruption of the electrostatic potential distribution along protein surfaces play the major role; in the alternative mode, mechanical control of dissociation by remote residues prevail. In the former, dissociation is triggered by changes in the local environment of the protein, e.g., pH or ionic strength, while in the latter, specific perturbations arriving at the controlling residues, e.g., via binding to a third interacting partner is required for decomplexation. We resolve the observations by relying on an electromechanical coupling model which reduces to the usual elastic network result in the limit of the lack of coupling. We validate the approach by illustrating the biological significance of top residues selected by PRS on select cases where we show that the residues whose perturbation leads to the observed conformational changes correspond to either functionally important or highly conserved residues in the complex.
Collapse
Affiliation(s)
- Haleh Abdizadeh
- Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, Netherlands
| | - Farzaneh Jalalypour
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul, Turkey
| | - Ali Rana Atilgan
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul, Turkey
| | - Canan Atilgan
- Faculty of Engineering and Natural Sciences, Sabanci University, Istanbul, Turkey
| |
Collapse
|
29
|
Dike PP, Bhowmick S, Eldesoky GE, Wabaidur SM, Patil PC, Islam MA. In silico identification of small molecule modulators for disruption of Hsp90-Cdc37 protein-protein interaction interface for cancer therapeutic application. J Biomol Struct Dyn 2020; 40:2082-2098. [PMID: 33095103 DOI: 10.1080/07391102.2020.1835714] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The protein-protein interactions (PPIs) in the biological systems are important to maintain a number of cellular processes. Several disorders including cancer may be developed due to dysfunction in the assembly of PPI networks. Hence, targeting intracellular PPIs can be considered as a crucial drug target for cancer therapy. Among the enormous and diverse group of cancer-enabling PPIs, the Hsp90-Cdc37 is prominent for cancer therapeutic development. The successful inhibition of Hsp90-Cdc37 PPI interface can be an important therapeutic option for cancer management. In the current study, a set of more than sixty thousand compounds belong to four databases were screened through a multi-steps molecular docking study in Glide against the Hsp90-Cdc37 interaction interface. The Glide-score and Prime-MM-GBSA based binding free energy of DCZ3112, standard Hsp90-Cdc37 inhibitor were found to be -6.96 and -40.46 kcal/mol, respectively. The above two parameters were used as cut-off score to reduce the chemical space from all successfully docked molecules. Furthermore, the in-silico pharmacokinetics parameters, common-feature pharmacophore analyses and the molecular binding interactions were used to wipe out the inactive molecules. Finally, four molecules were found to be important to modulate the Hsp90-Cdc37 interface. The potentiality of the final four molecules was checked through several drug-likeness characteristics. The molecular dynamics (MD) simulation study explained that all four molecules retained inside the interface of Hsp90-Cdc37. The binding free energy of each molecule obtained from the MD simulation trajectory was clearly explained the strong affection towards the Hsp90-Cdc37. Hence, the proposed molecule might be crucial for successful inhibition of the Hsp90-Cdc37 interface.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Prajakta Prakash Dike
- Department of Bioinformatics, Rajiv Gandhi Institute of IT and Biotechnology, Bharati Vidyapeeth Deemed University, Pune, India
| | - Shovonlal Bhowmick
- Department of Chemical Technology, University of Calcutta, Kolkata, India
| | - Gaber E Eldesoky
- Department of Chemistry, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Saikh M Wabaidur
- Department of Chemistry, College of Science, King Saud University, Riyadh, Saudi Arabia
| | - Preeti Chunarkar Patil
- Department of Bioinformatics, Rajiv Gandhi Institute of IT and Biotechnology, Bharati Vidyapeeth Deemed University, Pune, India
| | - Md Ataul Islam
- Division of Pharmacy and Optometry, School of Health Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK.,School of Health Sciences, University of Kwazulu-Natal, Durban, South Africa.,Department of Chemical Pathology, Faculty of Health Sciences, University of Pretoria and National Health Laboratory Service Tshwane Academic Division, Pretoria, South Africa
| |
Collapse
|
30
|
Wu R, Prabhu R, Ozkan A, Sitharam M. Rapid prediction of crucial hotspot interactions for icosahedral viral capsid self-assembly by energy landscape atlasing validated by mutagenesis. PLoS Comput Biol 2020; 16:e1008357. [PMID: 33079933 PMCID: PMC7598928 DOI: 10.1371/journal.pcbi.1008357] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 10/30/2020] [Accepted: 09/22/2020] [Indexed: 02/07/2023] Open
Abstract
Icosahedral viruses are under a micrometer in diameter, their infectious genome encapsulated by a shell assembled by a multiscale process, starting from an integer multiple of 60 viral capsid or coat protein (VP) monomers. We predict and validate inter-atomic hotspot interactions between VP monomers that are important for the assembly of 3 types of icosahedral viral capsids: Adeno Associated Virus serotype 2 (AAV2) and Minute Virus of Mice (MVM), both T = 1 single stranded DNA viruses, and Bromo Mosaic Virus (BMV), a T = 3 single stranded RNA virus. Experimental validation is by in-vitro, site-directed mutagenesis data found in literature. We combine ab-initio predictions at two scales: at the interface-scale, we predict the importance (cruciality) of an interaction for successful subassembly across each interface between symmetry-related VP monomers; and at the capsid-scale, we predict the cruciality of an interface for successful capsid assembly. At the interface-scale, we measure cruciality by changes in the capsid free-energy landscape partition function when an interaction is removed. The partition function computation uses atlases of interface subassembly landscapes, rapidly generated by a novel geometric method and curated opensource software EASAL (efficient atlasing and search of assembly landscapes). At the capsid-scale, cruciality of an interface for successful assembly of the capsid is based on combinatorial entropy. Our study goes all the way from resource-light, multiscale computational predictions of crucial hotspot inter-atomic interactions to validation using data on site-directed mutagenesis' effect on capsid assembly. By reliably and rapidly narrowing down target interactions, (no more than 1.5 hours per interface on a laptop with Intel Core i5-2500K @ 3.2 Ghz CPU and 8GB of RAM) our predictions can inform and reduce time-consuming in-vitro and in-vivo experiments, or more computationally intensive in-silico analyses.
Collapse
Affiliation(s)
- Ruijin Wu
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Rahul Prabhu
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Aysegul Ozkan
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| | - Meera Sitharam
- Department of Computer and Information Science and Engineering, University of Florida, Gainesville, Florida, United States of America
| |
Collapse
|
31
|
Oña Chuquimarca S, Ayala-Ruano S, Goossens J, Pauwels L, Goossens A, Leon-Reyes A, Ángel Méndez M. The Molecular Basis of JAZ-MYC Coupling, a Protein-Protein Interface Essential for Plant Response to Stressors. FRONTIERS IN PLANT SCIENCE 2020; 11:1139. [PMID: 32973821 PMCID: PMC7468482 DOI: 10.3389/fpls.2020.01139] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2020] [Accepted: 07/14/2020] [Indexed: 05/29/2023]
Abstract
The jasmonic acid (JA) signaling pathway is one of the primary mechanisms that allow plants to respond to a variety of biotic and abiotic stressors. Within this pathway, the JAZ repressor proteins and the basic helix-loop-helix (bHLH) transcription factor MYC3 play a critical role. JA is a volatile organic compound with an essential role in plant immunity. The increase in the concentration of JA leads to the decoupling of the JAZ repressor proteins and the bHLH transcription factor MYC3 causing the induction of genes of interest. The primary goal of this study was to identify the molecular basis of JAZ-MYC coupling. For this purpose, we modeled and validated 12 JAZ-MYC3 3D in silico structures and developed a molecular dynamics/machine learning pipeline to obtain two outcomes. First, we calculated the average free binding energy of JAZ-MYC3 complexes, which was predicted to be -10.94 +/-2.67 kJ/mol. Second, we predicted which ones should be the interface residues that make the predominant contribution to the free energy of binding (molecular hotspots). The predicted protein hotspots matched a conserved linear motif SL••FL•••R, which may have a crucial role during MYC3 recognition of JAZ proteins. As a proof of concept, we tested, both in silico and in vitro, the importance of this motif on PEAPOD (PPD) proteins, which also belong to the TIFY protein family, like the JAZ proteins, but cannot bind to MYC3. By mutating these proteins to match the SL••FL•••R motif, we could force PPDs to bind the MYC3 transcription factor. Taken together, modeling protein-protein interactions and using machine learning will help to find essential motifs and molecular mechanisms in the JA pathway.
Collapse
Affiliation(s)
- Samara Oña Chuquimarca
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Sebastián Ayala-Ruano
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| | - Jonas Goossens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Laurens Pauwels
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Alain Goossens
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- VIB Center for Plant Systems Biology, Ghent, Belgium
| | - Antonio Leon-Reyes
- Laboratorio de Biotecnología Agrícola y de Alimentos, Ingeniería en Agronomía, Colegio de Ciencias e Ingenierías, Universidad San Francisco de Quito, Campus Cumbayá, Quito, Ecuador
- Colegio de Ciencias Biológicas y Ambientales COCIBA, Instituto de Microbiología, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Colegio de Ciencias Biológicas y Ambientales COCIBA, Instituto de Investigaciones Biológicas y Ambientales BIÓSFERA, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Department of Biology, The University of North Carolina at Chapel Hill, Chapel Hill, NC, United States
| | - Miguel Ángel Méndez
- Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química, Universidad San Francisco de Quito USFQ, Campus Cumbayá, Quito, Ecuador
- Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito USFQ, Quito, Ecuador
| |
Collapse
|
32
|
Cloutier TK, Sudrik C, Mody N, Sathish HA, Trout BL. Machine Learning Models of Antibody–Excipient Preferential Interactions for Use in Computational Formulation Design. Mol Pharm 2020; 17:3589-3599. [DOI: 10.1021/acs.molpharmaceut.0c00629] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Theresa K. Cloutier
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Chaitanya Sudrik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Neil Mody
- Dosage Form Design and Development, AstraZeneca, Gaithersburg, Maryland 20878, United States
| | - Hasige A. Sathish
- Dosage Form Design and Development, AstraZeneca, Gaithersburg, Maryland 20878, United States
| | - Bernhardt L. Trout
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
33
|
Li Q, Zhou W, Wang D, Wang S, Li Q. Prediction of Anticancer Peptides Using a Low-Dimensional Feature Model. Front Bioeng Biotechnol 2020; 8:892. [PMID: 32903381 PMCID: PMC7434836 DOI: 10.3389/fbioe.2020.00892] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2020] [Accepted: 07/10/2020] [Indexed: 01/09/2023] Open
Abstract
Cancer is still a severe health problem globally. The therapy of cancer traditionally involves the use of radiotherapy or anticancer drugs to kill cancer cells, but these methods are quite expensive and have side effects, which will cause great harm to patients. With the find of anticancer peptides (ACPs), significant progress has been achieved in the therapy of tumors. Therefore, it is invaluable to accurately identify anticancer peptides. Although biochemical experiments can solve this work, this method is expensive and time-consuming. To promote the application of anticancer peptides in cancer therapy, machine learning can be used to recognize anticancer peptides by extracting the feature vectors of anticancer peptides. Nevertheless, poor performance usually be found in training the machine learning model to utilizing high-dimensional features in practice. In order to solve the above job, this paper put forward a 19-dimensional feature model based on anticancer peptide sequences, which has lower dimensionality and better performance than some existing methods. In addition, this paper also separated a model with a low number of dimensions and acceptable performance. The few features identified in this study may represent the important features of anticancer peptides.
Collapse
Affiliation(s)
- Qingwen Li
- College of Animal Science and Technology, Northeast Agricultural University, Harbin, China
| | - Wenyang Zhou
- Center for Bioinformatics, School of Life Sciences and Technology, Harbin Institute of Technology, Harbin, China
| | - Donghua Wang
- Department of General Surgery, Heilongjiang Province Land Reclamation Headquarters General Hospital, Harbin, China
| | - Sui Wang
- Key Laboratory of Soybean Biology in Chinese Ministry of Education, Northeast Agricultural University, Harbin, China
- State Key Laboratory of Tree Genetics and Breeding, Northeast Forestry University, Harbin, China
| | - Qingyuan Li
- Forestry and Fruit Tree Research Institute, Wuhan Academy of Agricultural Sciences, Wuhan, China
| |
Collapse
|
34
|
Randhawa V, Pathania S. Advancing from protein interactomes and gene co-expression networks towards multi-omics-based composite networks: approaches for predicting and extracting biological knowledge. Brief Funct Genomics 2020; 19:364-376. [PMID: 32678894 DOI: 10.1093/bfgp/elaa015] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 05/31/2020] [Accepted: 06/15/2020] [Indexed: 01/17/2023] Open
Abstract
Prediction of biological interaction networks from single-omics data has been extensively implemented to understand various aspects of biological systems. However, more recently, there is a growing interest in integrating multi-omics datasets for the prediction of interactomes that provide a global view of biological systems with higher descriptive capability, as compared to single omics. In this review, we have discussed various computational approaches implemented to infer and analyze two of the most important and well studied interactomes: protein-protein interaction networks and gene co-expression networks. We have explicitly focused on recent methods and pipelines implemented to infer and extract biologically important information from these interactomes, starting from utilizing single-omics data and then progressing towards multi-omics data. Accordingly, recent examples and case studies are also briefly discussed. Overall, this review will provide a proper understanding of the latest developments in protein and gene network modelling and will also help in extracting practical knowledge from them.
Collapse
Affiliation(s)
- Vinay Randhawa
- Department of Biochemistry, Panjab University, Chandigarh, 160014, India
| | - Shivalika Pathania
- Department of Biotechnology, Panjab University, Chandigarh, 160014, India
| |
Collapse
|
35
|
Poot Velez AH, Fontove F, Del Rio G. Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes. Int J Mol Sci 2020; 21:E4787. [PMID: 32640745 PMCID: PMC7370293 DOI: 10.3390/ijms21134787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 06/20/2020] [Accepted: 06/28/2020] [Indexed: 01/22/2023] Open
Abstract
Predicting protein-protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm-parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96-99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship.
Collapse
Affiliation(s)
- Albros Hermes Poot Velez
- Department of biochemistry and structural biology, Instituto de fisiologia celular, UNAM Mexico City 04510, Mexico;
| | | | - Gabriel Del Rio
- Department of biochemistry and structural biology, Instituto de fisiologia celular, UNAM Mexico City 04510, Mexico;
| |
Collapse
|
36
|
Upfold N, Ross C, Tastan Bishop Ö, Knox C. The In Silico Prediction of Hotspot Residues that Contribute to the Structural Stability of Subunit Interfaces of a Picornavirus Capsid. Viruses 2020; 12:v12040387. [PMID: 32244486 PMCID: PMC7232237 DOI: 10.3390/v12040387] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2020] [Revised: 03/26/2020] [Accepted: 03/28/2020] [Indexed: 11/16/2022] Open
Abstract
The assembly of picornavirus capsids proceeds through the stepwise oligomerization of capsid protein subunits and depends on interactions between critical residues known as hotspots. Few studies have described the identification of hotspot residues at the protein subunit interfaces of the picornavirus capsid, some of which could represent novel drug targets. Using a combination of accessible web servers for hotspot prediction, we performed a comprehensive bioinformatic analysis of the hotspot residues at the intraprotomer, interprotomer and interpentamer interfaces of the Theiler’s murine encephalomyelitis virus (TMEV) capsid. Significantly, many of the predicted hotspot residues were found to be conserved in representative viruses from different genera, suggesting that the molecular determinants of capsid assembly are conserved across the family. The analysis presented here can be applied to any icosahedral structure and provides a platform for in vitro mutagenesis studies to further investigate the significance of these hotspots in critical stages of the virus life cycle with a view to identify potential targets for antiviral drug design.
Collapse
Affiliation(s)
- Nicole Upfold
- Department of Biochemistry and Microbiology, Rhodes University, Grahamstown 6140, South Africa;
- Correspondence:
| | - Caroline Ross
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown 6140, South Africa; (C.R.); (Ö.T.B.)
| | - Özlem Tastan Bishop
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown 6140, South Africa; (C.R.); (Ö.T.B.)
| | - Caroline Knox
- Department of Biochemistry and Microbiology, Rhodes University, Grahamstown 6140, South Africa;
| |
Collapse
|
37
|
Li J, Wei L, Guo F, Zou Q. EP3: an ensemble predictor that accurately identifies type III secreted effectors. Brief Bioinform 2020; 22:1918-1928. [PMID: 32043137 DOI: 10.1093/bib/bbaa008] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 12/25/2019] [Accepted: 01/10/2020] [Indexed: 01/09/2023] Open
Abstract
Type III secretion systems (T3SS) can be found in many pathogenic bacteria, such as Dysentery bacillus, Salmonella typhimurium, Vibrio cholera and pathogenic Escherichia coli. The routes of infection of these bacteria include the T3SS transferring a large number of type III secreted effectors (T3SE) into host cells, thereby blocking or adjusting the communication channels of the host cells. Therefore, the accurate identification of T3SEs is the precondition for the further study of pathogenic bacteria. In this article, a new T3SEs ensemble predictor was developed, which can accurately distinguish T3SEs from any unknown protein. In the course of the experiment, methods and models are strictly trained and tested. Compared with other methods, EP3 demonstrates better performance, including the absence of overfitting, strong robustness and powerful predictive ability. EP3 (an ensemble predictor that accurately identifies T3SEs) is designed to simplify the user's (especially nonprofessional users) access to T3SEs for further investigation, which will have a significant impact on understanding the progression of pathogenic bacterial infections. Based on the integrated model that we proposed, a web server had been established to distinguish T3SEs from non-T3SEs, where have EP3_1 and EP3_2. The users can choose the model according to the species of the samples to be tested. Our related tools and data can be accessed through the link http://lab.malab.cn/∼lijing/EP3.html.
Collapse
|
38
|
Srivastava PA, Hegg EL, Fox BG, Yennamalli RM. PreDSLpmo: A neural network-based prediction tool for functional annotation of lytic polysaccharide monooxygenases. J Biotechnol 2020; 308:148-155. [DOI: 10.1016/j.jbiotec.2019.12.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2019] [Revised: 11/25/2019] [Accepted: 12/08/2019] [Indexed: 11/30/2022]
|
39
|
Junaid M, Li CD, Shah M, Khan A, Guo H, Wei DQ. Extraction of molecular features for the drug discovery targeting protein-protein interaction of Helicobacter pylori CagA and tumor suppressor protein ASSP2. Proteins 2019; 87:837-849. [PMID: 31134671 DOI: 10.1002/prot.25748] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 05/04/2019] [Accepted: 05/22/2019] [Indexed: 12/13/2022]
Abstract
Half of the world population is infected by the Gram-negative bacterium Helicobacter pylori (H. pylori). It colonizes in the stomach and is associated with severe gastric pathologies including gastric cancer and peptic ulceration. The most virulent factor of H. pylori is the cytotoxin-associated gene A (CagA) that is injected into the host cell. CagA interacts with several host proteins and alters their function, thereby causing several diseases. The most well-known target of CagA is the tumor suppressor protein ASPP2. The subdomain I at the N-terminus of CagA interacts with the proline-rich motif of ASPP2. Here, in this study, we carried out alanine scanning mutagenesis and an extensive molecular dynamics simulation summing up to 3.8 μs to find out hot spot residues and discovered some new protein-protein interaction (PPI)-modulating molecules. Our findings are in line with previous biochemical studies and further suggested new residues that are crucial for binding. The alanine scanning showed that mutation of Y207 and T211 residues to alanine decreased the binding affinity. Likewise, dynamics simulation and molecular mechanics with generalized Born surface area (MMGBSA) analysis also showed the importance of these two residues at the interface. A four-feature pharmacophore model was developed based on these two residues, and top 10 molecules were filtered from ZINC, NCI, and ChEMBL databases. The good binding affinity of the CHEMBL17319 and CHEMBL1183979 molecules shows the reliability of our adopted protocol for binding hot spot residues. We believe that our study provides a new insight for using CagA as the therapeutic target for gastric cancer treatment and provides a platform for a future experimental study.
Collapse
Affiliation(s)
- Muhammad Junaid
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Cheng-Dong Li
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Masaud Shah
- Department of Molecular Science and Technology, Ajou University, Suwon, South Korea
| | - Abbas Khan
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Haoyue Guo
- Department of Physiology, McGill University, Montreal, Quebec, Canada
| | - Dong-Qing Wei
- State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
40
|
Interface Characterization Between Polyethylene/ Silica in Engineered Cementitious Composites by Molecular Dynamics Simulation. Molecules 2019; 24:molecules24081497. [PMID: 30995822 PMCID: PMC6515107 DOI: 10.3390/molecules24081497] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2019] [Revised: 04/12/2019] [Accepted: 04/16/2019] [Indexed: 12/05/2022] Open
Abstract
Polyethylene is widely adopted in engineered cementitious composites to control the crack width. A clearer knowledge of the PE/concrete interfacial properties is important in developing engineered cementitious composites, which can lead to a limited crack width. Tensile failure and adhesion properties of the amorphous polyethylene/silica (PE/S) interface are investigated by molecular dynamics to interpret the PE/concrete interface. The influence of the PE chain length, the PE chain number and coupling agents applied on silica surface on the interfacial adhesion is studied. An increase of the adhesion strength of the modified silica surface by coupling agents compared with the unmodified silica is found. The failure process, density profile and potential energy evolutions of the PE/S interface are studied. The thermodynamic work of adhesion that quantifies the interfacial adhesion of the PE/S interface is evaluated. The present study helps to understand the interfacial adhesion behavior between ECC and PE, and is expected to contribute to restricting the crack width.
Collapse
|
41
|
Deng L, Sui Y, Zhang J. XGBPRH: Prediction of Binding Hot Spots at Protein⁻RNA Interfaces Utilizing Extreme Gradient Boosting. Genes (Basel) 2019; 10:genes10030242. [PMID: 30901953 PMCID: PMC6471955 DOI: 10.3390/genes10030242] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2019] [Revised: 03/14/2019] [Accepted: 03/15/2019] [Indexed: 01/24/2023] Open
Abstract
Hot spot residues at protein⁻RNA complexes are vitally important for investigating the underlying molecular recognition mechanism. Accurately identifying protein⁻RNA binding hot spots is critical for drug designing and protein engineering. Although some progress has been made by utilizing various available features and a series of machine learning approaches, these methods are still in the infant stage. In this paper, we present a new computational method named XGBPRH, which is based on an eXtreme Gradient Boosting (XGBoost) algorithm and can effectively predict hot spot residues in protein⁻RNA interfaces utilizing an optimal set of properties. Firstly, we download 47 protein⁻RNA complexes and calculate a total of 156 sequence, structure, exposure, and network features. Next, we adopt a two-step feature selection algorithm to extract a combination of 6 optimal features from the combination of these 156 features. Compared with the state-of-the-art approaches, XGBPRH achieves better performances with an area under the ROC curve (AUC) score of 0.817 and an F1-score of 0.802 on the independent test set. Meanwhile, we also apply XGBPRH to two case studies. The results demonstrate that the method can effectively identify novel energy hotspots.
Collapse
Affiliation(s)
- Lei Deng
- School of Computer Science and Engineering, Central South University, Changsha 410075, China.
| | - Yuanchao Sui
- School of Computer Science and Engineering, Central South University, Changsha 410075, China.
| | - Jingpu Zhang
- School of Computer and Data Science, Henan University of Urban Construction, Pingdingshan 467000, China.
| |
Collapse
|