1
|
Carpenter KA, Altman RB. Databases of ligand-binding pockets and protein-ligand interactions. Comput Struct Biotechnol J 2024; 23:1320-1338. [PMID: 38585646 PMCID: PMC10997877 DOI: 10.1016/j.csbj.2024.03.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 03/16/2024] [Accepted: 03/17/2024] [Indexed: 04/09/2024] Open
Abstract
Many research groups and institutions have created a variety of databases curating experimental and predicted data related to protein-ligand binding. The landscape of available databases is dynamic, with new databases emerging and established databases becoming defunct. Here, we review the current state of databases that contain binding pockets and protein-ligand binding interactions. We have compiled a list of such databases, fifty-three of which are currently available for use. We discuss variation in how binding pockets are defined and summarize pocket-finding methods. We organize the fifty-three databases into subgroups based on goals and contents, and describe standard use cases. We also illustrate that pockets within the same protein are characterized differently across different databases. Finally, we assess critical issues of sustainability, accessibility and redundancy.
Collapse
Affiliation(s)
- Kristy A. Carpenter
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - Russ B. Altman
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Department of Medicine, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
2
|
Machine Learning Models to Predict Protein-Protein Interaction Inhibitors. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27227986. [PMID: 36432086 PMCID: PMC9694076 DOI: 10.3390/molecules27227986] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/09/2022] [Accepted: 11/16/2022] [Indexed: 11/19/2022]
Abstract
Protein-protein interaction (PPI) inhibitors have an increasing role in drug discovery. It is hypothesized that machine learning (ML) algorithms can classify or identify PPI inhibitors. This work describes the performance of different algorithms and molecular fingerprints used in chemoinformatics to develop a classification model to identify PPI inhibitors making the codes freely available to the community, particularly the medicinal chemistry research groups working with PPI inhibitors. We found that classification algorithms have different performances according to various features employed in the training process. Random forest (RF) models with the extended connectivity fingerprint radius 2 (ECFP4) had the best classification abilities compared to those models trained with ECFP6 o MACCS keys (166-bits). In general, logistic regression (LR) models had lower performance metrics than RF models, but ECFP4 was the representation most appropriate for LR. ECFP4 also generated models with high-performance metrics with support vector machines (SVM). We also constructed ensemble models based on the top-performing models. As part of this work and to help non-computational experts, we developed a pipeline code freely available.
Collapse
|
3
|
Martino E, Chiarugi S, Margheriti F, Garau G. Mapping, Structure and Modulation of PPI. Front Chem 2021; 9:718405. [PMID: 34692637 PMCID: PMC8529325 DOI: 10.3389/fchem.2021.718405] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 09/21/2021] [Indexed: 11/13/2022] Open
Abstract
Because of the key relevance of protein–protein interactions (PPI) in diseases, the modulation of protein-protein complexes is of relevant clinical significance. The successful design of binding compounds modulating PPI requires a detailed knowledge of the involved protein-protein system at molecular level, and investigation of the structural motifs that drive the association of the proteins at the recognition interface. These elements represent hot spots of the protein binding free energy, define the complex lifetime and possible modulation strategies. Here, we review the advanced technologies used to map the PPI involved in human diseases, to investigate the structure-function features of protein complexes, and to discover effective ligands that modulate the PPI for therapeutic intervention.
Collapse
Affiliation(s)
- Elisa Martino
- Laboratorio NEST, Scuola Normale Superiore, Pisa, Italy
| | - Sara Chiarugi
- Laboratorio NEST, Scuola Normale Superiore, Pisa, Italy.,BioStructures Lab, Istituto Italiano di Tecnologia (IIT@NEST), Pisa, Italy
| | | | - Gianpiero Garau
- BioStructures Lab, Istituto Italiano di Tecnologia (IIT@NEST), Pisa, Italy
| |
Collapse
|
4
|
Fluorescence resonance energy transfer in revealing protein-protein interactions in living cells. Emerg Top Life Sci 2021; 5:49-59. [PMID: 33856021 DOI: 10.1042/etls20200337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 02/22/2021] [Accepted: 03/04/2021] [Indexed: 11/17/2022]
Abstract
Genes are expressed to proteins for a wide variety of fundamental biological processes at the cellular and organismal levels. However, a protein rarely functions alone, but rather acts through interactions with other proteins to maintain normal cellular and organismal functions. Therefore, it is important to analyze the protein-protein interactions to determine functional mechanisms of proteins, which can also guide to develop therapeutic targets for treatment of diseases caused by altered protein-protein interactions leading to cellular/organismal dysfunctions. There is a large number of methodologies to study protein interactions in vitro, in vivo and in silico, which led to the development of many protein interaction databases, and thus, have enriched our knowledge about protein-protein interactions and functions. However, many of these interactions were identified in vitro, but need to be verified/validated in living cells. Furthermore, it is unclear whether these interactions are direct or mediated via other proteins. Moreover, these interactions are representative of cell- and time-average, but not a single cell in real time. Therefore, it is crucial to detect direct protein-protein interactions in a single cell during biological processes in vivo, towards understanding the functional mechanisms of proteins in living cells. Importantly, a fluorescence resonance energy transfer (FRET)-based methodology has emerged as a powerful technique to decipher direct protein-protein interactions at a single cell resolution in living cells, which is briefly described in a limited available space in this mini-review.
Collapse
|
5
|
Waqas M, Haider A, Rehman A, Qasim M, Umar A, Sufyan M, Akram HN, Mir A, Razzaq R, Rasool D, Tahir RA, Sehgal SA. Immunoinformatics and Molecular Docking Studies Predicted Potential Multiepitope-Based Peptide Vaccine and Novel Compounds against Novel SARS-CoV-2 through Virtual Screening. BIOMED RESEARCH INTERNATIONAL 2021; 2021:1596834. [PMID: 33728324 PMCID: PMC7910514 DOI: 10.1155/2021/1596834] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Revised: 08/13/2020] [Accepted: 02/08/2021] [Indexed: 02/06/2023]
Abstract
BACKGROUND Coronaviruses (CoVs) are enveloped positive-strand RNA viruses which have club-like spikes at the surface with a unique replication process. Coronaviruses are categorized as major pathogenic viruses causing a variety of diseases in birds and mammals including humans (lethal respiratory dysfunctions). Nowadays, a new strain of coronaviruses is identified and named as SARS-CoV-2. Multiple cases of SARS-CoV-2 attacks are being reported all over the world. SARS-CoV-2 showed high death rate; however, no specific treatment is available against SARS-CoV-2. METHODS In the current study, immunoinformatics approaches were employed to predict the antigenic epitopes against SARS-CoV-2 for the development of the coronavirus vaccine. Cytotoxic T-lymphocyte and B-cell epitopes were predicted for SARS-CoV-2 coronavirus protein. Multiple sequence alignment of three genomes (SARS-CoV, MERS-CoV, and SARS-CoV-2) was used to conserved binding domain analysis. RESULTS The docking complexes of 4 CTL epitopes with antigenic sites were analyzed followed by binding affinity and binding interaction analyses of top-ranked predicted peptides with MHC-I HLA molecule. The molecular docking (Food and Drug Regulatory Authority library) was performed, and four compounds exhibiting least binding energy were identified. The designed epitopes lead to the molecular docking against MHC-I, and interactional analyses of the selected docked complexes were investigated. In conclusion, four CTL epitopes (GTDLEGNFY, TVNVLAWLY, GSVGFNIDY, and QTFSVLACY) and four FDA-scrutinized compounds exhibited potential targets as peptide vaccines and potential biomolecules against deadly SARS-CoV-2, respectively. A multiepitope vaccine was also designed from different epitopes of coronavirus proteins joined by linkers and led by an adjuvant. CONCLUSION Our investigations predicted epitopes and the reported molecules that may have the potential to inhibit the SARS-CoV-2 virus. These findings can be a step towards the development of a peptide-based vaccine or natural compound drug target against SARS-CoV-2.
Collapse
Affiliation(s)
- Muhammad Waqas
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Ali Haider
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Abdur Rehman
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Muhammad Qasim
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Ahitsham Umar
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Muhammad Sufyan
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Hafiza Nisha Akram
- Department of Environmental Sciences, Quaid-e-Azam University, Islamabad, Pakistan
| | - Asif Mir
- Department of Biological Sciences, International Islamic University, Islamabad, Pakistan
| | - Roha Razzaq
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Danish Rasool
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
| | - Rana Adnan Tahir
- Department of Biosciences, COMSATS University, Sahiwal Campus, Islamabad, Pakistan
| | - Sheikh Arslan Sehgal
- Department of Bioinformatics and Biotechnology, Government College University, Faisalabad, Pakistan
- Department of Bioinformatics, University of Okara, Okara, Pakistan
| |
Collapse
|
6
|
Abstract
The current global pandemic COVID-19 caused by the SARS-CoV-2 virus has already inflicted insurmountable damage both to the human lives and global economy. There is an immediate need for identification of effective drugs to contain the disastrous virus outbreak. Global efforts are already underway at a war footing to identify the best drug combination to address the disease. In this review, an attempt has been made to understand the SARS-CoV-2 life cycle, and based on this information potential druggable targets against SARS-CoV-2 are summarized. Also, the strategies for ongoing and future drug discovery against the SARS-CoV-2 virus are outlined. Given the urgency to find a definitive cure, ongoing drug repurposing efforts being carried out by various organizations are also described. The unprecedented crisis requires extraordinary efforts from the scientific community to effectively address the issue and prevent further loss of human lives and health.
Collapse
Affiliation(s)
- Ambrish Saxena
- Indian Institute of Technology Tirupati, Tirupati, India
| |
Collapse
|
7
|
Lorenzo-Diaz F, Moreno-Córdoba I, Espinosa M. Complete labelling of pneumococcal DNA-binding proteins with seleno-L-methionine. J Microbiol Methods 2019; 166:105720. [PMID: 31518592 DOI: 10.1016/j.mimet.2019.105720] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2019] [Revised: 09/06/2019] [Accepted: 09/09/2019] [Indexed: 10/26/2022]
Abstract
Streptococcus pneumoniae is a pathogenic and opportunistic Gram-positive bacterium that is the leading cause of community-acquired respiratory diseases, varying from mild- to deathly- infections. The appearance of antibiotic-resistant isolates has prompted the search for novel strategies and targets to tackle the bacterial resistances. One of the most promising approaches is the structure-based knowledge of possible targets in conjunction with rational design and docking of inhibitors of the chosen targets. A useful technique that helps to solve protein structures is to label them with an amino acid derivative like seleno-methionine that facilitates tracing of some of the amino acid residues. We have chosen two pneumococcal DNA-binding proteins, namely the relaxase domain of MobM protein from plasmid pMV158, and the RelB-RelE antitoxin-toxin protein complex. Through several changes that improve substantially a previous protocol (Budisa et al., 1995), we have used seleno-L-methionine to incorporate selenium into the amino acid sequence of the selected proteins. We have achieved 100% labelling of the proteins and could demonstrate that the labelled proteins retained full activity as judged from the relaxation of supercoiled plasmid DNA and from gel-retardation assays.
Collapse
Affiliation(s)
- Fabián Lorenzo-Diaz
- Departamento de Bioquímica, Microbiología, Biología Celular y Genética, Universidad de La Laguna, Santa Cruz de Tenerife, Spain
| | - Inmaculada Moreno-Córdoba
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu, 9, 28040 Madrid, Spain
| | - Manuel Espinosa
- Centro de Investigaciones Biológicas, Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu, 9, 28040 Madrid, Spain.
| |
Collapse
|
8
|
Gupta SP. Design and Development of Drugs Targeting Protein-Protein Interactions – Part-II. Curr Top Med Chem 2019; 19:500. [DOI: 10.2174/156802661907190531094029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Satya P. Gupta
- Department of Pharmaceutical Technology Meerut Institute of Engineering and Technology Meerut, India
| |
Collapse
|