1
|
Bitencourt-Ferreira G, Villarreal MA, Quiroga R, Biziukova N, Poroikov V, Tarasova O, de Azevedo Junior WF. Exploring Scoring Function Space: Developing Computational Models for Drug Discovery. Curr Med Chem 2024; 31:2361-2377. [PMID: 36944627 DOI: 10.2174/0929867330666230321103731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 12/15/2022] [Accepted: 12/29/2022] [Indexed: 03/23/2023]
Abstract
BACKGROUND The idea of scoring function space established a systems-level approach to address the development of models to predict the affinity of drug molecules by those interested in drug discovery. OBJECTIVE Our goal here is to review the concept of scoring function space and how to explore it to develop machine learning models to address protein-ligand binding affinity. METHODS We searched the articles available in PubMed related to the scoring function space. We also utilized crystallographic structures found in the protein data bank (PDB) to represent the protein space. RESULTS The application of systems-level approaches to address receptor-drug interactions allows us to have a holistic view of the process of drug discovery. The scoring function space adds flexibility to the process since it makes it possible to see drug discovery as a relationship involving mathematical spaces. CONCLUSION The application of the concept of scoring function space has provided us with an integrated view of drug discovery methods. This concept is useful during drug discovery, where we see the process as a computational search of the scoring function space to find an adequate model to predict receptor-drug binding affinity.
Collapse
Affiliation(s)
| | - Marcos A Villarreal
- CONICET-Departamento de Matemática y Física, Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | - Rodrigo Quiroga
- CONICET-Departamento de Matemática y Física, Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | - Nadezhda Biziukova
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Vladimir Poroikov
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Olga Tarasova
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Walter F de Azevedo Junior
- Pontifical Catholic University of Rio Grande do Sul - PUCRS, Porto Alegre-RS, Brazil
- Specialization Program in Bioinformatics, The Pontifical Catholic University of Rio Grande do Sul (PUCRS), Av. Ipiranga, 6681 Porto Alegre / RS 90619-900, Brazil
| |
Collapse
|
2
|
Wang Y, Jiao Q, Wang J, Cai X, Zhao W, Cui X. Prediction of protein-ligand binding affinity with deep learning. Comput Struct Biotechnol J 2023; 21:5796-5806. [PMID: 38213884 PMCID: PMC10782002 DOI: 10.1016/j.csbj.2023.11.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 11/03/2023] [Accepted: 11/03/2023] [Indexed: 01/13/2024] Open
Abstract
The prediction of binding affinities between target proteins and small molecule drugs is essential for speeding up the drug research and design process. To attain precise and effective affinity prediction, computer-aided methods are employed in the drug discovery pipeline. In the last decade, a variety of computational methods has been developed, with deep learning being the most commonly used approach. We have gathered several deep learning methods and classified them into convolutional neural networks (CNNs), graph neural networks (GNNs), and Transformers for analysis and discussion. Initially, we conducted an analysis of the different deep learning methods, focusing on their feature construction and model architecture. We discussed the advantages and disadvantages of each model. Subsequently, we conducted experiments using four deep learning methods on the PDBbind v.2016 core set. We evaluated their prediction capabilities in various affinity intervals and statistically and visually analyzed the samples of correct and incorrect predictions for each model. Through visual analysis, we attempted to combine the strengths of the four models to improve the Root Mean Square Error (RMSE) of predicted affinities by 1.6% (reducing the absolute value to 1.101) and the Pearson Correlation Coefficient (R) by 2.9% (increasing the absolute value to 0.894) compared to the current state-of-the-art method. Lastly, we discussed the challenges faced by current deep learning methods in affinity prediction and proposed potential solutions to address these issues.
Collapse
Affiliation(s)
- Yuxiao Wang
- School of Computer Science and Technology, Shandong University, Qingdao 266237, Shandong, China
| | - Qihong Jiao
- School of Computer Science and Technology, Shandong University, Qingdao 266237, Shandong, China
| | - Jingxuan Wang
- School of Computer Science and Technology, Shandong University, Qingdao 266237, Shandong, China
| | - Xiaojun Cai
- School of Computer Science and Technology, Shandong University, Qingdao 266237, Shandong, China
| | - Wei Zhao
- State Key Laboratory of Microbial Technology, Shandong University, Qingdao 266237, Shandong, China
| | - Xuefeng Cui
- School of Computer Science and Technology, Shandong University, Qingdao 266237, Shandong, China
| |
Collapse
|
3
|
Liu X, Zheng L, Qin C, Li Y, Zhang JZH, Sun Z. Screening Power of End-Point Free-Energy Calculations in Cucurbituril Host-Guest Systems. J Chem Inf Model 2023; 63:6938-6946. [PMID: 37908066 DOI: 10.1021/acs.jcim.3c01356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2023]
Abstract
End-point free-energy methods as an indispensable component in virtual screening are commonly recognized as a tool with a certain level of screening power in pharmaceutical research. While a huge number of records could be found for end-point applications in protein-ligand, protein-protein, and protein-DNA complexes from academic and industrial reports, up to now, there is no large-scale benchmark in host-guest complexes supporting the screening power of end-point free-energy techniques. A good benchmark requires a data set of sufficient coverage of pharmaceutically relevant chemical space, a long-time sampling length supporting the trajectory approximation of the ensemble average, and a sufficient sample size of receptor-acceptor pairs to stabilize the performance statistics. In this work, selecting a popular family of macrocyclic hosts named cucurbiturils, we construct a large data set containing 154 host-guest pairs, perform extensive end-point sampling of several hundred nanosecond lengths for each system, and extract the free-energy estimates with a variety of end-point free-energy techniques, including the advanced three-trajectory dielectric-constant-variable regime proposed in our recent work. The best-performing end-point protocol employs GAFF2 for solute descriptions, the three-trajectory end-point sampling regime, and the MM/GBSA Hamiltonian in free-energy extraction, achieving a high ranking metrics of Kendall τ > 0.6, a Pearlman predictive index of ∼0.8, and a high scoring power of Pearson r > 0.8. The current project as the first large-scale systematic benchmark of end-point methods in host-guest complexes in academic publications provides solid evidence of the applicability of end-point techniques and direct guidance of computational setups in practical host-guest systems.
Collapse
Affiliation(s)
- Xiao Liu
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Lei Zheng
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Chu Qin
- School of Mathematics, Physics and Statistics, Shanghai University of Engineering Science, Shanghai 201620, China
| | - Yang Li
- College of Information Science and Engineering, Shandong Agricultural University, Tai'an 271018, China
| | - John Z H Zhang
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
- Department of Chemistry, New York University, New York, New York 10003, United States
- School of Chemistry and Molecular Engineering, East China Normal University, Shanghai 200062, China
- Faculty of Synthetic Biology and Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen Guangdong 518055, China
| | - Zhaoxi Sun
- Changping Laboratory, Beijing 102206, China
| |
Collapse
|
4
|
Lee M, Min K. AmorProt: Amino Acid Molecular Fingerprints Repurposing-Based Protein Fingerprint. Biochemistry 2023; 62:2700-2709. [PMID: 37622182 DOI: 10.1021/acs.biochem.3c00253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/26/2023]
Abstract
As protein therapeutics play an important role in almost all medical fields, numerous studies have been conducted on proteins using artificial intelligence. Artificial intelligence has enabled data-driven predictions without the need for expensive experiments. Nevertheless, unlike the various molecular fingerprint algorithms that have been developed, protein fingerprint algorithms have rarely been studied. In this study, we proposed the amino acid molecular fingerprints repurposing-based protein (AmorProt) fingerprint, a protein sequence representation method that effectively uses the molecular fingerprints corresponding to 20 amino acids. Subsequently, the performances of the tree-based machine learning and artificial neural network models were compared using (1) amyloid classification and (2) isoelectric point regression. Finally, the applicability and advantages of the developed platform were demonstrated through a case study and the following experiments: (3) comparison of dataset dependence with feature-based methods, (4) feature importance analysis, and (5) protein space analysis. Consequently, the significantly improved model performance and data-set-independent versatility of the AmorProt fingerprint were verified. The results revealed that the current protein representation method can be applied to various fields related to proteins, such as predicting their fundamental properties or interaction with ligands.
Collapse
Affiliation(s)
- Myeonghun Lee
- School of Systems Biomedical Science, Soongsil University, 369 Sangdo-ro, Dongjak-gu, Seoul 06978, Republic of Korea
| | - Kyoungmin Min
- School of Mechanical Engineering, Soongsil University, 369 Sangdo-ro, Dongjak-gu, Seoul 06978, Republic of Korea
| |
Collapse
|
5
|
Kumar A, Singh P, Kumar R, Yadav P, Jaiswal A, Kumar Tewari A. An Experimental and Theoretical Study of the Conformational Stability of Triazinone Fleximers: Quantitative Analysis for Intermolecular Interactions. ChemistrySelect 2023. [DOI: 10.1002/slct.202203862] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Affiliation(s)
- Akhilesh Kumar
- Department of Chemistry (Center of Advanced Studies) Institute of Science Banaras Hindu University Varanasi 221005
| | - Praveen Singh
- Department of Chemistry Dayanand Vedic College Orai Jaluan 285001
| | - Ranjeet Kumar
- Department of Chemistry C. M. P. Degree College Prayagraj 211002 India
| | - Priyanka Yadav
- Department of Chemistry (Center of Advanced Studies) Institute of Science Banaras Hindu University Varanasi 221005
| | - Amit Jaiswal
- Department of Chemistry C. M. P. Degree College Prayagraj 211002 India
| | - Ashish Kumar Tewari
- Department of Chemistry (Center of Advanced Studies) Institute of Science Banaras Hindu University Varanasi 221005
| |
Collapse
|
6
|
McNair D. Artificial Intelligence and Machine Learning for Lead-to-Candidate Decision-Making and Beyond. Annu Rev Pharmacol Toxicol 2023; 63:77-97. [PMID: 35679624 DOI: 10.1146/annurev-pharmtox-051921-023255] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
The use of artificial intelligence (AI) and machine learning (ML) in pharmaceutical research and development has to date focused on research: target identification; docking-, fragment-, and motif-based generation of compound libraries; modeling of synthesis feasibility; rank-ordering likely hits according to structural and chemometric similarity to compounds having known activity and affinity to the target(s); optimizing a smaller library for synthesis and high-throughput screening; and combining evidence from screening to support hit-to-lead decisions. Applying AI/ML methods to lead optimization and lead-to-candidate (L2C) decision-making has shown slower progress, especially regarding predicting absorption, distribution, metabolism, excretion, and toxicology properties. The present review surveys reasons why this is so, reports progress that has occurred in recent years, and summarizes some of the issues that remain. Effective AI/ML tools to derisk L2C and later phases of development are important to accelerate the pharmaceutical development process, ameliorate escalating development costs, and achieve greater success rates.
Collapse
Affiliation(s)
- Douglas McNair
- Global Health, Integrated Development, Bill & Melinda Gates Foundation, Seattle, Washington, USA;
| |
Collapse
|
7
|
Basnet S, Marahatha R, Shrestha A, Bhattarai S, Katuwal S, Sharma KR, Marasini BP, Dahal SR, Basnyat RC, Patching SG, Parajuli N. In Vitro and In Silico Studies for the Identification of Potent Metabolites of Some High-Altitude Medicinal Plants from Nepal Inhibiting SARS-CoV-2 Spike Protein. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27248957. [PMID: 36558090 PMCID: PMC9786757 DOI: 10.3390/molecules27248957] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 12/01/2022] [Accepted: 12/12/2022] [Indexed: 12/23/2022]
Abstract
Despite ongoing vaccination programs against COVID-19 around the world, cases of infection are still rising with new variants. This infers that an effective antiviral drug against COVID-19 is crucial along with vaccinations to decrease cases. A potential target of such antivirals could be the membrane components of the causative pathogen, SARS-CoV-2, for instance spike (S) protein. In our research, we have deployed in vitro screening of crude extracts of seven ethnomedicinal plants against the spike receptor-binding domain (S1-RBD) of SARS-CoV-2 using an enzyme-linked immunosorbent assay (ELISA). Following encouraging in vitro results for Tinospora cordifolia, in silico studies were conducted for the 14 reported antiviral secondary metabolites isolated from T. cordifolia-a species widely cultivated and used as an antiviral drug in the Himalayan country of Nepal-using Genetic Optimization for Ligand Docking (GOLD), Molecular Operating Environment (MOE), and BIOVIA Discovery Studio. The molecular docking and binding energy study revealed that cordifolioside-A had a higher binding affinity and was the most effective in binding to the competitive site of the spike protein. Molecular dynamics (MD) simulation studies using GROMACS 5.4.1 further assayed the interaction between the potent compound and binding sites of the spike protein. It revealed that cordifolioside-A demonstrated better binding affinity and stability, and resulted in a conformational change in S1-RBD, hence hindering the activities of the protein. In addition, ADMET analysis of the secondary metabolites from T. cordifolia revealed promising pharmacokinetic properties. Our study thus recommends that certain secondary metabolites of T. cordifolia are possible medicinal candidates against SARS-CoV-2.
Collapse
Affiliation(s)
- Saroj Basnet
- Center for Drug Design and Molecular Simulation Division, Kathmandu 44600, Nepal
| | - Rishab Marahatha
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
- Department of Chemistry, Oklahoma State University, Still Water, OK 74078, USA
| | - Asmita Shrestha
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
| | - Salyan Bhattarai
- Paraza Pharma, Inc., 2525 Avenue Marie-Curie, Montreal, QC H4S 2E1, Canada
| | - Saurav Katuwal
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
| | - Khaga Raj Sharma
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
| | | | - Salik Ram Dahal
- Department of Chemistry, Oklahoma State University, Still Water, OK 74078, USA
- Oakridge National Laboratory, Bethel Valley Rd, Oak Ridge, TN 37830, USA
| | - Ram Chandra Basnyat
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
| | - Simon G. Patching
- Independent Researcher, Leeds LS2 9JT, UK
- Correspondence: (S.G.P.); (N.P.)
| | - Niranjan Parajuli
- Central Department of Chemistry, Tribhuvan University, Kathmandu 44618, Nepal
- Correspondence: (S.G.P.); (N.P.)
| |
Collapse
|
8
|
Corbo T, Kalajdzic A, Delic D, Suleiman S, Pojskic N. In silico prediction suggests inhibitory effect of halogenated boroxine on human catalase and carbonic anhydrase. J Genet Eng Biotechnol 2022; 20:153. [DOI: 10.1186/s43141-022-00437-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Accepted: 10/23/2022] [Indexed: 11/06/2022]
Abstract
Abstract
Background
This research work included bioinformatics modeling of the dipotassium-trioxohydroxytetrafluorotriborate-halogenated boroxine molecule, as well as simulation and prediction of structural interactions between the halogenated boroxine molecule, human carbonic anhydrase, and human catalase structures. Using computational methods, we tried to confirm the inhibitory effect of halogenated boroxine on the active sites of these previously mentioned enzymes. The three-dimensional crystal structures of human catalase (PDB ID: 1DGB) and human carbonic anhydrase (PDB ID: 6FE2) were retrieved from RCSB Protein Data Bank and the protein preparation was performed using AutoDock Tools. ACD/ChemSketch and ChemDoodle were used for creating the three-dimensional structure of halogenated boroxine. Molecular docking was performed using AutoDock Vina, while the results were visualized using PyMOL.
Results
Results obtained in this research are showing evidence that there are interactions between the halogenated boroxine molecule and both previously mentioned proteins (human carbonic anhydrase and human catalase) in their active sites, which led us to the conclusion that the inhibitory function of halogenated boroxine has been confirmed.
Conclusion
These findings could be an important step in determining the exact mechanisms of inhibitory activity and will hopefully serve in further research purposes of complex pharmacogenomics studies.
Collapse
|
9
|
Jeckel AM, Beran F, Züst T, Younkin G, Petschenka G, Pokharel P, Dreisbach D, Ganal-Vonarburg SC, Robert CAM. Metabolization and sequestration of plant specialized metabolites in insect herbivores: Current and emerging approaches. Front Physiol 2022; 13:1001032. [PMID: 36237530 PMCID: PMC9552321 DOI: 10.3389/fphys.2022.1001032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 08/22/2022] [Indexed: 11/13/2022] Open
Abstract
Herbivorous insects encounter diverse plant specialized metabolites (PSMs) in their diet, that have deterrent, anti-nutritional, or toxic properties. Understanding how they cope with PSMs is crucial to understand their biology, population dynamics, and evolution. This review summarizes current and emerging cutting-edge methods that can be used to characterize the metabolic fate of PSMs, from ingestion to excretion or sequestration. It further emphasizes a workflow that enables not only to study PSM metabolism at different scales, but also to tackle and validate the genetic and biochemical mechanisms involved in PSM resistance by herbivores. This review thus aims at facilitating research on PSM-mediated plant-herbivore interactions.
Collapse
Affiliation(s)
- Adriana Moriguchi Jeckel
- Laboratory of Chemical Ecology, Institute of Plant Sciences, University of Bern, Bern, Switzerland
| | - Franziska Beran
- Department of Insect Symbiosis, Max Planck Institute for Chemical Ecology, Jena, Germany
| | - Tobias Züst
- Department of Systematic and Evolutionary Botany, University of Zürich, Zürich, Switzerland
| | - Gordon Younkin
- Boyce Thompson Institute, Ithaca, NY, United States
- Plant Biology Section, School of Integrative Plant Science, Cornell University, Ithaca, NY, United States
| | - Georg Petschenka
- Department of Applied Entomology, Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Prayan Pokharel
- Department of Applied Entomology, Institute of Phytomedicine, University of Hohenheim, Stuttgart, Germany
| | - Domenic Dreisbach
- Institute for Inorganic and Analytical Chemistry, Justus Liebig University Giessen, Giessen, Germany
| | - Stephanie Christine Ganal-Vonarburg
- Department of Visceral Surgery and Medicine, Bern University Hospital, University of Bern, Bern, Switzerland
- Department for BioMedical Research, Visceral Surgery and Medicine, University of Bern, Bern, Switzerland
| | - Christelle Aurélie Maud Robert
- Laboratory of Chemical Ecology, Institute of Plant Sciences, University of Bern, Bern, Switzerland
- *Correspondence: Christelle Aurélie Maud Robert,
| |
Collapse
|
10
|
Oh J, Ceong HT, Na D, Park C. A machine learning model for classifying G-protein-coupled receptors as agonists or antagonists. BMC Bioinformatics 2022; 23:346. [PMID: 35982407 PMCID: PMC9389651 DOI: 10.1186/s12859-022-04877-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 08/03/2022] [Indexed: 11/13/2022] Open
Abstract
Background G-protein coupled receptors (GPCRs) sense and transmit extracellular signals into the intracellular machinery by regulating G proteins. GPCR malfunctions are associated with a variety of signaling-related diseases, including cancer and diabetes; at least a third of the marketed drugs target GPCRs. Thus, characterization of their signaling and regulatory mechanisms is crucial for the development of effective drugs. Results In this study, we developed a machine learning model to identify GPCR agonists and antagonists. We designed two-step prediction models: the first model identified the ligands binding to GPCRs and the second model classified the ligands as agonists or antagonists. Using 990 selected subset features from 5270 molecular descriptors calculated from 4590 ligands deposited in two drug databases, our model classified non-ligands, agonists, and antagonists of GPCRs, and achieved an area under the ROC curve (AUC) of 0.795, sensitivity of 0.716, specificity of 0.744, and accuracy of 0.733. In addition, we verified that 70% (44 out of 63) of FDA-approved GPCR-targeting drugs were correctly classified into their respective groups. Conclusions Studies of ligand–GPCR interaction recognition are important for the characterization of drug action mechanisms. Our GPCR–ligand interaction prediction model can be employed in the pharmaceutical sciences for the efficient virtual screening of putative GPCR-binding agonists and antagonists. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04877-7.
Collapse
Affiliation(s)
- Jooseong Oh
- School of Biological Sciences and Technology, Chonnam National University, Gwangju, 61186, Republic of Korea
| | - Hyi-Thaek Ceong
- Department of Multimedia, Chonnam National University, Yeosu, 59626, Republic of Korea.
| | - Dokyun Na
- Department of Biomedical Engineering, Chung-Ang University, Seoul, 06974, Republic of Korea.
| | - Chungoo Park
- School of Biological Sciences and Technology, Chonnam National University, Gwangju, 61186, Republic of Korea.
| |
Collapse
|
11
|
Joshi M, Purohit M, Shah DP, Patel D, Depani P, Moryani P, Krishnakumar A. Pathogenomic in silico approach identifies NSP-A and Fe-IIISBP as possible drug targets in Neisseria Meningitidis MC58 and development of pharmacophores as novel therapeutic candidates. Mol Divers 2022:10.1007/s11030-022-10480-y. [PMID: 35879631 DOI: 10.1007/s11030-022-10480-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 06/07/2022] [Indexed: 11/26/2022]
Abstract
Meningitis creates a life-threatening clinical crisis. Moreover, the administered antibiotics result into multi-drug resistance, thereby necessitating development of alternative therapeutic strategies. This study aimed at identifying novel-drug targets in Neisseria meningitidis and therapeutic molecules which can be exploited for the treatment of meningitis. Novel targets were identified by applying a pathogenomic approach involving protein data-set mining, subtractive channel analysis and subsequent qualitative analysis comprising of in silico pharmacokinetics, molecular docking and pharmacophore generation. Pathogenomic studies revealed Neisserial Surface Protein A (NSP-A) and Iron-III-Substrate Binding Protein (Fe-IIISBP) as potential targets. Two pharmacophore models comprising of 2-(biaryl) carbapenems, efavirenz, praziquantel and pyrimethamine for NSP-A and 2-(biaryl) carbapenems, trimipramine and pyrimethamine for Fe-IIISBP, showed successful docking, followed drug-likeness criteria and generated pharmacophore model with a score of 8.08 and 8.818, respectively, which had further been docked to the target stably. Thus, our study identifies NSP-A and Fe-IIISBP as novel targets in Neisseria meningitidis for which 2-(biaryl) carbapenems, efavirenz, praziquantel, trimipramine and pyrimethamine may be employed for effective treatment of meningitis.
Collapse
Affiliation(s)
- Madhavi Joshi
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Maitree Purohit
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Dhriti P Shah
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Devanshi Patel
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Preksha Depani
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Premkumar Moryani
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India
| | - Amee Krishnakumar
- Institute of Science, Nirma University, Sarkhej-Gandhinagar Highway, Ahmedabad, Gujarat, 382 481, India.
| |
Collapse
|
12
|
Aja PM, Awoke JN, Agu PC, Adegboyega AE, Ezeh EM, Igwenyi IO, Orji OU, Ani OG, Ale BA, Ibiam UA. Hesperidin abrogates bisphenol A endocrine disruption through binding with fibroblast growth factor 21 (FGF-21), α-amylase and α-glucosidase: an in silico molecular study. J Genet Eng Biotechnol 2022; 20:84. [PMID: 35648239 PMCID: PMC9160168 DOI: 10.1186/s43141-022-00370-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 05/20/2022] [Indexed: 12/03/2022]
Abstract
Background Fibroblast growth factor 21 (FGF-21), alpha-amylase, and alpha-glucosidase are key proteins implicated in metabolic dysregulations. Bisphenol A (BPA) is an environmental toxicant known to cause endocrine dysregulations. Hesperidin from citrus is an emerging flavonoid for metabolic diseases management. Through computational approach, we investigated the potentials of hesperidin in abrogating BPA interference in metabolism. The 3D crystal structure of the proteins (FGF-21, α-amylase, and α-glucosidase) and the ligands (BPA and hesperidin) were retrieved from the PDB and PubChem database respectively. Using Autodock plugin Pyrx, molecular docking of the ligands and individual proteins were performed to ascertain their binding affinities and their potentials to compete for the same binding site. Validation of the docking study was considered as the ability of the ligands to bind at the same site of each proteins. The docking poses were visualized using UCSF Chimera and Discovery Studio 2020, respectively to reveal each of the protein-ligands interactions within the binding pockets. Using SwissAdme and AdmeSar servers, we further investigated hesperidin’s ADMET profile. Hesperidin used was purchased commercially. Results Hesperidin and BPA competitively bound to the same site on each protein. Interestingly, hesperidin had greater binding affinities (Kcal/mol) − 5.80, − 9.60, and − 9.60 than BPA (Kcal/mol) − 4.40, − 7.20, − 7.10 for FGF-21, α-amylase, and α-glucosidase respectively. Visualizations of the binding poses showed that hesperidin interacted with stronger bonds than BPA within the proteins’ pockets. Although hesperidin violated Lipinski rule of five, this however can be optimized through structural modifications. Conclusions Hesperidin may be an emerging natural product with promising therapeutic potentials against metabolic and endocrine derangement.
Collapse
Affiliation(s)
- P M Aja
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria
| | - J N Awoke
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria. .,Interdisciplinary Biomedical Research Centre, School of Science and Technology, Nottingham Trent University, Nottingham, UK.
| | - P C Agu
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria
| | - A E Adegboyega
- Department of Biochemistry, Faculty of Medical Sciences, University of Jos/Jaris Computational Biology Centre, Jos, Nigeria
| | - E M Ezeh
- Department of Chemical Engineering, Nnamdi Azikiwe University, Awka, Nigeria
| | - I O Igwenyi
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria
| | - O U Orji
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria
| | - O G Ani
- Nutrition and Exercise Physiology, University of Missouri, Columbia, United States of America
| | - B A Ale
- Department of Biochemistry, University of Nigeria Nsukka, Nsukka, Nigeria
| | - U A Ibiam
- Department of Biochemistry, Faculty of Science, Ebonyi State University, Abakaliki, Nigeria
| |
Collapse
|
13
|
Asseri AH, Alam MJ, Alzahrani F, Khames A, Pathan MT, Abourehab MAS, Hosawi S, Ahmed R, Sultana SA, Alam NF, Alam NU, Alam R, Samad A, Pokhrel S, Kim JK, Ahammad F, Kim B, Tan SC. Toward the Identification of Natural Antiviral Drug Candidates against Merkel Cell Polyomavirus: Computational Drug Design Approaches. Pharmaceuticals (Basel) 2022; 15:ph15050501. [PMID: 35631328 PMCID: PMC9146542 DOI: 10.3390/ph15050501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 04/05/2022] [Accepted: 04/12/2022] [Indexed: 12/21/2022] Open
Abstract
Merkel cell carcinoma (MCC) is a rare form of aggressive skin cancer mainly caused by Merkel cell polyomavirus (MCPyV). Most MCC tumors express MCPyV large T (LT) antigens and play an important role in the growth-promoting activities of oncoproteins. Truncated LT promotes tumorigenicity as well as host cell proliferation by activating the viral replication machinery, and inhibition of this protein in humans drastically lowers cellular growth linked to the corresponding cancer. Our study was designed with the aim of identifying small molecular-like natural antiviral candidates that are able to inhibit the proliferation of malignant tumors, especially those that are aggressive, by blocking the activity of viral LT protein. To identify potential compounds against the target protein, a computational drug design including molecular docking, ADME (absorption, distribution, metabolism, and excretion), toxicity, molecular dynamics (MD) simulation, and molecular mechanics generalized Born surface area (MM-GBSA) approaches were applied in this study. Initially, a total of 2190 phytochemicals isolated from 104 medicinal plants were screened using the molecular docking simulation method, resulting in the identification of the top five compounds having the highest binding energy, ranging between −6.5 and −7.6 kcal/mol. The effectiveness and safety of the selected compounds were evaluated based on ADME and toxicity features. A 250 ns MD simulation confirmed the stability of the selected compounds bind to the active site (AS) of the target protein. Additionally, MM-GBSA analysis was used to determine the high values of binding free energy (ΔG bind) of the compounds binding to the target protein. The five compounds identified by computational approaches, Paulownin (CID: 3084131), Actaealactone (CID: 11537736), Epigallocatechin 3-O-cinnamate (CID: 21629801), Cirsilineol (CID: 162464), and Lycoricidine (CID: 73065), can be used in therapy as lead compounds to combat MCPyV-related cancer. However, further wet laboratory investigations are required to evaluate the activity of the drugs against the virus.
Collapse
Affiliation(s)
- Amer H. Asseri
- Biochemistry Department, Faculty of Science, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia; (A.H.A.); (F.A.); (S.H.)
- Centre for Artificial Intelligence in Precision Medicines, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia
| | - Md. Jahidul Alam
- Department of Applied Chemistry and Chemical Engineering, Noakhali Science and Technology University, Noakhali 3814, Bangladesh;
| | - Faisal Alzahrani
- Biochemistry Department, Faculty of Science, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia; (A.H.A.); (F.A.); (S.H.)
- King Fahd Medical Research Center, Embryonic Stem Cells Unit, Department of Biochemistry, Faculty of Science, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia
| | - Ahmed Khames
- Department of Pharmaceutics and Industrial pharmacy, College of Pharmacy, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia;
| | - Mohammad Turhan Pathan
- Department of Biochemistry and Microbiology, North South University, Dhaka 1229, Bangladesh;
| | - Mohammed A. S. Abourehab
- Department of Pharmaceutics, Faculty of Pharmacy, Umm Al-Qura University, Makkah 21955, Saudi Arabia;
| | - Salman Hosawi
- Biochemistry Department, Faculty of Science, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia; (A.H.A.); (F.A.); (S.H.)
- Centre for Artificial Intelligence in Precision Medicines, King Abdul-Aziz University, Jeddah 21589, Saudi Arabia
| | - Rubaiat Ahmed
- Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh; (R.A.); (N.F.A.)
| | - Sifat Ara Sultana
- Department of Pharmacy, Faculty of Pharmacy, University of Dhaka, Dhaka 1000, Bangladesh;
| | - Nazia Fairooz Alam
- Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka 1000, Bangladesh; (R.A.); (N.F.A.)
| | - Nafee-Ul Alam
- Department of Biotechnology, College of Life Science and Medicine, Zhejiang Sci-Tech University, Hangzhou 310018, China;
| | - Rahat Alam
- Department of Genetic Engineering and Biotechnology, Faculty of Biological Science and Technology, Jashore University of Science and Technology, Jashore 7408, Bangladesh; (R.A.); (A.S.)
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore 7408, Bangladesh
| | - Abdus Samad
- Department of Genetic Engineering and Biotechnology, Faculty of Biological Science and Technology, Jashore University of Science and Technology, Jashore 7408, Bangladesh; (R.A.); (A.S.)
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore 7408, Bangladesh
| | - Sushil Pokhrel
- Department of Biomedical Engineering, State University of New York (SUNY), Binghamton, NY 13902, USA;
| | - Jin Kyu Kim
- College of Korean Medicine, Kyung Hee University, Kyungheedae-ro 26, Dongdaemun-gu, Seoul 05254, Korea;
| | - Foysal Ahammad
- Laboratory of Computational Biology, Biological Solution Centre (BioSol Centre), Jashore 7408, Bangladesh
- Department of Biological Sciences, Faculty of Science, King Abdul-Aziz University (KAU), Jeddah 21589, Saudi Arabia
- Correspondence: (F.A.); (B.K.); (S.C.T.)
| | - Bonglee Kim
- College of Korean Medicine, Kyung Hee University, Kyungheedae-ro 26, Dongdaemun-gu, Seoul 05254, Korea;
- Correspondence: (F.A.); (B.K.); (S.C.T.)
| | - Shing Cheng Tan
- UKM Medical Molecular Biology Institute, Universiti Kebangsaan Malaysia, Kuala Lumpur 56000, Malaysia
- Correspondence: (F.A.); (B.K.); (S.C.T.)
| |
Collapse
|
14
|
Gorostiola González M, Janssen APA, IJzerman AP, Heitman LH, van Westen GJP. Oncological drug discovery: AI meets structure-based computational research. Drug Discov Today 2022; 27:1661-1670. [PMID: 35301149 DOI: 10.1016/j.drudis.2022.03.005] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Revised: 01/22/2022] [Accepted: 03/09/2022] [Indexed: 02/08/2023]
Abstract
The integration of machine learning and structure-based methods has proven valuable in the past as a way to prioritize targets and compounds in early drug discovery. In oncological research, these methods can be highly beneficial in addressing the diversity of neoplastic diseases portrayed by the different hallmarks of cancer. Here, we review six use case scenarios for integrated computational methods, namely driver prediction, computational mutagenesis, (off)-target prediction, binding site prediction, virtual screening, and allosteric modulation analysis. We address the heterogeneity of integration approaches and individual methods, while acknowledging their current limitations and highlighting their potential to bring drugs for personalized oncological therapies to the market faster.
Collapse
Affiliation(s)
- Marina Gorostiola González
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, The Netherlands; Oncode Institute, Utrecht, The Netherlands
| | - Antonius P A Janssen
- Oncode Institute, Utrecht, The Netherlands; Molecular Physiology, Leiden Institute of Chemistry, Leiden University, The Netherlands
| | - Adriaan P IJzerman
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, The Netherlands
| | - Laura H Heitman
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, The Netherlands; Oncode Institute, Utrecht, The Netherlands
| | - Gerard J P van Westen
- Division of Drug Discovery and Safety, Leiden Academic Centre for Drug Research, Leiden University, The Netherlands.
| |
Collapse
|
15
|
Nikolaienko T, Gurbych O, Druchok M. Complex machine learning model needs complex testing: Examining predictability of molecular binding affinity by a graph neural network. J Comput Chem 2022; 43:728-739. [PMID: 35201629 DOI: 10.1002/jcc.26831] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/04/2022] [Accepted: 02/09/2022] [Indexed: 12/12/2022]
Abstract
Drug discovery pipelines typically involve high-throughput screening of large amounts of compounds in a search of potential drugs candidates. As a chemical space of small organic molecules is huge, a "navigation" over it urges for fast and lightweight computational methods, thus promoting machine-learning approaches for processing huge pools of candidates. In this contribution, we present a graph-based deep neural network for prediction of protein-drug binding affinity and assess its predictive power under thorough testing conditions. Within the suggested approach, both protein and drug molecules are represented as graphs and passed to separate graph sub-networks, then concatenated and regressed towards a binding affinity. The neural network is trained on two binding affinity datasets-PDBbind and data imported from RCSB Protein Data Bank. In order to explore the generalization capabilities of the model we go beyond traditional random or leave-cluster-out techniques and demonstrate the need for more elaborate model performance assessment - six different strategies for test/train data partitioning (random, time- and property-arranged, protein- and ligand-clustered) with a k-fold cross-validation are engaged. Finally, we discuss the model performance in terms of a set of metrics for different split strategies and fold arrangement. Our code is available at https://github.com/SoftServeInc/affinity-by-GNN.
Collapse
Affiliation(s)
- Tymofii Nikolaienko
- SoftServe, Inc., Lviv, Ukraine.,Faculty of Physics, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
| | - Oleksandr Gurbych
- Blackthorn AI Ltd., London, UK.,Department of Artificial Intelligence Systems, Lviv Polytechnic National University, Lviv, Ukraine
| | - Maksym Druchok
- SoftServe, Inc., Lviv, Ukraine.,Institute for Condensed Matter Physics, NAS of Ukraine, Lviv, Ukraine
| |
Collapse
|
16
|
Carracedo-Reboredo P, Liñares-Blanco J, Rodríguez-Fernández N, Cedrón F, Novoa FJ, Carballal A, Maojo V, Pazos A, Fernandez-Lozano C. A review on machine learning approaches and trends in drug discovery. Comput Struct Biotechnol J 2021; 19:4538-4558. [PMID: 34471498 PMCID: PMC8387781 DOI: 10.1016/j.csbj.2021.08.011] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2021] [Revised: 08/06/2021] [Accepted: 08/06/2021] [Indexed: 12/30/2022] Open
Abstract
Drug discovery aims at finding new compounds with specific chemical properties for the treatment of diseases. In the last years, the approach used in this search presents an important component in computer science with the skyrocketing of machine learning techniques due to its democratization. With the objectives set by the Precision Medicine initiative and the new challenges generated, it is necessary to establish robust, standard and reproducible computational methodologies to achieve the objectives set. Currently, predictive models based on Machine Learning have gained great importance in the step prior to preclinical studies. This stage manages to drastically reduce costs and research times in the discovery of new drugs. This review article focuses on how these new methodologies are being used in recent years of research. Analyzing the state of the art in this field will give us an idea of where cheminformatics will be developed in the short term, the limitations it presents and the positive results it has achieved. This review will focus mainly on the methods used to model the molecular data, as well as the biological problems addressed and the Machine Learning algorithms used for drug discovery in recent years.
Collapse
Key Words
- ADMET, Absorption, distribution, metabolism, elimination and toxicity
- ADR, Adverse Drug Reaction
- AI, Artificial Intelligence
- ANN, Artificial Neural Networks
- APFP, Atom Pairs 2d FingerPrint
- AUC, Area under the Curve
- BBB, Blood–Brain barrier
- CDK, Chemical Development Kit
- CNN, Convolutional Neural Networks
- CNS, Central Nervous System
- CPI, Compound-protein interaction
- CV, Cross Validation
- Cheminformatics
- DL, Deep Learning
- DNA, Deoxyribonucleic acid
- Deep Learning
- Drug Discovery
- ECFP, Extended Connectivity Fingerprints
- FDA, Food and Drug Administration
- FNN, Fully Connected Neural Networks
- FP, Fringerprints
- FS, Feature Selection
- GCN, Graph Convolutional Networks
- GEO, Gene Expression Omnibus
- GNN, Graph Neural Networks
- GO, Gene Ontology
- KEGG, Kyoto Encyclopedia of Genes and Genomes
- MACCS, Molecular ACCess System
- MCC, Matthews correlation coefficient
- MD, Molecular Descriptors
- MKL, Multiple Kernel Learning
- ML, Machine Learning
- Machine Learning
- Molecular Descriptors
- NB, Naive Bayes
- OOB, Out of Bag
- PCA, Principal Component Analyisis
- QSAR
- QSAR, Quantitative structure–activity relationship
- RF, Random Forest
- RNA, Ribonucleic Acid
- SMILES, simplified molecular-input line-entry system
- SVM, Support Vector Machines
- TCGA, The Cancer Genome Atlas
- WHO, World Health Organization
- t-SNE, t-Distributed Stochastic Neighbor Embedding
Collapse
Affiliation(s)
- Paula Carracedo-Reboredo
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Jose Liñares-Blanco
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
| | - Nereida Rodríguez-Fernández
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Department of Computer Science and Information Technologies, Faculty of Communication Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Francisco Cedrón
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Francisco J. Novoa
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Adrian Carballal
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Department of Computer Science and Information Technologies, Faculty of Communication Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
| | - Victor Maojo
- Biomedical Informatics Group, Artificial Intelligence Department, Polytechnic University of Madrid, Calle de los Ciruelos, Boadilla del Monte, Madrid 28660, Spain
| | - Alejandro Pazos
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR), Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| | - Carlos Fernandez-Lozano
- Department of Computer Science and Information Technologies, Faculty of Computer Science, Universidade da Coruna, Campus Elviña s/n, A Coruña 15071, Spain
- CITIC-Research Center of Information and Communication Technologies, Universidade da Coruna, A Coruña 15071, Spain
- Grupo de Redes de Neuronas Artificiales y Sistemas Adaptativos. Imagen Médica y Diagnóstico Radiológico (RNASA-IMEDIR), Complexo Hospitalario Universitario de A Coruña (CHUAC), SERGAS, Universidade da Coruña, Instituto de Investigación Biomédica de A Coruña (INIBIC), A Coruña, Spain
| |
Collapse
|
17
|
Druchok M, Yarish D, Garkot S, Nikolaienko T, Gurbych O. Ensembling machine learning models to boost molecular affinity prediction. Comput Biol Chem 2021; 93:107529. [PMID: 34192653 DOI: 10.1016/j.compbiolchem.2021.107529] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2021] [Revised: 06/01/2021] [Accepted: 06/08/2021] [Indexed: 02/01/2023]
Abstract
This study unites six popular machine learning approaches to enhance the prediction of a molecular binding affinity between receptors (large protein molecules) and ligands (small organic molecules). Here we examine a scheme where affinity of ligands is predicted against a single receptor - human thrombin, thus, the models consider ligand features only. However, the suggested approach can be repurposed for other receptors. The methods include Support Vector Machine, Random Forest, CatBoost, feed-forward neural network, graph neural network, and Bidirectional Encoder Representations from Transformers. The first five methods use input features based on physico-chemical properties of molecules, while the last one is based on textual molecular representations. All approaches do not rely on atomic spatial coordinates, avoiding a potential bias from known structures, and are capable of generalizing for compounds with unknown conformations. Within each of the methods, we have trained two models that solve classification and regression tasks. Then, all models are grouped into a pipeline of two subsequent ensembles. The first ensemble aggregates six classification models which vote whether a ligand binds to a receptor or not. If a ligand is classified as active (i.e., binds), the second ensemble predicts its binding affinity in terms of the inhibition constant Ki.
Collapse
Affiliation(s)
- Maksym Druchok
- SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Institute for Condensed Matter Physics, NAS of Ukraine, 1 Svientsitskii Str., 79011 Lviv, Ukraine.
| | | | - Sofiya Garkot
- SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Ukrainian Catholic University, 17 Svientsitskii Str., 79011 Lviv, Ukraine
| | - Tymofii Nikolaienko
- SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Taras Shevchenko National University of Kyiv, 64/13, Volodymyrska Str., 01601 Kyiv, Ukraine
| | - Oleksandr Gurbych
- SoftServe, Inc., 2d Sadova Str., 79021 Lviv, Ukraine; Lviv Polytechnic National University, 5 Kniazia Romana Str., 79005 Lviv, Ukraine
| |
Collapse
|
18
|
Srinivas R, Verma N, Kraka E, Larson EC. Deep Learning-Based Ligand Design Using Shared Latent Implicit Fingerprints from Collaborative Filtering. J Chem Inf Model 2021; 61:2159-2174. [PMID: 33899481 DOI: 10.1021/acs.jcim.0c01355] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In their previous work, Srinivas et al. [ J. Cheminf. 2018, 10, 56] have shown that implicit fingerprints capture ligands and proteins in a shared latent space, typically for the purposes of virtual screening with collaborative filtering models applied on known bioactivity data. In this work, we extend these implicit fingerprints/descriptors using deep learning techniques to translate latent descriptors into discrete representations of molecules (SMILES), without explicitly optimizing for chemical properties. This allows the design of new compounds based upon the latent representation of nearby proteins, thereby encoding druglike properties including binding affinities to known proteins. The implicit descriptor method does not require any fingerprint similarity search, which makes the method free of any bias arising from the empirical nature of the fingerprint models [Srinivas, R.; J. Cheminf. 2018, 10, 56]. We evaluate the properties of the potentially novel drugs generated by our approach using physical properties of druglike molecules and chemical complexity. Additionally, we analyze the reliability of the biological activity of the new compounds generated using this method by employing models of protein-ligand interaction, which assists in assessing the potential binding affinity of the designed compounds. We find that the generated compounds exhibit properties of chemically feasible compounds and are predicted to be excellent binders to known proteins. Furthermore, we also analyze the diversity of compounds created using the Tanimoto distance and conclude that there is a wide diversity in the generated compounds.
Collapse
Affiliation(s)
- Raghuram Srinivas
- Department of Computer Science, Southern Methodist University, Dallas, Texas 75205, United States
| | - Niraj Verma
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75205, United States
| | - Elfi Kraka
- Department of Chemistry, Southern Methodist University, Dallas, Texas 75205, United States
| | - Eric C Larson
- Department of Computer Science, Southern Methodist University, Dallas, Texas 75205, United States
| |
Collapse
|
19
|
Lim S, Lu Y, Cho CY, Sung I, Kim J, Kim Y, Park S, Kim S. A review on compound-protein interaction prediction methods: Data, format, representation and model. Comput Struct Biotechnol J 2021; 19:1541-1556. [PMID: 33841755 PMCID: PMC8008185 DOI: 10.1016/j.csbj.2021.03.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 02/28/2021] [Accepted: 03/01/2021] [Indexed: 01/27/2023] Open
Abstract
There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been accumulated and curated significantly both in quantity and quality. Computational methods have become powerful ever to analyze such complex the data. Thus, recent successes in the improved quality of CPI prediction are due to use of both sophisticated computational techniques and higher quality information in the databases. The goal of this article is to provide reviews of topics related to CPI, such as data, format, representation, to computational models, so that researchers can take full advantages of these resources to develop novel prediction methods. Chemical compounds and protein data from various resources were discussed in terms of data formats and encoding schemes. For the CPI methods, we grouped prediction methods into five categories from traditional machine learning techniques to state-of-the-art deep learning techniques. In closing, we discussed emerging machine learning topics to help both experimental and computational scientists leverage the current knowledge and strategies to develop more powerful and accurate CPI prediction methods.
Collapse
Affiliation(s)
- Sangsoo Lim
- Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea
| | - Yijingxiu Lu
- Department of Computer Science and Engineering, College of Engineering, Seoul National University, Seoul, Republic of Korea
| | - Chang Yun Cho
- Institute of Engineering Research, Seoul National University, Seoul, Republic of Korea
| | - Inyoung Sung
- Institute of Engineering Research, Seoul National University, Seoul, Republic of Korea
| | - Jungwoo Kim
- Department of Computer Science and Engineering, College of Engineering, Seoul National University, Seoul, Republic of Korea
| | - Youngkuk Kim
- Department of Computer Science and Engineering, College of Engineering, Seoul National University, Seoul, Republic of Korea
| | - Sungjoon Park
- Department of Computer Science and Engineering, College of Engineering, Seoul National University, Seoul, Republic of Korea
| | - Sun Kim
- Bioinformatics Institute, Seoul National University, Seoul, Republic of Korea
- Department of Computer Science and Engineering, College of Engineering, Seoul National University, Seoul, Republic of Korea
- Institute of Engineering Research, Seoul National University, Seoul, Republic of Korea
- Interdisciplinary Program in Bioinformatics, College of Natural Sciences, Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
20
|
Qiu W, Lv Z, Hong Y, Jia J, Xiao X. BOW-GBDT: A GBDT Classifier Combining With Artificial Neural Network for Identifying GPCR-Drug Interaction Based on Wordbook Learning From Sequences. Front Cell Dev Biol 2021; 8:623858. [PMID: 33598456 PMCID: PMC7882597 DOI: 10.3389/fcell.2020.623858] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 12/15/2020] [Indexed: 12/28/2022] Open
Abstract
Background: As a class of membrane protein receptors, G protein-coupled receptors (GPCRs) are very important for cells to complete normal life function and have been proven to be a major drug target for widespread clinical application. Hence, it is of great significance to find GPCR targets that interact with drugs in the process of drug development. However, identifying the interaction of the GPCR–drug pairs by experimental methods is very expensive and time-consuming on a large scale. As more and more database about GPCR–drug pairs are opened, it is viable to develop machine learning models to accurately predict whether there is an interaction existing in a GPCR–drug pair. Methods: In this paper, the proposed model aims to improve the accuracy of predicting the interactions of GPCR–drug pairs. For GPCRs, the work extracts protein sequence features based on a novel bag-of-words (BOW) model improved with weighted Silhouette Coefficient and has been confirmed that it can extract more pattern information and limit the dimension of feature. For drug molecules, discrete wavelet transform (DWT) is used to extract features from the original molecular fingerprints. Subsequently, the above-mentioned two types of features are contacted, and SMOTE algorithm is selected to balance the training dataset. Then, artificial neural network is used to extract features further. Finally, a gradient boosting decision tree (GBDT) model is trained with the selected features. In this paper, the proposed model is named as BOW-GBDT. Results: D92M and Check390 are selected for testing BOW-GBDT. D92M is used for a cross-validation dataset which contains 635 interactive GPCR–drug pairs and 1,225 non-interactive pairs. Check390 is used for an independent test dataset which consists of 130 interactive GPCR–drug pairs and 260 non-interactive GPCR–drug pairs, and each element in Check390 cannot be found in D92M. According to the results, the proposed model has a better performance in generation ability compared with the existing machine learning models. Conclusion: The proposed predictor improves the accuracy of the interactions of GPCR–drug pairs. In order to facilitate more researchers to use the BOW-GBDT, the predictor has been settled into a brand-new server, which is available at http://www.jci-bioinfo.cn/bowgbdt.
Collapse
Affiliation(s)
- Wangren Qiu
- School of Information Engineering, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Zhe Lv
- School of Information Engineering, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Yaoqiu Hong
- School of Information Engineering, Jingdezhen University, Jingdezhen, China
| | - Jianhua Jia
- School of Information Engineering, Jingdezhen Ceramic Institute, Jingdezhen, China
| | - Xuan Xiao
- School of Information Engineering, Jingdezhen Ceramic Institute, Jingdezhen, China
| |
Collapse
|
21
|
SSnet: A Deep Learning Approach for Protein-Ligand Interaction Prediction. Int J Mol Sci 2021; 22:ijms22031392. [PMID: 33573266 PMCID: PMC7869013 DOI: 10.3390/ijms22031392] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2020] [Revised: 01/24/2021] [Accepted: 01/27/2021] [Indexed: 12/15/2022] Open
Abstract
Computational prediction of Protein-Ligand Interaction (PLI) is an important step in the modern drug discovery pipeline as it mitigates the cost, time, and resources required to screen novel therapeutics. Deep Neural Networks (DNN) have recently shown excellent performance in PLI prediction. However, the performance is highly dependent on protein and ligand features utilized for the DNN model. Moreover, in current models, the deciphering of how protein features determine the underlying principles that govern PLI is not trivial. In this work, we developed a DNN framework named SSnet that utilizes secondary structure information of proteins extracted as the curvature and torsion of the protein backbone to predict PLI. We demonstrate the performance of SSnet by comparing against a variety of currently popular machine and non-Machine Learning (ML) models using various metrics. We visualize the intermediate layers of SSnet to show a potential latent space for proteins, in particular to extract structural elements in a protein that the model finds influential for ligand binding, which is one of the key features of SSnet. We observed in our study that SSnet learns information about locations in a protein where a ligand can bind, including binding sites, allosteric sites and cryptic sites, regardless of the conformation used. We further observed that SSnet is not biased to any specific molecular interaction and extracts the protein fold information critical for PLI prediction. Our work forms an important gateway to the general exploration of secondary structure-based Deep Learning (DL), which is not just confined to protein-ligand interactions, and as such will have a large impact on protein research, while being readily accessible for de novo drug designers as a standalone package.
Collapse
|
22
|
Mehmood A, Kaushik AC, Wang Q, Li CD, Wei DQ. Bringing Structural Implications and Deep Learning-Based Drug Identification for KRAS Mutants. J Chem Inf Model 2021; 61:571-586. [PMID: 33513018 DOI: 10.1021/acs.jcim.0c00488] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Colorectal cancer is considered one of the leading causes of death that is linked with the Kirsten Rat Sarcoma (KRAS) harboring codons 13 and 61 mutations. The objective for this study is to search for clinically important codon 61 mutations and analyze how they affect the protein structural dynamics. Additionally, a deep-learning approach is used to carry out a similarity search for potential compounds that might have a comparatively better affinity. Public databases like The Cancer Genome Atlas and Genomic Data Commons were accessed for obtaining the data regarding mutations that are associated with colon cancer. Multiple analysis such as genomic alteration landscape, survival analysis, and systems biology-based kinetic simulations were carried out to predict dynamic changes for the selected mutations. Additionally, a molecular dynamics simulation of 100 ns for all the seven shortlisted codon 61 mutations have been conducted, which revealed noticeable deviations. Finally, the deep learning-based predicted compounds were docked with the KRAS 3D conformer, showing better affinity and good docking scores as compared to the already existing drugs. Taking together the outcomes of systems biology and molecular dynamics, it is observed that the reported mutations in the SII region are highly detrimental as they have an immense impact on the protein sensitive sites' native conformation and overall stability. The drugs reported in this study show increased performance and are encouraged to be used for further evaluation regarding the situation that ascends as a result of KRAS mutations.
Collapse
Affiliation(s)
- Aamir Mehmood
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China.,Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong, 518055, P. R. China
| | - Aman Chandra Kaushik
- Wuxi School of Medicine, Jiangnan University, Li Lake Avenue, Wuxi, Jiangsu 214122, China
| | - Qiankun Wang
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China
| | - Cheng-Dong Li
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China
| | - Dong-Qing Wei
- Department of Bioinformatics and Biostatistics, State Key Laboratory of Microbial Metabolism and School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Minhang District, Shanghai 200240, China.,Peng Cheng Laboratory, Vanke Cloud City Phase I Building 8, Xili Street, Nanshan District, Shenzhen, Guangdong, 518055, P. R. China
| |
Collapse
|
23
|
Xie L, Xu L, Kong R, Chang S, Xu X. Improvement of Prediction Performance With Conjoint Molecular Fingerprint in Deep Learning. Front Pharmacol 2021; 11:606668. [PMID: 33488387 PMCID: PMC7819282 DOI: 10.3389/fphar.2020.606668] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2020] [Accepted: 11/23/2020] [Indexed: 12/27/2022] Open
Abstract
The accurate predicting of physical properties and bioactivity of drug molecules in deep learning depends on how molecules are represented. Many types of molecular descriptors have been developed for quantitative structure-activity/property relationships quantitative structure-activity relationships (QSPR). However, each molecular descriptor is optimized for a specific application with encoding preference. Considering that standalone featurization methods may only cover parts of information of the chemical molecules, we proposed to build the conjoint fingerprint by combining two supplementary fingerprints. The impact of conjoint fingerprint and each standalone fingerprint on predicting performance was systematically evaluated in predicting the logarithm of the partition coefficient (logP) and binding affinity of protein-ligand by using machine learning/deep learning (ML/DL) methods, including random forest (RF), support vector regression (SVR), extreme gradient boosting (XGBoost), long short-term memory network (LSTM), and deep neural network (DNN). The results demonstrated that the conjoint fingerprint yielded improved predictive performance, even outperforming the consensus model using two standalone fingerprints among four out of five examined methods. Given that the conjoint fingerprint scheme shows easy extensibility and high applicability, we expect that the proposed conjoint scheme would create new opportunities for continuously improving predictive performance of deep learning by harnessing the complementarity of various types of fingerprints.
Collapse
Affiliation(s)
- Liangxu Xie
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China.,Jiangsu Sino-Israel Industrial Technology Research Institute, Changzhou, China
| | - Lei Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Ren Kong
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Shan Chang
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| | - Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, School of Electrical and Information Engineering, Jiangsu University of Technology, Changzhou, China
| |
Collapse
|
24
|
Rui M, Pang H, Ji W, Wang S, Yu X, Wang L, Feng C. Development of simultaneous interaction prediction approach (SiPA) for the expansion of interaction network of traditional Chinese medicine. Chin Med 2020; 15:90. [PMID: 32863859 PMCID: PMC7448979 DOI: 10.1186/s13020-020-00369-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2020] [Accepted: 08/19/2020] [Indexed: 11/21/2022] Open
Abstract
Background Due to the lack of enough interaction data among compositions, targets and diseases, it is difficult to construct a complete network of Traditional Chinese Medicine (TCM) that comprehensively reflects active compositions and their synergistic network in terms of specific diseases. Therefore, mapping of the full spectrum of interaction between compounds and their targets is of central importance when we use network pharmacology approach to explore the therapeutic potential of the TCM. Methods To address this challenge, we developed a large-scale simultaneous interaction prediction approach (SiPA) integrated one interaction network based simple inference model (SIM), focusing on ‘logical relevance’ between compounds, proteins or diseases, and another compound-target correlation space based interaction prediction model (CTCS-IPM) that was built on the basis of the canonical correlation analysis (CCA) to estimate the position of compounds (or targets) in compound-protein correlated space. Then SiPA was applied to discover reliable multiple interactions for interaction network expansion of a TCM, compound Salvia miltiorrhiza. By means of network analysis, potential active compounds and their related network synergy underlying cardiovascular diseases were evaluated between expanded and original interaction networks. Part of new interactions were validated with existing experimental evidence and molecular docking. Results As evaluated with known test dataset, the established combination approach was proved to make highly accurate prediction, showing a well prediction performance for the SIM and a high recall rate of 85.2% for the CTCS-IPM. Then 710 pairs of new compound-target interactions, 24 pairs of new compound-cardiovascular disease interactions and 294 pairs of new cardiovascular disease-protein interactions were predicted for compound Salvia miltiorrhiza. Results of network analysis suggested the network expansion could dramatically improve the completeness and effectiveness of the network. Validation results of literature and molecular docking manifested that inferred interactions had good reliability. Conclusions We provided a practical and efficient way for large-scale inference of multiple interactions of TCM ingredients, which was not limited by the lack of negative samples, sample size and target 3D structures. SiPA could help researchers more accurately prioritize the effective compounds and more completely explore network synergy of TCM for treating specific diseases, indicating a potential way for effectively identifying candidate compound (or target) in drug discovery.
Collapse
Affiliation(s)
- Mengjie Rui
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Hui Pang
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Wei Ji
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Siqi Wang
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Xuefei Yu
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Lilong Wang
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| | - Chunlai Feng
- School of Pharmacy, Jiangsu University, Zhenjiang, 212013 People's Republic of China
| |
Collapse
|
25
|
Wang P, Huang X, Qiu W, Xiao X. Identifying GPCR-drug interaction based on wordbook learning from sequences. BMC Bioinformatics 2020; 21:150. [PMID: 32312232 PMCID: PMC7171867 DOI: 10.1186/s12859-020-3488-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Accepted: 04/13/2020] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND G protein-coupled receptors (GPCRs) mediate a variety of important physiological functions, are closely related to many diseases, and constitute the most important target family of modern drugs. Therefore, the research of GPCR analysis and GPCR ligand screening is the hotspot of new drug development. Accurately identifying the GPCR-drug interaction is one of the key steps for designing GPCR-targeted drugs. However, it is prohibitively expensive to experimentally ascertain the interaction of GPCR-drug pairs on a large scale. Therefore, it is of great significance to predict the interaction of GPCR-drug pairs directly from the molecular sequences. With the accumulation of known GPCR-drug interaction data, it is feasible to develop sequence-based machine learning models for query GPCR-drug pairs. RESULTS In this paper, a new sequence-based method is proposed to identify GPCR-drug interactions. For GPCRs, we use a novel bag-of-words (BoW) model to extract sequence features, which can extract more pattern information from low-order to high-order and limit the feature space dimension. For drug molecules, we use discrete Fourier transform (DFT) to extract higher-order pattern information from the original molecular fingerprints. The feature vectors of two kinds of molecules are concatenated and input into a simple prediction engine distance-weighted K-nearest-neighbor (DWKNN). This basic method is easy to be enhanced through ensemble learning. Through testing on recently constructed GPCR-drug interaction datasets, it is found that the proposed methods are better than the existing sequence-based machine learning methods in generalization ability, even an unconventional method in which the prediction performance was further improved by post-processing procedure (PPP). CONCLUSIONS The proposed methods are effective for GPCR-drug interaction prediction, and may also be potential methods for other target-drug interaction prediction, or protein-protein interaction prediction. In addition, the new proposed feature extraction method for GPCR sequences is the modified version of the traditional BoW model and may be useful to solve problems of protein classification or attribute prediction. The source code of the proposed methods is freely available for academic research at https://github.com/wp3751/GPCR-Drug-Interaction.
Collapse
Affiliation(s)
- Pu Wang
- Computer School, Hubei University of Arts and Science, Xiangyang, 441053 China
| | - Xiaotong Huang
- Computer School, Hubei University of Arts and Science, Xiangyang, 441053 China
| | - Wangren Qiu
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, 333403 China
| | - Xuan Xiao
- Computer Department, Jingdezhen Ceramic Institute, Jingdezhen, 333403 China
| |
Collapse
|
26
|
Martins IC, Santos NC. Intrinsically disordered protein domains in flavivirus infection. Arch Biochem Biophys 2020; 683:108298. [PMID: 32045581 DOI: 10.1016/j.abb.2020.108298] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2019] [Revised: 02/03/2020] [Accepted: 02/05/2020] [Indexed: 12/30/2022]
Abstract
Intrinsically disordered protein regions are at the core of biological processes and involved in key protein-ligand interactions. The Flavivirus proteins, of viruses of great biomedical importance such as Zika and dengue viruses, exemplify this. Several proteins of these viruses have disordered regions that are of the utmost importance for biological activity. Disordered proteins can adopt several conformations, each able to interact with and/or bind to different ligands. In fact, such interactions can help stabilize a particular fold. Moreover, by being promiscuous in the number of target molecules they can bind to, these protein regions increase the number of functions that their small proteome (10 proteins) can achieve. A folding energy waterfall better describes the protein folding landscape of these proteins. A disordered protein can be thought as rolling down the folding energy cascade, in order "to fall, fold and function". This is the case of many viral protein regions, as seen in the flaviviruses proteome. Given their small size, flaviviruses are a good model system for understanding the role of intrinsically disordered protein regions in viral function. Finally, studying these viruses disordered protein regions will certainly contribute to the development of therapeutic approaches against such promising (yet challenging) targets.
Collapse
Affiliation(s)
- Ivo C Martins
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal.
| | - Nuno C Santos
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal.
| |
Collapse
|
27
|
Santana R, Zuluaga R, Gañán P, Arrasate S, Onieva E, González-Díaz H. Designing nanoparticle release systems for drug-vitamin cancer co-therapy with multiplicative perturbation-theory machine learning (PTML) models. NANOSCALE 2019; 11:21811-21823. [PMID: 31691701 DOI: 10.1039/c9nr05070a] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Nano-systems for cancer co-therapy including vitamins or vitamin derivatives have showed adequate results to continue with further research studies to better understand them. However, the number of different combinations of drugs, vitamins, nanoparticle types, coating agents, synthesis conditions, and system types (nanocapsules, micelles, etc.) to be tested is very large generating a high cost in experimentations. In this context, there are reports of large datasets of preclinical assays of compounds (like in the ChEMBL database) and increasing but yet limited reports of experimental measurements of nano-systems per se. On the other hand, Machine Learning is gaining momentum in Nanotechnology and Pharmaceutical Sciences as a tool for rational design of new drugs and drug-release nano-systems. In this work, we propose to combine Perturbation Theory principles and Machine Learning to develop a PTML model for rational selection of the components of cancer co-therapy drug-vitamin release nano-systems (DVRNs). In doing so, we apply information fusion techniques with 2 data sets: (1) a large ChEMBL dataset of >36 000 preclinical assays of vitamin derivatives and a new dataset of >1000 outcomes of DVRNs, collected herein from the literature for the first time. The ChEMBL dataset used covers a considerable number of assay conditions (cjvit) each one with multiple levels. These conditions included >504 biological activity parameters (c0vit), >340 types of proteins (c1vit), >650 types of cells (c2vit), >120 assay organisms (c3vit), >60 assay strains (c4vit). Regarding the DVRNs, there are 25 different types of nano-systems (njn), with up to 16 conditions (cjn) including also different levels such as 8 biological activity parameters (c0n), 9 raw nanomaterials (c4n), 15 assay cells (c11n), etc. In the first stage, we used Moving Average operators to quantify the perturbations (deviations) in all input variables with respect to the conditions. After that, we used multiplicative PT operators to carry out data fusion, and dimension reduction, and Linear Discriminant Analysis (LDA) to seek the PTML model. The best PTML model found showed values of specificity, sensitivity, and accuracy in the range of 83-88% in training and external validation series for >130 000 cases (DVRNs vs. ChEMBL data pairs) formed after data fusion. To the best of our knowledge, this is the first general purpose model for the rational design of DVRNs for cancer co-therapy.
Collapse
|
28
|
Machine learning for target discovery in drug development. Curr Opin Chem Biol 2019; 56:16-22. [PMID: 31734566 DOI: 10.1016/j.cbpa.2019.10.003] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Revised: 10/01/2019] [Accepted: 10/03/2019] [Indexed: 12/15/2022]
Abstract
The discovery of macromolecular targets for bioactive agents is currently a bottleneck for the informed design of chemical probes and drug leads. Typically, activity profiling against genetically manipulated cell lines or chemical proteomics is pursued to shed light on their biology and deconvolute drug-target networks. By taking advantage of the ever-growing wealth of publicly available bioactivity data, learning algorithms now provide an attractive means to generate statistically motivated research hypotheses and thereby prioritize biochemical screens. Here, we highlight recent successes in machine intelligence for target identification and discuss challenges and opportunities for drug discovery.
Collapse
|