1
|
Parveen A, Kumar A. Introduction to Integrated Proteogenomic Pipeline for Dealing with Pathogenic Missense SNPs. Methods Mol Biol 2025; 2859:93-107. [PMID: 39436598 DOI: 10.1007/978-1-0716-4152-1_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2024]
Abstract
Proteogenomics is a multi-omics setup combining mass spectrometry and next-generation sequencing (NGS) technologies (using genomics and/or transcriptomics) with main aims of improving genome annotation and facilitating characterization of proteo-isoforms. However, working with proteogenomic approach is a very challenging task as it is generating multi-omics data and integrating these data for interpretation of results for biological or clinical implications. There is an urgent need for the development of protocols for integrated proteogenomics approaches. Genome resequencing yields massive data for missense single-nucleotide polymorphisms (SNP), and SNPs are yet not fully covered for their pathogenic nature using proteogenomic approaches. In this chapter, we present such a protocol for dealing with pathogenic missense SNPs using an integrated proteogenomics pipeline combining several steps: DNA-Seq, RNA-Seq, mass spectroscopy (MS), making customized databases of produced datasets, and screening and filtering for useful MS spectrums. This protocol also provides users with tricks and tips for the modifications, based on the requirements of the projects.
Collapse
Affiliation(s)
- Alisha Parveen
- Manipal Academy of Higher Education (MAHE), Manipal & Institute of Bioinformatics, Bangalore, India
- , Manipal, India
| | - Abhishek Kumar
- Manipal Academy of Higher Education (MAHE), Manipal & Institute of Bioinformatics, Bangalore, India.
- , Manipal, India.
| |
Collapse
|
2
|
Rémondin C, Mignani S, Rochais C, Dallemagne P. Synthesis and interest in medicinal chemistry of β-phenylalanine derivatives (β-PAD): an update (2010-2022). Future Med Chem 2024; 16:1147-1162. [PMID: 38722231 PMCID: PMC11221601 DOI: 10.1080/17568919.2024.2347063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 04/19/2024] [Indexed: 06/26/2024] Open
Abstract
β-Phenylalanine derivatives (β-PAD) represent a structural family of therapeutic interest, either as components of drugs or as starting materials for access to key compounds. As scaffolds for medicinal chemistry work, β-PAD offer the advantage of great diversity and modularity, a chiral pseudopeptidic character that opens up the capacity to be recognized by natural systems, and greater stability than natural α-amino acids. Nevertheless, their synthesis remains a challenge in drug discovery and numerous methods have been devoted to their preparation. This review is an update of the access routes to β-PAD and their various therapeutic applications.
Collapse
Affiliation(s)
| | - Serge Mignani
- Normandie Univ.,
UNICAEN, CERMN,
14000, Caen, France
- UMR 860, Laboratoire de Chimie et de Biochimie
Pharmacologiques et Toxicologique, Université Paris
Descartes, PRES Sorbonne Paris Cité,
CNRS, 45 rue des Saints Pères,
75006, Paris, France
- CQM – Centro de Química da
Madeira, MMRG, Universidad da
Madeira, Campus da Penteada,
9020-105, Funchal,
Portugal
| | | | | |
Collapse
|
3
|
Speck-Planche A, Kleandrova VV. Multi-Condition QSAR Model for the Virtual Design of Chemicals with Dual Pan-Antiviral and Anti-Cytokine Storm Profiles. ACS OMEGA 2022; 7:32119-32130. [PMID: 36120024 PMCID: PMC9476185 DOI: 10.1021/acsomega.2c03363] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 08/19/2022] [Indexed: 06/15/2023]
Abstract
Respiratory viruses are infectious agents, which can cause pandemics. Although nowadays the danger associated with respiratory viruses continues to be evidenced by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as the virus responsible for the current COVID-19 pandemic, other viruses such as SARS-CoV-1, the influenza A and B viruses (IAV and IBV, respectively), and the respiratory syncytial virus (RSV) can lead to globally spread viral diseases. Also, from a biological point of view, most of these viruses can cause an organ-damaging hyperinflammatory response known as the cytokine storm (CS). Computational approaches constitute an essential component of modern drug development campaigns, and therefore, they have the potential to accelerate the discovery of chemicals able to simultaneously inhibit multiple molecular and nonmolecular targets. We report here the first multicondition model based on quantitative structure-activity relationships and an artificial neural network (mtc-QSAR-ANN) for the virtual design and prediction of molecules with dual pan-antiviral and anti-CS profiles. Our mtc-QSAR-ANN model exhibited an accuracy higher than 80%. By interpreting the different descriptors present in the mtc-QSAR-ANN model, we could retrieve several molecular fragments whose assembly led to new molecules with drug-like properties and predicted pan-antiviral and anti-CS activities.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- Grupo
de Química Computacional y Teórica (QCT-USFQ), Departamento
de Ingeniería Química, Universidad
San Francisco de Quito, Diego de Robles y vía Interoceánica, Quito 170901, Ecuador
| | - Valeria V. Kleandrova
- Laboratory
of Fundamental and Applied Research of Quality and Technology of Food
Production, Moscow State University of Food
Production, Volokolamskoe
shosse 11, 125080, Moscow, Russian Federation
| |
Collapse
|
4
|
Speck-Planche A, Kleandrova VV, Scotti MT. In Silico Drug Repurposing for Anti-Inflammatory Therapy: Virtual Search for Dual Inhibitors of Caspase-1 and TNF-Alpha. Biomolecules 2021; 11:biom11121832. [PMID: 34944476 PMCID: PMC8699067 DOI: 10.3390/biom11121832] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 11/15/2021] [Accepted: 12/02/2021] [Indexed: 12/27/2022] Open
Abstract
Inflammation involves a complex biological response of the body tissues to damaging stimuli. When dysregulated, inflammation led by biomolecular mediators such as caspase-1 and tumor necrosis factor-alpha (TNF-alpha) can play a detrimental role in the progression of different medical conditions such as cancer, neurological disorders, autoimmune diseases, and cytokine storms caused by viral infections such as COVID-19. Computational approaches can accelerate the search for dual-target drugs able to simultaneously inhibit the aforementioned proteins, enabling the discovery of wide-spectrum anti-inflammatory agents. This work reports the first multicondition model based on quantitative structure–activity relationships and a multilayer perceptron neural network (mtc-QSAR-MLP) for the virtual screening of agency-regulated chemicals as versatile anti-inflammatory therapeutics. The mtc-QSAR-MLP model displayed accuracy higher than 88%, and was interpreted from a physicochemical and structural point of view. When using the mtc-QSAR-MLP model as a virtual screening tool, we could identify several agency-regulated chemicals as dual inhibitors of caspase-1 and TNF-alpha, and the experimental information later retrieved from the scientific literature converged with our computational results. This study supports the capabilities of our mtc-QSAR-MLP model in anti-inflammatory therapy with direct applications to current health issues such as the COVID-19 pandemic.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil;
- Correspondence:
| | - Valeria V. Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe shosse 11, 125080 Moscow, Russia;
| | - Marcus T. Scotti
- Postgraduate Program in Natural and Synthetic Bioactive Products, Federal University of Paraíba, João Pessoa 58051-900, Brazil;
| |
Collapse
|
5
|
Quevedo-Tumailli V, Ortega-Tenezaca B, González-Díaz H. IFPTML Mapping of Drug Graphs with Protein and Chromosome Structural Networks vs. Pre-Clinical Assay Information for Discovery of Antimalarial Compounds. Int J Mol Sci 2021; 22:13066. [PMID: 34884870 PMCID: PMC8657696 DOI: 10.3390/ijms222313066] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 11/23/2021] [Accepted: 11/24/2021] [Indexed: 11/16/2022] Open
Abstract
The parasite species of genus Plasmodium causes Malaria, which remains a major global health problem due to parasite resistance to available Antimalarial drugs and increasing treatment costs. Consequently, computational prediction of new Antimalarial compounds with novel targets in the proteome of Plasmodium sp. is a very important goal for the pharmaceutical industry. We can expect that the success of the pre-clinical assay depends on the conditions of assay per se, the chemical structure of the drug, the structure of the target protein to be targeted, as well as on factors governing the expression of this protein in the proteome such as genes (Deoxyribonucleic acid, DNA) sequence and/or chromosomes structure. However, there are no reports of computational models that consider all these factors simultaneously. Some of the difficulties for this kind of analysis are the dispersion of data in different datasets, the high heterogeneity of data, etc. In this work, we analyzed three databases ChEMBL (Chemical database of the European Molecular Biology Laboratory), UniProt (Universal Protein Resource), and NCBI-GDV (National Center for Biotechnology Information-Genome Data Viewer) to achieve this goal. The ChEMBL dataset contains outcomes for 17,758 unique assays of potential Antimalarial compounds including numeric descriptors (variables) for the structure of compounds as well as a huge amount of information about the conditions of assays. The NCBI-GDV and UniProt datasets include the sequence of genes, proteins, and their functions. In addition, we also created two partitions (cassayj = caj and cdataj = cdj) of categorical variables from theChEMBL dataset. These partitions contain variables that encode information about experimental conditions of preclinical assays (caj) or about the nature and quality of data (cdj). These categorical variables include information about 22 parameters of biological activity (ca0), 28 target proteins (ca1), and 9 organisms of assay (ca2), etc. We also created another partition of (cprotj = cpj) including categorical variables with biological information about the target proteins, genes, and chromosomes. These variables cover32 genes (cp0), 10 chromosomes (cp1), gene orientation (cp2), and 31 protein functions (cp3). We used a Perturbation-Theory Machine Learning Information Fusion (IFPTML) algorithm to map all this information (from three databases) into and train a predictive model. Shannon's entropy measure Shk (numerical variables) was used to quantify the information about the structure of drugs, protein sequences, gene sequences, and chromosomes in the same information scale. Perturbation Theory Operators (PTOs) with the form of Moving Average (MA) operators have been used to quantify perturbations (deviations) in the structural variables with respect to their expected values for different subsets (partitions) of categorical variables. We obtained three IFPTML models using General Discriminant Analysis (GDA), Classification Tree with Univariate Splits (CTUS), and Classification Tree with Linear Combinations (CTLC). The IFPTML-CTLC presented the better performance with Sensitivity Sn(%) = 83.6/85.1, and Specificity Sp(%) = 89.8/89.7 for training/validation sets, respectively. This model could become a useful tool for the optimization of preclinical assays of new Antimalarial compounds vs. different proteins in the proteome of Plasmodium.
Collapse
Affiliation(s)
- Viviana Quevedo-Tumailli
- Grupo RNASA-IMEDIR, Department of Computer Science, University of A Coruña, 15071 A Coruña, Spain; (V.Q.-T.); (B.O.-T.)
- Research Department, Puyo Campus, Universidad Estatal Amazónica, Puyo 160150, Ecuador
| | - Bernabe Ortega-Tenezaca
- Grupo RNASA-IMEDIR, Department of Computer Science, University of A Coruña, 15071 A Coruña, Spain; (V.Q.-T.); (B.O.-T.)
- Information and Communications Technology Management Department, Puyo Campus, Universidad Estatal Amazónica, Puyo 160150, Ecuador
| | - Humberto González-Díaz
- Department of Organic and Inorganic Chemistry, University of the Basque Country UPV/EHU, 48940 Leioa, Spain
- BIOFISIKA, Basque Centre for Biophysics, CSIC-UPV/EHU, 48940 Leioa, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain
| |
Collapse
|
6
|
Kleandrova VV, Speck-Planche A. The QSAR Paradigm in Fragment-Based Drug Discovery: From the Virtual Generation of Target Inhibitors to Multi-Scale Modeling. Mini Rev Med Chem 2021; 20:1357-1374. [PMID: 32013845 DOI: 10.2174/1389557520666200204123156] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Revised: 10/21/2019] [Accepted: 10/28/2019] [Indexed: 12/24/2022]
Abstract
Fragment-Based Drug Design (FBDD) has established itself as a promising approach in modern drug discovery, accelerating and improving lead optimization, while playing a crucial role in diminishing the high attrition rates at all stages in the drug development process. On the other hand, FBDD has benefited from the application of computational methodologies, where the models derived from the Quantitative Structure-Activity Relationships (QSAR) have become consolidated tools. This mini-review focuses on the evolution and main applications of the QSAR paradigm in the context of FBDD in the last five years. This report places particular emphasis on the QSAR models derived from fragment-based topological approaches to extract physicochemical and/or structural information, allowing to design potentially novel mono- or multi-target inhibitors from relatively large and heterogeneous databases. Here, we also discuss the need to apply multi-scale modeling, to exemplify how different datasets based on target inhibition can be simultaneously integrated and predicted together with other relevant endpoints such as the biological activity against non-biomolecular targets, as well as in vitro and in vivo toxicity and pharmacokinetic properties. In this context, seminal papers are briefly analyzed. As huge amounts of data continue to accumulate in the domains of the chemical, biological and biomedical sciences, it has become clear that drug discovery must be viewed as a multi-scale optimization process. An ideal multi-scale approach should integrate diverse chemical and biological data and also serve as a knowledge generator, enabling the design of potentially optimal chemicals that may become therapeutic agents.
Collapse
Affiliation(s)
- Valeria V Kleandrova
- Laboratory of Fundamental and Applied Research of Quality and Technology of Food Production, Moscow State University of Food Production, Volokolamskoe Shosse 11, 125080, Moscow, Russian Federation
| | - Alejandro Speck-Planche
- Department of Chemistry, Institute of Pharmacy, I.M. Sechenov First Moscow State Medical University, Trubetskaya Str., 8, b. 2, 119992, Moscow, Russian Federation
| |
Collapse
|
7
|
Xu L, Ru X, Song R. Application of Machine Learning for Drug-Target Interaction Prediction. Front Genet 2021; 12:680117. [PMID: 34234813 PMCID: PMC8255962 DOI: 10.3389/fgene.2021.680117] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2021] [Accepted: 05/28/2021] [Indexed: 11/13/2022] Open
Abstract
Exploring drug–target interactions by biomedical experiments requires a lot of human, financial, and material resources. To save time and cost to meet the needs of the present generation, machine learning methods have been introduced into the prediction of drug–target interactions. The large amount of available drug and target data in existing databases, the evolving and innovative computer technologies, and the inherent characteristics of various types of machine learning have made machine learning techniques the mainstream method for drug–target interaction prediction research. In this review, details of the specific applications of machine learning in drug–target interaction prediction are summarized, the characteristics of each algorithm are analyzed, and the issues that need to be further addressed and explored for future research are discussed. The aim of this review is to provide a sound basis for the construction of high-performance models.
Collapse
Affiliation(s)
- Lei Xu
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| | - Xiaoqing Ru
- Department of Computer Science, University of Tsukuba, Tsukuba, Japan
| | - Rong Song
- School of Electronic and Communication Engineering, Shenzhen Polytechnic, Shenzhen, China
| |
Collapse
|
8
|
Hu S, Zuo H, Qi J, Hu Y, Yu B. Analysis of Effect of Schisandra in the Treatment of Myocardial Infarction Based on Three-Mode Gene Ontology Network. Front Pharmacol 2019; 10:232. [PMID: 30949047 PMCID: PMC6435518 DOI: 10.3389/fphar.2019.00232] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2018] [Accepted: 02/22/2019] [Indexed: 12/31/2022] Open
Abstract
Schisandra chinensis is a commonly used traditional Chinese medicine, which has been widely used in the treatment of acute myocardial infarction in China. However, it has been difficult to systematically clarify the major pharmacological effect of Schisandra, due to its multi-component complex mechanism. In order to solve this problem, a comprehensive network analysis method was established based-on “component–gene ontology–effect” interactions. Through the network analysis, reduction of cardiac preload and myocardial contractility was shown to be the major effect of Schisandra components, which was further experimentally validated. In addition, the expression of NCOR2 and NFAT in myocyte were experimentally confirmed to be associated with Schisandra in the treatment of AMI, which may be responsible for the preservation effect of myocardial contractility. In conclusion, the three-mode gene ontology network can be an effective network analysis workflow to evaluate the pharmacological effects of a multi-drug complex system.
Collapse
Affiliation(s)
- Siyao Hu
- Jiangsu Key Laboratory of Traditional Medicine and Translational Research, China Pharmaceutical University, Nanjing, China
| | - Huali Zuo
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Taipa, Macau
| | - Jin Qi
- Jiangsu Key Laboratory of Traditional Medicine and Translational Research, China Pharmaceutical University, Nanjing, China
| | - Yuanjia Hu
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Taipa, Macau
| | - Boyang Yu
- Jiangsu Key Laboratory of Traditional Medicine and Translational Research, China Pharmaceutical University, Nanjing, China
| |
Collapse
|
9
|
BET bromodomain inhibitors: fragment-based in silico design using multi-target QSAR models. Mol Divers 2018; 23:555-572. [PMID: 30421269 DOI: 10.1007/s11030-018-9890-8] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 10/30/2018] [Indexed: 12/17/2022]
Abstract
Epigenetics has become a focus of interest in drug discovery. In this sense, bromodomain-containing proteins have emerged as potential epigenetic targets in cancer research and other therapeutic areas. Several computational approaches have been applied to the prediction of bromodomain inhibitors. Nevertheless, such approaches have several drawbacks such as the fact that they predict activity against only one bromodomain-containing protein, using structurally related compounds. Also, there are no reports focused on meaningfully analyzing the physicochemical/structural features that are necessary for the design of a bromodomain inhibitor. This work describes the development of two different multi-target models based on quantitative structure-activity relationships (mt-QSAR) for the prediction and in silico design of multi-target bromodomain inhibitors against the proteins BRD2, BRD3, and BRD4. The first model relied on linear discriminant analysis (LDA) while the second focused on artificial neural networks. Both models exhibited accuracies higher than 85% in the dataset. Several molecular fragments were extracted, and their contributions to the inhibitory activity against the three BET proteins were calculated by the LDA model. Six molecules were designed by assembling the fragments with positive contributions, and they were predicted as multi-target BET bromodomain inhibitors by the two mt-QSAR models. Molecular docking calculations converged with the predictions performed by the mt-QSAR models, suggesting that the designed molecules can exhibit potent activity against the three BET proteins. These molecules complied with the Lipinski's rule of five.
Collapse
|
10
|
Loza-Mejía MA, Salazar JR, Sánchez-Tejeda JF. In Silico Studies on Compounds Derived from Calceolaria: Phenylethanoid Glycosides as Potential Multitarget Inhibitors for the Development of Pesticides. Biomolecules 2018; 8:E121. [PMID: 30360548 PMCID: PMC6322355 DOI: 10.3390/biom8040121] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Revised: 10/18/2018] [Accepted: 10/19/2018] [Indexed: 11/25/2022] Open
Abstract
An increasing occurrence of resistance in insect pests and high mammal toxicity exhibited by common pesticides increase the need for new alternative molecules. Among these alternatives, bioinsecticides are considered to be environmentally friendly and safer than synthetic insecticides. Particularly, plant extracts have shown great potential in laboratory conditions. However, the lack of studies that confirm their mechanisms of action diminishes their potential applications on a large scale. Previously, we have reported the insect growth regulator and insecticidal activities of secondary metabolites isolated from plants of the Calceolaria genus. Herein, we report an in silico study of compounds isolated from Calceolaria against acetylcholinesterase, prophenoloxidase, and ecdysone receptor. The molecular docking results are consistent with the previously reported experimental results, which were obtained during the bioevaluation of Calceolaria extracts. Among the compounds, phenylethanoid glycosides, such as verbascoside, exhibited good theoretical affinity to all the analyzed targets. In light of these results, we developed an index to evaluate potential multitarget insecticides based on docking scores.
Collapse
|
11
|
Designing multi-targeted agents: An emerging anticancer drug discovery paradigm. Eur J Med Chem 2017; 136:195-211. [PMID: 28494256 DOI: 10.1016/j.ejmech.2017.05.016] [Citation(s) in RCA: 144] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2017] [Revised: 04/30/2017] [Accepted: 05/04/2017] [Indexed: 12/11/2022]
Abstract
The dominant paradigm in drug discovery is to design ligands with maximum selectivity to act on individual drug targets. With the target-based approach, many new chemical entities have been discovered, developed, and further approved as drugs. However, there are a large number of complex diseases such as cancer that cannot be effectively treated or cured only with one medicine to modulate the biological function of a single target. As simultaneous intervention of two (or multiple) cancer progression relevant targets has shown improved therapeutic efficacy, the innovation of multi-targeted drugs has become a promising and prevailing research topic and numerous multi-targeted anticancer agents are currently at various developmental stages. However, most multi-pharmacophore scaffolds are usually discovered by serendipity or screening, while rational design by combining existing pharmacophore scaffolds remains an enormous challenge. In this review, four types of multi-pharmacophore modes are discussed, and the examples from literature will be used to introduce attractive lead compounds with the capability of simultaneously interfering with different enzyme or signaling pathway of cancer progression, which will reveal the trends and insights to help the design of the next generation multi-targeted anticancer agents.
Collapse
|
12
|
Mladenović M, Patsilinakos A, Pirolli A, Sabatino M, Ragno R. Understanding the Molecular Determinant of Reversible Human Monoamine Oxidase B Inhibitors Containing 2H-Chromen-2-One Core: Structure-Based and Ligand-Based Derived Three-Dimensional Quantitative Structure–Activity Relationships Predictive Models. J Chem Inf Model 2017; 57:787-814. [DOI: 10.1021/acs.jcim.6b00608] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Milan Mladenović
- Kragujevac Center
for Computational Biochemistry, Faculty of Science, University of Kragujevac, Radoja Domanovića 12, 34000 Kragujevac, P.O. Box 60, Serbia
| | - Alexandros Patsilinakos
- Rome Center for Molecular Design, Department of Drug
Chemistry and Technologies, Faculty of Pharmacy and Medicine, Rome Sapienza University, P.le A. Moro 5, 00185, Rome Italy
- Alchemical Dynamics srl, 00125 Rome, Italy
| | - Adele Pirolli
- Rome Center for Molecular Design, Department of Drug
Chemistry and Technologies, Faculty of Pharmacy and Medicine, Rome Sapienza University, P.le A. Moro 5, 00185, Rome Italy
- Department of Information
Technology, IRBM Science Park, Via Pontina km 30, 600, 00071 Pomezia, Rome, Italy
| | - Manuela Sabatino
- Rome Center for Molecular Design, Department of Drug
Chemistry and Technologies, Faculty of Pharmacy and Medicine, Rome Sapienza University, P.le A. Moro 5, 00185, Rome Italy
| | - Rino Ragno
- Rome Center for Molecular Design, Department of Drug
Chemistry and Technologies, Faculty of Pharmacy and Medicine, Rome Sapienza University, P.le A. Moro 5, 00185, Rome Italy
- Alchemical Dynamics srl, 00125 Rome, Italy
| |
Collapse
|
13
|
The Poitiers School of Mathematical and Theoretical Biology: Besson-Gavaudan-Schützenberger's Conjectures on Genetic Code and RNA Structures. Acta Biotheor 2016; 64:403-426. [PMID: 27592342 DOI: 10.1007/s10441-016-9287-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Accepted: 08/23/2016] [Indexed: 02/08/2023]
Abstract
The French school of theoretical biology has been mainly initiated in Poitiers during the sixties by scientists like J. Besson, G. Bouligand, P. Gavaudan, M. P. Schützenberger and R. Thom, launching many new research domains on the fractal dimension, the combinatorial properties of the genetic code and related amino-acids as well as on the genetic regulation of the biological processes. Presently, the biological science knows that RNA molecules are often involved in the regulation of complex genetic networks as effectors, e.g., activators (small RNAs as transcription factors), inhibitors (micro-RNAs) or hybrids (circular RNAs). Examples of such networks will be given showing that (1) there exist RNA "relics" that have played an important role during evolution and have survived in many genomes, whose probability distribution of their sub-sequences is quantified by the Shannon entropy, and (2) the robustness of the dynamics of the networks they regulate can be characterized by the Kolmogorov-Sinaï dynamic entropy and attractor entropy.
Collapse
|
14
|
Kleandrova VV, Ruso JM, Speck-Planche A, Dias Soeiro Cordeiro MN. Enabling the Discovery and Virtual Screening of Potent and Safe Antimicrobial Peptides. Simultaneous Prediction of Antibacterial Activity and Cytotoxicity. ACS COMBINATORIAL SCIENCE 2016; 18:490-8. [PMID: 27280735 DOI: 10.1021/acscombsci.6b00063] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Antimicrobial peptides (AMPs) represent promising alternatives to fight against bacterial pathogens. However, cellular toxicity remains one of the main concerns in the early development of peptide-based drugs. This work introduces the first multitasking (mtk) computational model focused on performing simultaneous predictions of antibacterial activities, and cytotoxicities of peptides. The model was created from a data set containing 3592 cases, and it displayed accuracy higher than 96% for classifying/predicting peptides in both training and prediction (test) sets. The technique known as alanine scanning was computationally applied to illustrate the calculation of the quantitative contributions of the amino acids (in their respective positions of the sequence) to the biological effects of a defined peptide. A small library formed by 10 peptides was generated, where peptides were designed by considering the interpretations of the different descriptors in the mtk-computational model. All the peptides were predicted to exhibit high antibacterial activities against multiple bacterial strains, and low cytotoxicity against various cell types. The present mtk-computational model can be considered a very useful tool to support high throughput research for the discovery of potent and safe AMPs.
Collapse
Affiliation(s)
- Valeria V. Kleandrova
- Faculty
of Technology and Production Management, Moscow State University of Food Production, Volokolamskoe shosse 11, Moscow, Russia
| | - Juan M. Ruso
- Department
of Applied Physics, University of Santiago de Compostela (USC), 15782 Santiago de Compostela, Spain
| | - Alejandro Speck-Planche
- Department
of Applied Physics, University of Santiago de Compostela (USC), 15782 Santiago de Compostela, Spain
- LAQV@REQUIMTE,
Department of Chemistry and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| | | |
Collapse
|
15
|
Qiu T, Qiu J, Feng J, Wu D, Yang Y, Tang K, Cao Z, Zhu R. The recent progress in proteochemometric modelling: focusing on target descriptors, cross-term descriptors and application scope. Brief Bioinform 2016; 18:125-136. [PMID: 26873661 DOI: 10.1093/bib/bbw004] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 12/09/2015] [Indexed: 12/17/2022] Open
Abstract
As an extension of the conventional quantitative structure activity relationship models, proteochemometric (PCM) modelling is a computational method that can predict the bioactivity relations between multiple ligands and multiple targets. Traditional PCM modelling includes three essential elements: descriptors (including target descriptors, ligand descriptors and cross-term descriptors), bioactivity data and appropriate learning functions that link the descriptors to the bioactivity data. Since its appearance, PCM modelling has developed rapidly over the past decade by taking advantage of the progress of different descriptors and machine learning techniques, along with the increasing amounts of available bioactivity data. Specifically, the new emerging target descriptors and cross-term descriptors not only significantly increased the performance of PCM modelling but also expanded its application scope from traditional protein-ligand interaction to more abundant interactions, including protein-peptide, protein-DNA and even protein-protein interactions. In this review, target descriptors and cross-term descriptors, as well as the corresponding application scope, are intensively summarized. Additionally, we look forward to seeing PCM modelling extend into new application scopes, such as Target-Catalyst-Ligand systems, with the further development of descriptors, machine learning techniques and increasing amounts of available bioactivity data.
Collapse
|
16
|
Drug-symptom networking: Linking drug-likeness screening to drug discovery. Pharmacol Res 2015; 103:105-13. [PMID: 26615785 DOI: 10.1016/j.phrs.2015.11.015] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Revised: 10/26/2015] [Accepted: 11/11/2015] [Indexed: 01/19/2023]
Abstract
Understanding the relationships between drugs and symptoms has broad medical consequences, yet a comprehensive description of the drug-symptom associations is currently lacking. Here, 1441 FDA-approved drugs were collected, and PCA was used to extract 122 descriptors which explained 91% of the variance. Then, a k-means++ method was employed to partition the drug dataset into 3 clusters, and 3 corresponding SVDD models (drug-likeness screening models) were constructed with an overall accuracy of up to 95.6%. Furthermore, 6878 herbal molecules from the TcmSP™ database were screened by the above 3 SVDD model to obtain 5309 candidate drug molecules with highly accept classification of 77.19%. To assess the accuracy of the SVDD models, 8559 herbal molecule-symptom co-occurrences were mined from Pubmed abstracts, involving 697 herbal molecules and 314 symptoms. Most of the 697 herbal molecules could be found in the accepted SVDD data (5309 molecules), showing the potential of the SVDD for the screening of drug candidates. Moreover, a herbal molecule-herbal molecule network and a herbal molecule-symptom were constructed. Overall, the results provided a new drug-likeness screening approach independent to abnormal training data, and the comprehensive collection of herbal molecule-symptom associations formed a new data resource for systematic characterization of the symptom-oriented medicines.
Collapse
|
17
|
Fernandez-Lozano C, Cuiñas RF, Seoane JA, Fernández-Blanco E, Dorado J, Munteanu CR. Classification of signaling proteins based on molecular star graph descriptors using Machine Learning models. J Theor Biol 2015; 384:50-8. [DOI: 10.1016/j.jtbi.2015.07.038] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2015] [Revised: 07/20/2015] [Accepted: 07/27/2015] [Indexed: 12/11/2022]
|
18
|
Martins F, Gonçalves R, Oliveira J, Cruz-Monteagudo M, Nieto-Villar JM, Paz-y-Miño C, Rebelo I, Tejera E. Unravelling the relationship between protein sequence and low-complexity regions entropies: Interactome implications. J Theor Biol 2015; 382:320-7. [PMID: 26164061 DOI: 10.1016/j.jtbi.2015.06.049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 06/12/2015] [Accepted: 06/28/2015] [Indexed: 10/23/2022]
Abstract
Low-complexity regions are sub-sequences of biased composition in a protein sequence. The influence of these regions over protein evolution, specific functions and highly interactive capacities is well known. Although protein sequence entropy has been largely studied, its relationship with low-complexity regions and the subsequent effects on protein function remains unclear. In this work we propose a theoretical and empirical model integrating the sequence entropy with local complexity parameters. Our results indicate that the protein sequence entropy is related with the protein length, the entropies inside and outside the low-complexity regions as well as their number and average size. We found a small but significant increment in the sequence entropy of hubs proteins. In agreement with our theoretical model, this increment is highly dependent of the balance between the increment of protein length and average size of the low-complexity regions. Finally, our models and proteins analysis provide evidence supporting that modifications in the average size is more relevant in hubs proteins than changes in the number of low-complexity regions.
Collapse
Affiliation(s)
- F Martins
- Department of Biochemistry, Faculty of Pharmacy, University of Porto, Portugal
| | - R Gonçalves
- Department of Biochemistry, Faculty of Pharmacy, University of Porto, Portugal
| | - J Oliveira
- Department of Biochemistry, Faculty of Pharmacy, University of Porto, Portugal
| | - M Cruz-Monteagudo
- Instituto de Investigaciones Biomédicas, Universidad de las Américas, Quito, Ecuador
| | - J M Nieto-Villar
- Dpto. de Química-Física, Fac. de Química, Universidad de La Habana, Cuba. Cátedra de Sistemas Complejos "H. Poincaré", Universidad de La Habana, Cuba
| | - C Paz-y-Miño
- Instituto de Investigaciones Biomédicas, Universidad de las Américas, Quito, Ecuador
| | - I Rebelo
- Department of Biochemistry, Faculty of Pharmacy, University of Porto, Portugal; UCIBIO@REQUIMTE, Portugal.
| | - E Tejera
- Instituto de Investigaciones Biomédicas, Universidad de las Américas, Quito, Ecuador
| |
Collapse
|
19
|
Speck-Planche A, Cordeiro MNDS. Multitasking models for quantitative structure–biological effect relationships: current status and future perspectives to speed up drug discovery. Expert Opin Drug Discov 2015; 10:245-56. [DOI: 10.1517/17460441.2015.1006195] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
20
|
Cao DS, Zhang LX, Tan GS, Xiang Z, Zeng WB, Xu QS, Chen AF. Computational Prediction of DrugTarget Interactions Using Chemical, Biological, and Network Features. Mol Inform 2014; 33:669-81. [PMID: 27485302 DOI: 10.1002/minf.201400009] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2014] [Accepted: 04/22/2014] [Indexed: 02/02/2023]
Abstract
Drugtarget interactions (DTIs) are central to current drug discovery processes. Efforts have been devoted to the development of methodology for predicting DTIs and drugtarget interaction networks. Most existing methods mainly focus on the application of information about drug or protein structure features. In the present work, we proposed a computational method for DTI prediction by combining the information from chemical, biological and network properties. The method was developed based on a learning algorithm-random forest (RF) combined with integrated features for predicting DTIs. Four classes of drugtarget interaction networks in humans involving enzymes, ion channels, G-protein-coupled receptors (GPCRs) and nuclear receptors, are independently used for establishing predictive models. The RF models gave prediction accuracy of 93.52 %, 94.84 %, 89.68 % and 84.72 % for four pharmaceutically useful datasets, respectively. The prediction ability of our approach is comparative to or even better than that of other DTI prediction methods. These comparative results demonstrated the relevance of the network topology as source of information for predicting DTIs. Further analysis confirmed that among our top ranked predictions of DTIs, several DTIs are supported by databases, while the others represent novel potential DTIs. We believe that our proposed approach can help to limit the search space of DTIs and provide a new way towards repositioning old drugs and identifying targets.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, P.R. China.
| | - Liu-Xia Zhang
- The 163rdHospital of The Chinese People's Liberation Army, Changsha 410003, P.R. China
| | - Gui-Shan Tan
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, P.R. China
| | - Zheng Xiang
- School of Pharmaceutical Sciences, Wenzhou Medical University, Wenzhou 325035, P.R. China
| | - Wen-Bin Zeng
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, P.R. China
| | - Qing-Song Xu
- School of Mathematics and Statistics, Central South University, Changsha 410083, P.R. China
| | - Alex F Chen
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, P.R. China.
| |
Collapse
|
21
|
González-Díaz H, Herrera-Ibatá DM, Duardo-Sánchez A, Munteanu CR, Orbegozo-Medina RA, Pazos A. ANN Multiscale Model of Anti-HIV Drugs Activity vs AIDS Prevalence in the US at County Level Based on Information Indices of Molecular Graphs and Social Networks. J Chem Inf Model 2014; 54:744-55. [DOI: 10.1021/ci400716y] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Humberto González-Díaz
- Department
of Organic Chemistry II, Faculty of Science and Technology, University of the Basque Country UPV/EHU, 48940, Leioa, Vizcaya, Spain
- IKERBASQUE, Basque
Foundation for Science, 48011, Bilbao, Vizcaya, Spain
| | - Diana María Herrera-Ibatá
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Aliuska Duardo-Sánchez
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Cristian R. Munteanu
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| | - Ricardo Alfredo Orbegozo-Medina
- Department
of Microbiology and Parasitology, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Alejandro Pazos
- Department of Information and Communication Technologies, University of A Coruña UDC, 15071, A Coruña, A Coruña, Spain
| |
Collapse
|
22
|
Speck-Planche A, Cordeiro MNDS. Simultaneous virtual prediction of anti-Escherichia coli activities and ADMET profiles: A chemoinformatic complementary approach for high-throughput screening. ACS COMBINATORIAL SCIENCE 2014; 16:78-84. [PMID: 24383958 DOI: 10.1021/co400115s] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Escherichia coli remains one of the principal pathogens that cause nosocomial infections, medical conditions that are increasingly common in healthcare facilities. E. coli is intrinsically resistant to many antibiotics, and multidrug-resistant strains have emerged recently. Chemoinformatics has been a great ally of experimental methodologies such as high-throughput screening, playing an important role in the discovery of effective antibacterial agents. However, there is no approach that can design safer anti-E. coli agents, because of the multifactorial nature and complexity of bacterial diseases and the lack of desirable ADMET (absorption, distribution, metabolism, elimination, and toxicity) profiles as a major cause of disapproval of drugs. In this work, we introduce the first multitasking model based on quantitative-structure biological effect relationships (mtk-QSBER) for simultaneous virtual prediction of anti-E. coli activities and ADMET properties of drugs and/or chemicals under many experimental conditions. The mtk-QSBER model was developed from a large and heterogeneous data set of more than 37800 cases, exhibiting overall accuracies of >95% in both training and prediction (validation) sets. The utility of our mtk-QSBER model was demonstrated by performing virtual prediction of properties for the investigational drug avarofloxacin (AVX) under 260 different experimental conditions. Results converged with the experimental evidence, confirming the remarkable anti-E. coli activities and safety of AVX. Predictions also showed that our mtk-QSBER model can be a promising computational tool for virtual screening of desirable anti-E. coli agents, and this chemoinformatic approach could be extended to the search for safer drugs with defined pharmacological activities.
Collapse
Affiliation(s)
- Alejandro Speck-Planche
- REQUIMTE/Department of Chemistry
and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| | - M. N. D. S. Cordeiro
- REQUIMTE/Department of Chemistry
and Biochemistry, University of Porto, 4169-007 Porto, Portugal
| |
Collapse
|
23
|
Ziarek JJ, Liu Y, Smith E, Zhang G, Peterson FC, Chen J, Yu Y, Chen Y, Volkman BF, Li R. Fragment-based optimization of small molecule CXCL12 inhibitors for antagonizing the CXCL12/CXCR4 interaction. Curr Top Med Chem 2013; 12:2727-40. [PMID: 23368099 DOI: 10.2174/1568026611212240003] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2012] [Revised: 10/08/2012] [Accepted: 11/03/2012] [Indexed: 12/21/2022]
Abstract
The chemokine CXCL12 and its G protein-coupled receptor (GPCR) CXCR4 are high-priority clinical targets because of their involvement in metastatic cancers (also implicated in autoimmune disease and cardiovascular disease). Because chemokines interact with two distinct sites to bind and activate their receptors, both the GPCRs and chemokines are potential targets for small molecule inhibition. A number of chemokines have been validated as targets for drug development, but virtually all drug discovery efforts focus on the GPCRs. However, all CXCR4 receptor antagonists with the exception of MSX-122 have failed in clinical trials due to unmanageable toxicities, emphasizing the need for alternative strategies to interfere with CXCL12/CXCR4-guided metastatic homing. Although targeting the relatively featureless surface of CXCL12 was presumed to be challenging, focusing efforts at the sulfotyrosine (sY) binding pockets proved successful for procuring initial hits. Using a hybrid structure-based in silico/NMR screening strategy, we recently identified a ligand that occludes the receptor recognition site. From this initial hit, we designed a small fragment library containing only nine tetrazole derivatives using a fragment-based and bioisostere approach to target the sY binding sites of CXCL12. Compound binding modes and affinities were studied by 2D NMR spectroscopy, X-ray crystallography, molecular docking and cell-based functional assays. Our results demonstrate that the sY binding sites are conducive to the development of high affinity inhibitors with better ligand efficiency (LE) than typical protein-protein interaction inhibitors (LE ≤ 0.24). Our novel tetrazole-based fragment 18 was identified to bind the sY21 site with a K(d) of 24 μM (LE = 0.30). Optimization of 18 yielded compound 25 which specifically inhibits CXCL12-induced migration with an improvement in potency over the initial hit 9. The fragment from this library that exhibited the highest affinity and ligand efficiency (11: K(d) = 13 μM, LE = 0.33) may serve as a starting point for development of inhibitors targeting the sY12 site.
Collapse
Affiliation(s)
- Joshua J Ziarek
- Department of Biochemistry, Medical College of Wisconsin, 8701 Watertown Plank Road, Milwaukee, WI 53226, USA
| | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Garg G, Bernal D, Trelis M, Forment J, Ortiz J, Valero ML, Pedrola L, Martinez-Blanch J, Esteban JG, Ranganathan S, Toledo R, Marcilla A. The transcriptome of Echinostoma caproni adults: Further characterization of the secretome and identification of new potential drug targets. J Proteomics 2013; 89:202-14. [DOI: 10.1016/j.jprot.2013.06.017] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2013] [Revised: 06/04/2013] [Accepted: 06/09/2013] [Indexed: 02/01/2023]
|
25
|
Cao DS, Zhou GH, Liu S, Zhang LX, Xu QS, He M, Liang YZ. Large-scale prediction of human kinase-inhibitor interactions using protein sequences and molecular topological structures. Anal Chim Acta 2013; 792:10-8. [PMID: 23910962 DOI: 10.1016/j.aca.2013.07.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2013] [Revised: 07/03/2013] [Accepted: 07/04/2013] [Indexed: 01/28/2023]
Abstract
The kinase family is one of the largest target families in the human genome. The family's key function in signal transduction for all organisms makes it a very attractive target class for the therapeutic interventions in many diseases states such as cancer, diabetes, inflammation and arthritis. A first step toward accelerating kinase drug discovery process is to fast identify whether a chemical and a kinase interact or not. Experimentally, these interactions can be identified by in vitro binding assay - an expensive and laborious procedure that is not applicable on a large scale. Therefore, there is an urgent need to develop statistically efficient approaches for identifying kinase-inhibitor interactions. For the first time, the quantitative binding affinities of kinase-inhibitor pairs are differentiated as a measurement to define if an inhibitor interacts with a kinase, and then a chemogenomics framework using an unbiased set of general integrated features (drug descriptors and protein descriptors) and random forest (RF) is employed to construct a predictive model which can accurately classify kinase-inhibitor pairs. Our results show that RF with integrated features gave prediction accuracy of 93.76%, sensitivity of 92.26%, and specificity of 95.27%, respectively. The results are superior to those by only considering two separated spaces (chemical space and protein space), demonstrating that these integrated features contribute cooperatively. Based on the constructed model, we provided a high confidence list of drug-target associations for subsequent experimental investigation guidance at a low false discovery rate.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha 410013, PR China.
| | | | | | | | | | | | | |
Collapse
|
26
|
Predicting Drug-Target Interactions for New Drug Compounds Using a Weighted Nearest Neighbor Profile. PLoS One 2013; 8:e66952. [PMID: 23840562 PMCID: PMC3694117 DOI: 10.1371/journal.pone.0066952] [Citation(s) in RCA: 161] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2013] [Accepted: 05/13/2013] [Indexed: 11/19/2022] Open
Abstract
In silico discovery of interactions between drug compounds and target proteins is of core importance for improving the efficiency of the laborious and costly experimental determination of drug-target interaction. Drug-target interaction data are available for many classes of pharmaceutically useful target proteins including enzymes, ion channels, GPCRs and nuclear receptors. However, current drug-target interaction databases contain a small number of drug-target pairs which are experimentally validated interactions. In particular, for some drug compounds (or targets) there is no available interaction. This motivates the need for developing methods that predict interacting pairs with high accuracy also for these 'new' drug compounds (or targets). We show that a simple weighted nearest neighbor procedure is highly effective for this task. We integrate this procedure into a recent machine learning method for drug-target interaction we developed in previous work. Results of experiments indicate that the resulting method predicts true interactions with high accuracy also for new drug compounds and achieves results comparable or better than those of recent state-of-the-art algorithms. Software is publicly available at http://cs.ru.nl/~tvanlaarhoven/drugtarget2013/.
Collapse
|
27
|
Chemoinformatics for rational discovery of safe antibacterial drugs: simultaneous predictions of biological activity against streptococci and toxicological profiles in laboratory animals. Bioorg Med Chem 2013; 21:2727-32. [PMID: 23582445 DOI: 10.1016/j.bmc.2013.03.015] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2012] [Revised: 03/03/2013] [Accepted: 03/04/2013] [Indexed: 11/21/2022]
Abstract
Streptococci are a group of Gram-positive bacteria which are responsible for causing many diverse diseases in humans and other animals worldwide. The high prevalence of resistance of these bacteria to current antibacterial drugs is an alarming problem for the scientific community. The battle against streptococci by using antimicrobial chemotherapies will depend on the design of new chemicals with high inhibitory activity, having also as low toxicity as possible. Multi-target approaches based on quantitative-structure activity relationships (mt-QSAR) have played a very important role, providing a better knowledge about the molecular patterns related with the appearance of different pharmacological profiles including antimicrobial activity. Until now, almost all mt-QSAR models have considered the study of biological activity or toxicity separately. In the present study, we develop by the first time, a unified multitasking (mtk) QSAR model for the simultaneous prediction of anti-streptococci activity and toxic effects against biological models like Mus musculus and Rattus norvegicus. The mtk-QSAR model was created by using artificial neural networks (ANN) analysis for the classification of compounds as positive (high biological activity and/or low toxicity) or negative (otherwise) under diverse sets of experimental conditions. Our mtk-QSAR model, correctly classified more than 97% of the cases in the whole database (more than 11,500 cases), serving as a promising tool for the virtual screening of potent and safe anti-streptococci drugs.
Collapse
|
28
|
Cao DS, Xu QS, Hu QN, Liang YZ. ChemoPy: freely available python package for computational biology and chemoinformatics. ACTA ACUST UNITED AC 2013; 29:1092-4. [PMID: 23493324 DOI: 10.1093/bioinformatics/btt105] [Citation(s) in RCA: 132] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
MOTIVATION Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural and physicochemical features. It computes 16 drug feature groups composed of 19 descriptors that include 1135 descriptor values. In addition, it provides seven types of molecular fingerprint systems for drug molecules, including topological fingerprints, electro-topological state (E-state) fingerprints, MACCS keys, FP4 keys, atom pairs fingerprints, topological torsion fingerprints and Morgan/circular fingerprints. By applying a semi-empirical quantum chemistry program MOPAC, ChemoPy can also compute a large number of 3D molecular descriptors conveniently. AVAILABILITY The python package, ChemoPy, is freely available via http://code.google.com/p/pychem/downloads/list, and it runs on Linux and MS-Windows. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- Research Center of Modernization of Traditional Chinese Medicines, Central South University, Changsha, P. R. China
| | | | | | | |
Collapse
|
29
|
Speck-Planche A, Kleandrova VV, Cordeiro MND. New insights toward the discovery of antibacterial agents: Multi-tasking QSBER model for the simultaneous prediction of anti-tuberculosis activity and toxicological profiles of drugs. Eur J Pharm Sci 2013; 48:812-8. [DOI: 10.1016/j.ejps.2013.01.011] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2012] [Revised: 01/05/2013] [Accepted: 01/23/2013] [Indexed: 01/11/2023]
|
30
|
Large-scale prediction of drug–target interactions using protein sequences and drug topological structures. Anal Chim Acta 2012; 752:1-10. [DOI: 10.1016/j.aca.2012.09.021] [Citation(s) in RCA: 74] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2012] [Revised: 09/10/2012] [Accepted: 09/14/2012] [Indexed: 11/19/2022]
|
31
|
Mor S, Pahal P, Narasimhan B. Synthesis, characterization, biological evaluation and QSAR studies of 11-p-substituted phenyl-12-phenyl-11a,12-dihydro-11H-indeno[2,1-c][1,5]benzothiazepines as potential antimicrobial agents. Eur J Med Chem 2012; 57:196-210. [DOI: 10.1016/j.ejmech.2012.09.003] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2012] [Revised: 08/31/2012] [Accepted: 09/03/2012] [Indexed: 10/27/2022]
|
32
|
Anusuya S, Natarajan J. Multi-targeted therapy for leprosy: insilico strategy to overcome multi drug resistance and to improve therapeutic efficacy. INFECTION GENETICS AND EVOLUTION 2012; 12:1899-910. [PMID: 22981928 DOI: 10.1016/j.meegid.2012.08.013] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2012] [Revised: 08/01/2012] [Accepted: 08/17/2012] [Indexed: 02/02/2023]
Abstract
Leprosy remains a major public health problem, since single and multi-drug resistance has been reported worldwide over the last two decades. In the present study, we report the novel multi-targeted therapy for leprosy to overcome multi drug resistance and to improve therapeutic efficacy. If multiple enzymes of an essential metabolic pathway of a bacterium were targeted, then the therapy would become more effective and can prevent the occurrence of drug resistance. The MurC, MurD, MurE and MurF enzymes of peptidoglycan biosynthetic pathway were selected for multi targeted therapy. The conserved or class specific active site residues important for function or stability were predicted using evolutionary trace analysis and site directed mutagenesis studies. Ten such residues which were present in at least any three of the four Mur enzymes (MurC, MurD, MurE and MurF) were identified. Among the ten residues G125, K126, T127 and G293 (numbered based on their position in MurC) were found to be conserved in all the four Mur enzymes of the entire bacterial kingdom. In addition K143, T144, T166, G168, H234 and Y329 (numbered based on their position in MurE) were significant in binding substrates and/co-factors needed for the functional events in any three of the Mur enzymes. These are the probable residues for designing newer anti-leprosy drugs in an attempt to reduce drug resistance.
Collapse
Affiliation(s)
- Shanmugam Anusuya
- Department of Bioinformatics, VMKV Engineering College, Vinayaka Missions University, Salem 636 308, India.
| | | |
Collapse
|
33
|
Information properties of naturally-occurring proteins: Fourier analysis and complexity phase plots. Protein J 2012; 31:550-63. [PMID: 22814572 DOI: 10.1007/s10930-012-9432-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
In previous work from this lab, the information in natural proteins was investigated with Ribonuclease A (RNase A) serving as the source. The signature traits were investigated at three structure levels: primary through tertiary. The present paper travels further by charting the primary structure information of about half a million molecules. This was feasible given abundant sequence archives for both living and viral systems. Notably, a method is presented for evaluating primary structure information, based on Fourier analysis and spectral complexity. Significantly, the results show certain complexity traits to be universal for living sources. Viruses, by contrast, encode protein collections which are case-specific and complexity-divergent. The results have ramifications for discriminating collections on the basis of sequence information. This discrimination offers new strategies for selecting drug targets.
Collapse
|
34
|
Gálvez J, Gálvez-Llompart M, García-Domenech R. Molecular topology as a novel approach for drug discovery. Expert Opin Drug Discov 2012; 7:133-53. [PMID: 22468915 DOI: 10.1517/17460441.2012.652083] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
INTRODUCTION Molecular topology (MT) has emerged in recent years as a powerful approach for the in silico generation of new drugs. One key part of MT is that, in the process of drug design/discovery, there is no need for an explicit knowledge of a drug's mechanism of action unlike other drug discovery methods. AREAS COVERED In this review, the authors introduce the topic by explaining briefly the most common methodology used today in drug design/discovery and address the most important concepts of MT and the methodology followed (QSAR equations, LDA, etc.). Furthermore, the significant results achieved, from this approach, are outlined and discussed. EXPERT OPINION The results outlined herein can be explained by considering that MT represents a new paradigm in the field of drug design. This means that it is not only an alternative method to the conventional methods, but it is also independent, that is, it represents a pathway to connect directly molecular structure with the experimental properties of the compounds (particularly drugs). Moreover, the process can be realized also in the reverse pathway, that is, designing new molecules from their topological pattern, what opens almost limitless expectations in new drugs development, given that the virtual universe of molecules is much greater than that of the existing ones.
Collapse
Affiliation(s)
- Jorge Gálvez
- University of Valencia Avd, Department of Physical Chemistry, Molecular Connectivity and Drug Design Research Unit, Valencia, Spain.
| | | | | |
Collapse
|
35
|
2D MI-DRAGON: A new predictor for protein–ligands interactions and theoretic-experimental studies of US FDA drug-target network, oxoisoaporphine inhibitors for MAO-A and human parasite proteins. Eur J Med Chem 2011; 46:5838-51. [DOI: 10.1016/j.ejmech.2011.09.045] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2011] [Revised: 09/22/2011] [Accepted: 09/26/2011] [Indexed: 11/20/2022]
|