1
|
Metsänen TT, Lexa KW, Santiago CB, Chung CK, Xu Y, Liu Z, Humphrey GR, Ruck RT, Sherer EC, Sigman MS. Combining traditional 2D and modern physical organic-derived descriptors to predict enhanced enantioselectivity for the key aza-Michael conjugate addition in the synthesis of Prevymis™ (letermovir). Chem Sci 2018; 9:6922-6927. [PMID: 30210766 PMCID: PMC6124913 DOI: 10.1039/c8sc02089b] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Accepted: 07/17/2018] [Indexed: 01/19/2023] Open
Abstract
Quantitative structure-activity relationships have an extensive history for optimizing drug candidates, yet they have only recently been applied in reaction development. In this report, the predictive power of multivariate parameterization has been explored toward the optimization of a catalyst promoting an aza-Michael conjugate addition for the asymmetric synthesis of letermovir. A hybrid approach combining 2D QSAR and modern 3D physical organic parameters performed better than either approach in isolation. Using these predictive models, a series of new catalysts were identified, which catalyzed the reaction to provide the desired product in improved enantioselectivity relative to the parent catalyst.
Collapse
Affiliation(s)
- Toni T Metsänen
- Department of Chemistry , University of Utah , 315 South 1400 East , Salt Lake City , Utah 84112 , USA .
| | - Katrina W Lexa
- Modeling and Informatics , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA .
| | - Celine B Santiago
- Department of Chemistry , University of Utah , 315 South 1400 East , Salt Lake City , Utah 84112 , USA .
| | - Cheol K Chung
- Process Research and Development , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA
| | - Yingju Xu
- Process Research and Development , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA
| | - Zhijian Liu
- Process Research and Development , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA
| | - Guy R Humphrey
- Process Research and Development , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA
| | - Rebecca T Ruck
- Process Research and Development , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA
| | - Edward C Sherer
- Modeling and Informatics , MRL , Merck Sharp & Dohme , Rahway , New Jersey 07065 , USA .
| | - Matthew S Sigman
- Department of Chemistry , University of Utah , 315 South 1400 East , Salt Lake City , Utah 84112 , USA .
| |
Collapse
|
2
|
Chemoinformatics for medicinal chemistry: in silico model to enable the discovery of potent and safer anti-cocci agents. Future Med Chem 2014; 6:2013-28. [DOI: 10.4155/fmc.14.136] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Background: Gram-positive cocci are increasingly antibiotic-resistant bacteria responsible for causing serious diseases. Chemoinformatics can help to rationalize the discovery of more potent and safer antibacterial drugs. We have developed a chemoinformatic model for simultaneous prediction of anti-cocci activities, and profiles involving absorption, distribution, metabolism, elimination and toxicity (ADMET). Results: A dataset containing 48,874 cases from many different chemicals assayed under dissimilar experimental conditions was created. The best model displayed accuracies around 93% in both training and prediction (test) sets. Quantitative contributions of several fragments to the biological effects were calculated and analyzed. Multiple biological effects of the investigational drug JNJ-Q2 were correctly predicted. Conclusion: Our chemoinformatic model can be used as powerful tool for virtual screening of promising anti-cocci agents.
Collapse
|
3
|
Model for vaccine design by prediction of B-epitopes of IEDB given perturbations in peptide sequence, in vivo process, experimental techniques, and source or host organisms. J Immunol Res 2014; 2014:768515. [PMID: 24741624 PMCID: PMC3987976 DOI: 10.1155/2014/768515] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2013] [Accepted: 11/17/2013] [Indexed: 11/17/2022] Open
Abstract
Perturbation methods add variation terms to a known experimental solution of one problem to approach a solution for a related problem without known exact solution. One problem of this type in immunology is the prediction of the possible action of epitope of one peptide after a perturbation or variation in the structure of a known peptide and/or other boundary conditions (host organism, biological process, and experimental assay). However, to the best of our knowledge, there are no reports of general-purpose perturbation models to solve this problem. In a recent work, we introduced a new quantitative structure-property relationship theory for the study of perturbations in complex biomolecular systems. In this work, we developed the first model able to classify more than 200,000 cases of perturbations with accuracy, sensitivity, and specificity >90% both in training and validation series. The perturbations include structural changes in >50000 peptides determined in experimental assays with boundary conditions involving >500 source organisms, >50 host organisms, >10 biological process, and >30 experimental techniques. The model may be useful for the prediction of new epitopes or the optimization of known peptides towards computational vaccine design.
Collapse
|
4
|
Duardo-Sánchez A, Munteanu CR, Riera-Fernández P, López-Díaz A, Pazos A, González-Díaz H. Modeling Complex Metabolic Reactions, Ecological Systems, and Financial and Legal Networks with MIANN Models Based on Markov-Wiener Node Descriptors. J Chem Inf Model 2013; 54:16-29. [DOI: 10.1021/ci400280n] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Affiliation(s)
- Aliuska Duardo-Sánchez
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
- Department of Special Public Law, Financial
and Tributary Law Area, Faculty of Law, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Cristian R. Munteanu
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Pablo Riera-Fernández
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Antonio López-Díaz
- Department of Special Public Law, Financial
and Tributary Law Area, Faculty of Law, University of Santiago de Compostela (USC), 15782, Santiago de Compostela, A Coruña, Spain
| | - Alejandro Pazos
- Department
of Information and Communication Technologies, Computer Science Faculty, University of A Coruña, Campus de Elviña, 15071, A Coruña, A Coruña, Spain
| | - Humberto González-Díaz
- Department of Organic Chemistry II, Faculty of Science and Technology, University of the Basque Country (UPV/EHU), 48940, Leioa, Bizkaia, Spain
- IKERBASQUE, Basque
Foundation for Science, 48011, Bilbao, Biscay, Spain
| |
Collapse
|
5
|
Exploring the adenylation domain repertoire of nonribosomal peptide synthetases using an ensemble of sequence-search methods. PLoS One 2013; 8:e65926. [PMID: 23874386 PMCID: PMC3712989 DOI: 10.1371/journal.pone.0065926] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Accepted: 05/01/2013] [Indexed: 11/24/2022] Open
Abstract
The introduction of two-dimension (2D) graphs and their numerical characterization for comparative analyses of DNA/RNA and protein sequences without the need of sequence alignments is an active yet recent research topic in bioinformatics. Here, we used a 2D artificial representation (four-color maps) with a simple numerical characterization through topological indices (TIs) to aid the discovering of remote homologous of Adenylation domains (A-domains) from the Nonribosomal Peptide Synthetases (NRPS) class in the proteome of the cyanobacteria Microcystis aeruginosa. Cyanobacteria are a rich source of structurally diverse oligopeptides that are predominantly synthesized by NPRS. Several A-domains share amino acid identities lower than 20 % being a possible source of remote homologous. Therefore, A-domains cannot be easily retrieved by BLASTp searches using a single template. To cope with the sequence diversity of the A-domains we have combined homology-search methods with an alignment-free tool that uses protein four-color-maps. TI2BioP (Topological Indices toBioPolymers) version 2.0, available at http://ti2biop.sourceforge.net/ allowed the calculation of simple TIs from the protein sequences (four-color maps). Such TIs were used as input predictors for the statistical estimations required to build the alignment-free models. We concluded that the use of graphical/numerical approaches in cooperation with other sequence search methods, like multi-templates BLASTp and profile HMM, can give the most complete exploration of the repertoire of highly diverse protein families.
Collapse
|
6
|
GUPTA MONIKA, MADAN AK. Diverse models for the prediction of CDK4 inhibitory activity of substituted 4-aminomethylene isoquinoline-1, 3-diones. J CHEM SCI 2013. [DOI: 10.1007/s12039-013-0410-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
7
|
The Rücker–Markov invariants of complex Bio-Systems: Applications in Parasitology and Neuroinformatics. Biosystems 2013; 111:199-207. [DOI: 10.1016/j.biosystems.2013.02.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2013] [Accepted: 02/11/2013] [Indexed: 11/23/2022]
|
8
|
Yuan J, Pu Y, Yin L. Predicting carcinogenicity and understanding the carcinogenic mechanism of N-nitroso compounds using a TOPS-MODE approach. Chem Res Toxicol 2011; 24:2269-79. [PMID: 22084901 DOI: 10.1021/tx2004097] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
A linear discriminant analysis (LDA) coupled with an enhanced replacement method (ERM) was used as an alternative method to predict the carcinogenicity of N-nitroso compounds (NOCs) in rats. This presented LDA based on the topological substructural molecular descriptors (TOPS-MODE) approach was developed to predict the carcinogenic and noncarcinogenic activity on a data set of 111 NOCs with a good classification value of 90.1%. The predictive power of the LDA model was validated through an external validation set (37 compounds) with a prediction accuracy of 94.6% and a leave-one-out cross-validation procedure (LOOCV) with a good prediction of 86.5%. This methodology showed that the TOPS-MODE descriptors weighted, respectively, by bond dipole moment and Abraham solute descriptor dipolarity/polarizability affected the NOC carcinogenicity. The contributions of certain bonds and fragments to carcinogenicity were used to assess biotransformation and carcinogenic mechanisms. The positive contribution of the carbon-nitrogen single bond (between the N-nitroso group and α-carbon to the N-nitroso group) indicated that the α-hydroxylation reaction could occur at the α-carbon or otherwise not occur. Similarly, the contributions from the molecular fragment could be applied to indicate whether the fragments generated an alkylating agent. These results suggested that this approach could discriminate between carcinogenic and noncarcinogenic NOCs, thereby providing insight into the structural features and chemical factors related to NOC carcinogenicity.
Collapse
Affiliation(s)
- Jintao Yuan
- Key Laboratory of Environmental Medicine Engineering, Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, China
| | | | | |
Collapse
|
9
|
Agüero-Chapin G, Pérez-Machado G, Molina-Ruiz R, Pérez-Castillo Y, Morales-Helguera A, Vasconcelos V, Antunes A. TI2BioP: Topological Indices to BioPolymers. Its practical use to unravel cryptic bacteriocin-like domains. Amino Acids 2010; 40:431-42. [PMID: 20563611 DOI: 10.1007/s00726-010-0653-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2010] [Accepted: 06/02/2010] [Indexed: 02/04/2023]
Abstract
Bacteriocins are proteinaceous toxins produced and exported by both gram-negative and gram-positive bacteria as a defense mechanism. The bacteriocin protein family is highly diverse, which complicates the identification of bacteriocin-like sequences using alignment approaches. The use of topological indices (TIs) irrespective of sequence similarity can be a promising alternative to predict proteinaceous bacteriocins. Thus, we present Topological Indices to BioPolymers (TI2BioP) as an alignment-free approach inspired in both the Topological Substructural Molecular Design (TOPS-MODE) and Markov Chain Invariants for Network Selection and Design (MARCH-INSIDE) methodology. TI2BioP allows the calculation of the spectral moments as simple TIs to seek quantitative sequence-function relationships (QSFR) models. Since hydrophobicity and basicity are major criteria for the bactericide activity of bacteriocins, the spectral moments ((HP)μ(k)) were derived for the first time from protein artificial secondary structures based on amino acid clustering into a Cartesian system of hydrophobicity and polarity. Several orders of (HP)μ(k) characterized numerically 196 bacteriocin-like sequences and a control group made up of 200 representative CATH domains. Subsequently, they were used to develop an alignment-free QSFR model allowing a 76.92% discrimination of bacteriocin proteins from other domains, a relevant result considering the high sequence diversity among the members of both groups. The model showed a prediction overall performance of 72.16%, detecting specifically 66.7% of proteinaceous bacteriocins whereas the InterProScan retrieved just 60.2%. As a practical validation, the model also predicted successfully the cryptic bactericide function of the Cry 1Ab C-terminal domain from Bacillus thuringiensis's endotoxin, which has not been detected by classical alignment methods.
Collapse
Affiliation(s)
- Guillermín Agüero-Chapin
- CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177, 4050-123, Porto, Portugal
| | | | | | | | | | | | | |
Collapse
|
10
|
Vilar S, Karpiak J, Costanzi S. Ligand and structure-based models for the prediction of ligand-receptor affinities and virtual screenings: Development and application to the beta(2)-adrenergic receptor. J Comput Chem 2010; 31:707-20. [PMID: 19569204 PMCID: PMC2818076 DOI: 10.1002/jcc.21346] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this study, we evaluated the applicability of ligand-based and structure-based models to quantitative affinity predictions and virtual screenings for ligands of the beta(2)-adrenergic receptor, a G protein-coupled receptor (GPCR). We also devised and evaluated a number of consensus models obtained through partial least square regressions, to combine the strengths of the individual components. In all cases, the bioactive conformation of each ligand was derived from molecular docking at the crystal structure of the receptor. We identified the most effective models applicable to the different scenarios, in the presence or in the absence of a training set. For ranking the affinity of closely related analogs when a training set is available, a ligand-based consensus model (LI-CM) seems to be the best choice, while the structure-based MM-GBSA score seems the best alternative in the absence of a training set. For virtual screening purposes, the structure-based MM-GBSA score was found to be the method of choice. Consensus models consistently had performances superior or close to those of the best individual components, and were endowed with a significantly increased robustness. Given multiple models with no a priori knowledge of their predictive capabilities, constructing a consensus model ensures results very close to those that the best model alone would have yielded.
Collapse
Affiliation(s)
- Santiago Vilar
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, DHHS, Bethesda, MD 20892, USA
| | - Joel Karpiak
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, DHHS, Bethesda, MD 20892, USA
| | - Stefano Costanzi
- Laboratory of Biological Modeling, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, DHHS, Bethesda, MD 20892, USA
| |
Collapse
|
11
|
Semmar N. A New Mixture Design-Based Approach to Graphical Screening of Potential Interconnections and Variability Processes in Metabolic Systems. Chem Biol Drug Des 2010; 75:91-105. [DOI: 10.1111/j.1747-0285.2009.00912.x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
12
|
Castillo-Garit J, Marrero-Ponce Y, Torrens F, García-Domenech R, Rodríguez-Borges J. Applications of Bond-Based 3D-Chiral Quadratic Indices in QSAR Studies Related to Central Chirality Codification. ACTA ACUST UNITED AC 2009. [DOI: 10.1002/qsar.200960085] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
13
|
Multi-target spectral moment: QSAR for antiviral drugs vs. different viral species. Anal Chim Acta 2009; 651:159-64. [PMID: 19782806 DOI: 10.1016/j.aca.2009.08.022] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2009] [Revised: 08/05/2009] [Accepted: 08/18/2009] [Indexed: 11/23/2022]
Abstract
The antiviral QSAR models have an important limitation today. They predict the biological activity of drugs against only one viral species. This is determined by the fact that most of the current reported molecular descriptors encode only information about the molecular structure. As a result, predicting the probability with which a drug is active against different viral species with a single unifying model is a goal of major importance. In this work, we use Markov Chain theory to calculate new multi-target spectral moments to fit a QSAR model for drugs active against 40 viral species. The model is based on 500 drugs (including active and non-active compounds) tested as antiviral agents in the recent literature; not all drugs were predicted against all viruses, but only those with experimental values. The database also contains 207 well-known compounds (not as recent as the previous ones) reported in the Merck Index with other activities that do not include antiviral action against any virus species. We used Linear Discriminant Analysis (LDA) to classify all these drugs into two classes as active or non-active against the different viral species tested, whose data we processed. The model correctly classifies 5129 out of 5594 non-active compounds (91.69%) and 412 out of 422 active compounds (97.63%). Overall training predictability was 92.34%. The validation of the model was carried out by means of external predicting series, the model classifying, thus, 2568 out of 2779 non-active compounds and 224 out of 229 active compounds. Overall training predictability was 92.82%. The present work reports the first attempts to calculate within a unified framework the probabilities of antiviral drugs against different virus species based on a spectral moment analysis.
Collapse
|
14
|
García I, Munteanu CR, Fall Y, Gómez G, Uriarte E, González-Díaz H. QSAR and complex network study of the chiral HMGR inhibitor structural diversity. Bioorg Med Chem 2009; 17:165-75. [DOI: 10.1016/j.bmc.2008.11.007] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2008] [Revised: 10/31/2008] [Accepted: 11/06/2008] [Indexed: 10/21/2022]
|
15
|
Vilar S, González-Díaz H, Santana L, Uriarte E. QSAR model for alignment-free prediction of human breast cancer biomarkers based on electrostatic potentials of protein pseudofolding HP-lattice networks. J Comput Chem 2008; 29:2613-22. [PMID: 18478581 DOI: 10.1002/jcc.21016] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Network theory allows relationships to be established between numerical parameters that describe the molecular structure of genes and proteins and their biological properties. These models can be considered as quantitative structure-activity relationships (QSAR) for biopolymers. The work described here concerns the first QSAR model for 122 proteins that are associated with human breast cancer (HBC), as identified experimentally by Sjöblom et al. (Science 2006, 314, 268) from over 10,000 human proteins. In this study, the 122 proteins related to HBC (HBCp) and a control group of 200 proteins that are not related to HBC (non-HBCp) were forced to fold in an HP lattice network. From these networks a series of electrostatic potential parameters (xi(k)) was calculated to describe each protein numerically. The use of xi(k) as an entry point to linear discriminant analysis led to a QSAR model to discriminate between HBCp and non-HBCp, and this model could help to predict the involvement of a certain gene and/or protein in HBC. In addition, validation procedures were carried out on the model and these included an external prediction series and evaluation of an additional series of 1000 non-HBCp. In all cases good levels of classification were obtained with values above 80%. This study represents the first example of a QSAR model for the computational chemistry inspired search of potential HBC protein biomarkers.
Collapse
Affiliation(s)
- Santiago Vilar
- Unit of Bioinformatics and Connectivity Analysis, Institute of Industrial Pharmacy, and Department of Organic Chemistry, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela 15782, Spain
| | | | | | | |
Collapse
|
16
|
Prado-Prado FJ, Martinez de la Vega O, Uriarte E, Ubeira FM, Chou KC, González-Díaz H. Unified QSAR approach to antimicrobials. 4. Multi-target QSAR modeling and comparative multi-distance study of the giant components of antiviral drug-drug complex networks. Bioorg Med Chem 2008; 17:569-75. [PMID: 19112024 DOI: 10.1016/j.bmc.2008.11.075] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2008] [Revised: 11/24/2008] [Accepted: 11/28/2008] [Indexed: 11/18/2022]
Abstract
One limitation of almost all antiviral Quantitative Structure-Activity Relationships (QSAR) models is that they predict the biological activity of drugs against only one species of virus. Consequently, the development of multi-tasking QSAR models (mt-QSAR) to predict drugs activity against different species of virus is of the major vitally important. These mt-QSARs offer also a good opportunity to construct drug-drug Complex Networks (CNs) that can be used to explore large and complex drug-viral species databases. It is known that in very large CNs we can use the Giant Component (GC) as a representative sub-set of nodes (drugs) and but the drug-drug similarity function selected may strongly determines the final network obtained. In the three previous works of the present series we reported mt-QSAR models to predict the antimicrobial activity against different fungi [Gonzalez-Diaz, H.; Prado-Prado, F. J.; Santana, L.; Uriarte, E. Bioorg.Med.Chem.2006, 14, 5973], bacteria [Prado-Prado, F. J.; Gonzalez-Diaz, H.; Santana, L.; Uriarte E. Bioorg.Med.Chem.2007, 15, 897] or parasite species [Prado-Prado, F.J.; González-Díaz, H.; Martinez de la Vega, O.; Ubeira, F.M.; Chou K.C. Bioorg.Med.Chem.2008, 16, 5871]. However, including these works, we do not found any report of mt-QSAR models for antivirals drug, or a comparative study of the different GC extracted from drug-drug CNs based on different similarity functions. In this work, we used Linear Discriminant Analysis (LDA) to fit a mt-QSAR model that classify 600 drugs as active or non-active against the 41 different tested species of virus. The model correctly classifies 143 of 169 active compounds (specificity=84.62%) and 119 of 139 non-active compounds (sensitivity=85.61%) and presents overall training accuracy of 85.1% (262 of 308 cases). Validation of the model was carried out by means of external predicting series, classifying the model 466 of 514, 90.7% of compounds. In order to illustrate the performance of the model in practice, we develop a virtual screening recognizing the model as active 92.7%, 102 of 110 antivirus compounds. These compounds were never use in training or predicting series. Next, we obtained and compared the topology of the CNs and their respective GCs based on Euclidean, Manhattan, Chebychey, Pearson and other similarity measures. The GC of the Manhattan network showed the more interesting features for drug-drug similarity search. We also give the procedure for the construction of Back-Projection Maps for the contribution of each drug sub-structure to the antiviral activity against different species.
Collapse
Affiliation(s)
- Francisco J Prado-Prado
- Department of Microbiology and Parasitology, Faculty of Pharmacy, University of Santiago de Compostela, Santiago de Compostela 15782, Spain
| | | | | | | | | | | |
Collapse
|
17
|
Cruz-Monteagudo M, Borges F, Cordeiro MNDS. Desirability-based multiobjective optimization for global QSAR studies: application to the design of novel NSAIDs with improved analgesic, antiinflammatory, and ulcerogenic profiles. J Comput Chem 2008; 29:2445-59. [PMID: 18452123 DOI: 10.1002/jcc.20994] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Up to now, very few reports have been published concerning the application of multiobjective optimization (MOOP) techniques to quantitative structure-activity relationship (QSAR) studies. However, none reports the optimization of objectives related directly to the desired pharmaceutical profile of the drug. In this work, for the first time, it is proposed a MOOP method based on Derringer's desirability function that allows conducting global QSAR studies considering simultaneously the pharmacological, pharmacokinetic and toxicological profile of a set of molecule candidates. The usefulness of the method is demonstrated by applying it to the simultaneous optimization of the analgesic, antiinflammatory, and ulcerogenic properties of a library of fifteen 3-(3-methylphenyl)-2-substituted amino-3H-quinazolin-4-one compounds. The levels of the predictor variables producing concurrently the best possible compromise between these properties is found and used to design a set of new optimized drug candidates. Our results also suggest the relevant role of the bulkiness of alkyl substituents on the C-2 position of the quinazoline ring over the ulcerogenic properties for this family of compounds. Finally, and most importantly, the desirability-based MOOP method proposed is a valuable tool and shall aid in the future rational design of novel successful drugs.
Collapse
Affiliation(s)
- Maykel Cruz-Monteagudo
- Physico-Chemical Molecular Research Unit, Department of Organic Chemistry, Faculty of Pharmacy, University of Porto, 4150-047 Porto, Portugal.
| | | | | |
Collapse
|
18
|
Perez-Bello A, Munteanu CR, Ubeira FM, De Magalhães AL, Uriarte E, González-Díaz H. Alignment-free prediction of mycobacterial DNA promoters based on pseudo-folding lattice network or star-graph topological indices. J Theor Biol 2008; 256:458-66. [PMID: 18992259 PMCID: PMC7126577 DOI: 10.1016/j.jtbi.2008.09.035] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2008] [Revised: 09/23/2008] [Accepted: 09/25/2008] [Indexed: 12/01/2022]
Abstract
The importance of the promoter sequences in the function regulation of several important mycobacterial pathogens creates the necessity to design simple and fast theoretical models that can predict them. This work proposes two DNA promoter QSAR models based on pseudo-folding lattice network (LN) and star-graphs (SG) topological indices. In addition, a comparative study with the previous RNA electrostatic parameters of thermodynamically-driven secondary structure folding representations has been carried out. The best model of this work was obtained with only two LN stochastic electrostatic potentials and it is characterized by accuracy, selectivity and specificity of 90.87%, 82.96% and 92.95%, respectively. In addition, we pointed out the SG result dependence on the DNA sequence codification and we proposed a QSAR model based on codons and only three SG spectral moments.
Collapse
Affiliation(s)
- Alcides Perez-Bello
- Department of Microbiology and Parasitology, University of Santiago de Compostela, Santiago de Compostela 15782, Spain.
| | | | | | | | | | | |
Collapse
|
19
|
Cruz-Monteagudo M, Borges F, Cordeiro MNDS, Cagide Fajin JL, Morell C, Ruiz RM, Cañizares-Carmenate Y, Dominguez ER. Desirability-based methods of multiobjective optimization and ranking for global QSAR studies. Filtering safe and potent drug candidates from combinatorial libraries. ACTA ACUST UNITED AC 2008; 10:897-913. [PMID: 18855460 DOI: 10.1021/cc800115y] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Up to now, very few applications of multiobjective optimization (MOOP) techniques to quantitative structure-activity relationship (QSAR) studies have been reported in the literature. However, none of them report the optimization of objectives related directly to the final pharmaceutical profile of a drug. In this paper, a MOOP method based on Derringer's desirability function that allows conducting global QSAR studies, simultaneously considering the potency, bioavailability, and safety of a set of drug candidates, is introduced. The results of the desirability-based MOOP (the levels of the predictor variables concurrently producing the best possible compromise between the properties determining an optimal drug candidate) are used for the implementation of a ranking method that is also based on the application of desirability functions. This method allows ranking drug candidates with unknown pharmaceutical properties from combinatorial libraries according to the degree of similarity with the previously determined optimal candidate. Application of this method will make it possible to filter the most promising drug candidates of a library (the best-ranked candidates), which should have the best pharmaceutical profile (the best compromise between potency, safety and bioavailability). In addition, a validation method of the ranking process, as well as a quantitative measure of the quality of a ranking, the ranking quality index (Psi), is proposed. The usefulness of the desirability-based methods of MOOP and ranking is demonstrated by its application to a library of 95 fluoroquinolones, reporting their gram-negative antibacterial activity and mammalian cell cytotoxicity. Finally, the combined use of the desirability-based methods of MOOP and ranking proposed here seems to be a valuable tool for rational drug discovery and development.
Collapse
Affiliation(s)
- Maykel Cruz-Monteagudo
- Physico-Chemical Molecular Research Unit, Department of Organic Chemistry, Faculty of Pharmacy, REQUIMTE, Department of Chemistry, and CIQ-UP, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal.
| | | | | | | | | | | | | | | |
Collapse
|
20
|
Dea-Ayuela MA, Pérez-Castillo Y, Meneses-Marcel A, Ubeira FM, Bolas-Fernández F, Chou KC, González-Díaz H. HP-Lattice QSAR for dynein proteins: experimental proteomics (2D-electrophoresis, mass spectrometry) and theoretic study of a Leishmania infantum sequence. Bioorg Med Chem 2008; 16:7770-6. [PMID: 18662882 DOI: 10.1016/j.bmc.2008.07.023] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2008] [Revised: 06/23/2008] [Accepted: 07/02/2008] [Indexed: 10/21/2022]
Abstract
The toxicity and inefficacy of actual organic drugs against Leishmaniosis justify research projects to find new molecular targets in Leishmania species including Leishmania infantum (L. infantum) and Leishmaniamajor (L. major), both important pathogens. In this sense, quantitative structure-activity relationship (QSAR) methods, which are very useful in Bioorganic and Medicinal Chemistry to discover small-sized drugs, may help to identify not only new drugs but also new drug targets, if we apply them to proteins. Dyneins are important proteins of these parasites governing fundamental processes such as cilia and flagella motion, nuclear migration, organization of the mitotic splinde, and chromosome separation during mitosis. However, despite the interest for them as potential drug targets, so far there has been no report whatsoever on dyneins with QSAR techniques. To the best of our knowledge, we report here the first QSAR for dynein proteins. We used as input the Spectral Moments of a Markov matrix associated to the HP-Lattice Network of the protein sequence. The data contain 411 protein sequences of different species selected by ClustalX to develop a QSAR that correctly discriminates on average between 92.75% and 92.51% of dyneins and other proteins in four different train and cross-validation datasets. We also report a combined experimental and theoretic study of a new dynein sequence in order to illustrate the utility of the model to search for potential drug targets with a practical example. First, we carried out a 2D-electrophoresis analysis of L. infantum biological samples. Next, we excised from 2D-E gels one spot of interest belonging to an unknown protein or protein fragment in the region M<20,200 and pI<4. We used MASCOT search engine to find proteins in the L. major data base with the highest similarity score to the MS of the protein isolated from L. infantum. We used the QSAR model to predict the new sequence as dynein with probability of 99.99% without relying upon alignment. In order to confirm the previous function annotation we predicted the sequences as dynein with BLAST and the omniBLAST tools (96% alignment similarity to dyneins of other species). Using this combined strategy, we have successfully identified L. infantum protein containing dynein heavy chain, and illustrated the potential use of the QSAR model as a complement to alignment tools.
Collapse
|
21
|
Estrada E. Quantum-Chemical Foundations of the Topological Substructural Molecular Design. J Phys Chem A 2008; 112:5208-17. [DOI: 10.1021/jp8010712] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Ernesto Estrada
- Complex Systems Research Group, RIAIDT & Department of Organic Chemistry, Faculty of Pharmacy, Edificio CACTUS, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| |
Collapse
|
22
|
Prado-Prado FJ, González-Díaz H, de la Vega OM, Ubeira FM, Chou KC. Unified QSAR approach to antimicrobials. Part 3: first multi-tasking QSAR model for input-coded prediction, structural back-projection, and complex networks clustering of antiprotozoal compounds. Bioorg Med Chem 2008; 16:5871-80. [PMID: 18485714 DOI: 10.1016/j.bmc.2008.04.068] [Citation(s) in RCA: 104] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2008] [Revised: 04/22/2008] [Accepted: 04/25/2008] [Indexed: 10/22/2022]
Abstract
Several pathogen parasite species show different susceptibilities to different antiparasite drugs. Unfortunately, almost all structure-based methods are one-task or one-target Quantitative Structure-Activity Relationships (ot-QSAR) that predict the biological activity of drugs against only one parasite species. Consequently, multi-tasking learning to predict drugs activity against different species by a single model (mt-QSAR) is vitally important. In the two previous works of the present series we reported two single mt-QSAR models in order to predict the antimicrobial activity against different fungal (Bioorg. Med. Chem.2006, 14, 5973-5980) or bacterial species (Bioorg. Med. Chem.2007, 15, 897-902). These mt-QSARs offer a good opportunity (unpractical with ot-QSAR) to construct drug-drug similarity Complex Networks and to map the contribution of sub-structures to function for multiple species. These possibilities were unattended in our previous works. In the present work, we continue this series toward other important direction of chemotherapy (antiparasite drugs) with the development of an mt-QSAR for more than 500 drugs tested in the literature against different parasites. The data were processed by Linear Discriminant Analysis (LDA) classifying drugs as active or non-active against the different tested parasite species. The model correctly classifies 212 out of 244 (87.0%) cases in training series and 207 out of 243 compounds (85.4%) in external validation series. In order to illustrate the performance of the QSAR for the selection of active drugs we carried out an additional virtual screening of antiparasite compounds not used in training or predicting series; the model recognized 97 out of 114 (85.1%) of them. We also give the procedures to construct back-projection maps and to calculate sub-structures contribution to the biological activity. Finally, we used the outputs of the QSAR to construct, by the first time, a multi-species Complex Networks of antiparasite drugs. The network predicted has 380 nodes (compounds), 634 edges (pairs of compounds with similar activity). This network allows us to cluster different compounds and identify on average three known compounds similar to a new query compound according to their profile of biological activity. This is the first attempt to calculate probabilities of antiparasitic action of drugs against different parasites.
Collapse
|
23
|
Agüero-Chapín G, González-Díaz H, de la Riva G, Rodríguez E, Sánchez-Rodríguez A, Podda G, Vazquez-Padrón RI. MMM-QSAR Recognition of Ribonucleases without Alignment: Comparison with an HMM Model and Isolation from Schizosaccharomyces pombe, Prediction, and Experimental Assay of a New Sequence. J Chem Inf Model 2008; 48:434-48. [DOI: 10.1021/ci7003225] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Guillermín Agüero-Chapín
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Humberto González-Díaz
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Gustavo de la Riva
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Edrey Rodríguez
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Aminael Sánchez-Rodríguez
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Gianni Podda
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| | - Roberto I. Vazquez-Padrón
- Dipartimento Farmaco Chimico Tecnologico, Universitá Degli Studi di Cagliari, Cagliari, 09124, Italy, CAP, Faculty of Chemistry and Pharmacy, IBP, and CBQ, UCLV, Santa Clara 54830, Cuba, Unit for Bioinformatics & Connectivity Analysis (UBICA), Institute of Industrial Pharmacy and Department of Organic Chemistry, Faculty of Pharmacy, USC, Santiago de Compostela 15782, Spain, CINVESTAV-LANGEBIO, Irapuato, Guanajuato 36821, México, Caribbean Vitroplants, Santo Domingo 1464, Dominican Republic, and Vascular
| |
Collapse
|
24
|
Vilar S, Quezada E, Alcaide C, Orallo F, Santana L, Uriarte E. Quantitative Structure Vasodilatory Activity Relationship – Synthesis and “In Silico” and “In Vitro” Evaluation of Resveratrol-Coumarin Hybrids. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/qsar.200630006] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
25
|
Hassan Khan MT, Ather A. Potentials of phenolic molecules of natural origin and their derivatives as anti-HIV agents. BIOTECHNOLOGY ANNUAL REVIEW 2007; 13:223-64. [PMID: 17875479 DOI: 10.1016/s1387-2656(07)13009-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Identification of phenolic compounds and their derivatives interfering the several steps of the viral life cycle of the human immunodeficiency virus type 1 (HIV-1) is focused for the development of novel molecules for the treatment of AIDS. Several phenolic compounds isolated and characterized from natural sources have been studied in detail and found to exhibit inhibitory effects against different steps of the HIV-1 life cycle, including virus-cell fusion and virus absorption, reverse transcription, integration (IN) and proteolytic cleavage. In the review, we are summarizing some strong evidences demonstrating several phenolic molecules and their derivatives from natural sources display promising anti-HIV-1 activities. The anti-HIV compounds have been organized in this review according to their mechanism of action in the life cycle of HIV. We also mentioned some findings using in silico approaches, like virtual screening, docking, neural network, etc., and even the chemogenomics and/or functional genomics approaches could be useful for the quick identifying promising new lead anti-HIV molecules without having any other unwanted pharmacological effects. Plants having large amount of phenolic compounds, can be considered as strong sources of molecules for the treatment of HIV-1. Despite the continuous advances made in antiretroviral combination therapy, AIDS has become the leading cause of death in Africa and the fourth worldwide. Today, many research groups are exploring the bio- and chemo-diversity of the plant kingdom to find new and better anti-HIV drugs with novel mechanisms of action.
Collapse
Affiliation(s)
- Mahmud Tareq Hassan Khan
- Pharmacology Research Laboratory, Faculty of Pharmaceutical Sciences, University of Science and Technology Chittagong, Chittagong, Bangladesh.
| | | |
Collapse
|
26
|
González-Díaz H, Olazábal E, Santana L, Uriarte E, González-Díaz Y, Castañedo N. QSAR study of anticoccidial activity for diverse chemical compounds: Prediction and experimental assay of trans-2-(2-nitrovinyl)furan. Bioorg Med Chem 2007; 15:962-8. [PMID: 17081758 DOI: 10.1016/j.bmc.2006.10.032] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2006] [Revised: 10/03/2006] [Accepted: 10/17/2006] [Indexed: 11/21/2022]
Abstract
In this work we report a QSAR model that discriminates between chemically heterogeneous classes of anticoccidial and non-anticoccidial compounds. For this purpose we used the Markovian Chemicals in silico Design (MARCH-INSIDE) approach J. Mol. Mod.2002, 8, 237-245; J. Mol. Mod.2003, 9, 395-407]. Linear discriminant analysis allowed us to fit the discriminant function. This function correctly classifies 86.67% of anticoccidial compounds and 96.23% of inactive compounds in the training series. Overall classification is 94.12%. We validated the model by means of an external predicting series, with 86.96% of global predictability. Remarkably, the present model is based on topological as well as configuration-dependent molecular descriptors. Therefore, the model performs timely calculations and allows discrimination between Z/E and chiral isomers. Finally, to exemplify the use of the model in practice we report the prediction and experimental assay of trans-2-(2-nitrovinyl)furan. It is notable that lesion control was 72.86% at mg/kg of body weight with respect to 60% at 125 mg/kg for amprolium (control drug). The back-projection map for this compound predicts a high level of importance for the double bond and for the nitro group in the trans position. We conclude that the MARCH-INSIDE approach enables the accurate fast track identification of anticoccidial hits. Moreover, trans-2-(2-nitrovinyl)furan seems to be a promising drug for the treatment of coccidiosis.
Collapse
Affiliation(s)
- Humberto González-Díaz
- Department of Organic Chemistry & Institute of Industrial Pharmacy, Faculty of Pharmacy, University of Santiago de Compostela, Santiago 15782, Spain.
| | | | | | | | | | | |
Collapse
|
27
|
Marrero-Ponce Y, Torrens F, Alvarado YJ, Rotondo R. Bond-based global and local (bond, group and bond-type) quadratic indices and their applications to computer-aided molecular design. 1. QSPR studies of diverse sets of organic chemicals. J Comput Aided Mol Des 2006; 20:685-701. [PMID: 17186417 DOI: 10.1007/s10822-006-9089-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2006] [Accepted: 10/18/2006] [Indexed: 11/26/2022]
Abstract
The concept of atom-based quadratic indices is extended to a series of molecular descriptors (MDs) (both total and local) based on adjacency between edges. The kth edge-adjacency matrix (E ( k )) denotes the matrix of bond-based quadratic indices (non-stochastic) with respect to the canonical basis set. The kth "stochastic" edge-adjacency matrix, ES ( k ), is here proposed as a new molecular representation easily calculated from E ( k ). Then, the kth stochastic bond-based quadratic indices are calculated using ES ( k ) as operators of quadratic transformations. The study of six representative physicochemical properties of octane isomers was used to compare the ability of both series of MDs to produce significant quantitative structure-property relationship (QSPR) models. Moreover, the general performance of the new MDs in this QSPR study has been evaluated with respect to other 2D/3D well-known sets of indices and the obtained results shown a quite satisfactory behavior of the present method. The novel bond-level MDs were also used for the description and prediction of the boiling point of 28 alkyl-alcohols and to the modeling of the specific rate constant (log k) of 34 derivatives of 2-furylethylenes. These models were statistically significant and showed very good stability to data variation in leave-one-out (LOO) cross-validation experiment. The comparison with other approaches (edge- and vertices-based connectivity indices, total and local spectral moments, and quantum chemical descriptors as well as E-state/biomolecular encounter parameters) expose a good behavior of our method in this QSPR studies. The approach described in this report appears to be a very promising structural invariant, useful for QSPR/QSAR studies, similarity/diversity analysis, and computer-aided "rational" molecular (drug) design.
Collapse
Affiliation(s)
- Yovani Marrero-Ponce
- Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research (CAMD-BIR Unit), Faculty of Chemistry-Pharmacy, Central University of Las Villas, Santa Clara, Villa Clara, 54830, Cuba.
| | | | | | | |
Collapse
|
28
|
Vilar S, Santana L, Uriarte E. Probabilistic Neural Network Model for the In Silico Evaluation of Anti-HIV Activity and Mechanism of Action. J Med Chem 2006; 49:1118-24. [PMID: 16451076 DOI: 10.1021/jm050932j] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A theoretical model has been developed that discriminates between active and nonactive drugs against HIV-1 with four different mechanisms of action for the active drugs. The model was built up using a probabilistic neural network (PNN) algorithm and a database of 2720 compounds. The model showed an overall accuracy of 97.34% in the training series, 85.12% in the selection series, and 84.78% in an external prediction series. The model not only correctly classified a very heterogeneous series of organic compounds but also discriminated between very similar active/nonactive chemicals that belong to the same family of compounds. More specifically, the model recognized 96.02% of nonactive compounds, 94.24% of active compounds that inhibited reverse transcriptase, 97.24% of protease inhibitors, 97.14% of virus uncoating inhibitors, and 90.32% of integrase inhibitors. The results indicate that this approach may represent a powerful tool for modeling large databases in QSAR with applications in medicinal chemistry.
Collapse
Affiliation(s)
- Santiago Vilar
- Faculty of Pharmacy, Department of Organic Chemistry, University of Santiago de Compostela, Santiago de Compostela 15782, Spain.
| | | | | |
Collapse
|