1
|
Cañizares-Carmenate Y, Perera-Sardiña Y, Marrero-Ponce Y, Díaz-Amador R, Torrens F, Castillo-Garit JA. Ligand and structure-based discovery of phosphorus-containing compounds as potential metalloproteinase inhibitors. SAR QSAR Environ Res 2024; 35:219-240. [PMID: 38380444 DOI: 10.1080/1062936x.2024.2314103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2023] [Accepted: 01/29/2024] [Indexed: 02/22/2024]
Abstract
In this study, a methodology is proposed, combining ligand- and structure-based virtual screening tools, for the identification of phosphorus-containing compounds as inhibitors of zinc metalloproteases. First, we use Dragon molecular descriptors to develop a Linear Discriminant Analysis classification model, which is widely validated according to the OECD principles. This model is simple, robust, stable and has good discriminating power. Furthermore, it has a defined applicability domain and it is used for virtual screening of the DrugBank database. Second, docking experiments are carried out on the identified compounds that showed good binding energies to the enzyme thermolysin. Considering the potential toxicity of phosphorus-containing compounds, their toxicological profile is evaluated according to Protox II. Of the five molecules evaluated, two show carcinogenic and mutagenic potential at small LD50, not recommended as drugs, while three of them are classified as non-toxic, and could constitute a starting point for the development of new vasoactive metalloprotease inhibitor drugs. According to molecular dynamics simulation, two of them show stable interactions with the active site maintaining coordination with the metal. A high agreement is evident between QSAR, docking and molecular dynamics results, demonstrating the potentialities of the combination of these tools.
Collapse
Affiliation(s)
- Y Cañizares-Carmenate
- Unit of Computer-Aided Molecular ''Biosilico" Discovery and Bioinformatic Research (CAMD-BIR Unit), Departamento de Farmacia, Facultad de Química-Farmacia, Universidad Central ''Marta Abreu" de Las Villas, Santa Clara, Cuba
| | - Y Perera-Sardiña
- Departamento de Ciencias Básicas Biomédicas, Facultad de Ciencias de la Salud, Universidad de Talca, Talca, Chile
| | - Y Marrero-Ponce
- Grupo de Medicina Molecular Y Traslacional (MeM & T), Escuela de Medicina, Universidad San Francisco de Quito, Edificio de Especialidades Médicas, Quito, Ecuador
| | - R Díaz-Amador
- Laboratorio de Bioinformática y Química Computacional, Escuela de Química y Farmacia, Facultad de Medicina, Universidad Católica de Maule, Maule, Chile
| | - F Torrens
- Institut Universitari de Ciència Molecular, Universitat de València, Edifici d'Instituts de Paterna, València, Spain
| | - J A Castillo-Garit
- Instituto Universitario de Investigación y Desarrollo Tecnológico (IDT), Universidad Tecnológica Metropolitana, Santiago, Chile
| |
Collapse
|
2
|
García-Jacas CR, Marrero-Ponce Y, Hernández-Ortega T, Martinez-Mayorga K, Cabrera-Leyva L, Ledesma-Romero JC, Aguilera-Fernández I, Rodríguez-León AR. Tensor algebra-based geometric methodology to codify central chirality on organic molecules. SAR QSAR Environ Res 2017; 28:541-556. [PMID: 28705027 DOI: 10.1080/1062936x.2017.1344729] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2017] [Accepted: 06/16/2017] [Indexed: 06/07/2023]
Abstract
A novel mathematical procedure to codify chiral features of organic molecules in the QuBiLS-MIDAS framework is introduced. This procedure constitutes a generalization to that commonly used to date, where the values 1 and -1 (correction factor) are employed to weight the molecular vectors when each atom is labelled as R (rectus) or S (sinister) according to the Cahn-Ingold-Prelog rules. Therefore, values in the range [Formula: see text] with steps equal to 0.25 may be accounted for. The atoms labelled R or S can have negative and positive values assigned (e.g. -3 for an R atom and 1 for an S atom, or vice versa), opposed values (e.g. -3 for an R atom and 3 for an S atom, or vice versa), positive values (e.g. 3 for an R atom and 1 for an S atom) or negative values (e.g. -3 for an R atom and -1 for an S atom). These proposed Chiral QuBiLS-MIDAS 3D-MDs are real numbers, non-symmetric and reduced to 'classical' (non-chiral) QuBiLS-MIDAS 3D-MDs when symmetry is not codified (correction factor equal to zero). In this report, only the factors with opposed values were considered with the purpose of demonstrating the feasibility of this proposal. From QSAR modelling carried out on four chemical datasets (Cramer's steroids, fenoterol stereoisomer derivatives, N-alkylated 3-(3-hydroxyphenyl)-piperidines, and perindoprilat stereoisomers), it was demonstrated that the use of several correction factors contributes to the building of models with greater robustness and predictive ability than those reported in the literature, as well as with respect to the models exclusively developed with QuBiLS-MIDAS 3D-MDs based on the factor 1 | -1. In conclusion, it can be stated that this novel strategy constitutes a suitable alternative to computed chirality-based descriptors, contributing to the development of good models to predict properties depending on symmetry.
Collapse
Affiliation(s)
- C R García-Jacas
- a Instituto de Química, Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , México
- b Escuela de Sistemas y Computación , Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE) , Esmeraldas , Ecuador
- g Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| | - Y Marrero-Ponce
- c Computer-Aided Molecular "Biosilico" Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- d Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina , Quito , Pichincha , Ecuador
- e Universidad San Francisco de Quito (USFQ), Instituto de Simulación Computacional (ISC-USFQ) , Quito , Pichincha , Ecuador
- f Grupo de Investigación Ambiental (GIA) , Programas Ambientales, Facultad de Ingenierías, Fundación Universitaria Tecnológico de Comfenalco (COMFENALCO) , Cartagena de Indias , Bolívar , Colombia
| | - T Hernández-Ortega
- g Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| | - K Martinez-Mayorga
- a Instituto de Química, Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , México
| | - L Cabrera-Leyva
- h Grupo de Investigación de Inteligencia Artificial (AIRES), Facultad de Informática , Universidad de Camagüey , Camagüey , Cuba
| | - J C Ledesma-Romero
- g Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| | - I Aguilera-Fernández
- g Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| | - A R Rodríguez-León
- g Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| |
Collapse
|
3
|
Martínez-Santiago O, Marrero-Ponce Y, Vivas-Reyes R, Rivera-Borroto OM, Hurtado E, Treto-Suarez MA, Ramos Y, Vergara-Murillo F, Orozco-Ugarriza ME, Martínez-López Y. Exploring the QSAR's predictive truthfulness of the novel N-tuple discrete derivative indices on benchmark datasets. SAR QSAR Environ Res 2017; 28:367-389. [PMID: 28590848 DOI: 10.1080/1062936x.2017.1326403] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 04/27/2017] [Indexed: 06/07/2023]
Abstract
Graph derivative indices (GDIs) have recently been defined over N-atoms (N = 2, 3 and 4) simultaneously, which are based on the concept of derivatives in discrete mathematics (finite difference), metaphorical to the derivative concept in classical mathematical analysis. These molecular descriptors (MDs) codify topo-chemical and topo-structural information based on the concept of the derivative of a molecular graph with respect to a given event (S) over duplex, triplex and quadruplex relations of atoms (vertices). These GDIs have been successfully applied in the description of physicochemical properties like reactivity, solubility and chemical shift, among others, and in several comparative quantitative structure activity/property relationship (QSAR/QSPR) studies. Although satisfactory results have been obtained in previous modelling studies with the aforementioned indices, it is necessary to develop new, more rigorous analysis to assess the true predictive performance of the novel structure codification. So, in the present paper, an assessment and statistical validation of the performance of these novel approaches in QSAR studies are executed, as well as a comparison with those of other QSAR procedures reported in the literature. To achieve the main aim of this research, QSARs were developed on eight chemical datasets widely used as benchmarks in the evaluation/validation of several QSAR methods and/or many different MDs (fundamentally 3D MDs). Three to seven variable QSAR models were built for each chemical dataset, according to the original dissection into training/test sets. The models were developed by using multiple linear regression (MLR) coupled with a genetic algorithm as the feature wrapper selection technique in the MobyDigs software. Each family of GDIs (for duplex, triplex and quadruplex) behaves similarly in all modelling, although there were some exceptions. However, when all families were used in combination, the results achieved were quantitatively higher than those reported by other authors in similar experiments. Comparisons with respect to external correlation coefficients (q2ext) revealed that the models based on GDIs possess superior predictive ability in seven of the eight datasets analysed, outperforming methodologies based on similar or more complex techniques and confirming the good predictive power of the obtained models. For the q2ext values, the non-parametric comparison revealed significantly different results to those reported so far, which demonstrated that the models based on DIVATI's indices presented the best global performance and yielded significantly better predictions than the 12 0-3D QSAR procedures used in the comparison. Therefore, GDIs are suitable for structure codification of the molecules and constitute a good alternative to build QSARs for the prediction of physicochemical, biological and environmental endpoints.
Collapse
Affiliation(s)
- O Martínez-Santiago
- a Department of Chemical Sciences , Central University 'Martha Abreu' of Las Villas , Santa Clara , Cuba
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - Y Marrero-Ponce
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- e Escuela de Medicina, Edificio de Especialidades Médicas , Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA) , Av. Interoceánica Km 12 ½, Cumbayá , Ecuador
- f Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica , Quito , Ecuador
- g Grupo de Investigación Ambiental (GIA) , Fundación Universitaria Tecnológico de Comfenalco , Cartagena de Indias , Colombia
| | - R Vivas-Reyes
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - O M Rivera-Borroto
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- h Departamento de Química Física Aplicada , Universidad Autónoma de Madrid (UAM) , Madrid , España
| | - E Hurtado
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
| | - M A Treto-Suarez
- i Center of Applied Nanosciences (CENAP), Andres Bello University , Chile
| | - Y Ramos
- j Department of Economic Sciences , University of Camagüey , Camagüey , Cuba
| | - F Vergara-Murillo
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - M E Orozco-Ugarriza
- k Seccional Cartagena y Grupo de Investigación Traslacional en Biomedicina & Biotecnología - GITB&B , Universidad del Sinú - Elías Bechara Zainúm , Cartagena de Indias , Colombia
| | - Y Martínez-López
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- l Grupo de Investigación de Inteligencia Artificial (AIRES) , Universidad de Camagüey , Camagüey , Cuba
| |
Collapse
|
4
|
Aguilera-Pesantes D, Robayo LE, Méndez PE, Mollocana D, Marrero-Ponce Y, Torres FJ, Méndez MA. Discovering key residues of dengue virus NS2b-NS3-protease: New binding sites for antiviral inhibitors design. Biochem Biophys Res Commun 2017; 492:631-642. [PMID: 28343993 DOI: 10.1016/j.bbrc.2017.03.107] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Revised: 03/01/2017] [Accepted: 03/19/2017] [Indexed: 12/24/2022]
Abstract
The NS2B-NS3 protease is essential for the Dengue Virus (DENV) replication process. This complex constitutes a target for efficient antiviral discovery because a drug could inhibit the viral polyprotein processing. Furthermore, since the protease is highly conserved between the four Dengue virus serotypes, it is probable that a drug would be equally effective against all of them. In this article, a strategy is reported that allowed us to identify influential residues on the function of the Dengue NS2b-NS3 Protease. Moreover, this is a strategy that could be applied to virtually any protein for the search of alternative influential residues, and for non-competitive inhibitor development. First, we incorporated several features derived from computational alanine scanning mutagenesis, sequence, structure conservation, and other structure-based characteristics. Second, these features were used as variables to obtain a multilayer perceptron model to identify defined groups (clusters) of key residues as possible candidate pockets for binding sites of new leads on the DENV protease. The identified residues included: i) amino acids close to the beta sheet-loop-beta sheet known to be important in its closed conformation for NS2b ii) residues close to the active site, iii) several residues evenly spread on the NS2b-NS3 contact surface, and iv) some inner residues most likely related to the overall stability of the protease. In addition, we found concordance on our list of residues with previously identified amino acids part of a highly conserved peptide studied for vaccine development.
Collapse
Affiliation(s)
- D Aguilera-Pesantes
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador
| | - L E Robayo
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador
| | - P E Méndez
- Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador
| | - D Mollocana
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador
| | - Y Marrero-Ponce
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Grupo de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina, Colegio de Ciencias de la Salud (COCSA), Av.Interoceánica Km 12 ½ y Av. Florencia, 17-1200-841, Cumbayá, Quito, Ecuador
| | - F J Torres
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador
| | - M A Méndez
- Universidad San Francisco de Quito, Grupo de Química Computacional y Teórica (QCT-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles sn y Vía Interoceánica, 17-1200-841, Quito, Ecuador; Universidad San Francisco de Quito, Grupo de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina, Colegio de Ciencias de la Salud (COCSA), Av.Interoceánica Km 12 ½ y Av. Florencia, 17-1200-841, Cumbayá, Quito, Ecuador.
| |
Collapse
|
5
|
García-Jacas CR, Martinez-Mayorga K, Marrero-Ponce Y, Medina-Franco JL. Conformation-dependent QSAR approach for the prediction of inhibitory activity of bromodomain modulators. SAR QSAR Environ Res 2017; 28:41-58. [PMID: 28161994 DOI: 10.1080/1062936x.2017.1278616] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Accepted: 12/22/2016] [Indexed: 06/06/2023]
Abstract
Epigenetic drug discovery is a promising research field with growing interest in the scientific community, as evidenced by the number of publications and the large amount of structure-epigenetic activity information currently available in the public domain. Computational methods are valuable tools to analyse and understand the activity of large compound collections from their structural information. In this manuscript, QSAR models to predict the inhibitory activity of a diverse and heterogeneous set of 88 organic molecules against the bromodomains BRD2, BRD3 and BRD4 are presented. A conformation-dependent representation of the chemical structures was established using the RDKit software and a training and test set division was performed. Several two-linear and three-linear QuBiLS-MIDAS molecular descriptors ( www.tomocomd.com ) were computed to extract the geometric structural features of the compounds studied. QuBiLS-MIDAS-based features sets, to be used in the modelling, were selected using dimensionality reduction strategies. The multiple linear regression procedure coupled with a genetic algorithm were employed to build the predictive models. Regression models containing between 6 to 9 variables were developed and assessed according to several internal and external validation methods. Analyses of outlier compounds and the applicability domain for each model were performed. As a result, the models against BRD2 and BRD3 with 8 variables and the model with 9 variables against BRD4 were those with the best overall performance according to the criteria accounted for. The results obtained suggest that the models proposed will be a good tool for studying the inhibitory activities of drug candidates against the bromodomains considered during epigenetic drug discovery.
Collapse
Affiliation(s)
- C R García-Jacas
- a Instituto de Química, Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , México
- b Escuela de Sistemas y Computación , Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE) , Esmeraldas , Ecuador
- c Grupo de Investigación de Bioinformática , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba
| | - K Martinez-Mayorga
- a Instituto de Química, Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , México
| | - Y Marrero-Ponce
- d Grupo de Medicina Molecular y Traslacional (MeM&T) , Universidad San Francisco de Quito (USFQ) , Quito , Ecuador
- e Grupo de Investigación Ambiental (GIA) , Fundación Universitaria Tecnológica de Comfenalco , Bolívar , Colombia
| | - J L Medina-Franco
- f Departamento de Farmacia , Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , México
| |
Collapse
|
6
|
García-Jacas CR, Marrero-Ponce Y, Barigye SJ, Hernández-Ortega T, Cabrera-Leyva L, Fernández-Castillo A. N-tuple topological/geometric cutoffs for 3D N-linear algebraic molecular codifications: variability, linear independence and QSAR analysis. SAR QSAR Environ Res 2016; 27:949-975. [PMID: 27707004 DOI: 10.1080/1062936x.2016.1231714] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Accepted: 08/30/2016] [Indexed: 06/06/2023]
Abstract
Novel N-tuple topological/geometric cutoffs to consider specific inter-atomic relations in the QuBiLS-MIDAS framework are introduced in this manuscript. These molecular cutoffs permit the taking into account of relations between more than two atoms by using (dis-)similarity multi-metrics and the concepts related with topological and Euclidean-geometric distances. To this end, the kth two-, three- and four-tuple topological and geometric neighbourhood quotient (NQ) total (or local-fragment) spatial-(dis)similarity matrices are defined, to represent 3D information corresponding to the relations between two, three and four atoms of the molecular structures that satisfy certain cutoff criteria. First, an analysis of a diverse chemical space for the most common values of topological/Euclidean-geometric distances, bond/dihedral angles, triangle/quadrilateral perimeters, triangle area and volume was performed in order to determine the intervals to take into account in the cutoff procedures. A variability analysis based on Shannon's entropy reveals that better distribution patterns are attained with the descriptors based on the cutoffs proposed (QuBiLS-MIDAS NQ-MDs) with regard to the results obtained when all inter-atomic relations are considered (QuBiLS-MIDAS KA-MDs - 'Keep All'). A principal component analysis shows that the novel molecular cutoffs codify chemical information captured by the respective QuBiLS-MIDAS KA-MDs, as well as information not captured by the latter. Lastly, a QSAR study to obtain deeper knowledge of the contribution of the proposed methods was carried out, using four molecular datasets (steroids (STER), angiotensin converting enzyme (ACE), thermolysin inhibitors (THER) and thrombin inhibitors (THR)) widely used as benchmarks in the evaluation of several methodologies. One to four variable QSAR models based on multiple linear regression were developed for each compound dataset following the original division into training and test sets. The results obtained reveal that the novel cutoff procedures yield superior performances relative to those of the QuBiLS-MIDAS KA-MDs in the prediction of the biological activities considered. From the results achieved, it can be suggested that the proposed N-tuple topological/geometric cutoffs constitute a relevant criteria for generating MDs codifying particular atomic relations, ultimately useful in enhancing the modelling capacity of the QuBiLS-MIDAS 3D-MDs.
Collapse
Affiliation(s)
- C R García-Jacas
- a Escuela de Sistemas y Computación , Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE) , Esmeraldas , Ecuador
- b Grupo de Investigación de Bioinformática , Instituto de Química, Universidad Nacional Autónoma de México (UNAM) , Ciudad de México , D.F, México
- c Grupo de Investigacion de Bioinformatica , Universidad de las Ciencias Informaticas (UCI) , La Habana , Cuba
| | - Y Marrero-Ponce
- d Grupo de Medicina Molecular y Traslacional (MeM&T) , Universidad San Francisco de Quito (USFQ) , Quito , Ecuador
- e Instituto de Simulación Computacional (ISC-USFQ), Universidad San Francisco de Quito (USFQ) , Quito , Ecuador
| | - S J Barigye
- g Department of Chemistry , McGill University , Montréal , Québec , Canada
| | - T Hernández-Ortega
- c Grupo de Investigacion de Bioinformatica , Universidad de las Ciencias Informaticas (UCI) , La Habana , Cuba
| | - L Cabrera-Leyva
- f Grupo de Investigación de Inteligencia Artificial (AIRES) , Universidad de Camagüey , Camagüey , Cuba
| | - A Fernández-Castillo
- c Grupo de Investigacion de Bioinformatica , Universidad de las Ciencias Informaticas (UCI) , La Habana , Cuba
| |
Collapse
|
7
|
Medina Marrero R, Marrero-Ponce Y, Barigye SJ, Echeverría Díaz Y, Acevedo-Barrios R, Casañola-Martín GM, García Bernal M, Torrens F, Pérez-Giménez F. QuBiLs-MAS method in early drug discovery and rational drug identification of antifungal agents. SAR QSAR Environ Res 2015; 26:943-58. [PMID: 26567876 DOI: 10.1080/1062936x.2015.1104517] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The QuBiLs-MAS approach is used for the in silico modelling of the antifungal activity of organic molecules. To this effect, non-stochastic (NS) and simple-stochastic (SS) atom-based quadratic indices are used to codify chemical information for a comprehensive dataset of 2478 compounds having a great structural variability, with 1087 of them being antifungal agents, covering the broadest antifungal mechanisms of action known so far. The NS and SS index-based antifungal activity classification models obtained using linear discriminant analysis (LDA) yield correct classification percentages of 90.73% and 92.47%, respectively, for the training set. Additionally, these models are able to correctly classify 92.16% and 87.56% of 706 compounds in an external test set. A comparison of the statistical parameters of the QuBiLs-MAS LDA-based models with those for models reported in the literature reveals comparable to superior performance, although the latter were built over much smaller and less diverse datasets, representing fewer mechanisms of action. It may therefore be inferred that the QuBiLs-MAS method constitutes a valuable tool useful in the design and/or selection of new and broad spectrum agents against life-threatening fungal infections.
Collapse
Affiliation(s)
- R Medina Marrero
- a Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research International Network (CAMD-BIR-IN) , Cartagena de Indias , Bolivar , Colombia
- b Department of Microbiology , Chemical Bioactive Center, Central University of Las Villas , Villa Clara , Cuba
| | - Y Marrero-Ponce
- a Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research International Network (CAMD-BIR-IN) , Cartagena de Indias , Bolivar , Colombia
- c Grupo de Investigación en Estudios Químicos y Biológicos, Facultad de Ciencias Básicas , Universidad Tecnológica de Bolívar , Cartagena de Indias , Bolívar , Colombia
- d Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia , Universitat de València , Valencia , Spain
- h Grupo de Investigación Microbiología y Ambiente (GIMA) . Programa de Bacteriología, Facultad Ciencias de la Salud, Universidad de San Buenaventura , Calle Real de Ternera, 130010, Cartagena (Bolivar) , Colombia
| | - S J Barigye
- a Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research International Network (CAMD-BIR-IN) , Cartagena de Indias , Bolivar , Colombia
- e Departamento de Química , Universidade Federal de Lavras , Lavras , MG , Brazil
| | - Y Echeverría Díaz
- a Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research International Network (CAMD-BIR-IN) , Cartagena de Indias , Bolivar , Colombia
| | - R Acevedo-Barrios
- c Grupo de Investigación en Estudios Químicos y Biológicos, Facultad de Ciencias Básicas , Universidad Tecnológica de Bolívar , Cartagena de Indias , Bolívar , Colombia
| | - G M Casañola-Martín
- a Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatic Research International Network (CAMD-BIR-IN) , Cartagena de Indias , Bolivar , Colombia
- d Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia , Universitat de València , Valencia , Spain
- f Facultad de Ingeniería Ambiental , Universidad Estatal Amazónica , Puyo , Ecuador
| | - M García Bernal
- b Department of Microbiology , Chemical Bioactive Center, Central University of Las Villas , Villa Clara , Cuba
| | - F Torrens
- g Institut Universitari de Ciència Molecular, Universitat de València , Valencia , Spain
| | - F Pérez-Giménez
- d Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia , Universitat de València , Valencia , Spain
| |
Collapse
|
8
|
Brito-Sánchez Y, Castillo-Garit JA, Le-Thi-Thu H, González-Madariaga Y, Torrens F, Marrero-Ponce Y, Rodríguez-Borges JE. Comparative study to predict toxic modes of action of phenols from molecular structures. SAR QSAR Environ Res 2013; 24:235-251. [PMID: 23437773 DOI: 10.1080/1062936x.2013.766260] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Quantitative structure-activity relationship models for the prediction of mode of toxic action (MOA) of 221 phenols to the ciliated protozoan Tetrahymena pyriformis using atom-based quadratic indices are reported. The phenols represent a variety of MOAs including polar narcotics, weak acid respiratory uncouplers, pro-electrophiles and soft electrophiles. Linear discriminant analysis (LDA), and four machine learning techniques (ML), namely k-nearest neighbours (k-NN), support vector machine (SVM), classification trees (CTs) and artificial neural networks (ANNs), have been used to develop several models with higher accuracies and predictive capabilities for distinguishing between four MOAs. Most of them showed global accuracy of over 90%, and false alarm rate values were below 2.9% for the training set. Cross-validation, complementary subsets and external test set were performed, with good behaviour in all cases. Our models compare favourably with other previously published models, and in general the models obtained with ML techniques show better results than those developed with linear techniques. We developed unsupervised and supervised consensus, and these results were better than our ML models, the results of rule-based approach and other ensemble models previously published. This investigation highlights the merits of ML-based techniques as an alternative to other more traditional methods for modelling MOA.
Collapse
Affiliation(s)
- Y Brito-Sánchez
- Unit of Computer-Aided Molecular Biosilico Discovery and Bioinformatic Research, Faculty of Chemistry-Pharmacy, Universidad Central Marta Abreu de Las Villas, Santa Clara, Cuba
| | | | | | | | | | | | | |
Collapse
|
9
|
Barigye SJ, Marrero-Ponce Y, Martínez López Y, Martínez Santiago O, Torrens F, García Domenech R, Galvez J. Event-based criteria in GT-STAF information indices: theory, exploratory diversity analysis and QSPR applications. SAR QSAR Environ Res 2013; 24:3-34. [PMID: 23066866 DOI: 10.1080/1062936x.2012.728994] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Versatile event-based approaches for the definition of novel information theory-based indices (IFIs) are presented. An event in this context is the criterion followed in the "discovery" of molecular substructures, which in turn serve as basis for the construction of the generalized incidence and relations frequency matrices, Q and F, respectively. From the resultant F, Shannon's, mutual, conditional and joint entropy-based IFIs are computed. In previous reports, an event named connected subgraphs was presented. The present study is an extension of this notion, in which we introduce other events, namely: terminal paths, vertex path incidence, quantum subgraphs, walks of length k, Sach's subgraphs, MACCs, E-state and substructure fingerprints and, finally, Ghose and Crippen atom-types for hydrophobicity and refractivity. Moreover, we define magnitude-based IFIs, introducing the use of the magnitude criterion in the definition of mutual, conditional and joint entropy-based IFIs. We also discuss the use of information-theoretic parameters as a measure of the dissimilarity of codified structural information of molecules. Finally, a comparison of the statistics for QSPR models obtained with the proposed IFIs and DRAGON's molecular descriptors for two physicochemical properties log P and log K of 34 derivatives of 2-furylethylenes demonstrates similar to better predictive ability than the latter.
Collapse
Affiliation(s)
- S J Barigye
- Faculty of Chemistry-Pharmacy, Universidad Central Marta Abreu de Las Villas, Villa Clara, Cuba
| | | | | | | | | | | | | |
Collapse
|