1
|
Barigye SJ, Gómez-Ganau S, Serrano-Candelas E, Gozalbes R. PeptiDesCalculator: Software for computation of peptide descriptors. Definition, implementation and case studies for 9 bioactivity endpoints. Proteins 2020; 89:174-184. [PMID: 32881068 DOI: 10.1002/prot.26003] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 08/05/2020] [Accepted: 08/27/2020] [Indexed: 11/09/2022]
Abstract
We present a novel Java-based program denominated PeptiDesCalculator for computing peptide descriptors. These descriptors include: redefinitions of known protein parameters to suite the peptide domain, generalization schemes for the global descriptions of peptide characteristics, as well as empirical descriptors based on experimental evidence on peptide stability and interaction propensity. The PeptiDesCalculator software provides a user-friendly Graphical User Interface (GUI) and is parallelized to maximize the use of computational resources available in current work stations. The PeptiDesCalculator indices are employed in modeling 8 peptide bioactivity endpoints demonstrating satisfactory behavior. Moreover, we compare the performance of a support vector machine (SVM) classifier built using 15 PeptiDesCalculator indices with that of a recently reported deep neural network (DNN) antimicrobial activity classifier, demonstrating comparable test set performance notwithstanding the remarkably lower degree of freedom for the former. This software will facilitate the development of in silico models for the prediction of peptide properties.
Collapse
Affiliation(s)
- Stephen J Barigye
- ProtoQSAR SL, Centro Europeo de Empresas Innovadoras (CEEI), Parque Tecnológico de Valencia, Valencia, Spain.,MolDrug AI Systems SL, Valencia, Spain
| | - Sergi Gómez-Ganau
- ProtoQSAR SL, Centro Europeo de Empresas Innovadoras (CEEI), Parque Tecnológico de Valencia, Valencia, Spain.,Eurofins Agroscience Services Regulatory Spain SL, Valencia, Spain
| | - Eva Serrano-Candelas
- ProtoQSAR SL, Centro Europeo de Empresas Innovadoras (CEEI), Parque Tecnológico de Valencia, Valencia, Spain
| | - Rafael Gozalbes
- ProtoQSAR SL, Centro Europeo de Empresas Innovadoras (CEEI), Parque Tecnológico de Valencia, Valencia, Spain.,MolDrug AI Systems SL, Valencia, Spain
| |
Collapse
|
3
|
Martínez-Santiago O, Marrero-Ponce Y, Vivas-Reyes R, Rivera-Borroto OM, Hurtado E, Treto-Suarez MA, Ramos Y, Vergara-Murillo F, Orozco-Ugarriza ME, Martínez-López Y. Exploring the QSAR's predictive truthfulness of the novel N-tuple discrete derivative indices on benchmark datasets. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2017; 28:367-389. [PMID: 28590848 DOI: 10.1080/1062936x.2017.1326403] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 04/27/2017] [Indexed: 06/07/2023]
Abstract
Graph derivative indices (GDIs) have recently been defined over N-atoms (N = 2, 3 and 4) simultaneously, which are based on the concept of derivatives in discrete mathematics (finite difference), metaphorical to the derivative concept in classical mathematical analysis. These molecular descriptors (MDs) codify topo-chemical and topo-structural information based on the concept of the derivative of a molecular graph with respect to a given event (S) over duplex, triplex and quadruplex relations of atoms (vertices). These GDIs have been successfully applied in the description of physicochemical properties like reactivity, solubility and chemical shift, among others, and in several comparative quantitative structure activity/property relationship (QSAR/QSPR) studies. Although satisfactory results have been obtained in previous modelling studies with the aforementioned indices, it is necessary to develop new, more rigorous analysis to assess the true predictive performance of the novel structure codification. So, in the present paper, an assessment and statistical validation of the performance of these novel approaches in QSAR studies are executed, as well as a comparison with those of other QSAR procedures reported in the literature. To achieve the main aim of this research, QSARs were developed on eight chemical datasets widely used as benchmarks in the evaluation/validation of several QSAR methods and/or many different MDs (fundamentally 3D MDs). Three to seven variable QSAR models were built for each chemical dataset, according to the original dissection into training/test sets. The models were developed by using multiple linear regression (MLR) coupled with a genetic algorithm as the feature wrapper selection technique in the MobyDigs software. Each family of GDIs (for duplex, triplex and quadruplex) behaves similarly in all modelling, although there were some exceptions. However, when all families were used in combination, the results achieved were quantitatively higher than those reported by other authors in similar experiments. Comparisons with respect to external correlation coefficients (q2ext) revealed that the models based on GDIs possess superior predictive ability in seven of the eight datasets analysed, outperforming methodologies based on similar or more complex techniques and confirming the good predictive power of the obtained models. For the q2ext values, the non-parametric comparison revealed significantly different results to those reported so far, which demonstrated that the models based on DIVATI's indices presented the best global performance and yielded significantly better predictions than the 12 0-3D QSAR procedures used in the comparison. Therefore, GDIs are suitable for structure codification of the molecules and constitute a good alternative to build QSARs for the prediction of physicochemical, biological and environmental endpoints.
Collapse
Affiliation(s)
- O Martínez-Santiago
- a Department of Chemical Sciences , Central University 'Martha Abreu' of Las Villas , Santa Clara , Cuba
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - Y Marrero-Ponce
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- e Escuela de Medicina, Edificio de Especialidades Médicas , Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA) , Av. Interoceánica Km 12 ½, Cumbayá , Ecuador
- f Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica , Quito , Ecuador
- g Grupo de Investigación Ambiental (GIA) , Fundación Universitaria Tecnológico de Comfenalco , Cartagena de Indias , Colombia
| | - R Vivas-Reyes
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - O M Rivera-Borroto
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- h Departamento de Química Física Aplicada , Universidad Autónoma de Madrid (UAM) , Madrid , España
| | - E Hurtado
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
| | - M A Treto-Suarez
- i Center of Applied Nanosciences (CENAP), Andres Bello University , Chile
| | - Y Ramos
- j Department of Economic Sciences , University of Camagüey , Camagüey , Cuba
| | - F Vergara-Murillo
- c Group of Quantum and Theoretical Chemistry , University of Cartagena , Cartagena de Indias , Colombia
- d Facultad de Ingeniería , Grupo CipTec, Fundación Universitaria Tecnológico Comfenalco , Cartagena de Indias , Colombia
| | - M E Orozco-Ugarriza
- k Seccional Cartagena y Grupo de Investigación Traslacional en Biomedicina & Biotecnología - GITB&B , Universidad del Sinú - Elías Bechara Zainúm , Cartagena de Indias , Colombia
| | - Y Martínez-López
- b Unit of Computer-Aided Molecular 'Biosilico' Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Quito , Ecuador
- l Grupo de Investigación de Inteligencia Artificial (AIRES) , Universidad de Camagüey , Camagüey , Cuba
| |
Collapse
|
4
|
Martínez-Santiago O, Marrero-Ponce Y, Barigye SJ, Le Thi Thu H, Torres FJ, Zambrano CH, Muñiz Olite JL, Cruz-Monteagudo M, Vivas-Reyes R, Vázquez Infante L, Artiles Martínez LM. Physico-Chemical and Structural Interpretation of Discrete Derivative Indices on N-Tuples Atoms. Int J Mol Sci 2016; 17:ijms17060812. [PMID: 27240357 PMCID: PMC4926346 DOI: 10.3390/ijms17060812] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2016] [Revised: 04/27/2016] [Accepted: 05/04/2016] [Indexed: 11/28/2022] Open
Abstract
This report examines the interpretation of the Graph Derivative Indices (GDIs) from three different perspectives (i.e., in structural, steric and electronic terms). It is found that the individual vertex frequencies may be expressed in terms of the geometrical and electronic reactivity of the atoms and bonds, respectively. On the other hand, it is demonstrated that the GDIs are sensitive to progressive structural modifications in terms of: size, ramifications, electronic richness, conjugation effects and molecular symmetry. Moreover, it is observed that the GDIs quantify the interaction capacity among molecules and codify information on the activation entropy. A structure property relationship study reveals that there exists a direct correspondence between the individual frequencies of atoms and Hückel’s Free Valence, as well as between the atomic GDIs and the chemical shift in NMR, which collectively validates the theory that these indices codify steric and electronic information of the atoms in a molecule. Taking in consideration the regularity and coherence found in experiments performed with the GDIs, it is possible to say that GDIs possess plausible interpretation in structural and physicochemical terms.
Collapse
Affiliation(s)
- Oscar Martínez-Santiago
- Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research International Network (CAMD-BIR IN), Cumbayá-Tumbaco, Quito 170184, Ecuador.
- Department of Chemical Science, Faculty of Chemistry-Pharmacy, Universidad Central "Martha Abreu" de Las Villas, Santa Clara 54830, Villa Clara, Cuba.
| | - Yovani Marrero-Ponce
- Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research International Network (CAMD-BIR IN), Cumbayá-Tumbaco, Quito 170184, Ecuador.
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas, Hospital de los Valles, Av. Interoceánica Km 12 ½-Cumbayá, Quito 170157, Ecuador.
- Universidad San Francisco de Quito (USFQ), Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador.
- Grupo de Investigación Microbiología y Ambiente (GIMA), Programa de Bacteriología, Facultad Ciencias de la Salud, Universidad de San Buenaventura, Calle Real de Ternera, Cartagena de Indias, Bolívar 130010, Colombia.
| | - Stephen J Barigye
- Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research International Network (CAMD-BIR IN), Cumbayá-Tumbaco, Quito 170184, Ecuador.
- Departamento de Química, Universidade Federal de Lavras (UFLA), Caixa Postal 3037, Lavras 37200-000, MG, Brazil.
| | - Huong Le Thi Thu
- School of Medicine and Pharmacy, Vietnam National University, Hanoi (VNU) 144 Xuan Thuy, Cau Giay, Hanoi 100000, Vietnam.
| | - F Javier Torres
- Universidad San Francisco de Quito (USFQ), Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador.
- Universidad San Francisco de Quito (USFQ), Grupo de Química Computacional y Teórica (QCT-USFQ), Departamento de Ingeniería Química, Diego de Robles y Vía Interoceánica, Quito 170157, Ecuador.
| | - Cesar H Zambrano
- Universidad San Francisco de Quito (USFQ), Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador.
- Universidad San Francisco de Quito (USFQ), Grupo de Química Computacional y Teórica (QCT-USFQ), Departamento de Ingeniería Química, Diego de Robles y Vía Interoceánica, Quito 170157, Ecuador.
| | - Jorge L Muñiz Olite
- Grupo de Investigación en Estudios Químicos y Biológicos, Facultad de Ciencias Básicas, Universidad Tecnológica de Bolívar (UTB), Parque Industrial y Tecnológico Carlos Vélez Pombo Km 1 vía Turbaco, Cartagena de Indias, Bolívar 130010, Colombia.
| | - Maykel Cruz-Monteagudo
- Instituto de Investigaciones Biomédicas (IIB), Universidad de Las Américas (UDLA), Quito 170513, Ecuador.
| | - Ricardo Vivas-Reyes
- Grupo de Química Cuántica y Teórica, Facultad de Ciencias, Universidad de Cartagena, Cartagena de Indias, Bolívar 130001, Colombia.
- Grupo CipTec, Fundación Universitaria Tecnológico de Comfenalco, Facultad de Ingenierías, Programa de Ingeniería de Procesos, Cartagena de Indias, Bolívar 130001, Colombia.
| | - Liliana Vázquez Infante
- Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research International Network (CAMD-BIR IN), Cumbayá-Tumbaco, Quito 170184, Ecuador.
- Department of Chemical Science, Faculty of Chemistry-Pharmacy, Universidad Central "Martha Abreu" de Las Villas, Santa Clara 54830, Villa Clara, Cuba.
| | - Luis M Artiles Martínez
- Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research International Network (CAMD-BIR IN), Cumbayá-Tumbaco, Quito 170184, Ecuador.
| |
Collapse
|