Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ruiz-Blanco YB, Paz W, Green J, Marrero-Ponce Y. ProtDCal: A program to compute general-purpose-numerical descriptors for sequences and 3D-structures of proteins. BMC Bioinformatics 2015;16:162. [PMID: 25982853 PMCID: PMC4432771 DOI: 10.1186/s12859-015-0586-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Accepted: 04/22/2015] [Indexed: 11/10/2022] Open

For:	Ruiz-Blanco YB, Paz W, Green J, Marrero-Ponce Y. ProtDCal: A program to compute general-purpose-numerical descriptors for sequences and 3D-structures of proteins. BMC Bioinformatics 2015;16:162. [PMID: 25982853 PMCID: PMC4432771 DOI: 10.1186/s12859-015-0586-0] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2015] [Accepted: 04/22/2015] [Indexed: 11/10/2022] Open

Number

Cited by Other Article(s)

Bhattarai S, Tayara H, Chong KT. Advancing Peptide-Based Cancer Therapy with AI: In-Depth Analysis of State-of-the-Art AI Models. J Chem Inf Model 2024;64:4941-4957. [PMID: 38874445 DOI: 10.1021/acs.jcim.4c00295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]

Han Y, Zhang H, Zeng Z, Liu Z, Lu D, Liu Z. Descriptor-augmented machine learning for enzyme-chemical interaction predictions. Synth Syst Biotechnol 2024;9:259-268. [PMID: 38450325 PMCID: PMC10915406 DOI: 10.1016/j.synbio.2024.02.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 02/21/2024] [Accepted: 02/22/2024] [Indexed: 03/08/2024] Open

Abstract

Descriptors play a pivotal role in enzyme design for the greener synthesis of biochemicals, as they could characterize enzymes and chemicals from the physicochemical and evolutionary perspective. This study examined the effects of various descriptors on the performance of Random Forest model used for enzyme-chemical relationships prediction. We curated activity data of seven specific enzyme families from the literature and developed the pipeline for evaluation the machine learning model performance using 10-fold cross-validation. The influence of protein and chemical descriptors was assessed in three scenarios, which were predicting the activity of unknown relations between known enzymes and known chemicals (new relationship evaluation), predicting the activity of novel enzymes on known chemicals (new enzyme evaluation), and predicting the activity of new chemicals on known enzymes (new chemical evaluation). The results showed that protein descriptors significantly enhanced the classification performance of model on new enzyme evaluation in three out of the seven datasets with the greatest number of enzymes, whereas chemical descriptors appear no effect. A variety of sequence-based and structure-based protein descriptors were constructed, among which the esm-2 descriptor achieved the best results. Using enzyme families as labels showed that descriptors could cluster proteins well, which could explain the contributions of descriptors to the machine learning model. As a counterpart, in the new chemical evaluation, chemical descriptors made significant improvement in four out of the seven datasets, while protein descriptors appear no effect. We attempted to evaluate the generalization ability of the model by correlating the statistics of the datasets with the performance of the models. The results showed that datasets with higher sequence similarity were more likely to get better results in the new enzyme evaluation and datasets with more enzymes were more likely beneficial from the protein descriptor strategy. This work provides guidance for the development of machine learning models for specific enzyme families.

Collapse

Michalik I, Kuder KJ. Machine Learning Methods in Protein-Protein Docking. Methods Mol Biol 2024;2780:107-126. [PMID: 38987466 DOI: 10.1007/978-1-0716-3985-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]

Jarończyk M. Software for Predicting Binding Free Energy of Protein-Protein Complexes and Their Mutants. Methods Mol Biol 2024;2780:139-147. [PMID: 38987468 DOI: 10.1007/978-1-0716-3985-6_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]

Nath A, Chaube R. Mining Chemogenomic Spaces for Prediction of Drug-Target Interactions. Methods Mol Biol 2024;2714:155-169. [PMID: 37676598 DOI: 10.1007/978-1-0716-3441-7_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/08/2023]

Durairaj J, de Ridder D, van Dijk AD. Beyond sequence: Structure-based machine learning. Comput Struct Biotechnol J 2022;21:630-643. [PMID: 36659927 PMCID: PMC9826903 DOI: 10.1016/j.csbj.2022.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/31/2022] Open

ABP-Finder: A Tool to Identify Antibacterial Peptides and the Gram-Staining Type of Targeted Bacteria. Antibiotics (Basel) 2022;11:antibiotics11121708. [PMID: 36551365 PMCID: PMC9774453 DOI: 10.3390/antibiotics11121708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/16/2022] [Accepted: 11/17/2022] [Indexed: 11/29/2022] Open

Yang Y, Zhao J, Zeng L, Vihinen M. ProTstab2 for Prediction of Protein Thermal Stabilities. Int J Mol Sci 2022;23:ijms231810798. [PMID: 36142711 PMCID: PMC9505338 DOI: 10.3390/ijms231810798] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 09/12/2022] [Accepted: 09/13/2022] [Indexed: 11/16/2022] Open

Agüero-Chapin G, Galpert-Cañizares D, Domínguez-Pérez D, Marrero-Ponce Y, Pérez-Machado G, Teijeira M, Antunes A. Emerging Computational Approaches for Antimicrobial Peptide Discovery. Antibiotics (Basel) 2022;11:antibiotics11070936. [PMID: 35884190 PMCID: PMC9311958 DOI: 10.3390/antibiotics11070936] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 02/05/2023] Open

Affiliation(s)

Guillermin Agüero-Chapin CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
Deborah Galpert-Cañizares Departamento de Ciencia de la Computación, Universidad Central Marta Abreu de Las Villas (UCLV), Santa Clara 54830, Cuba;
Dany Domínguez-Pérez CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Proquinorte, Unipessoal, Lda, Avenida 5 de Outubro, 124, 7º Piso, Avenidas Novas, 1050-061 Lisboa, Portugal
Yovani Marrero-Ponce Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Translacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador;
Gisselle Pérez-Machado EpiDisease S.L—Spin-Off of Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), 46980 Valencia, Spain;
Marta Teijeira Departamento de Química Orgánica, Facultade de Química, Universidade de Vigo, 36310 Vigo, Spain; Instituto de Investigación Sanitaria Galicia Sur, Hospital Álvaro Cunqueiro, 36213 Vigo, Spain
Agostinho Antunes CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal; Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)

Collapse

Romero-Molina S, Ruiz-Blanco YB, Mieres-Perez J, Harms M, Münch J, Ehrmann M, Sanchez-Garcia E. PPI-Affinity: A Web Tool for the Prediction and Optimization of Protein-Peptide and Protein-Protein Binding Affinity. J Proteome Res 2022;21:1829-1841. [PMID: 35654412 PMCID: PMC9361347 DOI: 10.1021/acs.jproteome.2c00020] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Quevedo-Tumailli V, Ortega-Tenezaca B, González-Díaz H. IFPTML Mapping of Drug Graphs with Protein and Chromosome Structural Networks vs. Pre-Clinical Assay Information for Discovery of Antimalarial Compounds. Int J Mol Sci 2021;22:ijms222313066. [PMID: 34884870 PMCID: PMC8657696 DOI: 10.3390/ijms222313066] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2021] [Revised: 11/23/2021] [Accepted: 11/24/2021] [Indexed: 11/16/2022] Open

Abstract

The parasite species of genus Plasmodium causes Malaria, which remains a major global health problem due to parasite resistance to available Antimalarial drugs and increasing treatment costs. Consequently, computational prediction of new Antimalarial compounds with novel targets in the proteome of Plasmodium sp. is a very important goal for the pharmaceutical industry. We can expect that the success of the pre-clinical assay depends on the conditions of assay per se, the chemical structure of the drug, the structure of the target protein to be targeted, as well as on factors governing the expression of this protein in the proteome such as genes (Deoxyribonucleic acid, DNA) sequence and/or chromosomes structure. However, there are no reports of computational models that consider all these factors simultaneously. Some of the difficulties for this kind of analysis are the dispersion of data in different datasets, the high heterogeneity of data, etc. In this work, we analyzed three databases ChEMBL (Chemical database of the European Molecular Biology Laboratory), UniProt (Universal Protein Resource), and NCBI-GDV (National Center for Biotechnology Information—Genome Data Viewer) to achieve this goal. The ChEMBL dataset contains outcomes for 17,758 unique assays of potential Antimalarial compounds including numeric descriptors (variables) for the structure of compounds as well as a huge amount of information about the conditions of assays. The NCBI-GDV and UniProt datasets include the sequence of genes, proteins, and their functions. In addition, we also created two partitions (c_assayj = c_aj and c_dataj = cd_j) of categorical variables from theChEMBL dataset. These partitions contain variables that encode information about experimental conditions of preclinical assays (c_aj) or about the nature and quality of data (c_dj). These categorical variables include information about 22 parameters of biological activity (c_a0), 28 target proteins (c_a1), and 9 organisms of assay (c_a2), etc. We also created another partition of (c_protj = c_pj) including categorical variables with biological information about the target proteins, genes, and chromosomes. These variables cover32 genes (c_p0), 10 chromosomes (c_p1), gene orientation (c_p2), and 31 protein functions (c_p3). We used a Perturbation-Theory Machine Learning Information Fusion (IFPTML) algorithm to map all this information (from three databases) into and train a predictive model. Shannon’s entropy measure Sh_k (numerical variables) was used to quantify the information about the structure of drugs, protein sequences, gene sequences, and chromosomes in the same information scale. Perturbation Theory Operators (PTOs) with the form of Moving Average (MA) operators have been used to quantify perturbations (deviations) in the structural variables with respect to their expected values for different subsets (partitions) of categorical variables. We obtained three IFPTML models using General Discriminant Analysis (GDA), Classification Tree with Univariate Splits (CTUS), and Classification Tree with Linear Combinations (CTLC). The IFPTML-CTLC presented the better performance with Sensitivity Sn(%) = 83.6/85.1, and Specificity Sp(%) = 89.8/89.7 for training/validation sets, respectively. This model could become a useful tool for the optimization of preclinical assays of new Antimalarial compounds vs. different proteins in the proteome of Plasmodium.

Collapse

PTML modeling for peptide discovery: in silico design of non-hemolytic peptides with antihypertensive activity. Mol Divers 2021;26:2523-2534. [PMID: 34802116 DOI: 10.1007/s11030-021-10350-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 11/05/2021] [Indexed: 01/19/2023]

Proteome-wide Prediction of Lysine Methylation Leads to Identification of H2BK43 Methylation and Outlines the Potential Methyllysine Proteome. Cell Rep 2021;32:107896. [PMID: 32668242 DOI: 10.1016/j.celrep.2020.107896] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 04/29/2020] [Accepted: 06/22/2020] [Indexed: 12/15/2022] Open

Ruiz-Blanco YB, Ávila-Barrientos LP, Hernández-García E, Antunes A, Agüero-Chapin G, García-Hernández E. Engineering protein fragments via evolutionary and protein-protein interaction algorithms: de novo design of peptide inhibitors for F_O F₁ -ATP synthase. FEBS Lett 2020;595:183-194. [PMID: 33151544 DOI: 10.1002/1873-3468.13988] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/23/2020] [Accepted: 10/30/2020] [Indexed: 11/08/2022]

Kumar A, Dubey R, Singhai S, Konar AD, Basu A. Structural characterization with light scattering: A tool for rationally designing protein formulations. Anal Biochem 2020;609:113979. [PMID: 33035463 DOI: 10.1016/j.ab.2020.113979] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2020] [Revised: 09/22/2020] [Accepted: 09/28/2020] [Indexed: 11/15/2022]

Mou Z, Eakes J, Cooper CJ, Foster CM, Standaert RF, Podar M, Doktycz MJ, Parks JM. Machine learning‐based prediction of enzyme substrate scope: Application to bacterial nitrilases. Proteins 2020;89:336-347. [DOI: 10.1002/prot.26019] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 09/02/2020] [Accepted: 10/17/2020] [Indexed: 01/11/2023]

Karlberg M, de Souza JV, Fan L, Kizhedath A, Bronowska AK, Glassey J. QSAR Implementation for HIC Retention Time Prediction of mAbs Using Fab Structure: A Comparison between Structural Representations. Int J Mol Sci 2020;21:ijms21218037. [PMID: 33126648 PMCID: PMC7663183 DOI: 10.3390/ijms21218037] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 10/22/2020] [Accepted: 10/27/2020] [Indexed: 12/19/2022] Open

Aguilera-Mendoza L, Marrero-Ponce Y, García-Jacas CR, Chavez E, Beltran JA, Guillen-Ramirez HA, Brizuela CA. Automatic construction of molecular similarity networks for visual graph mining in chemical space of bioactive peptides: an unsupervised learning approach. Sci Rep 2020;10:18074. [PMID: 33093586 PMCID: PMC7583304 DOI: 10.1038/s41598-020-75029-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Accepted: 09/23/2020] [Indexed: 12/15/2022] Open

Abstract

The increasing interest in bioactive peptides with therapeutic potentials has been reflected in a large variety of biological databases published over the last years. However, the knowledge discovery process from these heterogeneous data sources is a nontrivial task, becoming the essence of our research endeavor. Therefore, we devise a unified data model based on molecular similarity networks for representing a chemical reference space of bioactive peptides, having an implicit knowledge that is currently not explicitly accessed in existing biological databases. Indeed, our main contribution is a novel workflow for the automatic construction of such similarity networks, enabling visual graph mining techniques to uncover new insights from the "ocean" of known bioactive peptides. The workflow presented here relies on the following sequential steps: (i) calculation of molecular descriptors by applying statistical and aggregation operators on amino acid property vectors; (ii) a two-stage unsupervised feature selection method to identify an optimized subset of descriptors using the concepts of entropy and mutual information; (iii) generation of sparse networks where nodes represent bioactive peptides, and edges between two nodes denote their pairwise similarity/distance relationships in the defined descriptor space; and (iv) exploratory analysis using visual inspection in combination with clustering and network science techniques. For practical purposes, the proposed workflow has been implemented in our visual analytics software tool ( http://mobiosd-hub.com/starpep/ ), to assist researchers in extracting useful information from an integrated collection of 45120 bioactive peptides, which is one of the largest and most diverse data in its field. Finally, we illustrate the applicability of the proposed workflow for discovering central nodes in molecular similarity networks that may represent a biologically relevant chemical space known to date.

Collapse

Poot Velez AH, Fontove F, Del Rio G. Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes. Int J Mol Sci 2020;21:E4787. [PMID: 32640745 PMCID: PMC7370293 DOI: 10.3390/ijms21134787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 06/20/2020] [Accepted: 06/28/2020] [Indexed: 01/22/2023] Open

Youmans M, Spainhour JCG, Qiu P. Classification of Antibacterial Peptides Using Long Short-Term Memory Recurrent Neural Networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:1134-1140. [PMID: 30843849 DOI: 10.1109/tcbb.2019.2903800] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Romero-Molina S, Ruiz-Blanco YB, Green JR, Sanchez-Garcia E. ProtDCal-Suite: A web server for the numerical codification and functional analysis of proteins. Protein Sci 2020;28:1734-1743. [PMID: 31271472 DOI: 10.1002/pro.3673] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Revised: 06/21/2019] [Accepted: 06/24/2019] [Indexed: 12/24/2022]

Contreras-Torres E, Marrero-Ponce Y, Terán JE, García-Jacas CR, Brizuela CA, Sánchez-Rodríguez JC. MuLiMs-MCoMPAs: A Novel Multiplatform Framework to Compute Tensor Algebra-Based Three-Dimensional Protein Descriptors. J Chem Inf Model 2020;60:1042-1059. [PMID: 31663741 DOI: 10.1021/acs.jcim.9b00629] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Abstract

This report introduces the MuLiMs-MCoMPAs software (acronym for Multi-Linear Maps based on N-Metric and Contact Matrices of 3D Protein and Amino-acid weightings), designed to compute tensor-based 3D protein structural descriptors by applying two- and three-linear algebraic forms. Moreover, these descriptors contemplate generalizing components such as novel 3D protein structural representations, (dis)similarity metrics, and multimetrics to extract geometrical related information between two and three amino acids, weighting schemes based on amino acid properties, matrix normalization procedures that consider simple-stochastic and mutual probability transformations, topological and geometrical cutoffs, amino acid, and group-based MD calculations, and aggregation operators for merging amino acidic and group MDs. The MuLiMs-MCoMPAs software, which belongs to the ToMoCoMD-CAMPS suite, was developed in Java (version 1.8) using the Chemistry Development Kit (CDK) (version 1.4.19) and the Jmol libraries. This software implemented a divide-and-conquer strategy to parallelize the computation of the indices as well as modules for data preprocessing and batch computing functionalities. Furthermore, it consists of two components: (i) a desktop-graphical user interface (GUI) and (ii) an API library. The relevance of this novel approach is demonstrated through two analyses that considered Shannon's entropy-based variability and a principal component analysis. These studies showed that the MuLiMs-MCoMPAs' three-linear descriptor family contains higher informational entropy than several other descriptors generated with available computation tools. Moreover, the MuLiMs-MCoMPAs indices capture additional orthogonal information to the one codified by the available calculation approaches. As a result, two sets of suggested theoretical configurations that contain 13648 two-linear indices and 20263 three-linear indices are available for download at tomocomd.com . Furthermore, as a demonstration of the applicability and easy integration of the MuLiMs library into a QSAR-based expert system, a software application (ProStAF) was generated to predict SCOP protein structural classes and folding rate. It can thus be anticipated that the MuLiMs-MCoMPAs framework will turn into a valuable contribution to the chem- and bioinformatics research fields.

Collapse

Affiliation(s)

Ernesto Contreras-Torres Computer-Aided Molecular "Biosilico" Discovery and Bioinformatics Research International Network (CAMD-BIR IN) , Cumbayá, Quito , Ecuador.,Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas; and Instituto de Simulación Computacional (ISC-USFQ) , Universidad San Francisco de Quito (USFQ) , Diego de Robles y vía Interoceánica , Quito 170157 , Pichincha , Ecuador
Yovani Marrero-Ponce Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas; and Instituto de Simulación Computacional (ISC-USFQ) , Universidad San Francisco de Quito (USFQ) , Diego de Robles y vía Interoceánica , Quito 170157 , Pichincha , Ecuador.,Grupo GINUMED, Facultad de Salud, Programa de Medicina , Corporacion Universitaria Rafal Nuñez , Cartagena , Colombia.,Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia , Universitat de València , 46010 Valéncia , Spain
Julio E Terán Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas; and Instituto de Simulación Computacional (ISC-USFQ) , Universidad San Francisco de Quito (USFQ) , Diego de Robles y vía Interoceánica , Quito 170157 , Pichincha , Ecuador.,Grupo de Química Computacional y Teórica, Departamento de Ingeniería Química , Universidad San Francisco de Quito (USFQ) , Diego de Robles y vía Interoceánica , Quito 170157 , Pichincha Ecuador
César R García-Jacas Cátedras Conacyt-Departamento de Ciencias de la Computación , Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE) , Ensenada , Baja California , México
Carlos A Brizuela Departamento de Ciencias de la Computación , Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE) , Ensenada , Baja California , México
Juan Carlos Sánchez-Rodríguez Dirección de Tecnología , Universidad de las Ciencias Informáticas (UCI) , La Habana , Cuba

Collapse

García-Jacas CR, Marrero-Ponce Y, Vivas-Reyes R, Suárez-Lezcano J, Martinez-Rios F, Terán JE, Aguilera-Mendoza L. Distributed and multicore QuBiLS-MIDAS software v2.0: Computing chiral, fuzzy, weighted and truncated geometrical molecular descriptors based on tensor algebra. J Comput Chem 2020;41:1209-1227. [PMID: 32058625 DOI: 10.1002/jcc.26167] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Revised: 01/22/2020] [Accepted: 01/26/2020] [Indexed: 12/12/2022]

Abstract

Advances to the distributed, multi-core and fully cross-platform QuBiLS-MIDAS software v2.0 (http://tomocomd.com/qubils-midas) are reported in this article since the v1.0 release. The QuBiLS-MIDAS software is the only one that computes atom-pair and alignment-free geometrical MDs (3D-MDs) from several distance metrics other than the Euclidean distance, as well as alignment-free 3D-MDs that codify structural information regarding the relations among three and four atoms of a molecule. The most recent features added to the QuBiLS-MIDAS software v2.0 are related (a) to the calculation of atomic weightings from indices based on the vertex-degree invariant (e.g., Alikhanidi index); (b) to consider central chirality during the molecular encoding; (c) to use measures based on clustering methods and statistical functions to codify structural information among more than two atoms; (d) to the use of a novel method based on fuzzy membership functions to spherically truncate inter-atomic relations; and (e) to the use of weighted and fuzzy aggregation operators to compute global 3D-MDs according to the importance and/or interrelation of the atoms of a molecule during the molecular encoding. Moreover, a novel module to compute QuBiLS-MIDAS 3D-MDs from their headings was also developed. This module can be used either by the graphical user interface or by means of the software library. By using the library, both the predictive models built with the QuBiLS-MIDAS 3D-MDs and the QuBiLS-MIDAS 3D-MDs calculation can be embedded in other tools. A set of predefined QuBiLS-MIDAS 3D-MDs with high information content and low redundancy on a set comprised of 20,469 compounds is also provided to be employed in further cheminformatics tasks. This set of predefined 3D-MDs evidenced better performance than all the universe of Dragon (v5.5) and PaDEL 0D-to-3D MDs in variability studies, whereas a linear independence study proved that these QuBiLS-MIDAS 3D-MDs codify chemical information orthogonal to the Dragon 0D-to-3D MDs. This set of predefined 3D-MDs would be periodically updated as long as new results be achieved. In general, this report highlights our continued efforts to provide a better tool for a most suitable characterization of compounds, and in this way, to contribute to obtaining better outcomes in future applications.

Collapse

Gentiluomo L, Svilenov HL, Augustijn D, El Bialy I, Greco ML, Kulakova A, Indrakumar S, Mahapatra S, Morales MM, Pohl C, Roche A, Tosstorff A, Curtis R, Derrick JP, Nørgaard A, Khan TA, Peters GHJ, Pluen A, Rinnan Å, Streicher WW, van der Walle CF, Uddin S, Winter G, Roessner D, Harris P, Frieß W. Advancing Therapeutic Protein Discovery and Development through Comprehensive Computational and Biophysical Characterization. Mol Pharm 2020;17:426-440. [PMID: 31790599 DOI: 10.1021/acs.molpharmaceut.9b00852] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]

Affiliation(s)

Lorenzo Gentiluomo Wyatt Technology Europe GmbH , Hochstrasse 18 , 56307 Dernbach , Germany.,Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany
Hristo L Svilenov Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany
Dillen Augustijn Department of Food Science, Faculty of Science , Copenhagen University , Rolighedsvej 26 , 1958 Frederiksberg , Denmark
Inas El Bialy Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany
Maria Laura Greco Dosage Form Design and Development , AstraZeneca , Sir Aaron Klug Building, Granta Park , Cambridge CB21 6GH , U.K
Alina Kulakova Department of Chemistry , Technical University of Denmark , Kemitorvet 207 , 2800 Kongens Lyngby , Denmark
Sowmya Indrakumar Department of Chemistry , Technical University of Denmark , Kemitorvet 207 , 2800 Kongens Lyngby , Denmark
Sujata Mahapatra Novozymes A/S , Krogshoejvej 36 , 2880 Bagsvaerd , Denmark
Marcello Martinez Morales Dosage Form Design and Development , AstraZeneca , Sir Aaron Klug Building, Granta Park , Cambridge CB21 6GH , U.K
Christin Pohl Novozymes A/S , Krogshoejvej 36 , 2880 Bagsvaerd , Denmark
Aisling Roche School of Chemical Engineering and Analytical Science, Manchester Institute of Biotechnology , The University of Manchester , 131 Princess Street , Manchester M1 7DN , U.K
Andreas Tosstorff Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany
Robin Curtis School of Chemical Engineering and Analytical Science, Manchester Institute of Biotechnology , The University of Manchester , 131 Princess Street , Manchester M1 7DN , U.K
Jeremy P Derrick School of Biological Sciences, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre , The University of Manchester , Oxford Road , Manchester M13 9PT , U.K
Allan Nørgaard Novozymes A/S , Krogshoejvej 36 , 2880 Bagsvaerd , Denmark
Tarik A Khan Pharmaceutical Development & Supplies, Pharma Technical Development Biologics Europe , F. Hoffmann-La Roche Ltd. , Grenzacherstrasse 124 , 4070 Basel , Switzerland
Günther H J Peters Department of Chemistry , Technical University of Denmark , Kemitorvet 207 , 2800 Kongens Lyngby , Denmark
Alain Pluen School of Chemical Engineering and Analytical Science, Manchester Institute of Biotechnology , The University of Manchester , 131 Princess Street , Manchester M1 7DN , U.K
Åsmund Rinnan Department of Food Science, Faculty of Science , Copenhagen University , Rolighedsvej 26 , 1958 Frederiksberg , Denmark
Werner W Streicher Novozymes A/S , Krogshoejvej 36 , 2880 Bagsvaerd , Denmark
Christopher F van der Walle Dosage Form Design and Development , AstraZeneca , Sir Aaron Klug Building, Granta Park , Cambridge CB21 6GH , U.K
Shahid Uddin Dosage Form Design and Development , AstraZeneca , Sir Aaron Klug Building, Granta Park , Cambridge CB21 6GH , U.K
Gerhard Winter Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany
Dierk Roessner Wyatt Technology Europe GmbH , Hochstrasse 18 , 56307 Dernbach , Germany
Pernille Harris Department of Chemistry , Technical University of Denmark , Kemitorvet 207 , 2800 Kongens Lyngby , Denmark
Wolfgang Frieß Department of Pharmacy, Pharmaceutical Technology and Biopharmaceutics , Ludwig-Maximilians-Universitaet Muenchen , Butenandtstrasse 5 , 81377 Munich , Germany

Collapse

Agüero-Chapin G, Galpert D, Molina-Ruiz R, Ancede-Gallardo E, Pérez-Machado G, De la Riva GA, Antunes A. Graph Theory-Based Sequence Descriptors as Remote Homology Predictors. Biomolecules 2019;10:E26. [PMID: 31878100 PMCID: PMC7022958 DOI: 10.3390/biom10010026] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2019] [Revised: 12/16/2019] [Accepted: 12/18/2019] [Indexed: 12/23/2022] Open

Mazurenko S, Prokop Z, Damborsky J. Machine Learning in Enzyme Engineering. ACS Catal 2019. [DOI: 10.1021/acscatal.9b04321] [Citation(s) in RCA: 134] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Yang Y, Ding X, Zhu G, Niroula A, Lv Q, Vihinen M. ProTstab - predictor for cellular protein stability. BMC Genomics 2019;20:804. [PMID: 31684883 PMCID: PMC6830000 DOI: 10.1186/s12864-019-6138-7] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Accepted: 09/24/2019] [Indexed: 01/10/2023] Open

Application of interpretable artificial neural networks to early monoclonal antibodies development. Eur J Pharm Biopharm 2019;141:81-89. [DOI: 10.1016/j.ejpb.2019.05.017] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2019] [Revised: 05/17/2019] [Accepted: 05/17/2019] [Indexed: 11/20/2022]

Kizhedath A, Karlberg M, Glassey J. Cross-Interaction Chromatography-Based QSAR Model for Early-Stage Screening to Facilitate Enhanced Developability of Monoclonal Antibody Therapeutics. Biotechnol J 2019;14:e1800696. [PMID: 30810283 DOI: 10.1002/biot.201800696] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 01/19/2019] [Indexed: 01/13/2023]

Sachdev K, Gupta MK. A comprehensive review of feature based methods for drug target interaction prediction. J Biomed Inform 2019;93:103159. [PMID: 30926470 DOI: 10.1016/j.jbi.2019.103159] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 03/25/2019] [Accepted: 03/26/2019] [Indexed: 12/22/2022]

Romero-Molina S, Ruiz-Blanco YB, Harms M, Münch J, Sanchez-Garcia E. PPI-Detect: A support vector machine model for sequence-based prediction of protein-protein interactions. J Comput Chem 2019;40:1233-1242. [PMID: 30768790 DOI: 10.1002/jcc.25780] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 11/29/2018] [Accepted: 12/29/2018] [Indexed: 12/18/2022]

Johnson DE. Biotherapeutics: Challenges and Opportunities for Predictive Toxicology of Monoclonal Antibodies. Int J Mol Sci 2018;19:E3685. [PMID: 30469350 PMCID: PMC6274697 DOI: 10.3390/ijms19113685] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Revised: 11/18/2018] [Accepted: 11/19/2018] [Indexed: 12/19/2022] Open

Contreras-Torres E. Predicting structural classes of proteins by incorporating their global and local physicochemical and conformational properties into general Chou's PseAAC. J Theor Biol 2018;454:139-145. [DOI: 10.1016/j.jtbi.2018.05.033] [Citation(s) in RCA: 50] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Revised: 05/23/2018] [Accepted: 05/28/2018] [Indexed: 11/24/2022]

García-Jacas CR, Cabrera-Leyva L, Marrero-Ponce Y, Suárez-Lezcano J, Cortés-Guzmán F, García-González LA. GOWAWA Aggregation Operator-based Global Molecular Characterizations: Weighting Atom/bond Contributions (LOVIs/LOEIs) According to their Influence in the Molecular Encoding. Mol Inform 2018;37:e1800039. [PMID: 30070434 DOI: 10.1002/minf.201800039] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2018] [Accepted: 07/13/2018] [Indexed: 11/11/2022]

Abstract

A different perspective to compute global weighted definitions of molecular descriptors from the contributions of each atom (LOVIs) or covalent bond (LOEIs) within a molecule is presented, using the generalized ordered weighted averaging - weighted averaging (GOWAWA) aggregation operator. This operator is rather different from the other norm-, mean- and statistic-based operators used up to date for the descriptors calculation from LOVIs/LOEIs. GOWAWA unifies the generalized ordered weighted averaging (GOWA) and the weighted generalized mean (WGM) functions and, in addition, it uses a smoothing parameter to assign different importance values to both functions depending on the problem under study. With the GOWAWA operator, diversity of novel global aggregations of molecular descriptors can be determined, where the influence that each atom (or covalent bond) has on the molecular characterization is taken into account. Therefore, this approach is completely different from the ones reported in the literature, where the values of LOVIs/LOEIs are considered equally important. To demonstrate the feasibility of using this operator, the QuBiLS-MIDAS descriptors (http://tomocomd.com/qubils-midas) were used and, as a result, a module was built into the corresponding software to compute them, being thus the only software reported in the literature that can be employed to determine weighted descriptors. Moreover, several modeling studies were performed on eight chemical datasets, which demonstrated that, with the GOWAWA aggregation operator, weighted QuBiLS-MIDAS descriptors that contribute to develop models with greater predictive power can be computed, if compared to the models based on the non-weighted descriptors calculated from the other operators used up to date. A non-parametric statistical assessment confirmed that the GOWAWA-based predictions are significantly superior to the others obtained. Therefore, all in all, it can be concluded that, from the results achieved, the GOWAWA operator constitutes a prominent alternative to codify relevant chemical information of the molecules, ultimately useful in improving the modeling ability of several old and recent descriptors whose definition is based on the LOVIs/LOEIs calculation.

Collapse

PON-tstab: Protein Variant Stability Predictor. Importance of Training Data Quality. Int J Mol Sci 2018;19:ijms19041009. [PMID: 29597263 PMCID: PMC5979465 DOI: 10.3390/ijms19041009] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 03/21/2018] [Accepted: 03/24/2018] [Indexed: 12/24/2022] Open

Dong J, Yao ZJ, Zhang L, Luo F, Lin Q, Lu AP, Chen AF, Cao DS. PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions. J Cheminform 2018;10:16. [PMID: 29556758 PMCID: PMC5861255 DOI: 10.1186/s13321-018-0270-2] [Citation(s) in RCA: 70] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 03/12/2018] [Indexed: 11/15/2022] Open

Abstract

Background

With the increasing development of biotechnology and informatics technology, publicly available data in chemistry and biology are undergoing explosive growth. Such wealthy information in these data needs to be extracted and transformed to useful knowledge by various data mining methods. Considering the amazing rate at which data are accumulated in chemistry and biology fields, new tools that process and interpret large and complex interaction data are increasingly important. So far, there are no suitable toolkits that can effectively link the chemical and biological space in view of molecular representation. To further explore these complex data, an integrated toolkit for various molecular representation is urgently needed which could be easily integrated with data mining algorithms to start a full data analysis pipeline.

Results

Herein, the python library PyBioMed is presented, which comprises functionalities for online download for various molecular objects by providing different IDs, the pretreatment of molecular structures, the computation of various molecular descriptors for chemicals, proteins, DNAs and their interactions. PyBioMed is a feature-rich and highly customized python library used for the characterization of various complex chemical and biological molecules and interaction samples. The current version of PyBioMed could calculate 775 chemical descriptors and 19 kinds of chemical fingerprints, 9920 protein descriptors based on protein sequences, more than 6000 DNA descriptors from nucleotide sequences, and interaction descriptors from pairwise samples using three different combining strategies. Several examples and five real-life applications were provided to clearly guide the users how to use PyBioMed as an integral part of data analysis projects. By using PyBioMed, users are able to start a full pipelining from getting molecular data, pretreating molecules, molecular representation to constructing machine learning models conveniently.

Conclusion

PyBioMed provides various user-friendly and highly customized APIs to calculate various features of biological molecules and complex interaction samples conveniently, which aims at building integrated analysis pipelines from data acquisition, data checking, and descriptor calculation to modeling. PyBioMed is freely available at http://projects.scbdd.com/pybiomed.html.

Collapse

Affiliation(s)

Jie Dong Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
Zhi-Jiang Yao Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
Lin Zhang College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
Feijun Luo College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
Qinlu Lin College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
Ai-Ping Lu Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China
Alex F Chen Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
Dong-Sheng Cao Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China. .,Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China. .,Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South University, Changsha, People's Republic of China.

Collapse

Nath A, Kumari P, Chaube R. Prediction of Human Drug Targets and Their Interactions Using Machine Learning Methods: Current and Future Perspectives. Methods Mol Biol 2018;1762:21-30. [PMID: 29594765 DOI: 10.1007/978-1-4939-7756-7_2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Systematic Identification of Machine-Learning Models Aimed to Classify Critical Residues for Protein Function from Protein Structure. Molecules 2017;22:molecules22101673. [PMID: 28991206 PMCID: PMC6151554 DOI: 10.3390/molecules22101673] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Revised: 09/24/2017] [Accepted: 09/24/2017] [Indexed: 12/14/2022] Open

Ruiz-Blanco YB, Agüero-Chapin G, García-Hernández E, Álvarez O, Antunes A, Green J. Exploring general-purpose protein features for distinguishing enzymes and non-enzymes within the twilight zone. BMC Bioinformatics 2017;18:349. [PMID: 28732462 PMCID: PMC5521120 DOI: 10.1186/s12859-017-1758-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2017] [Accepted: 07/13/2017] [Indexed: 11/10/2022] Open

Pan W, Chen DS, Lu YJ, Xu HW, Hao WT, Zhang YW, Qin SP, Zheng KY, Tang RX. Genetic diversity and phylogenetic analysis of EG95 sequences of Echinococcus granulosus: Implications for EG95 vaccine application. ASIAN PAC J TROP MED 2017. [PMID: 28647192 DOI: 10.1016/j.apjtm.2017.05.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open

Abstract

OBJECTIVE

To analyse the genetic variability of EG95 sequences and provide guidance for EG95 vaccine application against Echinococcus granulosus (E. granulosus).

METHODS

We analysed EG95 polymorphism by collecting total 97 different E. granulosus isolates from 12 different host species that originated from 10 different countries. Multiple sequence alignments and the homology were performed by Lasergene 1 (DNASTAR Inc., Madison, WI), and the phylogenetic analysis was performed by using MEGA5.1 (CEMI, Tempe, AZ, USA). In addition, linear and conformational epitopes were analysed, including secondary structure, NXT/S glycosylation, fibronectin type III (FnIII) domain and glycosylphosphatidylinositol anchor signal (GPI-anchor). The secondary structure was predicted by PSIPRED method.

RESULTS

Our results indicated that most isolates overall shared 72.6-100% identity in EG95 gene sequence with the published standard EG95 sequence, X90928. However, EG95 gene indeed has polymorphism in different isolates. Phylogenetic analysis showed that different isolates could be divided into three subgroups. Subgroup 1 contained 87 isolates while Subgroup 2 and Subgroup 3 consisted of 3 and 7 isolates, respectively. Four sequences cloned from oncosphere shared a high identity with the parental sequence of the current vaccine, X90928, and they belonged to Subgroup 1. However, in comparison to X90928, several amino acid mutations occurred in most isolates besides oncosphere, which potentially altered the immunodominant linear epitopes, glycosylation sites and secondary structures in EG95 genes. All these variations might change their previous antigenicity and thereby affecting the efficacy of current EG95 vaccine.

CONCLUSIONS

This study reveals the genetic variability of EG95 sequences in different E. granulosus isolates, and proposed that more vaccination trials would be needed to test the effectiveness of current EG95 vaccine against distinct isolates in different countries.

Collapse

Affiliation(s)

Wei Pan Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
De-Sheng Chen Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China; Department of Clinical Medicine, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Yun-Juan Lu Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China; Department of Clinical Medicine, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Hui-Wen Xu Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China; Department of Clinical Medicine, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Wen-Ting Hao Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Ya-Wen Zhang Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China; Department of Clinical Medicine, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Su-Ping Qin Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Kui-Yang Zheng Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China
Ren-Xian Tang Jiangsu Key Laboratory of Immunity and Metabolism, Department of Pathogenic Biology and Immunology, Laboratory of Infection and Immunity, Xuzhou Medical University, Xuzhou, Jiangsu Province, 221004, PR China.

Collapse

Simeon S, Li H, Win TS, Malik AA, Kandhro AH, Piacham T, Shoombuatong W, Nuchnoi P, Wikberg JES, Gleeson MP, Nantasenamat C. PepBio: predicting the bioactivity of host defense peptides. RSC Adv 2017. [DOI: 10.1039/c7ra01388d] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open

Novel "extended sequons" of human N-glycosylation sites improve the precision of qualitative predictions: an alignment-free study of pattern recognition using ProtDCal protein features. Amino Acids 2016;49:317-325. [PMID: 27896447 DOI: 10.1007/s00726-016-2362-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2016] [Accepted: 11/05/2016] [Indexed: 10/20/2022]

Kizhedath A, Wilkinson S, Glassey J. Applicability of predictive toxicology methods for monoclonal antibody therapeutics: status Quo and scope. Arch Toxicol 2016;91:1595-1612. [PMID: 27766364 PMCID: PMC5364268 DOI: 10.1007/s00204-016-1876-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2016] [Accepted: 10/12/2016] [Indexed: 12/31/2022]

Abstract

Biopharmaceuticals, monoclonal antibody (mAb)-based therapeutics in particular, have positively impacted millions of lives. MAbs and related therapeutics are highly desirable from a biopharmaceutical perspective as they are highly target specific and well tolerated within the human system. Nevertheless, several mAbs have been discontinued or withdrawn based either on their inability to demonstrate efficacy and/or due to adverse effects. Approved monoclonal antibodies and derived therapeutics have been associated with adverse effects such as immunogenicity, cytokine release syndrome, progressive multifocal leukoencephalopathy, intravascular haemolysis, cardiac arrhythmias, abnormal liver function, gastrointestinal perforation, bronchospasm, intraocular inflammation, urticaria, nephritis, neuropathy, birth defects, fever and cough to name a few. The advances made in this field are also impeded by a lack of progress in bioprocess development strategies as well as increasing costs owing to attrition, wherein the lack of efficacy and safety accounts for nearly 60 % of all factors contributing to attrition. This reiterates the need for smarter preclinical development using quality by design-based approaches encompassing carefully designed predictive models during early stages of drug development. Different in vitro and in silico methods are extensively used for predicting biological activity as well as toxicity during small molecule drug development; however, their full potential has not been utilized for biological drug development. The scope of in vitro and in silico tools in early developmental stages of monoclonal antibody-based therapeutics production and how it contributes to lower attrition rates leading to faster development of potential drug candidates has been evaluated. The applicability of computational toxicology approaches in this context as well as the pitfalls and promises of extending such techniques to biopharmaceutical development has been highlighted.

Collapse

Huang BFF, Boutros PC. The parameter sensitivity of random forests. BMC Bioinformatics 2016;17:331. [PMID: 27586051 PMCID: PMC5009551 DOI: 10.1186/s12859-016-1228-x] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 08/26/2016] [Indexed: 02/07/2023] Open

Kleandrova VV, Ruso JM, Speck-Planche A, Dias Soeiro Cordeiro MN. Enabling the Discovery and Virtual Screening of Potent and Safe Antimicrobial Peptides. Simultaneous Prediction of Antibacterial Activity and Cytotoxicity. ACS COMBINATORIAL SCIENCE 2016;18:490-8. [PMID: 27280735 DOI: 10.1021/acscombsci.6b00063] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Speck-Planche A, Kleandrova VV, Ruso JM, Cordeiro MNDS. First Multitarget Chemo-Bioinformatic Model To Enable the Discovery of Antibacterial Peptides against Multiple Gram-Positive Pathogens. J Chem Inf Model 2016;56:588-98. [PMID: 26960000 DOI: 10.1021/acs.jcim.5b00630] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Global informatics and physical property selection in protein sequences. Proc Natl Acad Sci U S A 2016;113:1808-10. [PMID: 26831093 DOI: 10.1073/pnas.1525745113] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Efficient Multicriteria Protein Structure Comparison on Modern Processor Architectures. BIOMED RESEARCH INTERNATIONAL 2015;2015:563674. [PMID: 26605332 PMCID: PMC4641208 DOI: 10.1155/2015/563674] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/24/2015] [Revised: 10/04/2015] [Accepted: 10/05/2015] [Indexed: 11/18/2022]