1
|
Syryamina VN, Aisenbrey C, Kardash M, Dzuba SA, Bechinger B. Self-assembly of spin-labeled antimicrobial peptides magainin 2 and PGLa in lipid bilayers. Biophys Chem 2024; 310:107251. [PMID: 38678820 DOI: 10.1016/j.bpc.2024.107251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 04/15/2024] [Accepted: 04/21/2024] [Indexed: 05/01/2024]
Abstract
The cationic antimicrobial peptides PGLa and magainin 2 (Mag2) are known for their antimicrobial activity and synergistic enhancement in antimicrobial and membrane leakage assays. Further use of peptides in combinatory therapy requires knowledge of the mechanisms of action of both individual peptides and their mixtures. Here, electron paramagnetic resonance (EPR), double electron-electron resonance (DEER, also known as PELDOR) and electron spin echo envelope modulation (ESEEM) spectroscopies were applied to study self-assembly and localization of spin-labeled PGLa and Mag2 in POPE/POPG membranes with a wide range of peptide/lipid ratios (P/L) from ∼1/1500 to 1/50. EPR and DEER data showed that both peptides tend to organize in clusters, which occurs already at the lowest peptide/lipid molar ratio of 1/1500 (0.067 mol%). For individual peptides, these clusters are quite dense with intermolecular distances of the order of ∼2 nm. In the presence of a synergistic peptide partner, these homo-clusters are transformed into lipid-diluted hetero-clusters. These clusters are characterized by a local surface density that is several times higher than expected from a random distribution. ESEEM data indicate a slightly different insertion depth of peptides in hetero-clusters when compared to homo-clusters.
Collapse
Affiliation(s)
- Victoria N Syryamina
- Voevodsky Institute of Chemical Kinetics and Combustion, RAS, Novosibirsk 630090, Russian Federation
| | - Christopher Aisenbrey
- University of Strasbourg/CNRS, UMR7177, Strasbourg Institute of Chemistry, Membrane Biophysics and NMR, 67000 Strasbourg, France
| | - Maria Kardash
- University of Strasbourg/CNRS, UMR7177, Strasbourg Institute of Chemistry, Membrane Biophysics and NMR, 67000 Strasbourg, France
| | - Sergei A Dzuba
- Voevodsky Institute of Chemical Kinetics and Combustion, RAS, Novosibirsk 630090, Russian Federation; Department of Physics, Novosibirsk State University, 630090 Novosibirsk, Russian Federation.
| | - Burkhard Bechinger
- University of Strasbourg/CNRS, UMR7177, Strasbourg Institute of Chemistry, Membrane Biophysics and NMR, 67000 Strasbourg, France; Institut Universitaire de France, 75231 Paris, France.
| |
Collapse
|
2
|
Castillo-Mendieta K, Agüero-Chapin G, Marquez E, Perez-Castillo Y, Barigye SJ, Pérez-Cárdenas M, Peréz-Giménez F, Marrero-Ponce Y. Multiquery Similarity Searching Models: An Alternative Approach for Predicting Hemolytic Activity from Peptide Sequence. Chem Res Toxicol 2024; 37:580-589. [PMID: 38501392 DOI: 10.1021/acs.chemrestox.3c00408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
The desirable pharmacological properties and a broad number of therapeutic activities have made peptides promising drugs over small organic molecules and antibody drugs. Nevertheless, toxic effects, such as hemolysis, have hampered the development of such promising drugs. Hence, a reliable computational tool to predict peptide hemolytic toxicity is enormously useful before synthesis and experimental evaluation. Currently, four web servers that predict hemolytic activity using machine learning (ML) algorithms are available; however, they exhibit some limitations, such as the need for a reliable negative set and limited application domain. Hence, we developed a robust model based on a novel theoretical approach that combines network science and a multiquery similarity searching (MQSS) method. A total of 1152 initial models were constructed from 144 scaffolds generated in a previous report. These were evaluated on external data sets, and the best models were fused and improved. Our best MQSS model I1 outperformed all state-of-the-art ML-based models and was used to characterize the prevalence of hemolytic toxicity on therapeutic peptides. Based on our model's estimation, the number of hemolytic peptides might be 3.9-fold higher than the reported.
Collapse
Affiliation(s)
- Kevin Castillo-Mendieta
- School of Biological Sciences and Engineering, Yachay Tech University, Hda. San José s/n y Proyecto Yachay, Urcuquí 100119, Ecuador
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, Terminal de Cruzeiros do Porto de Leixões, University of Porto, Av. General Norton de Matos s/n, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
| | - Edgar Marquez
- Grupo de Investigaciones en Química y Biología, Departamento de Química y Biología, Facultad de Ciencias Básicas, Universidad del Norte, Carrera 51B, Km 5, vía Puerto Colombia, Barranquilla 081007, Colombia
| | - Yunierkis Perez-Castillo
- Bio-Chemoinformatics Research Group and Escuela de Ciencias Físicas y Matemáticas. Universidad de Las Américas, Quito 170504, Ecuador
| | - Stephen J Barigye
- Departamento de Química Física Aplicada, Facultad de Ciencias, Universidad Autónoma de Madrid (UAM), 28049 Madrid, Spain
| | - Mariela Pérez-Cárdenas
- School of Biological Sciences and Engineering, Yachay Tech University, Hda. San José s/n y Proyecto Yachay, Urcuquí 100119, Ecuador
| | - Facundo Peréz-Giménez
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia 46100, Spain
| | - Yovani Marrero-Ponce
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia 46100, Spain
- Facultad de Ingeniería, Universidad Panamericana, Augusto Rodin No. 498, Insurgentes Mixcoac, Benito Juárez, CDMX, Mexico 03920, Mexico
- Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas; and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Universidad San Francisco de Quito (USFQ), Quito, Pichincha 170157, Ecuador
| |
Collapse
|
3
|
Iwaniak A, Minkiewicz P, Darewicz M. Bioinformatics and bioactive peptides from foods: Do they work together? ADVANCES IN FOOD AND NUTRITION RESEARCH 2024; 108:35-111. [PMID: 38461003 DOI: 10.1016/bs.afnr.2023.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2024]
Abstract
We live in the Big Data Era which affects many aspects of science, including research on bioactive peptides derived from foods, which during the last few decades have been a focus of interest for scientists. These two issues, i.e., the development of computer technologies and progress in the discovery of novel peptides with health-beneficial properties, are closely interrelated. This Chapter presents the example applications of bioinformatics for studying biopeptides, focusing on main aspects of peptide analysis as the starting point, including: (i) the role of peptide databases; (ii) aspects of bioactivity prediction; (iii) simulation of peptide release from proteins. Bioinformatics can also be used for predicting other features of peptides, including ADMET, QSAR, structure, and taste. To answer the question asked "bioinformatics and bioactive peptides from foods: do they work together?", currently it is almost impossible to find examples of peptide research with no bioinformatics involved. However, theoretical predictions are not equivalent to experimental work and always require critical scrutiny. The aspects of compatibility of in silico and in vitro results are also summarized herein.
Collapse
Affiliation(s)
- Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland.
| | - Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Olsztyn-Kortowo, Poland
| |
Collapse
|
4
|
Meng W, Pan H, Sha Y, Zhai X, Xing A, Lingampelly SS, Sripathi SR, Wang Y, Li K. Metabolic Connectome and Its Role in the Prediction, Diagnosis, and Treatment of Complex Diseases. Metabolites 2024; 14:93. [PMID: 38392985 PMCID: PMC10890086 DOI: 10.3390/metabo14020093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 01/17/2024] [Accepted: 01/25/2024] [Indexed: 02/25/2024] Open
Abstract
The interconnectivity of advanced biological systems is essential for their proper functioning. In modern connectomics, biological entities such as proteins, genes, RNA, DNA, and metabolites are often represented as nodes, while the physical, biochemical, or functional interactions between them are represented as edges. Among these entities, metabolites are particularly significant as they exhibit a closer relationship to an organism's phenotype compared to genes or proteins. Moreover, the metabolome has the ability to amplify small proteomic and transcriptomic changes, even those from minor genomic changes. Metabolic networks, which consist of complex systems comprising hundreds of metabolites and their interactions, play a critical role in biological research by mediating energy conversion and chemical reactions within cells. This review provides an introduction to common metabolic network models and their construction methods. It also explores the diverse applications of metabolic networks in elucidating disease mechanisms, predicting and diagnosing diseases, and facilitating drug development. Additionally, it discusses potential future directions for research in metabolic networks. Ultimately, this review serves as a valuable reference for researchers interested in metabolic network modeling, analysis, and their applications.
Collapse
Affiliation(s)
- Weiyu Meng
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| | - Hongxin Pan
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| | - Yuyang Sha
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| | - Xiaobing Zhai
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| | - Abao Xing
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| | | | - Srinivasa R Sripathi
- Henderson Ocular Stem Cell Laboratory, Retina Foundation of the Southwest, Dallas, TX 75231, USA
| | - Yuefei Wang
- National Key Laboratory of Chinese Medicine Modernization, State Key Laboratory of Component-Based Chinese Medicine, Tianjin University of Traditional Chinese Medicine, Tianjin 301617, China
- Haihe Laboratory of Modern Chinese Medicine, Tianjin 301617, China
| | - Kefeng Li
- Center for Artificial Intelligence Driven Drug Discovery, Faculty of Applied Sciences, Macao Polytechnic University, Macau SAR 999078, China
| |
Collapse
|
5
|
Yuan Y, Shi C, Zhao H. Machine Learning-Enabled Genome Mining and Bioactivity Prediction of Natural Products. ACS Synth Biol 2023; 12:2650-2662. [PMID: 37607352 PMCID: PMC10615616 DOI: 10.1021/acssynbio.3c00234] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Natural products (NPs) produced by microorganisms and plants are a major source of drugs, herbicides, and fungicides. Thanks to recent advances in DNA sequencing, bioinformatics, and genome mining tools, a vast amount of data on NP biosynthesis has been generated over the years, which has been increasingly exploited to develop machine learning (ML) tools for NP discovery. In this review, we discuss the latest advances in developing and applying ML tools for exploring the potential NPs that can be encoded by genomic language and predicting the types of bioactivities of NPs. We also examine the technical challenges associated with the development and application of ML tools for NP research.
Collapse
Affiliation(s)
- Yujie Yuan
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Chengyou Shi
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | - Huimin Zhao
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
- Departments of Chemistry, Biochemistry, and Bioengineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| |
Collapse
|
6
|
Singh V, Singh SK. A separable temporal convolutional networks based deep learning technique for discovering antiviral medicines. Sci Rep 2023; 13:13722. [PMID: 37608092 PMCID: PMC10444765 DOI: 10.1038/s41598-023-40922-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 08/18/2023] [Indexed: 08/24/2023] Open
Abstract
An alarming number of fatalities caused by the COVID-19 pandemic has forced the scientific community to accelerate the process of therapeutic drug discovery. In this regard, the collaboration between biomedical scientists and experts in artificial intelligence (AI) has led to a number of in silico tools being developed for the initial screening of therapeutic molecules. All living organisms produce antiviral peptides (AVPs) as a part of their first line of defense against invading viruses. The Deep-AVPiden model proposed in this paper and its corresponding web app, deployed at https://deep-avpiden.anvil.app , is an effort toward discovering novel AVPs in proteomes of living organisms. Apart from Deep-AVPiden, a computationally efficient model called Deep-AVPiden (DS) has also been developed using the same underlying network but with point-wise separable convolutions. The Deep-AVPiden and Deep-AVPiden (DS) models show an accuracy of 90% and 88%, respectively, and both have a precision of 90%. Also, the proposed models were statistically compared using the Student's t-test. On comparing the proposed models with the state-of-the-art classifiers, it was found that they are much better than them. To test the proposed model, we identified some AVPs in the natural defense proteins of plants, mammals, and fishes and found them to have appreciable sequence similarity with some experimentally validated antimicrobial peptides. These AVPs can be chemically synthesized and tested for their antiviral activity.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, 221005, India.
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU) Varanasi, Varanasi, Uttar Pradesh, 221005, India.
| |
Collapse
|
7
|
Aguilera-Mendoza L, Ayala-Ruano S, Martinez-Rios F, Chavez E, García-Jacas CR, Brizuela CA, Marrero-Ponce Y. StarPep Toolbox: an open-source software to assist chemical space analysis of bioactive peptides and their functions using complex networks. Bioinformatics 2023; 39:btad506. [PMID: 37603724 PMCID: PMC10469104 DOI: 10.1093/bioinformatics/btad506] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/24/2023] [Accepted: 08/18/2023] [Indexed: 08/23/2023] Open
Abstract
MOTIVATION Antimicrobial peptides (AMPs) are promising molecules to treat infectious diseases caused by multi-drug resistance pathogens, some types of cancer, and other conditions. Computer-aided strategies are efficient tools for the high-throughput screening of AMPs. RESULTS This report highlights StarPep Toolbox, an open-source and user-friendly software to study the bioactive chemical space of AMPs using complex network-based representations, clustering, and similarity-searching models. The novelty of this research lies in the combination of network science and similarity-searching techniques, distinguishing it from conventional methods based on machine learning and other computational approaches. The network-based representation of the AMP chemical space presents promising opportunities for peptide drug repurposing, development, and optimization. This approach could serve as a baseline for the discovery of a new generation of therapeutics peptides. AVAILABILITY AND IMPLEMENTATION All underlying code and installation files are accessible through GitHub (https://github.com/Grupo-Medicina-Molecular-y-Traslacional/StarPep) under the Apache 2.0 license.
Collapse
Affiliation(s)
- Longendri Aguilera-Mendoza
- Grupo de Medicina Molecular y Translacional (MeM&T), Facultad de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Ecuador
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
| | - Sebastián Ayala-Ruano
- Grupo de Medicina Molecular y Translacional (MeM&T), Facultad de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Ecuador
| | - Felix Martinez-Rios
- Facultad de Ingeniería, Universidad Panamericana, CDMX, Benito Juárez 03920, México
| | - Edgar Chavez
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
| | - César R García-Jacas
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
- Cátedras CONAHCYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
| | - Yovani Marrero-Ponce
- Grupo de Medicina Molecular y Translacional (MeM&T), Facultad de Medicina, Universidad San Francisco de Quito (USFQ), Quito, Ecuador
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada, Baja California 22860, México
| |
Collapse
|
8
|
Agüero-Chapin G, Antunes A, Mora JR, Pérez N, Contreras-Torres E, Valdes-Martini JR, Martinez-Rios F, Zambrano CH, Marrero-Ponce Y. Complex Networks Analyses of Antibiofilm Peptides: An Emerging Tool for Next-Generation Antimicrobials' Discovery. Antibiotics (Basel) 2023; 12:antibiotics12040747. [PMID: 37107109 PMCID: PMC10135022 DOI: 10.3390/antibiotics12040747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 04/04/2023] [Accepted: 04/11/2023] [Indexed: 04/29/2023] Open
Abstract
Microbial biofilms cause several environmental and industrial issues, even affecting human health. Although they have long represented a threat due to their resistance to antibiotics, there are currently no approved antibiofilm agents for clinical treatments. The multi-functionality of antimicrobial peptides (AMPs), including their antibiofilm activity and their potential to target multiple microbes, has motivated the synthesis of AMPs and their relatives for developing antibiofilm agents for clinical purposes. Antibiofilm peptides (ABFPs) have been organized in databases that have allowed the building of prediction tools which have assisted in the discovery/design of new antibiofilm agents. However, the complex network approach has not yet been explored as an assistant tool for this aim. Herein, a kind of similarity network called the half-space proximal network (HSPN) is applied to represent/analyze the chemical space of ABFPs, aiming to identify privileged scaffolds for the development of next-generation antimicrobials that are able to target both planktonic and biofilm microbial forms. Such analyses also considered the metadata associated with the ABFPs, such as origin, other activities, targets, etc., in which the relationships were projected by multilayer networks called metadata networks (METNs). From the complex networks' mining, a reduced but informative set of 66 ABFPs was extracted, representing the original antibiofilm space. This subset contained the most central to atypical ABFPs, some of them having the desired properties for developing next-generation antimicrobials. Therefore, this subset is advisable for assisting the search for/design of both new antibiofilms and antimicrobial agents. The provided ABFP motifs list, discovered within the HSPN communities, is also useful for the same purpose.
Collapse
Affiliation(s)
- Guillermin Agüero-Chapin
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal
| | - José R Mora
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Noel Pérez
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Ernesto Contreras-Torres
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | | | - Felix Martinez-Rios
- Facultad de Ingeniería, Universidad Panamericana, Augusto Rodin No. 498, Insurgentes Mixcoac, Benito Juárez, Ciudad de México 03920, Mexico
| | - Cesar H Zambrano
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada 22860, Baja California, Mexico
| |
Collapse
|
9
|
Ayala-Ruano S, Marrero-Ponce Y, Aguilera-Mendoza L, Pérez N, Agüero-Chapin G, Antunes A, Aguilar AC. Network Science and Group Fusion Similarity-Based Searching to Explore the Chemical Space of Antiparasitic Peptides. ACS OMEGA 2022; 7:46012-46036. [PMID: 36570318 PMCID: PMC9773354 DOI: 10.1021/acsomega.2c03398] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 11/21/2022] [Indexed: 05/13/2023]
Abstract
Antimicrobial peptides (AMPs) have appeared as promising compounds to treat a wide range of diseases. Their clinical potentialities reside in the wide range of mechanisms they can use for both killing microbes and modulating immune responses. However, the hugeness of the AMPs' chemical space (AMPCS), represented by more than 1065 unique sequences, has represented a big challenge for the discovery of new promising therapeutic peptides and for the identification of common structural motifs. Here, we introduce network science and a similarity searching approach to discover new promising AMPs, specifically antiparasitic peptides (APPs). We exploited the network-based representation of APPs' chemical space (APPCS) to retrieve valuable information by using three network types: chemical space (CSN), half-space proximal (HSPN), and metadata (METN). Some centrality measures were applied to identify in each network the most important and nonredundant peptides. Then, these central peptides were considered as queries (Qs) in group fusion similarity-based searches against a comprehensive collection of known AMPs, stored in the graph database StarPepDB, to propose new potential APPs. The performance of the resulting multiquery similarity-based search models (mQSSMs) was evaluated in five benchmarking data sets of APP/non-APPs. The predictions performed by the best mQSSM showed a strong-to-very-strong performance since their external Matthews correlation coefficient (MCC) values ranged from 0.834 to 0.965. Outstanding MCC values (>0.85) were attained by the mQSSM with 219 Qs from both networks CSN and HSPN with 0.5 as similarity threshold in external data sets. Then, the performance of our best mQSSM was compared with the APPs prediction servers AMPDiscover and AMPFun. The proposed model showed its relevance by outperforming state-of-the-art machine learning models to predict APPs. After applying the best mQSSM and additional filters on the non-APP space from StarPepDB, 95 AMPs were repurposed as potential APP hits. Due to the high sequence diversity of these peptides, different computational approaches were applied to identify relevant motifs for searching and designing new APPs. Lastly, we identified 11 promising APP lead candidates by using our best mQSSMs together with diversity-based network analyses, and 24 web servers for activity/toxicity and drug-like properties. These results support that network-based similarity searches can be an effective and reliable strategy to identify APPs. The proposed models and pipeline are freely available through the StarPep toolbox software at http://mobiosd-hub.com/starpep.
Collapse
Affiliation(s)
- Sebastián Ayala-Ruano
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Yovani Marrero-Ponce
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
- Computer-Aided
Molecular “Biosilico” Discovery and Bioinformatics Research
International Network (CAMD-BIR IN), Cumbayá, Quito 170901, Ecuador
- Universidad
San Francisco de Quito (USFQ), Instituto
de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
- or . Phone: +593-2-297-1700 (ext. 4021). http://www.uv.es/yoma/ or http://ymponce.googlepages.com/home
| | - Longendri Aguilera-Mendoza
- Departamento
de Ciencias de la Computación, Centro
de Investigación Científica y de Educación Superior
de Ensenada (CICESE), Baja California 22860, Mexico
| | - Noel Pérez
- Colegio
de Ciencias e Ingenierías “El Politécnico”, Universidad San Francisco de Quito (USFQ), Quito 170901, Ecuador
| | - Guillermin Agüero-Chapin
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR,
Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton
de Matos s/n, 4450-208 Porto, Portugal
- Department
of Biology, Faculty of Sciences, University
of Porto, Rua do Campo
Alegre, 4169-007 Porto, Portugal
| | - Ana Cristina Aguilar
- Grupo
de Medicina Molecular y Traslacional (MeM&T), Escuela de Medicina,
Colegio de Ciencias de la Salud (COCSA), Universidad San Francisco de Quito, Av. Interoceánica Km 12 1/2 y Av. Florencia, Quito 17-1200-841, Ecuador
| |
Collapse
|
10
|
ABP-Finder: A Tool to Identify Antibacterial Peptides and the Gram-Staining Type of Targeted Bacteria. Antibiotics (Basel) 2022; 11:antibiotics11121708. [PMID: 36551365 PMCID: PMC9774453 DOI: 10.3390/antibiotics11121708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/16/2022] [Accepted: 11/17/2022] [Indexed: 11/29/2022] Open
Abstract
Multi-drug resistance in bacteria is a major health problem worldwide. To overcome this issue, new approaches allowing for the identification and development of antibacterial agents are urgently needed. Peptides, due to their binding specificity and low expected side effects, are promising candidates for a new generation of antibiotics. For over two decades, a large diversity of antimicrobial peptides (AMPs) has been discovered and annotated in public databases. The AMP family encompasses nearly 20 biological functions, thus representing a potentially valuable resource for data mining analyses. Nonetheless, despite the availability of machine learning-based approaches focused on AMPs, these tools lack evidence of successful application for AMPs' discovery, and many are not designed to predict a specific function for putative AMPs, such as antibacterial activity. Consequently, among the apparent variety of data mining methods to screen peptide sequences for antibacterial activity, only few tools can deal with such task consistently, although with limited precision and generally no information about the possible targets. Here, we addressed this gap by introducing a tool specifically designed to identify antibacterial peptides (ABPs) with an estimation of which type of bacteria is susceptible to the action of these peptides, according to their response to the Gram-staining assay. Our tool is freely available via a web server named ABP-Finder. This new method ranks within the top state-of-the-art ABP predictors, particularly in terms of precision. Importantly, we showed the successful application of ABP-Finder for the screening of a large peptide library from the human urine peptidome and the identification of an antibacterial peptide.
Collapse
|
11
|
García-Jacas CR, García-González LA, Martinez-Rios F, Tapia-Contreras IP, Brizuela CA. Handcrafted versus non-handcrafted (self-supervised) features for the classification of antimicrobial peptides: complementary or redundant? Brief Bioinform 2022; 23:6754757. [PMID: 36215083 DOI: 10.1093/bib/bbac428] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/28/2022] [Accepted: 09/02/2022] [Indexed: 12/14/2022] Open
Abstract
Antimicrobial peptides (AMPs) have received a great deal of attention given their potential to become a plausible option to fight multi-drug resistant bacteria as well as other pathogens. Quantitative sequence-activity models (QSAMs) have been helpful to discover new AMPs because they allow to explore a large universe of peptide sequences and help reduce the number of wet lab experiments. A main aspect in the building of QSAMs based on shallow learning is to determine an optimal set of protein descriptors (features) required to discriminate between sequences with different antimicrobial activities. These features are generally handcrafted from peptide sequence datasets that are labeled with specific antimicrobial activities. However, recent developments have shown that unsupervised approaches can be used to determine features that outperform human-engineered (handcrafted) features. Thus, knowing which of these two approaches contribute to a better classification of AMPs, it is a fundamental question in order to design more accurate models. Here, we present a systematic and rigorous study to compare both types of features. Experimental outcomes show that non-handcrafted features lead to achieve better performances than handcrafted features. However, the experiments also prove that an improvement in performance is achieved when both types of features are merged. A relevance analysis reveals that non-handcrafted features have higher information content than handcrafted features, while an interaction-based importance analysis reveals that handcrafted features are more important. These findings suggest that there is complementarity between both types of features. Comparisons regarding state-of-the-art deep models show that shallow models yield better performances both when fed with non-handcrafted features alone and when fed with non-handcrafted and handcrafted features together.
Collapse
Affiliation(s)
- César R García-Jacas
- Cátedras CONACYT - Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Luis A García-González
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | | | - Issac P Tapia-Contreras
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
12
|
Rezaie H, Asadollahi-Baboli M, Hassaninejad-Darzi SK. Hybrid consensus and k-nearest neighbours (kNN) strategies to classify dual BRD4/PLK1 inhibitors. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2022; 33:779-792. [PMID: 36330747 DOI: 10.1080/1062936x.2022.2139292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 10/17/2022] [Indexed: 06/16/2023]
Abstract
A novel decision-making procedure is proposed here for the first time to identify active/inactive and selective/non-selective dual inhibitors using consensus approaches and pools of k-nearest neighbours (kNN) classifications instead of individual models. Dual BRD4/PLK1 inhibition with adequate selectivity is a potential therapeutic strategy for targeting tumour cells in high-risk patients. We report the unique way to identify both active and selective dual BRD4/PLK1 inhibitors using consensus and kNN strategies together with two sources of receptor-based and ligand-based information which are the ranked binding energies of residues and important molecular features, respectively. The results of consensus approaches were compared with the results of individual kNN models. The chemical space similarity was measured using three different distance functions to increase the reliability. All activity and selectivity classification models were validated using cross-validation and y-randomization tests. The outcomes show that consensus approaches can increase the reliability and accuracy of active/inactive or selective/non-selective detections up to 90%. Consensus approaches also reached more balanced values of sensitivity and specificity compared to the individual kNN models because of the compensation in the integration of diverse sources of information.
Collapse
Affiliation(s)
- H Rezaie
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| | - M Asadollahi-Baboli
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| | - S K Hassaninejad-Darzi
- Department of Chemistry, Faculty of Science, Babol Noshirvani University of Technology, Babol, Iran
| |
Collapse
|
13
|
Agüero-Chapin G, Galpert-Cañizares D, Domínguez-Pérez D, Marrero-Ponce Y, Pérez-Machado G, Teijeira M, Antunes A. Emerging Computational Approaches for Antimicrobial Peptide Discovery. Antibiotics (Basel) 2022; 11:antibiotics11070936. [PMID: 35884190 PMCID: PMC9311958 DOI: 10.3390/antibiotics11070936] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2022] [Revised: 07/01/2022] [Accepted: 07/08/2022] [Indexed: 02/05/2023] Open
Abstract
In the last two decades many reports have addressed the application of artificial intelligence (AI) in the search and design of antimicrobial peptides (AMPs). AI has been represented by machine learning (ML) algorithms that use sequence-based features for the discovery of new peptidic scaffolds with promising biological activity. From AI perspective, evolutionary algorithms have been also applied to the rational generation of peptide libraries aimed at the optimization/design of AMPs. However, the literature has scarcely dedicated to other emerging non-conventional in silico approaches for the search/design of such bioactive peptides. Thus, the first motivation here is to bring up some non-standard peptide features that have been used to build classical ML predictive models. Secondly, it is valuable to highlight emerging ML algorithms and alternative computational tools to predict/design AMPs as well as to explore their chemical space. Another point worthy of mention is the recent application of evolutionary algorithms that actually simulate sequence evolution to both the generation of diversity-oriented peptide libraries and the optimization of hit peptides. Last but not least, included here some new considerations in proteogenomic analyses currently incorporated into the computational workflow for unravelling AMPs in natural sources.
Collapse
Affiliation(s)
- Guillermin Agüero-Chapin
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| | - Deborah Galpert-Cañizares
- Departamento de Ciencia de la Computación, Universidad Central Marta Abreu de Las Villas (UCLV), Santa Clara 54830, Cuba;
| | - Dany Domínguez-Pérez
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Proquinorte, Unipessoal, Lda, Avenida 5 de Outubro, 124, 7º Piso, Avenidas Novas, 1050-061 Lisboa, Portugal
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Translacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Ecuador;
| | - Gisselle Pérez-Machado
- EpiDisease S.L—Spin-Off of Centro de Investigación Biomédica en Red de Enfermedades Raras (CIBERER), 46980 Valencia, Spain;
| | - Marta Teijeira
- Departamento de Química Orgánica, Facultade de Química, Universidade de Vigo, 36310 Vigo, Spain;
- Instituto de Investigación Sanitaria Galicia Sur, Hospital Álvaro Cunqueiro, 36213 Vigo, Spain
| | - Agostinho Antunes
- CIIMAR—Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Av. General Norton de Matos, s/n, 4450-208 Porto, Portugal;
- Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal
- Correspondence: (G.A.-C.); (A.A.); Tel.: +351-22-340-1813 (G.A.-C. & A.A.)
| |
Collapse
|
14
|
A Novel Network Science and Similarity-Searching-Based Approach for Discovering Potential Tumor-Homing Peptides from Antimicrobials. Antibiotics (Basel) 2022; 11:antibiotics11030401. [PMID: 35326864 PMCID: PMC8944733 DOI: 10.3390/antibiotics11030401] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2022] [Revised: 03/13/2022] [Accepted: 03/15/2022] [Indexed: 02/01/2023] Open
Abstract
Peptide-based drugs are promising anticancer candidates due to their biocompatibility and low toxicity. In particular, tumor-homing peptides (THPs) have the ability to bind specifically to cancer cell receptors and tumor vasculature. Despite their potential to develop antitumor drugs, there are few available prediction tools to assist the discovery of new THPs. Two webservers based on machine learning models are currently active, the TumorHPD and the THPep, and more recently the SCMTHP. Herein, a novel method based on network science and similarity searching implemented in the starPep toolbox is presented for THP discovery. The approach leverages from exploring the structural space of THPs with Chemical Space Networks (CSNs) and from applying centrality measures to identify the most relevant and non-redundant THP sequences within the CSN. Such THPs were considered as queries (Qs) for multi-query similarity searches that apply a group fusion (MAX-SIM rule) model. The resulting multi-query similarity searching models (SSMs) were validated with three benchmarking datasets of THPs/non-THPs. The predictions achieved accuracies that ranged from 92.64 to 99.18% and Matthews Correlation Coefficients between 0.894–0.98, outperforming state-of-the-art predictors. The best model was applied to repurpose AMPs from the starPep database as THPs, which were subsequently optimized for the TH activity. Finally, 54 promising THP leads were discovered, and their sequences were analyzed to encounter novel motifs. These results demonstrate the potential of CSNs and multi-query similarity searching for the rapid and accurate identification of THPs.
Collapse
|
15
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. Accelerating the discovery of antifungal peptides using deep temporal convolutional networks. Brief Bioinform 2022; 23:6526725. [DOI: 10.1093/bib/bbac008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/27/2021] [Accepted: 01/06/2022] [Indexed: 02/02/2023] Open
Abstract
Abstract
The application of machine intelligence in biological sciences has led to the development of several automated tools, thus enabling rapid drug discovery. Adding to this development is the ongoing COVID-19 pandemic, due to which researchers working in the field of artificial intelligence have acquired an active interest in finding machine learning-guided solutions for diseases like mucormycosis, which has emerged as an important post-COVID-19 fungal complication, especially in immunocompromised patients. On these lines, we have proposed a temporal convolutional network-based binary classification approach to discover new antifungal molecules in the proteome of plants and animals to accelerate the development of antifungal medications. Although these biomolecules, known as antifungal peptides (AFPs), are part of an organism’s intrinsic host defense mechanism, their identification and discovery by traditional biochemical procedures is arduous. Also, the absence of a large dataset on AFPs is also a considerable impediment in building a robust automated classifier. To this end, we have employed the transfer learning technique to pre-train our model on antibacterial peptides. Subsequently, we have built a classifier that predicts AFPs with accuracy and precision of 94%. Our classifier outperforms several state-of-the-art models by a considerable margin. The results of its performance were proven as statistically significant using the Kruskal–Wallis H test, followed by a post hoc analysis performed using the Tukey honestly significant difference (HSD) test. Furthermore, we identified potent AFPs in representative animal (Histatin) and plant (Snakin) proteins using our model. We also built and deployed a web app that is freely available at https://tcn-afppred.anvil.app/ for the identification of AFPs in protein sequences.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
16
|
Instance-based learning using the Half-Space Proximal Graph. Pattern Recognit Lett 2022. [DOI: 10.1016/j.patrec.2022.01.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
17
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. StaBle-ABPpred: a stacked ensemble predictor based on biLSTM and attention mechanism for accelerated discovery of antibacterial peptides. Brief Bioinform 2021; 23:6423526. [PMID: 34750606 DOI: 10.1093/bib/bbab439] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/06/2021] [Accepted: 09/24/2021] [Indexed: 01/29/2023] Open
Abstract
Due to the rapid emergence of multi-drug resistant (MDR) bacteria, existing antibiotics are becoming ineffective. So, researchers are looking for alternatives in the form of antibacterial peptides (ABPs) based medicines. The discovery of novel ABPs using wet-lab experiments is time-consuming and expensive. Many machine learning models have been proposed to search for new ABPs, but there is still scope to develop a robust model that has high accuracy and precision. In this work, we present StaBle-ABPpred, a stacked ensemble technique-based deep learning classifier that uses bidirectional long-short term memory (biLSTM) and attention mechanism at base-level and an ensemble of random forest, gradient boosting and logistic regression at meta-level to classify peptides as antibacterial or otherwise. The performance of our model has been compared with several state-of-the-art classifiers, and results were subjected to analysis of variance (ANOVA) test and its post hoc analysis, which proves that our model performs better than existing classifiers. Furthermore, a web app has been developed and deployed at https://stable-abppred.anvil.app to identify novel ABPs in protein sequences. Using this app, we identified novel ABPs in all the proteins of the Streptococcus phage T12 genome. These ABPs have shown amino acid similarities with experimentally tested antimicrobial peptides (AMPs) of other organisms. Hence, they could be chemically synthesized and experimentally validated for their activity against different bacteria. The model and app developed in this work can be further utilized to explore the protein diversity for identifying novel ABPs with broad-spectrum activity, especially against MDR bacterial pathogens.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
18
|
Sharma R, Shrivastava S, Kumar Singh S, Kumar A, Saxena S, Kumar Singh R. Deep-AFPpred: identifying novel antifungal peptides using pretrained embeddings from seq2vec with 1DCNN-BiLSTM. Brief Bioinform 2021; 23:6404058. [PMID: 34670278 DOI: 10.1093/bib/bbab422] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 09/03/2021] [Accepted: 09/12/2021] [Indexed: 01/04/2023] Open
Abstract
Fungal infections or mycosis cause a wide range of diseases in humans and animals. The incidences of community acquired; nosocomial fungal infections have increased dramatically after the emergence of COVID-19 pandemic. The increase in number of patients with immunodeficiency / immunosuppression related diseases, resistance to existing antifungal compounds and availability of limited therapeutic options has triggered the search for alternative antifungal molecules. In this direction, antifungal peptides (AFPs) have received a lot of interest as an alternative to currently available antifungal drugs. Although the AFPs are produced by diverse population of living organisms, identifying effective AFPs from natural sources is time-consuming and expensive. Therefore, there is a need to develop a robust in silico model capable of identifying novel AFPs in protein sequences. In this paper, we propose Deep-AFPpred, a deep learning classifier that can identify AFPs in protein sequences. We developed Deep-AFPpred using the concept of transfer learning with 1DCNN-BiLSTM deep learning algorithm. The findings reveal that Deep-AFPpred beats other state-of-the-art AFP classifiers by a wide margin and achieved approximately 96% and 94% precision on validation and test data, respectively. Based on the proposed approach, an online prediction server is created and made publicly available at https://afppred.anvil.app/. Using this server, one can identify novel AFPs in protein sequences and the results are provided as a report that includes predicted peptides, their physicochemical properties and motifs. By utilizing this model, we identified AFPs in different proteins, which can be chemically synthesized in lab and experimentally validated for their antifungal activity.
Collapse
Affiliation(s)
- Ritesh Sharma
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Raj Kumar Singh
- Former Director & Vice Chancellor, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
19
|
Sharma R, Shrivastava S, Kumar Singh S, Kumar A, Saxena S, Kumar Singh R. AniAMPpred: artificial intelligence guided discovery of novel antimicrobial peptides in animal kingdom. Brief Bioinform 2021; 22:6320952. [PMID: 34259329 DOI: 10.1093/bib/bbab242] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 06/02/2021] [Accepted: 06/21/2021] [Indexed: 12/12/2022] Open
Abstract
With advancements in genomics, there has been substantial reduction in the cost and time of genome sequencing and has resulted in lot of data in genome databases. Antimicrobial host defense proteins provide protection against invading microbes. But confirming the antimicrobial function of host proteins by wet-lab experiments is expensive and time consuming. Therefore, there is a need to develop an in silico tool to identify the antimicrobial function of proteins. In the current study, we developed a model AniAMPpred by considering all the available antimicrobial peptides (AMPs) of length $\in $[10 200] from the animal kingdom. The model utilizes a support vector machine algorithm with deep learning-based features and identifies probable antimicrobial proteins (PAPs) in the genome of animals. The results show that our proposed model outperforms other state-of-the-art classifiers, has very high confidence in its predictions, is not biased and can classify both AMPs and non-AMPs for a diverse peptide length with high accuracy. By utilizing AniAMPpred, we identified 436 PAPs in the genome of Helobdella robusta. To further confirm the functional activity of PAPs, we performed BLAST analysis against known AMPs. On detailed analysis of five selected PAPs, we could observe their similarity with antimicrobial proteins of several animal species. Thus, our proposed model can help the researchers identify PAPs in the genome of animals and provide insight into the functional identity of different proteins. An online prediction server is also developed based on the proposed approach, which is freely accessible at https://aniamppred.anvil.app/.
Collapse
Affiliation(s)
- Ritesh Sharma
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Raj Kumar Singh
- Former Director & Vice Chancellor, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
20
|
Pinacho-Castellanos SA, García-Jacas CR, Gilson MK, Brizuela CA. Alignment-Free Antimicrobial Peptide Predictors: Improving Performance by a Thorough Analysis of the Largest Available Data Set. J Chem Inf Model 2021; 61:3141-3157. [PMID: 34081438 DOI: 10.1021/acs.jcim.1c00251] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
In the last two decades, a large number of machine-learning-based predictors for the activities of antimicrobial peptides (AMPs) have been proposed. These predictors differ from one another in the learning method and in the training and testing data sets used. Unfortunately, the training data sets present several drawbacks, such as a low representativeness regarding the experimentally validated AMP space, and duplicated peptide sequences between negative and positive data sets. These limitations give a low confidence to most of the approaches to be used in prospective studies. To address these weaknesses, we propose novel modeling and assessing data sets from the largest experimentally validated nonredundant peptide data set reported to date. From these novel data sets, alignment-free quantitative sequence-activity models (AF-QSAMs) based on Random Forest are created to identify general AMPs and their antibacterial, antifungal, antiparasitic, and antiviral functional types. An applicability domain analysis is carried out to determine the reliability of the predictions obtained, which, to the best of our knowledge, is performed for the first time for AMP recognition. A benchmarking is undertaken between the models proposed and several models from the literature that are freely available in 13 programs (ClassAMP, iAMP-2L, ADAM, MLAMP, AMPScanner v2.0, AntiFP, AMPfun, PEPred-suite, AxPEP, CAMPR3, iAMPpred, APIN, and Meta-iAVP). The models proposed are those with the best performance in all of the endpoints modeled, while most of the methods from the literature have weak-to-random predictive agreements. The models proposed are also assessed through Y-scrambling and repeated k-fold cross-validation tests, demonstrating that the outcomes obtained by them are not given by chance. Three chemometric analyses also confirmed the relevance of the peptides descriptors used in the modeling. Therefore, it can be concluded that the models built by fixing the drawbacks existing in the literature contribute to identifying antibacterial, antifungal, antiparasitic, and antiviral peptides with high effectivity and reliability. Models are freely available via the AMPDiscover tool at https://biocom-ampdiscover.cicese.mx/.
Collapse
Affiliation(s)
- Sergio A Pinacho-Castellanos
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México.,Centro de Investigación y Desarrollo de Tecnología Digital (CITEDI), Instituto Politécnico Nacional (IPN), 22435 Tijuana, Baja California, México
| | - César R García-Jacas
- Cátedras CONACYT-Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California 92093, United States
| | - Carlos A Brizuela
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), 22860 Ensenada, Baja California, México
| |
Collapse
|
21
|
Zhang Y, Lin J, Zhao L, Zeng X, Liu X. A novel antibacterial peptide recognition algorithm based on BERT. Brief Bioinform 2021; 22:6284370. [PMID: 34037687 DOI: 10.1093/bib/bbab200] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2020] [Revised: 04/19/2021] [Accepted: 05/03/2021] [Indexed: 12/31/2022] Open
Abstract
As the best substitute for antibiotics, antimicrobial peptides (AMPs) have important research significance. Due to the high cost and difficulty of experimental methods for identifying AMPs, more and more researches are focused on using computational methods to solve this problem. Most of the existing calculation methods can identify AMPs through the sequence itself, but there is still room for improvement in recognition accuracy, and there is a problem that the constructed model cannot be universal in each dataset. The pre-training strategy has been applied to many tasks in natural language processing (NLP) and has achieved gratifying results. It also has great application prospects in the field of AMP recognition and prediction. In this paper, we apply the pre-training strategy to the model training of AMP classifiers and propose a novel recognition algorithm. Our model is constructed based on the BERT model, pre-trained with the protein data from UniProt, and then fine-tuned and evaluated on six AMP datasets with large differences. Our model is superior to the existing methods and achieves the goal of accurate identification of datasets with small sample size. We try different word segmentation methods for peptide chains and prove the influence of pre-training steps and balancing datasets on the recognition effect. We find that pre-training on a large number of diverse AMP data, followed by fine-tuning on new data, is beneficial for capturing both new data's specific features and common features between AMP sequences. Finally, we construct a new AMP dataset, on which we train a general AMP recognition model.
Collapse
Affiliation(s)
- Yue Zhang
- Xiamen University, Xiamen 361005, China
| | | | | | | | | |
Collapse
|
22
|
Sharma R, Shrivastava S, Kumar Singh S, Kumar A, Saxena S, Kumar Singh R. Deep-ABPpred: identifying antibacterial peptides in protein sequences using bidirectional LSTM with word2vec. Brief Bioinform 2021; 22:6204762. [PMID: 33784381 DOI: 10.1093/bib/bbab065] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2020] [Revised: 02/04/2021] [Indexed: 12/13/2022] Open
Abstract
The overuse of antibiotics has led to emergence of antimicrobial resistance, and as a result, antibacterial peptides (ABPs) are receiving significant attention as an alternative. Identification of effective ABPs in lab from natural sources is a cost-intensive and time-consuming process. Therefore, there is a need for the development of in silico models, which can identify novel ABPs in protein sequences for chemical synthesis and testing. In this study, we propose a deep learning classifier named Deep-ABPpred that can identify ABPs in protein sequences. We developed Deep-ABPpred using bidirectional long short-term memory algorithm with amino acid level features from word2vec. The results show that Deep-ABPpred outperforms other state-of-the-art ABP classifiers on both test and independent datasets. Our proposed model achieved the precision of approximately 97 and 94% on test dataset and independent dataset, respectively. The high precision suggests applicability of Deep-ABPpred in proposing novel ABPs for synthesis and experimentation. By utilizing Deep-ABPpred, we identified ABPs in the tail protein sequences of Streptococcus bacteriophages, chemically synthesized identified peptides in lab and tested their activity in vitro. These ABPs showed potent antibacterial activity against selected Gram-positive and Gram-negative bacteria, which confirms the capability of Deep-ABPpred in identifying novel ABPs in protein sequences. Based on the proposed approach, an online prediction server is also developed, which is freely accessible at https://abppred.anvil.app/. This web server takes the protein sequence as input and provides ABPs with high probability (>0.95) as output.
Collapse
Affiliation(s)
- Ritesh Sharma
- Department of Computer Science and Engineering at IIT (BHU), Varanasi, India
| | | | - Sanjay Kumar Singh
- Department of Computer Science and Engineering at IIT (BHU), Varanasi, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering at IIT (BHU), Varanasi, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, IVRI, Izatnagar, India
| | | |
Collapse
|
23
|
Bechinger B, Juhl DW, Glattard E, Aisenbrey C. Revealing the Mechanisms of Synergistic Action of Two Magainin Antimicrobial Peptides. FRONTIERS IN MEDICAL TECHNOLOGY 2020; 2:615494. [PMID: 35047895 PMCID: PMC8757784 DOI: 10.3389/fmedt.2020.615494] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2020] [Accepted: 11/30/2020] [Indexed: 12/18/2022] Open
Abstract
The study of peptide-lipid and peptide-peptide interactions as well as their topology and dynamics using biophysical and structural approaches have changed our view how antimicrobial peptides work and function. It has become obvious that both the peptides and the lipids arrange in soft supramolecular arrangements which are highly dynamic and able to change and mutually adapt their conformation, membrane penetration, and detailed morphology. This can occur on a local and a global level. This review focuses on cationic amphipathic peptides of the magainin family which were studied extensively by biophysical approaches. They are found intercalated at the membrane interface where they cause membrane thinning and ultimately lysis. Interestingly, mixtures of two of those peptides namely magainin 2 and PGLa which occur naturally as a cocktail in the frog skin exhibit synergistic enhancement of antimicrobial activities when investigated together in antimicrobial assays but also in biophysical experiments with model membranes. Detailed dose-response curves, presented here for the first time, show a cooperative behavior for the individual peptides which is much increased when PGLa and magainin are added as equimolar mixture. This has important consequences for their bacterial killing activities and resistance development. In membranes that carry unsaturations both peptides align parallel to the membrane surface where they have been shown to arrange into mesophases involving the peptides and the lipids. This supramolecular structuration comes along with much-increased membrane affinities for the peptide mixture. Because this synergism is most pronounced in membranes representing the bacterial lipid composition it can potentially be used to increase the therapeutic window of pharmaceutical formulations.
Collapse
Affiliation(s)
- Burkhard Bechinger
- University of Strasbourg/CNRS, UMR7177, Institut de Chimie de Strasbourg, Strasbourg, France
- Institut Universitaire de France (IUF), Paris, France
| | - Dennis Wilkens Juhl
- University of Strasbourg/CNRS, UMR7177, Institut de Chimie de Strasbourg, Strasbourg, France
| | - Elise Glattard
- University of Strasbourg/CNRS, UMR7177, Institut de Chimie de Strasbourg, Strasbourg, France
| | - Christopher Aisenbrey
- University of Strasbourg/CNRS, UMR7177, Institut de Chimie de Strasbourg, Strasbourg, France
| |
Collapse
|