1
|
Rathore AS, Choudhury S, Arora A, Tijare P, Raghava GPS. ToxinPred 3.0: An improved method for predicting the toxicity of peptides. Comput Biol Med 2024; 179:108926. [PMID: 39038391 DOI: 10.1016/j.compbiomed.2024.108926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 05/17/2024] [Accepted: 07/17/2024] [Indexed: 07/24/2024]
Abstract
Toxicity emerges as a prominent challenge in the design of therapeutic peptides, causing the failure of numerous peptides during clinical trials. In 2013, our group developed ToxinPred, a computational method that has been extensively adopted by the scientific community for predicting peptide toxicity. In this paper, we propose a refined variant of ToxinPred that showcases improved reliability and accuracy in predicting peptide toxicity. Initially, we utilized a similarity/alignment-based approach employing BLAST to predict toxic peptides, which yielded satisfactory accuracy; however, the method suffered from inadequate coverage. Subsequently, we employed a motif-based approach using MERCI software to uncover specific patterns or motifs that are exclusively observed in toxic peptides. The search for these motifs in peptides allowed us to predict toxic peptides with a high level of specificity with poor sensitivity. To overcome the coverage limitations, we developed alignment-free methods using machine/deep learning techniques to balance sensitivity and specificity of prediction. Deep learning model (ANN - LSTM with fixed sequence length) developed using one-hot encoding achieved a maximum AUROC of 0.93 with MCC of 0.71 on an independent dataset. Machine learning model (extra tree) developed using compositional features of peptides achieved a maximum AUROC of 0.95 with MCC of 0.78. We also developed large language models and achieved maximum AUC of 0.93 using ESM2-t33. Finally, we developed hybrid or ensemble methods combining two or more methods to enhance performance. Our specific hybrid method, which combines a motif-based approach with a machine learning-based model, achieved a maximum AUROC of 0.98 with MCC 0.81 on an independent dataset. In this study, all models were trained and tested on 80 % of data using five-fold cross-validation and evaluated on the remaining 20 % of data called independent dataset. The evaluation of all methods on an independent dataset revealed that the method proposed in this study exhibited better performance than existing methods. To cater to the needs of the scientific community, we have developed a standalone software, pip package and web-based server ToxinPred3 (https://github.com/raghavagps/toxinpred3 and https://webs.iiitd.edu.in/raghava/toxinpred3/).
Collapse
Affiliation(s)
- Anand Singh Rathore
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Shubham Choudhury
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Akanksha Arora
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Purva Tijare
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| | - Gajendra P S Raghava
- Department of Computational Biology, Indraprastha Institute of Information Technology, Okhla Phase 3, New Delhi, 110020, India.
| |
Collapse
|
2
|
Kang Y, Zhang H, Wang X, Yang Y, Jia Q. MMDB: Multimodal dual-branch model for multi-functional bioactive peptide prediction. Anal Biochem 2024; 690:115491. [PMID: 38460901 DOI: 10.1016/j.ab.2024.115491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/21/2024] [Accepted: 02/19/2024] [Indexed: 03/11/2024]
Abstract
Bioactive peptides can hinder oxidative processes and microbial spoilage in foodstuffs and play important roles in treating diverse diseases and disorders. While most of the methods focus on single-functional bioactive peptides and have obtained promising prediction performance, it is still a significant challenge to accurately detect complex and diverse functions simultaneously with the quick increase of multi-functional bioactive peptides. In contrast to previous research on multi-functional bioactive peptide prediction based solely on sequence, we propose a novel multimodal dual-branch (MMDB) lightweight deep learning model that designs two different branches to effectively capture the complementary information of peptide sequence and structural properties. Specifically, a multi-scale dilated convolution with Bi-LSTM branch is presented to effectively model the different scales sequence properties of peptides while a multi-layer convolution branch is proposed to capture structural information. To the best of our knowledge, this is the first effective extraction of peptide sequence features using multi-scale dilated convolution without parameter increase. Multimodal features from both branches are integrated via a fully connected layer for multi-label classification. Compared to state-of-the-art methods, our MMDB model exhibits competitive results across metrics, with a 9.1% Coverage increase and 5.3% and 3.5% improvements in Precision and Accuracy, respectively.
Collapse
Affiliation(s)
- Yan Kang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China; Yunnan Key Laboratory of Software Engineering, China
| | - Huadong Zhang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Xinchao Wang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China
| | - Yun Yang
- National Pilot School of Software, Yunnan University, Kunming, 650091, Yunnan, China; Yunnan Key Laboratory of Software Engineering, China.
| | - Qi Jia
- School of Information Science, Yunnan University, Kunming, 650091, Yunnan, China
| |
Collapse
|
3
|
Olcay B, Ozdemir GD, Ozdemir MA, Ercan UK, Guren O, Karaman O. Prediction of the synergistic effect of antimicrobial peptides and antimicrobial agents via supervised machine learning. BMC Biomed Eng 2024; 6:1. [PMID: 38233957 DOI: 10.1186/s42490-024-00075-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 01/09/2024] [Indexed: 01/19/2024] Open
Abstract
BACKGROUND Infectious diseases not only cause severe health problems but also burden the healthcare system. Therefore, the effective treatment of those diseases is crucial. Both conventional approaches, such as antimicrobial agents, and novel approaches, like antimicrobial peptides (AMPs), are used to treat infections. However, due to the drawbacks of current approaches, new solutions are still being investigated. One recent approach is the use of AMPs and antimicrobial agents in combination, but determining synergism is with a huge variety of AMPs time-consuming and requires multiple experimental studies. Machine learning (ML) algorithms are widely used to predict biological outcomes, particularly in the field of AMPs, but no previous research reported on predicting the synergistic effects of AMPs and antimicrobial agents. RESULTS Several supervised ML models were implemented to accurately predict the synergistic effect of AMPs and antimicrobial agents. The results demonstrated that the hyperparameter-optimized Light Gradient Boosted Machine Classifier (oLGBMC) yielded the best test accuracy of 76.92% for predicting the synergistic effect. Besides, the feature importance analysis reveals that the target microbial species, the minimum inhibitory concentrations (MICs) of the AMP and the antimicrobial agents, and the used antimicrobial agent were the most important features for the prediction of synergistic effect, which aligns with recent experimental studies in the literature. CONCLUSION This study reveals that ML algorithms can predict the synergistic activity of two different antimicrobial agents without the need for complex and time-consuming experimental procedures. The implications support that the ML models may not only reduce the experimental cost but also provide validation of experimental procedures.
Collapse
Affiliation(s)
- Basak Olcay
- Department of Biomedical Engineering, Graduate School of Natural and Applied Sciences, Izmir Katip Celebi University, Izmir, Turkey
| | - Gizem D Ozdemir
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Mehmet A Ozdemir
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey.
| | - Utku K Ercan
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Onan Guren
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| | - Ozan Karaman
- Department of Biomedical Engineering, Faculty of Engineering and Architecture, Izmir Katip Celebi University, Izmir, Turkey
| |
Collapse
|
4
|
Yang Y, Wu H, Gao Y, Tong W, Li K. MFPPDB: a comprehensive multi-functional plant peptide database. FRONTIERS IN PLANT SCIENCE 2023; 14:1224394. [PMID: 37908832 PMCID: PMC10613858 DOI: 10.3389/fpls.2023.1224394] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Accepted: 09/29/2023] [Indexed: 11/02/2023]
Abstract
Plants produce a wide range of bioactive peptides as part of their innate defense mechanisms. With the explosive growth of plant-derived peptides, verifying the therapeutic function using traditional experimental methods are resources and time consuming. Therefore, it is necessary to predict the therapeutic function of plant-derived peptides more effectively and accurately with reduced waste of resources and thus expedite the development of plant peptides. We herein developed a repository of plant peptides predicted to have multiple therapeutic functions, named as MFPPDB (multi-functional plant peptide database). MFPPDB including 1,482,409 single or multiple functional plant origin therapeutic peptides derived from 121 fundamental plant species. The functional categories of these therapeutic peptides include 41 different features such as anti-bacterial, anti-fungal, anti-HIV, anti-viral, and anti-cancer. The detailed physicochemical information of these peptides was presented in functional search and physicochemical property search module, which can help users easily access the peptide information by the plant peptide species, ID, and functions, or by their peptide ID, isoelectric point, peptide sequence, and molecular weight through web-friendly interface. We further matched the predicted peptides to nine state-of-the-art curated functional peptide databases and found that at least 293,408 of the peptides possess functional potentials. Overall, MFPPDB integrated a massive number of plant peptides have single or multiple therapeutic functions, which will facilitate the comprehensive research in plant peptidomics. MFPPDB can be freely accessed through http://124.223.195.214:9188/mfppdb/index.
Collapse
Affiliation(s)
- Yaozu Yang
- School of Information and Computer, Anhui Agricultural University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei, Anhui, China
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, Anhui, China
| | - Hongwei Wu
- School of Information and Computer, Anhui Agricultural University, Hefei, China
| | - Yu Gao
- School of Information and Computer, Anhui Agricultural University, Hefei, China
| | - Wei Tong
- State Key Laboratory of Tea Plant Biology and Utilization, Anhui Agricultural University, Hefei, Anhui, China
| | - Ke Li
- School of Information and Computer, Anhui Agricultural University, Hefei, China
- Information Materials and Intelligent Sensing Laboratory of Anhui Province, Anhui University, Hefei, Anhui, China
- Anhui Provincial Engineering Laboratory for Beidou Precision Agriculture Information, Anhui Agricultural University, Hefei, Anhui, China
| |
Collapse
|
5
|
Debroy R, Ramaiah S. Consolidated knowledge-guided computational pipeline for therapeutic intervention against bacterial biofilms - a review. BIOFOULING 2023; 39:928-947. [PMID: 38108207 DOI: 10.1080/08927014.2023.2294763] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 12/11/2023] [Indexed: 12/19/2023]
Abstract
Biofilm-associated bacterial infections attributed to multifactorial antimicrobial resistance have caused worldwide challenges in formulating successful treatment strategies. In search of accelerated yet cost-effective therapeutics, several researchers have opted for bioinformatics-based protocols to systemize targeted therapies against biofilm-producing strains. The present review investigated the up-to-date computational databases and servers dedicated to anti-biofilm research to design/screen novel biofilm inhibitors (antimicrobial peptides/phytocompounds/synthetic compounds) and predict their biofilm-inhibition efficacy. Scrutinizing the contemporary in silico methods, a consolidated approach has been highlighted, referred to as a knowledge-guided computational pipeline for biofilm-targeted therapy. The proposed pipeline has amalgamated prominently employed methodologies in genomics, transcriptomics, interactomics and proteomics to identify potential target proteins and their complementary anti-biofilm compounds for effective functional inhibition of biofilm-linked pathways. This review can pave the way for new portals to formulate successful therapeutic interventions against biofilm-producing pathogens.
Collapse
Affiliation(s)
- Reetika Debroy
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology (SBST), Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
- Department of Bio-Medical Sciences, School of Bio-Sciences and Technology (SBST), Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| | - Sudha Ramaiah
- Medical and Biological Computing Laboratory, School of Bio-Sciences and Technology (SBST), Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
- Department of Bio-Sciences, School of Bio-Sciences and Technology (SBST), Vellore Institute of Technology (VIT), Vellore, Tamil Nadu, India
| |
Collapse
|
6
|
Liu T, Zhai Y, Jeong KC. Advancing understanding of microbial biofilms through machine learning-powered studies. Food Sci Biotechnol 2023; 32:1653-1664. [PMID: 37780593 PMCID: PMC10533454 DOI: 10.1007/s10068-023-01415-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 07/26/2023] [Accepted: 08/07/2023] [Indexed: 10/03/2023] Open
Abstract
Microbial biofilms are prevalent in various environments and pose significant challenges to food safety and public health. The biofilms formed by pathogens can cause food spoilage, foodborne illness, and infectious diseases, which are difficult to treat due to their enhanced antimicrobial resistance. While the composition and development of biofilms have been widely studied, their profound impact on food, the food industry, and public health has not been sufficiently recapitulated. This review aims to provide a comprehensive overview of microbial biofilms in the food industry and their implication on public health. It highlights the existence of biofilms along the food-producing chains and the underlying mechanisms of biofilm-associated diseases. Furthermore, this review thoroughly summarizes the enhanced understanding of microbial biofilms achieved through machine learning approaches in biofilm research. By consolidating existing knowledge, this review intends to facilitate developing effective strategies to combat biofilm-associated infections in both the food industry and public health.
Collapse
Affiliation(s)
- Ting Liu
- Emerging Pathogens Institute, University of Florida, 2055 Mowry Rd, Gainesville, FL 32610 USA
- Department of Animal Sciences, University of Florida, 2250 Shealy Dr, Gainesville, FL 32608 USA
| | - Yuting Zhai
- Emerging Pathogens Institute, University of Florida, 2055 Mowry Rd, Gainesville, FL 32610 USA
- Department of Animal Sciences, University of Florida, 2250 Shealy Dr, Gainesville, FL 32608 USA
| | - Kwangcheol Casey Jeong
- Emerging Pathogens Institute, University of Florida, 2055 Mowry Rd, Gainesville, FL 32610 USA
- Department of Animal Sciences, University of Florida, 2250 Shealy Dr, Gainesville, FL 32608 USA
| |
Collapse
|
7
|
Xu J, Li F, Li C, Guo X, Landersdorfer C, Shen HH, Peleg AY, Li J, Imoto S, Yao J, Akutsu T, Song J. iAMPCN: a deep-learning approach for identifying antimicrobial peptides and their functional activities. Brief Bioinform 2023; 24:bbad240. [PMID: 37369638 PMCID: PMC10359087 DOI: 10.1093/bib/bbad240] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 05/30/2023] [Accepted: 06/08/2023] [Indexed: 06/29/2023] Open
Abstract
Antimicrobial peptides (AMPs) are short peptides that play crucial roles in diverse biological processes and have various functional activities against target organisms. Due to the abuse of chemical antibiotics and microbial pathogens' increasing resistance to antibiotics, AMPs have the potential to be alternatives to antibiotics. As such, the identification of AMPs has become a widely discussed topic. A variety of computational approaches have been developed to identify AMPs based on machine learning algorithms. However, most of them are not capable of predicting the functional activities of AMPs, and those predictors that can specify activities only focus on a few of them. In this study, we first surveyed 10 predictors that can identify AMPs and their functional activities in terms of the features they employed and the algorithms they utilized. Then, we constructed comprehensive AMP datasets and proposed a new deep learning-based framework, iAMPCN (identification of AMPs based on CNNs), to identify AMPs and their related 22 functional activities. Our experiments demonstrate that iAMPCN significantly improved the prediction performance of AMPs and their corresponding functional activities based on four types of sequence features. Benchmarking experiments on the independent test datasets showed that iAMPCN outperformed a number of state-of-the-art approaches for predicting AMPs and their functional activities. Furthermore, we analyzed the amino acid preferences of different AMP activities and evaluated the model on datasets of varying sequence redundancy thresholds. To facilitate the community-wide identification of AMPs and their corresponding functional types, we have made the source codes of iAMPCN publicly available at https://github.com/joy50706/iAMPCN/tree/master. We anticipate that iAMPCN can be explored as a valuable tool for identifying potential AMPs with specific functional activities for further experimental validation.
Collapse
Affiliation(s)
- Jing Xu
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Fuyi Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC 3800, Australia
| | - Chen Li
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Cornelia Landersdorfer
- Monash Institute of Pharmaceutical Sciences, Monash University, Melbourne, VIC 3800, Australia
| | - Hsin-Hui Shen
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Materials Science and Engineering, Faculty of Engineering, Monash University, Clayton, VIC, 3800, Australia
| | - Anton Y Peleg
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Infectious Diseases, Alfred Hospital, Alfred Health, Melbourne, Victoria, Australia
| | - Jian Li
- Monash Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC 3800, Australia
| | - Seiya Imoto
- Division of Health Medical Intelligence, Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo, Japan
- Collaborative Research Institute for Innovative Microbiology, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | | | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji 611-0011, Japan
| |
Collapse
|
8
|
Rajput A, Bhamare KT, Thakur A, Kumar M. Anti-Biofilm: Machine Learning Assisted Prediction of IC 50 Activity of Chemicals Against Biofilms of Microbes Causing Antimicrobial Resistance and Implications in Drug Repurposing. J Mol Biol 2023; 435:168115. [PMID: 37356913 DOI: 10.1016/j.jmb.2023.168115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 04/06/2023] [Accepted: 04/14/2023] [Indexed: 06/27/2023]
Abstract
Biofilms are one of the leading causes of antibiotic resistance. It acts as a physical barrier against the human immune system and drugs. The use of anti-biofilm agents helps in tackling the menace of antibiotic resistance. The identification of efficient anti-biofilm chemicals remains a challenge. Therefore, in this study, we developed 'anti-Biofilm', a machine learning technique (MLT) based predictive algorithm for identifying and analyzing the biofilm inhibition of small molecules. The algorithm is developed using experimentally validated anti-biofilm compounds with half maximal inhibitory concentration (IC50) values extracted from aBiofilm resource. Out of the five MLTs, the Support Vector Machine performed best with Pearson's correlation coefficient of 0.75 on the training/testing data set. The robustness of the developed model was further checked using an independent validation dataset. While analyzing the chemical diversity of the anti-biofilm compounds, we observed that they occupy diverse chemical spaces with parent molecules like furanone, urea, phenolic acids, quinolines, and many more. Use of diverse chemicals as input further signifies the robustness of our predictive models. The three best-performing machine learning models were implemented as a user-friendly 'anti-Biofilm' web server (https://bioinfo.imtech.res.in/manojk/antibiofilm/) with different other modules which make 'anti-Biofilm' a comprehensive platform. Therefore, we hope that our initiative will be helpful for the scientific community engaged in identifying effective anti-biofilm agents to target the problem of antimicrobial resistance.
Collapse
Affiliation(s)
- Akanksha Rajput
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India
| | - Kailash T Bhamare
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Anamika Thakur
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India
| | - Manoj Kumar
- Virology Unit and Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39A, Chandigarh 160036, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
| |
Collapse
|
9
|
Raya D, Peta V, Bomgni A, Du Do T, Kalimuthu J, Salem DR, Gadhamshetty V, Gnimpieba EZ, Dhiman SS. Classification of bacterial nanowire proteins using Machine Learning and Feature Engineering model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.03.539336. [PMID: 37205598 PMCID: PMC10187271 DOI: 10.1101/2023.05.03.539336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Nanowires (NW) have been extensively studied for Shewanella spp. and Geobacter spp. and are mostly produced by Type IV pili or multiheme c-type cytochrome. Electron transfer via NW is the most studied mechanism in microbially induced corrosion, with recent interest in application in bioelectronics and biosensor. In this study, a machine learning (ML) based tool was developed to classify NW proteins. A manually curated 999 protein collection was developed as an NW protein dataset. Gene ontology analysis of the dataset revealed microbial NW is part of membranal proteins with metal ion binding motifs and plays a central role in electron transfer activity. Random Forest (RF), support vector machine (SVM), and extreme gradient boost (XGBoost) models were implemented in the prediction model and were observed to identify target proteins based on functional, structural, and physicochemical properties with 89.33%, 95.6%, and 99.99% accuracy. Dipetide amino acid composition, transition, and distribution protein features of NW are key important features aiding in the model's high performance.
Collapse
|
10
|
Agüero-Chapin G, Antunes A, Mora JR, Pérez N, Contreras-Torres E, Valdes-Martini JR, Martinez-Rios F, Zambrano CH, Marrero-Ponce Y. Complex Networks Analyses of Antibiofilm Peptides: An Emerging Tool for Next-Generation Antimicrobials' Discovery. Antibiotics (Basel) 2023; 12:antibiotics12040747. [PMID: 37107109 PMCID: PMC10135022 DOI: 10.3390/antibiotics12040747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 04/04/2023] [Accepted: 04/11/2023] [Indexed: 04/29/2023] Open
Abstract
Microbial biofilms cause several environmental and industrial issues, even affecting human health. Although they have long represented a threat due to their resistance to antibiotics, there are currently no approved antibiofilm agents for clinical treatments. The multi-functionality of antimicrobial peptides (AMPs), including their antibiofilm activity and their potential to target multiple microbes, has motivated the synthesis of AMPs and their relatives for developing antibiofilm agents for clinical purposes. Antibiofilm peptides (ABFPs) have been organized in databases that have allowed the building of prediction tools which have assisted in the discovery/design of new antibiofilm agents. However, the complex network approach has not yet been explored as an assistant tool for this aim. Herein, a kind of similarity network called the half-space proximal network (HSPN) is applied to represent/analyze the chemical space of ABFPs, aiming to identify privileged scaffolds for the development of next-generation antimicrobials that are able to target both planktonic and biofilm microbial forms. Such analyses also considered the metadata associated with the ABFPs, such as origin, other activities, targets, etc., in which the relationships were projected by multilayer networks called metadata networks (METNs). From the complex networks' mining, a reduced but informative set of 66 ABFPs was extracted, representing the original antibiofilm space. This subset contained the most central to atypical ABFPs, some of them having the desired properties for developing next-generation antimicrobials. Therefore, this subset is advisable for assisting the search for/design of both new antibiofilms and antimicrobial agents. The provided ABFP motifs list, discovered within the HSPN communities, is also useful for the same purpose.
Collapse
Affiliation(s)
- Guillermin Agüero-Chapin
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal
| | - José R Mora
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Noel Pérez
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Ernesto Contreras-Torres
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | | | - Felix Martinez-Rios
- Facultad de Ingeniería, Universidad Panamericana, Augusto Rodin No. 498, Insurgentes Mixcoac, Benito Juárez, Ciudad de México 03920, Mexico
| | - Cesar H Zambrano
- Universidad San Francisco de Quito (USFQ), Colegio de Ciencias e Ingenierías "El Politécnico", Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas and Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, Quito 170157, Pichincha, Ecuador
- Departamento de Ciencias de la Computación, Centro de Investigación Científica y de Educación Superior de Ensenada (CICESE), Ensenada 22860, Baja California, Mexico
| |
Collapse
|
11
|
Kumar S, Balaya RDA, Kanekar S, Raju R, Prasad TSK, Kandasamy RK. Computational tools for exploring peptide-membrane interactions in gram-positive bacteria. Comput Struct Biotechnol J 2023; 21:1995-2008. [PMID: 36950221 PMCID: PMC10025024 DOI: 10.1016/j.csbj.2023.02.051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/27/2023] [Accepted: 02/27/2023] [Indexed: 03/05/2023] Open
Abstract
The vital cellular functions in Gram-positive bacteria are controlled by signaling molecules known as quorum sensing peptides (QSPs), considered promising therapeutic interventions for bacterial infections. In the bacterial system QSPs bind to membrane-coupled receptors, which then auto-phosphorylate and activate intracellular response regulators. These response regulators induce target gene expression in bacteria. One of the most reliable trends in drug discovery research for virulence-associated molecular targets is the use of peptide drugs or new functionalities. In this perspective, computational methods act as auxiliary aids for biologists, where methodologies based on machine learning and in silico analysis are developed as suitable tools for target peptide identification. Therefore, the development of quick and reliable computational resources to identify or predict these QSPs along with their receptors and inhibitors is receiving considerable attention. The databases such as Quorumpeps and Quorum Sensing of Human Gut Microbes (QSHGM) provide a detailed overview of the structures and functions of QSPs. The tools and algorithms such as QSPpred, QSPred-FL, iQSP, EnsembleQS and PEPred-Suite have been used for the generic prediction of QSPs and feature representation. The availability of compiled key resources for utilizing peptide features based on amino acid composition, positional preferences, and motifs as well as structural and physicochemical properties, including biofilm inhibitory peptides, can aid in elucidating the QSP and membrane receptor interactions in infectious Gram-positive pathogens. Herein, we present a comprehensive survey of diverse computational approaches that are suitable for detecting QSPs and QS interference molecules. This review highlights the utility of these methods for developing potential biomarkers against infectious Gram-positive pathogens.
Collapse
Key Words
- 3-HBA, 3–Hydroxybenzoic Acid
- AAC, Amino Acid Composition
- ABC, ATP-binding cassette
- ACD, Available Chemicals Database
- AIP, Autoinducing Peptide
- AMP, Anti-Microbial Peptide
- ATP, Adenosine Triphosphate
- Agr, Accessory gene regulator
- BFE, Binding Free Energy
- BIP Inhibitors
- BIP, Biofilm Inhibitory Peptides
- BLAST, Basic Local Alignment Search Tool
- BNB, Bernoulli Naïve-Bayes
- CADD, Computer-Aided Drug Design
- CSP, Competence Stimulating Peptide
- CTD, Composition-Transition-Distribution
- D, Aspartate
- DCH, 3,3′-(3,4-dichlorobenzylidene)-bis-(4-hydroxycoumarin)
- DT, Decision Tree
- FDA, Food and Drug Administration
- GBM, Gradient Boosting Machine
- GDC, g-gap Dipeptide
- GNB, Gaussian NB
- Gram-positive bacteria
- H, Histidine
- H-Kinase, Histidine Kinase
- H-phosphotransferase, Histidine Phosphotransferase
- HAM, Hamamelitannin
- HGM, Human Gut Microbiota
- HNP, Human Neutrophil Peptide
- IT, Information Theory Features
- In silico approaches
- KNN, K-Nearest Neighbors
- MCC, Mathew Co-relation Coefficient
- MD, Molecular Dynamics
- MDR, Multiple Drug Resistance
- ML, Machine Learning
- MRSA, Methicillin Resistant S. aureus
- MSL, Multiple Sequence Alignment
- OMR, Omargliptin
- OVP, Overlapping Property Features
- PCP, Physicochemical Properties
- PDB, Protein Data Bank
- PPIs, Protein-Protein Interactions
- PSM, Phenol-Soluble Modulin
- PTM, Post Translational Modification
- QS, Quorum Sensing
- QSCN, QS communication network
- QSHGM, Quorum Sensing of Human Gut Microbes
- QSI, QS Inhibitors
- QSIM, QS Interference Molecules
- QSP inhibitors
- QSP predictors
- QSP, QS Peptides
- QSPR, Quantitative Structure Property Relationship
- Quorum sensing peptides
- RAP, RNAIII-activating protein
- RF, Random Forest
- RIP, RNAIII-inhibiting peptide
- ROC, Receiver Operating Characteristic
- SAR, Structure-Activity Relationship
- SFS, Sequential Forward Search
- SIT, Sitagliptin
- SVM, Support Vector Machine
- TCS, Two-Component Sensory
- TRAP, Target of RAP
- TRG, Trelagliptin
- WHO, World Health Organization
- mRMR, minimum Redundancy and Maximum Relevance
Collapse
Affiliation(s)
- Shreya Kumar
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | | | - Saptami Kanekar
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Rajesh Raju
- Centre for Integrative Omics Data Science, Yenepoya (Deemed to be University), Mangalore 575018, India
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | | | - Richard K. Kandasamy
- Centre of Molecular Inflammation Research (CEMIR), and Department of Clinical and Molecular Medicine (IKOM), Norwegian University of Science and Technology, 7491 Trondheim, Norway
- Department of Laboratory Medicine and Pathology, Center for Individualized Medicine, Mayo Clinic, Rochester, MN 55905, USA
| |
Collapse
|
12
|
Yu H, Luo X. IPPF-FE: an integrated peptide and protein function prediction framework based on fused features and ensemble models. Brief Bioinform 2023; 24:6834141. [PMID: 36403184 DOI: 10.1093/bib/bbac476] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2022] [Revised: 09/23/2022] [Accepted: 10/05/2022] [Indexed: 11/21/2022] Open
Abstract
The prediction of peptide and protein function is important for research and industrial applications, and many machine learning methods have been developed for this purpose. The existing models have encountered many challenges, including the lack of effective and comprehensive features and the limited applicability of each model. Here, we introduce an Integrated Peptide and Protein function prediction Framework based on Fused features and Ensemble models (IPPF-FE), which can accurately capture the relationship between features and labels. The results indicated that IPPF-FE outperformed existing state-of-the-art (SOTA) models on more than 8 different categories of peptide and protein tasks. In addition, t-distributed Stochastic Neighbour Embedding demonstrated the advantages of IPPF-FE. We anticipate that our method will become a versatile tool for peptide and protein prediction tasks and shed light on the future development of related models. The model is open source and available in the GitHub repository https://github.com/Luo-SynBioLab/IPPF-FE.
Collapse
Affiliation(s)
- Han Yu
- Center for Synthetic Biochemistry, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| | - Xiaozhou Luo
- Shenzhen Key Laboratory for the Intelligent Microbial Manufacturing of Medicines, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.,University of Chinese Academy of Sciences, Beijing 100049, China.,CAS Key Laboratory of Quantitative Engineering Biology, Shenzhen Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China.,Center for Synthetic Biochemistry, Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518055, China
| |
Collapse
|
13
|
Ebrahimi Tarki F, Zarrabi M, Abdiali A, Sharbatdar M. Integration of Machine Learning and Structural Analysis for Predicting Peptide Antibiofilm Effects: Advancements in Drug Discovery for Biofilm-Related Infections. IRANIAN JOURNAL OF PHARMACEUTICAL RESEARCH : IJPR 2023; 22:e138704. [PMID: 38450220 PMCID: PMC10916117 DOI: 10.5812/ijpr-138704] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/22/2023] [Accepted: 08/26/2023] [Indexed: 03/08/2024]
Abstract
Background The rise of antibiotic resistance has become a major concern, signaling the end of the golden age of antibiotics. Bacterial biofilms, which exhibit high resistance to antibiotics, significantly contribute to the emergence of antibiotic resistance. Therefore, there is an urgent need to discover new therapeutic agents with specific characteristics to effectively combat biofilm-related infections. Studies have shown the promising potential of peptides as antimicrobial agents. Objectives This study aimed to establish a cost-effective and streamlined computational method for predicting the antibiofilm effects of peptides. This method can assist in addressing the intricate challenge of designing peptides with strong antibiofilm properties, a task that can be both challenging and costly. Methods A positive library, consisting of peptide sequences with antibiofilm activity exceeding 50%, was assembled, along with a negative library containing quorum-sensing peptides. For each peptide sequence, feature vectors were calculated, while considering the primary structure, the order of amino acids, their physicochemical properties, and their distributions. Multiple supervised learning algorithms were used to classify peptides with significant antibiofilm effects for subsequent experimental evaluations. Results The computational approach exhibited high accuracy in predicting the antibiofilm effects of peptides, with accuracy, precision, Matthew's correlation coefficient (MCC), and F1 score of 99%, 99%, 0.97, and 0.99, respectively. The performance level of this computational approach was comparable to that of previous methods. This study introduced a novel approach by combining the feature space with high antibiofilm activity. Conclusions In this study, a reliable and cost-effective method was developed for predicting the antibiofilm effects of peptides using a computational approach. This approach allows for the identification of peptide sequences with substantial antibiofilm activities for further experimental investigations. Accessible source codes and raw data of this study can be found online (hiABF), providing easy access and enabling future updates.
Collapse
Affiliation(s)
- Fatemeh Ebrahimi Tarki
- Department of Biotechnology, Faculty of Biological Sciences, Alzahra University, Tehran, Iran
| | - Mahboobeh Zarrabi
- Department of Biotechnology, Faculty of Biological Sciences, Alzahra University, Tehran, Iran
| | - Ahya Abdiali
- Department of Microbiology, Faculty of Biological Sciences, Alzahra University, Tehran, Iran
| | - Mahkame Sharbatdar
- Department of Mechanical Engineering, Khajeh Nasir Toosi University of Technology, Tehran, Iran
| |
Collapse
|
14
|
Metagenomic insights into taxonomic, functional diversity and inhibitors of microbial biofilms. Microbiol Res 2022; 265:127207. [DOI: 10.1016/j.micres.2022.127207] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Revised: 04/17/2022] [Accepted: 09/18/2022] [Indexed: 11/21/2022]
|
15
|
Anju VT, Busi S, Imchen M, Kumavath R, Mohan MS, Salim SA, Subhaswaraj P, Dyavaiah M. Polymicrobial Infections and Biofilms: Clinical Significance and Eradication Strategies. Antibiotics (Basel) 2022; 11:antibiotics11121731. [PMID: 36551388 PMCID: PMC9774821 DOI: 10.3390/antibiotics11121731] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2022] [Revised: 11/21/2022] [Accepted: 11/24/2022] [Indexed: 12/03/2022] Open
Abstract
Biofilms are population of cells growing in a coordinated manner and exhibiting resistance towards hostile environments. The infections associated with biofilms are difficult to control owing to the chronicity of infections and the emergence of antibiotic resistance. Most microbial infections are contributed by polymicrobial or mixed species interactions, such as those observed in chronic wound infections, otitis media, dental caries, and cystic fibrosis. This review focuses on the polymicrobial interactions among bacterial-bacterial, bacterial-fungal, and fungal-fungal aggregations based on in vitro and in vivo models and different therapeutic interventions available for polymicrobial biofilms. Deciphering the mechanisms of polymicrobial interactions and microbial diversity in chronic infections is very helpful in anti-microbial research. Together, we have discussed the role of metagenomic approaches in studying polymicrobial biofilms. The outstanding progress made in polymicrobial research, especially the model systems and application of metagenomics for detecting, preventing, and controlling infections, are reviewed.
Collapse
Affiliation(s)
- V T Anju
- Department of Biochemistry and Molecular Biology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Siddhardha Busi
- Department of Microbiology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
- Correspondence:
| | - Madangchanok Imchen
- Department of Microbiology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Ranjith Kumavath
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Kerala 671316, India
- Department of Biotechnology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Mahima S. Mohan
- Department of Microbiology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Simi Asma Salim
- Department of Microbiology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Pattnaik Subhaswaraj
- Department of Biotechnology and Bioinformatics, Sambalpur University, Burla, Sambalpur 768019, India
| | - Madhu Dyavaiah
- Department of Biochemistry and Molecular Biology, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| |
Collapse
|
16
|
Yan W, Tang W, Wang L, Bin Y, Xia J. PrMFTP: Multi-functional therapeutic peptides prediction based on multi-head self-attention mechanism and class weight optimization. PLoS Comput Biol 2022; 18:e1010511. [PMID: 36094961 PMCID: PMC9499272 DOI: 10.1371/journal.pcbi.1010511] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 09/22/2022] [Accepted: 08/24/2022] [Indexed: 11/18/2022] Open
Abstract
Prediction of therapeutic peptide is a significant step for the discovery of promising therapeutic drugs. Most of the existing studies have focused on the mono-functional therapeutic peptide prediction. However, the number of multi-functional therapeutic peptides (MFTP) is growing rapidly, which requires new computational schemes to be proposed to facilitate MFTP discovery. In this study, based on multi-head self-attention mechanism and class weight optimization algorithm, we propose a novel model called PrMFTP for MFTP prediction. PrMFTP exploits multi-scale convolutional neural network, bi-directional long short-term memory, and multi-head self-attention mechanisms to fully extract and learn informative features of peptide sequence to predict MFTP. In addition, we design a class weight optimization scheme to address the problem of label imbalanced data. Comprehensive evaluation demonstrate that PrMFTP is superior to other state-of-the-art computational methods for predicting MFTP. We provide a user-friendly web server of PrMFTP, which is available at http://bioinfo.ahu.edu.cn/PrMFTP.
Collapse
Affiliation(s)
- Wenhui Yan
- Information Materials and Intelligent Sensing Laboratory of Anhui Province and Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| | - Wending Tang
- Information Materials and Intelligent Sensing Laboratory of Anhui Province and Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| | - Lihua Wang
- Information Materials and Intelligent Sensing Laboratory of Anhui Province and Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
| | - Yannan Bin
- Information Materials and Intelligent Sensing Laboratory of Anhui Province and Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
- * E-mail: (YB); (JX)
| | - Junfeng Xia
- Information Materials and Intelligent Sensing Laboratory of Anhui Province and Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, Hefei, Anhui, China
- * E-mail: (YB); (JX)
| |
Collapse
|
17
|
Biofilm- i: A Platform for Predicting Biofilm Inhibitors Using Quantitative Structure-Relationship (QSAR) Based Regression Models to Curb Antibiotic Resistance. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27154861. [PMID: 35956807 PMCID: PMC9369795 DOI: 10.3390/molecules27154861] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/16/2022] [Accepted: 07/17/2022] [Indexed: 11/19/2022]
Abstract
Antibiotic drug resistance has emerged as a major public health threat globally. One of the leading causes of drug resistance is the colonization of microorganisms in biofilm mode. Hence, there is an urgent need to design novel and highly effective biofilm inhibitors that can work either synergistically with antibiotics or individually. Therefore, we have developed a recursive regression-based platform “Biofilm-i” employing a quantitative structure–activity relationship approach for making generalized predictions, along with group and species-specific predictions of biofilm inhibition efficiency of chemical(s). The platform encompasses eight predictors, three analysis tools, and data visualization modules. The experimentally validated biofilm inhibitors for model development were retrieved from the “aBiofilm” resource and processed using a 10-fold cross-validation approach using the support vector machine and andom forest machine learning techniques. The data was further sub-divided into training/testing and independent validation sets. From training/testing data sets the Pearson’s correlation coefficient of overall chemicals, Gram-positive bacteria, Gram-negative bacteria, fungus, Pseudomonas aeruginosa, Staphylococcus aureus, Candida albicans, and Escherichia coli was 0.60, 0.77, 0.62, 0.77, 0.73, 0.83, 0.70, and 0.71 respectively via Support Vector Machine. Further, all the QSAR models performed equally well on independent validation data sets. Additionally, we also checked the performance of the random forest machine learning technique for the above datasets. The integrated analysis tools can convert the chemical structure into different formats, search for a similar chemical in the aBiofilm database and design the analogs. Moreover, the data visualization modules check the distribution of experimentally validated biofilm inhibitors according to their common scaffolds. The Biofilm-i platform would be of immense help to researchers engaged in designing highly efficacious biofilm inhibitors for tackling the menace of antibiotic drug resistance.
Collapse
|
18
|
Antimicrobial peptides for tackling cystic fibrosis related bacterial infections: a review. Microbiol Res 2022; 263:127152. [DOI: 10.1016/j.micres.2022.127152] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 07/28/2022] [Accepted: 07/29/2022] [Indexed: 11/18/2022]
|
19
|
Fathi F, Ghobeh M, Tabarzad M. Anti-Microbial Peptides: Strategies of Design and Development and Their Promising Wound-Healing Activities. Mol Biol Rep 2022; 49:9001-9012. [DOI: 10.1007/s11033-022-07405-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2022] [Revised: 03/13/2022] [Accepted: 03/17/2022] [Indexed: 12/30/2022]
|
20
|
Bose B, Downey T, Ramasubramanian AK, Anastasiu DC. Identification of Distinct Characteristics of Antibiofilm Peptides and Prospection of Diverse Sources for Efficacious Sequences. Front Microbiol 2022; 12:783284. [PMID: 35185814 PMCID: PMC8856603 DOI: 10.3389/fmicb.2021.783284] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2021] [Accepted: 12/30/2021] [Indexed: 01/15/2023] Open
Abstract
A majority of microbial infections are associated with biofilms. Targeting biofilms is considered an effective strategy to limit microbial virulence while minimizing the development of antibiotic resistance. Toward this need, antibiofilm peptides are an attractive arsenal since they are bestowed with properties orthogonal to small molecule drugs. In this work, we developed machine learning models to identify the distinguishing characteristics of known antibiofilm peptides, and to mine peptide databases from diverse habitats to classify new peptides with potential antibiofilm activities. Additionally, we used the reported minimum inhibitory/eradication concentration (MBIC/MBEC) of the antibiofilm peptides to create a regression model on top of the classification model to predict the effectiveness of new antibiofilm peptides. We used a positive dataset containing 242 antibiofilm peptides, and a negative dataset which, unlike previous datasets, contains peptides that are likely to promote biofilm formation. Our model achieved a classification accuracy greater than 98% and harmonic mean of precision-recall (F1) and Matthews correlation coefficient (MCC) scores greater than 0.90; the regression model achieved an MCC score greater than 0.81. We utilized our classification-regression pipeline to evaluate 135,015 peptides from diverse sources for potential antibiofilm activity, and we identified 185 candidates that are likely to be effective against preformed biofilms at micromolar concentrations. Structural analysis of the top 37 hits revealed a larger distribution of helices and coils than sheets, and common functional motifs. Sequence alignment of these hits with known antibiofilm peptides revealed that, while some of the hits showed relatively high sequence similarity with known peptides, some others did not indicate the presence of antibiofilm activity in novel sources or sequences. Further, some of the hits had previously recognized therapeutic properties or host defense traits suggestive of drug repurposing applications. Taken together, this work demonstrates a new in silico approach to predicting antibiofilm efficacy, and identifies promising new candidates for biofilm eradication.
Collapse
Affiliation(s)
- Bipasa Bose
- Department of Biomedical Engineering, San Jose State University, San Jose, CA, United States
| | - Taylor Downey
- Department of Computer Science and Engineering, Santa Clara University, Santa Clara, CA, United States
| | - Anand K. Ramasubramanian
- Department of Chemical and Materials Engineering, San Jose State University, San Jose, CA, United States
- *Correspondence: Anand K. Ramasubramanian
| | - David C. Anastasiu
- Department of Computer Science and Engineering, Santa Clara University, Santa Clara, CA, United States
- David C. Anastasiu
| |
Collapse
|
21
|
Shahraki MF, Atanaki FF, Ariaeenejad S, Ghaffari MR, Norouzi‐Beirami MH, Maleki M, Salekdeh GH, Kavousi K. A computational learning paradigm to targeted discovery of biocatalysts from metagenomic data: a case study of lipase identification. Biotechnol Bioeng 2022; 119:1115-1128. [DOI: 10.1002/bit.28037] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 08/18/2021] [Accepted: 12/01/2021] [Indexed: 11/09/2022]
Affiliation(s)
- Mehdi Foroozandeh Shahraki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran Tehran Iran
| | - Fereshteh Fallah Atanaki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran Tehran Iran
| | - Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO) Karaj Iran
| | - Mohammad Reza Ghaffari
- Department of Systems and Synthetic Biology Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO) Karaj Iran
| | - Mohammad Hossein Norouzi‐Beirami
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran Tehran Iran
- Department of Computer Engineering Osku Branch, Islamic Azad University Osku Iran
| | - Morteza Maleki
- Department of Systems and Synthetic Biology Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO) Karaj Iran
| | - Ghasem Hosseini Salekdeh
- Department of Systems and Synthetic Biology Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research Education and Extension Organization (AREEO) Karaj Iran
- Department of Molecular Sciences Macquarie University Sydney NSW Australia
| | - Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Institute of Biochemistry and Biophysics (IBB), University of Tehran Tehran Iran
| |
Collapse
|
22
|
Abstract
Antibiotic resistance constitutes a global threat and could lead to a future pandemic. One strategy is to develop a new generation of antimicrobials. Naturally occurring antimicrobial peptides (AMPs) are recognized templates and some are already in clinical use. To accelerate the discovery of new antibiotics, it is useful to predict novel AMPs from the sequenced genomes of various organisms. The antimicrobial peptide database (APD) provided the first empirical peptide prediction program. It also facilitated the testing of the first machine-learning algorithms. This chapter provides an overview of machine-learning predictions of AMPs. Most of the predictors, such as AntiBP, CAMP, and iAMPpred, involve a single-label prediction of antimicrobial activity. This type of prediction has been expanded to antifungal, antiviral, antibiofilm, anti-TB, hemolytic, and anti-inflammatory peptides. The multiple functional roles of AMPs annotated in the APD also enabled multi-label predictions (iAMP-2L, MLAMP, and AMAP), which include antibacterial, antiviral, antifungal, antiparasitic, antibiofilm, anticancer, anti-HIV, antimalarial, insecticidal, antioxidant, chemotactic, spermicidal activities, and protease inhibiting activities. Also considered in predictions are peptide posttranslational modification, 3D structure, and microbial species-specific information. We compare important amino acids of AMPs implied from machine learning with the frequently occurring residues of the major classes of natural peptides. Finally, we discuss advances, limitations, and future directions of machine-learning predictions of antimicrobial peptides. Ultimately, we may assemble a pipeline of such predictions beyond antimicrobial activity to accelerate the discovery of novel AMP-based antimicrobials.
Collapse
Affiliation(s)
- Guangshun Wang
- Department of Pathology and Microbiology, College of Medicine, University of Nebraska Medical Center, 985900 Nebraska Medical Center, Omaha, NE 68198-5900, USA;,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Iosif I. Vaisman
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Monique L. van Hoek
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| |
Collapse
|
23
|
Mhade S, Panse S, Tendulkar G, Awate R, Narasimhan Y, Kadam S, Yennamalli RM, Kaushik KS. AMPing Up the Search: A Structural and Functional Repository of Antimicrobial Peptides for Biofilm Studies, and a Case Study of Its Application to Corynebacterium striatum, an Emerging Pathogen. Front Cell Infect Microbiol 2021; 11:803774. [PMID: 34976872 PMCID: PMC8716830 DOI: 10.3389/fcimb.2021.803774] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 11/24/2021] [Indexed: 11/13/2022] Open
Abstract
Antimicrobial peptides (AMPs) have been recognized for their ability to target processes important for biofilm formation. Given the vast array of AMPs, identifying potential anti-biofilm candidates remains a significant challenge, and prompts the need for preliminary in silico investigations prior to extensive in vitro and in vivo studies. We have developed Biofilm-AMP (B-AMP), a curated 3D structural and functional repository of AMPs relevant to biofilm studies. In its current version, B-AMP contains predicted 3D structural models of 5544 AMPs (from the DRAMP database) developed using a suite of molecular modeling tools. The repository supports a user-friendly search, using source, name, DRAMP ID, and PepID (unique to B-AMP). Further, AMPs are annotated to existing biofilm literature, consisting of a vast library of over 10,000 articles, enhancing the functional capabilities of B-AMP. To provide an example of the usability of B-AMP, we use the sortase C biofilm target of the emerging pathogen Corynebacterium striatum as a case study. For this, 100 structural AMP models from B-AMP were subject to in silico protein-peptide molecular docking against the catalytic site residues of the C. striatum sortase C protein. Based on docking scores and interacting residues, we suggest a preference scale using which candidate AMPs could be taken up for further in silico, in vitro and in vivo testing. The 3D protein-peptide interaction models and preference scale are available in B-AMP. B-AMP is a comprehensive structural and functional repository of AMPs, and will serve as a starting point for future studies exploring AMPs for biofilm studies. B-AMP is freely available to the community at https://b-amp.karishmakaushiklab.com and will be regularly updated with AMP structures, interaction models with potential biofilm targets, and annotations to biofilm literature.
Collapse
Affiliation(s)
- Shreeya Mhade
- Department of Bioinformatics, Guru Nanak Khalsa College of Arts, Science and Commerce (Autonomous), Mumbai, India
| | - Stutee Panse
- Huck Institutes of Life Sciences, The Pennsylvania State University, University Park, College State, PA, United States
| | - Gandhar Tendulkar
- Department of Bioinformatics, Sir Sitaram and Lady Shantabai Patkar College of Arts and Science and V P Varde College of Commerce and Economics (Autonomous), Mumbai, India
| | - Rohit Awate
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, United States
| | - Yatindrapravanan Narasimhan
- Department of Bioinformatics, School of Chemical and Biotechnology, Shanmugha Arts, Science, Technology & Research Academy (SASTRA), Deemed to Be University, Thanjavur, India
| | - Snehal Kadam
- Hull York Medical School, University of Hull, Hull, United Kingdom
| | - Ragothaman M. Yennamalli
- Department of Bioinformatics, School of Chemical and Biotechnology, Shanmugha Arts, Science, Technology & Research Academy (SASTRA), Deemed to Be University, Thanjavur, India
| | | |
Collapse
|
24
|
Singh V, Shrivastava S, Kumar Singh S, Kumar A, Saxena S. StaBle-ABPpred: a stacked ensemble predictor based on biLSTM and attention mechanism for accelerated discovery of antibacterial peptides. Brief Bioinform 2021; 23:6423526. [PMID: 34750606 DOI: 10.1093/bib/bbab439] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 09/06/2021] [Accepted: 09/24/2021] [Indexed: 01/29/2023] Open
Abstract
Due to the rapid emergence of multi-drug resistant (MDR) bacteria, existing antibiotics are becoming ineffective. So, researchers are looking for alternatives in the form of antibacterial peptides (ABPs) based medicines. The discovery of novel ABPs using wet-lab experiments is time-consuming and expensive. Many machine learning models have been proposed to search for new ABPs, but there is still scope to develop a robust model that has high accuracy and precision. In this work, we present StaBle-ABPpred, a stacked ensemble technique-based deep learning classifier that uses bidirectional long-short term memory (biLSTM) and attention mechanism at base-level and an ensemble of random forest, gradient boosting and logistic regression at meta-level to classify peptides as antibacterial or otherwise. The performance of our model has been compared with several state-of-the-art classifiers, and results were subjected to analysis of variance (ANOVA) test and its post hoc analysis, which proves that our model performs better than existing classifiers. Furthermore, a web app has been developed and deployed at https://stable-abppred.anvil.app to identify novel ABPs in protein sequences. Using this app, we identified novel ABPs in all the proteins of the Streptococcus phage T12 genome. These ABPs have shown amino acid similarities with experimentally tested antimicrobial peptides (AMPs) of other organisms. Hence, they could be chemically synthesized and experimentally validated for their activity against different bacteria. The model and app developed in this work can be further utilized to explore the protein diversity for identifying novel ABPs with broad-spectrum activity, especially against MDR bacterial pathogens.
Collapse
Affiliation(s)
- Vishakha Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sameer Shrivastava
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| | - Sanjay Kumar Singh
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Abhinav Kumar
- Department of Computer Science and Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, Uttar Pradesh, India
| | - Sonal Saxena
- Division of Veterinary Biotechnology, ICAR-Indian Veterinary Research Institute, Izatnagar, 243122, Uttar Pradesh, India
| |
Collapse
|
25
|
Nasiri F, Atanaki FF, Behrouzi S, Kavousi K, Bagheri M. CpACpP: In Silico Cell-Penetrating Anticancer Peptide Prediction Using a Novel Bioinformatics Framework. ACS OMEGA 2021; 6:19846-19859. [PMID: 34368571 PMCID: PMC8340416 DOI: 10.1021/acsomega.1c02569] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 07/13/2021] [Indexed: 05/12/2023]
Abstract
Cell-penetrating anticancer peptides (Cp-ACPs) are considered promising candidates in solid tumor and hematologic cancer therapies. Current approaches for the design and discovery of Cp-ACPs trust the expensive high-throughput screenings that often give rise to multiple obstacles, including instrumentation adaptation and experimental handling. The application of machine learning (ML) tools developed for peptide activity prediction is importantly of growing interest. In this study, we applied the random forest (RF)-, support vector machine (SVM)-, and eXtreme gradient boosting (XGBoost)-based algorithms to predict the active Cp-ACPs using an experimentally validated data set. The model, CpACpP, was developed on the basis of two independent cell-penetrating peptide (CPP) and anticancer peptide (ACP) subpredictors. Various compositional and physiochemical-based features were combined or selected using the multilayered recursive feature elimination (RFE) method for both data sets. Our results showed that the ACP subclassifiers obtain a mean performance accuracy (ACC) of 0.98 with an area under curve (AUC) ≈ 0.98 vis-à-vis the CPP predictors displaying relevant values of ∼0.94 and ∼0.95 via the hybrid-based features and independent data sets, respectively. Also, the predicting evaluation of Cp-ACPs gave accuracies of ∼0.79 and 0.89 on a series of independent sequences by applying our CPP and ACP classifiers, respectively, which leaves the performance of our predictors better than the earlier reported ACPred, mACPpred, MLCPP, and CPPred-RF. The described consensus-based fusion method additionally reached an AUC of 0.94 for the prediction of Cp-ACP (http://cbb1.ut.ac.ir/CpACpP/Index).
Collapse
Affiliation(s)
- Farid Nasiri
- Peptide
Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry
and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Saman Behrouzi
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Kaveh Kavousi
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Mojtaba Bagheri
- Peptide
Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry
and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| |
Collapse
|
26
|
Therapeutic Potential of Antimicrobial Peptides in Polymicrobial Biofilm-Associated Infections. Int J Mol Sci 2021; 22:ijms22020482. [PMID: 33418930 PMCID: PMC7825036 DOI: 10.3390/ijms22020482] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 01/03/2021] [Indexed: 01/10/2023] Open
Abstract
It is widely recognized that many chronic infections of the human body have a polymicrobial etiology. These include diabetic foot ulcer infections, lung infections in cystic fibrosis patients, periodontitis, otitis, urinary tract infections and even a proportion of systemic infections. The treatment of mixed infections poses serious challenges in the clinic. First, polymicrobial communities of microorganisms often organize themselves as biofilms that are notoriously recalcitrant to antimicrobial therapy and clearance by the host immune system. Secondly, a plethora of interactions among community members may affect the expression of virulence factors and the susceptibility to antimicrobials of individual species in the community. Therefore, new strategies able to target multiple pathogens in mixed populations need to be urgently developed and evaluated. In this regard, antimicrobial or host defense peptides (AMPs) deserve particular attention as they are endowed with many favorable features that may serve to this end. The aim of the present review is to offer a comprehensive and updated overview of studies addressing the therapeutic potential of AMPs in mixed infections, highlighting the opportunities offered by this class of antimicrobials in the fight against polymicrobial infections, but also the limits that may arise in their use for this type of application.
Collapse
|
27
|
Kavousi K, Bagheri M, Behrouzi S, Vafadar S, Atanaki FF, Lotfabadi BT, Ariaeenejad S, Shockravi A, Moosavi-Movahedi AA. IAMPE: NMR-Assisted Computational Prediction of Antimicrobial Peptides. J Chem Inf Model 2020; 60:4691-4701. [DOI: 10.1021/acs.jcim.0c00841] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Kaveh Kavousi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Mojtaba Bagheri
- Peptide Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| | - Saman Behrouzi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Safar Vafadar
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Bahareh Teimouri Lotfabadi
- Laboratory of Complex Biological Systems and Bioinformatics (CBB), Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Shohreh Ariaeenejad
- Department of Systems and Synthetic Biology, Agricultural Biotechnology Research Institute of Iran (ABRII) Agricultural Research Education and Extension Organization (AREEO), Karaj 14348-87714, Iran
| | - Abbas Shockravi
- Faculty of Chemistry, Kharazmi University, Tehran 14911-15719, Iran
| | - Ali Akbar Moosavi-Movahedi
- Department of Biophysics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| |
Collapse
|