1
|
Beltrán JF, Herrera-Belén L, Parraguez-Contreras F, Farías JG, Machuca-Sepúlveda J, Short S. MultiToxPred 1.0: a novel comprehensive tool for predicting 27 classes of protein toxins using an ensemble machine learning approach. BMC Bioinformatics 2024; 25:148. [PMID: 38609877 PMCID: PMC11010298 DOI: 10.1186/s12859-024-05748-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 03/14/2024] [Indexed: 04/14/2024] Open
Abstract
Protein toxins are defense mechanisms and adaptations found in various organisms and microorganisms, and their use in scientific research as therapeutic candidates is gaining relevance due to their effectiveness and specificity against cellular targets. However, discovering these toxins is time-consuming and expensive. In silico tools, particularly those based on machine learning and deep learning, have emerged as valuable resources to address this challenge. Existing tools primarily focus on binary classification, determining whether a protein is a toxin or not, and occasionally identifying specific types of toxins. For the first time, we propose a novel approach capable of classifying protein toxins into 27 distinct categories based on their mode of action within cells. To accomplish this, we assessed multiple machine learning techniques and found that an ensemble model incorporating the Light Gradient Boosting Machine and Quadratic Discriminant Analysis algorithms exhibited the best performance. During the tenfold cross-validation on the training dataset, our model exhibited notable metrics: 0.840 accuracy, 0.827 F1 score, 0.836 precision, 0.840 sensitivity, and 0.989 AUC. In the testing stage, using an independent dataset, the model achieved 0.846 accuracy, 0.838 F1 score, 0.847 precision, 0.849 sensitivity, and 0.991 AUC. These results present a powerful next-generation tool called MultiToxPred 1.0, accessible through a web application. We believe that MultiToxPred 1.0 has the potential to become an indispensable resource for researchers, facilitating the efficient identification of protein toxins. By leveraging this tool, scientists can accelerate their search for these toxins and advance their understanding of their therapeutic potential.
Collapse
Affiliation(s)
- Jorge F Beltrán
- Department of Chemical Engineering, Faculty of Engineering and Science, Universidad de La Frontera, Ave. Francisco Salazar, 01145, Temuco, Chile.
| | - Lisandra Herrera-Belén
- Departamento de Ciencias Básicas, Facultad de Ciencias, Universidad Santo Tomas, Temuco, Chile
| | - Fernanda Parraguez-Contreras
- Department of Chemical Engineering, Faculty of Engineering and Science, Universidad de La Frontera, Ave. Francisco Salazar, 01145, Temuco, Chile
| | - Jorge G Farías
- Department of Chemical Engineering, Faculty of Engineering and Science, Universidad de La Frontera, Ave. Francisco Salazar, 01145, Temuco, Chile
| | - Jorge Machuca-Sepúlveda
- Department of Chemical Engineering, Faculty of Engineering and Science, Universidad de La Frontera, Ave. Francisco Salazar, 01145, Temuco, Chile
| | - Stefania Short
- Department of Chemical Engineering, Faculty of Engineering and Science, Universidad de La Frontera, Ave. Francisco Salazar, 01145, Temuco, Chile
| |
Collapse
|
2
|
Li J, Lyu B, Bi J, Shan R, Stanley D, Feng Q, Song Q. Partner of neuropeptide bursicon homodimer pburs mediates a novel antimicrobial peptide Ten3LP via Dif/Dorsal2 in Tribolium castaneum. Int J Biol Macromol 2023; 247:125840. [PMID: 37454995 DOI: 10.1016/j.ijbiomac.2023.125840] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 07/12/2023] [Accepted: 07/13/2023] [Indexed: 07/18/2023]
Abstract
Bursicon is a cystine knot family neuropeptide, composed of two subunits, bursicon (burs) and partner of burs (pburs). The subunits can form heterodimers to regulate cuticle tanning and wing maturation and homodimers to signal different biological functions in innate immunity, midgut stem cell proliferation and energy homeostasis, and reproductive physiology in the model insects Drosophila melanogaster or Tribolium castaneum. Here, we report on the role of the pburs homodimer in signaling innate immunity in T. castaneum larvae. Through transcriptome analysis we identified a set of immune-related genes that respond to pburs RNAi. Treating larvae with recombinant-pburs protein led to up-regulation of antimicrobial peptide (AMP) genes in vivo and in vitro. The upregulation of most AMP genes was dependent on the NF-κB transcription factor Relish. Most importantly, we identified a novel AMP, Tenecin 3-like peptide (Ten3LP), regulated by pburs via NF-κB transcription factor Dorsal-related immunity factor (Dif)/Dorsal2, but not Relish. We conducted Ten3LP RNAi, synthesized recombinant Ten3LP protein for microbial inhibition assays and functionally characterized Ten3LP as an AMP specific for fungi and Gram-positive bacteria. We demonstrate that expression of Ten3LP is activated by pburs via the Toll pathway. These findings identify new molecular targets for development of potential antibiotics for treating microbial infections and perhaps for RNAi based pest management technology.
Collapse
Affiliation(s)
- Jingjing Li
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
| | - Bo Lyu
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
| | - Jingxiu Bi
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA; Institution of Quality Standard and Testing Technology for Agro-product, Shandong Academy of Agricultural Science, Jinan, Shandong 250100, China.
| | - Ruiqi Shan
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
| | - David Stanley
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA; Biological Control of Insect Research Laboratory, United States Department of Agriculture-Agricultural Research Station (USDA/ARS), Columbia, MO 65203, USA.
| | - Qili Feng
- Guangzhou Key Laboratory of Insect Development Regulation and Application Research, Institute of Insect Science and Technology, School of Life Sciences, South China Normal University, Guangzhou 510631, China.
| | - Qisheng Song
- Division of Plant Science and Technology, University of Missouri, Columbia, MO 65211, USA.
| |
Collapse
|
3
|
Abstract
Antibiotic resistance constitutes a global threat and could lead to a future pandemic. One strategy is to develop a new generation of antimicrobials. Naturally occurring antimicrobial peptides (AMPs) are recognized templates and some are already in clinical use. To accelerate the discovery of new antibiotics, it is useful to predict novel AMPs from the sequenced genomes of various organisms. The antimicrobial peptide database (APD) provided the first empirical peptide prediction program. It also facilitated the testing of the first machine-learning algorithms. This chapter provides an overview of machine-learning predictions of AMPs. Most of the predictors, such as AntiBP, CAMP, and iAMPpred, involve a single-label prediction of antimicrobial activity. This type of prediction has been expanded to antifungal, antiviral, antibiofilm, anti-TB, hemolytic, and anti-inflammatory peptides. The multiple functional roles of AMPs annotated in the APD also enabled multi-label predictions (iAMP-2L, MLAMP, and AMAP), which include antibacterial, antiviral, antifungal, antiparasitic, antibiofilm, anticancer, anti-HIV, antimalarial, insecticidal, antioxidant, chemotactic, spermicidal activities, and protease inhibiting activities. Also considered in predictions are peptide posttranslational modification, 3D structure, and microbial species-specific information. We compare important amino acids of AMPs implied from machine learning with the frequently occurring residues of the major classes of natural peptides. Finally, we discuss advances, limitations, and future directions of machine-learning predictions of antimicrobial peptides. Ultimately, we may assemble a pipeline of such predictions beyond antimicrobial activity to accelerate the discovery of novel AMP-based antimicrobials.
Collapse
Affiliation(s)
- Guangshun Wang
- Department of Pathology and Microbiology, College of Medicine, University of Nebraska Medical Center, 985900 Nebraska Medical Center, Omaha, NE 68198-5900, USA;,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Iosif I. Vaisman
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| | - Monique L. van Hoek
- School of Systems Biology, George Mason University, 10920 George Mason Circle, Manassas, VA, 20110, USA.,Corresponding to: Dr. Monique van Hoek: ; Dr. Iosif Vaisman: ; Dr. Guangshun Wang:
| |
Collapse
|
4
|
David L, Brata AM, Mogosan C, Pop C, Czako Z, Muresan L, Ismaiel A, Dumitrascu DI, Leucuta DC, Stanculete MF, Iaru I, Popa SL. Artificial Intelligence and Antibiotic Discovery. Antibiotics (Basel) 2021; 10:antibiotics10111376. [PMID: 34827314 PMCID: PMC8614913 DOI: 10.3390/antibiotics10111376] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Revised: 11/01/2021] [Accepted: 11/08/2021] [Indexed: 01/13/2023] Open
Abstract
Over recent decades, a new antibiotic crisis has been unfolding due to a decreased research in this domain, a low return of investment for the companies that developed the drug, a lengthy and difficult research process, a low success rate for candidate molecules, an increased use of antibiotics in farms and an overall inappropriate use of antibiotics. This has led to a series of pathogens developing antibiotic resistance, which poses severe threats to public health systems while also driving up the costs of hospitalization and treatment. Moreover, without proper action and collaboration between academic and health institutions, a catastrophic trend might develop, with the possibility of returning to a pre-antibiotic era. Nevertheless, new emerging AI-based technologies have started to enter the field of antibiotic and drug development, offering a new perspective to an ever-growing problem. Cheaper and faster research can be achieved through algorithms that identify hit compounds, thereby further accelerating the development of new antibiotics, which represents a vital step in solving the current antibiotic crisis. The aim of this review is to provide an extended overview of the current artificial intelligence-based technologies that are used for antibiotic discovery, together with their technological and economic impact on the industrial sector.
Collapse
Affiliation(s)
- Liliana David
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| | - Anca Monica Brata
- Faculty of Environmental Protection, University of Oradea, 410048 Oradea, Romania
- Correspondence:
| | - Cristina Mogosan
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Cristina Pop
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Zoltan Czako
- Department of Computer Science, Technical University of Cluj-Napoca, 400027 Cluj-Napoca, Romania;
| | - Lucian Muresan
- Department of Cardiology, “Emile Muller” Hospital, 68200 Mulhouse, France;
| | - Abdulrahman Ismaiel
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| | - Dinu Iuliu Dumitrascu
- Department of Anatomy, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania;
| | - Daniel Corneliu Leucuta
- Department of Medical Informatics and Biostatistics, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400349 Cluj-Napoca, Romania;
| | - Mihaela Fadygas Stanculete
- Department of Neurosciences, Discipline of Psychiatry and Pediatric Psychiatry, “Iuliu Hatieganu“ University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania;
| | - Irina Iaru
- Department of Pharmacology, Physiology and Pathophysiology, Faculty of Pharmacy, “Iuliu Hațieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (C.M.); (C.P.); (I.I.)
| | - Stefan Lucian Popa
- 2nd Medical Department, “Iuliu Hatieganu” University of Medicine and Pharmacy, 400000 Cluj-Napoca, Romania; (L.D.); (A.I.); (S.L.P.)
| |
Collapse
|
5
|
Bin Hafeez A, Jiang X, Bergen PJ, Zhu Y. Antimicrobial Peptides: An Update on Classifications and Databases. Int J Mol Sci 2021; 22:11691. [PMID: 34769122 PMCID: PMC8583803 DOI: 10.3390/ijms222111691] [Citation(s) in RCA: 111] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 10/24/2021] [Accepted: 10/25/2021] [Indexed: 02/06/2023] Open
Abstract
Antimicrobial peptides (AMPs) are distributed across all kingdoms of life and are an indispensable component of host defenses. They consist of predominantly short cationic peptides with a wide variety of structures and targets. Given the ever-emerging resistance of various pathogens to existing antimicrobial therapies, AMPs have recently attracted extensive interest as potential therapeutic agents. As the discovery of new AMPs has increased, many databases specializing in AMPs have been developed to collect both fundamental and pharmacological information. In this review, we summarize the sources, structures, modes of action, and classifications of AMPs. Additionally, we examine current AMP databases, compare valuable computational tools used to predict antimicrobial activity and mechanisms of action, and highlight new machine learning approaches that can be employed to improve AMP activity to combat global antimicrobial resistance.
Collapse
Affiliation(s)
- Ahmer Bin Hafeez
- Centre of Biotechnology and Microbiology, University of Peshawar, Peshawar 25120, Pakistan;
| | - Xukai Jiang
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
- National Glycoengineering Research Center, Shandong University, Qingdao 266237, China
| | - Phillip J. Bergen
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| | - Yan Zhu
- Infection and Immunity Program, Department of Microbiology, Biomedicine Discovery Institute, Monash University, Clayton, VIC 3800, Australia; (X.J.); (P.J.B.)
| |
Collapse
|
6
|
Liu T, Chen J, Zhang Q, Hippe K, Hunt C, Le T, Cao R, Tang H. The Development of Machine Learning Methods in discriminating Secretory Proteins of Malaria Parasite. Curr Med Chem 2021; 29:807-821. [PMID: 34636289 DOI: 10.2174/0929867328666211005140625] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 07/28/2021] [Accepted: 08/15/2021] [Indexed: 11/22/2022]
Abstract
Malaria caused by Plasmodium falciparum is one of the major infectious diseases in the world. It is essential to exploit an effective method to predict secretory proteins of malaria parasites to develop effective cures and treatment. Biochemical assays can provide details for accurate identification of the secretory proteins, but these methods are expensive and time-consuming. In this paper, we summarized the machine learning-based identification algorithms and compared the construction strategies between different computational methods. Also, we discussed the use of machine learning to improve the ability of algorithms to identify proteins secreted by malaria parasites.
Collapse
Affiliation(s)
- Ting Liu
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Jiamao Chen
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Qian Zhang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Kyle Hippe
- Department of Computer Science, Pacific Lutheran University. United States
| | - Cassandra Hunt
- Department of Computer Science, Pacific Lutheran University. United States
| | - Thu Le
- Department of Computer Science, Pacific Lutheran University. United States
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University. United States
| | - Hua Tang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| |
Collapse
|
7
|
Herrera-Bravo J, Herrera Belén L, Farias JG, Beltrán JF. TAP 1.0: A robust immunoinformatic tool for the prediction of tumor T-cell antigens based on AAindex properties. Comput Biol Chem 2021; 91:107452. [PMID: 33592504 DOI: 10.1016/j.compbiolchem.2021.107452] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 02/04/2021] [Accepted: 02/04/2021] [Indexed: 01/28/2023]
Abstract
Immunotherapy is a research area with great potential in drug discovery for cancer treatment. Because of the capacity of tumor antigens to activate the immune response and promote the destruction of tumor cells, they are considered excellent immunotherapeutic drugs. In this work, we evaluated fifteen machine learning algorithms for the classification of tumor antigens. For this purpose, we build robust datasets, carefully selected from the TANTIGEN and IEDB databases. The feature computation of all antigens in this study was performed by developing a script written in Python 3.8, which allowed the calculation of 544 physicochemical and biochemical properties extracted from the AAindex database. All classifiers were subjected to the training, 10-fold cross-validation, and testing on an independent dataset. The results of this study showed that the quadratic discriminant classifier presented the best performance measures over the independent dataset, accuracy = 0.7384, AUC = 0.817, recall = 0.676, precision = 0.7857, F1 = 0.713, kappa = 0.4764, and Matthews correlation coefficient = 0.4834, outperforming common machine learning classifiers used in the bioinformatics area. We believe that our prediction model could be of great importance in the field of cancer immunotherapy for the search of potential tumor antigens. Taking all aspects mentioned before, we developed an immunoinformatic tool called TAP 1.0 with a friendly interface for tumor antigens prediction, available at https://tapredictor.herokuapp.com/.
Collapse
Affiliation(s)
- Jesús Herrera-Bravo
- Departamento de Ciencias Básicas, Facultad de Ciencias, Universidad Santo Tomas, Chile; Center of Molecular Biology and Pharmacogenetics, Scientific and Technological Bioresource Nucleus, Universidad de La Frontera, Chile
| | - Lisandra Herrera Belén
- Universidad de La Frontera, Department of Chemical Engineering, Faculty of Engineering and Science, Ave. Francisco Salazar 01145, Temuco, Chile
| | - Jorge G Farias
- Universidad de La Frontera, Department of Chemical Engineering, Faculty of Engineering and Science, Ave. Francisco Salazar 01145, Temuco, Chile
| | - Jorge F Beltrán
- Universidad de La Frontera, Department of Chemical Engineering, Faculty of Engineering and Science, Ave. Francisco Salazar 01145, Temuco, Chile.
| |
Collapse
|
8
|
Li H, Nantasenamat C. Toward insights on determining factors for high activity in antimicrobial peptides via machine learning. PeerJ 2019; 7:e8265. [PMID: 31875156 PMCID: PMC6927346 DOI: 10.7717/peerj.8265] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 11/21/2019] [Indexed: 01/02/2023] Open
Abstract
The continued and general rise of antibiotic resistance in pathogenic microbes is a well-recognized global threat. Host defense peptides (HDPs), a component of the innate immune system have demonstrated promising potential to become a next generation antibiotic effective against a plethora of pathogens. While the effectiveness of antimicrobial HDPs has been extensively demonstrated in experimental studies, theoretical insights on the mechanism by which these peptides function is comparably limited. In particular, experimental studies of AMP mechanisms are limited in the number of different peptides investigated and the type of peptide parameters considered. This study makes use of the random forest algorithm for classifying the antimicrobial activity as well for identifying molecular descriptors underpinning the antimicrobial activity of investigated peptides. Subsequent manual interpretation of the identified important descriptors revealed that polarity-solubility are necessary for the membrane lytic antimicrobial activity of HDPs.
Collapse
Affiliation(s)
- Hao Li
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, Bangkok, Thailand
| |
Collapse
|