1
|
Imre A, Balogh B, Mándity I. GraphCPP: The new state-of-the-art method for cell-penetrating peptide prediction via graph neural networks. Br J Pharmacol 2025; 182:495-509. [PMID: 39568115 DOI: 10.1111/bph.17388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 08/07/2024] [Accepted: 10/07/2024] [Indexed: 11/22/2024] Open
Abstract
BACKGROUND AND PURPOSE Cell-penetrating peptides (CPPs) are short amino acid sequences that can penetrate cell membranes and deliver molecules into cells. Several models have been developed for their discovery, yet these models often face challenges in accurately predicting membrane penetration due to the complex nature of peptide-cell interactions. Hence, there is a need for innovative approaches that can enhance predictive performance. EXPERIMENTAL APPROACH In this study, we present the application GraphCPP, a novel graph neural network (GNN) for the prediction of membrane penetration capability of peptides. KEY RESULTS A new comprehensive dataset-dubbed CPP1708-was constructed resulting in the largest reliable database of CPPs to date. Comparative analyses with previous methods, such as MLCPP2, C2Pred, CellPPD and CellPPD-Mod, demonstrated the superior predictive performance of our model. Upon testing against other published methods, GraphCPP performs exceptionally, achieving 0.5787 Matthews correlation coefficient and 0.8459 area under the curve (AUC) values on one dataset. This means a 92.8% and 23.3% improvement in Matthews correlation coefficient and AUC measures respectively compared with the next best model. The capability of the model to effectively learn peptide representations was demonstrated through t-distributed stochastic neighbour embedding plots. Additionally, the uncertainty analysis revealed that GraphCPP maintains high confidence in predictions for peptides shorter than 40 amino acids. The source code is available at https://github.com/attilaimre99/GraphCPP. CONCLUSION AND IMPLICATIONS These findings indicate the potential of GNN-based models to improve CPP penetration prediction and it may contribute towards the development of more efficient drug delivery systems.
Collapse
Affiliation(s)
- Attila Imre
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Health Technology Assessment, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - Balázs Balogh
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
| | - István Mándity
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, Budapest, Hungary
- Center for Pharmacology and Drug Research & Development, Semmelweis University, Budapest, Hungary
- Artificial Transporters Research Group, Research Centre for Natural Sciences, Budapest, Hungary
| |
Collapse
|
2
|
Kumar N, Du Z, Li Y. pLM4CPPs: Protein Language Model-Based Predictor for Cell Penetrating Peptides. J Chem Inf Model 2025. [PMID: 39878455 DOI: 10.1021/acs.jcim.4c01338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2025]
Abstract
Cell-penetrating peptides (CPPs) are short peptides capable of penetrating cell membranes, making them valuable for drug delivery and intracellular targeting. Accurate prediction of CPPs can streamline experimental validation in the lab. This study aims to assess pretrained protein language models (pLMs) for their effectiveness in representing CPPs and develop a reliable model for CPP classification. We evaluated peptide embeddings generated from BEPLER, CPCProt, SeqVec, various ESM variants (ESM, ESM-2 with expanded feature set, ESM-1b, and ESM-1v), ProtT5-XL UniRef50, ProtT5-XL BFD, and ProtBERT. We developed pLM4CCPs, a novel deep learning architecture using convolutional neural networks (CNNs) as the classifier for binary classification of CPPs. pLM4CCPs demonstrated superior performance over existing state-of-the-art CPP prediction models, achieving improvements in accuracy (ACC) by 4.9-5.5%, Matthews correlation coefficient (MCC) by 9.3-10.2%, and sensitivity (Sn) by 14.1-19.6%. Among all the tested models, ESM-1280 and ProtT5-XL BFD demonstrated the highest overall performance on the kelm data set. ESM-1280 achieved an ACC of 0.896, an MCC of 0.796, a Sn of 0.844, and a specificity (Sp) of 0.978. ProtT5-XL BFD exhibited superior performance with an ACC of 0.901, an MCC of 0.802, an Sn of 0.885, and an Sp of 0.917. pLM4CCPs combine predictions from multiple models to provide a consensus on whether a given peptide sequence is classified as a CPP or non-CPP. This approach will enhance prediction reliability by leveraging the strengths of each individual model. A user-friendly web server for bioactivity predictions, along with data sets, is available at https://ry2acnp6ep.us-east-1.awsapprunner.com. The source code and protocol for adapting pLM4CPPs can be accessed on GitHub at https://github.com/drkumarnandan/pLM4CPPs. This platform aims to advance CPP prediction and peptide functionality modeling, aiding researchers in exploring peptide functionality effectively.
Collapse
Affiliation(s)
- Nandan Kumar
- Department of Grain Science and Industry, Kansas State University, Manhattan, Kansas 66506, United States
| | - Zhenjiao Du
- Department of Grain Science and Industry, Kansas State University, Manhattan, Kansas 66506, United States
| | - Yonghui Li
- Department of Grain Science and Industry, Kansas State University, Manhattan, Kansas 66506, United States
| |
Collapse
|
3
|
Ramasundaram M, Sohn H, Madhavan T. A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides. Front Artif Intell 2025; 7:1497307. [PMID: 39839972 PMCID: PMC11747587 DOI: 10.3389/frai.2024.1497307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 12/13/2024] [Indexed: 01/23/2025] Open
Abstract
Cell-penetrating peptides (CPPs) are highly effective at passing through eukaryotic membranes with various cargo molecules, like drugs, proteins, nucleic acids, and nanoparticles, without causing significant harm. Creating drug delivery systems with CPP is associated with cancer, genetic disorders, and diabetes due to their unique chemical properties. Wet lab experiments in drug discovery methodologies are time-consuming and expensive. Machine learning (ML) techniques can enhance and accelerate the drug discovery process with accurate and intricate data quality. ML classifiers, such as support vector machine (SVM), random forest (RF), gradient-boosted decision trees (GBDT), and different types of artificial neural networks (ANN), are commonly used for CPP prediction with cross-validation performance evaluation. Functional CPP prediction is improved by using these ML strategies by using CPP datasets produced by high-throughput sequencing and computational methods. This review focuses on several ML-based CPP prediction tools. We discussed the CPP mechanism to understand the basic functioning of CPPs through cells. A comparative analysis of diverse CPP prediction methods was conducted based on their algorithms, dataset size, feature encoding, software utilities, assessment metrics, and prediction scores. The performance of the CPP prediction was evaluated based on accuracy, sensitivity, specificity, and Matthews correlation coefficient (MCC) on independent datasets. In conclusion, this review will encourage the use of ML algorithms for finding effective CPPs, which will have a positive impact on future research on drug delivery and therapeutics.
Collapse
Affiliation(s)
- Maduravani Ramasundaram
- Department of Genetic Engineering, Computational Biology Lab, School of Bioengineering, SRM Institute of Science and Technology, SRM Nagar, Chennai, India
| | - Honglae Sohn
- Department of Chemistry and Department of Carbon Materials, Chosun University, Gwangju, Republic of Korea
| | - Thirumurthy Madhavan
- Department of Genetic Engineering, Computational Biology Lab, School of Bioengineering, SRM Institute of Science and Technology, SRM Nagar, Chennai, India
| |
Collapse
|
4
|
Moreno-Vargas LM, Prada-Gracia D. Exploring the Chemical Features and Biomedical Relevance of Cell-Penetrating Peptides. Int J Mol Sci 2024; 26:59. [PMID: 39795918 PMCID: PMC11720145 DOI: 10.3390/ijms26010059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 11/27/2024] [Accepted: 11/28/2024] [Indexed: 01/13/2025] Open
Abstract
Cell-penetrating peptides (CPPs) are a diverse group of peptides, typically composed of 4 to 40 amino acids, known for their unique ability to transport a wide range of substances-such as small molecules, plasmid DNA, small interfering RNA, proteins, viruses, and nanoparticles-across cellular membranes while preserving the integrity of the cargo. CPPs exhibit passive and non-selective behavior, often requiring functionalization or chemical modification to enhance their specificity and efficacy. The precise mechanisms governing the cellular uptake of CPPs remain ambiguous; however, electrostatic interactions between positively charged amino acids and negatively charged glycosaminoglycans on the membrane, particularly heparan sulfate proteoglycans, are considered the initial crucial step for CPP uptake. Clinical trials have highlighted the potential of CPPs in diagnosing and treating various diseases, including cancer, central nervous system disorders, eye disorders, and diabetes. This review provides a comprehensive overview of CPP classifications, potential applications, transduction mechanisms, and the most relevant algorithms to improve the accuracy and reliability of predictions in CPP development.
Collapse
|
5
|
Zhu L, Chen Z, Yang S. EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information. Interdiscip Sci 2024:10.1007/s12539-024-00673-4. [PMID: 39714579 DOI: 10.1007/s12539-024-00673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 10/28/2024] [Accepted: 11/01/2024] [Indexed: 12/24/2024]
Abstract
Cell-Penetrating Peptides (CPPs) are a crucial carrier for drug delivery. Since the process of synthesizing new CPPs in the laboratory is both time- and resource-consuming, computational methods to predict potential CPPs can be used to find CPPs to enhance the development of CPPs in therapy. In this study, EnDM-CPP is proposed, which combines machine learning algorithms (SVM and CatBoost) with convolutional neural networks (CNN and TextCNN). For dataset construction, three previous CPP benchmark datasets, including CPPsite 2.0, MLCPP 2.0, and CPP924, are merged to improve the diversity and reduce homology. For feature generation, two language model-based features obtained from the Transformer architecture, including ProtT5 and ESM-2, are employed in CNN and TextCNN. Additionally, sequence features, such as CPRS, Hybrid PseAAC, KSC, etc., are input to SVM and CatBoost. Based on the result of each predictor, Logistic Regression (LR) is built to predict the final decision. The experiment results indicate that ProtT5 and ESM-2 fusion features significantly contribute to predicting CPP and that combining employed features and models demonstrates better association. On an independent test dataset comparison, EnDM-CPP achieved an accuracy of 0.9495 and a Matthews correlation coefficient of 0.9008 with an improvement of 2.23%-9.48% and 4.32%-19.02%, respectively, compared with other state-of-the-art methods. Code and data are available at https://github.com/tudou1231/EnDM-CPP.git .
Collapse
Affiliation(s)
- Lun Zhu
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou University, Changzhou, 213164, China
| | - Zehua Chen
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou University, Changzhou, 213164, China
| | - Sen Yang
- School of Computer Science and Artificial Intelligence, Aliyun School of Big Data, School of Software, Changzhou University, Changzhou, 213164, China.
- The Affiliated Changzhou No. 2 People's Hospital of Nanjing Medical University, Changzhou, 213164, China.
| |
Collapse
|
6
|
Weckbecker M, Anžel A, Yang Z, Hattab G. Interpretable molecular encodings and representations for machine learning tasks. Comput Struct Biotechnol J 2024; 23:2326-2336. [PMID: 38867722 PMCID: PMC11167246 DOI: 10.1016/j.csbj.2024.05.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 05/13/2024] [Accepted: 05/19/2024] [Indexed: 06/14/2024] Open
Abstract
Molecular encodings and their usage in machine learning models have demonstrated significant breakthroughs in biomedical applications, particularly in the classification of peptides and proteins. To this end, we propose a new encoding method: Interpretable Carbon-based Array of Neighborhoods (iCAN). Designed to address machine learning models' need for more structured and less flexible input, it captures the neighborhoods of carbon atoms in a counting array and improves the utility of the resulting encodings for machine learning models. The iCAN method provides interpretable molecular encodings and representations, enabling the comparison of molecular neighborhoods, identification of repeating patterns, and visualization of relevance heat maps for a given data set. When reproducing a large biomedical peptide classification study, it outperforms its predecessor encoding. When extended to proteins, it outperforms a lead structure-based encoding on 71% of the data sets. Our method offers interpretable encodings that can be applied to all organic molecules, including exotic amino acids, cyclic peptides, and larger proteins, making it highly versatile across various domains and data sets. This work establishes a promising new direction for machine learning in peptide and protein classification in biomedicine and healthcare, potentially accelerating advances in drug discovery and disease diagnosis.
Collapse
Affiliation(s)
- Moritz Weckbecker
- Center for Artificial Intelligence in Public Health Research, (ZKI-PH), Robert Koch Institute, Nordufer 20, Berlin, 13353, Berlin, Germany
| | - Aleksandar Anžel
- Center for Artificial Intelligence in Public Health Research, (ZKI-PH), Robert Koch Institute, Nordufer 20, Berlin, 13353, Berlin, Germany
| | - Zewen Yang
- Center for Artificial Intelligence in Public Health Research, (ZKI-PH), Robert Koch Institute, Nordufer 20, Berlin, 13353, Berlin, Germany
| | - Georges Hattab
- Center for Artificial Intelligence in Public Health Research, (ZKI-PH), Robert Koch Institute, Nordufer 20, Berlin, 13353, Berlin, Germany
- Department of Mathematics and Computer science Freie Universität, Arnimallee 14, Berlin, 14195, Berlin, Germany
| |
Collapse
|
7
|
Li Y, Li XM, Wei LS, Ye JF. Advancements in mitochondrial-targeted nanotherapeutics: overcoming biological obstacles and optimizing drug delivery. Front Immunol 2024; 15:1451989. [PMID: 39483479 PMCID: PMC11524880 DOI: 10.3389/fimmu.2024.1451989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 09/19/2024] [Indexed: 11/03/2024] Open
Abstract
In recent decades, nanotechnology has significantly advanced drug delivery systems, particularly in targeting subcellular organelles, thus opening new avenues for disease treatment. Mitochondria, critical for cellular energy and health, when dysfunctional, contribute to cancer, neurodegenerative diseases, and metabolic disorders. This has propelled the development of nanomedicines aimed at precise mitochondrial targeting to modulate their function, marking a research hotspot. This review delves into the recent advancements in mitochondrial-targeted nanotherapeutics, with a comprehensive focus on targeting strategies, nanocarrier designs, and their therapeutic applications. It emphasizes nanotechnology's role in enhancing drug delivery by overcoming biological barriers and optimizing drug design for specific mitochondrial targeting. Strategies exploiting mitochondrial membrane potential differences and specific targeting ligands improve the delivery and mitochondrial accumulation of nanomedicines. The use of diverse nanocarriers, including liposomes, polymer nanoparticles, and inorganic nanoparticles, tailored for effective mitochondrial targeting, shows promise in anti-tumor and neurodegenerative treatments. The review addresses the challenges and future directions in mitochondrial targeting nanotherapy, highlighting the need for precision, reduced toxicity, and clinical validation. Mitochondrial targeting nanotherapy stands at the forefront of therapeutic strategies, offering innovative treatment perspectives. Ongoing innovation and research are crucial for developing more precise and effective treatment modalities.
Collapse
Affiliation(s)
- Yang Li
- General Surgery Center, First Hospital of Jilin University, Changchun, China
- Department of Rehabilitation, School of Nursing, Jilin University, Changchun, China
| | - Xiao-meng Li
- Department of Rehabilitation, School of Nursing, Jilin University, Changchun, China
| | - Li-si Wei
- Department of Rehabilitation, School of Nursing, Jilin University, Changchun, China
| | - Jun-feng Ye
- General Surgery Center, First Hospital of Jilin University, Changchun, China
| |
Collapse
|
8
|
Li H, Meng J, Wang Z, Luan Y. misORFPred: A Novel Method to Mine Translatable sORFs in Plant Pri-miRNAs Using Enhanced Scalable k-mer and Dynamic Ensemble Voting Strategy. Interdiscip Sci 2024:10.1007/s12539-024-00661-8. [PMID: 39397199 DOI: 10.1007/s12539-024-00661-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 09/18/2024] [Accepted: 09/22/2024] [Indexed: 10/15/2024]
Abstract
The primary microRNAs (pri-miRNAs) have been observed to contain translatable small open reading frames (sORFs) that can encode peptides as an independent element. Relevant studies have proven that those of sORFs are of significance in regulating the expression of biological traits. The existing methods for predicting the coding potential of sORFs frequently overlook this data or categorize them as negative samples, impeding the identification of additional translatable sORFs in pri-miRNAs. In light of this, a novel method named misORFPred has been proposed. Specifically, an enhanced scalable k-mer (ESKmer) that simultaneously integrates the composition information within a sequence and distance information between sequences is designed to extract the nucleotide sequence features. After feature selection, the optimal features and several machine learning classifiers are combined to construct the ensemble model, where a newly devised dynamic ensemble voting strategy (DEVS) is proposed to dynamically adjust the weights of base classifiers and adaptively select the optimal base classifiers for each unlabeled sample. Cross-validation results suggest that ESKmer and DEVS are essential for this classification task and could boost model performance. Independent testing results indicate that misORFPred outperforms the state-of-the-art methods. Furthermore, we execute misORFPerd on the genomes of various plant species and perform a thorough analysis of the predicted outcomes. Taken together, misORFPred is a powerful tool for identifying the translatable sORFs in plant pri-miRNAs and can provide highly trusted candidates for subsequent biological experiments.
Collapse
Affiliation(s)
- Haibin Li
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China.
| | - Zhaowei Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, 116024, China.
| |
Collapse
|
9
|
Ma H, Zhou X, Zhang Z, Weng Z, Li G, Zhou Y, Yao Y. AI-Driven Design of Cell-Penetrating Peptides for Therapeutic Biotechnology. Int J Pept Res Ther 2024; 30:69. [DOI: 10.1007/s10989-024-10654-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/22/2024] [Indexed: 01/05/2025]
|
10
|
Uddin I, Awan HH, Khalid M, Khan S, Akbar S, Sarker MR, Abdolrasol MGM, Alghamdi TAH. A hybrid residue based sequential encoding mechanism with XGBoost improved ensemble model for identifying 5-hydroxymethylcytosine modifications. Sci Rep 2024; 14:20819. [PMID: 39242695 PMCID: PMC11379919 DOI: 10.1038/s41598-024-71568-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 08/29/2024] [Indexed: 09/09/2024] Open
Abstract
RNA modifications play an important role in actively controlling recently created formation in cellular regulation mechanisms, which link them to gene expression and protein. The RNA modifications have numerous alterations, presenting broad glimpses of RNA's operations and character. The modification process by the TET enzyme oxidation is the crucial change associated with cytosine hydroxymethylation. The effect of CR is an alteration in specific biochemical ways of the organism, such as gene expression and epigenetic alterations. Traditional laboratory systems that identify 5-hydroxymethylcytosine (5hmC) samples are expensive and time-consuming compared to other methods. To address this challenge, the paper proposed XGB5hmC, a machine learning algorithm based on a robust gradient boosting algorithm (XGBoost), with different residue based formulation methods to identify 5hmC samples. Their results were amalgamated, and six different frequency residue based encoding features were fused to form a hybrid vector in order to enhance model discrimination capabilities. In addition, the proposed model incorporates SHAP (Shapley Additive Explanations) based feature selection to demonstrate model interpretability by highlighting the high contributory features. Among the applied machine learning algorithms, the XGBoost ensemble model using the tenfold cross-validation test achieved improved results than existing state-of-the-art models. Our model reported an accuracy of 89.97%, sensitivity of 87.78%, specificity of 94.45%, F1-score of 0.8934%, and MCC of 0.8764%. This study highlights the potential to provide valuable insights for enhancing medical assessment and treatment protocols, representing a significant advancement in RNA modification analysis.
Collapse
Affiliation(s)
- Islam Uddin
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan
| | - Hamid Hussain Awan
- Department of Computer Science, Muslim Youth University, Islamabad, Pakistan
| | - Majdi Khalid
- Department of Computer Science and Artificial Intelligence, College of Computing, Umm Al-Qura University, Makkah, 21955, Saudi Arabia
| | - Salman Khan
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan
| | - Shahid Akbar
- Department of Computer Science, Abdul Wali Khan University, Mardan, Pakistan.
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, 610054, China.
| | - Mahidur R Sarker
- Institute of Visual Informatics, Universiti Kebangsaan Malaysia, Bangi, 43600, Selangor, Malaysia
- Universidad de Diseño, Innovación y Tecnología, UDIT, Av. Alfonso XIII, 97, 28016, Madrid, Spain
| | - Maher G M Abdolrasol
- Institute of Sustainable Energy, Universiti Tenaga Nasional, Kajang, 43000, Malaysia
| | - Thamer A H Alghamdi
- Wolfson Centre for Magnetics, School of Engineering, Cardiff University, Cardiff, CF24 3AA, UK.
- Electrical Engineering Department, Faculty of Engineering, Al-Baha University, Al-Baha, 65779, Saudi Arabia.
| |
Collapse
|
11
|
Cherene MB, Taveira GB, Almeida-Silva F, da Silva MS, Cavaco MC, da Silva-Ferreira AT, Perales JEA, de Oliveira Carvalho A, Venâncio TM, da Motta OV, Rodrigues R, Castanho MARB, Gomes VM. Structural and Biochemical Characterization of Three Antimicrobial Peptides from Capsicum annuum L. var. annuum Leaves for Anti-Candida Use. Probiotics Antimicrob Proteins 2024; 16:1270-1287. [PMID: 37365421 DOI: 10.1007/s12602-023-10112-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/14/2023] [Indexed: 06/28/2023]
Abstract
The emergence of resistant microorganisms has reduced the effectiveness of currently available antimicrobials, necessitating the development of new strategies. Plant antimicrobial peptides (AMPs) are promising candidates for novel drug development. In this study, we aimed to isolate, characterize, and evaluate the antimicrobial activities of AMPs isolated from Capsicum annuum. The antifungal potential was tested against Candida species. Three AMPs from C. annuum leaves were isolated and characterized: a protease inhibitor, a defensin-like protein, and a lipid transporter protein, respectively named CaCPin-II, CaCDef-like, and CaCLTP2. All three peptides had a molecular mass between 3.5 and 6.5 kDa and caused morphological and physiological changes in four different species of the genus Candida, such as pseudohyphae formation, cell swelling and agglutination, growth inhibition, reduced cell viability, oxidative stress, membrane permeabilization, and metacaspase activation. Except for CaCPin-II, the peptides showed low or no hemolytic activity at the concentrations used in the yeast assays. CaCPin-II inhibited α-amylase activity. Together, these results suggest that these peptides have the potential as antimicrobial agents against species of the genus Candida and can serve as scaffolds for the development of synthetic peptides for this purpose.
Collapse
Affiliation(s)
- Milena Bellei Cherene
- Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Gabriel Bonan Taveira
- Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Fabricio Almeida-Silva
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Marciele Souza da Silva
- Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Marco Calvinho Cavaco
- Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina da Universidade de Lisboa, Lisbon, Portugal
| | | | | | - André de Oliveira Carvalho
- Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Thiago Motta Venâncio
- Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Olney Vieira da Motta
- Laboratório de Sanidade Animal, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | - Rosana Rodrigues
- Laboratório de Melhoramento e Genética Vegetal, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
| | | | - Valdirene Moreira Gomes
- Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil.
| |
Collapse
|
12
|
Zhao F, Qiu J, Xiang D, Jiao P, Cao Y, Xu Q, Qiao D, Xu H, Cao Y. deepAMPNet: a novel antimicrobial peptide predictor employing AlphaFold2 predicted structures and a bi-directional long short-term memory protein language model. PeerJ 2024; 12:e17729. [PMID: 39040937 PMCID: PMC11262304 DOI: 10.7717/peerj.17729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 06/20/2024] [Indexed: 07/24/2024] Open
Abstract
Background Global public health is seriously threatened by the escalating issue of antimicrobial resistance (AMR). Antimicrobial peptides (AMPs), pivotal components of the innate immune system, have emerged as a potent solution to AMR due to their therapeutic potential. Employing computational methodologies for the prompt recognition of these antimicrobial peptides indeed unlocks fresh perspectives, thereby potentially revolutionizing antimicrobial drug development. Methods In this study, we have developed a model named as deepAMPNet. This model, which leverages graph neural networks, excels at the swift identification of AMPs. It employs structures of antimicrobial peptides predicted by AlphaFold2, encodes residue-level features through a bi-directional long short-term memory (Bi-LSTM) protein language model, and constructs adjacency matrices anchored on amino acids' contact maps. Results In a comparative study with other state-of-the-art AMP predictors on two external independent test datasets, deepAMPNet outperformed in accuracy. Furthermore, in terms of commonly accepted evaluation matrices such as AUC, Mcc, sensitivity, and specificity, deepAMPNet achieved the highest or highly comparable performances against other predictors. Conclusion deepAMPNet interweaves both structural and sequence information of AMPs, stands as a high-performance identification model that propels the evolution and design in antimicrobial peptide pharmaceuticals. The data and code utilized in this study can be accessed at https://github.com/Iseeu233/deepAMPNet.
Collapse
Affiliation(s)
- Fei Zhao
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Junhui Qiu
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Dongyou Xiang
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Pengrui Jiao
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Yu Cao
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Qingrui Xu
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Dairong Qiao
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Hui Xu
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| | - Yi Cao
- Microbiology and Metabolic Engineering Laboratory of Sichuan Province, College of Life Science, Sichuan University, Chengdu, Sichuan, China
| |
Collapse
|
13
|
Serebrennikova M, Grafskaia E, Maltsev D, Ivanova K, Bashkirov P, Kornilov F, Volynsky P, Efremov R, Bocharov E, Lazarev V. TriplEP-CPP: Algorithm for Predicting the Properties of Peptide Sequences. Int J Mol Sci 2024; 25:6869. [PMID: 38999985 PMCID: PMC11241344 DOI: 10.3390/ijms25136869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Revised: 06/18/2024] [Accepted: 06/20/2024] [Indexed: 07/14/2024] Open
Abstract
Advancements in medicine and pharmacology have led to the development of systems that deliver biologically active molecules inside cells, increasing drug concentrations at target sites. This improves effectiveness and duration of action and reduces side effects on healthy tissues. Cell-penetrating peptides (CPPs) show promise in this area. While traditional medicinal chemistry methods have been used to develop CPPs, machine learning techniques can speed up and reduce costs in the search for new peptides. A predictive algorithm based on machine learning models was created to identify novel CPP sequences using molecular descriptors using a combination of algorithms like k-nearest neighbors, gradient boosting, and random forest. Some potential CPPs were found and tested for cytotoxicity and penetrating ability. A new low-toxicity CPP was discovered from the Rhopilema esculentum venom proteome through this study.
Collapse
Affiliation(s)
- Maria Serebrennikova
- Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.)
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
| | - Ekaterina Grafskaia
- Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.)
| | - Dmitriy Maltsev
- Federal Center of Brain Research and Neurotechnologies, Federal Medical Biological Agency, Moscow 117997, Russia;
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
- Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Pirogov Russian National Research Medical University, Moscow 117997, Russia
| | - Kseniya Ivanova
- Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.)
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
- Research Institute for Systems Biology and Medicine, Moscow 117246, Russia
| | - Pavel Bashkirov
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
- Research Institute for Systems Biology and Medicine, Moscow 117246, Russia
| | - Fedor Kornilov
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
| | - Pavel Volynsky
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
- Institute of Cytology, Russian Academy of Sciences, St. Petersburg 194064, Russia
| | - Roman Efremov
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
| | - Eduard Bocharov
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
| | - Vassili Lazarev
- Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.)
- Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
| |
Collapse
|
14
|
Li H, Meng J, Wang Z, Tang Y, Xia S, Wang Y, Qin Z, Luan Y. miPEPPred-FRL: A Novel Method for Predicting Plant MiRNA-Encoded Peptides Using Adaptive Feature Representation Learning. J Chem Inf Model 2024; 64:2889-2900. [PMID: 37733290 DOI: 10.1021/acs.jcim.3c01020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
MicroRNAs (miRNAs) are an essential type of small molecule RNAs that play significant regulatory roles in organisms. Recent studies have demonstrated that small open reading frames (sORFs) harbored in primary miRNAs (pri-miRNAs) can encode small peptides, known as miPEPs. Plant miPEPs can increase the abundance and activity of cognate miRNAs by promoting the transcription of their corresponding pri-miRNAs, thereby modulating plant traits. Biological experiments are the most effective way to accurately identify miPEPs; however, they are time-consuming and expensive. Hence, an efficient computational method for the identification of miPEPs on a large scale is highly desirable. Up to now, there have been no specialized computational tools for identifying miPEPs. In this work, a novel predictor named miPEPPred-FRL based on an adaptive feature representation learning framework that consists of the feature transformation module and the cascade architecture has been proposed. The feature transformation module integrating a newly designed feature selection method and classifier selection rule is developed to convert sequence-based features into primary class and probabilistic features, which are then fed into the improved cascade architecture to obtain more stable and discriminative augmented features. Finally, the augmented features are utilized to construct the final predictor. Cross-validation experiments illustrate that the novel feature selection method and classifier selection rule contribute to boosting the feature representation ability of the framework. Furthermore, the high accuracy of miPEPPred-FRL on independent testing data suggests that it is a trustworthy and valuable tool for the identification of miPEPs.
Collapse
Affiliation(s)
- Haibin Li
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Jun Meng
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Zhaowei Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Youwei Tang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Shihao Xia
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Yu Wang
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Zhaojing Qin
- School of Computer Science and Technology, Dalian University of Technology, Dalian, Liaoning 116024, China
| | - Yushi Luan
- School of Bioengineering, Dalian University of Technology, Dalian, Liaoning 116024, China
| |
Collapse
|
15
|
Datta S, Nabeel Asim M, Dengel A, Ahmed S. NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences. Brief Funct Genomics 2024; 23:163-179. [PMID: 37248673 DOI: 10.1093/bfgp/elad018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 04/12/2023] [Accepted: 05/02/2023] [Indexed: 05/31/2023] Open
Abstract
Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.
Collapse
Affiliation(s)
- Sourajyoti Datta
- Department of Computer Science, Rheinland Pfälzische Technische Universität, Kaiserslautern, 67663, Germany
| | - Muhammad Nabeel Asim
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany
| | - Andreas Dengel
- Department of Computer Science, Rheinland Pfälzische Technische Universität, Kaiserslautern, 67663, Germany
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany
| | - Sheraz Ahmed
- German Research Center for Artificial Intelligence, Kaiserslautern, 67663, Germany
| |
Collapse
|
16
|
Preto AJ, Caniceiro AB, Duarte F, Fernandes H, Ferreira L, Mourão J, Moreira IS. POSEIDON: Peptidic Objects SEquence-based Interaction with cellular DOmaiNs: a new database and predictor. J Cheminform 2024; 16:18. [PMID: 38365724 PMCID: PMC10874016 DOI: 10.1186/s13321-024-00810-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 02/07/2024] [Indexed: 02/18/2024] Open
Abstract
Cell-penetrating peptides (CPPs) are short chains of amino acids that have shown remarkable potential to cross the cell membrane and deliver coupled therapeutic cargoes into cells. Designing and testing different CPPs to target specific cells or tissues is crucial to ensure high delivery efficiency and reduced toxicity. However, in vivo/in vitro testing of various CPPs can be both time-consuming and costly, which has led to interest in computational methodologies, such as Machine Learning (ML) approaches, as faster and cheaper methods for CPP design and uptake prediction. However, most ML models developed to date focus on classification rather than regression techniques, because of the lack of informative quantitative uptake values. To address these challenges, we developed POSEIDON, an open-access and up-to-date curated database that provides experimental quantitative uptake values for over 2,300 entries and physicochemical properties of 1,315 peptides. POSEIDON also offers physicochemical properties, such as cell line, cargo, and sequence, among others. By leveraging this database along with cell line genomic features, we processed a dataset of over 1,200 entries to develop an ML regression CPP uptake predictor. Our results demonstrated that POSEIDON accurately predicted peptide cell line uptake, achieving a Pearson correlation of 0.87, Spearman correlation of 0.88, and r2 score of 0.76, on an independent test set. With its comprehensive and novel dataset, along with its potent predictive capabilities, the POSEIDON database and its associated ML predictor signify a significant leap forward in CPP research and development. The POSEIDON database and ML Predictor are available for free and with a user-friendly interface at https://moreiralab.com/resources/poseidon/ , making them valuable resources for advancing research on CPP-related topics. Scientific Contribution Statement: Our research addresses the critical need for more efficient and cost-effective methodologies in Cell-Penetrating Peptide (CPP) research. We introduced POSEIDON, a comprehensive and freely accessible database that delivers quantitative uptake values for over 2,300 entries, along with detailed physicochemical profiles for 1,315 peptides. Recognizing the limitations of current Machine Learning (ML) models for CPP design, our work leveraged the rich dataset provided by POSEIDON to develop a highly accurate ML regression model for predicting CPP uptake.
Collapse
Affiliation(s)
- António J Preto
- Center for Neuroscience and Cell Biology, University of Coimbra, 3004-504, Coimbra, Portugal
- PhD Programme in Experimental Biology and Biomedicine, Institute for Interdisciplinary Research (IIIUC), University of Coimbra, Casa Costa Alemão, 3030-789, Coimbra, Portugal
| | - Ana B Caniceiro
- Center for Neuroscience and Cell Biology, University of Coimbra, 3004-504, Coimbra, Portugal
- Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456, Coimbra, Portugal
| | - Francisco Duarte
- Center for Neuroscience and Cell Biology, University of Coimbra, 3004-504, Coimbra, Portugal
| | - Hugo Fernandes
- CNC - Center for Neuroscience and Cell Biology, CIBB - Centre for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal
- FMUC - Faculty of Medicine, University of Coimbra, Coimbra, Portugal
- MIA - Multidisciplinary Institute of Ageing, University of Coimbra, Coimbra, Portugal
| | - Lino Ferreira
- CNC - Center for Neuroscience and Cell Biology, CIBB - Centre for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal
- FMUC - Faculty of Medicine, University of Coimbra, Coimbra, Portugal
| | - Joana Mourão
- CNC - Center for Neuroscience and Cell Biology, CIBB - Centre for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal
| | - Irina S Moreira
- Department of Life Sciences, University of Coimbra, Calçada Martim de Freitas, 3000-456, Coimbra, Portugal.
- CNC - Center for Neuroscience and Cell Biology, CIBB - Centre for Innovative Biomedicine and Biotechnology, University of Coimbra, Coimbra, Portugal.
| |
Collapse
|
17
|
Malik A, Jayarathna DK, Fisher M, Barbhuiya TK, Gandhi NS, Batra J. Dynamics and recognition of homeodomain containing protein-DNA complex of IRX4. Proteins 2024; 92:282-301. [PMID: 37861198 DOI: 10.1002/prot.26604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 09/15/2023] [Accepted: 09/25/2023] [Indexed: 10/21/2023]
Abstract
Iroquois Homeobox 4 (IRX4) belongs to a family of homeobox TFs having roles in embryogenesis, cell specification, and organ development. Recently, large scale genome-wide association studies and epigenetic studies have highlighted the role of IRX4 and its associated variants in prostate cancer. No studies have investigated and characterized the structural aspect of the IRX4 homeodomain and its potential to bind to DNA. The current study uses sequence analysis, homology modeling, and molecular dynamics simulations to explore IRX4 homeodomain-DNA recognition mechanisms and the role of somatic mutations affecting these interactions. Using publicly available databases, gene expression of IRX4 was found in different tissues, including prostate, heart, skin, vagina, and the protein expression was found in cancer cell lines (HCT166, HEK293), B cells, ascitic fluid, and brain. Sequence conservation of the homeodomain shed light on the importance of N- and C-terminal residues involved in DNA binding. The specificity of IRX4 homodimer bound to consensus human DNA sequence was confirmed by molecular dynamics simulations, representing the role of conserved amino acids including R145, A194, N195, S190, R198, and R199 in binding to DNA. Additional N-terminal residues like T144 and G143 were also found to have specific interactions highlighting the importance of N-terminus of the homeodomain in DNA recognition. Additionally, the effects of somatic mutations, including the conserved Arginine (R145, R198, and R199) residues on DNA binding elucidated the importance of these residues in stabilizing the protein-DNA complex. Secondary structure and hydrogen bonding analysis showed the roles of specific residues (R145, T191, A194, N195, R198, and R199) in maintaining the homogeneity of the structure and its interaction with DNA. The differences in relative binding free energies of all the mutants shed light on the structural modularity of this protein and the dynamics behind protein-DNA interaction. We also have predicted that the C-terminal sequence of the IRX4 homeodomain could act as a potential cell-penetrating peptide, emphasizing the role these small peptides could play in targeting homeobox TFs.
Collapse
Affiliation(s)
- Adil Malik
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, Queensland, Australia
- Translational Research Institute, Woolloongabba, Queensland, Australia
| | - Dulari K Jayarathna
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, Australia
- School of Chemistry and Physics, Faculty of Science, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Mark Fisher
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, Queensland, Australia
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Tabassum Khair Barbhuiya
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, Australia
- School of Chemistry and Physics, Faculty of Science, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Neha S Gandhi
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, Australia
- School of Chemistry and Physics, Faculty of Science, Queensland University of Technology, Brisbane, Queensland, Australia
- Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education, Udupi, Karnataka, India
| | - Jyotsna Batra
- School of Biomedical Sciences, Faculty of Health, Queensland University of Technology, Brisbane, Queensland, Australia
- Translational Research Institute, Woolloongabba, Queensland, Australia
- Centre for Genomics and Personalised Health, Queensland University of Technology, Brisbane, Queensland, Australia
| |
Collapse
|
18
|
Shi K, Xiong Y, Wang Y, Deng Y, Wang W, Jing B, Gao X. PractiCPP: a deep learning approach tailored for extremely imbalanced datasets in cell-penetrating peptide prediction. Bioinformatics 2024; 40:btae058. [PMID: 38305405 PMCID: PMC11212486 DOI: 10.1093/bioinformatics/btae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/25/2024] [Accepted: 01/30/2024] [Indexed: 02/03/2024] Open
Abstract
MOTIVATION Effective drug delivery systems are paramount in enhancing pharmaceutical outcomes, particularly through the use of cell-penetrating peptides (CPPs). These peptides are gaining prominence due to their ability to penetrate eukaryotic cells efficiently without inflicting significant damage to the cellular membrane, thereby ensuring optimal drug delivery. However, the identification and characterization of CPPs remain a challenge due to the laborious and time-consuming nature of conventional methods, despite advances in proteomics. Current computational models, however, are predominantly tailored for balanced datasets, an approach that falls short in real-world applications characterized by a scarcity of known positive CPP instances. RESULTS To navigate this shortfall, we introduce PractiCPP, a novel deep-learning framework tailored for CPP prediction in highly imbalanced data scenarios. Uniquely designed with the integration of hard negative sampling and a sophisticated feature extraction and prediction module, PractiCPP facilitates an intricate understanding and learning from imbalanced data. Our extensive computational validations highlight PractiCPP's exceptional ability to outperform existing state-of-the-art methods, demonstrating remarkable accuracy, even in datasets with an extreme positive-to-negative ratio of 1:1000. Furthermore, through methodical embedding visualizations, we have established that models trained on balanced datasets are not conducive to practical, large-scale CPP identification, as they do not accurately reflect real-world complexities. In summary, PractiCPP potentially offers new perspectives in CPP prediction methodologies. Its design and validation, informed by real-world dataset constraints, suggest its utility as a valuable tool in supporting the acceleration of drug delivery advancements. AVAILABILITY AND IMPLEMENTATION The source code of PractiCPP is available on Figshare at https://doi.org/10.6084/m9.figshare.25053878.v1.
Collapse
Affiliation(s)
- Kexin Shi
- Syneron Technology, Guangzhou 510000, China
- Individualized Interdisciplinary Program (Data Science and Analytics), The Hong Kong University of Science and Technology, Hong Kong SAR, China
| | | | - Yu Wang
- Syneron Technology, Guangzhou 510000, China
| | - Yifan Deng
- Syneron Technology, Guangzhou 510000, China
| | - Wenjia Wang
- Data Science and Analytics Thrust, The Hong Kong University of Science and Technology (Guangzhou), Nansha, Guangzhou, 511400, Guangdong, China
| | - Bingyi Jing
- Department of Statistics and Data Science, Southern University of Science and Technology, Shenzhen 518000, China
| | - Xin Gao
- Syneron Technology, Guangzhou 510000, China
- Computer Science Program, Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
- Computational Bioscience Research Center, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| |
Collapse
|
19
|
Cui Z, Wang SG, He Y, Chen ZH, Zhang QH. DeepTPpred: A Deep Learning Approach With Matrix Factorization for Predicting Therapeutic Peptides by Integrating Length Information. IEEE J Biomed Health Inform 2023; 27:4611-4622. [PMID: 37368803 DOI: 10.1109/jbhi.2023.3290014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2023]
Abstract
The abuse of traditional antibiotics has led to increased resistance of bacteria and viruses. Efficient therapeutic peptide prediction is critical for peptide drug discovery. However, most of the existing methods only make effective predictions for one class of therapeutic peptides. It is worth noting that currently no predictive method considers sequence length information as a distinct feature of therapeutic peptides. In this article, a novel deep learning approach with matrix factorization for predicting therapeutic peptides (DeepTPpred) by integrating length information are proposed. The matrix factorization layer can learn the potential features of the encoded sequence through the mechanism of first compression and then restoration. And the length features of the sequence of therapeutic peptides are embedded with encoded amino acid sequences. To automatically learn therapeutic peptide predictions, these latent features are input into the neural networks with self-attention mechanism. On eight therapeutic peptide datasets, DeepTPpred achieved excellent prediction results. Based on these datasets, we first integrated eight datasets to obtain a full therapeutic peptide integration dataset. Then, we obtained two functional integration datasets based on the functional similarity of the peptides. Finally, we also conduct experiments on the latest versions of the ACP and CPP datasets. Overall, the experimental results show that our work is effective for the identification of therapeutic peptides.
Collapse
|
20
|
Chen S, Liao Y, Zhao J, Bin Y, Zheng C. PACVP: Prediction of Anti-Coronavirus Peptides Using a Stacking Learning Strategy With Effective Feature Representation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3106-3116. [PMID: 37022025 DOI: 10.1109/tcbb.2023.3238370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Due to the global outbreak of COVID-19 and its variants, antiviral peptides with anti-coronavirus activity (ACVPs) represent a promising new drug candidate for the treatment of coronavirus infection. At present, several computational tools have been developed to identify ACVPs, but the overall prediction performance is still not enough to meet the actual therapeutic application. In this study, we constructed an efficient and reliable prediction model PACVP (Prediction of Anti-CoronaVirus Peptides) for identifying ACVPs based on effective feature representation and a two-layer stacking learning framework. In the first layer, we use nine feature encoding methods with different feature representation angles to characterize the rich sequence information and fuse them into a feature matrix. Secondly, data normalization and unbalanced data processing are carried out. Next, 12 baseline models are constructed by combining three feature selection methods and four machine learning classification algorithms. In the second layer, we input the optimal probability features into the logistic regression algorithm (LR) to train the final model PACVP. The experiments show that PACVP achieves favorable prediction performance on independent test dataset, with ACC of 0.9208 and AUC of 0.9465. We hope that PACVP will become a useful method for identifying, annotating and characterizing novel ACVPs.
Collapse
|
21
|
Meng C, Pei Y, Zou Q, Yuan L. DP-AOP: A novel SVM-based antioxidant proteins identifier. Int J Biol Macromol 2023; 247:125499. [PMID: 37414318 DOI: 10.1016/j.ijbiomac.2023.125499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 06/01/2023] [Accepted: 06/19/2023] [Indexed: 07/08/2023]
Abstract
The identification of antioxidant proteins is a challenging yet meaningful task, as they can protect against the damage caused by some free radicals. In addition to time-consuming, laborious, and expensive experimental identification methods, efficient identification of antioxidant proteins through machine learning algorithms has become increasingly common. In recent years, researchers have proposed models for identifying antioxidant proteins; unfortunately, although the accuracy of models is already high, their sensitivity is too low, indicating the possibility of overfitting in the model. Therefore, we developed a new model called DP-AOP for the recognition of antioxidant proteins. We used the SMOTE algorithm to balance the dataset, selected Wei's proposed feature extraction algorithm to obtain 473 dimensional feature vectors, and based on the sorting function in MRMD, scored and ranked each feature to obtain a feature set with contribution values ranging from high to low. To effectively reduce the feature dimension, we combined the dynamic programming idea to make the local eight features the optimal subset. After obtaining the 36 dimensional feature vectors, we finally selected 17 features through experimental analysis. The SVM classification algorithm was used to implement the model through the libsvm tool. The model achieved satisfactory performance, with an accuracy rate of 91.076 %, SN of 96.4 %, SP of 85.8 %, MCC of 82.6 %, and F1 core of 91.5 %. Furthermore, we built a free web server to facilitate researchers' subsequent unfolding studies of antioxidant protein recognition. The website is http://112.124.26.17:8003/#/.
Collapse
Affiliation(s)
- Chaolu Meng
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China; Inner Mongolia Autonomous Region Key Laboratory of Big Data Research and Application of Agriculture and Animal Husbandry, China.
| | - Yue Pei
- College of Computer and Information Engineering, Inner Mongolia Agricultural University, Hohhot, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, China.
| | - Lei Yuan
- Department of Hepatobiliary Surgery, Quzhou People's Hospital, China.
| |
Collapse
|
22
|
Hsueh HT, Chou RT, Rai U, Liyanage W, Kim YC, Appell MB, Pejavar J, Leo KT, Davison C, Kolodziejski P, Mozzer A, Kwon H, Sista M, Anders NM, Hemingway A, Rompicharla SVK, Edwards M, Pitha I, Hanes J, Cummings MP, Ensign LM. Machine learning-driven multifunctional peptide engineering for sustained ocular drug delivery. Nat Commun 2023; 14:2509. [PMID: 37130851 PMCID: PMC10154330 DOI: 10.1038/s41467-023-38056-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/12/2023] [Indexed: 05/04/2023] Open
Abstract
Sustained drug delivery strategies have many potential benefits for treating a range of diseases, particularly chronic diseases that require treatment for years. For many chronic ocular diseases, patient adherence to eye drop dosing regimens and the need for frequent intraocular injections are significant barriers to effective disease management. Here, we utilize peptide engineering to impart melanin binding properties to peptide-drug conjugates to act as a sustained-release depot in the eye. We develop a super learning-based methodology to engineer multifunctional peptides that efficiently enter cells, bind to melanin, and have low cytotoxicity. When the lead multifunctional peptide (HR97) is conjugated to brimonidine, an intraocular pressure lowering drug that is prescribed for three times per day topical dosing, intraocular pressure reduction is observed for up to 18 days after a single intracameral injection in rabbits. Further, the cumulative intraocular pressure lowering effect increases ~17-fold compared to free brimonidine injection. Engineered multifunctional peptide-drug conjugates are a promising approach for providing sustained therapeutic delivery in the eye and beyond.
Collapse
Affiliation(s)
- Henry T Hsueh
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Renee Ti Chou
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
| | - Usha Rai
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Wathsala Liyanage
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Yoo Chun Kim
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Matthew B Appell
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
| | - Jahnavi Pejavar
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Kirby T Leo
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Charlotte Davison
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Patricia Kolodziejski
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Ann Mozzer
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - HyeYoung Kwon
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Maanasa Sista
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA
| | - Nicole M Anders
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Avelina Hemingway
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Sri Vishnu Kiran Rompicharla
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Malia Edwards
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ian Pitha
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Justin Hanes
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA.
| | - Laura M Ensign
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA.
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA.
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
23
|
Zhou W, Liu Y, Li Y, Kong S, Wang W, Ding B, Han J, Mou C, Gao X, Liu J. TriNet: A tri-fusion neural network for the prediction of anticancer and antimicrobial peptides. PATTERNS (NEW YORK, N.Y.) 2023; 4:100702. [PMID: 36960450 PMCID: PMC10028424 DOI: 10.1016/j.patter.2023.100702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 12/20/2022] [Accepted: 02/03/2023] [Indexed: 03/04/2023]
Abstract
The accurate identification of anticancer peptides (ACPs) and antimicrobial peptides (AMPs) remains a computational challenge. We propose a tri-fusion neural network termed TriNet for the accurate prediction of both ACPs and AMPs. The framework first defines three kinds of features to capture the peptide information contained in serial fingerprints, sequence evolutions, and physicochemical properties, which are then fed into three parallel modules: a convolutional neural network module enhanced by channel attention, a bidirectional long short-term memory module, and an encoder module for training and final classification. To achieve a better training effect, TriNet is trained via a training approach using iterative interactions between the samples in the training and validation datasets. TriNet is tested on multiple challenging ACP and AMP datasets and exhibits significant improvements over various state-of-the-art methods. The web server and source code of TriNet are respectively available at http://liulab.top/TriNet/server and https://github.com/wanyunzh/TriNet.
Collapse
Affiliation(s)
- Wanyun Zhou
- SDU-ANU Joint Science College, Shandong University (Weihai), Weihai 264209, China
| | - Yufei Liu
- SDU-ANU Joint Science College, Shandong University (Weihai), Weihai 264209, China
| | - Yingxin Li
- School of Mechanical, Electrical & Information Engineering, Shandong University (Weihai), Weihai 264209, China
| | - Siqi Kong
- SDU-ANU Joint Science College, Shandong University (Weihai), Weihai 264209, China
| | - Weilin Wang
- SDU-ANU Joint Science College, Shandong University (Weihai), Weihai 264209, China
| | - Boyun Ding
- SDU-ANU Joint Science College, Shandong University (Weihai), Weihai 264209, China
| | - Jiyun Han
- School of Mathematics and Statistics, Shandong University (Weihai), Weihai 264209, China
| | - Chaozhou Mou
- School of Mathematics and Statistics, Shandong University (Weihai), Weihai 264209, China
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal 23955, Saudi Arabia
| | - Juntao Liu
- School of Mathematics and Statistics, Shandong University (Weihai), Weihai 264209, China
| |
Collapse
|
24
|
Essus VA, Souza Júnior GSE, Nunes GHP, Oliveira JDS, de Faria BM, Romão LF, Cortines JR. Bacteriophage P22 Capsid as a Pluripotent Nanotechnology Tool. Viruses 2023; 15:516. [PMID: 36851730 PMCID: PMC9962691 DOI: 10.3390/v15020516] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/15/2023] Open
Abstract
The Salmonella enterica bacteriophage P22 is one of the most promising models for the development of virus-like particle (VLP) nanocages. It possesses an icosahedral T = 7 capsid, assembled by the combination of two structural proteins: the coat protein (gp5) and the scaffold protein (gp8). The P22 capsid has the remarkable capability of undergoing structural transition into three morphologies with differing diameters and wall-pore sizes. These varied morphologies can be explored for the design of nanoplatforms, such as for the development of cargo internalization strategies. The capsid proteic nature allows for the extensive modification of its structure, enabling the addition of non-native structures to alter the VLP properties or confer them to diverse ends. Various molecules were added to the P22 VLP through genetic, chemical, and other means to both the capsid and the scaffold protein, permitting the encapsulation or the presentation of cargo. This allows the particle to be exploited for numerous purposes-for example, as a nanocarrier, nanoreactor, and vaccine model, among other applications. Therefore, the present review intends to give an overview of the literature on this amazing particle.
Collapse
Affiliation(s)
- Victor Alejandro Essus
- Laboratório de Virologia e Espectrometria de Massas (LAVEM), Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21590-902, Brazil
| | - Getúlio Silva e Souza Júnior
- Laboratório de Virologia e Espectrometria de Massas (LAVEM), Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21590-902, Brazil
| | - Gabriel Henrique Pereira Nunes
- Laboratório de Virologia e Espectrometria de Massas (LAVEM), Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21590-902, Brazil
| | - Juliana dos Santos Oliveira
- Laboratório de Virologia e Espectrometria de Massas (LAVEM), Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21590-902, Brazil
| | - Bruna Mafra de Faria
- Instituto de Ciências Biomédicas, Universidade Federal do Rio de Janeiro, Av. Carlos Chagas Filho, 373, CCS, Bl. F026, Rio de Janeiro 21941-590, Brazil
| | - Luciana Ferreira Romão
- Instituto de Ciências Biomédicas, Universidade Federal do Rio de Janeiro, Av. Carlos Chagas Filho, 373, CCS, Bl. F026, Rio de Janeiro 21941-590, Brazil
| | - Juliana Reis Cortines
- Laboratório de Virologia e Espectrometria de Massas (LAVEM), Departamento de Virologia, Instituto de Microbiologia Paulo de Góes, Universidade Federal do Rio de Janeiro, Rio de Janeiro 21590-902, Brazil
| |
Collapse
|
25
|
Hasanzadeh A, Hamblin MR, Kiani J, Noori H, Hardie JM, Karimi M, Shafiee H. Could artificial intelligence revolutionize the development of nanovectors for gene therapy and mRNA vaccines? NANO TODAY 2022; 47:101665. [PMID: 37034382 PMCID: PMC10081506 DOI: 10.1016/j.nantod.2022.101665] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Gene therapy enables the introduction of nucleic acids like DNA and RNA into host cells, and is expected to revolutionize the treatment of a wide range of diseases. This growth has been further accelerated by the discovery of CRISPR/Cas technology, which allows accurate genomic editing in a broad range of cells and organisms in vitro and in vivo. Despite many advances in gene delivery and the development of various viral and non-viral gene delivery vectors, the lack of highly efficient non-viral systems with low cellular toxicity remains a challenge. The application of cutting-edge technologies such as artificial intelligence (AI) has great potential to find new paradigms to solve this issue. Herein, we review AI and its major subfields including machine learning (ML), neural networks (NNs), expert systems, deep learning (DL), computer vision and robotics. We discuss the potential of AI-based models and algorithms in the design of targeted gene delivery vehicles capable of crossing extracellular and intracellular barriers by viral mimicry strategies. We finally discuss the role of AI in improving the function of CRISPR/Cas systems, developing novel nanobots, and mRNA vaccine carriers.
Collapse
Affiliation(s)
- Akbar Hasanzadeh
- Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
| | - Michael R Hamblin
- Laser Research Centre, Faculty of Health Science, University of Johannesburg, Doornfontein 2028, South Africa
- Radiation Biology Research Center, Iran University of Medical Sciences, Tehran, Iran
| | - Jafar Kiani
- Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Department of Molecular Medicine, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran
| | - Hamid Noori
- Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
| | - Joseph M. Hardie
- Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA
| | - Mahdi Karimi
- Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran
- Research Center for Science and Technology in Medicine, Tehran University of Medical Sciences, Tehran 141556559, Iran
- Applied Biotechnology Research Centre, Tehran Medical Science, Islamic Azad University, Tehran 1584743311, Iran
| | - Hadi Shafiee
- Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA
| |
Collapse
|
26
|
Antimicrobial peptides with cell-penetrating activity as prophylactic and treatment drugs. Biosci Rep 2022; 42:231731. [PMID: 36052730 PMCID: PMC9508529 DOI: 10.1042/bsr20221789] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 08/31/2022] [Accepted: 09/01/2022] [Indexed: 01/18/2023] Open
Abstract
Health is fundamental for the development of individuals and evolution of species. In that sense, for human societies is relevant to understand how the human body has developed molecular strategies to maintain health. In the present review, we summarize diverse evidence that support the role of peptides in this endeavor. Of particular interest to the present review are antimicrobial peptides (AMP) and cell-penetrating peptides (CPP). Different experimental evidence indicates that AMP/CPP are able to regulate autophagy, which in turn regulates the immune system response. AMP also assists in the establishment of the microbiota, which in turn is critical for different behavioral and health aspects of humans. Thus, AMP and CPP are multifunctional peptides that regulate two aspects of our bodies that are fundamental to our health: autophagy and microbiota. While it is now clear the multifunctional nature of these peptides, we are still in the early stages of the development of computational strategies aimed to assist experimentalists in identifying selective multifunctional AMP/CPP to control nonhealthy conditions. For instance, both AMP and CPP are computationally characterized as amphipatic and cationic, yet none of these features are relevant to differentiate these peptides from non-AMP or non-CPP. The present review aims to highlight current knowledge that may facilitate the development of AMP’s design tools for preventing or treating illness.
Collapse
|
27
|
Arif M, Kabir M, Ahmed S, Khan A, Ge F, Khelifi A, Yu DJ. DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2749-2759. [PMID: 34347603 DOI: 10.1109/tcbb.2021.3102133] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Cell-penetrating peptides (CPPs) are special peptides capable of carrying a variety of bioactive molecules, such as genetic materials, short interfering RNAs and nanoparticles, into cells. Recently, research on CPP has gained substantial interest from researchers, and the biological mechanisms of CPPS have been assessed in the context of safe drug delivery agents and therapeutic applications. Correct identification and synthesis of CPPs using traditional biochemical methods is an extremely slow, expensive and laborious task particularly due to the large volume of unannotated peptide sequences accumulating in the World Bank repository. Hence, a powerful bioinformatics predictor that rapidly identifies CPPs with a high recognition rate is urgently needed. To date, numerous computational methods have been developed for CPP prediction. However, the available machine-learning (ML) tools are unable to distinguish both the CPPs and their uptake efficiencies. This study aimed to develop a two-layer deep learning framework named DeepCPPred to identify both CPPs in the first phase and peptide uptake efficiency in the second phase. The DeepCPPred predictor first uses four types of descriptors that cover evolutionary, energy estimation, reduced sequence and amino-acid contact information. Then, the extracted features are optimized through the elastic net algorithm and fed into a cascade deep forest algorithm to build the final CPP model. The proposed method achieved 99.45 percent overall accuracy with the CPP924 benchmark dataset in the first layer and 95.43 percent accuracy in the second layer with the CPPSite3 dataset using a 5-fold cross-validation test. Thus, our proposed bioinformatics tool surpassed all the existing state-of-the-art sequence-based CPP approaches.
Collapse
|
28
|
Hu RS, Wu J, Zhang L, Zhou X, Zhang Y. CD8TCEI-EukPath: A Novel Predictor to Rapidly Identify CD8+ T-Cell Epitopes of Eukaryotic Pathogens Using a Hybrid Feature Selection Approach. Front Genet 2022; 13:935989. [PMID: 35937988 PMCID: PMC9354802 DOI: 10.3389/fgene.2022.935989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 05/24/2022] [Indexed: 12/02/2022] Open
Abstract
Computational prediction to screen potential vaccine candidates has been proven to be a reliable way to provide guarantees for vaccine discovery in infectious diseases. As an important class of organisms causing infectious diseases, pathogenic eukaryotes (such as parasitic protozoans) have evolved the ability to colonize a wide range of hosts, including humans and animals; meanwhile, protective vaccines are urgently needed. Inspired by the immunological idea that pathogen-derived epitopes are able to mediate the CD8+ T-cell-related host adaptive immune response and with the available positive and negative CD8+ T-cell epitopes (TCEs), we proposed a novel predictor called CD8TCEI-EukPath to detect CD8+ TCEs of eukaryotic pathogens. Our method integrated multiple amino acid sequence-based hybrid features, employed a well-established feature selection technique, and eventually built an efficient machine learning classifier to differentiate CD8+ TCEs from non-CD8+ TCEs. Based on the feature selection results, 520 optimal hybrid features were used for modeling by utilizing the LightGBM algorithm. CD8TCEI-EukPath achieved impressive performance, with an accuracy of 79.255% in ten-fold cross-validation and an accuracy of 78.169% in the independent test. Collectively, CD8TCEI-EukPath will contribute to rapidly screening epitope-based vaccine candidates, particularly from large peptide-coding datasets. To conduct the prediction of CD8+ TCEs conveniently, an online web server is freely accessible (http://lab.malab.cn/∼hrs/CD8TCEI-EukPath/).
Collapse
Affiliation(s)
- Rui-Si Hu
- Yangtze Delta Region Institute, University of Electronic Science and Technology of China, Quzhou, China
| | - Jin Wu
- School of Management, Shenzhen Polytechnic, Shenzhen, China
| | - Lichao Zhang
- School of Intelligent Manufacturing and Equipment, Shenzhen Institute of Information Technology, Shenzhen, China
| | - Xun Zhou
- Beidahuang Industry Group General Hospital, Harbin, China
- *Correspondence: Xun Zhou, ; Ying Zhang,
| | - Ying Zhang
- Department of Anesthesiology, Hospital (T.C.M) Affiliated of Southwest Medical University, Luzhou, China
- *Correspondence: Xun Zhou, ; Ying Zhang,
| |
Collapse
|
29
|
Niu M, Zou Q. SgRNA-RF: Identification of SgRNA On-Target Activity With Imbalanced Datasets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:2442-2453. [PMID: 33979289 DOI: 10.1109/tcbb.2021.3079116] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Single-guide RNA is a guide RNA (gRNA), which guides the insertion or deletion of uridine residues into kinetoplastid during RNA editing. It is a small non-coding RNA that can be combined with pre -mRNA pairing. SgRNA is a critical component of the CRISPR/Cas9 gene knockout system and play an important role in gene editing and gene regulation. It is important to accurately and quickly identify highly on-target activity sgRNAs. Due to its importance, several computational predictors have been proposed to predict sgRNAs on-target activity. All these methods have clearly contributed to the development of this very important field. However, they also have certain limitations. In the paper, we developed a new classifier SgRNA-RF, which extracts the features of nucleic acid composition and structure of on-target activity sgRNA sequence and identified by random forest algorithm. In addition to solving an imbalanced dataset, this paper proposed a new method called CS-Smote. We compared sgRNA-RF with state-of-the-art predictors on the five datasets, and found SgRNA-RF significantly improved the identification accuracy, with accuracies of 0.8636,0.9161,0.894,0.938,0.965,0.77,0.979,0.973, respectively. The user-friendly web server that implements sgRNA-RF is freely available at http://server.malab.cn/sgRNA-RF/.
Collapse
|
30
|
Prediction of Cell-Penetrating Peptides Using a Novel HSIC-Based Multiview TSK Fuzzy System. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Cell-penetrating peptides (CPPs) are short peptides that can carry cargo into cells. CPPs are widely utilized due to their powerful loading capacity and transduction efficiency. Identifying CPPs is the basis for studying their functions and mechanisms; however, experimental methods to identify CPPs are expensive and time-consuming. Recently, CPP predictors based on machine learning methods have become a research hotspot. Although considerable progress has been made, some challenges remain unresolved. First, most predictors employ a variety of feature descriptors to transform an original sequence into multiview data; however, extant methods ignore the relationships between different views, limiting further performance improvement. Second, most machine learning models are actually black boxes and cannot offer insightful advice. In this paper, a novel Hilbert–Schmidt independence criterion (HSIC)-based multiview TSK fuzzy system is proposed. Compared with other machine learning methods, TSK fuzzy systems have better interpretability, and the introduction of multiview mechanisms provides comprehensive insight into the intrinsic laws of the data. HSIC is utilized here to measure the independence and enhance the complementarity between different views. Notably, the proposed method attained prediction accuracy results of 92.2% and 96.2% for the training and independent test sets, respectively. The empirical results show that our promising approach features greater recognition performance than the state-of-the-art method.
Collapse
|
31
|
Liu P, Ding Y, Rong Y, Chen D. Prediction of cell penetrating peptides and their uptake efficiency using random forest‐based feature selections. AIChE J 2022. [DOI: 10.1002/aic.17781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Peng Liu
- Institute of Fundamental and Frontier Sciences University of Electronic Science and Technology of China Chengdu China
- Institute of Yangtze Delta Region (Quzhou) University of Electronic Science and Technology of China Quzhou China
| | - Yijie Ding
- Institute of Yangtze Delta Region (Quzhou) University of Electronic Science and Technology of China Quzhou China
| | - Ying Rong
- Beidahuang Industry Group General Hospital Harbin China
| | - Dong Chen
- College of Electrical and Information Engineering, Quzhou University Quzhou China
| |
Collapse
|
32
|
Chen X, Zhang Q, Li B, Lu C, Yang S, Long J, He B, Chen H, Huang J. BBPpredict: A Web Service for Identifying Blood-Brain Barrier Penetrating Peptides. Front Genet 2022; 13:845747. [PMID: 35656322 PMCID: PMC9152268 DOI: 10.3389/fgene.2022.845747] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 03/30/2022] [Indexed: 12/22/2022] Open
Abstract
Blood-brain barrier (BBB) is a major barrier to drug delivery into the brain in the treatment of central nervous system (CNS) diseases. Blood-brain barrier penetrating peptides (BBPs), a class of peptides that can cross BBB through various mechanisms without damaging BBB, are effective drug candidates for CNS diseases. However, identification of BBPs by experimental methods is time-consuming and laborious. To discover more BBPs as drugs for CNS disease, it is urgent to develop computational methods that can quickly and accurately identify BBPs and non-BBPs. In the present study, we created a training dataset that consists of 326 BBPs derived from previous databases and published manuscripts and 326 non-BBPs collected from UniProt, to construct a BBP predictor based on sequence information. We also constructed an independent testing dataset with 99 BBPs and 99 non-BBPs. Multiple machine learning methods were compared based on the training dataset via a nested cross-validation. The final BBP predictor was constructed based on the training dataset and the results showed that random forest (RF) method outperformed other classification algorithms on the training and independent testing dataset. Compared with previous BBP prediction tools, the RF-based predictor, named BBPpredict, performs considerably better than state-of-the-art BBP predictors. BBPpredict is expected to contribute to the discovery of novel BBPs, or at least can be a useful complement to the existing methods in this area. BBPpredict is freely available at http://i.uestc.edu.cn/BBPpredict/cgi-bin/BBPpredict.pl.
Collapse
Affiliation(s)
- Xue Chen
- Medical College, Guizhou University, Guiyang, China
| | | | - Bowen Li
- Medical College, Guizhou University, Guiyang, China
| | - Chunying Lu
- Medical College, Guizhou University, Guiyang, China
| | | | - Jinjin Long
- Medical College, Guizhou University, Guiyang, China
| | - Bifang He
- Medical College, Guizhou University, Guiyang, China
| | - Heng Chen
- Medical College, Guizhou University, Guiyang, China
| | - Jian Huang
- School of Life Science and Technology, University of Electronic Science and Technology of China, Chengdu, China
| |
Collapse
|
33
|
Lokhande KB, Banerjee T, Swamy KV, Ghosh P, Deshpande M. An in silico scientific basis for LL-37 as a therapeutic for Covid-19. Proteins 2022; 90:1029-1043. [PMID: 34333809 PMCID: PMC8441666 DOI: 10.1002/prot.26198] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Revised: 06/08/2021] [Accepted: 07/28/2021] [Indexed: 01/25/2023]
Abstract
A multi-pronged approach with help in all forms possible is essential to completely overcome the Covid-19 pandemic. There is a requirement to research as many new and different types of approaches as possible to cater to the entire world population, complementing the vaccines with promising results. The need is also because SARS-CoV-2 has several unknown or variable facets which get revealed from time to time. In this work, in silico scientific findings are presented, which are indicative of the potential for the use of the LL-37 human anti-microbial peptide as a therapeutic against SARS-CoV-2. This indication is based on the high structural similarity of LL-37 to the N-terminal helix, with which the virus interacts, of the receptor for SARS-CoV-2, Angiotensin Converting Enzyme 2. Moreover, there is positive prediction of binding of LL-37 to the receptor-binding domain of SARS-CoV-2; this is the first study to have described this interaction. In silico data on the safety of LL-37 are also reported. As Vitamin D is known to upregulate the expression of LL-37, the vitamin is a candidate preventive molecule. This work provides the possible basis for an inverse correlation between Vitamin D levels in the body and the severity of or susceptibility to Covid-19, as widely reported in literature. With the scientific link put forth herein, Vitamin D could be used at an effective, medically prescribed, safe dose as a preventive. The information in this report would be valuable in bolstering the worldwide efforts to eliminate the pandemic as early as possible.
Collapse
Affiliation(s)
- Kiran Bharat Lokhande
- Bioinformatics Research Laboratory, Dr. D.Y. Patil Biotechnology and Bioinformatics InstitutePuneMaharashtraIndia
- Dr. D.Y. Patil Biotechnology and Bioinformatics Institute, Dr. D.Y. Patil Vidyapeeth, Bangalore‐Mumbai HighwayPuneMaharashtraIndia
| | - Tanushree Banerjee
- Dr. D.Y. Patil Biotechnology and Bioinformatics Institute, Dr. D.Y. Patil Vidyapeeth, Bangalore‐Mumbai HighwayPuneMaharashtraIndia
- Molecular Neuroscience Research Laboratory, Dr. D.Y. Patil Biotechnology and Bioinformatics InstitutePuneMaharashtraIndia
| | - Kakumani Venkateswara Swamy
- MIT School of Bioengineering Sciences & Research, A Constituent Unit of MIT ArtDesign and Technology UniversityPuneMaharashtraIndia
| | - Payel Ghosh
- Bioinformatics Centre, Savitribai Phule Pune UniversityPuneMaharashtraIndia
| | - Manisha Deshpande
- Dr. D.Y. Patil Biotechnology and Bioinformatics Institute, Dr. D.Y. Patil Vidyapeeth, Bangalore‐Mumbai HighwayPuneMaharashtraIndia
| |
Collapse
|
34
|
Gupta S, Azadvari N, Hosseinzadeh P. Design of Protein Segments and Peptides for Binding to Protein Targets. BIODESIGN RESEARCH 2022; 2022:9783197. [PMID: 37850124 PMCID: PMC10521657 DOI: 10.34133/2022/9783197] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 03/16/2022] [Indexed: 10/19/2023] Open
Abstract
Recent years have witnessed a rise in methods for accurate prediction of structure and design of novel functional proteins. Design of functional protein fragments and peptides occupy a small, albeit unique, space within the general field of protein design. While the smaller size of these peptides allows for more exhaustive computational methods, flexibility in their structure and sparsity of data compared to proteins, as well as presence of noncanonical building blocks, add additional challenges to their design. This review summarizes the current advances in the design of protein fragments and peptides for binding to targets and discusses the challenges in the field, with an eye toward future directions.
Collapse
Affiliation(s)
- Suchetana Gupta
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| | - Noora Azadvari
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| | - Parisa Hosseinzadeh
- Knight Campus Center for Accelerating Scientific Impact, University of Oregon, Eugene OR 97403, USA
| |
Collapse
|
35
|
MLCPP 2.0: An updated cell-penetrating peptides and their uptake efficiency predictor. J Mol Biol 2022; 434:167604. [DOI: 10.1016/j.jmb.2022.167604] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 04/03/2022] [Accepted: 04/19/2022] [Indexed: 12/12/2022]
|
36
|
Zhou H, Wang H, Ding Y, Tang J. Multivariate Information Fusion for Identifying Antifungal Peptides with
Hilbert-Schmidt Independence Criterion. Curr Bioinform 2022. [DOI: 10.2174/1574893616666210727161003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
Antifungal Peptides (AFP) have been found to be effective against many fungal
infections.
Objective:
However, it is difficult to identify AFP. Therefore, it is great practical significance to identify
AFP via machine learning methods (with sequence information).
Method:
In this study, a Multi-Kernel Support Vector Machine (MKSVM) with Hilbert-Schmidt Independence
Criterion (HSIC) is proposed. Proteins are encoded with five types of features (188-bit,
AAC, ASDC, CKSAAP, DPC), and then construct kernels using Gaussian kernel function. HSIC are
used to combine kernels and multi-kernel SVM model is built.
Results:
Our model performed well on three AFPs datasets and the performance is better than or comparable
to other state-of-art predictive models.
Conclusion:
Our method will be a useful tool for identifying antifungal peptides.
Collapse
Affiliation(s)
- Haohao Zhou
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin,
300354, China
| | - Hao Wang
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin,
300354, China
| | - Yijie Ding
- School of Electronic and Information Engineering, Suzhou University of Science and Technology, Suzhou,
215009, China
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of
China, Quzhou, 324000, China
| | - Jijun Tang
- Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055,
China
| |
Collapse
|
37
|
Khandelwal R, Sharma AK, Biswa BB, Sharma Y. Extracellular Secretagogin is internalized into the cells through endocytosis. FEBS J 2021; 289:3183-3204. [PMID: 34967502 DOI: 10.1111/febs.16338] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 11/29/2021] [Indexed: 11/29/2022]
Abstract
Secretagogin (SCGN) is a calcium-sensor protein with a regulatory role in glucose metabolism and the secretion of several peptide hormones. Many, but not all, functions of SCGN can be explained by its intracellular manifestation. Despite early data on SCGN secretion, the secretory mechanism, biological fate, physiological implications, and trans-cellular signaling of extracellular SCGN remain unknown. We here report that extracellular SCGN is readily internalized into the C2C12 cells in an energy-dependent manner. Using endocytosis inhibitors, we demonstrate that SCGN internalizes via clathrin-mediated endocytosis, following which, SCGN localizes largely in the cytosol. Exogenous SCGN treatment induces a global proteomic reprogramming in C2C12 cells. Gene ontology search suggests that SCGN-induced proteomic reprogramming favors protein synthesis and cellular growth. We thus validated the cell proliferative action of SCGN using C2C12, HepG2, and NIH-3T3 cell lines. Based on the data, we propose that circulatory SCGN is internalized into the target cells and modulates the expression of cell growth-related proteins. The work suggests that extracellular SCGN is a functional protein, which communicates with specific cell types and directly modulates cell proliferation.
Collapse
Affiliation(s)
- Radhika Khandelwal
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad, 500 007, India.,Academy of Scientific and Innovative Research (AcSIR), New Delhi, India
| | - Anand Kumar Sharma
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad, 500 007, India
| | - Bhim B Biswa
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad, 500 007, India
| | - Yogendra Sharma
- CSIR-Centre for Cellular and Molecular Biology (CCMB), Uppal Road, Hyderabad, 500 007, India.,Academy of Scientific and Innovative Research (AcSIR), New Delhi, India.,Indian Institute of Science Education and Research (IISER), Berhampur, Odisha, 760010, India
| |
Collapse
|
38
|
Guo Y, Ju Y, Chen D, Wang L. Research on the Computational Prediction of Essential Genes. Front Cell Dev Biol 2021; 9:803608. [PMID: 34938741 PMCID: PMC8685449 DOI: 10.3389/fcell.2021.803608] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 11/22/2021] [Indexed: 11/19/2022] Open
Abstract
Genes, the nucleotide sequences that encode a polypeptide chain or functional RNA, are the basic genetic unit controlling biological traits. They are the guarantee of the basic structures and functions in organisms, and they store information related to biological factors and processes such as blood type, gestation, growth, and apoptosis. The environment and genetics jointly affect important physiological processes such as reproduction, cell division, and protein synthesis. Genes are related to a wide range of phenomena including growth, decline, illness, aging, and death. During the evolution of organisms, there is a class of genes that exist in a conserved form in multiple species. These genes are often located on the dominant strand of DNA and tend to have higher expression levels. The protein encoded by it usually either performs very important functions or is responsible for maintaining and repairing these essential functions. Such genes are called persistent genes. Among them, the irreplaceable part of the body’s life activities is the essential gene. For example, when starch is the only source of energy, the genes related to starch digestion are essential genes. Without them, the organism will die because it cannot obtain enough energy to maintain basic functions. The function of the proteins encoded by these genes is thought to be fundamental to life. Nowadays, DNA can be extracted from blood, saliva, or tissue cells for genetic testing, and detailed genetic information can be obtained using the most advanced scientific instruments and technologies. The information gained from genetic testing is useful to assess the potential risks of disease, and to help determine the prognosis and development of diseases. Such information is also useful for developing personalized medication and providing targeted health guidance to improve the quality of life. Therefore, it is of great theoretical and practical significance to identify important and essential genes. In this paper, the research status of essential genes and the essential genome database of bacteria are reviewed, the computational prediction method of essential genes based on communication coding theory is expounded, and the significance and practical application value of essential genes are discussed.
Collapse
Affiliation(s)
- Yuxin Guo
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.,Key Laboratory of Computational Science and Application of Hainan Province, Haikou, China.,Key Laboratory of Data Science and Intelligence Education, Hainan Normal University, Ministry of Education, Haikou, China.,School of Mathematics and Statistics, Hainan Normal University, Haikou, China
| | - Ying Ju
- School of Informatics, Xiamen University, Xiamen, China
| | - Dong Chen
- College of Electrical and Information Engineering, Quzhou University, Quzhou, China
| | - Lihong Wang
- Beidahuang Industry Group General Hospital, Harbin, China
| |
Collapse
|
39
|
Sebák F, Horváth LB, Kovács D, Szolomájer J, Tóth GK, Babiczky Á, Bősze S, Bodor A. Novel Lysine-Rich Delivery Peptides of Plant Origin ERD and Human S100: The Effect of Carboxyfluorescein Conjugation, Influence of Aromatic and Proline Residues, Cellular Internalization, and Penetration Ability. ACS OMEGA 2021; 6:34470-34484. [PMID: 34963932 PMCID: PMC8697381 DOI: 10.1021/acsomega.1c04637] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 10/25/2021] [Indexed: 06/14/2023]
Abstract
The need for novel drug delivery peptides is an important issue of the modern pharmaceutical research. Here, we test K-rich peptides from plant dehydrin ERD14 (ERD-A, ERD-B, and ERD-C) and the C-terminal CPP-resembling region of S100A4 (S100) using the 5(6)-carboxyfluorescein (Cf) tag at the N-terminus. Via a combined pH-dependent NMR and fluorescence study, we analyze the effect of the Cf conjugation/modification on the structural behavior, separately investigating the (5)-Cf and (6)-Cf forms. Flow cytometry results show that all peptides internalize; however, there is a slight difference between the cellular internalization of (5)- and (6)-Cf-peptides. We indicate the possible importance of residues with an aromatic sidechain and proline. We prove that ERD-A localizes mostly in the cytosol, ERD-B and S100 have partial colocalization with lysosomal staining, and ERD-C mainly localizes within vesicle-like compartments, while the uptake mechanism mainly occurs through energy-dependent paths.
Collapse
Affiliation(s)
- Fanni Sebák
- Institute
of Chemistry, ELTE−Eötvös
Loránd University, Pázmány Péter sétány 1/a, H-1117 Budapest, Hungary
- Doctoral
School of Pharmaceutical Sciences, Semmelweis
University, Üllői
út 26, H-1085 Budapest, Hungary
| | - Lilla Borbála Horváth
- ELKH-ELTE
Research Group of Peptide Chemistry, Eötvös Loránd
Research Network, Eötvös Loránd
University, Pázmány Péter sétány 1/a, H-1117 Budapest, Hungary
- National
Public Health Center, Albert Flórián út 2-6, Budapest H-1097, Hungary
- Hevesy
György PhD School of Chemistry, ELTE
Eötvös Loránd University, Pázmány Péter sétány
1/a, H-1117 Budapest, Hungary
| | - Dániel Kovács
- Institute
of Chemistry, ELTE−Eötvös
Loránd University, Pázmány Péter sétány 1/a, H-1117 Budapest, Hungary
- Hevesy
György PhD School of Chemistry, ELTE
Eötvös Loránd University, Pázmány Péter sétány
1/a, H-1117 Budapest, Hungary
| | - János Szolomájer
- Department
of Medical Chemistry, University of Szeged, Dóm tér 8, H-6720 Szeged, Hungary
| | - Gábor K. Tóth
- Department
of Medical Chemistry, University of Szeged, Dóm tér 8, H-6720 Szeged, Hungary
| | - Ákos Babiczky
- Institute
of Cognitive Neuroscience and Psychology, Research Centre for Natural Sciences, Magyar tudósok körútja 2, H-1117 Budapest, Hungary
- Doctoral
School of Psychology/Cognitive Science, Budapest University of Technology and Economics, Műegyetem rakpart 3, H-1111 Budapest, Hungary
| | - Szilvia Bősze
- ELKH-ELTE
Research Group of Peptide Chemistry, Eötvös Loránd
Research Network, Eötvös Loránd
University, Pázmány Péter sétány 1/a, H-1117 Budapest, Hungary
- National
Public Health Center, Albert Flórián út 2-6, Budapest H-1097, Hungary
| | - Andrea Bodor
- Institute
of Chemistry, ELTE−Eötvös
Loránd University, Pázmány Péter sétány 1/a, H-1117 Budapest, Hungary
| |
Collapse
|
40
|
Jiao S, Zou Q, Guo H, Shi L. iTTCA-RF: a random forest predictor for tumor T cell antigens. J Transl Med 2021; 19:449. [PMID: 34706730 PMCID: PMC8554859 DOI: 10.1186/s12967-021-03084-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/16/2021] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Cancer is one of the most serious diseases threatening human health. Cancer immunotherapy represents the most promising treatment strategy due to its high efficacy and selectivity and lower side effects compared with traditional treatment. The identification of tumor T cell antigens is one of the most important tasks for antitumor vaccines development and molecular function investigation. Although several machine learning predictors have been developed to identify tumor T cell antigen, more accurate tumor T cell antigen identification by existing methodology is still challenging. METHODS In this study, we used a non-redundant dataset of 592 tumor T cell antigens (positive samples) and 393 tumor T cell antigens (negative samples). Four types feature encoding methods have been studied to build an efficient predictor, including amino acid composition, global protein sequence descriptors and grouped amino acid and peptide composition. To improve the feature representation ability of the hybrid features, we further employed a two-step feature selection technique to search for the optimal feature subset. The final prediction model was constructed using random forest algorithm. RESULTS Finally, the top 263 informative features were selected to train the random forest classifier for detecting tumor T cell antigen peptides. iTTCA-RF provides satisfactory performance, with balanced accuracy, specificity and sensitivity values of 83.71%, 78.73% and 88.69% over tenfold cross-validation as well as 73.14%, 62.67% and 83.61% over independent tests, respectively. The online prediction server was freely accessible at http://lab.malab.cn/~acy/iTTCA . CONCLUSIONS We have proven that the proposed predictor iTTCA-RF is superior to the other latest models, and will hopefully become an effective and useful tool for identifying tumor T cell antigens presented in the context of major histocompatibility complex class I.
Collapse
Affiliation(s)
- Shihu Jiao
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
| | - Quan Zou
- Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | - Huannan Guo
- Department of Oncology, General Hospital of Heilongjiang Province Land Reclamation Bureau, Harbin, China.
| | - Lei Shi
- Department of Spine Surgery, Changzheng Hospital, Naval Medical University, Shanghai, China.
| |
Collapse
|
41
|
Schissel CK, Mohapatra S, Wolfe JM, Fadzen CM, Bellovoda K, Wu CL, Wood JA, Malmberg AB, Loas A, Gómez-Bombarelli R, Pentelute BL. Deep learning to design nuclear-targeting abiotic miniproteins. Nat Chem 2021; 13:992-1000. [PMID: 34373596 PMCID: PMC8819921 DOI: 10.1038/s41557-021-00766-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 07/05/2021] [Indexed: 02/08/2023]
Abstract
There are more amino acid permutations within a 40-residue sequence than atoms on Earth. This vast chemical search space hinders the use of human learning to design functional polymers. Here we show how machine learning enables the de novo design of abiotic nuclear-targeting miniproteins to traffic antisense oligomers to the nucleus of cells. We combined high-throughput experimentation with a directed evolution-inspired deep-learning approach in which the molecular structures of natural and unnatural residues are represented as topological fingerprints. The model is able to predict activities beyond the training dataset, and simultaneously deciphers and visualizes sequence-activity predictions. The predicted miniproteins, termed 'Mach', reach an average mass of 10 kDa, are more effective than any previously known variant in cells and can also deliver proteins into the cytosol. The Mach miniproteins are non-toxic and efficiently deliver antisense cargo in mice. These results demonstrate that deep learning can decipher design principles to generate highly active biomolecules that are unlikely to be discovered by empirical approaches.
Collapse
Affiliation(s)
- Carly K. Schissel
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Somesh Mohapatra
- Massachusetts Institute of Technology, Department of Materials Science and Engineering, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Justin M. Wolfe
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Colin M. Fadzen
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Kamela Bellovoda
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | - Chia-Ling Wu
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | - Jenna A. Wood
- Sarepta Therapeutics, 215 First Street, Cambridge, MA 02142, USA
| | | | - Andrei Loas
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Rafael Gómez-Bombarelli
- Massachusetts Institute of Technology, Department of Materials Science and Engineering, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,Correspondence to: ,
| | - Bradley L. Pentelute
- Massachusetts Institute of Technology, Department of Chemistry, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,The Koch Institute for Integrative Cancer Research, Massachusetts Institute of Technology, 500 Main Street, Cambridge, MA 02142, USA,Center for Environmental Health Sciences, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA,Broad Institute of MIT and Harvard, 415 Main Street, Cambridge, MA 02142, USA,Correspondence to: ,
| |
Collapse
|
42
|
Xue Y, Ye X, Wei L, Zhang X, Sakurai T, Wei L. Better Performance with Transformer: CPPFormer in precise prediction of cell-Penetrating Peptides. Curr Med Chem 2021; 29:881-893. [PMID: 34544332 DOI: 10.2174/0929867328666210920103140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 07/28/2021] [Accepted: 08/07/2021] [Indexed: 11/22/2022]
Abstract
With its superior performance, the Transformer model, which is based on the 'Encoder-Decoder' paradigm, has become the mainstream in natural language processing. On the other hand, bioinformatics has embraced machine learning and made great progress in drug design and protein property prediction. Cell-penetrating peptides (CPPs) are one kind of permeable protein that is convenient as a kind of 'postman' in drug penetration tasks. However, a small number of CPPs have been discovered by research, let alone practical applications in drug permeability. Therefore, correctly identifying the CPPs has opened up a new way to take macromolecules into cells without other potentially harmful materials in the drug. Most of the previous work only uses trivial machine learning techniques and hand-crafted features to construct a simple classifier. In CPPFormer, we learn from the idea of implementing the attention structure of Transformer, rebuilding the network based on the characteristics of CPPs according to its short length, and using an automatic feature extractor with a few manual engineered features to co-direct the predicted results. Compared to all previous methods and other classic text classification models, the empirical result has shown that our proposed deep model-based method has achieved the best performance of 92.16% accuracy in the CPP924 dataset and has passed various index tests.
Collapse
Affiliation(s)
- Yuyang Xue
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Xiucai Ye
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Lesong Wei
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Xin Zhang
- School of Software, Shandong University, Jinan. China
| | - Tetsuya Sakurai
- Department of Computer Science, University of Tsukuba, Tsukuba. Japan
| | - Leyi Wei
- School of Software, Shandong University, Jinan. China
| |
Collapse
|
43
|
Porosk L, Põhako K, Arukuusk P, Langel Ü. Cell-Penetrating Peptides Predicted From CASC3, AKIP1, and AHRR Proteins. Front Pharmacol 2021; 12:716226. [PMID: 34504427 PMCID: PMC8421526 DOI: 10.3389/fphar.2021.716226] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/26/2021] [Indexed: 11/13/2022] Open
Abstract
Peptides can be used as research tools and for diagnostic or therapeutic applications. Peptides, alongside small molecules and antibodies, are used and are gaining further interest as protein-protein interaction (PPI) modulators. Peptides have high target specificity and high affinity, but, unlike small molecule modulators, they are not able to cross the cell membranes to reach their intracellular targets. To overcome this limitation, the special property of the cell-penetrating peptides (CPPs) could benefit their cause. CPPs are a class of peptides that can enter the cells and with them also deliver the attached cargoes. Today, with the advancement of in silico prediction tools and the availability of protein databases, designing new and multifunctional peptides that are able to reach intracellular targets and inhibit certain cellular processes in a very specific manner is reachable. Although there are several efficient CPP sequences already known, the discovery of new CPPs is crucial for the development of efficient delivery methods for both biotechnological and therapeutic applications. In this work, we chose 10 human nuclear proteins from which we predicted new potential CPP sequences by using three different CPP predictors: cell-penetrating peptide prediction tool, CellPPD, and SkipCPP-Pred. From each protein, one predicted CPP sequence was synthesized and its internalization into cells was assessed. Out of the tested sequences, three peptides displayed features characteristic to CPPs. These peptides and also the predicted peptide sequences could be used to design and modify new CPPs. In this work, we show that we can use protein sequences as input for generating new peptides with cell internalization properties. Three new CPPs, AHRR8-24, CASC3251-264, and AKIP127-37, can be further used for the delivery of other cargoes or designed into multifunctional peptides with capability of internalizing cells.
Collapse
Affiliation(s)
- Ly Porosk
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Kaisa Põhako
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Piret Arukuusk
- Institute of Technology, University of Tartu, Tartu, Estonia
| | - Ülo Langel
- Institute of Technology, University of Tartu, Tartu, Estonia.,Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| |
Collapse
|
44
|
Guo X, Chen L, Wang L, Geng J, Wang T, Hu J, Li J, Liu C, Wang H. In silico identification and experimental validation of cellular uptake and intracellular labeling by a new cell penetrating peptide derived from CDN1. Drug Deliv 2021; 28:1722-1736. [PMID: 34463179 PMCID: PMC8409945 DOI: 10.1080/10717544.2021.1963352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Bioactive therapeutic molecules are generally impermeable to the cell membrane, hindering their utility and efficacy. A group of peptides called cell-penetrating peptides (CPPs) were found to have the capability of transporting different types of cargo molecules across the cell membrane. Here, we identified a short peptide named P2, which has a higher proportion of basic residues than the CDN1 (cyclin-dependent kinase inhibitor 1) protein it is derived from, and we used bioinformatic analysis and experimental validation to confirm the penetration property of peptide P2. We found that peptide P2 can efficiently enter different cell lines in a concentration-dependent manner. The endocytosis pathway, especially receptor-related endocytosis, may be involved in the process of P2 penetration. Our data also showed that peptide P2 is safe in cultured cell lines and red blood cells. Lastly, peptide P2 can efficiently deliver self-labeling protein HaloTag into cells for imaging. Our study illustrates that peptide P2 is a promising imaging agent delivery vehicle for future applications.
Collapse
Affiliation(s)
- Xiangli Guo
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Lab of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Linlin Chen
- Hubei Key Lab of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China.,Affiliated Ren He Hospital of China Three Gorges University, Yichang, China
| | - Lidan Wang
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Lab of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Jingping Geng
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Lab of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Tao Wang
- The First Clinical Medical College of China Three Gorges University, Yichang, China
| | - Jixiong Hu
- College of Life Science, Yangtze University, Jingzhou, China
| | - Jason Li
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Changbai Liu
- Hubei Key Lab of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Hu Wang
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Lead Contact
| |
Collapse
|
45
|
Su R, Hu J, Zou Q, Manavalan B, Wei L. Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools. Brief Bioinform 2021; 21:408-420. [PMID: 30649170 DOI: 10.1093/bib/bby124] [Citation(s) in RCA: 107] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 12/16/2022] Open
Abstract
Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the last few decades, a number of machine learning (ML)-based prediction tools have been developed, and some of them are freely available as web portals. However, the predictions produced by various tools are difficult to quantify and compare. In particular, there is no systematic comparison of the web-based prediction tools in performance, especially in practical applications. In this work, we provide a comprehensive review on the biological importance of CPPs, CPP database and existing ML-based methods for CPP prediction. To evaluate current prediction tools, we conducted a comparative study and analyzed a total of 12 models from 6 publicly available CPP prediction tools on 2 benchmark validation sets of CPPs and non-CPPs. Our benchmarking results demonstrated that a model from the KELM-CPPpred, namely KELM-hybrid-AAC, showed a significant improvement in overall performance, when compared to the other 11 prediction models. Moreover, through a length-dependency analysis, we find that existing prediction tools tend to more accurately predict CPPs and non-CPPs with the length of 20-25 residues long than peptides in other length ranges.
Collapse
Affiliation(s)
- Ran Su
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jie Hu
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China
| | | | - Leyi Wei
- College of Intelligence and Computing, Tianjin University, Tianjin, China
| |
Collapse
|
46
|
B3Pred: A Random-Forest-Based Method for Predicting and Designing Blood-Brain Barrier Penetrating Peptides. Pharmaceutics 2021; 13:pharmaceutics13081237. [PMID: 34452198 PMCID: PMC8399279 DOI: 10.3390/pharmaceutics13081237] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 07/07/2021] [Accepted: 07/14/2021] [Indexed: 12/14/2022] Open
Abstract
The blood–brain barrier is a major obstacle in treating brain-related disorders, as it does not allow the delivery of drugs into the brain. We developed a method for predicting blood–brain barrier penetrating peptides to facilitate drug delivery into the brain. These blood–brain barrier penetrating peptides (B3PPs) can act as therapeutics, as well as drug delivery agents. We trained, tested, and evaluated our models on blood–brain barrier peptides obtained from the B3Pdb database. First, we computed a wide range of peptide features. Then, we selected relevant peptide features. Finally, we developed numerous machine-learning-based models for predicting blood–brain barrier peptides using the selected features. The random-forest-based model performed the best with respect to the top 80 selected features and achieved a maximal 85.08% accuracy with an AUROC of 0.93. We also developed a webserver, B3pred, that implements our best models. It has three major modules that allow users to predict/design B3PPs and scan B3PPs in a protein sequence.
Collapse
|
47
|
Nasiri F, Atanaki FF, Behrouzi S, Kavousi K, Bagheri M. CpACpP: In Silico Cell-Penetrating Anticancer Peptide Prediction Using a Novel Bioinformatics Framework. ACS OMEGA 2021; 6:19846-19859. [PMID: 34368571 PMCID: PMC8340416 DOI: 10.1021/acsomega.1c02569] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 07/13/2021] [Indexed: 05/12/2023]
Abstract
Cell-penetrating anticancer peptides (Cp-ACPs) are considered promising candidates in solid tumor and hematologic cancer therapies. Current approaches for the design and discovery of Cp-ACPs trust the expensive high-throughput screenings that often give rise to multiple obstacles, including instrumentation adaptation and experimental handling. The application of machine learning (ML) tools developed for peptide activity prediction is importantly of growing interest. In this study, we applied the random forest (RF)-, support vector machine (SVM)-, and eXtreme gradient boosting (XGBoost)-based algorithms to predict the active Cp-ACPs using an experimentally validated data set. The model, CpACpP, was developed on the basis of two independent cell-penetrating peptide (CPP) and anticancer peptide (ACP) subpredictors. Various compositional and physiochemical-based features were combined or selected using the multilayered recursive feature elimination (RFE) method for both data sets. Our results showed that the ACP subclassifiers obtain a mean performance accuracy (ACC) of 0.98 with an area under curve (AUC) ≈ 0.98 vis-à-vis the CPP predictors displaying relevant values of ∼0.94 and ∼0.95 via the hybrid-based features and independent data sets, respectively. Also, the predicting evaluation of Cp-ACPs gave accuracies of ∼0.79 and 0.89 on a series of independent sequences by applying our CPP and ACP classifiers, respectively, which leaves the performance of our predictors better than the earlier reported ACPred, mACPpred, MLCPP, and CPPred-RF. The described consensus-based fusion method additionally reached an AUC of 0.94 for the prediction of Cp-ACP (http://cbb1.ut.ac.ir/CpACpP/Index).
Collapse
Affiliation(s)
- Farid Nasiri
- Peptide
Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry
and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| | - Fereshteh Fallah Atanaki
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Saman Behrouzi
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Kaveh Kavousi
- Laboratory
of Complex Biological Systems and Bioinformatics (CBB), Department
of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran 14176-14411, Iran
| | - Mojtaba Bagheri
- Peptide
Chemistry Laboratory, Department of Biochemistry, Institute of Biochemistry
and Biophysics (IBB), University of Tehran, Tehran 14176-14335, Iran
| |
Collapse
|
48
|
Chen L, Guo X, Wang L, Geng J, Wu J, Hu B, Wang T, Li J, Liu C, Wang H. In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS. Drug Deliv 2021; 28:1637-1648. [PMID: 34338123 PMCID: PMC8330795 DOI: 10.1080/10717544.2021.1960922] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Viral vectors for vaccine delivery are challenged by recently reported safety issues like immunogenicity and risk for cancer development, and thus there is a growing need for the development of non-viral vectors. Cell penetrating peptides (CPPs) are non-viral vectors that can enter plasma membranes efficiently and deliver a broad range of cargoes. Our bioinformatic prediction and wet-lab validation data suggested that peptide P1 derived from MARCKS protein phosphorylation site domain is a new potential CPP candidate. We found that peptide P1 can efficiently internalize into various cell lines in a concentration-dependent manner. Receptor-mediated endocytosis pathway is the major mechanism of P1 penetration, although P1 also directly penetrates the plasma membrane. We also found that peptide P1 has low cytotoxicity in cultured cell lines as well as mouse red blood cells. Furthermore, peptide P1 not only can enter into cultured cells itself, but it also can interact with plasmid DNA and mediate the functional delivery of plasmid DNA into cultured cells, even in hard-to-transfect cells. Combined, these findings indicate that P1 may be a promising vector for efficient intracellular delivery of bioactive cargos.
Collapse
Affiliation(s)
- Linlin Chen
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Laboratory of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China.,Affiliated Ren He Hospital of China Three Gorges University, Yichang, China
| | - Xiangli Guo
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Laboratory of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Lidan Wang
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Laboratory of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Jingping Geng
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Laboratory of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Jiao Wu
- Affiliated Ren He Hospital of China Three Gorges University, Yichang, China
| | - Bin Hu
- Affiliated Ren He Hospital of China Three Gorges University, Yichang, China
| | - Tao Wang
- The First Clinical Medical College of China Three Gorges University, Yichang, China
| | - Jason Li
- Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Changbai Liu
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China.,Hubei Key Laboratory of Tumor Microenvironment and Immunotherapy, China Three Gorges University, Yichang, China
| | - Hu Wang
- Department of Pathology and Immunology, Medical School, China Three Gorges University, Yichang, China
| |
Collapse
|
49
|
Zhou J, Li Y, Huang W, Shi W, Qian H. Source and exploration of the peptides used to construct peptide-drug conjugates. Eur J Med Chem 2021; 224:113712. [PMID: 34303870 DOI: 10.1016/j.ejmech.2021.113712] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 07/12/2021] [Accepted: 07/17/2021] [Indexed: 12/16/2022]
Abstract
Peptide-drug conjugates (PDCs) are a class of novel molecules widely designed and synthesized for delivering payload drugs. The peptide part plays a vital role in the whole molecule, because they determine the ability of the molecules to penetrate the membrane and target to the specific targets. Here, we introduce the source of different kinds of cell-penetrating peptides (CPPs) and cell-targeting peptides (CTPs) that have been used or could be used in constructing PDCs as well as their latest application in delivering drugs. What's more, the approaches of developing CPPs and CTPs and the techniques to discover novel peptides are focused on and summarized in the review. This review aims to help relevant researchers fast understand the research status of peptides in PDCs and carry forward the process of novel peptides discovery.
Collapse
Affiliation(s)
- Jiaqi Zhou
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China
| | - Yuanyuan Li
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China
| | - Wenlong Huang
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China; Jiangsu Key Laboratory of Drug Discovery for Metabolic Disease, China Pharmaceutical University, 24 Tongjiaxiang, Nanjing, 210009, PR China
| | - Wei Shi
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China.
| | - Hai Qian
- Centre of Drug Discovery, State Key Laboratory of Natural Medicines, China Pharmaceutical University, Nanjing, 210009, PR China; Jiangsu Key Laboratory of Drug Discovery for Metabolic Disease, China Pharmaceutical University, 24 Tongjiaxiang, Nanjing, 210009, PR China.
| |
Collapse
|
50
|
Holl NJ, Lee HJ, Huang YW. Evolutionary Timeline of Genetic Delivery and Gene Therapy. Curr Gene Ther 2021; 21:89-111. [PMID: 33292120 DOI: 10.2174/1566523220666201208092517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/17/2020] [Accepted: 11/22/2020] [Indexed: 11/22/2022]
Abstract
There are more than 3,500 genes that are being linked to hereditary diseases or correlated with an elevated risk of certain illnesses. As an alternative to conventional treatments with small molecule drugs, gene therapy has arisen as an effective treatment with the potential to not just alleviate disease conditions but also cure them completely. In order for these treatment regimens to work, genes or editing tools intended to correct diseased genetic material must be efficiently delivered to target sites. There have been many techniques developed to achieve such a goal. In this article, we systematically review a variety of gene delivery and therapy methods that include physical methods, chemical and biochemical methods, viral methods, and genome editing. We discuss their historical discovery, mechanisms, advantages, limitations, safety, and perspectives.
Collapse
Affiliation(s)
- Natalie J Holl
- Department of Biological Sciences, College of Arts, Sciences, and Business, Missouri University of Science and Technology, Rolla, MO 65409, United States
| | - Han-Jung Lee
- Department of Natural Resources and Environmental Studies, College of Environmental Studies, National Dong Hwa University, Hualien 974301, Taiwan
| | - Yue-Wern Huang
- Department of Biological Sciences, College of Arts, Sciences, and Business, Missouri University of Science and Technology, Rolla, MO 65409, United States
| |
Collapse
|