Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Wei L, Tang J, Zou Q. SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides. BMC Genomics 2017. [PMID: 29513192 PMCID: PMC5657092 DOI: 10.1186/s12864-017-4128-1] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

For:	Wei L, Tang J, Zou Q. SkipCPP-Pred: an improved and promising sequence-based predictor for predicting cell-penetrating peptides. BMC Genomics 2017. [PMID: 29513192 PMCID: PMC5657092 DOI: 10.1186/s12864-017-4128-1] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Number

Cited by Other Article(s)

Imre A, Balogh B, Mándity I. GraphCPP: The new state-of-the-art method for cell-penetrating peptide prediction via graph neural networks. Br J Pharmacol 2025;182:495-509. [PMID: 39568115 DOI: 10.1111/bph.17388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Revised: 08/07/2024] [Accepted: 10/07/2024] [Indexed: 11/22/2024] Open

Kumar N, Du Z, Li Y. pLM4CPPs: Protein Language Model-Based Predictor for Cell Penetrating Peptides. J Chem Inf Model 2025. [PMID: 39878455 DOI: 10.1021/acs.jcim.4c01338] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2025]

Abstract

Cell-penetrating peptides (CPPs) are short peptides capable of penetrating cell membranes, making them valuable for drug delivery and intracellular targeting. Accurate prediction of CPPs can streamline experimental validation in the lab. This study aims to assess pretrained protein language models (pLMs) for their effectiveness in representing CPPs and develop a reliable model for CPP classification. We evaluated peptide embeddings generated from BEPLER, CPCProt, SeqVec, various ESM variants (ESM, ESM-2 with expanded feature set, ESM-1b, and ESM-1v), ProtT5-XL UniRef50, ProtT5-XL BFD, and ProtBERT. We developed pLM4CCPs, a novel deep learning architecture using convolutional neural networks (CNNs) as the classifier for binary classification of CPPs. pLM4CCPs demonstrated superior performance over existing state-of-the-art CPP prediction models, achieving improvements in accuracy (ACC) by 4.9-5.5%, Matthews correlation coefficient (MCC) by 9.3-10.2%, and sensitivity (Sn) by 14.1-19.6%. Among all the tested models, ESM-1280 and ProtT5-XL BFD demonstrated the highest overall performance on the kelm data set. ESM-1280 achieved an ACC of 0.896, an MCC of 0.796, a Sn of 0.844, and a specificity (Sp) of 0.978. ProtT5-XL BFD exhibited superior performance with an ACC of 0.901, an MCC of 0.802, an Sn of 0.885, and an Sp of 0.917. pLM4CCPs combine predictions from multiple models to provide a consensus on whether a given peptide sequence is classified as a CPP or non-CPP. This approach will enhance prediction reliability by leveraging the strengths of each individual model. A user-friendly web server for bioactivity predictions, along with data sets, is available at https://ry2acnp6ep.us-east-1.awsapprunner.com. The source code and protocol for adapting pLM4CPPs can be accessed on GitHub at https://github.com/drkumarnandan/pLM4CPPs. This platform aims to advance CPP prediction and peptide functionality modeling, aiding researchers in exploring peptide functionality effectively.

Collapse

Ramasundaram M, Sohn H, Madhavan T. A bird's-eye view of the biological mechanism and machine learning prediction approaches for cell-penetrating peptides. Front Artif Intell 2025;7:1497307. [PMID: 39839972 PMCID: PMC11747587 DOI: 10.3389/frai.2024.1497307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 12/13/2024] [Indexed: 01/23/2025] Open

Moreno-Vargas LM, Prada-Gracia D. Exploring the Chemical Features and Biomedical Relevance of Cell-Penetrating Peptides. Int J Mol Sci 2024;26:59. [PMID: 39795918 PMCID: PMC11720145 DOI: 10.3390/ijms26010059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2024] [Revised: 11/27/2024] [Accepted: 11/28/2024] [Indexed: 01/13/2025] Open

Zhu L, Chen Z, Yang S. EnDM-CPP: A Multi-view Explainable Framework Based on Deep Learning and Machine Learning for Identifying Cell-Penetrating Peptides with Transformers and Analyzing Sequence Information. Interdiscip Sci 2024:10.1007/s12539-024-00673-4. [PMID: 39714579 DOI: 10.1007/s12539-024-00673-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 10/28/2024] [Accepted: 11/01/2024] [Indexed: 12/24/2024]

Weckbecker M, Anžel A, Yang Z, Hattab G. Interpretable molecular encodings and representations for machine learning tasks. Comput Struct Biotechnol J 2024;23:2326-2336. [PMID: 38867722 PMCID: PMC11167246 DOI: 10.1016/j.csbj.2024.05.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 05/13/2024] [Accepted: 05/19/2024] [Indexed: 06/14/2024] Open

Li Y, Li XM, Wei LS, Ye JF. Advancements in mitochondrial-targeted nanotherapeutics: overcoming biological obstacles and optimizing drug delivery. Front Immunol 2024;15:1451989. [PMID: 39483479 PMCID: PMC11524880 DOI: 10.3389/fimmu.2024.1451989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Accepted: 09/19/2024] [Indexed: 11/03/2024] Open

Li H, Meng J, Wang Z, Luan Y. misORFPred: A Novel Method to Mine Translatable sORFs in Plant Pri-miRNAs Using Enhanced Scalable k-mer and Dynamic Ensemble Voting Strategy. Interdiscip Sci 2024:10.1007/s12539-024-00661-8. [PMID: 39397199 DOI: 10.1007/s12539-024-00661-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 09/18/2024] [Accepted: 09/22/2024] [Indexed: 10/15/2024]

Ma H, Zhou X, Zhang Z, Weng Z, Li G, Zhou Y, Yao Y. AI-Driven Design of Cell-Penetrating Peptides for Therapeutic Biotechnology. Int J Pept Res Ther 2024;30:69. [DOI: 10.1007/s10989-024-10654-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/22/2024] [Indexed: 01/05/2025]

Uddin I, Awan HH, Khalid M, Khan S, Akbar S, Sarker MR, Abdolrasol MGM, Alghamdi TAH. A hybrid residue based sequential encoding mechanism with XGBoost improved ensemble model for identifying 5-hydroxymethylcytosine modifications. Sci Rep 2024;14:20819. [PMID: 39242695 PMCID: PMC11379919 DOI: 10.1038/s41598-024-71568-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Accepted: 08/29/2024] [Indexed: 09/09/2024] Open

Cherene MB, Taveira GB, Almeida-Silva F, da Silva MS, Cavaco MC, da Silva-Ferreira AT, Perales JEA, de Oliveira Carvalho A, Venâncio TM, da Motta OV, Rodrigues R, Castanho MARB, Gomes VM. Structural and Biochemical Characterization of Three Antimicrobial Peptides from Capsicum annuum L. var. annuum Leaves for Anti-Candida Use. Probiotics Antimicrob Proteins 2024;16:1270-1287. [PMID: 37365421 DOI: 10.1007/s12602-023-10112-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/14/2023] [Indexed: 06/28/2023]

Affiliation(s)

Milena Bellei Cherene Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Gabriel Bonan Taveira Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Fabricio Almeida-Silva Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Marciele Souza da Silva Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Marco Calvinho Cavaco Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina da Universidade de Lisboa, Lisbon, Portugal
André Teixeira da Silva-Ferreira Laboratório de Toxinologia, Instituto Oswaldo Cruz, FIOCRUZ, Rio de Janeiro, RJ, Brazil
Jonas Enrique Aguilar Perales Laboratório de Toxinologia, Instituto Oswaldo Cruz, FIOCRUZ, Rio de Janeiro, RJ, Brazil
André de Oliveira Carvalho Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Thiago Motta Venâncio Laboratório de Química e Função de Proteínas e Peptídeos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Olney Vieira da Motta Laboratório de Sanidade Animal, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Rosana Rodrigues Laboratório de Melhoramento e Genética Vegetal, Centro de Ciências e Tecnologias Agropecuárias, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil
Miguel Augusto Rico Botas Castanho Instituto de Medicina Molecular João Lobo Antunes, Faculdade de Medicina da Universidade de Lisboa, Lisbon, Portugal
Valdirene Moreira Gomes Laboratório de Fisiologia e Bioquímica de Microrganismos, Centro de Biociências e Biotecnologia, Universidade Estadual do Norte Fluminense Darcy Ribeiro, Campos dos Goytacazes, RJ, 28013-602, Brazil.

Collapse

Zhao F, Qiu J, Xiang D, Jiao P, Cao Y, Xu Q, Qiao D, Xu H, Cao Y. deepAMPNet: a novel antimicrobial peptide predictor employing AlphaFold2 predicted structures and a bi-directional long short-term memory protein language model. PeerJ 2024;12:e17729. [PMID: 39040937 PMCID: PMC11262304 DOI: 10.7717/peerj.17729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 06/20/2024] [Indexed: 07/24/2024] Open

Serebrennikova M, Grafskaia E, Maltsev D, Ivanova K, Bashkirov P, Kornilov F, Volynsky P, Efremov R, Bocharov E, Lazarev V. TriplEP-CPP: Algorithm for Predicting the Properties of Peptide Sequences. Int J Mol Sci 2024;25:6869. [PMID: 38999985 PMCID: PMC11241344 DOI: 10.3390/ijms25136869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2024] [Revised: 06/18/2024] [Accepted: 06/20/2024] [Indexed: 07/14/2024] Open

Affiliation(s)

Maria Serebrennikova Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.) Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)
Ekaterina Grafskaia Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.)
Dmitriy Maltsev Federal Center of Brain Research and Neurotechnologies, Federal Medical Biological Agency, Moscow 117997, Russia; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia; Center for Precision Genome Editing and Genetic Technologies for Biomedicine, Pirogov Russian National Research Medical University, Moscow 117997, Russia
Kseniya Ivanova Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.) Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.) Research Institute for Systems Biology and Medicine, Moscow 117246, Russia
Pavel Bashkirov Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.) Research Institute for Systems Biology and Medicine, Moscow 117246, Russia
Fedor Kornilov Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.) Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
Pavel Volynsky Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia; Institute of Cytology, Russian Academy of Sciences, St. Petersburg 194064, Russia
Roman Efremov Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.) Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
Eduard Bocharov Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.) Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia;
Vassili Lazarev Laboratory of Genetic Engineering, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Moscow 119435, Russia; (M.S.); (K.I.); (V.L.) Moscow Center for Advanced Studies 20, Kulakova Str., Moscow 123592, Russia; (P.B.); (F.K.); (R.E.); (E.B.)

Collapse

Li H, Meng J, Wang Z, Tang Y, Xia S, Wang Y, Qin Z, Luan Y. miPEPPred-FRL: A Novel Method for Predicting Plant MiRNA-Encoded Peptides Using Adaptive Feature Representation Learning. J Chem Inf Model 2024;64:2889-2900. [PMID: 37733290 DOI: 10.1021/acs.jcim.3c01020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]

Datta S, Nabeel Asim M, Dengel A, Ahmed S. NTpred: a robust and precise machine learning framework for in silico identification of Tyrosine nitration sites in protein sequences. Brief Funct Genomics 2024;23:163-179. [PMID: 37248673 DOI: 10.1093/bfgp/elad018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 04/12/2023] [Accepted: 05/02/2023] [Indexed: 05/31/2023] Open

Abstract

Post-translational modifications (PTMs) either enhance a protein's activity in various sub-cellular processes, or degrade their activity which leads toward failure of intracellular processes. Tyrosine nitration (NT) modification degrades protein's activity that initiates and propagates various diseases including neurodegenerative, cardiovascular, autoimmune diseases and carcinogenesis. Identification of NT modification supports development of novel therapies and drug discoveries for associated diseases. Identification of NT modification in biochemical labs is expensive, time consuming and error-prone. To supplement this process, several computational approaches have been proposed. However these approaches fail to precisely identify NT modification, due to the extraction of irrelevant, redundant and less discriminative features from protein sequences. This paper presents the NTpred framework that is competent in extracting comprehensive features from raw protein sequences using four different sequence encoders. To reap the benefits of different encoders, it generates four additional feature spaces by fusing different combinations of individual encodings. Furthermore, it eradicates irrelevant and redundant features from eight different feature spaces through a Recursive Feature Elimination process. Selected features of four individual encodings and four feature fusion vectors are used to train eight different Gradient Boosted Tree classifiers. The probability scores from the trained classifiers are utilized to generate a new probabilistic feature space, which is used to train a Logistic Regression classifier. On the BD1 benchmark dataset, the proposed framework outperforms the existing best-performing predictor in 5-fold cross validation and independent test evaluation with combined improvement of 13.7% in MCC and 20.1% in AUC. Similarly, on the BD2 benchmark dataset, the proposed framework outperforms the existing best-performing predictor with combined improvement of 5.3% in MCC and 1.0% in AUC. NTpred is publicly available for further experimentation and predictive use at: https://sds_genetic_analysis.opendfki.de/PredNTS/.

Collapse

Preto AJ, Caniceiro AB, Duarte F, Fernandes H, Ferreira L, Mourão J, Moreira IS. POSEIDON: Peptidic Objects SEquence-based Interaction with cellular DOmaiNs: a new database and predictor. J Cheminform 2024;16:18. [PMID: 38365724 PMCID: PMC10874016 DOI: 10.1186/s13321-024-00810-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 02/07/2024] [Indexed: 02/18/2024] Open

Abstract

Cell-penetrating peptides (CPPs) are short chains of amino acids that have shown remarkable potential to cross the cell membrane and deliver coupled therapeutic cargoes into cells. Designing and testing different CPPs to target specific cells or tissues is crucial to ensure high delivery efficiency and reduced toxicity. However, in vivo/in vitro testing of various CPPs can be both time-consuming and costly, which has led to interest in computational methodologies, such as Machine Learning (ML) approaches, as faster and cheaper methods for CPP design and uptake prediction. However, most ML models developed to date focus on classification rather than regression techniques, because of the lack of informative quantitative uptake values. To address these challenges, we developed POSEIDON, an open-access and up-to-date curated database that provides experimental quantitative uptake values for over 2,300 entries and physicochemical properties of 1,315 peptides. POSEIDON also offers physicochemical properties, such as cell line, cargo, and sequence, among others. By leveraging this database along with cell line genomic features, we processed a dataset of over 1,200 entries to develop an ML regression CPP uptake predictor. Our results demonstrated that POSEIDON accurately predicted peptide cell line uptake, achieving a Pearson correlation of 0.87, Spearman correlation of 0.88, and r2 score of 0.76, on an independent test set. With its comprehensive and novel dataset, along with its potent predictive capabilities, the POSEIDON database and its associated ML predictor signify a significant leap forward in CPP research and development. The POSEIDON database and ML Predictor are available for free and with a user-friendly interface at https://moreiralab.com/resources/poseidon/ , making them valuable resources for advancing research on CPP-related topics. Scientific Contribution Statement: Our research addresses the critical need for more efficient and cost-effective methodologies in Cell-Penetrating Peptide (CPP) research. We introduced POSEIDON, a comprehensive and freely accessible database that delivers quantitative uptake values for over 2,300 entries, along with detailed physicochemical profiles for 1,315 peptides. Recognizing the limitations of current Machine Learning (ML) models for CPP design, our work leveraged the rich dataset provided by POSEIDON to develop a highly accurate ML regression model for predicting CPP uptake.

Collapse

Malik A, Jayarathna DK, Fisher M, Barbhuiya TK, Gandhi NS, Batra J. Dynamics and recognition of homeodomain containing protein-DNA complex of IRX4. Proteins 2024;92:282-301. [PMID: 37861198 DOI: 10.1002/prot.26604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2022] [Revised: 09/15/2023] [Accepted: 09/25/2023] [Indexed: 10/21/2023]

Abstract

Iroquois Homeobox 4 (IRX4) belongs to a family of homeobox TFs having roles in embryogenesis, cell specification, and organ development. Recently, large scale genome-wide association studies and epigenetic studies have highlighted the role of IRX4 and its associated variants in prostate cancer. No studies have investigated and characterized the structural aspect of the IRX4 homeodomain and its potential to bind to DNA. The current study uses sequence analysis, homology modeling, and molecular dynamics simulations to explore IRX4 homeodomain-DNA recognition mechanisms and the role of somatic mutations affecting these interactions. Using publicly available databases, gene expression of IRX4 was found in different tissues, including prostate, heart, skin, vagina, and the protein expression was found in cancer cell lines (HCT166, HEK293), B cells, ascitic fluid, and brain. Sequence conservation of the homeodomain shed light on the importance of N- and C-terminal residues involved in DNA binding. The specificity of IRX4 homodimer bound to consensus human DNA sequence was confirmed by molecular dynamics simulations, representing the role of conserved amino acids including R145, A194, N195, S190, R198, and R199 in binding to DNA. Additional N-terminal residues like T144 and G143 were also found to have specific interactions highlighting the importance of N-terminus of the homeodomain in DNA recognition. Additionally, the effects of somatic mutations, including the conserved Arginine (R145, R198, and R199) residues on DNA binding elucidated the importance of these residues in stabilizing the protein-DNA complex. Secondary structure and hydrogen bonding analysis showed the roles of specific residues (R145, T191, A194, N195, R198, and R199) in maintaining the homogeneity of the structure and its interaction with DNA. The differences in relative binding free energies of all the mutants shed light on the structural modularity of this protein and the dynamics behind protein-DNA interaction. We also have predicted that the C-terminal sequence of the IRX4 homeodomain could act as a potential cell-penetrating peptide, emphasizing the role these small peptides could play in targeting homeobox TFs.

Collapse

Shi K, Xiong Y, Wang Y, Deng Y, Wang W, Jing B, Gao X. PractiCPP: a deep learning approach tailored for extremely imbalanced datasets in cell-penetrating peptide prediction. Bioinformatics 2024;40:btae058. [PMID: 38305405 PMCID: PMC11212486 DOI: 10.1093/bioinformatics/btae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/25/2024] [Accepted: 01/30/2024] [Indexed: 02/03/2024] Open

Abstract

MOTIVATION

Effective drug delivery systems are paramount in enhancing pharmaceutical outcomes, particularly through the use of cell-penetrating peptides (CPPs). These peptides are gaining prominence due to their ability to penetrate eukaryotic cells efficiently without inflicting significant damage to the cellular membrane, thereby ensuring optimal drug delivery. However, the identification and characterization of CPPs remain a challenge due to the laborious and time-consuming nature of conventional methods, despite advances in proteomics. Current computational models, however, are predominantly tailored for balanced datasets, an approach that falls short in real-world applications characterized by a scarcity of known positive CPP instances.

RESULTS

To navigate this shortfall, we introduce PractiCPP, a novel deep-learning framework tailored for CPP prediction in highly imbalanced data scenarios. Uniquely designed with the integration of hard negative sampling and a sophisticated feature extraction and prediction module, PractiCPP facilitates an intricate understanding and learning from imbalanced data. Our extensive computational validations highlight PractiCPP's exceptional ability to outperform existing state-of-the-art methods, demonstrating remarkable accuracy, even in datasets with an extreme positive-to-negative ratio of 1:1000. Furthermore, through methodical embedding visualizations, we have established that models trained on balanced datasets are not conducive to practical, large-scale CPP identification, as they do not accurately reflect real-world complexities. In summary, PractiCPP potentially offers new perspectives in CPP prediction methodologies. Its design and validation, informed by real-world dataset constraints, suggest its utility as a valuable tool in supporting the acceleration of drug delivery advancements.

AVAILABILITY AND IMPLEMENTATION

The source code of PractiCPP is available on Figshare at https://doi.org/10.6084/m9.figshare.25053878.v1.

Collapse

Cui Z, Wang SG, He Y, Chen ZH, Zhang QH. DeepTPpred: A Deep Learning Approach With Matrix Factorization for Predicting Therapeutic Peptides by Integrating Length Information. IEEE J Biomed Health Inform 2023;27:4611-4622. [PMID: 37368803 DOI: 10.1109/jbhi.2023.3290014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2023]

Chen S, Liao Y, Zhao J, Bin Y, Zheng C. PACVP: Prediction of Anti-Coronavirus Peptides Using a Stacking Learning Strategy With Effective Feature Representation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023;20:3106-3116. [PMID: 37022025 DOI: 10.1109/tcbb.2023.3238370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Meng C, Pei Y, Zou Q, Yuan L. DP-AOP: A novel SVM-based antioxidant proteins identifier. Int J Biol Macromol 2023;247:125499. [PMID: 37414318 DOI: 10.1016/j.ijbiomac.2023.125499] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 06/01/2023] [Accepted: 06/19/2023] [Indexed: 07/08/2023]

Hsueh HT, Chou RT, Rai U, Liyanage W, Kim YC, Appell MB, Pejavar J, Leo KT, Davison C, Kolodziejski P, Mozzer A, Kwon H, Sista M, Anders NM, Hemingway A, Rompicharla SVK, Edwards M, Pitha I, Hanes J, Cummings MP, Ensign LM. Machine learning-driven multifunctional peptide engineering for sustained ocular drug delivery. Nat Commun 2023;14:2509. [PMID: 37130851 PMCID: PMC10154330 DOI: 10.1038/s41467-023-38056-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/12/2023] [Indexed: 05/04/2023] Open

Affiliation(s)

Henry T Hsueh Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Renee Ti Chou Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
Usha Rai Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Wathsala Liyanage Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Yoo Chun Kim Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Matthew B Appell Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
Jahnavi Pejavar Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Kirby T Leo Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
Charlotte Davison Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Patricia Kolodziejski Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Ann Mozzer Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
HyeYoung Kwon Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
Maanasa Sista Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA
Nicole M Anders The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Avelina Hemingway The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Sri Vishnu Kiran Rompicharla Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Malia Edwards Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Ian Pitha Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Justin Hanes Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
Michael P Cummings Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA.
Laura M Ensign Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA. Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA. Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA. Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA. Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA. The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA.

Collapse

Zhou W, Liu Y, Li Y, Kong S, Wang W, Ding B, Han J, Mou C, Gao X, Liu J. TriNet: A tri-fusion neural network for the prediction of anticancer and antimicrobial peptides. PATTERNS (NEW YORK, N.Y.) 2023;4:100702. [PMID: 36960450 PMCID: PMC10028424 DOI: 10.1016/j.patter.2023.100702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 12/20/2022] [Accepted: 02/03/2023] [Indexed: 03/04/2023]

Essus VA, Souza Júnior GSE, Nunes GHP, Oliveira JDS, de Faria BM, Romão LF, Cortines JR. Bacteriophage P22 Capsid as a Pluripotent Nanotechnology Tool. Viruses 2023;15:516. [PMID: 36851730 PMCID: PMC9962691 DOI: 10.3390/v15020516] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2022] [Revised: 02/09/2023] [Accepted: 02/10/2023] [Indexed: 02/15/2023] Open

Hasanzadeh A, Hamblin MR, Kiani J, Noori H, Hardie JM, Karimi M, Shafiee H. Could artificial intelligence revolutionize the development of nanovectors for gene therapy and mRNA vaccines? NANO TODAY 2022;47:101665. [PMID: 37034382 PMCID: PMC10081506 DOI: 10.1016/j.nantod.2022.101665] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Affiliation(s)

Akbar Hasanzadeh Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Michael R Hamblin Laser Research Centre, Faculty of Health Science, University of Johannesburg, Doornfontein 2028, South Africa Radiation Biology Research Center, Iran University of Medical Sciences, Tehran, Iran
Jafar Kiani Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Molecular Medicine, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran
Hamid Noori Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Joseph M. Hardie Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA
Mahdi Karimi Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Research Center for Science and Technology in Medicine, Tehran University of Medical Sciences, Tehran 141556559, Iran Applied Biotechnology Research Centre, Tehran Medical Science, Islamic Azad University, Tehran 1584743311, Iran
Hadi Shafiee Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA

Collapse

Antimicrobial peptides with cell-penetrating activity as prophylactic and treatment drugs. Biosci Rep 2022;42:231731. [PMID: 36052730 PMCID: PMC9508529 DOI: 10.1042/bsr20221789] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Revised: 08/31/2022] [Accepted: 09/01/2022] [Indexed: 01/18/2023] Open

Arif M, Kabir M, Ahmed S, Khan A, Ge F, Khelifi A, Yu DJ. DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:2749-2759. [PMID: 34347603 DOI: 10.1109/tcbb.2021.3102133] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]

Hu RS, Wu J, Zhang L, Zhou X, Zhang Y. CD8TCEI-EukPath: A Novel Predictor to Rapidly Identify CD8+ T-Cell Epitopes of Eukaryotic Pathogens Using a Hybrid Feature Selection Approach. Front Genet 2022;13:935989. [PMID: 35937988 PMCID: PMC9354802 DOI: 10.3389/fgene.2022.935989] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 05/24/2022] [Indexed: 12/02/2022] Open

Niu M, Zou Q. SgRNA-RF: Identification of SgRNA On-Target Activity With Imbalanced Datasets. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022;19:2442-2453. [PMID: 33979289 DOI: 10.1109/tcbb.2021.3079116] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Prediction of Cell-Penetrating Peptides Using a Novel HSIC-Based Multiview TSK Fuzzy System. APPLIED SCIENCES-BASEL 2022. [DOI: 10.3390/app12115383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Liu P, Ding Y, Rong Y, Chen D. Prediction of cell penetrating peptides and their uptake efficiency using random forest‐based feature selections. AIChE J 2022. [DOI: 10.1002/aic.17781] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Chen X, Zhang Q, Li B, Lu C, Yang S, Long J, He B, Chen H, Huang J. BBPpredict: A Web Service for Identifying Blood-Brain Barrier Penetrating Peptides. Front Genet 2022;13:845747. [PMID: 35656322 PMCID: PMC9152268 DOI: 10.3389/fgene.2022.845747] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Accepted: 03/30/2022] [Indexed: 12/22/2022] Open

Lokhande KB, Banerjee T, Swamy KV, Ghosh P, Deshpande M. An in silico scientific basis for LL-37 as a therapeutic for Covid-19. Proteins 2022;90:1029-1043. [PMID: 34333809 PMCID: PMC8441666 DOI: 10.1002/prot.26198] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2021] [Revised: 06/08/2021] [Accepted: 07/28/2021] [Indexed: 01/25/2023]

Gupta S, Azadvari N, Hosseinzadeh P. Design of Protein Segments and Peptides for Binding to Protein Targets. BIODESIGN RESEARCH 2022;2022:9783197. [PMID: 37850124 PMCID: PMC10521657 DOI: 10.34133/2022/9783197] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Accepted: 03/16/2022] [Indexed: 10/19/2023] Open

MLCPP 2.0: An updated cell-penetrating peptides and their uptake efficiency predictor. J Mol Biol 2022;434:167604. [DOI: 10.1016/j.jmb.2022.167604] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 04/03/2022] [Accepted: 04/19/2022] [Indexed: 12/12/2022]

Zhou H, Wang H, Ding Y, Tang J. Multivariate Information Fusion for Identifying Antifungal Peptides with Hilbert-Schmidt Independence Criterion. Curr Bioinform 2022. [DOI: 10.2174/1574893616666210727161003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Khandelwal R, Sharma AK, Biswa BB, Sharma Y. Extracellular Secretagogin is internalized into the cells through endocytosis. FEBS J 2021;289:3183-3204. [PMID: 34967502 DOI: 10.1111/febs.16338] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 11/29/2021] [Indexed: 11/29/2022]

Guo Y, Ju Y, Chen D, Wang L. Research on the Computational Prediction of Essential Genes. Front Cell Dev Biol 2021;9:803608. [PMID: 34938741 PMCID: PMC8685449 DOI: 10.3389/fcell.2021.803608] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2021] [Accepted: 11/22/2021] [Indexed: 11/19/2022] Open

Abstract

Genes, the nucleotide sequences that encode a polypeptide chain or functional RNA, are the basic genetic unit controlling biological traits. They are the guarantee of the basic structures and functions in organisms, and they store information related to biological factors and processes such as blood type, gestation, growth, and apoptosis. The environment and genetics jointly affect important physiological processes such as reproduction, cell division, and protein synthesis. Genes are related to a wide range of phenomena including growth, decline, illness, aging, and death. During the evolution of organisms, there is a class of genes that exist in a conserved form in multiple species. These genes are often located on the dominant strand of DNA and tend to have higher expression levels. The protein encoded by it usually either performs very important functions or is responsible for maintaining and repairing these essential functions. Such genes are called persistent genes. Among them, the irreplaceable part of the body’s life activities is the essential gene. For example, when starch is the only source of energy, the genes related to starch digestion are essential genes. Without them, the organism will die because it cannot obtain enough energy to maintain basic functions. The function of the proteins encoded by these genes is thought to be fundamental to life. Nowadays, DNA can be extracted from blood, saliva, or tissue cells for genetic testing, and detailed genetic information can be obtained using the most advanced scientific instruments and technologies. The information gained from genetic testing is useful to assess the potential risks of disease, and to help determine the prognosis and development of diseases. Such information is also useful for developing personalized medication and providing targeted health guidance to improve the quality of life. Therefore, it is of great theoretical and practical significance to identify important and essential genes. In this paper, the research status of essential genes and the essential genome database of bacteria are reviewed, the computational prediction method of essential genes based on communication coding theory is expounded, and the significance and practical application value of essential genes are discussed.

Collapse

Sebák F, Horváth LB, Kovács D, Szolomájer J, Tóth GK, Babiczky Á, Bősze S, Bodor A. Novel Lysine-Rich Delivery Peptides of Plant Origin ERD and Human S100: The Effect of Carboxyfluorescein Conjugation, Influence of Aromatic and Proline Residues, Cellular Internalization, and Penetration Ability. ACS OMEGA 2021;6:34470-34484. [PMID: 34963932 PMCID: PMC8697381 DOI: 10.1021/acsomega.1c04637] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 10/25/2021] [Indexed: 06/14/2023]

Jiao S, Zou Q, Guo H, Shi L. iTTCA-RF: a random forest predictor for tumor T cell antigens. J Transl Med 2021;19:449. [PMID: 34706730 PMCID: PMC8554859 DOI: 10.1186/s12967-021-03084-x] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Accepted: 09/16/2021] [Indexed: 12/21/2022] Open

Abstract

BACKGROUND

Cancer is one of the most serious diseases threatening human health. Cancer immunotherapy represents the most promising treatment strategy due to its high efficacy and selectivity and lower side effects compared with traditional treatment. The identification of tumor T cell antigens is one of the most important tasks for antitumor vaccines development and molecular function investigation. Although several machine learning predictors have been developed to identify tumor T cell antigen, more accurate tumor T cell antigen identification by existing methodology is still challenging.

METHODS

In this study, we used a non-redundant dataset of 592 tumor T cell antigens (positive samples) and 393 tumor T cell antigens (negative samples). Four types feature encoding methods have been studied to build an efficient predictor, including amino acid composition, global protein sequence descriptors and grouped amino acid and peptide composition. To improve the feature representation ability of the hybrid features, we further employed a two-step feature selection technique to search for the optimal feature subset. The final prediction model was constructed using random forest algorithm.

RESULTS

Finally, the top 263 informative features were selected to train the random forest classifier for detecting tumor T cell antigen peptides. iTTCA-RF provides satisfactory performance, with balanced accuracy, specificity and sensitivity values of 83.71%, 78.73% and 88.69% over tenfold cross-validation as well as 73.14%, 62.67% and 83.61% over independent tests, respectively. The online prediction server was freely accessible at http://lab.malab.cn/~acy/iTTCA .

CONCLUSIONS

We have proven that the proposed predictor iTTCA-RF is superior to the other latest models, and will hopefully become an effective and useful tool for identifying tumor T cell antigens presented in the context of major histocompatibility complex class I.

Collapse

Schissel CK, Mohapatra S, Wolfe JM, Fadzen CM, Bellovoda K, Wu CL, Wood JA, Malmberg AB, Loas A, Gómez-Bombarelli R, Pentelute BL. Deep learning to design nuclear-targeting abiotic miniproteins. Nat Chem 2021;13:992-1000. [PMID: 34373596 PMCID: PMC8819921 DOI: 10.1038/s41557-021-00766-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Accepted: 07/05/2021] [Indexed: 02/08/2023]

Xue Y, Ye X, Wei L, Zhang X, Sakurai T, Wei L. Better Performance with Transformer: CPPFormer in precise prediction of cell-Penetrating Peptides. Curr Med Chem 2021;29:881-893. [PMID: 34544332 DOI: 10.2174/0929867328666210920103140] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2021] [Revised: 07/28/2021] [Accepted: 08/07/2021] [Indexed: 11/22/2022]

Porosk L, Põhako K, Arukuusk P, Langel Ü. Cell-Penetrating Peptides Predicted From CASC3, AKIP1, and AHRR Proteins. Front Pharmacol 2021;12:716226. [PMID: 34504427 PMCID: PMC8421526 DOI: 10.3389/fphar.2021.716226] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/26/2021] [Indexed: 11/13/2022] Open

Abstract

Peptides can be used as research tools and for diagnostic or therapeutic applications. Peptides, alongside small molecules and antibodies, are used and are gaining further interest as protein-protein interaction (PPI) modulators. Peptides have high target specificity and high affinity, but, unlike small molecule modulators, they are not able to cross the cell membranes to reach their intracellular targets. To overcome this limitation, the special property of the cell-penetrating peptides (CPPs) could benefit their cause. CPPs are a class of peptides that can enter the cells and with them also deliver the attached cargoes. Today, with the advancement of in silico prediction tools and the availability of protein databases, designing new and multifunctional peptides that are able to reach intracellular targets and inhibit certain cellular processes in a very specific manner is reachable. Although there are several efficient CPP sequences already known, the discovery of new CPPs is crucial for the development of efficient delivery methods for both biotechnological and therapeutic applications. In this work, we chose 10 human nuclear proteins from which we predicted new potential CPP sequences by using three different CPP predictors: cell-penetrating peptide prediction tool, CellPPD, and SkipCPP-Pred. From each protein, one predicted CPP sequence was synthesized and its internalization into cells was assessed. Out of the tested sequences, three peptides displayed features characteristic to CPPs. These peptides and also the predicted peptide sequences could be used to design and modify new CPPs. In this work, we show that we can use protein sequences as input for generating new peptides with cell internalization properties. Three new CPPs, AHRR_8-24, CASC3_251-264, and AKIP1_27-37, can be further used for the delivery of other cargoes or designed into multifunctional peptides with capability of internalizing cells.

Collapse

Guo X, Chen L, Wang L, Geng J, Wang T, Hu J, Li J, Liu C, Wang H. In silico identification and experimental validation of cellular uptake and intracellular labeling by a new cell penetrating peptide derived from CDN1. Drug Deliv 2021;28:1722-1736. [PMID: 34463179 PMCID: PMC8409945 DOI: 10.1080/10717544.2021.1963352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Su R, Hu J, Zou Q, Manavalan B, Wei L. Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools. Brief Bioinform 2021;21:408-420. [PMID: 30649170 DOI: 10.1093/bib/bby124] [Citation(s) in RCA: 107] [Impact Index Per Article: 26.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 11/30/2018] [Accepted: 11/30/2018] [Indexed: 12/16/2022] Open

B3Pred: A Random-Forest-Based Method for Predicting and Designing Blood-Brain Barrier Penetrating Peptides. Pharmaceutics 2021;13:pharmaceutics13081237. [PMID: 34452198 PMCID: PMC8399279 DOI: 10.3390/pharmaceutics13081237] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2021] [Revised: 07/07/2021] [Accepted: 07/14/2021] [Indexed: 12/14/2022] Open

Nasiri F, Atanaki FF, Behrouzi S, Kavousi K, Bagheri M. CpACpP: In Silico Cell-Penetrating Anticancer Peptide Prediction Using a Novel Bioinformatics Framework. ACS OMEGA 2021;6:19846-19859. [PMID: 34368571 PMCID: PMC8340416 DOI: 10.1021/acsomega.1c02569] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Accepted: 07/13/2021] [Indexed: 05/12/2023]

Chen L, Guo X, Wang L, Geng J, Wu J, Hu B, Wang T, Li J, Liu C, Wang H. In silico identification and experimental validation of cellular uptake by a new cell penetrating peptide P1 derived from MARCKS. Drug Deliv 2021;28:1637-1648. [PMID: 34338123 PMCID: PMC8330795 DOI: 10.1080/10717544.2021.1960922] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open

Zhou J, Li Y, Huang W, Shi W, Qian H. Source and exploration of the peptides used to construct peptide-drug conjugates. Eur J Med Chem 2021;224:113712. [PMID: 34303870 DOI: 10.1016/j.ejmech.2021.113712] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 07/12/2021] [Accepted: 07/17/2021] [Indexed: 12/16/2022]

Holl NJ, Lee HJ, Huang YW. Evolutionary Timeline of Genetic Delivery and Gene Therapy. Curr Gene Ther 2021;21:89-111. [PMID: 33292120 DOI: 10.2174/1566523220666201208092517] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 11/17/2020] [Accepted: 11/22/2020] [Indexed: 11/22/2022]