Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Manavalan B, Shin TH, Kim MO, Lee G. PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions. Front Immunol 2018;9:1783. [PMID: 30108593 PMCID: PMC6079197 DOI: 10.3389/fimmu.2018.01783] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 07/19/2018] [Indexed: 02/03/2023] Open

For:	Manavalan B, Shin TH, Kim MO, Lee G. PIP-EL: A New Ensemble Learning Method for Improved Proinflammatory Peptide Predictions. Front Immunol 2018;9:1783. [PMID: 30108593 PMCID: PMC6079197 DOI: 10.3389/fimmu.2018.01783] [Citation(s) in RCA: 88] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 07/19/2018] [Indexed: 02/03/2023] Open

Number

Cited by Other Article(s)

Weckbecker M, Anžel A, Yang Z, Hattab G. Interpretable molecular encodings and representations for machine learning tasks. Comput Struct Biotechnol J 2024;23:2326-2336. [PMID: 38867722 PMCID: PMC11167246 DOI: 10.1016/j.csbj.2024.05.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2024] [Revised: 05/13/2024] [Accepted: 05/19/2024] [Indexed: 06/14/2024] Open

Basith S, Sangaraju VK, Manavalan B, Lee G. mHPpred: Accurate identification of peptide hormones using multi-view feature learning. Comput Biol Med 2024;183:109297. [PMID: 39442438 DOI: 10.1016/j.compbiomed.2024.109297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Revised: 10/04/2024] [Accepted: 10/15/2024] [Indexed: 10/25/2024]

Abstract

Peptide hormones were first used in medicine in the early 20th century, with the pivotal event being the isolation and purification of insulin in 1921. These hormones are integral to a sophisticated system that emerged early in evolution to regulate growth, development, and homeostasis. They serve as targeted signaling molecules that transfer specific information between cells and organs, ensuring coordinated and precise physiological responses. While experimental methods for identifying peptide hormones present challenges such as low abundance, stability issues, and complexity, computational methods offer promising alternatives. Advances in machine learning and bioinformatics have facilitated the prediction of peptide hormones, further enhancing their therapeutic potential. In this study, we explored three different computational frameworks for peptide hormone identification and determined that the meta-approach was the most suitable. Firstly, we evaluated the discriminative power of 26 feature descriptors using a series of baseline models and identified seven feature descriptors with high predictive potential. Through a systematic approach, we then selected the top 20 performing baseline models and integrated their predicted probabilities to train a meta-model, leveraging the strengths of multiple prediction strategies. Our final light gradient boosting-based meta-model, mHPpred, significantly outperformed the existing method, HOPPred, on both benchmarking and independent datasets. Notably, mHPpred also demonstrated superior performance compared to the hybrid and integrative framework approaches employed in this study. This superiority demonstrates the effectiveness of our multi-view feature learning strategy in capturing discriminative features and providing a more accurate prediction model for peptide hormones. mHPpred is publicly accessible at: https://balalab-skku.org/mHPpred.

Collapse

Yan C, Geng A, Pan Z, Zhang Z, Cui F. MultiFeatVotPIP: a voting-based ensemble learning framework for predicting proinflammatory peptides. Brief Bioinform 2024;25:bbae505. [PMID: 39406523 PMCID: PMC11479713 DOI: 10.1093/bib/bbae505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 09/01/2024] [Accepted: 09/30/2024] [Indexed: 10/20/2024] Open

Kang Y, Zhang H, Wang X, Yang Y, Jia Q. MMDB: Multimodal dual-branch model for multi-functional bioactive peptide prediction. Anal Biochem 2024;690:115491. [PMID: 38460901 DOI: 10.1016/j.ab.2024.115491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2023] [Revised: 01/21/2024] [Accepted: 02/19/2024] [Indexed: 03/11/2024]

Madugula SS, Pujar P, Nammi B, Wang S, Jayasinghe-Arachchige VM, Pham T, Mashburn D, Artiles M, Liu J. Identification of Family-Specific Features in Cas9 and Cas12 Proteins: A Machine Learning Approach Using Complete Protein Feature Spectrum. J Chem Inf Model 2024;64:4897-4911. [PMID: 38838358 DOI: 10.1021/acs.jcim.4c00625] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2024]

Abstract

The recent development of CRISPR-Cas technology holds promise to correct gene-level defects for genetic diseases. The key element of the CRISPR-Cas system is the Cas protein, a nuclease that can edit the gene of interest assisted by guide RNA. However, these Cas proteins suffer from inherent limitations such as large size, low cleavage efficiency, and off-target effects, hindering their widespread application as a gene editing tool. Therefore, there is a need to identify novel Cas proteins with improved editing properties, for which it is necessary to understand the underlying features governing the Cas families. In this study, we aim to elucidate the unique protein features associated with Cas9 and Cas12 families and identify the features distinguishing each family from non-Cas proteins. Here, we built Random Forest (RF) binary classifiers to distinguish Cas12 and Cas9 proteins from non-Cas proteins, respectively, using the complete protein feature spectrum (13,494 features) encoding various physiochemical, topological, constitutional, and coevolutionary information on Cas proteins. Furthermore, we built multiclass RF classifiers differentiating Cas9, Cas12, and non-Cas proteins. All the models were evaluated rigorously on the test and independent data sets. The Cas12 and Cas9 binary models achieved a high overall accuracy of 92% and 95% on their respective independent data sets, while the multiclass classifier achieved an F1 score of close to 0.98. We observed that Quasi-Sequence-Order (QSO) descriptors like Schneider.lag and Composition descriptors like charge, volume, and polarizability are predominant in the Cas12 family. Conversely Amino Acid Composition descriptors, especially Tripeptide Composition (TPC), predominate the Cas9 family. Four of the top 10 descriptors identified in Cas9 classification are tripeptides PWN, PYY, HHA, and DHI, which are seen to be conserved across all Cas9 proteins and located within different catalytically important domains of the Streptococcus pyogenes Cas9 (SpCas9) structure. Among these, DHI and HHA are well-known to be involved in the DNA cleavage activity of the SpCas9 protein. Mutation studies have highlighted the significance of the PWN tripeptide in PAM recognition and DNA cleavage activity of SpCas9, while Y450 from the PYY tripeptide plays a crucial role in reducing off-target effects and improving the specificity in SpCas9. Leveraging our machine learning (ML) pipeline, we identified numerous Cas9 and Cas12 family-specific features. These features offer valuable insights for future experimental and computational studies aiming at designing Cas systems with enhanced gene-editing properties. These features suggest plausible structural modifications that can effectively guide the development of Cas proteins with improved editing capabilities.

Collapse

Affiliation(s)

Sita Sirisha Madugula Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Pranav Pujar Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Bharani Nammi Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Shouyi Wang Department of Industrial, Manufacturing and Systems Engineering, University of Texas at Arlington, 701 South Nedderman Drive, Arlington, Texas 76019, United States
Vindi M Jayasinghe-Arachchige Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Tyler Pham School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Dominic Mashburn Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Maria Artiles School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States
Jin Liu Department of Pharmaceutical Sciences, University of North Texas System College of Pharmacy, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States School of Biomedical Sciences, University of North Texas Health Science Center, 3500 Camp Bowie Blvd, Fort Worth, Texas 76107, United States

Collapse

Teixeira DG, Rodrigues-Neto JF, da Cunha DCS, Jeronimo SMB. Understanding SARS-CoV-2 spike glycoprotein clusters and their impact on immunity of the population from Rio Grande do Norte, Brazil. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2024;118:105556. [PMID: 38242186 DOI: 10.1016/j.meegid.2024.105556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/15/2024] [Indexed: 01/21/2024]

Madugula SS, Pujar P, Bharani N, Wang S, Jayasinghe-Arachchige VM, Pham T, Mashburn D, Artilis M, Liu J. Identification of Family-Specific Features in Cas9 and Cas12 Proteins: A Machine Learning Approach Using Complete Protein Feature Spectrum. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.22.576286. [PMID: 38328240 PMCID: PMC10849529 DOI: 10.1101/2024.01.22.576286] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]

Abstract

The recent development of CRISPR-Cas technology holds promise to correct gene-level defects for genetic diseases. The key element of the CRISPR-Cas system is the Cas protein, a nuclease that can edit the gene of interest assisted by guide RNA. However, these Cas proteins suffer from inherent limitations like large size, low cleavage efficiency, and off-target effects, hindering their widespread application as a gene editing tool. Therefore, there is a need to identify novel Cas proteins with improved editing properties, for which it is necessary to understand the underlying features governing the Cas families. In the current study, we aim to elucidate the unique protein attributes associated with Cas9 and Cas12 families and identify the features that distinguish each family from the other. Here, we built Random Forest (RF) binary classifiers to distinguish Cas12 and Cas9 proteins from non-Cas proteins, respectively, using the complete protein feature spectrum (13,495 features) encoding various physiochemical, topological, constitutional, and coevolutionary information of Cas proteins. Furthermore, we built multiclass RF classifiers differentiating Cas9, Cas12, and Non-Cas proteins. All the models were evaluated rigorously on the test and independent datasets. The Cas12 and Cas9 binary models achieved a high overall accuracy of 95% and 97% on their respective independent datasets, while the multiclass classifier achieved a high F1 score of 0.97. We observed that Quasi-sequence-order descriptors like Schneider-lag descriptors and Composition descriptors like charge, volume, and polarizability are essential for the Cas12 family. More interestingly, we discovered that Amino Acid Composition descriptors, especially the Tripeptide Composition (TPC) descriptors, are important for the Cas9 family. Four of the identified important descriptors of Cas9 classification are tripeptides PWN, PYY, HHA, and DHI, which are seen to be conserved across all the Cas9 proteins and were located within different catalytically important domains of the Cas9 protein structure. Among these four tripeptides, tripeptides DHI and HHA are well-known to be involved in the DNA cleavage activity of the Cas9 protein. We therefore propose the the other two tripeptides, PWN and PYY, may also be essential for the Cas9 family. Our identified important descriptors enhanced the understanding of the catalytic mechanisms of Cas9 and Cas12 proteins and provide valuable insights into design of novel Cas systems to achieve enhanced gene-editing properties.

Collapse

Su R, Zhuang J, Liu S, Liu D, Feng K. EnILs: A General Ensemble Computational Approach for Predicting Inducing Peptides of Multiple Interleukins. J Comput Biol 2023;30:1289-1304. [PMID: 38010531 DOI: 10.1089/cmb.2023.0002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2023] Open

Naorem LD, Sharma N, Raghava GPS. A web server for predicting and scanning of IL-5 inducing peptides using alignment-free and alignment-based method. Comput Biol Med 2023;158:106864. [PMID: 37058758 DOI: 10.1016/j.compbiomed.2023.106864] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2022] [Revised: 03/06/2023] [Accepted: 03/30/2023] [Indexed: 04/16/2023]

Spänig S, Michel A, Heider D. Unsupervised encoding selection through ensemble pruning for biomedical classification. BioData Min 2023;16:10. [PMID: 36927546 PMCID: PMC10018861 DOI: 10.1186/s13040-022-00317-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 11/27/2022] [Indexed: 03/18/2023] Open

Abstract

BACKGROUND

Owing to the rising levels of multi-resistant pathogens, antimicrobial peptides, an alternative strategy to classic antibiotics, got more attention. A crucial part is thereby the costly identification and validation. With the ever-growing amount of annotated peptides, researchers leverage artificial intelligence to circumvent the cumbersome, wet-lab-based identification and automate the detection of promising candidates. However, the prediction of a peptide's function is not limited to antimicrobial efficiency. To date, multiple studies successfully classified additional properties, e.g., antiviral or cell-penetrating effects. In this light, ensemble classifiers are employed aiming to further improve the prediction. Although we recently presented a workflow to significantly diminish the initial encoding choice, an entire unsupervised encoding selection, considering various machine learning models, is still lacking.

RESULTS

We developed a workflow, automatically selecting encodings and generating classifier ensembles by employing sophisticated pruning methods. We observed that the Pareto frontier pruning is a good method to create encoding ensembles for the datasets at hand. In addition, encodings combined with the Decision Tree classifier as the base model are often superior. However, our results also demonstrate that none of the ensemble building techniques is outstanding for all datasets.

CONCLUSION

The workflow conducts multiple pruning methods to evaluate ensemble classifiers composed from a wide range of peptide encodings and base models. Consequently, researchers can use the workflow for unsupervised encoding selection and ensemble creation. Ultimately, the extensible workflow can be used as a plugin for the PEPTIDE REACToR, further establishing it as a versatile tool in the domain.

Collapse

Malik A, Shoombuatong W, Kim CB, Manavalan B. GPApred: The first computational predictor for identifying proteins with LPXTG-like motif using sequence-based optimal features. Int J Biol Macromol 2023;229:529-538. [PMID: 36596370 DOI: 10.1016/j.ijbiomac.2022.12.315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Revised: 12/19/2022] [Accepted: 12/28/2022] [Indexed: 01/02/2023]

Ghaly G, Tallima H, Dabbish E, Badr ElDin N, Abd El-Rahman MK, Ibrahim MAA, Shoeib T. Anti-Cancer Peptides: Status and Future Prospects. Molecules 2023;28:molecules28031148. [PMID: 36770815 PMCID: PMC9920184 DOI: 10.3390/molecules28031148] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 12/26/2022] [Accepted: 01/19/2023] [Indexed: 01/26/2023] Open

Dhanda SK, Mahajan S, Manoharan M. Neoepitopes prediction strategies: an integration of cancer genomics and immunoinformatics approaches. Brief Funct Genomics 2023;22:1-8. [PMID: 36398967 DOI: 10.1093/bfgp/elac041] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 09/28/2022] [Accepted: 10/14/2022] [Indexed: 11/19/2022] Open

Barh D, Tiwari S, Rodrigues Gomes LG, Ramalho Pinto CH, Andrade BS, Ahmad S, Aljabali AAA, Alzahrani KJ, Banjer HJ, Hassan SS, Redwan EM, Raza K, Góes-Neto A, Sabino-Silva R, Lundstrom K, Uversky VN, Azevedo V, Tambuwala MM. SARS-CoV-2 Variants Show a Gradual Declining Pathogenicity and Pro-Inflammatory Cytokine Stimulation, an Increasing Antigenic and Anti-Inflammatory Cytokine Induction, and Rising Structural Protein Instability: A Minimal Number Genome-Based Approach. Inflammation 2023;46:297-312. [PMID: 36215001 PMCID: PMC9549046 DOI: 10.1007/s10753-022-01734-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2022] [Revised: 08/12/2022] [Accepted: 08/23/2022] [Indexed: 11/30/2022]

Affiliation(s)

Debmalya Barh Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, West Bengal, 721172, Purba Medinipur, India. .,Laboratory of Cellular and Molecular Genetics (LGCM) and PG Program in Bioinformatics, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, CEP, 31270-901, Brazil.
Sandeep Tiwari Laboratory of Cellular and Molecular Genetics (LGCM) and PG Program in Bioinformatics, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, CEP 31270-901 Brazil
Lucas Gabriel Rodrigues Gomes Laboratory of Cellular and Molecular Genetics (LGCM) and PG Program in Bioinformatics, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, CEP 31270-901 Brazil
Cecília Horta Ramalho Pinto Department of Biochemistry and Immunology, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, 31270-901 Brazil
Bruno Silva Andrade Laboratory of Bioinformatics and Computational Chemistry, Department of Biological Sciences, State University of Southwest Bahia (UESB), Jequié, 45206-190 Brazil
Shaban Ahmad Department of Computer Science, Jamia Millia Islamia, New Delhi, 110025 India
Alaa A. A. Aljabali Department of Pharmaceutics and Pharmaceutical Technology, Faculty of Pharmacy, Yarmouk University, P O BOX 566, Irbid, 21163 Jordan
Khalid J. Alzahrani Department of Clinical Laboratories Sciences, College of Applied Medical Sciences, Taif University, P.O. Box 11099, Taif, 21944 Saudi Arabia
Hamsa Jameel Banjer Department of Clinical Laboratories Sciences, College of Applied Medical Sciences, Taif University, P.O. Box 11099, Taif, 21944 Saudi Arabia
Sk. Sarif Hassan Department of Mathematics, Pingla Thana Mahavidyalaya, Maligram, 721140 India
Elrashdy M. Redwan Department of Biological Science, Faculty of Science, King Abdulazizi University, Jeddah, 21589 Saudi Arabia
Khalid Raza Department of Computer Science, Jamia Millia Islamia, New Delhi, 110025 India
Aristóteles Góes-Neto Laboratory of Cellular and Molecular Genetics (LGCM) and PG Program in Bioinformatics, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, CEP 31270-901 Brazil
Robinson Sabino-Silva Department of Physiology, Institute of Biomedical Sciences, Federal University of Uberlandia, Minas Gerais, Uberlandia, CEP 38400-902 Brazil
Kenneth Lundstrom PanTherapeutics, 1095 Lutry, Switzerland
Vladimir N. Uversky Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, FL 33612 USA
Vasco Azevedo Laboratory of Cellular and Molecular Genetics (LGCM) and PG Program in Bioinformatics, Department of Genetics, Ecology and Evolution, Institute of Biological Sciences, Federal University of Minas Gerais, Belo Horizonte, CEP 31270-901 Brazil
Murtaza M. Tambuwala Lincoln Medical School, University of Lincoln, Brayford Pool Campus, Lincoln, LN6 7TS UK

Collapse

Jain S, Dhall A, Patiyal S, Raghava GPS. In Silico Tool for Identification, Designing, and Searching of IL13-Inducing Peptides in Antigens. Methods Mol Biol 2023;2673:329-338. [PMID: 37258925 DOI: 10.1007/978-1-0716-3239-0_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Characterisation of a novel crustin isoform from mud crab, Scylla serrata (Forsskål, 1775) and its functional analysis in silico. In Silico Pharmacol 2022;11:2. [PMID: 36582926 PMCID: PMC9795441 DOI: 10.1007/s40203-022-00138-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 12/18/2022] [Indexed: 12/29/2022] Open

Dhanda SK, Malviya J, Gupta S. Not all T cell epitopes are equally desired: a review of in silico tools for the prediction of cytokine-inducing potential of T-cell epitopes. Brief Bioinform 2022;23:6692551. [PMID: 36070623 DOI: 10.1093/bib/bbac382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 08/01/2022] [Accepted: 08/09/2022] [Indexed: 11/13/2022] Open

Considering epitopes conservity in targeting SARS-CoV-2 mutations in variants: a novel immunoinformatics approach to vaccine design. Sci Rep 2022;12:14017. [PMID: 35982065 PMCID: PMC9386201 DOI: 10.1038/s41598-022-18152-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 08/05/2022] [Indexed: 11/08/2022] Open

Bhargav A, Fatima F, Chaurasia P, Seth S, Ramachandran S. Computer-Aided Tools and Resources for Fungal Pathogens: An Application of Reverse Vaccinology for Mucormycosis. Monoclon Antib Immunodiagn Immunother 2022;41:243-254. [PMID: 35939284 DOI: 10.1089/mab.2021.0039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Zou H. iRNA5hmC-HOC: High-order correlation information for identifying RNA 5-hydroxymethylcytosine modification. J Bioinform Comput Biol 2022;20:2250017. [DOI: 10.1142/s0219720022500172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Li Y, Li X, Liu Y, Yao Y, Huang G. MPMABP: A CNN and Bi-LSTM-Based Method for Predicting Multi-Activities of Bioactive Peptides. Pharmaceuticals (Basel) 2022;15:707. [PMID: 35745625 PMCID: PMC9231127 DOI: 10.3390/ph15060707] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 05/23/2022] [Accepted: 05/30/2022] [Indexed: 12/30/2022] Open

Development of Anticancer Peptides Using Artificial Intelligence and Combinational Therapy for Cancer Therapeutics. Pharmaceutics 2022;14:pharmaceutics14050997. [PMID: 35631583 PMCID: PMC9147327 DOI: 10.3390/pharmaceutics14050997] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/28/2022] [Accepted: 05/04/2022] [Indexed: 01/27/2023] Open

Shishir TA, Jannat T, Naser IB. An in-silico study of the mutation-associated effects on the spike protein of SARS-CoV-2, Omicron variant. PLoS One 2022;17:e0266844. [PMID: 35446879 PMCID: PMC9022835 DOI: 10.1371/journal.pone.0266844] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 03/28/2022] [Indexed: 01/16/2023] Open

Yang J, Han SC, Poon J. A survey on extraction of causal relations from natural language text. Knowl Inf Syst 2022. [DOI: 10.1007/s10115-022-01665-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Gong W, Pan C, Cheng P, Wang J, Zhao G, Wu X. Peptide-Based Vaccines for Tuberculosis. Front Immunol 2022;13:830497. [PMID: 35173740 PMCID: PMC8841753 DOI: 10.3389/fimmu.2022.830497] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 01/10/2022] [Indexed: 12/12/2022] Open

Manavalan B, Basith S, Lee G. Comparative analysis of machine learning-based approaches for identifying therapeutic peptides targeting SARS-CoV-2. Brief Bioinform 2022;23:bbab412. [PMID: 34595489 PMCID: PMC8500067 DOI: 10.1093/bib/bbab412] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 08/27/2021] [Accepted: 09/07/2021] [Indexed: 01/08/2023] Open

Qiu WR, Guan MY, Wang QK, Lou LL, Xiao X. Identifying Pupylation Proteins and Sites by Incorporating Multiple Methods. Front Endocrinol (Lausanne) 2022;13:849549. [PMID: 35557849 PMCID: PMC9088680 DOI: 10.3389/fendo.2022.849549] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 03/07/2022] [Indexed: 11/20/2022] Open

Wang G, Vaisman II, van Hoek ML. Machine Learning Prediction of Antimicrobial Peptides. Methods Mol Biol 2022;2405:1-37. [PMID: 35298806 PMCID: PMC9126312 DOI: 10.1007/978-1-0716-1855-4_1] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]

Tarrahimofrad H, Rahimnahal S, Zamani J, Jahangirian E, Aminzadeh S. Designing a multi-epitope vaccine to provoke the robust immune response against influenza A H7N9. Sci Rep 2021;11:24485. [PMID: 34966175 PMCID: PMC8716528 DOI: 10.1038/s41598-021-03932-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Accepted: 12/13/2021] [Indexed: 12/12/2022] Open

Abstract

A new strain of Influenza A Virus (IAV), so-called "H7N9 Avian Influenza", is the first strain of this virus in which a human is infected by transmitting the N9 of influenza virus. Although continuous human-to-human transmission has not been reported, the occurrence of various H7N9-associated epidemics and the lack of production of strong antibodies against H7N9 in humans warn of the potential for H7N9 to become a new pandemic. Therefore, the need for effective vaccination against H7N9 as a life-threatening viral pathogen has become a major concern. The current study reports the design of a multi-epitope vaccine against Hemagglutinin (HA) and Neuraminidase (NA) proteins of H7N9 Influenza A virus by prediction of Cytotoxic T lymphocyte (CTL), Helper T lymphocyte (HTL), IFN-γ and B-cell epitopes. Human β-defensin-3 (HβD-3) and pan HLA DR-binding epitope (PADRE) sequence were considered as adjuvant. EAAAK, AAY, GPGPG, HEYGAEALERAG, KK and RVRR linkers were used as a connector for epitopes. The final construct contained 777 amino acids that are expected to be a recombinant protein of about ~ 86.38 kDa with antigenic and non-allergenic properties after expression. Modeled protein analysis based on the tertiary structure validation, docking studies, and molecular dynamics simulations results like Root-mean-square deviation (RMSD), Gyration, Root-mean-square fluctuation (RMSF) and Molecular Mechanics Poisson-Boltzmann Surface Area (MM/PBSA) showed that this protein has a stable construct and capable of being in interaction with Toll-like receptor 7 (TLR7), TLR8 and m826 antibody. Analysis of the obtained data the demonstrates that suggested vaccine has the potential to induce the immune response by stimulating T and Bcells, and may be utilizable for prevention purposes against Avian Influenza A (H7N9).

Collapse

Dhall A, Jain S, Sharma N, Naorem LD, Kaur D, Patiyal S, Raghava GPS. In silico tools and databases for designing cancer immunotherapy. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021;129:1-50. [PMID: 35305716 DOI: 10.1016/bs.apcsb.2021.11.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Malik A, Subramaniyam S, Kim CB, Manavalan B. SortPred: The first machine learning based predictor to identify bacterial sortases and their classes using sequence-derived information. Comput Struct Biotechnol J 2021;20:165-174. [PMID: 34976319 PMCID: PMC8703055 DOI: 10.1016/j.csbj.2021.12.014] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Revised: 12/08/2021] [Accepted: 12/09/2021] [Indexed: 12/12/2022] Open

Zhao YW, Zhang S, Ding H. Recent development of machine learning methods in sumoylation sites prediction. Curr Med Chem 2021;29:894-907. [PMID: 34525906 DOI: 10.2174/0929867328666210915112030] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2021] [Revised: 07/24/2021] [Accepted: 08/07/2021] [Indexed: 11/22/2022]

Melo MCR, Maasch JRMA, de la Fuente-Nunez C. Accelerating antibiotic discovery through artificial intelligence. Commun Biol 2021;4:1050. [PMID: 34504303 PMCID: PMC8429579 DOI: 10.1038/s42003-021-02586-0] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 07/16/2021] [Indexed: 02/07/2023] Open

Jia C, Zhang M, Fan C, Li F, Song J. Formator: Predicting Lysine Formylation Sites Based on the Most Distant Undersampling and Safe-Level Synthetic Minority Oversampling. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1937-1945. [PMID: 31804942 DOI: 10.1109/tcbb.2019.2957758] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

iBitter-Fuse: A Novel Sequence-Based Bitter Peptide Predictor by Fusing Multi-View Features. Int J Mol Sci 2021;22:ijms22168958. [PMID: 34445663 PMCID: PMC8396555 DOI: 10.3390/ijms22168958] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2021] [Revised: 08/08/2021] [Accepted: 08/17/2021] [Indexed: 12/19/2022] Open

Charoenkwan P, Chiangjong W, Hasan MM, Nantasenamat C, Shoombuatong W. Review and comparative analysis of machine learning-based predictors for predicting and analyzing of anti-angiogenic peptides. Curr Med Chem 2021;29:849-864. [PMID: 34375178 DOI: 10.2174/0929867328666210810145806] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Revised: 06/17/2021] [Accepted: 06/22/2021] [Indexed: 11/22/2022]

Charoenkwan P, Anuwongcharoen N, Nantasenamat C, Hasan MM, Shoombuatong W. In Silico Approaches for the Prediction and Analysis of Antiviral Peptides: A Review. Curr Pharm Des 2021;27:2180-2188. [PMID: 33138759 DOI: 10.2174/1381612826666201102105827] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Accepted: 08/20/2020] [Indexed: 11/22/2022]

Perpetuo L, Klein J, Ferreira R, Guedes S, Amado F, Leite-Moreira A, Silva AMS, Thongboonkerd V, Vitorino R. How can artificial intelligence be used for peptidomics? Expert Rev Proteomics 2021;18:527-556. [PMID: 34343059 DOI: 10.1080/14789450.2021.1962303] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]

Bhatt P, Sharma M, Sharma S. Prediction and identification of T cell epitopes of COVID-19 with balanced cytokine response for the development of peptide based vaccines. In Silico Pharmacol 2021;9:40. [PMID: 34221846 PMCID: PMC8237047 DOI: 10.1007/s40203-021-00098-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Accepted: 06/05/2021] [Indexed: 12/27/2022] Open

Abstract

Recent outbreak of 2019 novel Corona virus poses serious challenge for the global health system. In lieu of paucity of experimental data, tools and the very basic understanding of host immune responses against SARS-CoV-2, well thought effective measures are needed to control COVID-19 pandemic. We have identified specific overlapping antigenic peptide epitopes (OAPE) within the 4 structural proteins of SARS-CoV-2 predictive of triggering robust CD4 and CD8 T cell responses in host using bio-informatics tools (NetMHC4.0, IEDB, and Vaxijen2.0). We speculate an early release of pro-inflammatory cytokines for protection and later release of anti-inflammatory cytokines for prevention of immunopathology in designing a vaccine for Covid-19. Therefore, the selected immunogenic OAPE were subjected to in silico tools (IL-6-Pred, IFNepitope and PIP-EL) for analyzing their pro-inflammatory response. The OAPEs found to be pro-inflammatory in nature were further subjected to prediction servers (IL-4-Pred, IL-10-Pred, Pre-AIP) to characterize them as inducers of anti-inflammatory response as well. We finally filtered out 12 OAPE which had affinity for both CD4 and CD8 T cells as well as were inducers of pro-inflammatory and anti-inflammatory cytokines. On confirmation of OAPE binding affinity for respective T cell specific MHC allele using docking studies (pepATTRACT, Hex8.0 and Discovery studio) they were found to be have more immunogenic potential than the 3 negative control peptides (NCPs) included in the study. Additionally, we constructed CTxB-adjuvanated multi-epitopic vaccine inclusive of the 12 OAPEs which was non-toxic, non-allergenic and capable of inducing both pro-inflammatory and anti-inflammatory cytokines. A successful in silico cloning and docking of modeled subunit vaccine construct with toll like receptor-2 (TLR-2) confirmed the high efficacy of our multi-epitopic vaccine which can through a balanced interplay of cytokines help in creating a steady-state immune equilibrium. In silico immune simulation studies with the vaccine using C-ImmSim server also showed higher percentage of T cells along with production of pro-inflammatory as well as some anti-inflammatory cytokines. Experimental validation of this prediction based study on Peripheral Blood Mononuclear Cells (PBMCs) of un-infected individuals, patients and recovered individuals will facilitate production of high priority effective SARS -CoV-2 vaccine candidate.

Supplementary Information

The online version contains supplementary material available at 10.1007/s40203-021-00098-7.

Collapse

Neelima S, Archana K, Athira PP, Anju MV, Anooja VV, Bright Singh IS, Philip R. Molecular characterization of a novel β-defensin isoform from the red-toothed trigger fish, Odonus niger (Ruppel, 1836). J Genet Eng Biotechnol 2021;19:71. [PMID: 33978838 PMCID: PMC8116387 DOI: 10.1186/s43141-021-00175-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 05/03/2021] [Indexed: 11/10/2022]

Abstract

Background

The concern regarding a post-antibiotic era with increasing drug resistance by pathogens imposes the need to discover alternatives for existing antibiotics. Antimicrobial peptides (AMPs) with their versatile therapeutic properties are a group of promising molecules with curative potentials. These evolutionarily conserved molecules play important roles in the innate immune system of several organisms. The β-defensins are a group of cysteine rich cationic antimicrobial peptides that play an important role in the innate immune system by their antimicrobial activity against the invading pathogens. The present study deals with a novel β-defensin isoform from the red-toothed trigger fish, Odonus niger. Total RNA was isolated from the gills, cDNA was synthesized and the β-defensin isoform obtained by polymerase chain reaction was cloned and subjected to structural and functional characterization in silico.

Results

A β-defensin isoform could be detected from the gill mRNA of red-toothed trigger fish, Odonus niger. The cDNA encoded a 63 amino acid peptide, β-defensin, with a 20 amino acid signal sequence followed by 43 amino acid cationic mature peptide (On-Def) having a molecular weight of 5.214 kDa and theoretical pI of 8.89. On-Def possessed six highly conserved cysteine residues forming disulfide bonds between C1–C5, C2–C4, and C3–C6, typical of β-defensins. An anionic pro-region was observed prior to the β-defensin domain within the mature peptide. Clustal alignment and phylogenetic analyses revealed On-Def as a group 2 β-defensin. Furthermore, it shared some structural similarities and functional motifs with β-defensins from other organisms. On-Def was predicted to be non-hemolytic with anti-bacterial, anti-viral, anti-fungal, anti-cancer, and immunomodulatory potential.

Conclusion

On-Def is the first report of a β-defensin from the red-toothed trigger fish, Odonus niger. The antimicrobial profile showed the potential for further studies as a suitable candidate for antimicrobial peptide therapeutics.

Collapse

Charoenkwan P, Chiangjong W, Nantasenamat C, Hasan MM, Manavalan B, Shoombuatong W. StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides. Brief Bioinform 2021;22:6271998. [PMID: 33963832 DOI: 10.1093/bib/bbab172] [Citation(s) in RCA: 87] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2021] [Revised: 03/30/2021] [Accepted: 04/10/2021] [Indexed: 12/13/2022] Open

Abstract

The release of interleukin (IL)-6 is stimulated by antigenic peptides from pathogens as well as by immune cells for activating aggressive inflammation. IL-6 inducing peptides are derived from pathogens and can be used as diagnostic biomarkers for predicting various stages of disease severity as well as being used as IL-6 inhibitors for the suppression of aggressive multi-signaling immune responses. Thus, the accurate identification of IL-6 inducing peptides is of great importance for investigating their mechanism of action as well as for developing diagnostic and immunotherapeutic applications. This study proposes a novel stacking ensemble model (termed StackIL6) for accurately identifying IL-6 inducing peptides. More specifically, StackIL6 was constructed from twelve different feature descriptors derived from three major groups of features (composition-based features, composition-transition-distribution-based features and physicochemical properties-based features) and five popular machine learning algorithms (extremely randomized trees, logistic regression, multi-layer perceptron, support vector machine and random forest). To enhance the utility of baseline models, they were effectively and systematically integrated through a stacking strategy to build the final meta-based model. Extensive benchmarking experiments demonstrated that StackIL6 could achieve significantly better performance than the existing method (IL6PRED) and outperformed its constituent baseline models on both training and independent test datasets, which thereby support its excellent discrimination and generalization abilities. To facilitate easy access to the StackIL6 model, it was established as a freely available web server accessible at http://camt.pythonanywhere.com/StackIL6. It is anticipated that StackIL6 can help to facilitate rapid screening of promising IL-6 inducing peptides for the development of diagnostic and immunotherapeutic applications in the future.

Collapse

Convolutional neural networks with image representation of amino acid sequences for protein function prediction. Comput Biol Chem 2021;92:107494. [PMID: 33930742 DOI: 10.1016/j.compbiolchem.2021.107494] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/21/2021] [Indexed: 01/11/2023]

Dhall A, Patiyal S, Sharma N, Usmani SS, Raghava GPS. Computer-aided prediction and design of IL-6 inducing peptides: IL-6 plays a crucial role in COVID-19. Brief Bioinform 2021;22:936-945. [PMID: 33034338 PMCID: PMC7665369 DOI: 10.1093/bib/bbaa259] [Citation(s) in RCA: 67] [Impact Index Per Article: 22.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2020] [Revised: 08/28/2020] [Accepted: 09/13/2020] [Indexed: 12/16/2022] Open

Zhang ZM, Guan ZX, Wang F, Zhang D, Ding H. Application of Machine Learning Methods in Predicting Nuclear Receptors and their Families. Med Chem 2021;16:594-604. [PMID: 31584374 DOI: 10.2174/1573406415666191004125551] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 06/18/2019] [Accepted: 08/23/2019] [Indexed: 11/22/2022]

Abstract

Nuclear receptors (NRs) are a superfamily of ligand-dependent transcription factors that are closely related to cell development, differentiation, reproduction, homeostasis, and metabolism. According to the alignments of the conserved domains, NRs are classified and assigned the following seven subfamilies or eight subfamilies: (1) NR1: thyroid hormone like (thyroid hormone, retinoic acid, RAR-related orphan receptor, peroxisome proliferator activated, vitamin D3- like), (2) NR2: HNF4-like (hepatocyte nuclear factor 4, retinoic acid X, tailless-like, COUP-TFlike, USP), (3) NR3: estrogen-like (estrogen, estrogen-related, glucocorticoid-like), (4) NR4: nerve growth factor IB-like (NGFI-B-like), (5) NR5: fushi tarazu-F1 like (fushi tarazu-F1 like), (6) NR6: germ cell nuclear factor like (germ cell nuclear factor), and (7) NR0: knirps like (knirps, knirpsrelated, embryonic gonad protein, ODR7, trithorax) and DAX like (DAX, SHP), or dividing NR0 into (7) NR7: knirps like and (8) NR8: DAX like. Different NRs families have different structural features and functions. Since the function of a NR is closely correlated with which subfamily it belongs to, it is highly desirable to identify NRs and their subfamilies rapidly and effectively. The knowledge acquired is essential for a proper understanding of normal and abnormal cellular mechanisms. With the advent of the post-genomics era, huge amounts of sequence-known proteins have increased explosively. Conventional methods for accurately classifying the family of NRs are experimental means with high cost and low efficiency. Therefore, it has created a greater need for bioinformatics tools to effectively recognize NRs and their subfamilies for the purpose of understanding their biological function. In this review, we summarized the application of machine learning methods in the prediction of NRs from different aspects. We hope that this review will provide a reference for further research on the classification of NRs and their families.

Collapse

Zhang S, Zhu F, Yu Q, Zhu X. Identifying DNA-binding proteins based on multi-features and LASSO feature selection. Biopolymers 2021;112:e23419. [PMID: 33476047 DOI: 10.1002/bip.23419] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 01/08/2021] [Accepted: 01/08/2021] [Indexed: 01/22/2023]

Lathwal A, Kumar R, Raghava GPS. In-silico identification of subunit vaccine candidates against lung cancer-associated oncogenic viruses. Comput Biol Med 2021;130:104215. [PMID: 33465550 DOI: 10.1016/j.compbiomed.2021.104215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 01/08/2021] [Accepted: 01/08/2021] [Indexed: 10/22/2022]

Abstract

Globally, ~20% of cancer malignancies are associated with virus infections. Lung cancer is the most prevalent cancer and has a 10% 5-year survival rate when diagnosed at stage IV. Cancer vaccines and oncolytic immunotherapy are promising treatment strategies for better clinical outcomes in advanced-stage cancer patients. Here, we used a reverse vaccinology approach to devise subunit vaccine candidates against lung cancer-causing oncogenic viruses. Protein components (945) from nine oncogenic virus species were systematically analyzed to identify epitope-based subunit vaccine candidates. Best vaccine candidates were identified based on their predicted ability to stimulate humoral and cell-mediated immunity and avoid self-tolerance. Using a rigorous integrative approach, we identified 125 best antigenic epitopes with predicted B-cell, T-cell, and/or MHC-binding capability and vaccine adjuvant potential. Thirty-two of these antigenic epitopes were predicted to have IL-4/IFN-gamma inducing potential and IL-10 non-inducing potential and were predicted to bind 15 MHC-type I and 49 MHC-type II alleles. All 32 epitopes were non-allergenic and 31 were non-toxic. The identified epitopes showed good conservancy and likely bind a broad class of human HLA alleles, indicating promiscuous potential. The majority of best antigenic epitopes were derived from Human papillomavirus and Epstein-Barr virus proteins. Of the 32 epitopes, 25 promiscuous epitopes were related to E1 and E6 envelope genes and were present in multiple viral strains/species, potentially providing heterologous immunity. Further validating our results, 38 antigenic epitopes were also present in the largest experimentally-validated epitope resource, Immune Epitope Database and Analysis Resource. We further narrowed the selection to 29 antigenic epitopes with the highest immunogenic/immune-boosting potential. These epitopes possess tremendous therapeutic potential as vaccines against lung cancer-causing viruses and should be validated in future experiments. All findings are available at https://webs.iiitd.edu.in/raghava/vlcvirus/.

Collapse

Hasan MM, Alam MA, Shoombuatong W, Kurata H. IRC-Fuse: improved and robust prediction of redox-sensitive cysteine by fusing of multiple feature representations. J Comput Aided Mol Des 2021;35:315-323. [PMID: 33392948 DOI: 10.1007/s10822-020-00368-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 12/06/2020] [Indexed: 12/11/2022]

Zhai Y, Chen Y, Teng Z, Zhao Y. Identifying Antioxidant Proteins by Using Amino Acid Composition and Protein-Protein Interactions. Front Cell Dev Biol 2020;8:591487. [PMID: 33195258 PMCID: PMC7658297 DOI: 10.3389/fcell.2020.591487] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2020] [Accepted: 09/18/2020] [Indexed: 12/13/2022] Open

Charoenkwan P, Yana J, Nantasenamat C, Hasan MM, Shoombuatong W. iUmami-SCM: A Novel Sequence-Based Predictor for Prediction and Analysis of Umami Peptides Using a Scoring Card Method with Propensity Scores of Dipeptides. J Chem Inf Model 2020;60:6666-6678. [DOI: 10.1021/acs.jcim.0c00707] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]

Charoenkwan P, Kanthawong S, Nantasenamat C, Hasan MM, Shoombuatong W. iAMY-SCM: Improved prediction and analysis of amyloid proteins using a scoring card method with propensity scores of dipeptides. Genomics 2020;113:689-698. [PMID: 33017626 DOI: 10.1016/j.ygeno.2020.09.065] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 09/21/2020] [Accepted: 09/30/2020] [Indexed: 01/09/2023]

Abstract

Fast, accurate identification and characterization of amyloid proteins at a large-scale is essential for understating their role in therapeutic intervention strategies. As a matter of fact, there exist only one in silico model for amyloid protein identification using the random forest (RF) model in conjunction with various feature types namely the RFAmy. However, it suffers from low interpretability for biologists. Thus, it is highly desirable to develop a simple and easily interpretable prediction method with robust accuracy as compared to the existing complicated model. In this study, we propose iAMY-SCM, the first scoring card method-based predictor for predicting and analyzing amyloid proteins. Herein, the iAMY-SCM made use of a simple weighted-sum function in conjunction with the propensity scores of dipeptides for the amyloid protein identification. Cross-validation results indicated that iAMY-SCM provided an accuracy of 0.895 that corresponded to 10-22% higher performance than that of widely used machine learning models. Furthermore, iAMY-SCM achieving an accuracy of 0.827 as evaluated by an independent test, which was found to be comparable to that of RFAmy and was approximately 9-13% higher than widely used machine learning models. Furthermore, the analysis of estimated propensity scores of amino acids and dipeptides were performed to provide insights into the biophysical and biochemical properties of amyloid proteins. As such, this demonstrates that the proposed iAMY-SCM is efficient and reliable in terms of simplicity, interpretability and implementation. To facilitate ease of use of the proposed iAMY-SCM, a user-friendly and publicly accessible web server at http://camt.pythonanywhere.com/iAMY-SCM has been established. We anticipate that that iAMY-SCM will be an important tool for facilitating the large-scale prediction and characterization of amyloid protein.

Collapse