1
|
Gillani M, Pollastri G. Protein subcellular localization prediction tools. Comput Struct Biotechnol J 2024; 23:1796-1807. [PMID: 38707539 PMCID: PMC11066471 DOI: 10.1016/j.csbj.2024.04.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
Protein subcellular localization prediction is of great significance in bioinformatics and biological research. Most of the proteins do not have experimentally determined localization information, computational prediction methods and tools have been acting as an active research area for more than two decades now. Knowledge of the subcellular location of a protein provides valuable information about its functionalities, the functioning of the cell, and other possible interactions with proteins. Fast, reliable, and accurate predictors provides platforms to harness the abundance of sequence data to predict subcellular locations accordingly. During the last decade, there has been a considerable amount of research effort aimed at developing subcellular localization predictors. This paper reviews recent subcellular localization prediction tools in the Eukaryotic, Prokaryotic, and Virus-based categories followed by a detailed analysis. Each predictor is discussed based on its main features, strengths, weaknesses, algorithms used, prediction techniques, and analysis. This review is supported by prediction tools taxonomies that highlight their rele- vant area and examples for uncomplicated categorization and ease of understandability. These taxonomies help users find suitable tools according to their needs. Furthermore, recent research gaps and challenges are discussed to cover areas that need the utmost attention. This survey provides an in-depth analysis of the most recent prediction tools to facilitate readers and can be considered a quick guide for researchers to identify and explore the recent literature advancements.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| | - Gianluca Pollastri
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| |
Collapse
|
2
|
Asadinezhad M, Pakzad I, Asadollahi P, Ghafourian S, Kalani BS. Proteomics Exploration of Brucella melitensis to Design an Innovative Multi-Epitope mRNA Vaccine. Bioinform Biol Insights 2024; 18:11779322241272404. [PMID: 39220468 PMCID: PMC11365029 DOI: 10.1177/11779322241272404] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 07/09/2024] [Indexed: 09/04/2024] Open
Abstract
Brucellosis is a chronic and debilitating disease in humans, causing great economic losses in the livestock industry. Making an effective vaccine is one of the most important concerns for this disease. The new mRNA vaccine technology due to its accuracy and high efficiency has given promising results in various diseases. The objective of this research was to create a novel mRNA vaccine with multiple epitopes targeting Brucella melitensis. Seventeen antigenic proteins and their appropriate epitopes were selected with immunoinformatic tools and surveyed in terms of toxicity, allergenicity, and homology. Then, their presentation and identification by MHC cells and other immune cells were checked with valid tools such as molecular docking, and a multi-epitope protein was modeled, and after optimization, mRNA was analyzed in terms of structure and stability. Ultimately, the immune system's reaction to this novel vaccine was evaluated and the results disclosed that the designed mRNA construct can be an effective and promising vaccine that requires laboratory and clinical trials.
Collapse
Affiliation(s)
- Maryam Asadinezhad
- Students Research Committee, Ilam University of Medical Sciences, Ilam, Iran
- Department of Microbiology, Faculty of Medicine, Ilam University of Medical Sciences, Ilam, Iran
| | - Iraj Pakzad
- Department of Microbiology, Faculty of Medicine, Ilam University of Medical Sciences, Ilam, Iran
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran
| | - Parisa Asadollahi
- Department of Microbiology, Faculty of Medicine, Ilam University of Medical Sciences, Ilam, Iran
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran
| | - Sobhan Ghafourian
- Department of Microbiology, Faculty of Medicine, Ilam University of Medical Sciences, Ilam, Iran
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran
| | - Behrooz Sadeghi Kalani
- Department of Microbiology, Faculty of Medicine, Ilam University of Medical Sciences, Ilam, Iran
- Clinical Microbiology Research Center, Ilam University of Medical Sciences, Ilam, Iran
- Department of Medical Microbiology, School of Medicine, Ilam University of Medical Sciences, Ilam, Iran
| |
Collapse
|
3
|
Nielsen H. Protein Sorting Prediction. Methods Mol Biol 2024; 2715:27-63. [PMID: 37930519 DOI: 10.1007/978-1-0716-3445-5_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]
Abstract
Many computational methods are available for predicting protein sorting in bacteria. When comparing them, it is important to know that they can be grouped into three fundamentally different approaches: signal-based, global property-based, and homology-based prediction. In this chapter, the strengths and drawbacks of each of these approaches are described through many examples of methods that predict secretion, integration into membranes, or subcellular locations in general. The aim of this chapter is to provide a user-level introduction to the field with a minimum of computational theory.
Collapse
Affiliation(s)
- Henrik Nielsen
- Department of Health Technology, Technical University of Denmark, Lyngby, Denmark.
| |
Collapse
|
4
|
Wagner N, Alburquerque M, Ecker N, Dotan E, Zerah B, Pena MM, Potnis N, Pupko T. Natural language processing approach to model the secretion signal of type III effectors. FRONTIERS IN PLANT SCIENCE 2022; 13:1024405. [PMID: 36388586 PMCID: PMC9659976 DOI: 10.3389/fpls.2022.1024405] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]
Abstract
Type III effectors are proteins injected by Gram-negative bacteria into eukaryotic hosts. In many plant and animal pathogens, these effectors manipulate host cellular processes to the benefit of the bacteria. Type III effectors are secreted by a type III secretion system that must "classify" each bacterial protein into one of two categories, either the protein should be translocated or not. It was previously shown that type III effectors have a secretion signal within their N-terminus, however, despite numerous efforts, the exact biochemical identity of this secretion signal is generally unknown. Computational characterization of the secretion signal is important for the identification of novel effectors and for better understanding the molecular translocation mechanism. In this work we developed novel machine-learning algorithms for characterizing the secretion signal in both plant and animal pathogens. Specifically, we represented each protein as a vector in high-dimensional space using Facebook's protein language model. Classification algorithms were next used to separate effectors from non-effector proteins. We subsequently curated a benchmark dataset of hundreds of effectors and thousands of non-effector proteins. We showed that on this curated dataset, our novel approach yielded substantially better classification accuracy compared to previously developed methodologies. We have also tested the hypothesis that plant and animal pathogen effectors are characterized by different secretion signals. Finally, we integrated the novel approach in Effectidor, a web-server for predicting type III effector proteins, leading to a more accurate classification of effectors from non-effectors.
Collapse
Affiliation(s)
- Naama Wagner
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Michael Alburquerque
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Noa Ecker
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Edo Dotan
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Ben Zerah
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Michelle Mendonca Pena
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL, United States
| | - Neha Potnis
- Department of Entomology and Plant Pathology, Auburn University, Auburn, AL, United States
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| |
Collapse
|
5
|
Nakai K, Wei L. Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics. FRONTIERS IN BIOINFORMATICS 2022; 2:910531. [PMID: 36304291 PMCID: PMC9580943 DOI: 10.3389/fbinf.2022.910531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Prediction of subcellular localization of proteins from their amino acid sequences has a long history in bioinformatics and is still actively developing, incorporating the latest advances in machine learning and proteomics. Notably, deep learning-based methods for natural language processing have made great contributions. Here, we review recent advances in the field as well as its related fields, such as subcellular proteomics and the prediction/recognition of subcellular localization from image data.
Collapse
Affiliation(s)
- Kenta Nakai
- Institute of Medical Science, The University of Tokyo, Minato-Ku, Japan
- *Correspondence: Kenta Nakai,
| | - Leyi Wei
- School of Software, Shandong University, Jinan, China
| |
Collapse
|
6
|
Wan Q, Bai T, Liu M, Liu Y, Xie Y, Zhang T, Huang M, Zhang J. Comparative Analysis of the Chalcone-Flavanone Isomerase Genes in Six Citrus Species and Their Expression Analysis in Sweet Orange (Citrus sinensis). Front Genet 2022; 13:848141. [PMID: 35495138 PMCID: PMC9039136 DOI: 10.3389/fgene.2022.848141] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2022] [Accepted: 03/15/2022] [Indexed: 12/14/2022] Open
Abstract
Citrus fruit contains rich nutrients which is edible and of officinal value. Citrus flavanones are widely used in the treatment of cardiovascular and other diseases, and they are a foundational material of Chinese medicine. The chalcone-flavanone isomerase (CHI) plays a key role in flavanone synthesis. Therefore, we comprehensively analyzed CHI genes in Citrus species. Here, thirty CHI genes were identified for the first time in six Citrus species, which were divided into CHI and FAP groups. Evolutionary analysis showed that CHI gene members were highly conserved and were an ancient family. All CsCHI genes showed the highest expression level after the second physiological fruit-falling period in C. sinensis. CsCHI1 and CsCHI3 were highly expressed at 50 days after the flowering (DAF) stage in albedo. The expression of CsFAP2 and CsCHI3 genes at the 50 DAF stage was 16.5 and 24.3 times higher than that at the 220 DAF stage, respectively. The expression of CsCHI1, CsCHI3, and CsFAP2 genes in the peel was higher than that in the pulp, especially in common sweet orange. The CsCHI3 gene maintained a high expression level in the epicarp and juice sac at all periods. The members of CHIs interacted with chalcone synthase (CHS), flavonol synthase/flavanone 3-hydroxylase (FLS) and naringenin, and 2-oxoglutarate 3-dioxygenase (F3H) to form heterodimers, which might together play a regulatory role and participate in the flavonoid pathway. This study will provide the basis for the selection of flavonoids in plant tissues and periods and fundamental information for further functional studies.
Collapse
Affiliation(s)
- Quan Wan
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
- Affiliated Hospital of Inner Mongolia Minzu University, Inner Mongolia Minzu University, Tongliao, China
- *Correspondence: Quan Wan, ; Jinlian Zhang,
| | - Tingting Bai
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Minmin Liu
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Ying Liu
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Yating Xie
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Tao Zhang
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Min Huang
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
| | - Jinlian Zhang
- School of Pharmacy, Jiangxi University of Chinese Medicine, Nanchang, China
- *Correspondence: Quan Wan, ; Jinlian Zhang,
| |
Collapse
|
7
|
Lin K, Zhao N, Cai Y, Lin Y, Han S, Zheng S. Genome-Scale Mining of Novel Anchor Proteins of Corynebacterium glutamicum. Front Microbiol 2022; 12:677702. [PMID: 35185806 PMCID: PMC8854784 DOI: 10.3389/fmicb.2021.677702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2021] [Accepted: 12/17/2021] [Indexed: 11/26/2022] Open
Abstract
The display of recombinant proteins on the surfaces of bacteria is a research topic with many possible biotechnology applications—among which, the choice of host cell and anchoring motif is the key for efficient display. Corynebacterium glutamicum is a promising host for surface display due to its natural advantages, while single screening methods and fewer anchor proteins restrict its application. In this study, the subcellular localization (SCL) predictor LocateP and tied-mixture hidden Markov models were used to analyze all five known endogenous anchor proteins of C. glutamicum and test the accuracy of the predictions. Using these two tools, the SCLs of all proteins encoded by the genome of C. glutamicum 13032 were predicted, and 14 potential anchor proteins were screened. Compared with the positive controls NCgl1221 and NCgl1337, three anchoring proteins—NCgl1307, NCgl2775, and NCgl0717—performed better. This study also discussed the applicability of the anchor protein screening method used in this experiment to other bacteria.
Collapse
Affiliation(s)
- Kerui Lin
- Guangdong Key Laboratory of Fermentation and Enzyme Engineering, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Guangdong Research Center of Industrial Enzyme and Green Manufacturing Technology, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
| | - Nannan Zhao
- Guangdong Key Laboratory of Fermentation and Enzyme Engineering, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Guangdong Research Center of Industrial Enzyme and Green Manufacturing Technology, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
| | - Youhua Cai
- Star Lake Bioscience Co. Inc., Zhaoqing Guangdong, Zhaoqing, China
| | - Ying Lin
- Guangdong Key Laboratory of Fermentation and Enzyme Engineering, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Guangdong Research Center of Industrial Enzyme and Green Manufacturing Technology, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
| | - Shuangyan Han
- Guangdong Key Laboratory of Fermentation and Enzyme Engineering, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Guangdong Research Center of Industrial Enzyme and Green Manufacturing Technology, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Shuangyan Han,
| | - Suiping Zheng
- Guangdong Key Laboratory of Fermentation and Enzyme Engineering, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- Guangdong Research Center of Industrial Enzyme and Green Manufacturing Technology, School of Biology and Biological Engineering, South China University of Technology, Guangzhou, China
- *Correspondence: Suiping Zheng,
| |
Collapse
|
8
|
Lau WYV, Hoad GR, Jin V, Winsor GL, Madyan A, Gray KL, Laird MR, Lo R, Brinkman FSL. PSORTdb 4.0: expanded and redesigned bacterial and archaeal protein subcellular localization database incorporating new secondary localizations. Nucleic Acids Res 2021; 49:D803-D808. [PMID: 33313828 PMCID: PMC7778896 DOI: 10.1093/nar/gkaa1095] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 10/23/2020] [Accepted: 12/10/2020] [Indexed: 01/18/2023] Open
Abstract
Protein subcellular localization (SCL) is important for understanding protein function, genome annotation, and aids identification of potential cell surface diagnostic markers, drug targets, or vaccine components. PSORTdb comprises ePSORTdb, a manually curated database of experimentally verified protein SCLs, and cPSORTdb, a pre-computed database of PSORTb-predicted SCLs for NCBI's RefSeq deduced bacterial and archaeal proteomes. We now report PSORTdb 4.0 (http://db.psort.org/). It features a website refresh, in particular a more user-friendly database search. It also addresses the need to uniquely identify proteins from NCBI genomes now that GI numbers have been retired. It further expands both ePSORTdb and cPSORTdb, including additional data about novel secondary localizations, such as proteins found in bacterial outer membrane vesicles. Protein predictions in cPSORTdb have increased along with the number of available microbial genomes, from approximately 13 million when PSORTdb 3.0 was released, to over 66 million currently. Now, analyses of both complete and draft genomes are included. This expanded database will be of wide use to researchers developing SCL predictors or studying diverse microbes, including medically, agriculturally and industrially important species that have both classic or atypical cell envelope structures or vesicles.
Collapse
Affiliation(s)
- Wing Yin Venus Lau
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Gemma R Hoad
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Vivian Jin
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Geoffrey L Winsor
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Ashmeet Madyan
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Kristen L Gray
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Matthew R Laird
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Raymond Lo
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Fiona S L Brinkman
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| |
Collapse
|
9
|
Imai K, Nakai K. Tools for the Recognition of Sorting Signals and the Prediction of Subcellular Localization of Proteins From Their Amino Acid Sequences. Front Genet 2020; 11:607812. [PMID: 33324450 PMCID: PMC7723863 DOI: 10.3389/fgene.2020.607812] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Accepted: 11/03/2020] [Indexed: 12/13/2022] Open
Abstract
At the time of translation, nascent proteins are thought to be sorted into their final subcellular localization sites, based on the part of their amino acid sequences (i.e., sorting or targeting signals). Thus, it is interesting to computationally recognize these signals from the amino acid sequences of any given proteins and to predict their final subcellular localization with such information, supplemented with additional information (e.g., k-mer frequency). This field has a long history and many prediction tools have been released. Even in this era of proteomic atlas at the single-cell level, researchers continue to develop new algorithms, aiming at accessing the impact of disease-causing mutations/cell type-specific alternative splicing, for example. In this article, we overview the entire field and discuss its future direction.
Collapse
Affiliation(s)
- Kenichiro Imai
- Cellular and Molecular Biotechnology Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan
| | - Kenta Nakai
- The Institute of Medical Science, The University of Tokyo, Tokyo, Japan
| |
Collapse
|