1
|
Gillani M, Pollastri G. Protein subcellular localization prediction tools. Comput Struct Biotechnol J 2024; 23:1796-1807. [PMID: 38707539 PMCID: PMC11066471 DOI: 10.1016/j.csbj.2024.04.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 04/11/2024] [Accepted: 04/11/2024] [Indexed: 05/07/2024] Open
Abstract
Protein subcellular localization prediction is of great significance in bioinformatics and biological research. Most of the proteins do not have experimentally determined localization information, computational prediction methods and tools have been acting as an active research area for more than two decades now. Knowledge of the subcellular location of a protein provides valuable information about its functionalities, the functioning of the cell, and other possible interactions with proteins. Fast, reliable, and accurate predictors provides platforms to harness the abundance of sequence data to predict subcellular locations accordingly. During the last decade, there has been a considerable amount of research effort aimed at developing subcellular localization predictors. This paper reviews recent subcellular localization prediction tools in the Eukaryotic, Prokaryotic, and Virus-based categories followed by a detailed analysis. Each predictor is discussed based on its main features, strengths, weaknesses, algorithms used, prediction techniques, and analysis. This review is supported by prediction tools taxonomies that highlight their rele- vant area and examples for uncomplicated categorization and ease of understandability. These taxonomies help users find suitable tools according to their needs. Furthermore, recent research gaps and challenges are discussed to cover areas that need the utmost attention. This survey provides an in-depth analysis of the most recent prediction tools to facilitate readers and can be considered a quick guide for researchers to identify and explore the recent literature advancements.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| | - Gianluca Pollastri
- School of Computer Science, University College Dublin (UCD), Dublin, D04 V1W8, Ireland
| |
Collapse
|
2
|
Gillani M, Pollastri G. SCLpred-ECL: Subcellular Localization Prediction by Deep N-to-1 Convolutional Neural Networks. Int J Mol Sci 2024; 25:5440. [PMID: 38791479 PMCID: PMC11121631 DOI: 10.3390/ijms25105440] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 05/09/2024] [Accepted: 05/11/2024] [Indexed: 05/26/2024] Open
Abstract
The subcellular location of a protein provides valuable insights to bioinformaticians in terms of drug designs and discovery, genomics, and various other aspects of medical research. Experimental methods for protein subcellular localization determination are time-consuming and expensive, whereas computational methods, if accurate, would represent a much more efficient alternative. This article introduces an ab initio protein subcellular localization predictor based on an ensemble of Deep N-to-1 Convolutional Neural Networks. Our predictor is trained and tested on strict redundancy-reduced datasets and achieves 63% accuracy for the diverse number of classes. This predictor is a step towards bridging the gap between a protein sequence and the protein's function. It can potentially provide information about protein-protein interaction to facilitate drug design and processes like vaccine production that are essential to disease prevention.
Collapse
Affiliation(s)
- Maryam Gillani
- School of Computer Science, University College Dublin (UCD), D04 V1W8 Dublin, Ireland;
| | | |
Collapse
|
3
|
Lai D, Zhang K, He Y, Fan Y, Li W, Shi Y, Gao Y, Huang X, He J, Zhao H, Lu X, Xiao Y, Cheng J, Ruan J, Georgiev MI, Fernie AR, Zhou M. Multi-omics identification of a key glycosyl hydrolase gene FtGH1 involved in rutin hydrolysis in Tartary buckwheat (Fagopyrum tataricum). PLANT BIOTECHNOLOGY JOURNAL 2024; 22:1206-1223. [PMID: 38062934 PMCID: PMC11022807 DOI: 10.1111/pbi.14259] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 10/16/2023] [Accepted: 11/20/2023] [Indexed: 04/18/2024]
Abstract
Rutin, a flavonoid rich in buckwheat, is important for human health and plant resistance to external stresses. The hydrolysis of rutin to quercetin underlies the bitter taste of Tartary buckwheat. In order to identify rutin hydrolysis genes, a 200 genotypes mini-core Tartary buckwheat germplasm resource was re-sequenced with 30-fold coverage depth. By combining the content of the intermediate metabolites of rutin metabolism with genome resequencing data, metabolite genome-wide association analyses (GWAS) eventually identified a glycosyl hydrolase gene FtGH1, which could hydrolyse rutin to quercetin. This function was validated both in Tartary buckwheat overexpression hairy roots and in vitro enzyme activity assays. Mutation of the two key active sites, which were determined by molecular docking and experimentally verified via overexpression in hairy roots and transient expression in tobacco leaves, exhibited abnormal subcellular localization, suggesting functional changes. Sequence analysis revealed that mutation of the FtGH1 promoter in accessions of two haplotypes might be necessary for enzymatic activity. Co-expression analysis and GWAS revealed that FtbHLH165 not only repressed FtGH1 expression, but also increased seed length. This work reveals a potential mechanism behind rutin metabolism, which should provide both theoretical support in the study of flavonoid metabolism and in the molecular breeding of Tartary buckwheat.
Collapse
Affiliation(s)
- Dili Lai
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
- College of AgricultureGuizhou UniversityGuiyangChina
| | - Kaixuan Zhang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yuqi He
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yu Fan
- School of Food and Biological EngineeringChengdu UniversityChengduChina
| | - Wei Li
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yaliang Shi
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yuanfen Gao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Xu Huang
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Jiayue He
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Hui Zhao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Xiang Lu
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | - Yawen Xiao
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| | | | - Jingjun Ruan
- College of AgricultureGuizhou UniversityGuiyangChina
| | - Milen I. Georgiev
- Laboratory of Metabolomics, Institute of MicrobiologyBulgarian Academy of SciencesPlovdivBulgaria
- Center of Plant Systems Biology and BiotechnologyPlovdivBulgaria
| | - Alisdair R. Fernie
- Center of Plant Systems Biology and BiotechnologyPlovdivBulgaria
- Department of Molecular PhysiologyMax‐Planck‐Institute of Molecular Plant PhysiologyPotsdam‐GolmGermany
| | - Meiliang Zhou
- State Key Laboratory of Crop Gene Resources and Breeding, Institute of Crop SciencesChinese Academy of Agricultural SciencesBeijingChina
| |
Collapse
|
4
|
Chen S, Li K, Tan B, Wei Y, Li H. Identification of Pyrabactin resistance 1-like (PYL) genes in Brachypodium distachyon and functional characterization of BdPYL5. JOURNAL OF PLANT PHYSIOLOGY 2023; 283:153949. [PMID: 36842335 DOI: 10.1016/j.jplph.2023.153949] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Revised: 01/28/2023] [Accepted: 02/18/2023] [Indexed: 06/18/2023]
Abstract
Abscisic acid (ABA) is an endogenous phytohormone that plays an important role in regulating plant growth, development, and stress response. Pyrabactin resistance 1-like (PYR/PYL) proteins are ABA receptors and core components of ABA signalling in plants. This study identified nine PYL genes in the Brachypodium distachyon genome and they distribute on three chromosomes. Phylogenetical BdPYLs were classified into three clades. 81 protein-protein interactions between 9 BdPYLs and 9 BdPP2C proteins were predicted and 66 pairs were verified by yeast two-hybrid assay previously. Relatively, BdPYL genes are expressed in leaves at high level, and ABA and drought regulate their expression. A homologue of Arabidopsis PYL9, BdPYL5 was selected to overexpress in Arabidopsis to characterize its function. In general, overexpression of BdPYL5 enhanced ABA sensitivity and drought tolerance, implying its conserved function. Our study lays the foundation for further functional elucidation of BdPYL genes.
Collapse
Affiliation(s)
- Shoukun Chen
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712000, China.
| | - Kunjie Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712000, China.
| | - Bin Tan
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712000, China.
| | - Yaning Wei
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712000, China.
| | - Haifeng Li
- State Key Laboratory of Crop Stress Biology for Arid Areas, College of Agronomy, Northwest A&F University, Yangling, 712000, China.
| |
Collapse
|
5
|
Systems biology's role in leveraging microalgal biomass potential: Current status and future perspectives. ALGAL RES 2022. [DOI: 10.1016/j.algal.2022.102963] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
6
|
Abstract
Western blotting (WB), also known as immunoblotting, is a well-known molecular biology method that biologists often use to investigate many features of the protein, ranging from basic protein analysis to disease detection. WB is simple, unique, rapid, widely used routine tool with easy interpretation and definite results. It is being used in various fields of science, research and development, diagnostic labs and hospitals. The principle of WB is to accomplish the separation of proteins based on molecular weight and charge. This review addresses in detail the individual steps involved in the WB technique, its troubleshooting, internal loading controls, total protein staining and its diverse applications in scientific research and clinical settings, along with its future perspectives.
Collapse
|
7
|
Nakai K, Wei L. Recent Advances in the Prediction of Subcellular Localization of Proteins and Related Topics. FRONTIERS IN BIOINFORMATICS 2022; 2:910531. [PMID: 36304291 PMCID: PMC9580943 DOI: 10.3389/fbinf.2022.910531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 04/25/2022] [Indexed: 11/13/2022] Open
Abstract
Prediction of subcellular localization of proteins from their amino acid sequences has a long history in bioinformatics and is still actively developing, incorporating the latest advances in machine learning and proteomics. Notably, deep learning-based methods for natural language processing have made great contributions. Here, we review recent advances in the field as well as its related fields, such as subcellular proteomics and the prediction/recognition of subcellular localization from image data.
Collapse
Affiliation(s)
- Kenta Nakai
- Institute of Medical Science, The University of Tokyo, Minato-Ku, Japan
- *Correspondence: Kenta Nakai,
| | - Leyi Wei
- School of Software, Shandong University, Jinan, China
| |
Collapse
|
8
|
Liu T, Chen J, Zhang Q, Hippe K, Hunt C, Le T, Cao R, Tang H. The Development of Machine Learning Methods in discriminating Secretory Proteins of Malaria Parasite. Curr Med Chem 2021; 29:807-821. [PMID: 34636289 DOI: 10.2174/0929867328666211005140625] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 07/28/2021] [Accepted: 08/15/2021] [Indexed: 11/22/2022]
Abstract
Malaria caused by Plasmodium falciparum is one of the major infectious diseases in the world. It is essential to exploit an effective method to predict secretory proteins of malaria parasites to develop effective cures and treatment. Biochemical assays can provide details for accurate identification of the secretory proteins, but these methods are expensive and time-consuming. In this paper, we summarized the machine learning-based identification algorithms and compared the construction strategies between different computational methods. Also, we discussed the use of machine learning to improve the ability of algorithms to identify proteins secreted by malaria parasites.
Collapse
Affiliation(s)
- Ting Liu
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Jiamao Chen
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Qian Zhang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| | - Kyle Hippe
- Department of Computer Science, Pacific Lutheran University. United States
| | - Cassandra Hunt
- Department of Computer Science, Pacific Lutheran University. United States
| | - Thu Le
- Department of Computer Science, Pacific Lutheran University. United States
| | - Renzhi Cao
- Department of Computer Science, Pacific Lutheran University. United States
| | - Hua Tang
- School of Basic Medical Sciences, Southwest Medical University, Luzhou. China
| |
Collapse
|