Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Dong X, Zhang YJ, Zhang Z. Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes. PLoS One 2013;8:e56632. [PMID: 23437191 PMCID: PMC3577856 DOI: 10.1371/journal.pone.0056632] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Accepted: 01/11/2013] [Indexed: 11/25/2022] Open

For:	Dong X, Zhang YJ, Zhang Z. Using weakly conserved motifs hidden in secretion signals to identify type-III effectors from bacterial pathogen genomes. PLoS One 2013;8:e56632. [PMID: 23437191 PMCID: PMC3577856 DOI: 10.1371/journal.pone.0056632] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Accepted: 01/11/2013] [Indexed: 11/25/2022] Open

Number

Cited by Other Article(s)

Nielsen H. Protein Sorting Prediction. Methods Mol Biol 2024;2715:27-63. [PMID: 37930519 DOI: 10.1007/978-1-0716-3445-5_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2023]

Wagner N, Alburquerque M, Ecker N, Dotan E, Zerah B, Pena MM, Potnis N, Pupko T. Natural language processing approach to model the secretion signal of type III effectors. FRONTIERS IN PLANT SCIENCE 2022;13:1024405. [PMID: 36388586 PMCID: PMC9659976 DOI: 10.3389/fpls.2022.1024405] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2022] [Accepted: 10/11/2022] [Indexed: 06/16/2023]

Jing R, Wen T, Liao C, Xue L, Liu F, Yu L, Luo J. DeepT3 2.0: improving type III secreted effector predictions by an integrative deep learning framework. NAR Genom Bioinform 2021;3:lqab086. [PMID: 34617013 PMCID: PMC8489581 DOI: 10.1093/nargab/lqab086] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 08/12/2021] [Accepted: 09/09/2021] [Indexed: 11/13/2022] Open

Hasan MM, Alam MA, Shoombuatong W, Deng HW, Manavalan B, Kurata H. NeuroPred-FRL: an interpretable prediction model for identifying neuropeptide using feature representation learning. Brief Bioinform 2021;22:6272801. [PMID: 33975333 DOI: 10.1093/bib/bbab167] [Citation(s) in RCA: 48] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2021] [Revised: 03/23/2021] [Accepted: 04/09/2021] [Indexed: 12/13/2022] Open

Computational prediction of secreted proteins in gram-negative bacteria. Comput Struct Biotechnol J 2021;19:1806-1828. [PMID: 33897982 PMCID: PMC8047123 DOI: 10.1016/j.csbj.2021.03.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Revised: 03/18/2021] [Accepted: 03/18/2021] [Indexed: 12/29/2022] Open

Yu L, Liu F, Li Y, Luo J, Jing R. DeepT3_4: A Hybrid Deep Neural Network Model for the Distinction Between Bacterial Type III and IV Secreted Effectors. Front Microbiol 2021;12:605782. [PMID: 33552038 PMCID: PMC7858263 DOI: 10.3389/fmicb.2021.605782] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2020] [Accepted: 01/04/2021] [Indexed: 01/17/2023] Open

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021;2021:6690299. [PMID: 33505516 PMCID: PMC7806399 DOI: 10.1155/2021/6690299] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 12/24/2020] [Accepted: 12/26/2020] [Indexed: 11/18/2022]

Jing R, Li Y, Xue L, Liu F, Li M, Luo J. autoBioSeqpy: A Deep Learning Tool for the Classification of Biological Sequences. J Chem Inf Model 2020;60:3755-3764. [PMID: 32786512 DOI: 10.1021/acs.jcim.0c00409] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

ACNNT3: Attention-CNN Framework for Prediction of Sequence-Based Bacterial Type III Secreted Effectors. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2020;2020:3974598. [PMID: 32328150 PMCID: PMC7157791 DOI: 10.1155/2020/3974598] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 03/09/2020] [Accepted: 03/17/2020] [Indexed: 12/18/2022]

Li J, Wei L, Guo F, Zou Q. EP3: an ensemble predictor that accurately identifies type III secreted effectors. Brief Bioinform 2020;22:1918-1928. [PMID: 32043137 DOI: 10.1093/bib/bbaa008] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 12/25/2019] [Accepted: 01/10/2020] [Indexed: 01/09/2023] Open

Park J, Tae Eom G, Young Oh J, Hyun Park J, Chang Kim S, Kwang Song J, Hoon Ahn J. High-Level Production of Bacteriotoxic Phospholipase A1 in Bacterial Host Pseudomonas fluorescens Via ABC Transporter-Mediated Secretion and Inducible Expression. Microorganisms 2020;8:microorganisms8020239. [PMID: 32053917 PMCID: PMC7074900 DOI: 10.3390/microorganisms8020239] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2020] [Revised: 02/05/2020] [Accepted: 02/09/2020] [Indexed: 02/03/2023] Open

Fu X, Yang Y. WEDeepT3: predicting type III secreted effectors based on word embedding and deep learning. QUANTITATIVE BIOLOGY 2019. [DOI: 10.1007/s40484-019-0184-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Zeng C, Zou L. An account of in silico identification tools of secreted effector proteins in bacteria and future challenges. Brief Bioinform 2019;20:110-129. [PMID: 28981574 DOI: 10.1093/bib/bbx078] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Indexed: 01/08/2023] Open

Hasan MM, Rashid MM, Khatun MS, Kurata H. Computational identification of microbial phosphorylation sites by the enhanced characteristics of sequence information. Sci Rep 2019;9:8258. [PMID: 31164681 PMCID: PMC6547684 DOI: 10.1038/s41598-019-44548-x] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Accepted: 05/20/2019] [Indexed: 11/30/2022] Open

Dhroso A, Eidson S, Korkin D. Genome-wide prediction of bacterial effector candidates across six secretion system types using a feature-based statistical framework. Sci Rep 2018;8:17209. [PMID: 30464223 PMCID: PMC6249201 DOI: 10.1038/s41598-018-33874-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Accepted: 10/06/2018] [Indexed: 01/12/2023] Open

Xue L, Tang B, Chen W, Luo J. DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence. Bioinformatics 2018;35:2051-2057. [DOI: 10.1093/bioinformatics/bty931] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 10/22/2018] [Accepted: 11/07/2018] [Indexed: 11/12/2022] Open

Wang J, Li J, Yang B, Xie R, Marquez-Lago TT, Leier A, Hayashida M, Akutsu T, Zhang Y, Chou KC, Selkrig J, Zhou T, Song J, Lithgow T. Bastion3: a two-layer ensemble predictor of type III secreted effectors. Bioinformatics 2018;35:2017-2028. [PMID: 30388198 PMCID: PMC7963071 DOI: 10.1093/bioinformatics/bty914] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Revised: 10/15/2018] [Accepted: 10/31/2018] [Indexed: 01/31/2023] Open

Abstract

MOTIVATION

Type III secreted effectors (T3SEs) can be injected into host cell cytoplasm via type III secretion systems (T3SSs) to modulate interactions between Gram-negative bacterial pathogens and their hosts. Due to their relevance in pathogen-host interactions, significant computational efforts have been put toward identification of T3SEs and these in turn have stimulated new T3SE discoveries. However, as T3SEs with new characteristics are discovered, these existing computational tools reveal important limitations: (i) most of the trained machine learning models are based on the N-terminus (or incorporating also the C-terminus) instead of the proteins' complete sequences, and (ii) the underlying models (trained with classic algorithms) employed only few features, most of which were extracted based on sequence-information alone. To achieve better T3SE prediction, we must identify more powerful, informative features and investigate how to effectively integrate these into a comprehensive model.

RESULTS

In this work, we present Bastion3, a two-layer ensemble predictor developed to accurately identify type III secreted effectors from protein sequence data. In contrast with existing methods that employ single models with few features, Bastion3 explores a wide range of features, from various types, trains single models based on these features and finally integrates these models through ensemble learning. We trained the models using a new gradient boosting machine, LightGBM and further boosted the models' performances through a novel genetic algorithm (GA) based two-step parameter optimization strategy. Our benchmark test demonstrates that Bastion3 achieves a much better performance compared to commonly used methods, with an ACC value of 0.959, F-value of 0.958, MCC value of 0.917 and AUC value of 0.956, which comprehensively outperformed all other toolkits by more than 5.6% in ACC value, 5.7% in F-value, 12.4% in MCC value and 5.8% in AUC value. Based on our proposed two-layer ensemble model, we further developed a user-friendly online toolkit, maximizing convenience for experimental scientists toward T3SE prediction. With its design to ease future discoveries of novel T3SEs and improved performance, Bastion3 is poised to become a widely used, state-of-the-art toolkit for T3SE prediction.

AVAILABILITY AND IMPLEMENTATION

http://bastion3.erc.monash.edu/.

CONTACT

selkrig@embl.de or wyztli@163.com or or trevor.lithgow@monash.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Affiliation(s)

Jiawei Wang Infection and Immunity Program, Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC, Australia
Jiahui Li Infection and Immunity Program, Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC, Australia,Department of Clinical Laboratory, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
Bingjiao Yang School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China
Ruopeng Xie School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China
Tatiana T Marquez-Lago Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA,Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
André Leier Department of Genetics, School of Medicine, University of Alabama at Birmingham, AL, USA,Department of Cell, Developmental and Integrative Biology, School of Medicine, University of Alabama at Birmingham, AL, USA
Morihiro Hayashida National Institute of Technology, Matsue College, Matsue, Shimane, Japan
Tatsuya Akutsu Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto, Japan
Yanju Zhang School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, China
Kuo-Chen Chou Gordon Life Science Institute, Boston, MA, USA,Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu, China,Center of Excellence in Genomic Medicine Research (CEGMR), King Abdulaziz University, Jeddah, Saudi Arabia
Joel Selkrig European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
Tieli Zhou Department of Clinical Laboratory, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou, China
Jiangning Song To whom correspondence should be addressed.
Trevor Lithgow Infection and Immunity Program, Biomedicine Discovery Institute and Department of Microbiology, Monash University, Melbourne, VIC, Australia

Collapse

An Y, Wang J, Li C, Leier A, Marquez-Lago T, Wilksch J, Zhang Y, Webb GI, Song J, Lithgow T. Comprehensive assessment and performance improvement of effector protein predictors for bacterial secretion systems III, IV and VI. Brief Bioinform 2018;19:148-161. [PMID: 27777222 DOI: 10.1093/bib/bbw100] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2016] [Indexed: 11/15/2022] Open

Hasan MM, Kurata H. GPSuc: Global Prediction of Generic and Species-specific Succinylation Sites by aggregating multiple sequence features. PLoS One 2018;13:e0200283. [PMID: 30312302 PMCID: PMC6193575 DOI: 10.1371/journal.pone.0200283] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 06/22/2018] [Indexed: 01/09/2023] Open

Hasan MM, Khatun MS, Mollah MNH, Yong C, Dianjing G. NTyroSite: Computational Identification of Protein Nitrotyrosine Sites Using Sequence Evolutionary Features. Molecules 2018;23:E1667. [PMID: 29987232 PMCID: PMC6099560 DOI: 10.3390/molecules23071667] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 06/28/2018] [Accepted: 06/28/2018] [Indexed: 02/06/2023] Open

Nielsen H. Protein Sorting Prediction. Methods Mol Biol 2018;1615:23-57. [PMID: 28667600 DOI: 10.1007/978-1-4939-7033-9_2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]

Hasan MM, Khatun MS, Mollah MNH, Yong C, Guo D. A systematic identification of species-specific protein succinylation sites using joint element features information. Int J Nanomedicine 2017;12:6303-6315. [PMID: 28894368 PMCID: PMC5584904 DOI: 10.2147/ijn.s140875] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

An Y, Wang J, Li C, Revote J, Zhang Y, Naderer T, Hayashida M, Akutsu T, Webb GI, Lithgow T, Song J. SecretEPDB: a comprehensive web-based resource for secreted effector proteins of the bacterial types III, IV and VI secretion systems. Sci Rep 2017;7:41031. [PMID: 28112271 PMCID: PMC5253721 DOI: 10.1038/srep41031] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Accepted: 12/14/2016] [Indexed: 12/28/2022] Open

Hasan MM, Guo D, Kurata H. Computational identification of protein S-sulfenylation sites by incorporating the multiple sequence features information. MOLECULAR BIOSYSTEMS 2017;13:2545-2550. [DOI: 10.1039/c7mb00491e] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Scheibner F, Schulz S, Hausner J, Marillonnet S, Büttner D. Type III-Dependent Translocation of HrpB2 by a Nonpathogenic hpaABC Mutant of the Plant-Pathogenic Bacterium Xanthomonas campestris pv. vesicatoria. Appl Environ Microbiol 2016;82:3331-3347. [PMID: 27016569 PMCID: PMC4959247 DOI: 10.1128/aem.00537-16] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2016] [Accepted: 03/21/2016] [Indexed: 11/20/2022] Open

Abstract

UNLABELLED

The plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria employs a type III secretion (T3S) system to translocate effector proteins into plant cells. The T3S apparatus spans both bacterial membranes and is associated with an extracellular pilus and a channel-like translocon in the host plasma membrane. T3S is controlled by the switch protein HpaC, which suppresses secretion and translocation of the predicted inner rod protein HrpB2 and promotes secretion of translocon and effector proteins. We previously reported that HrpB2 interacts with HpaC and the cytoplasmic domain of the inner membrane protein HrcU (C. Lorenz, S. Schulz, T. Wolsch, O. Rossier, U. Bonas, and D. Büttner, PLoS Pathog 4:e1000094, 2008, http://dx.doi.org/10.1371/journal.ppat.1000094). However, the molecular mechanisms underlying the control of HrpB2 secretion are not yet understood. Here, we located a T3S and translocation signal in the N-terminal 40 amino acids of HrpB2. The results of complementation experiments with HrpB2 deletion derivatives revealed that the T3S signal of HrpB2 is essential for protein function. Furthermore, interaction studies showed that the N-terminal region of HrpB2 interacts with the cytoplasmic domain of HrcU, suggesting that the T3S signal of HrpB2 contributes to substrate docking. Translocation of HrpB2 is suppressed not only by HpaC but also by the T3S chaperone HpaB and its secreted regulator, HpaA. Deletion of hpaA, hpaB, and hpaC leads to a loss of pathogenicity but allows the translocation of fusion proteins between the HrpB2 T3S signal and effector proteins into leaves of host and non-host plants.

IMPORTANCE

The T3S system of the plant-pathogenic bacterium Xanthomonas campestris pv. vesicatoria is essential for pathogenicity and delivers effector proteins into plant cells. T3S depends on HrpB2, which is a component of the predicted periplasmic inner rod structure of the secretion apparatus. HrpB2 is secreted during the early stages of the secretion process and interacts with the cytoplasmic domain of the inner membrane protein HrcU. Here, we localized the secretion and translocation signal of HrpB2 in the N-terminal 40 amino acids and show that this region is sufficient for the interaction with the cytoplasmic domain of HrcU. Our results suggest that the T3S signal of HrpB2 is required for the docking of HrpB2 to the secretion apparatus. Furthermore, we provide experimental evidence that the N-terminal region of HrpB2 is sufficient to target effector proteins for translocation in a nonpathogenic X. campestris pv. vesicatoria strain.

Collapse

Sonah H, Deshmukh RK, Bélanger RR. Computational Prediction of Effector Proteins in Fungi: Opportunities and Challenges. FRONTIERS IN PLANT SCIENCE 2016;7:126. [PMID: 26904083 PMCID: PMC4751359 DOI: 10.3389/fpls.2016.00126] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/27/2015] [Accepted: 01/23/2016] [Indexed: 05/20/2023]

Dong X, Lu X, Zhang Z. BEAN 2.0: an integrated web resource for the identification and functional analysis of type III secreted effectors. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015;2015:bav064. [PMID: 26120140 PMCID: PMC4483310 DOI: 10.1093/database/bav064] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 06/02/2015] [Indexed: 11/13/2022]

Hasan MM, Zhou Y, Lu X, Li J, Song J, Zhang Z. Computational Identification of Protein Pupylation Sites by Using Profile-Based Composition of k-Spaced Amino Acid Pairs. PLoS One 2015;10:e0129635. [PMID: 26080082 PMCID: PMC4469302 DOI: 10.1371/journal.pone.0129635] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2015] [Accepted: 05/10/2015] [Indexed: 11/20/2022] Open

Luo J, Li W, Liu Z, Guo Y, Pu X, Li M. A sequence-based two-level method for the prediction of type I secreted RTX proteins. Analyst 2015;140:3048-56. [PMID: 25800819 DOI: 10.1039/c5an00311c] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Abstract

Many Gram-negative bacteria use the type I secretion system (T1SS) to translocate a wide range of substrates (type I secreted RTX proteins, T1SRPs) from the cytoplasm across the inner and outer membrane in one step to the extracellular space. Since T1SRPs play an important role in pathogen-host interactions, identifying them is crucial for a full understanding of the pathogenic mechanism of T1SS. However, experimental identification is often time-consuming and expensive. In the post-genomic era, it becomes imperative to predict new T1SRPs using information from the amino acid sequence alone when new proteins are being identified in a high-throughput mode. In this study, we report a two-level method for the first attempt to identify T1SRPs using sequence-derived features and the random forest (RF) algorithm. At the full-length sequence level, the results show that the unique feature of T1SRPs is the presence of variable numbers of the calcium-binding RTX repeats. These RTX repeats have a strong predictive power and so T1SRPs can be well distinguished from non-T1SRPs. At another level, different from that of the secretion signal, we find that a sequence segment located at the last 20-30 C-terminal amino acids may contain important signal information for T1SRP secretion because obvious differences were shown between the corresponding positions of T1SRPs and non-T1SRPs in terms of amino acid and secondary structure compositions. Using five-fold cross-validation, overall accuracies of 97% at the full-length sequence level and 89% at the secretion signal level were achieved through feature evaluation and optimization. Benchmarking on an independent dataset, our method could correctly predict 63 and 66 of 74 T1SRPs at the full-length sequence and secretion signal levels, respectively. We believe that this study will be useful in elucidating the secretion mechanism of T1SS and facilitating hypothesis-driven experimental design and validation.

Collapse

Yang X, Guo Y, Luo J, Pu X, Li M. Effective identification of Gram-negative bacterial type III secreted effectors using position-specific residue conservation profiles. PLoS One 2013;8:e84439. [PMID: 24391954 PMCID: PMC3877298 DOI: 10.1371/journal.pone.0084439] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2013] [Accepted: 11/07/2013] [Indexed: 11/18/2022] Open

Abstract

BACKGROUND

Type III secretion systems (T3SSs) are central to the pathogenesis and specifically deliver their secreted substrates (type III secreted proteins, T3SPs) into host cells. Since T3SPs play a crucial role in pathogen-host interactions, identifying them is crucial to our understanding of the pathogenic mechanisms of T3SSs. This study reports a novel and effective method for identifying the distinctive residues which are conserved different from other SPs for T3SPs prediction. Moreover, the importance of several sequence features was evaluated and further, a promising prediction model was constructed.

RESULTS

Based on the conservation profiles constructed by a position-specific scoring matrix (PSSM), 52 distinctive residues were identified. To our knowledge, this is the first attempt to identify the distinct residues of T3SPs. Of the 52 distinct residues, the first 30 amino acid residues are all included, which is consistent with previous studies reporting that the secretion signal generally occurs within the first 30 residue positions. However, the remaining 22 positions span residues 30-100 were also proven by our method to contain important signal information for T3SP secretion because the translocation of many effectors also depends on the chaperone-binding residues that follow the secretion signal. For further feature optimisation and compression, permutation importance analysis was conducted to select 62 optimal sequence features. A prediction model across 16 species was developed using random forest to classify T3SPs and non-T3 SPs, with high receiver operating curve of 0.93 in the 10-fold cross validation and an accuracy of 94.29% for the test set. Moreover, when performing on a common independent dataset, the results demonstrate that our method outperforms all the others published to date. Finally, the novel, experimentally confirmed T3 effectors were used to further demonstrate the model's correct application. The model and all data used in this paper are freely available at http://cic.scu.edu.cn/bioinformatics/T3SPs.zip.

Collapse

Tung CW. Prediction of pupylation sites using the composition of k-spaced amino acid pairs. J Theor Biol 2013;336:11-7. [PMID: 23871866 DOI: 10.1016/j.jtbi.2013.07.009] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2013] [Revised: 07/05/2013] [Accepted: 07/10/2013] [Indexed: 11/24/2022]

More Evidence for Secretion Signals within the mRNA of Type 3 Secreted Effectors. J Bacteriol 2013;195:2117-8. [DOI: 10.1128/jb.00303-13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open