Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Ding S, Li Y, Shi Z, Yan S. A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile. Biochimie 2014;97:60-5. [DOI: 10.1016/j.biochi.2013.09.013] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 09/16/2013] [Indexed: 10/26/2022]

For:	Ding S, Li Y, Shi Z, Yan S. A protein structural classes prediction method based on predicted secondary structure and PSI-BLAST profile. Biochimie 2014;97:60-5. [DOI: 10.1016/j.biochi.2013.09.013] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 09/16/2013] [Indexed: 10/26/2022]

Number

Cited by Other Article(s)

Reis WF, Silva MES, Gondim ACS, Torres RCF, Carneiro RF, Nagano CS, Sampaio AH, Teixeira CS, Gomes LCBF, Sousa BL, Andrade AL, Teixeira EH, Vasconcelos MA. Glucose-Binding Dioclea bicolor Lectin (DBL): Purification, Characterization, Structural Analysis, and Antibacterial Properties. Protein J 2024;43:559-576. [PMID: 38615284 DOI: 10.1007/s10930-024-10199-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/07/2024] [Indexed: 04/15/2024]

Affiliation(s)

Willian F Reis Departamento de Ciências da Natureza E da Terra, Universidade Do Estado de Minas Gerais, Unidade de Divinópolis, Divinópolis, MG, Brazil
Marcos E S Silva Faculdade de Educação de Itapipoca, Universidade Estadual Do Ceará, Itapipoca, CE, Brazil Faculdade de Ciências Exatas E Naturais, Universidade Do Estado Do Rio Grande Do Norte, Mossoró, RN, Brazil
Ana C S Gondim Departamento de Química Orgânica E Inorgânica, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Renato C F Torres Centro de Ciências Agrárias E da Biodiversidade, Universidade Federal Do Cariri, Crato, CE, Brazil
Rômulo F Carneiro Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Celso S Nagano Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Alexandre H Sampaio Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Claudener S Teixeira Centro de Ciências Agrárias E da Biodiversidade, Universidade Federal Do Cariri, Crato, CE, Brazil
Lenita C B F Gomes Faculdade de Filosofia Dom Aureliano Matos, Universidade Estadual Do Ceará, Limoeiro Do Norte, CE, Brazil
Bruno L Sousa Faculdade de Filosofia Dom Aureliano Matos, Universidade Estadual Do Ceará, Limoeiro Do Norte, CE, Brazil
Alexandre L Andrade Laboratório Integrado de Biomoléculas - LIBS, Departamento de Patologia E Medicina Legal, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Edson H Teixeira Laboratório Integrado de Biomoléculas - LIBS, Departamento de Patologia E Medicina Legal, Universidade Federal Do Ceará, Fortaleza, CE, Brazil
Mayron A Vasconcelos Departamento de Ciências da Natureza E da Terra, Universidade Do Estado de Minas Gerais, Unidade de Divinópolis, Divinópolis, MG, Brazil. Faculdade de Educação de Itapipoca, Universidade Estadual Do Ceará, Itapipoca, CE, Brazil. Faculdade de Ciências Exatas E Naturais, Universidade Do Estado Do Rio Grande Do Norte, Mossoró, RN, Brazil.

Collapse

Abbass J, Parisi C. Machine learning-based prediction of proteins' architecture using sequences of amino acids and structural alphabets. J Biomol Struct Dyn 2024:1-16. [PMID: 38505995 DOI: 10.1080/07391102.2024.2328736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Accepted: 03/05/2024] [Indexed: 03/21/2024]

Wu J, Qing H, Ouyang J, Zhou J, Gao Z, Mason CE, Liu Z, Shi T. HiFun: homology independent protein function prediction by a novel protein-language self-attention model. Brief Bioinform 2023;24:bbad311. [PMID: 37649370 DOI: 10.1093/bib/bbad311] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 07/31/2023] [Accepted: 08/08/2023] [Indexed: 09/01/2023] Open

Li Y, Wei Y, Xu S, Tan Q, Zong L, Wang J, Wang Y, Chen J, Hong L, Li Y. AcrNET: predicting anti-CRISPR with deep learning. Bioinformatics 2023;39:btad259. [PMID: 37084259 PMCID: PMC10174705 DOI: 10.1093/bioinformatics/btad259] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 04/08/2023] [Accepted: 04/12/2023] [Indexed: 04/22/2023] Open

Abstract

MOTIVATION

As an important group of proteins discovered in phages, anti-CRISPR inhibits the activity of the immune system of bacteria (i.e. CRISPR-Cas), offering promise for gene editing and phage therapy. However, the prediction and discovery of anti-CRISPR are challenging due to their high variability and fast evolution. Existing biological studies rely on known CRISPR and anti-CRISPR pairs, which may not be practical considering the huge number. Computational methods struggle with prediction performance. To address these issues, we propose a novel deep neural network for anti-CRISPR analysis (AcrNET), which achieves significant performance.

RESULTS

On both the cross-fold and cross-dataset validation, our method outperforms the state-of-the-art methods. Notably, AcrNET improves the prediction performance by at least 15% regarding the F1 score for the cross-dataset test problem comparing with state-of-art Deep Learning method. Moreover, AcrNET is the first computational method to predict the detailed anti-CRISPR classes, which may help illustrate the anti-CRISPR mechanism. Taking advantage of a Transformer protein language model ESM-1b, which was pre-trained on 250 million protein sequences, AcrNET overcomes the data scarcity problem. Extensive experiments and analysis suggest that the Transformer model feature, evolutionary feature, and local structure feature complement each other, which indicates the critical properties of anti-CRISPR proteins. AlphaFold prediction, further motif analysis, and docking experiments further demonstrate that AcrNET can capture the evolutionarily conserved pattern and the interaction between anti-CRISPR and the target implicitly.

AVAILABILITY AND IMPLEMENTATION

Web server: https://proj.cse.cuhk.edu.hk/aihlab/AcrNET/. Training code and pre-trained model are available at.

Collapse

Chen Y, Gao L, Zhang T. Stack-VTP: prediction of vesicle transport proteins based on stacked ensemble classifier and evolutionary information. BMC Bioinformatics 2023;24:137. [PMID: 37029385 PMCID: PMC10080812 DOI: 10.1186/s12859-023-05257-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 03/28/2023] [Indexed: 04/09/2023] Open

Zhu L, Wang X, Li F, Song J. PreAcrs: a machine learning framework for identifying anti-CRISPR proteins. BMC Bioinformatics 2022;23:444. [PMID: 36284264 PMCID: PMC9597991 DOI: 10.1186/s12859-022-04986-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2022] [Accepted: 10/14/2022] [Indexed: 11/10/2022] Open

Xu Y, Wojtczak D. Dive into machine learning algorithms for influenza virus host prediction with hemagglutinin sequences. Biosystems 2022;220:104740. [DOI: 10.1016/j.biosystems.2022.104740] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2022] [Revised: 07/02/2022] [Accepted: 07/16/2022] [Indexed: 11/26/2022]

Pritam M, Singh G, Kumar R, Singh SP. Screening of potential antigens from whole proteome and development of multi-epitope vaccine against Rhizopus delemar using immunoinformatics approaches. J Biomol Struct Dyn 2022;41:2118-2145. [PMID: 35067195 DOI: 10.1080/07391102.2022.2028676] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]

Gong Y, Dong B, Zhang Z, Zhai Y, Gao B, Zhang T, Zhang J. VTP-Identifier: Vesicular Transport Proteins Identification Based on PSSM Profiles and XGBoost. Front Genet 2022;12:808856. [PMID: 35047020 PMCID: PMC8762342 DOI: 10.3389/fgene.2021.808856] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Accepted: 11/29/2021] [Indexed: 11/13/2022] Open

Yang C, Yang Z, Tong K, Wang J, Yang W, Yu R, Jiang F, Ji Y. Homology modeling and molecular docking simulation of martentoxin as a specific inhibitor of the BK channel. ANNALS OF TRANSLATIONAL MEDICINE 2022;10:71. [PMID: 35282126 PMCID: PMC8848368 DOI: 10.21037/atm-21-6967] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Accepted: 01/13/2022] [Indexed: 11/18/2022]

Jia Y, Huang S, Zhang T. KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest. Front Genet 2021;12:811158. [PMID: 34912382 PMCID: PMC8667860 DOI: 10.3389/fgene.2021.811158] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Accepted: 11/15/2021] [Indexed: 02/04/2023] Open

Sousa ARDO, Andrade FRN, Chaves RP, Sousa BLD, Lima DBD, Souza RODS, da Silva CGL, Teixeira CS, Sampaio AH, Nagano CS, Carneiro RF. Structural characterization of a galectin isolated from the marine sponge Chondrilla caribensis with leishmanicidal potential. Biochim Biophys Acta Gen Subj 2021;1865:129992. [PMID: 34508835 DOI: 10.1016/j.bbagen.2021.129992] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2021] [Revised: 08/11/2021] [Accepted: 08/19/2021] [Indexed: 01/08/2023]

Affiliation(s)

Andressa Rocha de Oliveira Sousa Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil
Francisco Regivânio Nascimento Andrade Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil
Renata Pinheiro Chaves Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil
Bruno Lopes de Sousa Faculdade de Filosofia Dom Aureliano Matos, Universidade Estadual do Ceará, Brazil
Dimas Batista de Lima Faculdade de Medicina, Universidade Federal do Cariri, Barbalha, CE, Brazil
Racquel Oliveira da Silva Souza Faculdade de Medicina, Universidade Federal do Cariri, Barbalha, CE, Brazil
Cláudio Gleidiston Lima da Silva Faculdade de Medicina, Universidade Federal do Cariri, Barbalha, CE, Brazil
Claudener Souza Teixeira Centro de Ciências Agrárias e da Biodiversidade, Universidade Federal do Cariri, Crato, CE, Brazil
Alexandre Holanda Sampaio Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil
Celso Shiniti Nagano Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil
Rômulo Farias Carneiro Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, 60440-970 Fortaleza, Ceará, Brazil.

Collapse

Zervou MA, Doutsi E, Pavlidis P, Tsakalides P. Structural classification of proteins based on the computationally efficient recurrence quantification analysis and horizontal visibility graphs. Bioinformatics 2021;37:1796-1804. [PMID: 34048559 DOI: 10.1093/bioinformatics/btab407] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 04/13/2021] [Accepted: 05/27/2021] [Indexed: 11/14/2022] Open

iT3SE-PX: Identification of Bacterial Type III Secreted Effectors Using PSSM Profiles and XGBoost Feature Selection. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2021;2021:6690299. [PMID: 33505516 PMCID: PMC7806399 DOI: 10.1155/2021/6690299] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 12/24/2020] [Accepted: 12/26/2020] [Indexed: 11/18/2022]

Zhang J, Lv L, Lu D, Kong D, Al-Alashaari MAA, Zhao X. Variable selection from a feature representing protein sequences: a case of classification on bacterial type IV secreted effectors. BMC Bioinformatics 2020;21:480. [PMID: 33109082 PMCID: PMC7590791 DOI: 10.1186/s12859-020-03826-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Accepted: 10/19/2020] [Indexed: 12/13/2022] Open

Abstract

Background

Classification of certain proteins with specific functions is momentous for biological research. Encoding approaches of protein sequences for feature extraction play an important role in protein classification. Many computational methods (namely classifiers) are used for classification on protein sequences according to various encoding approaches. Commonly, protein sequences keep certain labels corresponding to different categories of biological functions (e.g., bacterial type IV secreted effectors or not), which makes protein prediction a fantasy. As to protein prediction, a kernel set of protein sequences keeping certain labels certified by biological experiments should be existent in advance. However, it has been hardly ever seen in prevailing researches. Therefore, unsupervised learning rather than supervised learning (e.g. classification) should be considered. As to protein classification, various classifiers may help to evaluate the effectiveness of different encoding approaches. Besides, variable selection from an encoded feature representing protein sequences is an important issue that also needs to be considered.

Results

Focusing on the latter problem, we propose a new method for variable selection from an encoded feature representing protein sequences. Taking a benchmark dataset containing 1947 protein sequences as a case, experiments are made to identify bacterial type IV secreted effectors (T4SE) from protein sequences, which are composed of 399 T4SE and 1548 non-T4SE. Comparable and quantified results are obtained only using certain components of the encoded feature, i.e., position-specific scoring matix, and that indicates the effectiveness of our method.

Conclusions

Certain variables other than an encoded feature they belong to do work for discrimination between different types of proteins. In addition, ensemble classifiers with an automatic assignment of different base classifiers do achieve a better classification result.

Collapse

Wang J, Dai W, Li J, Xie R, Dunstan RA, Stubenrauch C, Zhang Y, Lithgow T. PaCRISPR: a server for predicting and visualizing anti-CRISPR proteins. Nucleic Acids Res 2020;48:W348-W357. [PMID: 32459325 PMCID: PMC7319593 DOI: 10.1093/nar/gkaa432] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Revised: 04/22/2020] [Accepted: 05/13/2020] [Indexed: 01/09/2023] Open

Pritam M, Singh G, Swaroop S, Singh AK, Pandey B, Singh SP. A cutting-edge immunoinformatics approach for design of multi-epitope oral vaccine against dreadful human malaria. Int J Biol Macromol 2020;158:159-179. [PMID: 32360460 PMCID: PMC7189201 DOI: 10.1016/j.ijbiomac.2020.04.191] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2020] [Revised: 03/28/2020] [Accepted: 04/22/2020] [Indexed: 12/18/2022]

Ge Y, Zhao S, Zhao X. A step-by-step classification algorithm of protein secondary structures based on double-layer SVM model. Genomics 2020;112:1941-1946. [DOI: 10.1016/j.ygeno.2019.11.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2019] [Revised: 10/15/2019] [Accepted: 11/11/2019] [Indexed: 11/26/2022]

Apurva M, Mazumdar H. Predicting structural class for protein sequences of 40% identity based on features of primary and secondary structure using Random Forest algorithm. Comput Biol Chem 2020;84:107164. [DOI: 10.1016/j.compbiolchem.2019.107164] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2019] [Revised: 10/25/2019] [Accepted: 11/10/2019] [Indexed: 02/08/2023]

Guo L, Wang S, Li M, Cao Z. Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning. BMC Bioinformatics 2019;20:700. [PMID: 31874615 PMCID: PMC6929490 DOI: 10.1186/s12859-019-3275-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open

Wang S, Wang X. Prediction of protein structural classes by different feature expressions based on 2-D wavelet denoising and fusion. BMC Bioinformatics 2019;20:701. [PMID: 31874617 PMCID: PMC6929547 DOI: 10.1186/s12859-019-3276-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open

Zhu XJ, Feng CQ, Lai HY, Chen W, Hao L. Predicting protein structural classes for low-similarity sequences by evaluating different features. Knowl Based Syst 2019. [DOI: 10.1016/j.knosys.2018.10.007] [Citation(s) in RCA: 69] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

A novel feature selection method to predict protein structural class. Comput Biol Chem 2018;76:118-129. [DOI: 10.1016/j.compbiolchem.2018.06.007] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Revised: 05/14/2018] [Accepted: 06/30/2018] [Indexed: 01/05/2023]

Wang J, Yang B, Revote J, Leier A, Marquez-Lago TT, Webb G, Song J, Chou KC, Lithgow T. POSSUM: a bioinformatics toolkit for generating numerical sequence feature descriptors based on PSSM profiles. Bioinformatics 2018;33:2756-2758. [PMID: 28903538 DOI: 10.1093/bioinformatics/btx302] [Citation(s) in RCA: 107] [Impact Index Per Article: 17.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2017] [Accepted: 05/09/2017] [Indexed: 11/13/2022] Open

Liang Y, Zhang S. Predict protein structural class by incorporating two different modes of evolutionary information into Chou's general pseudo amino acid composition. J Mol Graph Model 2017;78:110-117. [DOI: 10.1016/j.jmgm.2017.10.003] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2017] [Revised: 10/03/2017] [Accepted: 10/03/2017] [Indexed: 11/27/2022]

Yu B, Lou L, Li S, Zhang Y, Qiu W, Wu X, Wang M, Tian B. Prediction of protein structural class for low-similarity sequences using Chou’s pseudo amino acid composition and wavelet denoising. J Mol Graph Model 2017;76:260-273. [DOI: 10.1016/j.jmgm.2017.07.012] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2017] [Revised: 07/11/2017] [Accepted: 07/12/2017] [Indexed: 11/25/2022]

Yuan M, Yang Z, Huang G, Ji G. Feature selection by maximizing correlation information for integrated high-dimensional protein data. Pattern Recognit Lett 2017. [DOI: 10.1016/j.patrec.2017.03.011] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Carneiro RF, Torres RCF, Chaves RP, de Vasconcelos MA, de Sousa BL, Goveia ACR, Arruda FV, Matos MNC, Matthews-Cascon H, Freire VN, Teixeira EH, Nagano CS, Sampaio AH. Purification, Biochemical Characterization, and Amino Acid Sequence of a Novel Type of Lectin from Aplysia dactylomela Eggs with Antibacterial/Antibiofilm Potential. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2017;19:49-64. [PMID: 28150103 DOI: 10.1007/s10126-017-9728-x] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2016] [Accepted: 01/08/2017] [Indexed: 06/06/2023]

Affiliation(s)

Rômulo Farias Carneiro Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Renato Cézar Farias Torres Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Renata Pinheiro Chaves Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Mayron Alves de Vasconcelos Laboratório Integrado de Biomoléculas - LIBS, Departamento de Patologia e Medicina Legal, Universidade Federal do Ceará, Monsenhor Furtado, s/n, Fortaleza, Ceará, 60430-160, Brazil
Bruno Lopes de Sousa Departamento de Física, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Fortaleza, Ceará, 60440-970, Brazil
André Castelo Rodrigues Goveia Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Francisco Vassiliepe Arruda Laboratório Integrado de Biomoléculas - LIBS, Departamento de Patologia e Medicina Legal, Universidade Federal do Ceará, Monsenhor Furtado, s/n, Fortaleza, Ceará, 60430-160, Brazil
Maria Nágila Carneiro Matos Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Helena Matthews-Cascon Laboratório de Invertebrados Marinhos do Ceará - LIMCE, Departamento de Biologia, Universidade Federal do Ceará, Campus do Pici s/n, bloco 906, Fortaleza, CE, 60455-760, Brazil
Valder Nogueira Freire Departamento de Física, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Fortaleza, Ceará, 60440-970, Brazil
Edson Holanda Teixeira Laboratório Integrado de Biomoléculas - LIBS, Departamento de Patologia e Medicina Legal, Universidade Federal do Ceará, Monsenhor Furtado, s/n, Fortaleza, Ceará, 60430-160, Brazil
Celso Shiniti Nagano Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil
Alexandre Holanda Sampaio Laboratório de Biotecnologia Marinha - BioMar-Lab, Departamento de Engenharia de Pesca, Universidade Federal do Ceará, Campus do Pici s/n, bloco 871, Av. Mister Hull, Box 6043, Fortaleza, Ceará, 60440-970, Brazil.

Collapse

Xu Y, Li L, Ding J, Wu LY, Mai G, Zhou F. Gly-PseAAC: Identifying protein lysine glycation through sequences. Gene 2017;602:1-7. [DOI: 10.1016/j.gene.2016.11.021] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Revised: 08/29/2016] [Accepted: 11/10/2016] [Indexed: 11/29/2022]

Fan GL, Liu YL, Wang H. Identification of thermophilic proteins by incorporating evolutionary and acid dissociation information into Chou's general pseudo amino acid composition. J Theor Biol 2016;407:138-142. [DOI: 10.1016/j.jtbi.2016.07.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2016] [Revised: 06/24/2016] [Accepted: 07/07/2016] [Indexed: 10/21/2022]

Kong L, Kong L, Jing R. Improving the Prediction of Protein Structural Class for Low-Similarity Sequences by Incorporating Evolutionaryand Structural Information. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2016. [DOI: 10.20965/jaciii.2016.p0402] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Prediction of sumoylation sites in proteins using linear discriminant analysis. Gene 2016;576:99-104. [DOI: 10.1016/j.gene.2015.09.072] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2015] [Revised: 08/24/2015] [Accepted: 09/28/2015] [Indexed: 01/05/2023]

Liu L, Cui J, Zhou J. A Novel Prediction Method of Protein Structural Classes Based on Protein Super-Secondary Structure. ACTA ACUST UNITED AC 2016. [DOI: 10.4236/jcc.2016.415005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]

Prediction of Protein Structural Classes for Low-Similarity Sequences Based on Consensus Sequence and Segmented PSSM. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE 2015;2015:370756. [PMID: 26788119 PMCID: PMC4693000 DOI: 10.1155/2015/370756] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/31/2015] [Revised: 11/19/2015] [Accepted: 12/01/2015] [Indexed: 11/17/2022]

Xu Y, Ding YX, Ding J, Wu LY, Deng NY. Phogly–PseAAC: Prediction of lysine phosphoglycerylation in proteins incorporating with position-specific propensity. J Theor Biol 2015;379:10-5. [DOI: 10.1016/j.jtbi.2015.04.016] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Revised: 03/17/2015] [Accepted: 04/11/2015] [Indexed: 01/04/2023]

Abbass J, Nebel JC. Customised fragments libraries for protein structure prediction based on structural class annotations. BMC Bioinformatics 2015;16:136. [PMID: 25925397 PMCID: PMC4419399 DOI: 10.1186/s12859-015-0576-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 04/17/2015] [Indexed: 12/05/2022] Open

Abstract

Background

Since experimental techniques are time and cost consuming, in silico protein structure prediction is essential to produce conformations of protein targets. When homologous structures are not available, fragment-based protein structure prediction has become the approach of choice. However, it still has many issues including poor performance when targets’ lengths are above 100 residues, excessive running times and sub-optimal energy functions. Taking advantage of the reliable performance of structural class prediction software, we propose to address some of the limitations of fragment-based methods by integrating structural constraints in their fragment selection process.

Results

Using Rosetta, a state-of-the-art fragment-based protein structure prediction package, we evaluated our proposed pipeline on 70 former CASP targets containing up to 150 amino acids. Using either CATH or SCOP-based structural class annotations, enhancement of structure prediction performance is highly significant in terms of both GDT_TS (at least +2.6, p-values < 0.0005) and RMSD (−0.4, p-values < 0.005). Although CATH and SCOP classifications are different, they perform similarly. Moreover, proteins from all structural classes benefit from the proposed methodology. Further analysis also shows that methods relying on class-based fragments produce conformations which are more relevant to user and converge quicker towards the best model as estimated by GDT_TS (up to 10% in average). This substantiates our hypothesis that usage of structurally relevant templates conducts to not only reducing the size of the conformation space to be explored, but also focusing on a more relevant area.

Conclusions

Since our methodology produces models the quality of which is up to 7% higher in average than those generated by a standard fragment-based predictor, we believe it should be considered before conducting any fragment-based protein structure prediction. Despite such progress, ab initio prediction remains a challenging task, especially for proteins of average and large sizes. Apart from improving search strategies and energy functions, integration of additional constraints seems a promising route, especially if they can be accurately predicted from sequence alone.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0576-2) contains supplementary material, which is available to authorized users.

Collapse

Zhang L, Zhao X, Kong L. Predict protein structural class for low-similarity sequences by evolutionary difference information into the general form of Chou's pseudo amino acid composition. J Theor Biol 2014;355:105-10. [PMID: 24735902 DOI: 10.1016/j.jtbi.2014.04.008] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Revised: 02/26/2014] [Accepted: 04/04/2014] [Indexed: 10/25/2022]

Kong L, Zhang L. Novel structure-driven features for accurate prediction of protein structural class. Genomics 2014;103:292-7. [DOI: 10.1016/j.ygeno.2014.04.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2013] [Revised: 04/05/2014] [Accepted: 04/07/2014] [Indexed: 11/25/2022]

PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations. PLoS One 2014;9:e92863. [PMID: 24675610 PMCID: PMC3968047 DOI: 10.1371/journal.pone.0092863] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 02/27/2014] [Indexed: 02/05/2023] Open