Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zhang Y. Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10. Proteins 2013;82 Suppl 2:175-87. [PMID: 23760925 DOI: 10.1002/prot.24341] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 05/15/2013] [Accepted: 05/23/2013] [Indexed: 11/09/2022]

For:	Zhang Y. Interplay of I-TASSER and QUARK for template-based and ab initio protein structure prediction in CASP10. Proteins 2013;82 Suppl 2:175-87. [PMID: 23760925 DOI: 10.1002/prot.24341] [Citation(s) in RCA: 89] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2013] [Revised: 05/15/2013] [Accepted: 05/23/2013] [Indexed: 11/09/2022]

Number

Cited by Other Article(s)

Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, Pouriyeh S, Dai Z, Chen J, Xie CY. AI-Driven Deep Learning Techniques in Protein Structure Prediction. Int J Mol Sci 2024;25:8426. [PMID: 39125995 PMCID: PMC11313475 DOI: 10.3390/ijms25158426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Revised: 07/29/2024] [Accepted: 07/29/2024] [Indexed: 08/12/2024] Open

Abstract

Protein structure prediction is important for understanding their function and behavior. This review study presents a comprehensive review of the computational models used in predicting protein structure. It covers the progression from established protein modeling to state-of-the-art artificial intelligence (AI) frameworks. The paper will start with a brief introduction to protein structures, protein modeling, and AI. The section on established protein modeling will discuss homology modeling, ab initio modeling, and threading. The next section is deep learning-based models. It introduces some state-of-the-art AI models, such as AlphaFold (AlphaFold, AlphaFold2, AlphaFold3), RoseTTAFold, ProteinBERT, etc. This section also discusses how AI techniques have been integrated into established frameworks like Swiss-Model, Rosetta, and I-TASSER. The model performance is compared using the rankings of CASP14 (Critical Assessment of Structure Prediction) and CASP15. CASP16 is ongoing, and its results are not included in this review. Continuous Automated Model EvaluatiOn (CAMEO) complements the biennial CASP experiment. Template modeling score (TM-score), global distance test total score (GDT_TS), and Local Distance Difference Test (lDDT) score are discussed too. This paper then acknowledges the ongoing difficulties in predicting protein structure and emphasizes the necessity of additional searches like dynamic protein behavior, conformational changes, and protein-protein interactions. In the application section, this paper introduces some applications in various fields like drug design, industry, education, and novel protein development. In summary, this paper provides a comprehensive overview of the latest advancements in established protein modeling and deep learning-based models for protein structure predictions. It emphasizes the significant advancements achieved by AI and identifies potential areas for further investigation.

Collapse

Williams ME. HIV-1 Vif protein sequence variations in South African people living with HIV and their influence on Vif-APOBEC3G interaction. Eur J Clin Microbiol Infect Dis 2024;43:325-338. [PMID: 38072879 PMCID: PMC10821834 DOI: 10.1007/s10096-023-04728-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Accepted: 11/28/2023] [Indexed: 01/28/2024]

Zheng W, Wuyun Q, Freddolino PL, Zhang Y. Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15. Proteins 2023;91:1684-1703. [PMID: 37650367 PMCID: PMC10840719 DOI: 10.1002/prot.26585] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 08/04/2023] [Accepted: 08/14/2023] [Indexed: 09/01/2023]

Abstract

We report the results of the "UM-TBM" and "Zheng" groups in CASP15 for protein monomer and complex structure prediction. These prediction sets were obtained using the D-I-TASSER and DMFold-Multimer algorithms, respectively. For monomer structure prediction, D-I-TASSER introduced four new features during CASP15: (i) a multiple sequence alignment (MSA) generation protocol that combines multi-source MSA searching and a structural modeling-based MSA ranker; (ii) attention-network based spatial restraints; (iii) a multi-domain module containing domain partition and arrangement for domain-level templates and spatial restraints; (iv) an optimized I-TASSER-based folding simulation system for full-length model creation guided by a combination of deep learning restraints, threading alignments, and knowledge-based potentials. For 47 free modeling targets in CASP15, the final models predicted by D-I-TASSER showed average TM-score 19% higher than the standard AlphaFold2 program. We thus showed that traditional Monte Carlo-based folding simulations, when appropriately coupled with deep learning algorithms, can generate models with improved accuracy over end-to-end deep learning methods alone. For protein complex structure prediction, DMFold-Multimer generated models by integrating a new MSA generation algorithm (DeepMSA2) with the end-to-end modeling module from AlphaFold2-Multimer. For the 38 complex targets, DMFold-Multimer generated models with an average TM-score of 0.83 and Interface Contact Score of 0.60, both significantly higher than those of competing complex prediction tools. Our analyses on complexes highlighted the critical role played by MSA generating, ranking, and pairing in protein complex structure prediction. We also discuss future room for improvement in the areas of viral protein modeling and complex model ranking.

Collapse

Li J, Kang G, Wang J, Yuan H, Wu Y, Meng S, Wang P, Zhang M, Wang Y, Feng Y, Huang H, de Marco A. Affinity maturation of antibody fragments: A review encompassing the development from random approaches to computational rational optimization. Int J Biol Macromol 2023;247:125733. [PMID: 37423452 DOI: 10.1016/j.ijbiomac.2023.125733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Revised: 07/04/2023] [Accepted: 07/06/2023] [Indexed: 07/11/2023]

Affiliation(s)

Jiaqi Li School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
Guangbo Kang School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
Jiewen Wang School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
Haibin Yuan School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China
Yili Wu Zhejiang Provincial Clinical Research Center for Mental Disorders, School of Mental Health and the Affiliated Kangning Hospital, Institute of Aging, Key Laboratory of Alzheimer's Disease of Zhejiang Province, Wenzhou Medical University, Oujiang Laboratory, Wenzhou, Zhejiang 325035, China
Shuxian Meng School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
Ping Wang New Technology R&D Department, Tianjin Modern Innovative TCM Technology Company Limited, Tianjin 300392, China
Miao Zhang School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; China Resources Biopharmaceutical Company Limited, Beijing 100029, China
Yuli Wang School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Tianjin Pharmaceutical Da Ren Tang Group Corporation Limited, Traditional Chinese Pharmacy Research Institute, Tianjin Key Laboratory of Quality Control in Chinese Medicine, Tianjin 300457, China; State Key Laboratory of Drug Delivery Technology and Pharmacokinetics, Tianjin Institute of Pharmaceutical Research, Tianjin 300193, China
Yuanhang Feng School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China
He Huang School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin 300072, China.
Ario de Marco Laboratory for Environmental and Life Sciences, University of Nova Gorica, Nova Gorica, Slovenia.

Collapse

I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction. Nat Protoc 2022;17:2326-2353. [PMID: 35931779 DOI: 10.1038/s41596-022-00728-0] [Citation(s) in RCA: 135] [Impact Index Per Article: 67.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2022] [Accepted: 05/24/2022] [Indexed: 01/17/2023]

Lee J, Shamim A, Park J, Jang JH, Kim JH, Kwon JY, Kim JW, Kim KK, Lee J. Functional and Structural Changes in the Membrane-Bound O-Acyltransferase Family Member 7 (MBOAT7) Protein: The Pathomechanism of a Novel MBOAT7 Variant in Patients With Intellectual Disability. Front Neurol 2022;13:836954. [PMID: 35509994 PMCID: PMC9058081 DOI: 10.3389/fneur.2022.836954] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 03/11/2022] [Indexed: 12/05/2022] Open

Vishwakarma P, Vattekatte AM, Shinada N, Diharce J, Martins C, Cadet F, Gardebien F, Etchebest C, Nadaradjane AA, de Brevern AG. V_HH Structural Modelling Approaches: A Critical Review. Int J Mol Sci 2022;23:3721. [PMID: 35409081 PMCID: PMC8998791 DOI: 10.3390/ijms23073721] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 03/23/2022] [Accepted: 03/23/2022] [Indexed: 12/20/2022] Open

Affiliation(s)

Poonam Vishwakarma INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.) INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
Akhila Melarkode Vattekatte INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.) INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
Nicolas Shinada 3 SBX Corp., Tokyo-to, Shinagawa-ku, Tokyo 141-0022, Japan;
Julien Diharce INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
Carla Martins INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.) INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
Frédéric Cadet INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.) PEACCEL, Artificial Intelligence Department, Square Albin Cachot, F-75013 Paris, France
Fabrice Gardebien INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
Catherine Etchebest INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
Aravindan Arun Nadaradjane INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.) INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
Alexandre G. de Brevern INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.) INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)

Collapse

Zheng W, Li Y, Zhang C, Zhou X, Pearce R, Bell EW, Huang X, Zhang Y. Protein structure prediction using deep learning distance and hydrogen-bonding restraints in CASP14. Proteins 2021;89:1734-1751. [PMID: 34331351 PMCID: PMC8616857 DOI: 10.1002/prot.26193] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Revised: 07/06/2021] [Accepted: 07/22/2021] [Indexed: 11/10/2022]

Ding Y, Tang J, Guo F. Protein Crystallization Identification via Fuzzy Model on Linear Neighborhood Representation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:1986-1995. [PMID: 31751248 DOI: 10.1109/tcbb.2019.2954826] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

A physiologic rise in cytoplasmic calcium ion signal increases pannexin1 channel activity via a C-terminus phosphorylation by CaMKII. Proc Natl Acad Sci U S A 2021;118:2108967118. [PMID: 34301850 DOI: 10.1073/pnas.2108967118] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open

Abstract

Pannexin1 (Panx1) channels are ubiquitously expressed in vertebrate cells and are widely accepted as adenosine triphosphate (ATP)-releasing membrane channels. Activation of Panx1 has been associated with phosphorylation in a specific tyrosine residue or cleavage of its C-terminal domains. In the present work, we identified a residue (S394) as a putative phosphorylation site by Ca²⁺/calmodulin-dependent kinase II (CaMKII). In HeLa cells transfected with rat Panx1 (rPanx1), membrane stretch (MS)-induced activation-measured by changes in DAPI uptake rate-was drastically reduced by either knockdown of Piezo1 or pharmacological inhibition of calmodulin or CaMKII. By site-directed mutagenesis we generated rPanx1S394A-EGFP (enhanced green fluorescent protein), which lost its sensitivity to MS, and rPanx1S394D-EGFP, mimicking phosphorylation, which shows high DAPI uptake rate without MS stimulation or cleavage of the C terminus. Using whole-cell patch-clamp and outside-out excised patch configurations, we found that rPanx1-EGFP and rPanx1S394D-EGFP channels showed current at all voltages between ±100 mV, similar single channel currents with outward rectification, and unitary conductance (∼30 to 70 pS). However, using cell-attached configuration we found that rPanx1S394D-EGFP channels show increased spontaneous unitary events independent of MS stimulation. In silico studies revealed that phosphorylation of S394 caused conformational changes in the selectivity filter and increased the average volume of lateral tunnels, allowing ATP to be released via these conduits and DAPI uptake directly from the channel mouth to the cytoplasmic space. These results could explain one possible mechanism for activation of rPanx1 upon increase in cytoplasmic Ca²⁺ signal elicited by diverse physiological conditions in which the C-terminal domain is not cleaved.

Collapse

A Peptides Prediction Methodology for Tertiary Structure Based on Simulated Annealing. MATHEMATICAL AND COMPUTATIONAL APPLICATIONS 2021. [DOI: 10.3390/mca26020039] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhang GJ, Xie TY, Zhou XG, Wang LJ, Hu J. Protein Structure Prediction Using Population-Based Algorithm Guided by Information Entropy. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:697-707. [PMID: 31180869 DOI: 10.1109/tcbb.2019.2921958] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Wang Y, Ding Y, Tang J, Dai Y, Guo F. CrystalM: A Multi-View Fusion Approach for Protein Crystallization Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021;18:325-335. [PMID: 31027046 DOI: 10.1109/tcbb.2019.2912173] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Abstract

Improving the accuracy of predicting protein crystallization is very important for protein crystallization projects, which is a critical step for the determination of protein structure by X-ray crystallography. At present, many machine learning methods are used to predict protein crystallization. Here, we use a novel feature combination to construct a SVM model in the prediction of protein crystallization, called as CrystalM. In this work, we extract six features to represent protein sequences, namely Average Block-Position specific scoring matrix (AVBlock-PSSM), Average Block-Secondary Structure (AVBlock-SS), Global Encoding (GE), Pseudo-Position specific scoring matrix (PsePSSM), Protscale, and Discrete Wavelet Transform-Position specific scoring matrix (DWT-PSSM). Moreover, we employ two training datasets (TRAIN3587 and TRAIN1500) and their corresponding independent test datasets (TEST3585 and TEST500) to evaluate CrystalM by feeding multi-view features into Support Vector Machine (SVM) classifier. Two training datasets are employed for five-fold cross validation, and two test datasets are separately used to test the corresponding datasets. Finally, we compare CrystalM with other existing methods in the performance. For the datasets of TRAIN3587 and TEST3585, CrystalM achieves best Accuracy (ACC), best Specificity (SP), and the same Mathew's correlation coefficient (MCC) as the previous outperforming methods in the five-fold cross validation. In particular, ACC, SP, and MCC have surpassed the existing methods in independent test, which proves the effectiveness of CrystalM. Meanwhile, ACC, SP, and MCC are higher than existing methods in the five-fold cross validation for TRAIN1500. Although the performance of independent test for TEST500 is not the best, CrystalM also has a certain predictability in the prediction of protein crystallization. In addition, we find that only choosing the first four features can improve the performance of prediction for TRAIN1500 and TEST500, not only in independent tests but also in five-fold cross validation. This phenomenon indicates that the latter two features can not effectively represent proteins of TRAIN1500 and TEST500. CrystalM is a sequence-based protein crystallization prediction method. The good performance on the datasets proves the effectiveness of CrystalM and the better performance on large datasets further demonstrates the stability and superiority of CrystalM.

Collapse

Abbass J, Nebel JC. Rosetta and the Journey to Predict Proteins’ Structures, 20 Years on. Curr Bioinform 2020. [DOI: 10.2174/1574893615999200504103643] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Zhang GJ, Wang XQ, Ma LF, Wang LJ, Hu J, Zhou XG. Two-Stage Distance Feature-based Optimization Algorithm for De novo Protein Structure Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:2119-2130. [PMID: 31107659 DOI: 10.1109/tcbb.2019.2917452] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Dhingra S, Sowdhamini R, Cadet F, Offmann B. A glance into the evolution of template-free protein structure prediction methodologies. Biochimie 2020;175:85-92. [DOI: 10.1016/j.biochi.2020.04.026] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/24/2020] [Accepted: 04/27/2020] [Indexed: 11/26/2022]

Noncanonical type 2B von Willebrand disease associated with mutations in the VWF D'D3 and D4 domains. Blood Adv 2020;4:3405-3415. [PMID: 32722784 DOI: 10.1182/bloodadvances.2020002334] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 06/22/2020] [Indexed: 11/20/2022] Open

Abbass J, Nebel JC. Enhancing fragment-based protein structure prediction by customising fragment cardinality according to local secondary structure. BMC Bioinformatics 2020;21:170. [PMID: 32357827 PMCID: PMC7195757 DOI: 10.1186/s12859-020-3491-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2019] [Accepted: 04/13/2020] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Whenever suitable template structures are not available, usage of fragment-based protein structure prediction becomes the only practical alternative as pure ab initio techniques require massive computational resources even for very small proteins. However, inaccuracy of their energy functions and their stochastic nature imposes generation of a large number of decoys to explore adequately the solution space, limiting their usage to small proteins. Taking advantage of the uneven complexity of the sequence-structure relationship of short fragments, we adjusted the fragment insertion process by customising the number of available fragment templates according to the expected complexity of the predicted local secondary structure. Whereas the number of fragments is kept to its default value for coil regions, important and dramatic reductions are proposed for beta sheet and alpha helical regions, respectively.

RESULTS

The evaluation of our fragment selection approach was conducted using an enhanced version of the popular Rosetta fragment-based protein structure prediction tool. It was modified so that the number of fragment candidates used in Rosetta could be adjusted based on the local secondary structure. Compared to Rosetta's standard predictions, our strategy delivered improved first models, + 24% and + 6% in terms of GDT, when using 2000 and 20,000 decoys, respectively, while reducing significantly the number of fragment candidates. Furthermore, our enhanced version of Rosetta is able to deliver with 2000 decoys a performance equivalent to that produced by standard Rosetta while using 20,000 decoys. We hypothesise that, as the fragment insertion process focuses on the most challenging regions, such as coils, fewer decoys are needed to explore satisfactorily conformation spaces.

CONCLUSIONS

Taking advantage of the high accuracy of sequence-based secondary structure predictions, we showed the value of that information to customise the number of candidates used during the fragment insertion process of fragment-based protein structure prediction. Experimentations conducted using standard Rosetta showed that, when using the recommended number of decoys, i.e. 20,000, our strategy produces better results. Alternatively, similar results can be achieved using only 2000 decoys. Consequently, we recommend the adoption of this strategy to either improve significantly model quality or reduce processing times by a factor 10.

Collapse

Discriminative margin-sensitive autoencoder for collective multi-view disease analysis. Neural Netw 2020;123:94-107. [DOI: 10.1016/j.neunet.2019.11.013] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2019] [Revised: 08/18/2019] [Accepted: 11/13/2019] [Indexed: 12/18/2022]

Zheng W, Li Y, Zhang C, Pearce R, Mortuza SM, Zhang Y. Deep-learning contact-map guided protein structure prediction in CASP13. Proteins 2019;87:1149-1164. [PMID: 31365149 PMCID: PMC6851476 DOI: 10.1002/prot.25792] [Citation(s) in RCA: 131] [Impact Index Per Article: 26.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Revised: 07/14/2019] [Accepted: 07/27/2019] [Indexed: 12/28/2022]

Abstract

We report the results of two fully automated structure prediction pipelines, "Zhang-Server" and "QUARK", in CASP13. The pipelines were built upon the C-I-TASSER and C-QUARK programs, which in turn are based on I-TASSER and QUARK but with three new modules: (a) a novel multiple sequence alignment (MSA) generation protocol to construct deep sequence-profiles for contact prediction; (b) an improved meta-method, NeBcon, which combines multiple contact predictors, including ResPRE that predicts contact-maps by coupling precision-matrices with deep residual convolutional neural-networks; and (c) an optimized contact potential to guide structure assembly simulations. For 50 CASP13 FM domains that lacked homologous templates, average TM-scores of the first models produced by C-I-TASSER and C-QUARK were 28% and 56% higher than those constructed by I-TASSER and QUARK, respectively. For the first time, contact-map predictions demonstrated usefulness on TBM domains with close homologous templates, where TM-scores of C-I-TASSER models were significantly higher than those of I-TASSER models with a P-value <.05. Detailed data analyses showed that the success of C-I-TASSER and C-QUARK was mainly due to the increased accuracy of deep-learning-based contact-maps, as well as the careful balance between sequence-based contact restraints, threading templates, and generic knowledge-based potentials. Nevertheless, challenges still remain for predicting quaternary structure of multi-domain proteins, due to the difficulties in domain partitioning and domain reassembly. In addition, contact prediction in terminal regions was often unsatisfactory due to the sparsity of MSAs. Development of new contact-based domain partitioning and assembly methods and training contact models on sparse MSAs may help address these issues.

Collapse

Wang Y, Shi Q, Yang P, Zhang C, Mortuza SM, Xue Z, Ning K, Zhang Y. Fueling ab initio folding with marine metagenomics enables structure and function predictions of new protein families. Genome Biol 2019;20:229. [PMID: 31676016 PMCID: PMC6825341 DOI: 10.1186/s13059-019-1823-z] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 09/13/2019] [Indexed: 02/01/2023] Open

Wang Y, Virtanen J, Xue Z, Zhang Y. I-TASSER-MR: automated molecular replacement for distant-homology proteins using iterative fragment assembly and progressive sequence truncation. Nucleic Acids Res 2019;45:W429-W434. [PMID: 28472524 PMCID: PMC5793832 DOI: 10.1093/nar/gkx349] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Accepted: 04/20/2017] [Indexed: 11/16/2022] Open

Wang Y, Wang J, Li R, Shi Q, Xue Z, Zhang Y. ThreaDomEx: a unified platform for predicting continuous and discontinuous protein domains by multiple-threading and segment assembly. Nucleic Acids Res 2019;45:W400-W407. [PMID: 28498994 PMCID: PMC5793814 DOI: 10.1093/nar/gkx410] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 04/28/2017] [Indexed: 12/21/2022] Open

Tang L, Yang J, Chen J, Zhang J, Yu H, Shen Z. Design of salt-bridge cyclization peptide tags for stability and activity enhancement of enzymes. Process Biochem 2019. [DOI: 10.1016/j.procbio.2019.03.002] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Xu G, Ma T, Wang Q, Ma J. OPUS-SSF: A side-chain-inclusive scoring function for ranking protein structural models. Protein Sci 2019;28:1157-1162. [PMID: 30919509 DOI: 10.1002/pro.3608] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2018] [Revised: 03/21/2019] [Accepted: 03/27/2019] [Indexed: 12/21/2022]

Blaszczyk M, Gront D, Kmiecik S, Kurcinski M, Kolinski M, Ciemny MP, Ziolkowska K, Panek M, Kolinski A. Protein Structure Prediction Using Coarse-Grained Models. SPRINGER SERIES ON BIO- AND NEUROSYSTEMS 2019. [DOI: 10.1007/978-3-319-95843-9_2] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Kc DB. Recent advances in sequence-based protein structure prediction. Brief Bioinform 2018;18:1021-1032. [PMID: 27562963 DOI: 10.1093/bib/bbw070] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Indexed: 11/13/2022] Open

Avishek K, Ahuja K, Pradhan D, Gannavaram S, Selvapandiyan A, Nakhasi HL, Salotra P. A Leishmania-specific gene upregulated at the amastigote stage is crucial for parasite survival. Parasitol Res 2018;117:3215-3228. [DOI: 10.1007/s00436-018-6020-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Accepted: 07/17/2018] [Indexed: 01/03/2023]

Guzenko D, Strelkov SV. Granular clustering of de novo protein models. Bioinformatics 2018;33:390-396. [PMID: 28171609 DOI: 10.1093/bioinformatics/btw628] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2016] [Revised: 09/19/2016] [Accepted: 09/27/2016] [Indexed: 11/12/2022] Open

Keasar C, McGuffin LJ, Wallner B, Chopra G, Adhikari B, Bhattacharya D, Blake L, Bortot LO, Cao R, Dhanasekaran BK, Dimas I, Faccioli RA, Faraggi E, Ganzynkowicz R, Ghosh S, Ghosh S, Giełdoń A, Golon L, He Y, Heo L, Hou J, Khan M, Khatib F, Khoury GA, Kieslich C, Kim DE, Krupa P, Lee GR, Li H, Li J, Lipska A, Liwo A, Maghrabi AHA, Mirdita M, Mirzaei S, Mozolewska MA, Onel M, Ovchinnikov S, Shah A, Shah U, Sidi T, Sieradzan AK, Ślusarz M, Ślusarz R, Smadbeck J, Tamamis P, Trieber N, Wirecki T, Yin Y, Zhang Y, Bacardit J, Baranowski M, Chapman N, Cooper S, Defelicibus A, Flatten J, Koepnick B, Popović Z, Zaborowski B, Baker D, Cheng J, Czaplewski C, Delbem ACB, Floudas C, Kloczkowski A, Ołdziej S, Levitt M, Scheraga H, Seok C, Söding J, Vishveshwara S, Xu D, Crivelli SN. An analysis and evaluation of the WeFold collaborative for protein structure prediction and its pipelines in CASP11 and CASP12. Sci Rep 2018;8:9939. [PMID: 29967418 PMCID: PMC6028396 DOI: 10.1038/s41598-018-26812-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 05/17/2018] [Indexed: 01/14/2023] Open

Affiliation(s)

Chen Keasar Department of Computer Science, Ben Gurion University of the Negev, Be'er sheva, Israel
Liam J McGuffin Biomedical Sciences Division, School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
Björn Wallner Division of Bioinformatics, Department of Physics, Chemistry, and Biology, Linköping University, Linköping, Sweden
Gaurav Chopra Department of Chemistry, College of Science, Purdue University, West Lafayette, IN, USA Purdue Institute for Drug Discovery, Purdue University, West Lafayette, IN, USA Purdue Center for Cancer Research, Purdue University, West Lafayette, IN, USA Purdue Institute for Inflammation, Immunology and Infectious Disease, Purdue University, West Lafayette, IN, USA Purdue Institute for Integrative Neuroscience, Purdue University, West Lafayette, IN, USA
Badri Adhikari Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
Debswapna Bhattacharya Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA Department of Computer Science and Software Engineering, Auburn University, Auburn, AL, USA
Lauren Blake Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Leandro Oliveira Bortot Laboratory of Biological Physics, Faculty of Pharmaceutical Sciences at Ribeirão Preto, University of São Paulo, São Paulo, Brazil
Renzhi Cao Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
B K Dhanasekaran Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
Itzhel Dimas Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Rodrigo Antonio Faccioli Institute of Mathematical and Computer Sciences, University of São Paulo, São Paulo, Brazil
Eshel Faraggi Research and Information Systems, LLC, Carmel, IN, USA Department of Biochemistry and Molecular Biology, IU School of Medicine, Indianapolis, IN, USA Batelle Center for Mathematical Medicine, The Research Institute at Nationwide Children's Hospital, Columbus, OH, USA
Robert Ganzynkowicz Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Sambit Ghosh Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
Soma Ghosh Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
Artur Giełdoń Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Lukasz Golon Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Yi He School of Engineering, University of California, Merced, CA, USA
Lim Heo Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Jie Hou Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
Main Khan Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
Firas Khatib Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
George A Khoury Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
Chris Kieslich Texas A&M Energy Institute, Texas A&M University, College Station, TX, USA
David E Kim Department of Biochemistry, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
Pawel Krupa Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Gyu Rie Lee Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Hongbo Li Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA School of Computer Science and Information Technology, NorthEast Normal University, Changchun, China Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
Jilong Li Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
Agnieszka Lipska Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Adam Liwo Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Ali Hassan A Maghrabi Biomedical Sciences Division, School of Biological Sciences, University of Reading, Reading, RG6 6AS, UK
Milot Mirdita Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
Shokoufeh Mirzaei Lawrence Berkeley National Laboratory, Berkeley, CA, USA California State Polytechnic University, Pomona, CA, USA
Magdalena A Mozolewska Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Melis Onel Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
Sergey Ovchinnikov Department of Biochemistry, University of Washington, Seattle, WA, USA Institute for Protein Design, University of Washington, Seattle, WA, USA
Anand Shah Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
Utkarsh Shah Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
Tomer Sidi Department of Computer Science, Ben Gurion University of the Negev, Be'er sheva, Israel
Adam K Sieradzan Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Magdalena Ślusarz Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Rafal Ślusarz Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
James Smadbeck Department of Chemical and Biological Engineering, Princeton University, Princeton, NJ, USA
Phanourios Tamamis Texas A&M Energy Institute, Texas A&M University, College Station, TX, USA Artie McFerrin Department of Chemical Engineering, Texas A&M University, College Station, TX, USA
Nicholas Trieber Department of Computer and Information Science, University of Massachusetts Dartmouth, MA, USA
Tomasz Wirecki Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Yanping Yin Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
Yang Zhang Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
Jaume Bacardit Interdisciplinary Computing and Complex BioSystems (ICOS) research group, School of Computing, Newcastle University, Newcastle-upon-Tyne, UK
Maciej Baranowski Intercollegiate Faculty of Biotechnology, University of Gdańsk and Medical University of Gdańsk, Gdańsk, Poland
Nicholas Chapman Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
Seth Cooper College of Computer and Information Science, Northeastern University, Boston, MA, USA
Alexandre Defelicibus Institute of Mathematical and Computer Sciences, University of São Paulo, São Paulo, Brazil
Jeff Flatten Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
Brian Koepnick Department of Biochemistry, University of Washington, Seattle, WA, USA
Zoran Popović Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
Bartlomiej Zaborowski Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
David Baker Department of Biochemistry, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA Center for Game Science, Department of Computer Science & Engineering, University of Washington, Seattle, WA, USA
Jianlin Cheng Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
Cezary Czaplewski Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Alexandre Cláudio Botazzo Delbem Institute of Mathematical and Computer Sciences, University of São Paulo, São Paulo, Brazil
Christodoulos Floudas Texas A&M Energy Institute, Texas A&M University, College Station, TX, USA
Andrzej Kloczkowski Faculty of Chemistry, University of Gdansk, Gdańsk, Poland
Stanislaw Ołdziej Intercollegiate Faculty of Biotechnology, University of Gdańsk and Medical University of Gdańsk, Gdańsk, Poland
Michael Levitt Department of Structural Biology, School of Medicine, Stanford University, Stanford, CA, USA
Harold Scheraga Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
Chaok Seok Department of Chemistry, Seoul National University, Seoul, Republic of Korea
Johannes Söding Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
Saraswathi Vishveshwara Molecular Biophysics Unit and IISC Mathematics Initiative, Indian Institute of Science, Bangalore, India
Dong Xu Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
Silvia N Crivelli Lawrence Berkeley National Laboratory, Berkeley, CA, USA. Department of Computer Science, University of California, Davis, CA, USA.

Collapse

Xie M, Muchero W, Bryan AC, Yee K, Guo HB, Zhang J, Tschaplinski TJ, Singan VR, Lindquist E, Payyavula RS, Barros-Rios J, Dixon R, Engle N, Sykes RW, Davis M, Jawdy SS, Gunter LE, Thompson O, DiFazio SP, Evans LM, Winkeler K, Collins C, Schmutz J, Guo H, Kalluri U, Rodriguez M, Feng K, Chen JG, Tuskan GA. A 5-Enolpyruvylshikimate 3-Phosphate Synthase Functions as a Transcriptional Repressor in Populus. THE PLANT CELL 2018;30:1645-1660. [PMID: 29891568 PMCID: PMC6096593 DOI: 10.1105/tpc.18.00168] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2018] [Revised: 04/17/2018] [Accepted: 06/05/2018] [Indexed: 05/21/2023]

Affiliation(s)

Meng Xie BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Wellington Muchero BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Anthony C Bryan BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Kelsey Yee BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Hao-Bo Guo Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee 37996
Jin Zhang BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Timothy J Tschaplinski BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Vasanth R Singan U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
Erika Lindquist U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598
Raja S Payyavula BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Jaime Barros-Rios BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
Richard Dixon BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
Nancy Engle BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Robert W Sykes Bioscience Center, National Renewable Energy Laboratory, Golden, Colorado 80401
Mark Davis Bioscience Center, National Renewable Energy Laboratory, Golden, Colorado 80401
Sara S Jawdy BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Lee E Gunter BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Olivia Thompson BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Stephen P DiFazio Department of Biology, West Virginia University, Morgantown, West Virginia 26506
Luke M Evans Department of Biology, West Virginia University, Morgantown, West Virginia 26506
Kim Winkeler ArborGen, Ridgeville, South Carolina 29472
Cassandra Collins ArborGen, Ridgeville, South Carolina 29472
Jeremy Schmutz U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598 HudsonAlpha Institute for Biotechnology, Huntsville, Alabama 35806
Hong Guo Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee 37996
Udaya Kalluri BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Miguel Rodriguez BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Kai Feng BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Jin-Gui Chen BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
Gerald A Tuskan BioEnergy Science Center and Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831 U.S. Department of Energy Joint Genome Institute, Walnut Creek, California 94598

Collapse

Gao S, Song S, Cheng J, Todo Y, Zhou M. Incorporation of Solvent Effect into Multi-Objective Evolutionary Algorithm for Improved Protein Structure Prediction. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018;15:1365-1378. [PMID: 28534784 DOI: 10.1109/tcbb.2017.2705094] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]

Kozic M, Fox SJ, Thomas JM, Verma CS, Rigden DJ. Large scale ab initio modeling of structurally uncharacterized antimicrobial peptides reveals known and novel folds. Proteins 2018;86:548-565. [DOI: 10.1002/prot.25473] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 01/16/2018] [Accepted: 01/29/2018] [Indexed: 12/20/2022]

Usability as the Key Factor to the Design of a Web Server for the CReF Protein Structure Predictor: The wCReF. INFORMATION 2018. [DOI: 10.3390/info9010020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Yerabham ASK, Müller-Schiffmann A, Ziehm T, Stadler A, Köber S, Indurkhya X, Marreiros R, Trossbach SV, Bradshaw NJ, Prikulis I, Willbold D, Weiergräber OH, Korth C. Biophysical insights from a single chain camelid antibody directed against the Disrupted-in-Schizophrenia 1 protein. PLoS One 2018;13:e0191162. [PMID: 29324815 PMCID: PMC5764400 DOI: 10.1371/journal.pone.0191162] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 12/31/2017] [Indexed: 01/17/2023] Open

Barradas-Bautista D, Rosell M, Pallara C, Fernández-Recio J. Structural Prediction of Protein–Protein Interactions by Docking: Application to Biomedical Problems. PROTEIN-PROTEIN INTERACTIONS IN HUMAN DISEASE, PART A 2018;110:203-249. [DOI: 10.1016/bs.apcsb.2017.06.003] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Zhang C, Mortuza SM, He B, Wang Y, Zhang Y. Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12. Proteins 2017;86 Suppl 1:136-151. [PMID: 29082551 DOI: 10.1002/prot.25414] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Revised: 10/09/2017] [Accepted: 10/27/2017] [Indexed: 12/26/2022]

Zhang GJ, Zhou XG, Yu XF, Hao XH, Yu L. Enhancing Protein Conformational Space Sampling Using Distance Profile-Guided Differential Evolution. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2017;14:1288-1301. [PMID: 28113726 DOI: 10.1109/tcbb.2016.2566617] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Kato K, Nakayoshi T, Fukuyoshi S, Kurimoto E, Oda A. Validation of Molecular Dynamics Simulations for Prediction of Three-Dimensional Structures of Small Proteins. Molecules 2017;22:molecules22101716. [PMID: 29023395 PMCID: PMC6151455 DOI: 10.3390/molecules22101716] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 10/05/2017] [Accepted: 10/10/2017] [Indexed: 12/14/2022] Open

Hao XH, Zhang GJ, Zhou XG. Conformational Space Sampling Method Using Multi-Subpopulation Differential Evolution for De novo Protein Structure Prediction. IEEE Trans Nanobioscience 2017;16:618-633. [DOI: 10.1109/tnb.2017.2749243] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Jalily Hasani H, Ahmed M, Barakat K. A comprehensive structural model for the human KCNQ1/KCNE1 ion channel. J Mol Graph Model 2017;78:26-47. [PMID: 28992529 DOI: 10.1016/j.jmgm.2017.09.019] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2017] [Revised: 09/25/2017] [Accepted: 09/26/2017] [Indexed: 10/18/2022]

Mackenzie CO, Grigoryan G. Protein structural motifs in prediction and design. Curr Opin Struct Biol 2017;44:161-167. [PMID: 28460216 PMCID: PMC5513761 DOI: 10.1016/j.sbi.2017.03.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Revised: 03/18/2017] [Accepted: 03/28/2017] [Indexed: 01/11/2023]

Annotation of Alternatively Spliced Proteins and Transcripts with Protein-Folding Algorithms and Isoform-Level Functional Networks. Methods Mol Biol 2017;1558:415-436. [PMID: 28150250 DOI: 10.1007/978-1-4939-6783-4_20] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Liu D, Liu J, Wang W, Xia L, Yang J, Sun S, Zhang F. Computational and Experimental Investigation of the Antimicrobial Peptide Cecropin XJ and its Ligands as the Impact Factors of Antibacterial Activity. FOOD BIOPHYS 2016. [DOI: 10.1007/s11483-016-9445-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Zhang S, Cui FC, Cao Y, Li YQ. Sequence identification, structure prediction and validation of tannase from Aspergillusniger N5-5. CHINESE CHEM LETT 2016. [DOI: 10.1016/j.cclet.2016.04.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, Kolinski A. Coarse-Grained Protein Models and Their Applications. Chem Rev 2016;116:7898-936. [DOI: 10.1021/acs.chemrev.6b00163] [Citation(s) in RCA: 555] [Impact Index Per Article: 69.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Modi V, Dunbrack RL. Assessment of refinement of template-based models in CASP11. Proteins 2016;84 Suppl 1:260-81. [PMID: 27081793 DOI: 10.1002/prot.25048] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2015] [Revised: 03/13/2016] [Accepted: 04/11/2016] [Indexed: 12/26/2022]

Hu J, Han K, Li Y, Yang JY, Shen HB, Yu DJ. TargetCrys: protein crystallization prediction by fusing multi-view features with two-layered SVM. Amino Acids 2016;48:2533-2547. [DOI: 10.1007/s00726-016-2274-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Accepted: 06/07/2016] [Indexed: 12/12/2022]

Bhattacharya D, Cao R, Cheng J. UniCon3D: de novo protein structure prediction using united-residue conformational search via stepwise, probabilistic sampling. Bioinformatics 2016;32:2791-9. [PMID: 27259540 PMCID: PMC5018369 DOI: 10.1093/bioinformatics/btw316] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2016] [Accepted: 05/15/2016] [Indexed: 12/20/2022] Open

Abstract

MOTIVATION

Recent experimental studies have suggested that proteins fold via stepwise assembly of structural units named 'foldons' through the process of sequential stabilization. Alongside, latest developments on computational side based on probabilistic modeling have shown promising direction to perform de novo protein conformational sampling from continuous space. However, existing computational approaches for de novo protein structure prediction often randomly sample protein conformational space as opposed to experimentally suggested stepwise sampling.

RESULTS

Here, we develop a novel generative, probabilistic model that simultaneously captures local structural preferences of backbone and side chain conformational space of polypeptide chains in a united-residue representation and performs experimentally motivated conditional conformational sampling via stepwise synthesis and assembly of foldon units that minimizes a composite physics and knowledge-based energy function for de novo protein structure prediction. The proposed method, UniCon3D, has been found to (i) sample lower energy conformations with higher accuracy than traditional random sampling in a small benchmark of 6 proteins; (ii) perform comparably with the top five automated methods on 30 difficult target domains from the 11th Critical Assessment of Protein Structure Prediction (CASP) experiment and on 15 difficult target domains from the 10th CASP experiment; and (iii) outperform two state-of-the-art approaches and a baseline counterpart of UniCon3D that performs traditional random sampling for protein modeling aided by predicted residue-residue contacts on 45 targets from the 10th edition of CASP.

AVAILABILITY AND IMPLEMENTATION

Source code, executable versions, manuals and example data of UniCon3D for Linux and OSX are freely available to non-commercial users at http://sysbio.rnet.missouri.edu/UniCon3D/ CONTACT: chengji@missouri.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

Collapse

Figueroa M, Sleutel M, Vandevenne M, Parvizi G, Attout S, Jacquin O, Vandenameele J, Fischer AW, Damblon C, Goormaghtigh E, Valerio-Lepiniec M, Urvoas A, Durand D, Pardon E, Steyaert J, Minard P, Maes D, Meiler J, Matagne A, Martial JA, Van de Weerdt C. The unexpected structure of the designed protein Octarellin V.1 forms a challenge for protein structure prediction tools. J Struct Biol 2016;195:19-30. [PMID: 27181418 DOI: 10.1016/j.jsb.2016.05.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Revised: 04/19/2016] [Accepted: 05/12/2016] [Indexed: 12/26/2022]

Affiliation(s)

Maximiliano Figueroa GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium.
Mike Sleutel Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
Marylene Vandevenne GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
Gregory Parvizi GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
Sophie Attout GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
Olivier Jacquin GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
Julie Vandenameele Laboratoire d'Enzymologie et Repliement des Protéines, Centre for Protein Engineering, University of Liège, Liège, Belgium
Axel W Fischer Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
Christian Damblon Department of Chemistry, Univeristy of Liège, Belgium
Erik Goormaghtigh Laboratory for the Structure and Function of Biological Membranes, Center for Structural Biology and Bioinformatics, Université Libre de Bruxelles, Brussels, Belgium
Marie Valerio-Lepiniec Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
Agathe Urvoas Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
Dominique Durand Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
Els Pardon Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Research Center, VIB, Pleinlaan 2, 1050 Brussels, Belgium
Jan Steyaert Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Structural Biology Research Center, VIB, Pleinlaan 2, 1050 Brussels, Belgium
Philippe Minard Institute for Integrative Biology of the Cell (I2BC), UMT 9198, CEA, CNRS, Université Paris-Sud, Orsay, France
Dominique Maes Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium
Jens Meiler Department of Chemistry, Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
André Matagne Laboratoire d'Enzymologie et Repliement des Protéines, Centre for Protein Engineering, University of Liège, Liège, Belgium
Joseph A Martial GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium
Cécile Van de Weerdt GIGA-Research, Molecular Biomimetics and Protein Engineering, University of Liège, Liège, Belgium.

Collapse