1
|
Rubach P, Sikora M, Jarmolinska A, Perlinska A, Sulkowska J. AlphaKnot 2.0: a web server for the visualization of proteins' knotting and a database of knotted AlphaFold-predicted models. Nucleic Acids Res 2024; 52:W187-W193. [PMID: 38842945 PMCID: PMC11223836 DOI: 10.1093/nar/gkae443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2024] [Revised: 04/29/2024] [Accepted: 05/10/2024] [Indexed: 07/06/2024] Open
Abstract
The availability of 3D protein models is rapidly increasing with the development of structure prediction algorithms. With the expanding availability of data, new ways of analysis, especially topological analysis, of those predictions are becoming necessary. Here, we present the updated version of the AlphaKnot service that provides a straightforward way of analyzing structure topology. It was designed specifically to determine knot types of the predicted structure models, however, it can be used for all structures, including the ones solved experimentally. AlphaKnot 2.0 provides the user's ability to obtain the knowledge necessary to assess the topological correctness of the model. Both probabilistic and deterministic knot detection methods are available, together with various visualizations (including a trajectory of simplification steps to highlight the topological complexities). Moreover, the web server provides a list of proteins similar to the queried model within AlphaKnot's database and returns their knot types for direct comparison. We pre-calculated the topology of high-quality models from the AlphaFold Database (4th version) and there are now more than 680.000 knotted models available in the AlphaKnot database. AlphaKnot 2.0 is available at https://alphaknot.cent.uw.edu.pl/.
Collapse
Affiliation(s)
- Pawel Rubach
- Warsaw School of Economics, Al. Niepodleglosci 162, 02-554 Warsaw, Poland
| | - Maciej Sikora
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| | | | - Agata P Perlinska
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| | - Joanna I Sulkowska
- Centre of New Technologies, University of Warsaw, Banacha 2c, 02-097 Warsaw, Poland
| |
Collapse
|
2
|
Sikora M, Klimentova E, Uchal D, Sramkova D, Perlinska AP, Nguyen ML, Korpacz M, Malinowska R, Nowakowski S, Rubach P, Simecek P, Sulkowska JI. Knot or not? Identifying unknotted proteins in knotted families with sequence-based Machine Learning model. Protein Sci 2024; 33:e4998. [PMID: 38888487 PMCID: PMC11184937 DOI: 10.1002/pro.4998] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Revised: 03/14/2024] [Accepted: 04/09/2024] [Indexed: 06/20/2024]
Abstract
Knotted proteins, although scarce, are crucial structural components of certain protein families, and their roles continue to be a topic of intense research. Capitalizing on the vast collection of protein structure predictions offered by AlphaFold (AF), this study computationally examines the entire UniProt database to create a robust dataset of knotted and unknotted proteins. Utilizing this dataset, we develop a machine learning (ML) model capable of accurately predicting the presence of knots in protein structures solely from their amino acid sequences. We tested the model's capabilities on 100 proteins whose structures had not yet been predicted by AF and found agreement with our local prediction in 92% cases. From the point of view of structural biology, we found that all potentially knotted proteins predicted by AF can be classified only into 17 families. This allows us to discover the presence of unknotted proteins in families with a highly conserved knot. We found only three new protein families: UCH, DUF4253, and DUF2254, that contain both knotted and unknotted proteins, and demonstrate that deletions within the knot core could potentially account for the observed unknotted (trivial) topology. Finally, we have shown that in the majority of knotted families (11 out of 15), the knotted topology is strictly conserved in functional proteins with very low sequence similarity. We have conclusively demonstrated that proteins AF predicts as unknotted are structurally accurate in their unknotted configurations. However, these proteins often represent nonfunctional fragments, lacking significant portions of the knot core (amino acid sequence).
Collapse
Affiliation(s)
- Maciej Sikora
- Centre of New Technologies, University of WarsawWarsawPoland
- Faculty of Mathematics, Informatics and Mechanics, University of WarsawWarsawPoland
| | - Eva Klimentova
- Central European Institute of Technology, Masaryk UniversityBrnoCzech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk UniversityBrnoCzech Republic
| | - Dawid Uchal
- Centre of New Technologies, University of WarsawWarsawPoland
- Faculty of Physics, University of WarsawWarsawPoland
| | - Denisa Sramkova
- Central European Institute of Technology, Masaryk UniversityBrnoCzech Republic
- National Centre for Biomolecular Research, Faculty of Science, Masaryk UniversityBrnoCzech Republic
| | | | - Mai Lan Nguyen
- Centre of New Technologies, University of WarsawWarsawPoland
| | - Marta Korpacz
- Centre of New Technologies, University of WarsawWarsawPoland
- Faculty of Mathematics, Informatics and Mechanics, University of WarsawWarsawPoland
| | - Roksana Malinowska
- Centre of New Technologies, University of WarsawWarsawPoland
- Faculty of Mathematics, Informatics and Mechanics, University of WarsawWarsawPoland
| | - Szymon Nowakowski
- Faculty of Mathematics, Informatics and Mechanics, University of WarsawWarsawPoland
- Faculty of Physics, University of WarsawWarsawPoland
| | - Pawel Rubach
- Centre of New Technologies, University of WarsawWarsawPoland
- Warsaw School of EconomicsWarsawPoland
| | - Petr Simecek
- Central European Institute of Technology, Masaryk UniversityBrnoCzech Republic
| | | |
Collapse
|
3
|
Gren BA, Antczak M, Zok T, Sulkowska JI, Szachniuk M. Knotted artifacts in predicted 3D RNA structures. PLoS Comput Biol 2024; 20:e1011959. [PMID: 38900780 PMCID: PMC11218946 DOI: 10.1371/journal.pcbi.1011959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 07/02/2024] [Accepted: 06/01/2024] [Indexed: 06/22/2024] Open
Abstract
Unlike proteins, RNAs deposited in the Protein Data Bank do not contain topological knots. Recently, admittedly, the first trefoil knot and some lasso-type conformations have been found in experimental RNA structures, but these are still exceptional cases. Meanwhile, algorithms predicting 3D RNA models have happened to form knotted structures not so rarely. Interestingly, machine learning-based predictors seem to be more prone to generate knotted RNA folds than traditional methods. A similar situation is observed for the entanglements of structural elements. In this paper, we analyze all models submitted to the CASP15 competition in the 3D RNA structure prediction category. We show what types of topological knots and structure element entanglements appear in the submitted models and highlight what methods are behind the generation of such conformations. We also study the structural aspect of susceptibility to entanglement. We suggest that predictors take care of an evaluation of RNA models to avoid publishing structures with artifacts, such as unusual entanglements, that result from hallucinations of predictive algorithms.
Collapse
Affiliation(s)
- Bartosz A. Gren
- Centre of New Technologies, University of Warsaw, Warsaw, Poland
| | - Maciej Antczak
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| | - Tomasz Zok
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
| | | | - Marta Szachniuk
- Institute of Computing Science, Poznan University of Technology, Poznan, Poland
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Poznan, Poland
| |
Collapse
|
4
|
Sriramoju MK, Ko KT, Hsu STD. Tying a true topological protein knot by cyclization. Biochem Biophys Res Commun 2024; 696:149470. [PMID: 38244314 DOI: 10.1016/j.bbrc.2024.149470] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 12/23/2023] [Accepted: 01/03/2024] [Indexed: 01/22/2024]
Abstract
Knotted proteins are fascinating to biophysicists because of their robust ability to fold into intricately defined three-dimensional structures with complex and topologically knotted arrangements. Exploring the biophysical properties of the knotted proteins is of significant interest, as they could offer enhanced chemical, thermal, and mechanostabilities. A true mathematical knot requires a closed path; in contrast, knotted protein structures have open N- and C-termini. To address the question of how a truly knotted protein differs from the naturally occurring counterpart, we enzymatically cyclized a 31 knotted YibK protein from Haemophilus influenza (HiYibK) to investigate the impact of path closure on its structure-function relationship and folding stability. Through the use of a multitude of structural and biophysical tools, including X-ray crystallography, NMR spectroscopy, small angle X-ray scattering, differential scanning calorimetry, and isothermal calorimetry, we showed that the path closure minimally perturbs the native structure and ligand binding of HiYibK. Nevertheless, the cyclization did alter the folding stability and mechanism according to chemical and thermal unfolding analysis. These molecular insights contribute to our fundamental understanding of protein folding and knotting that could have implications in the protein design with higher stabilities.
Collapse
Affiliation(s)
| | - Kuang-Ting Ko
- Institute of Biological Chemistry, Academia Sinica, Taipei, 11529, Taiwan
| | - Shang-Te Danny Hsu
- Institute of Biological Chemistry, Academia Sinica, Taipei, 11529, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei, 106319, Taiwan; International Institute for Sustainability with Knotted Chiral Meta Matter (SKCM(2)), Hiroshima University, Higashihiroshima, 739-8527, Japan.
| |
Collapse
|
5
|
Hsu MF, Sriramoju MK, Lai CH, Chen YR, Huang JS, Ko TP, Huang KF, Hsu STD. Structure, dynamics, and stability of the smallest and most complex 7 1 protein knot. J Biol Chem 2024; 300:105553. [PMID: 38072060 PMCID: PMC10840475 DOI: 10.1016/j.jbc.2023.105553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Revised: 11/21/2023] [Accepted: 12/04/2023] [Indexed: 12/29/2023] Open
Abstract
Proteins can spontaneously tie a variety of intricate topological knots through twisting and threading of the polypeptide chains. Recently developed artificial intelligence algorithms have predicted several new classes of topological knotted proteins, but the predictions remain to be authenticated experimentally. Here, we showed by X-ray crystallography and solution-state NMR spectroscopy that Q9PR55, an 89-residue protein from Ureaplasma urealyticum, possesses a novel 71 knotted topology that is accurately predicted by AlphaFold 2, except for the flexible N terminus. Q9PR55 is monomeric in solution, making it the smallest and most complex knotted protein known to date. In addition to its exceptional chemical stability against urea-induced unfolding, Q9PR55 is remarkably robust to resist the mechanical unfolding-coupled proteolysis by a bacterial proteasome, ClpXP. Our results suggest that the mechanical resistance against pulling-induced unfolding is determined by the complexity of the knotted topology rather than the size of the molecule.
Collapse
Affiliation(s)
- Min-Feng Hsu
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan
| | | | - Chih-Hsuan Lai
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan
| | - Yun-Ru Chen
- Academia Sinica Protein Clinic, Academia Sinica, Taipei, Taiwan
| | - Jing-Siou Huang
- Academia Sinica Protein Clinic, Academia Sinica, Taipei, Taiwan
| | - Tzu-Ping Ko
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan
| | - Kai-Fa Huang
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan; Academia Sinica Protein Clinic, Academia Sinica, Taipei, Taiwan
| | - Shang-Te Danny Hsu
- Institute of Biological Chemistry, Academia Sinica, Taipei, Taiwan; Academia Sinica Protein Clinic, Academia Sinica, Taipei, Taiwan; Institute of Biochemical Sciences, National Taiwan University, Taipei, Taiwan; International Institute for Sustainability with Knotted Chiral Meta Matter (WPI-SKCM(2)), Hiroshima University, Higashihiroshima, Japan.
| |
Collapse
|