1
|
Vats S, Bobrovs R, Söderhjelm P, Bhakat S. AlphaFold-SFA: Accelerated sampling of cryptic pocket opening, protein-ligand binding and allostery by AlphaFold, slow feature analysis and metadynamics. PLoS One 2024; 19:e0307226. [PMID: 39190764 DOI: 10.1371/journal.pone.0307226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 07/02/2024] [Indexed: 08/29/2024] Open
Abstract
Sampling rare events in proteins is crucial for comprehending complex phenomena like cryptic pocket opening, where transient structural changes expose new binding sites. Understanding these rare events also sheds light on protein-ligand binding and allosteric communications, where distant site interactions influence protein function. Traditional unbiased molecular dynamics simulations often fail to sample such rare events, as the free energy barrier between metastable states is large relative to the thermal energy. This renders these events inaccessible on the timescales typically simulated by unbiased molecular dynamics, limiting our understanding of these critical processes. In this paper, we proposed a novel unsupervised learning approach termed as slow feature analysis (SFA) which aims to extract slowly varying features from high-dimensional temporal data. SFA trained on small unbiased molecular dynamics simulations launched from AlphaFold generated conformational ensembles manages to capture rare events governing cryptic pocket opening, protein-ligand binding, and allosteric communications in a kinase. Metadynamics simulations using SFA as collective variables manage to sample 'deep' cryptic pocket opening within a few hundreds of nanoseconds which was beyond the reach of microsecond long unbiased molecular dynamics simulations. SFA augmented metadynamics also managed to capture conformational plasticity of protein upon ligand binding/unbinding and provided novel insights into allosteric communication in receptor-interacting protein kinase 2 (RIPK2) which dictates protein-protein interaction. Taken together, our results show how SFA acts as a dimensionality reduction tool which bridges the gap between AlphaFold, molecular dynamics simulation and metadynamics in context of capturing rare events in biomolecules, extending the scope of structure-based drug discovery in the era of AlphaFold.
Collapse
Affiliation(s)
- Shray Vats
- Department of Computer Science, University of Texas at Austin, Austin, TX, United States of America
| | | | - Pär Söderhjelm
- Division of Biophysical Chemistry, Chemical Center, Lund University, Lund, Sweden
| | | |
Collapse
|
2
|
Guo Y, Zhou J, Jia W, Gao H, Zhang H, Zhang C. Characterization of a Novel Milk-Clotting Aspartic Protease from Penicillium sp. and Structural Explanation for its High Milk-Clotting Index. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023. [PMID: 37017929 DOI: 10.1021/acs.jafc.2c07303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
A novel milk-clotting enzyme isolated from Penicillium sp. ACCC 39790 (PsMCE) was prepared by heterologous expression. The recombinant PsMCE had an apparent molecular mass of 45 kDa and exhibited maximum casein hydrolysis activity at pH 4.0 and 50 °C. The PsMCE activity was enhanced by calcium ions and strongly inhibited by pepstatin A. Through hydrolysis pattern and cleavage site analyses, the milk-clotting activity of PsMCE was related to its specific hydrolysis between Phe105 and Met106 in the κ-casein proteins. The structural basis of PsMCE was characterized using homology modeling, molecular docking, and interactional analysis. The P1' region of PsMCE is critical for its selective binding to the hydrolytic site in κ-casein, and the hydrophobic forces play a decisive role in the specific cleavage of Phe105 and Met106. These interactional analyses between PsMCE and the ligand peptide clarified the fundamentals of its high milk-clotting index (MCI). PsMCE could be applied in cheese making due to its thermolability and high MCI value as a potential milk-clotting enzyme.
Collapse
Affiliation(s)
- Yujie Guo
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
- Xinjiang Taikun Group Co., Ltd., Xinjiang Uygur Autonomous Region, Changji 831100, People's Republic of China
| | - Jiaojiao Zhou
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Wei Jia
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
- Xinjiang Taikun Group Co., Ltd., Xinjiang Uygur Autonomous Region, Changji 831100, People's Republic of China
| | - Hongwei Gao
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
- Xinjiang Taikun Group Co., Ltd., Xinjiang Uygur Autonomous Region, Changji 831100, People's Republic of China
| | - Hongru Zhang
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Chunhui Zhang
- Key Laboratory of Agro-Products Processing, Ministry of Agriculture and Rural Affairs, Institute of Food Science and Technology, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| |
Collapse
|
3
|
Meller A, Bhakat S, Solieva S, Bowman GR. Accelerating Cryptic Pocket Discovery Using AlphaFold. J Chem Theory Comput 2023. [PMID: 36948209 PMCID: PMC10373493 DOI: 10.1021/acs.jctc.2c01189] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
Cryptic pockets, or pockets absent in ligand-free, experimentally determined structures, hold great potential as drug targets. However, cryptic pocket openings are often beyond the reach of conventional biomolecular simulations because certain cryptic pocket openings involve slow motions. Here, we investigate whether AlphaFold can be used to accelerate cryptic pocket discovery either by generating structures with open pockets directly or generating structures with partially open pockets that can be used as starting points for simulations. We use AlphaFold to generate ensembles for 10 known cryptic pocket examples, including five that were deposited after AlphaFold's training data were extracted from the PDB. We find that in 6 out of 10 cases AlphaFold samples the open state. For plasmepsin II, an aspartic protease from the causative agent of malaria, AlphaFold only captures a partial pocket opening. As a result, we ran simulations from an ensemble of AlphaFold-generated structures and show that this strategy samples cryptic pocket opening, even though an equivalent amount of simulations launched from a ligand-free experimental structure fails to do so. Markov state models (MSMs) constructed from the AlphaFold-seeded simulations quickly yield a free energy landscape of cryptic pocket opening that is in good agreement with the same landscape generated with well-tempered metadynamics. Taken together, our results demonstrate that AlphaFold has a useful role to play in cryptic pocket discovery but that many cryptic pockets may remain difficult to sample using AlphaFold alone.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Medical Scientist Training Program, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
| | - Soumendranath Bhakat
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Shahlo Solieva
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Gregory R Bowman
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
4
|
Polino AJ, Miller JJ, Bhakat S, Mukherjee S, Bobba S, Bowman GR, Goldberg DE. The nepenthesin insert in the Plasmodium falciparum aspartic protease plasmepsin V is necessary for enzyme function. J Biol Chem 2022; 298:102355. [PMID: 35952758 PMCID: PMC9478907 DOI: 10.1016/j.jbc.2022.102355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 08/01/2022] [Accepted: 08/02/2022] [Indexed: 11/19/2022] Open
Abstract
Plasmepsin V (PM V) is a pepsin-like aspartic protease essential for growth of the malarial parasite Plasmodium falciparum. Previous work has shown PM V to be an endoplasmic reticulum-resident protease that processes parasite proteins destined for export into the host cell. Depletion or inhibition of the enzyme is lethal during asexual replication within red blood cells as well as during the formation of sexual stage gametocytes. The structure of the Plasmodium vivax PM V has been characterized by X-ray crystallography, revealing a canonical pepsin fold punctuated by structural features uncommon to secretory aspartic proteases; however, the function of this unique structure is unclear. Here, we used parasite genetics to probe these structural features by attempting to rescue lethal PM V depletion with various mutant enzymes. We found an unusual nepenthesin 1-type insert in the PM V gene to be essential for parasite growth and PM V activity. Mutagenesis of the nepenthesin insert suggests that both its amino acid sequence and one of the two disulfide bonds that undergird its structure are required for the insert's role in PM V function. Furthermore, molecular dynamics simulations paired with Markov state modeling suggest that mutations to the nepenthesin insert may allosterically affect PM V catalysis through multiple mechanisms. Taken together, these data provide further insights into the structure of the P. falciparum PM V protease.
Collapse
Affiliation(s)
- Alexander J Polino
- Division of Infectious Diseases, Department of Medicine and Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Justin J Miller
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Soumendranath Bhakat
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Sumit Mukherjee
- Division of Infectious Diseases, Department of Medicine and Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Suhas Bobba
- Division of Infectious Diseases, Department of Medicine and Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Gregory R Bowman
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri, USA
| | - Daniel E Goldberg
- Division of Infectious Diseases, Department of Medicine and Department of Molecular Microbiology, Washington University School of Medicine, St. Louis, Missouri, USA.
| |
Collapse
|
5
|
Bhakat S. Collective variable discovery in the age of machine learning: reality, hype and everything in between. RSC Adv 2022; 12:25010-25024. [PMID: 36199882 PMCID: PMC9437778 DOI: 10.1039/d2ra03660f] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 08/20/2022] [Indexed: 11/21/2022] Open
Abstract
Understanding the kinetics and thermodynamics profile of biomolecules is necessary to understand their functional roles which has a major impact in mechanism driven drug discovery. Molecular dynamics simulation has been routinely used to understand conformational dynamics and molecular recognition in biomolecules. Statistical analysis of high-dimensional spatiotemporal data generated from molecular dynamics simulation requires identification of a few low-dimensional variables which can describe the essential dynamics of a system without significant loss of information. In physical chemistry, these low-dimensional variables are often called collective variables. Collective variables are used to generate reduced representations of free energy surfaces and calculate transition probabilities between different metastable basins. However the choice of collective variables is not trivial for complex systems. Collective variables range from geometric criteria such as distances and dihedral angles to abstract ones such as weighted linear combinations of multiple geometric variables. The advent of machine learning algorithms led to increasing use of abstract collective variables to represent biomolecular dynamics. In this review, I will highlight several nuances of commonly used collective variables ranging from geometric to abstract ones. Further, I will put forward some cases where machine learning based collective variables were used to describe simple systems which in principle could have been described by geometric ones. Finally, I will put forward my thoughts on artificial general intelligence and how it can be used to discover and predict collective variables from spatiotemporal data generated by molecular dynamics simulations.
Collapse
Affiliation(s)
- Soumendranath Bhakat
- Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania Pennsylvania 19104-6059 USA +1 30549 32620
| |
Collapse
|
6
|
Bobrovs R, Basens EE, Drunka L, Kanepe I, Matisone S, Velins KK, Andrianov V, Leitis G, Zelencova-Gopejenko D, Rasina D, Jirgensons A, Jaudzems K. Exploring Aspartic Protease Inhibitor Binding to Design-Selective Antimalarials. J Chem Inf Model 2022; 62:3263-3273. [PMID: 35712895 DOI: 10.1021/acs.jcim.2c00422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Selectivity is a major issue in the development of drugs targeting pathogen aspartic proteases. Here, we explore the selectivity-determining factors by studying specifically designed malaria aspartic protease (plasmepsin) open-flap inhibitors. Metadynamics simulations are used to uncover the complex binding/unbinding pathways of these inhibitors and describe the critical transition states in atomistic resolution. The simulation results are compared with experimentally determined enzymatic activities. Our findings demonstrate that plasmepsin inhibitor selectivity can be achieved by targeting the flap loop with hydrophobic substituents that enable ligand binding under the flap loop, as such a behavior is not observed for several other aspartic proteases. The ability to estimate the selectivity of compounds before they are synthesized is of considerable importance in drug design; therefore, we expect that our approach will be useful in selective inhibitor designs against not only aspartic proteases but also other enzyme classes.
Collapse
Affiliation(s)
- Raitis Bobrovs
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | | | - Laura Drunka
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | - Iveta Kanepe
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | - Sofija Matisone
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | | | - Victor Andrianov
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | - Gundars Leitis
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | | | - Dace Rasina
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | - Aigars Jirgensons
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| | - Kristaps Jaudzems
- Latvian Institute of Organic Synthesis, Aizkraukles 21, Riga LV1006, Latvia
| |
Collapse
|