1
|
Díaz-Holguín A, Saarinen M, Vo DD, Sturchio A, Branzell N, Cabeza de Vaca I, Hu H, Mitjavila-Domènech N, Lindqvist A, Baranczewski P, Millan MJ, Yang Y, Carlsson J, Svenningsson P. AlphaFold accelerated discovery of psychotropic agonists targeting the trace amine-associated receptor 1. SCIENCE ADVANCES 2024; 10:eadn1524. [PMID: 39110804 PMCID: PMC11305387 DOI: 10.1126/sciadv.adn1524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/25/2023] [Accepted: 06/28/2024] [Indexed: 08/10/2024]
Abstract
Artificial intelligence is revolutionizing protein structure prediction, providing unprecedented opportunities for drug design. To assess the potential impact on ligand discovery, we compared virtual screens using protein structures generated by the AlphaFold machine learning method and traditional homology modeling. More than 16 million compounds were docked to models of the trace amine-associated receptor 1 (TAAR1), a G protein-coupled receptor of unknown structure and target for treating neuropsychiatric disorders. Sets of 30 and 32 highly ranked compounds from the AlphaFold and homology model screens, respectively, were experimentally evaluated. Of these, 25 were TAAR1 agonists with potencies ranging from 12 to 0.03 μM. The AlphaFold screen yielded a more than twofold higher hit rate (60%) than the homology model and discovered the most potent agonists. A TAAR1 agonist with a promising selectivity profile and drug-like properties showed physiological and antipsychotic-like effects in wild-type but not in TAAR1 knockout mice. These results demonstrate that AlphaFold structures can accelerate drug discovery.
Collapse
Affiliation(s)
- Alejandro Díaz-Holguín
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Marcus Saarinen
- Neuro Svenningsson, Department of Clinical Neuroscience, Karolinska Institute, SE-171 76 Stockholm, Sweden
| | - Duc Duy Vo
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Andrea Sturchio
- Neuro Svenningsson, Department of Clinical Neuroscience, Karolinska Institute, SE-171 76 Stockholm, Sweden
- Department of Neurology, James J. and Joan A. Gardner Family Center for Parkinson's Disease and Movement Disorders, University of Cincinnati, Cincinnati, OH, USA
| | - Niclas Branzell
- Neuro Svenningsson, Department of Clinical Neuroscience, Karolinska Institute, SE-171 76 Stockholm, Sweden
| | - Israel Cabeza de Vaca
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Huabin Hu
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Núria Mitjavila-Domènech
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Annika Lindqvist
- Department of Pharmacy, SciLifeLab Drug Discovery and Development Platform, Uppsala University, Box 580, SE-751 23 Uppsala, Sweden
| | - Pawel Baranczewski
- Department of Pharmacy, SciLifeLab Drug Discovery and Development Platform, Uppsala University, Box 580, SE-751 23 Uppsala, Sweden
| | - Mark J. Millan
- Neuroinflammation Therapeutic Area, Institut de Recherches Servier, Centre de Recherches de Croissy, Paris, France and Institute of Neuroscience and Psychology, College of Medicine, Vet and Life Sciences, Glasgow University, Scotland, Glasgow, UK
| | - Yunting Yang
- Neuro Svenningsson, Department of Clinical Neuroscience, Karolinska Institute, SE-171 76 Stockholm, Sweden
| | - Jens Carlsson
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Box 596, SE-751 24 Uppsala, Sweden
| | - Per Svenningsson
- Neuro Svenningsson, Department of Clinical Neuroscience, Karolinska Institute, SE-171 76 Stockholm, Sweden
| |
Collapse
|
2
|
Ohnuki J, Okazaki KI. Integration of AlphaFold with Molecular Dynamics for Efficient Conformational Sampling of Transporter Protein NarK. J Phys Chem B 2024. [PMID: 39066727 DOI: 10.1021/acs.jpcb.4c02726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Transporter proteins carry their substrate across the cell membrane by changing their conformation. Thus, conformational dynamics are crucial for transport function. However, clarifying the complete transport cycle is challenging even with the current structural biology approach. Molecular dynamics (MD) simulation is a computational approach that can provide the time-resolved conformational dynamics of transporter proteins in atomic details but suffers from a high computational cost. Here, we integrate state-of-the-art protein structure prediction AI, AlphaFold2 (AF2), with MD simulation to reduce the computational cost. Focusing on the transporter protein NarK, we first show that AF2 sampled broad conformations of NarK, including the inward-open, occluded, and outward-open states. We also applied the coevolution-informed mutation in AF2, identifying state-shifting mutations. Then, we show that MD simulations from AF2-generated outward-open conformation, which is experimentally unresolved, captured the essence of the conformational state. We also found that MD simulations from AF2-generated intermediates showed transient dynamics like a transition state connecting two conformational states. This study paves the way for efficient conformational sampling of transporter proteins.
Collapse
Affiliation(s)
- Jun Ohnuki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, Aichi 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| | - Kei-Ichi Okazaki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki, Aichi 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| |
Collapse
|
3
|
Swapna GVT, Dube N, Roth MJ, Montelione GT. Modeling Alternative Conformational States of Pseudo-Symmetric Solute Carrier Transporters using Methods from Machine Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.15.603529. [PMID: 39071413 PMCID: PMC11275918 DOI: 10.1101/2024.07.15.603529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
The Solute Carrier (SLC) superfamily of integral membrane proteins function to transport a wide array of solutes across the plasma and organelle membranes. SLC proteins also function as important drug transporters and as viral receptors. Despite being classified as a single superfamily, SLC proteins do not share a single common fold classification; however, most belong to multi-pass transmembrane helical protein fold families. SLC proteins populate different conformational states during the solute transport process, including outward open, intermediate (occluded), and inward open conformational states. For some SLC fold families this structural "flipping" corresponds to swapping between conformations of their N-terminal and C-terminal symmetry-related sub-structures. Conventional AlphaFold2 or Evolutionary Scale Modeling methods typically generate models for only one of these multiple conformational states of SLC proteins. Here we describe a fast and simple approach for modeling multiple conformational states of SLC proteins using a combined ESM - AF2 process. The resulting multi-state models are validated by comparison with sequence-based evolutionary co-variance data (ECs) that encode information about contacts present in the various conformational states adopted by the protein. We also explored the impact of mutations on conformational distributions of SLC proteins modeled by AlphaFold2 using both conventional and enhanced sampling methods. This approach for modeling conformational landscapes of pseudo-symmetric SLC proteins is demonstrated for several integral membrane protein transporters, including SLC35F2 the receptor of a feline leukemia virus envelope protein required for viral entry into eukaryotic cells.
Collapse
Affiliation(s)
- G V T Swapna
- Dept. of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
- Department of Pharmacology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway NJ 08854 USA
| | - Namita Dube
- Dept. of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| | - Monica J Roth
- Department of Pharmacology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway NJ 08854 USA
| | - Gaetano T Montelione
- Dept. of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| |
Collapse
|
4
|
Samanta R, Harmalkar A, Prathima P, Gray JJ. Advancing membrane-associated protein docking with improved sampling and scoring in Rosetta. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.09.602802. [PMID: 39026849 PMCID: PMC11257521 DOI: 10.1101/2024.07.09.602802] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
The oligomerization of protein macromolecules on cell membranes plays a fundamental role in regulating cellular function. From modulating signal transduction to directing immune response, membrane proteins (MPs) play a crucial role in biological processes and are often the target of many pharmaceutical drugs. Despite their biological relevance, the challenges in experimental determination have hampered the structural availability of membrane proteins and their complexes. Computational docking provides a promising alternative to model membrane protein complex structures. Here, we present Rosetta-MPDock, a flexible transmembrane (TM) protein docking protocol that captures binding-induced conformational changes. Rosetta-MPDock samples large conformational ensembles of flexible monomers and docks them within an implicit membrane environment. We benchmarked this method on 29 TM-protein complexes of variable backbone flexibility. These complexes are classified based on the root-mean-square deviation between the unbound and bound states (RMSDUB) as: rigid (RMSDUB <1.2 Å), moderately-flexible (RMSDUB ∈ [1.2, 2.2) Å), and flexible targets (RMSDUB > 2.2 Å). In a local docking scenario, i.e. with membrane protein partners starting ≈10 Å apart embedded in the membrane in their unbound conformations, Rosetta-MPDock successfully predicts the correct interface (success defined as achieving 3 near-native structures in the 5 top-ranked models) for 67% moderately flexible targets and 60% of the highly flexible targets, a substantial improvement from the existing membrane protein docking methods. Further, by integrating AlphaFold2-multimer for structure determination and using Rosetta-MPDock for docking and refinement, we demonstrate improved success rates over the benchmark targets from 64% to 73%. Rosetta-MPDock advances the capabilities for membrane protein complex structure prediction and modeling to tackle key biological questions and elucidate functional mechanisms in the membrane environment. The benchmark set and the code is available for public use at github.com/Graylab/MPDock.
Collapse
Affiliation(s)
- Rituparna Samanta
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD 21218, USA
- Current affiliation: University of South Florida, Tampa, FL, USA
| | - Ameya Harmalkar
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD 21218, USA
- Current affiliation: Generate Biomedicines Inc., Cambridge, MA, USA
| | - Priyamvada Prathima
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD 21218, USA
- Current affiliation: Department of Immunology, Blavatnik Institute, Harvard Medical School, Boston, MA, USA
| | - Jeffrey J. Gray
- Department of Chemical and Biomolecular Engineering, The Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
5
|
Licht JA, Berry SP, Gutierrez MA, Gaudet R. They all rock: A systematic comparison of conformational movements in LeuT-fold transporters. Structure 2024:S0969-2126(24)00233-8. [PMID: 39025067 DOI: 10.1016/j.str.2024.06.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 05/30/2024] [Accepted: 06/21/2024] [Indexed: 07/20/2024]
Abstract
Many membrane transporters share the LeuT fold-two five-helix repeats inverted across the membrane plane. Despite hundreds of structures, whether distinct conformational mechanisms are supported by the LeuT fold has not been systematically determined. After annotating published LeuT-fold structures, we analyzed distance difference matrices (DDMs) for nine proteins with multiple available conformations. We identified rigid bodies and relative movements of transmembrane helices (TMs) during distinct steps of the transport cycle. In all transporters, the bundle (first two TMs of each repeat) rotates relative to the hash (third and fourth TMs). Motions of the arms (fifth TM) to close or open the intracellular and outer vestibules are common, as is a TM1a swing, with notable variations in the opening-closing motions of the outer vestibule. Our analyses suggest that LeuT-fold transporters layer distinct motions on a common bundle-hash rock and demonstrate that systematic analyses can provide new insights into large structural datasets.
Collapse
Affiliation(s)
- Jacob A Licht
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | - Samuel P Berry
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | - Michael A Gutierrez
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA
| | - Rachelle Gaudet
- Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02138, USA.
| |
Collapse
|
6
|
Lombard V, Grudinin S, Laine E. Explaining Conformational Diversity in Protein Families through Molecular Motions. Sci Data 2024; 11:752. [PMID: 38987561 PMCID: PMC11237097 DOI: 10.1038/s41597-024-03524-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Accepted: 06/14/2024] [Indexed: 07/12/2024] Open
Abstract
Proteins play a central role in biological processes, and understanding their conformational variability is crucial for unraveling their functional mechanisms. Recent advancements in high-throughput technologies have enhanced our knowledge of protein structures, yet predicting their multiple conformational states and motions remains challenging. This study introduces Dimensionality Analysis for protein Conformational Exploration (DANCE) for a systematic and comprehensive description of protein families conformational variability. DANCE accommodates both experimental and predicted structures. It is suitable for analysing anything from single proteins to superfamilies. Employing it, we clustered all experimentally resolved protein structures available in the Protein Data Bank into conformational collections and characterized them as sets of linear motions. The resource facilitates access and exploitation of the multiple states adopted by a protein and its homologs. Beyond descriptive analysis, we assessed classical dimensionality reduction techniques for sampling unseen states on a representative benchmark. This work improves our understanding of how proteins deform to perform their functions and opens ways to a standardised evaluation of methods designed to sample and generate protein conformations.
Collapse
Affiliation(s)
- Valentin Lombard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005, Paris, France
| | - Sergei Grudinin
- Université Grenoble Alpes, CNRS, Grenoble INP, LJK, 38000, Grenoble, France.
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005, Paris, France.
- Institut Universitaire de France (IUF), Paris, France.
| |
Collapse
|
7
|
Raisinghani N, Alshahrani M, Gupta G, Verkhivker G. Atomistic Prediction of Structures, Conformational Ensembles and Binding Energetics for the SARS-CoV-2 Spike JN.1, KP.2 and KP.3 Variants Using AlphaFold2 and Molecular Dynamics Simulations: Mutational Profiling and Binding Free Energy Analysis Reveal Epistatic Hotspots of the ACE2 Affinity and Immune Escape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.09.602810. [PMID: 39026832 PMCID: PMC11257589 DOI: 10.1101/2024.07.09.602810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
The most recent wave of SARS-CoV-2 Omicron variants descending from BA.2 and BA.2.86 exhibited improved viral growth and fitness due to convergent evolution of functional hotspots. These hotspots operate in tandem to optimize both receptor binding for effective infection and immune evasion efficiency, thereby maintaining overall viral fitness. The lack of molecular details on structure, dynamics and binding energetics of the latest FLiRT and FLuQE variants with the ACE2 receptor and antibodies provides a considerable challenge that is explored in this study. We combined AlphaFold2-based atomistic predictions of structures and conformational ensembles of the SARS-CoV-2 Spike complexes with the host receptor ACE2 for the most dominant Omicron variants JN.1, KP.1, KP.2 and KP.3 to examine the mechanisms underlying the role of convergent evolution hotspots in balancing ACE2 binding and antibody evasion. Using the ensemble-based mutational scanning of the spike protein residues and computations of binding affinities, we identified binding energy hotspots and characterized molecular basis underlying epistatic couplings between convergent mutational hotspots. The results suggested that the existence of epistatic interactions between convergent mutational sites at L455, F456, Q493 positions that enable to protect and restore ACE2 binding affinity while conferring beneficial immune escape. To examine immune escape mechanisms, we performed structure-based mutational profiling of the spike protein binding with several classes of antibodies that displayed impaired neutralization against BA.2.86, JN.1, KP.2 and KP.3. The results confirmed the experimental data that JN.1, KP.2 and KP.3 harboring the L455S and F456L mutations can significantly impair the neutralizing activity of class-1 monoclonal antibodies, while the epistatic effects mediated by F456L can facilitate the subsequent convergence of Q493E changes to rescue ACE2 binding. Structural and energetic analysis provided a rationale to the experimental results showing that BD55-5840 and BD55-5514 antibodies that bind to different binding epitopes can retain neutralizing efficacy against all examined variants BA.2.86, JN.1, KP.2 and KP.3. The results support the notion that evolution of Omicron variants may favor emergence of lineages with beneficial combinations of mutations involving mediators of epistatic couplings that control balance of high ACE2 affinity and immune evasion.
Collapse
|
8
|
Gu X, Aranganathan A, Tiwary P. Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE. ARXIV 2024:arXiv:2404.07102v3. [PMID: 38659642 PMCID: PMC11042445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2 based framework combined with all-atom enhanced sampling molecular dynamics and induced fit docking, named AF2RAVE-Glide, to conduct computational model based small molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of known type II kinase inhibitors with more than 50% success rates across docking calculations. We believe the protocol should be deployable for other kinases and more proteins generally.
Collapse
Affiliation(s)
- Xinyu Gu
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- University of Maryland Institute for Health Computing, Bethesda, United States
| | - Akashnathan Aranganathan
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- Biophysics Program, University of Maryland, College Park 20742, USA
| | - Pratyush Tiwary
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Bethesda, United States
| |
Collapse
|
9
|
Nguyen ATN, Nguyen DTN, Koh HY, Toskov J, MacLean W, Xu A, Zhang D, Webb GI, May LT, Halls ML. The application of artificial intelligence to accelerate G protein-coupled receptor drug discovery. Br J Pharmacol 2024; 181:2371-2384. [PMID: 37161878 DOI: 10.1111/bph.16140] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 04/14/2023] [Accepted: 04/27/2023] [Indexed: 05/11/2023] Open
Abstract
The application of artificial intelligence (AI) approaches to drug discovery for G protein-coupled receptors (GPCRs) is a rapidly expanding area. Artificial intelligence can be used at multiple stages during the drug discovery process, from aiding our understanding of the fundamental actions of GPCRs to the discovery of new ligand-GPCR interactions or the prediction of clinical responses. Here, we provide an overview of the concepts behind artificial intelligence, including the subfields of machine learning and deep learning. We summarise the published applications of artificial intelligence to different stages of the GPCR drug discovery process. Finally, we reflect on the benefits and limitations of artificial intelligence and share our vision for the exciting potential for further development of applications to aid GPCR drug discovery. In addition to making the drug discovery process "faster, smarter and cheaper," we anticipate that the application of artificial intelligence will create exciting new opportunities for GPCR drug discovery. LINKED ARTICLES: This article is part of a themed issue Therapeutic Targeting of G Protein-Coupled Receptors: hot topics from the Australasian Society of Clinical and Experimental Pharmacologists and Toxicologists 2021 Virtual Annual Scientific Meeting. To view the other articles in this section visit http://onlinelibrary.wiley.com/doi/10.1111/bph.v181.14/issuetoc.
Collapse
Affiliation(s)
- Anh T N Nguyen
- Drug Discovery Biology Theme, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
| | - Diep T N Nguyen
- Department of Information Technology, Faculty of Engineering and Technology, Vietnam National University, Cau Giay, Hanoi, Vietnam
| | - Huan Yee Koh
- Drug Discovery Biology Theme, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
- Monash Data Futures Institute and Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia
| | - Jason Toskov
- Monash DeepNeuron, Monash University, Clayton, Victoria, Australia
| | - William MacLean
- Monash DeepNeuron, Monash University, Clayton, Victoria, Australia
| | - Andrew Xu
- Monash DeepNeuron, Monash University, Clayton, Victoria, Australia
| | - Daokun Zhang
- Drug Discovery Biology Theme, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
- Monash Data Futures Institute and Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia
| | - Geoffrey I Webb
- Monash Data Futures Institute and Department of Data Science and Artificial Intelligence, Monash University, Clayton, Victoria, Australia
| | - Lauren T May
- Drug Discovery Biology Theme, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
| | - Michelle L Halls
- Drug Discovery Biology Theme, Monash Institute of Pharmaceutical Sciences, Monash University, Parkville, Victoria, Australia
| |
Collapse
|
10
|
Herrington NB, Li YC, Stein D, Pandey G, Schlessinger A. A comprehensive exploration of the druggable conformational space of protein kinases using AI-predicted structures. PLoS Comput Biol 2024; 20:e1012302. [PMID: 39046952 PMCID: PMC11268620 DOI: 10.1371/journal.pcbi.1012302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 07/09/2024] [Indexed: 07/27/2024] Open
Abstract
Protein kinase function and interactions with drugs are controlled in part by the movement of the DFG and ɑC-Helix motifs that are related to the catalytic activity of the kinase. Small molecule ligands elicit therapeutic effects with distinct selectivity profiles and residence times that often depend on the active or inactive kinase conformation(s) they bind. Modern AI-based structural modeling methods have the potential to expand upon the limited availability of experimentally determined kinase structures in inactive states. Here, we first explored the conformational space of kinases in the PDB and models generated by AlphaFold2 (AF2) and ESMFold, two prominent AI-based protein structure prediction methods. Our investigation of AF2's ability to explore the conformational diversity of the kinome at various multiple sequence alignment (MSA) depths showed a bias within the predicted structures of kinases in DFG-in conformations, particularly those controlled by the DFG motif, based on their overabundance in the PDB. We demonstrate that predicting kinase structures using AF2 at lower MSA depths explored these alternative conformations more extensively, including identifying previously unobserved conformations for 398 kinases. Ligand enrichment analyses for 23 kinases showed that, on average, docked models distinguished between active molecules and decoys better than random (average AUC (avgAUC) of 64.58), but select models perform well (e.g., avgAUCs for PTK2 and JAK2 were 79.28 and 80.16, respectively). Further analysis explained the ligand enrichment discrepancy between low- and high-performing kinase models as binding site occlusions that would preclude docking. The overall results of our analyses suggested that, although AF2 explored previously uncharted regions of the kinase conformational space and select models exhibited enrichment scores suitable for rational drug discovery, rigorous refinement of AF2 models is likely still necessary for drug discovery campaigns.
Collapse
Affiliation(s)
- Noah B. Herrington
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Yan Chak Li
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - David Stein
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Gaurav Pandey
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
- Department of Artificial Intelligence and Human Health, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| | - Avner Schlessinger
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, New York, United States of America
| |
Collapse
|
11
|
Basu S, Subedi U, Tonelli M, Afshinpour M, Tiwari N, Fuentes EJ, Chakravarty S. Assessing the functional roles of coevolving PHD finger residues. Protein Sci 2024; 33:e5065. [PMID: 38923615 PMCID: PMC11201814 DOI: 10.1002/pro.5065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 04/21/2024] [Accepted: 05/16/2024] [Indexed: 06/28/2024]
Abstract
Although in silico folding based on coevolving residue constraints in the deep-learning era has transformed protein structure prediction, the contributions of coevolving residues to protein folding, stability, and other functions in physical contexts remain to be clarified and experimentally validated. Herein, the PHD finger module, a well-known histone reader with distinct subtypes containing subtype-specific coevolving residues, was used as a model to experimentally assess the contributions of coevolving residues and to clarify their specific roles. The results of the assessment, including proteolysis and thermal unfolding of wildtype and mutant proteins, suggested that coevolving residues have varying contributions, despite their large in silico constraints. Residue positions with large constraints were found to contribute to stability in one subtype but not others. Computational sequence design and generative model-based energy estimates of individual structures were also implemented to complement the experimental assessment. Sequence design and energy estimates distinguish coevolving residues that contribute to folding from those that do not. The results of proteolytic analysis of mutations at positions contributing to folding were consistent with those suggested by sequence design and energy estimation. Thus, we report a comprehensive assessment of the contributions of coevolving residues, as well as a strategy based on a combination of approaches that should enable detailed understanding of the residue contributions in other large protein families.
Collapse
Affiliation(s)
- Shraddha Basu
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Ujwal Subedi
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison (NMRFAM), University of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Maral Afshinpour
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Nitija Tiwari
- Department of Biochemistry & Molecular BiologyUniversity of IowaIowa CityIowaUSA
| | - Ernesto J. Fuentes
- Department of Biochemistry & Molecular BiologyUniversity of IowaIowa CityIowaUSA
| | - Suvobrata Chakravarty
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| |
Collapse
|
12
|
Hong L, Kortemme T. An integrative approach to protein sequence design through multiobjective optimization. PLoS Comput Biol 2024; 20:e1011953. [PMID: 38991035 PMCID: PMC11265717 DOI: 10.1371/journal.pcbi.1011953] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 07/23/2024] [Accepted: 06/25/2024] [Indexed: 07/13/2024] Open
Abstract
With recent methodological advances in the field of computational protein design, in particular those based on deep learning, there is an increasing need for frameworks that allow for coherent, direct integration of different models and objective functions into the generative design process. Here we demonstrate how evolutionary multiobjective optimization techniques can be adapted to provide such an approach. With the established Non-dominated Sorting Genetic Algorithm II (NSGA-II) as the optimization framework, we use AlphaFold2 and ProteinMPNN confidence metrics to define the objective space, and a mutation operator composed of ESM-1v and ProteinMPNN to rank and then redesign the least favorable positions. Using the two-state design problem of the foldswitching protein RfaH as an in-depth case study, and PapD and calmodulin as examples of higher-dimensional design problems, we show that the evolutionary multiobjective optimization approach leads to significant reduction in the bias and variance in RfaH native sequence recovery, compared to a direct application of ProteinMPNN. We suggest that this improvement is due to three factors: (i) the use of an informative mutation operator that accelerates the sequence space exploration, (ii) the parallel, iterative design process inherent to the genetic algorithm that improves upon the ProteinMPNN autoregressive sequence decoding scheme, and (iii) the explicit approximation of the Pareto front that leads to optimal design candidates representing diverse tradeoff conditions. We anticipate this approach to be readily adaptable to different models and broadly relevant for protein design tasks with complex specifications.
Collapse
Affiliation(s)
- Lu Hong
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, United States of America
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California, United States of America
- Quantitative Biosciences Institute, University of California, San Francisco, California, United States of America
- Chan Zuckerberg Biohub, San Francisco, California, United States of America
| |
Collapse
|
13
|
Huang YJ, Montelione GT. Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.26.600902. [PMID: 38979209 PMCID: PMC11230435 DOI: 10.1101/2024.06.26.600902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Recent advances in molecular modeling using deep learning can revolutionize our understanding of dynamic protein structures. NMR is particularly well-suited for determining dynamic features of biomolecular structures. The conventional process for determining biomolecular structures from experimental NMR data involves its representation as conformation-dependent restraints, followed by generation of structural models guided by these spatial restraints. Here we describe an alternative approach: generating a distribution of realistic protein conformational models using artificial intelligence-(AI-) based methods and then selecting the sets of conformers that best explain the experimental data. We applied this conformational selection approach to redetermine the solution NMR structure of the enzyme Gaussia luciferase. First, we generated a diverse set of conformer models using AlphaFold2 (AF2) with an enhanced sampling protocol. The models that best-fit NOESY and chemical shift data were then selected with a Bayesian scoring metric. The resulting models include features of both the published NMR structure and the standard AF2 model generated without enhanced sampling. This "AlphaFold-NMR" protocol also generated an alternative "open" conformational state that fits nearly as well to the overall NMR data but accounts for some NOESY data that is not consistent with first "closed" conformational state; while other NOESY data consistent with this second state are not consistent with the first conformational state. The structure of this "open" structural state differs from that of the "closed" state primarily by the position of a thumb-shaped loop between α-helices H5 and H6, revealing a cryptic surface pocket. These alternative conformational states of Gluc are supported by "double recall" analysis of NOESY data and AF2 models. Additional structural states are also indicated by backbone chemical shift data indicating partially-disordered conformations for the C-terminal segment. Considered as a multistate ensemble, these multiple states of Gluc together fit the NOESY and chemical shift data better than the "restraint-based" NMR structure and provide novel insights into its structure-dynamic-function relationships. This study demonstrates the potential of AI-based modeling with enhanced sampling to generate conformational ensembles followed by conformer selection with experimental data as an alternative to conventional restraint satisfaction protocols for protein NMR structure determination.
Collapse
Affiliation(s)
- Yuanpeng J. Huang
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| | - Gaetano T. Montelione
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| |
Collapse
|
14
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. Exploring conformational landscapes and binding mechanisms of convergent evolution for the SARS-CoV-2 spike Omicron variant complexes with the ACE2 receptor using AlphaFold2-based structural ensembles and molecular dynamics simulations. Phys Chem Chem Phys 2024; 26:17720-17744. [PMID: 38869513 DOI: 10.1039/d4cp01372g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
In this study, we combined AlphaFold-based approaches for atomistic modeling of multiple protein states and microsecond molecular simulations to accurately characterize conformational ensembles evolution and binding mechanisms of convergent evolution for the SARS-CoV-2 spike Omicron variants BA.1, BA.2, BA.2.75, BA.3, BA.4/BA.5 and BQ.1.1. We employed and validated several different adaptations of the AlphaFold methodology for modeling of conformational ensembles including the introduced randomized full sequence scanning for manipulation of sequence variations to systematically explore conformational dynamics of Omicron spike protein complexes with the ACE2 receptor. Microsecond atomistic molecular dynamics (MD) simulations provide a detailed characterization of the conformational landscapes and thermodynamic stability of the Omicron variant complexes. By integrating the predictions of conformational ensembles from different AlphaFold adaptations and applying statistical confidence metrics we can expand characterization of the conformational ensembles and identify functional protein conformations that determine the equilibrium dynamics for the Omicron spike complexes with the ACE2. Conformational ensembles of the Omicron RBD-ACE2 complexes obtained using AlphaFold-based approaches for modeling protein states and MD simulations are employed for accurate comparative prediction of the binding energetics revealing an excellent agreement with the experimental data. In particular, the results demonstrated that AlphaFold-generated extended conformational ensembles can produce accurate binding energies for the Omicron RBD-ACE2 complexes. The results of this study suggested complementarities and potential synergies between AlphaFold predictions of protein conformational ensembles and MD simulations showing that integrating information from both methods can potentially yield a more adequate characterization of the conformational landscapes for the Omicron RBD-ACE2 complexes. This study provides insights in the interplay between conformational dynamics and binding, showing that evolution of Omicron variants through acquisition of convergent mutational sites may leverage conformational adaptability and dynamic couplings between key binding energy hotspots to optimize ACE2 binding affinity and enable immune evasion.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, 75275, USA
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas, 75275, USA
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA.
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, CA 92618, USA
| |
Collapse
|
15
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker GM. Integration of a Randomized Sequence Scanning Approach in AlphaFold2 and Local Frustration Profiling of Conformational States Enable Interpretable Atomistic Characterization of Conformational Ensembles and Detection of Hidden Allosteric States in the ABL1 Protein Kinase. J Chem Theory Comput 2024; 20:5317-5336. [PMID: 38865109 DOI: 10.1021/acs.jctc.4c00222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2024]
Abstract
Despite the success of AlphaFold methods in predicting single protein structures, these methods showed intrinsic limitations in the characterization of multiple functional conformations of allosteric proteins. The recent NMR-based structural determination of the unbound ABL kinase in the active state and discovery of the inactive low-populated functional conformations that are unique for ABL kinase present an ideal challenge for the AlphaFold2 approaches. In the current study, we employ several adaptations of the AlphaFold2 methodology to predict protein conformational ensembles and allosteric states of the ABL kinase including randomized alanine sequence scanning combined with the multiple sequence alignment subsampling proposed in this study. We show that the proposed new AlphaFold2 adaptation combined with local frustration profiling of conformational states enables accurate prediction of the protein kinase structures and conformational ensembles, also offering a robust approach for interpretable characterization of the AlphaFold2 predictions and detection of hidden allosteric states. We found that the large high frustration residue clusters are uniquely characteristic of the low-populated, fully inactive ABL form and can define energetically frustrated cracking sites of conformational transitions, presenting difficult targets for AlphaFold2. The results of this study uncovered previously unappreciated fundamental connections between local frustration profiles of the functional allosteric states and the ability of AlphaFold2 methods to predict protein structural ensembles of the active and inactive states. This study showed that integration of the randomized sequence scanning adaptation of AlphaFold2 with a robust landscape-based analysis allows for interpretable atomistic predictions and characterization of protein conformational ensembles, providing a physical basis for the successes and limitations of current AlphaFold2 methods in detecting functional allosteric states that play a significant role in protein kinase regulation.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Hao Tian
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Gennady M Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
- Department of Pharmacology, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States
| |
Collapse
|
16
|
Pak S, Ryu H, Lim S, Nguyen TL, Yang S, Kang S, Yu YG, Woo J, Kim C, Fenollar-Ferrer C, Wood JN, Lee MO, Hong GS, Han K, Kim TS, Oh U. Tentonin 3 is a pore-forming subunit of a slow inactivation mechanosensitive channel. Cell Rep 2024; 43:114334. [PMID: 38850532 PMCID: PMC11310380 DOI: 10.1016/j.celrep.2024.114334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 04/25/2024] [Accepted: 05/23/2024] [Indexed: 06/10/2024] Open
Abstract
Mechanically activating (MA) channels transduce numerous physiological functions. Tentonin 3/TMEM150C (TTN3) confers MA currents with slow inactivation kinetics in somato- and barosensory neurons. However, questions were raised about its role as a Piezo1 regulator and its potential as a channel pore. Here, we demonstrate that purified TTN3 proteins incorporated into the lipid bilayer displayed spontaneous and pressure-sensitive channel currents. These MA currents were conserved across vertebrates and differ from Piezo1 in activation threshold and pharmacological response. Deep neural network structure prediction programs coupled with mutagenetic analysis predicted a rectangular-shaped, tetrameric structure with six transmembrane helices and a pore at the inter-subunit center. The putative pore aligned with two helices of each subunit and had constriction sites whose mutations changed the MA currents. These findings suggest that TTN3 is a pore-forming subunit of a distinct slow inactivation MA channel, potentially possessing a tetrameric structure.
Collapse
Affiliation(s)
- Sungmin Pak
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; College of Pharmacy, Seoul National University, Seoul 08826, Korea
| | - Hyunil Ryu
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea
| | - Sujin Lim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Korea
| | - Thien-Luan Nguyen
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; College of Pharmacy, Seoul National University, Seoul 08826, Korea
| | - Sungwook Yang
- Artificial Intelligence and Robotics Institute, KIST, Seoul 02792, Korea
| | - Sumin Kang
- Department of Chemistry, Kookmin University, Seoul 02707, Korea
| | - Yeon Gyu Yu
- Department of Chemistry, Kookmin University, Seoul 02707, Korea
| | - Junhyuk Woo
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea
| | - Chanjin Kim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea
| | - Cristina Fenollar-Ferrer
- Stiles-Nicholson Brain Institute at Florida Atlantic University, Jupiter, FL 33458, USA; Laboratory of Molecular Genetics, NIDCD, NIH, Bethesda, MD 20892, USA
| | - John N Wood
- Molecular Nociception Group, Wolfson Institute for Biomedical Research, University College London, London WC1E 6BT, UK
| | - Mi-Ock Lee
- College of Pharmacy, Seoul National University, Seoul 08826, Korea
| | - Gyu-Sang Hong
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; Division of Bio-Medical Science & Technology, KIST School, University of Science and Technology, Seoul 02792, Korea; Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Korea.
| | - Kyungreem Han
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; Division of Bio-Medical Science & Technology, KIST School, University of Science and Technology, Seoul 02792, Korea.
| | - Tae Song Kim
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea.
| | - Uhtaek Oh
- Brain Science Institute, Korea Institute of Science and Technology (KIST), Seoul 02792, Korea; Department of Molecular Medicine and Biopharmaceutical Sciences, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Korea.
| |
Collapse
|
17
|
Duran C, Casadevall G, Osuna S. Harnessing conformational dynamics in enzyme catalysis to achieve nature-like catalytic efficiencies: the shortest path map tool for computational enzyme redesign. Faraday Discuss 2024. [PMID: 38910409 DOI: 10.1039/d3fd00156c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/25/2024]
Abstract
Enzymes exhibit diverse conformations, as represented in the free energy landscape (FEL). Such conformational diversity provides enzymes with the ability to evolve towards novel functions. The challenge lies in identifying mutations that enhance specific conformational changes, especially if located in distal sites from the active site cavity. The shortest path map (SPM) method, which we developed to address this challenge, constructs a graph based on the distances and correlated motions of residues observed in nanosecond timescale molecular dynamics (MD) simulations. We recently introduced a template based AlphaFold2 (tAF2) approach coupled with 10 nanosecond MD simulations to quickly estimate the conformational landscape of enzymes and assess how the FEL is shifted after mutation. In this study, we evaluate the potential of SPM when coupled with tAF2-MD in estimating conformational heterogeneity and identifying key conformationally-relevant positions. The selected model system is the beta subunit of tryptophan synthase (TrpB). We compare how the SPM pathways differ when integrating tAF2 with different MD simulation lengths from as short as 10 ns until 50 ns and considering two distinct Amber forcefield and water models (ff14SB/TIP3P versus ff19SB/OPC). The new methodology can more effectively capture the distal mutations found in laboratory evolution, thus showcasing the efficacy of tAF2-MD-SPM in rapidly estimating enzyme dynamics and identifying the key conformationally relevant hotspots for computational enzyme engineering.
Collapse
Affiliation(s)
- Cristina Duran
- Departament de Química, Institut de Química Computacional i Catàlisi, Universitat de Girona, c/Maria Aurèlia Capmany 69, 17003, Girona, Spain.
| | - Guillem Casadevall
- Departament de Química, Institut de Química Computacional i Catàlisi, Universitat de Girona, c/Maria Aurèlia Capmany 69, 17003, Girona, Spain.
| | - Sílvia Osuna
- Departament de Química, Institut de Química Computacional i Catàlisi, Universitat de Girona, c/Maria Aurèlia Capmany 69, 17003, Girona, Spain.
- ICREA, Pg. Lluís Companys 23, 08010, Barcelona, Spain
| |
Collapse
|
18
|
Agarwal V, McShan AC. The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins. Nat Chem Biol 2024:10.1038/s41589-024-01638-w. [PMID: 38907110 DOI: 10.1038/s41589-024-01638-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 04/29/2024] [Indexed: 06/23/2024]
Abstract
Artificial intelligence-driven advances in protein structure prediction in recent years have raised the question: has the protein structure-prediction problem been solved? Here, with a focus on nonglobular proteins, we highlight the many strengths and potential weaknesses of DeepMind's AlphaFold2 in the context of its biological and therapeutic applications. We summarize the subtleties associated with evaluation of AlphaFold2 model quality and reliability using the predicted local distance difference test (pLDDT) and predicted aligned error (PAE) values. We highlight various classes of proteins that AlphaFold2 can be applied to and the caveats involved. Concrete examples of how AlphaFold2 models can be integrated with experimental data in the form of small-angle X-ray scattering (SAXS), solution NMR, cryo-electron microscopy (cryo-EM) and X-ray diffraction are discussed. Finally, we highlight the need to move beyond structure prediction of rigid, static structural snapshots toward conformational ensembles and alternate biologically relevant states. The overarching theme is that careful consideration is due when using AlphaFold2-generated models to generate testable hypotheses and structural models, rather than treating predicted models as de facto ground truth structures.
Collapse
Affiliation(s)
- Vinayak Agarwal
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.
| | - Andrew C McShan
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
19
|
Sundaraswamy PM, Minami Y, Jayaprakash J, B Gowda SG, Takatsu H, Gowda D, Shin HW, Hui SP. A facile method for monitoring sphingomyelin synthase activity in HeLa cells using liquid chromatography/mass spectrometry. Analyst 2024; 149:3293-3301. [PMID: 38713069 DOI: 10.1039/d4an00304g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Sphingomyelin synthase (SMS) is a sphingolipid-metabolizing enzyme involved in the de novo synthesis of sphingomyelin (SM) from ceramide (Cer). Recent studies have indicated that SMS is a key therapeutic target for metabolic diseases such as fatty liver, type 2 diabetes, atherosclerosis, and colorectal cancer. However, very few SMS inhibitors have been identified because of the limited sensitivity and selectivity of the current fluorescence-based screening assay. In this study, we developed a simple cell-based assay coupled with liquid chromatography/tandem mass spectrometry (LC-MS/MS) to screen for SMS inhibitors. HeLa cells stably expressing SMS1 or SMS2 were used for the screening. A non-fluorescent unnatural C6-Cer was used as a substrate for SMS to produce C6-SM. C6-Cer and C6-SM levels in the cells were monitored and quantified using LC-MS/MS. The activity of ginkgolic acid C15:1 (GA), a known SMS inhibitor, was measured. GA had half-maximal inhibitory concentrations of 5.5 μM and 3.6 μM for SMS1 and SMS2, respectively. To validate these findings, hSMS1 and hSMS2 proteins were optimized for molecular docking studies. In silico analyses were conducted to assess the interaction of GA with SMS1 and SMS2, and its binding affinity. This study offers an analytical approach for screening novel SMS inhibitors and provides in silico support for the experimental findings.
Collapse
Affiliation(s)
- Punith M Sundaraswamy
- Graduate School of Global Food Resources, Hokkaido University, Kita-9, Nishi-9, Kita-Ku, Sapporo 060-0809, Japan.
| | - Yusuke Minami
- Graduate School of Health Sciences, Hokkaido University, Kita-12, Nishi-5, Kita-ku, Sapporo 060-0812, Japan
| | - Jayashankar Jayaprakash
- Graduate School of Global Food Resources, Hokkaido University, Kita-9, Nishi-9, Kita-Ku, Sapporo 060-0809, Japan.
| | - Siddabasave Gowda B Gowda
- Graduate School of Global Food Resources, Hokkaido University, Kita-9, Nishi-9, Kita-Ku, Sapporo 060-0809, Japan.
- Faculty of Health Sciences, Hokkaido University, Kita-12, Nishi-5, Kita-ku, Sapporo 060-0812, Japan.
| | - Hiroyuki Takatsu
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Divyavani Gowda
- Faculty of Health Sciences, Hokkaido University, Kita-12, Nishi-5, Kita-ku, Sapporo 060-0812, Japan.
| | - Hye-Won Shin
- Graduate School of Pharmaceutical Sciences, Kyoto University, Kyoto 606-8501, Japan
| | - Shu-Ping Hui
- Faculty of Health Sciences, Hokkaido University, Kita-12, Nishi-5, Kita-ku, Sapporo 060-0812, Japan.
| |
Collapse
|
20
|
Urvas L, Chiesa L, Bret G, Jacquemard C, Kellenberger E. Benchmarking AlphaFold-Generated Structures of Chemokine-Chemokine Receptor Complexes. J Chem Inf Model 2024; 64:4587-4600. [PMID: 38809680 DOI: 10.1021/acs.jcim.3c01835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
AlphaFold and AlphaFold-Multimer have become two essential tools for the modeling of unknown structures of proteins and protein complexes. In this work, we extensively benchmarked the quality of chemokine-chemokine receptor structures generated by AlphaFold-Multimer against experimentally determined structures. Our analysis considered both the global quality of the model, as well as key structural features for chemokine recognition. To study the effects of template and multiple sequence alignment parameters on the results, a new prediction pipeline called LIT-AlphaFold (https://github.com/LIT-CCM-lab/LIT-AlphaFold) was developed, allowing extensive input customization. AlphaFold-Multimer correctly predicted differences in chemokine binding orientation and accurately reproduced the unique binding orientation of the CXCL12-ACKR3 complex. Further, the predictions of the full receptor N-terminus provided insights into a putative chemokine recognition site 0.5. The accuracy of chemokine N-terminus binding mode prediction varied between complexes, but the confidence score permitted the distinguishing of residues that were very likely well positioned. Finally, we generated a high-confidence model of the unsolved CXCL12-CXCR4 complex, which agreed with experimental mutagenesis and cross-linking data.
Collapse
Affiliation(s)
- Lauri Urvas
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Luca Chiesa
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Guillaume Bret
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Célien Jacquemard
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Esther Kellenberger
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| |
Collapse
|
21
|
Hetmann M, Parigger L, Sirelkhatim H, Stern A, Krassnigg A, Gruber K, Steinkellner G, Ruau D, Gruber CC. Folding the human proteome using BioNeMo: A fused dataset of structural models for machine learning purposes. Sci Data 2024; 11:591. [PMID: 38844754 PMCID: PMC11156891 DOI: 10.1038/s41597-024-03403-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 05/22/2024] [Indexed: 06/09/2024] Open
Abstract
Human proteins are crucial players in both health and disease. Understanding their molecular landscape is a central topic in biological research. Here, we present an extensive dataset of predicted protein structures for 42,042 distinct human proteins, including splicing variants, derived from the UniProt reference proteome UP000005640. To ensure high quality and comparability, the dataset was generated by combining state-of-the-art modeling-tools AlphaFold 2, OpenFold, and ESMFold, provided within NVIDIA's BioNeMo platform, as well as homology modeling using Innophore's CavitomiX platform. Our dataset is offered in both unedited and edited formats for diverse research requirements. The unedited version contains structures as generated by the different prediction methods, whereas the edited version contains refinements, including a dataset of structures without low prediction-confidence regions and structures in complex with predicted ligands based on homologs in the PDB. We are confident that this dataset represents the most comprehensive collection of human protein structures available today, facilitating diverse applications such as structure-based drug design and the prediction of protein function and interactions.
Collapse
|
22
|
Jiang T, Wan G, Zhang H, Gyawali YP, Underbakke ES, Feng C. Mapping the Intersubunit Interdomain FMN-Heme Interactions in Neuronal Nitric Oxide Synthase by Targeted Quantitative Cross-Linking Mass Spectrometry. Biochemistry 2024; 63:1395-1411. [PMID: 38747545 DOI: 10.1021/acs.biochem.4c00157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Nitric oxide synthase (NOS) in mammals is a family of multidomain proteins in which interdomain electron transfer (IET) is controlled by domain-domain interactions. Calmodulin (CaM) binds to the canonical CaM-binding site in the linker region between the FMN and heme domains of NOS and allows tethered FMN domain motions, enabling an intersubunit FMN-heme IET in the output state for NO production. Our previous cross-linking mass spectrometric (XL MS) results demonstrated site-specific protein dynamics in the CaM-responsive regions of rat neuronal NOS (nNOS) reductase construct, a monomeric protein [Jiang et al., Biochemistry, 2023, 62, 2232-2237]. In this work, we have extended our combined approach of XL MS structural mapping and AlphaFold structural prediction to examine the homodimeric nNOS oxygenase/FMN (oxyFMN) construct, an established model of the NOS output state. We employed parallel reaction monitoring (PRM) based quantitative XL MS (qXL MS) to assess the CaM-induced changes in interdomain dynamics and interactions. Intersubunit cross-links were identified by mapping the cross-links onto top AlphaFold structural models, which was complemented by comparing their relative abundances in the cross-linked dimeric and monomeric bands. Furthermore, contrasting the CaM-free and CaM-bound nNOS samples shows that CaM enables the formation of the intersubunit FMN-heme docking complex and that CaM binding induces extensive, allosteric conformational changes across the NOS regions. Moreover, the observed cross-links sites specifically respond to changes in ionic strength. This indicates that interdomain salt bridges are responsible for stabilizing and orienting the output state for efficient FMN-heme IET. Taken together, our targeted qXL MS results have revealed that CaM and ionic strength modulate specific dynamic changes in the CaM/FMN/heme complexes, particularly in the context of intersubunit interdomain FMN-heme interactions.
Collapse
Affiliation(s)
- Ting Jiang
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Guanghua Wan
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Haikun Zhang
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Yadav Prasad Gyawali
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Eric S Underbakke
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, United States
| | - Changjian Feng
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
23
|
Dahlström KM, Salminen TA. Apprehensions and emerging solutions in ML-based protein structure prediction. Curr Opin Struct Biol 2024; 86:102819. [PMID: 38631107 DOI: 10.1016/j.sbi.2024.102819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/05/2024] [Accepted: 03/31/2024] [Indexed: 04/19/2024]
Abstract
The three-dimensional structure of proteins determines their function in vital biological processes. Thus, when the structure is known, the molecular mechanism of protein function can be understood in more detail and obtained information utilized in biotechnological, diagnostics, and therapeutic applications. Over the past five years, machine learning (ML)-based modeling has pushed protein structure prediction to the next level with AlphaFold in the front line, predicting the structure for hundreds of millions of proteins. Further advances recently report promising ML-based approaches for solving remaining challenges by incorporating functionally important metals, co-factors, post-translational modifications, structural dynamics, and interdomain and multimer interactions in the structure prediction process.
Collapse
Affiliation(s)
- Käthe M Dahlström
- Structural Bioinformatics Laboratory, Biochemistry, Faculty of Science and Engineering, Åbo Akademi University, Tykistökatu 6A, 20520 Turku, Finland; InFLAMES Research Flagship Center, Åbo Akademi University, 20520 Turku, Finland
| | - Tiina A Salminen
- Structural Bioinformatics Laboratory, Biochemistry, Faculty of Science and Engineering, Åbo Akademi University, Tykistökatu 6A, 20520 Turku, Finland; InFLAMES Research Flagship Center, Åbo Akademi University, 20520 Turku, Finland.
| |
Collapse
|
24
|
Wang L, Wen Z, Liu SW, Zhang L, Finley C, Lee HJ, Fan HJS. Overview of AlphaFold2 and breakthroughs in overcoming its limitations. Comput Biol Med 2024; 176:108620. [PMID: 38761500 DOI: 10.1016/j.compbiomed.2024.108620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2023] [Revised: 05/01/2024] [Accepted: 05/14/2024] [Indexed: 05/20/2024]
Abstract
Predicting three-dimensional (3D) protein structures has been challenging for decades. The emergence of AlphaFold2 (AF2), a deep learning-based machine learning method developed by DeepMind, became a game changer in the protein folding community. AF2 can predict a protein's three-dimensional structure with high confidence based on its amino acid sequence. Accurate prediction of protein structures can dramatically accelerate our understanding of biological mechanisms and provide a solid foundation for reliable drug design. Although AF2 breaks through the barriers in predicting protein structures, many rooms remain to be further studied. This review provides a brief historical overview of the development of protein structure prediction, covering template-based, template-free, and machine learning-based methods. In addition to reviewing the potential benefits (Pros) and considerations (Cons) of using AF2, this review summarizes the diverse applications, including protein structure predictions, dynamic changes, point mutation, integration of language model and experimental data, protein complex, and protein-peptide interaction. It underscores recent advancements in efficiency, reliability, and broad application of AF2. This comprehensive review offers valuable insights into the applications of AF2 and AF2-inspired AI methods in structural biology and its potential for clinically significant drug target discovery.
Collapse
Affiliation(s)
- Lei Wang
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Zehua Wen
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Shi-Wei Liu
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China
| | - Lihong Zhang
- Digestive Department, Binhai New Area Hospital of TCM Tianjin, Tianjin, 300451, China
| | - Cierra Finley
- Department of Natural Sciences, Southwest Tennessee Community College, Memphis, TN, 38015, USA
| | - Ho-Jin Lee
- Department of Natural Sciences, Southwest Tennessee Community College, Memphis, TN, 38015, USA; Division of Natural & Mathematical Sciences, LeMoyne-Own College, Memphis, TN, 38126, USA.
| | - Hua-Jun Shawn Fan
- College of Chemical Engineering, Sichuan University of Science and Engineering, Zigong City, Sichuan Province, 64300, China.
| |
Collapse
|
25
|
Middendorf L, Eicholt LA. Random, de novo, and conserved proteins: How structure and disorder predictors perform differently. Proteins 2024; 92:757-767. [PMID: 38226524 DOI: 10.1002/prot.26652] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Revised: 10/18/2023] [Accepted: 12/01/2023] [Indexed: 01/17/2024]
Abstract
Understanding the emergence and structural characteristics of de novo and random proteins is crucial for unraveling protein evolution and designing novel enzymes. However, experimental determination of their structures remains challenging. Recent advancements in protein structure prediction, particularly with AlphaFold2 (AF2), have expanded our knowledge of protein structures, but their applicability to de novo and random proteins is unclear. In this study, we investigate the structural predictions and confidence scores of AF2 and protein language model-based predictor ESMFold for de novo and conserved proteins from Drosophila and a dataset of comparable random proteins. We find that the structural predictions for de novo and random proteins differ significantly from conserved proteins. Interestingly, a positive correlation between disorder and confidence scores (pLDDT) is observed for de novo and random proteins, in contrast to the negative correlation observed for conserved proteins. Furthermore, the performance of structure predictors for de novo and random proteins is hampered by the lack of sequence identity. We also observe fluctuating median predicted disorder among different sequence length quartiles for random proteins, suggesting an influence of sequence length on disorder predictions. In conclusion, while structure predictors provide initial insights into the structural composition of de novo and random proteins, their accuracy and applicability to such proteins remain limited. Experimental determination of their structures is necessary for a comprehensive understanding. The positive correlation between disorder and pLDDT could imply a potential for conditional folding and transient binding interactions of de novo and random proteins.
Collapse
Affiliation(s)
- Lasse Middendorf
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| | - Lars A Eicholt
- Institute for Evolution and Biodiversity, University of Muenster, Muenster, Germany
| |
Collapse
|
26
|
Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, Ronneberger O, Willmore L, Ballard AJ, Bambrick J, Bodenstein SW, Evans DA, Hung CC, O'Neill M, Reiman D, Tunyasuvunakool K, Wu Z, Žemgulytė A, Arvaniti E, Beattie C, Bertolli O, Bridgland A, Cherepanov A, Congreve M, Cowen-Rivers AI, Cowie A, Figurnov M, Fuchs FB, Gladman H, Jain R, Khan YA, Low CMR, Perlin K, Potapenko A, Savy P, Singh S, Stecula A, Thillaisundaram A, Tong C, Yakneen S, Zhong ED, Zielinski M, Žídek A, Bapst V, Kohli P, Jaderberg M, Hassabis D, Jumper JM. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 2024; 630:493-500. [PMID: 38718835 PMCID: PMC11168924 DOI: 10.1038/s41586-024-07487-w] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 04/29/2024] [Indexed: 06/13/2024]
Abstract
The introduction of AlphaFold 21 has spurred a revolution in modelling the structure of proteins and their interactions, enabling a huge range of applications in protein modelling and design2-6. Here we describe our AlphaFold 3 model with a substantially updated diffusion-based architecture that is capable of predicting the joint structure of complexes including proteins, nucleic acids, small molecules, ions and modified residues. The new AlphaFold model demonstrates substantially improved accuracy over many previous specialized tools: far greater accuracy for protein-ligand interactions compared with state-of-the-art docking tools, much higher accuracy for protein-nucleic acid interactions compared with nucleic-acid-specific predictors and substantially higher antibody-antigen prediction accuracy compared with AlphaFold-Multimer v.2.37,8. Together, these results show that high-accuracy modelling across biomolecular space is possible within a single unified deep-learning framework.
Collapse
Affiliation(s)
| | - Jonas Adler
- Core Contributor, Google DeepMind, London, UK
| | - Jack Dunger
- Core Contributor, Google DeepMind, London, UK
| | | | - Tim Green
- Core Contributor, Google DeepMind, London, UK
| | | | | | | | | | | | | | | | | | | | | | | | - Zachary Wu
- Core Contributor, Google DeepMind, London, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | - Yousuf A Khan
- Google DeepMind, London, UK
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA
| | | | | | | | | | | | | | | | | | | | - Ellen D Zhong
- Google DeepMind, London, UK
- Department of Computer Science, Princeton University, Princeton, NJ, USA
| | | | | | | | | | | | - Demis Hassabis
- Core Contributor, Google DeepMind, London, UK.
- Core Contributor, Isomorphic Labs, London, UK.
| | | |
Collapse
|
27
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker G. Prediction of Conformational Ensembles and Structural Effects of State-Switching Allosteric Mutants in the Protein Kinases Using Comparative Analysis of AlphaFold2 Adaptations with Sequence Masking and Shallow Subsampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.17.594786. [PMID: 38798650 PMCID: PMC11118581 DOI: 10.1101/2024.05.17.594786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Despite the success of AlphaFold2 approaches in predicting single protein structures, these methods showed intrinsic limitations in predicting multiple functional conformations of allosteric proteins and have been challenged to accurately capture of the effects of single point mutations that induced significant structural changes. We systematically examined several implementations of AlphaFold2 methods to predict conformational ensembles for state-switching mutants of the ABL kinase. The results revealed that a combination of randomized alanine sequence masking with shallow multiple sequence alignment subsampling can significantly expand the conformational diversity of the predicted structural ensembles and capture shifts in populations of the active and inactive ABL states. Consistent with the NMR experiments, the predicted conformational ensembles for M309L/L320I and M309L/H415P ABL mutants that perturb the regulatory spine networks featured the increased population of the fully closed inactive state. On the other hand, the predicted conformational ensembles for the G269E/M309L/T334I and M309L/L320I/T334I triple ABL mutants that share activating T334I gate-keeper substitution are dominated by the active ABL form. The proposed adaptation of AlphaFold can reproduce the experimentally observed mutation-induced redistributions in the relative populations of the active and inactive ABL states and capture the effects of regulatory mutations on allosteric structural rearrangements of the kinase domain. The ensemble-based network analysis complemented AlphaFold predictions by revealing allosteric mediating centers that often directly correspond to state-switching mutational sites or reside in their immediate local structural proximity, which may explain the global effect of regulatory mutations on structural changes between the ABL states. This study suggested that attention-based learning of long-range dependencies between sequence positions in homologous folds and deciphering patterns of allosteric interactions may further augment the predictive abilities of AlphaFold methods for modeling of alternative protein sates, conformational ensembles and mutation-induced structural transformations.
Collapse
|
28
|
Yee SW, Macdonald CB, Mitrovic D, Zhou X, Koleske ML, Yang J, Buitrago Silva D, Rockefeller Grimes P, Trinidad DD, More SS, Kachuri L, Witte JS, Delemotte L, Giacomini KM, Coyote-Maestas W. The full spectrum of SLC22 OCT1 mutations illuminates the bridge between drug transporter biophysics and pharmacogenomics. Mol Cell 2024; 84:1932-1947.e10. [PMID: 38703769 DOI: 10.1016/j.molcel.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/04/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024]
Abstract
Mutations in transporters can impact an individual's response to drugs and cause many diseases. Few variants in transporters have been evaluated for their functional impact. Here, we combine saturation mutagenesis and multi-phenotypic screening to dissect the impact of 11,213 missense single-amino-acid deletions, and synonymous variants across the 554 residues of OCT1, a key liver xenobiotic transporter. By quantifying in parallel expression and substrate uptake, we find that most variants exert their primary effect on protein abundance, a phenotype not commonly measured alongside function. Using our mutagenesis results combined with structure prediction and molecular dynamic simulations, we develop accurate structure-function models of the entire transport cycle, providing biophysical characterization of all known and possible human OCT1 polymorphisms. This work provides a complete functional map of OCT1 variants along with a framework for integrating functional genomics, biophysical modeling, and human genetics to predict variant effects on disease and drug efficacy.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Darko Mitrovic
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Megan L Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jia Yang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Dina Buitrago Silva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Patrick Rockefeller Grimes
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Donovan D Trinidad
- Department of Medicine, Division of Infectious Disease, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Swati S More
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - Lucie Delemotte
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden.
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA; Chan Zuckerberg Biohub, San Francisco, CA 94148, USA.
| |
Collapse
|
29
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2 Predictions of Conformational Ensembles and Atomistic Simulations of the SARS-CoV-2 Spike XBB Lineages Reveal Epistatic Couplings between Convergent Mutational Hotspots that Control ACE2 Affinity. J Phys Chem B 2024; 128:4696-4715. [PMID: 38696745 DOI: 10.1021/acs.jpcb.4c01341] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/04/2024]
Abstract
In this study, we combined AlphaFold-based atomistic structural modeling, microsecond molecular simulations, mutational profiling, and network analysis to characterize binding mechanisms of the SARS-CoV-2 spike protein with the host receptor ACE2 for a series of Omicron XBB variants including XBB.1.5, XBB.1.5+L455F, XBB.1.5+F456L, and XBB.1.5+L455F+F456L. AlphaFold-based structural and dynamic modeling of SARS-CoV-2 Spike XBB lineages can accurately predict the experimental structures and characterize conformational ensembles of the spike protein complexes with the ACE2. Microsecond molecular dynamics simulations identified important differences in the conformational landscapes and equilibrium ensembles of the XBB variants, suggesting that combining AlphaFold predictions of multiple conformations with molecular dynamics simulations can provide a complementary approach for the characterization of functional protein states and binding mechanisms. Using the ensemble-based mutational profiling of protein residues and physics-based rigorous calculations of binding affinities, we identified binding energy hotspots and characterized the molecular basis underlying epistatic couplings between convergent mutational hotspots. Consistent with the experiments, the results revealed the mediating role of the Q493 hotspot in the synchronization of epistatic couplings between L455F and F456L mutations, providing a quantitative insight into the energetic determinants underlying binding differences between XBB lineages. We also proposed a network-based perturbation approach for mutational profiling of allosteric communications and uncovered the important relationships between allosteric centers mediating long-range communication and binding hotspots of epistatic couplings. The results of this study support a mechanism in which the binding mechanisms of the XBB variants may be determined by epistatic effects between convergent evolutionary hotspots that control ACE2 binding.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
| |
Collapse
|
30
|
Ellaway JIJ, Anyango S, Nair S, Zaki HA, Nadzirin N, Powell HR, Gutmanas A, Varadi M, Velankar S. Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2024; 11:034701. [PMID: 38774441 PMCID: PMC11106648 DOI: 10.1063/4.0000251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 05/08/2024] [Indexed: 05/24/2024]
Abstract
Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe - Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.
Collapse
Affiliation(s)
- Joseph I. J. Ellaway
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Stephen Anyango
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sreenath Nair
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Hossam A. Zaki
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island 02903, USA
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Harold R. Powell
- Imperial College London, Department of Life Sciences, London, United Kingdom
| | - Aleksandras Gutmanas
- WaveBreak Therapeutics Ltd., Clarendon House, Clarendon Road, Cambridge, United Kingdom
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| |
Collapse
|
31
|
Xie T, Huang J. Can Protein Structure Prediction Methods Capture Alternative Conformations of Membrane Transporters? J Chem Inf Model 2024; 64:3524-3536. [PMID: 38564295 DOI: 10.1021/acs.jcim.3c01936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Understanding the conformational dynamics of proteins, such as the inward-facing (IF) and outward-facing (OF) transition observed in transporters, is vital for elucidating their functional mechanisms. Despite significant advances in protein structure prediction (PSP) over the past three decades, most efforts have been focused on single-state prediction, leaving multistate or alternative conformation prediction (ACP) relatively unexplored. This discrepancy has led to the development of highly accurate PSP methods such as AlphaFold, yet their capabilities for ACP remain limited. To investigate the performance of current PSP methods in ACP, we curated a data set, named IOMemP, consisting of 32 experimentally determined high-resolution IF and OF structures of 16 membrane proteins with substantial conformational changes. We benchmarked 12 representative PSP methods, along with two recent multistate methods based on AlphaFold, against this data set. Our findings reveal a remarkably consistent preference for specific states across various PSP methods. We elucidated how coevolution information in MSAs influences state preference. Moreover, we showed that AlphaFold, when excluding coevolution information, estimated similar energies between the experimental IF and OF conformations, indicating that the energy model learned by AlphaFold is not biased toward any particular state. Our IOMemP data set and benchmark results are anticipated to advance the development of robust ACP methods.
Collapse
Affiliation(s)
- Tengyu Xie
- College of Life Science, Zhejiang University, HangZhou Zhejiang 310058, China
- Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, HangZhou Zhejiang 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, HangZhou Zhejiang 310024, China
| | - Jing Huang
- College of Life Science, Zhejiang University, HangZhou Zhejiang 310058, China
- Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, HangZhou Zhejiang 310024, China
- Westlake AI Therapeutics Lab, Westlake Laboratory of Life Sciences and Biomedicine, HangZhou Zhejiang 310024, China
| |
Collapse
|
32
|
Tripp A, Braun M, Wieser F, Oberdorfer G, Lechner H. Click, Compute, Create: A Review of Web-based Tools for Enzyme Engineering. Chembiochem 2024:e202400092. [PMID: 38634409 DOI: 10.1002/cbic.202400092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 04/14/2024] [Accepted: 04/15/2024] [Indexed: 04/19/2024]
Abstract
Enzyme engineering, though pivotal across various biotechnological domains, is often plagued by its time-consuming and labor-intensive nature. This review aims to offer an overview of supportive in silico methodologies for this demanding endeavor. Starting from methods to predict protein structures, to classification of their activity and even the discovery of new enzymes we continue with describing tools used to increase thermostability and production yields of selected targets. Subsequently, we discuss computational methods to modulate both, the activity as well as selectivity of enzymes. Last, we present recent approaches based on cutting-edge machine learning methods to redesign enzymes. With exception of the last chapter, there is a strong focus on methods easily accessible via web-interfaces or simple Python-scripts, therefore readily useable for a diverse and broad community.
Collapse
Affiliation(s)
- Adrian Tripp
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Markus Braun
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Florian Wieser
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
| | - Gustav Oberdorfer
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| | - Horst Lechner
- Institute of Biochemistry, Graz University of Technology, Petersgasse 12/2, 8010, Graz, Austria
- BioTechMed, Graz, Austria
| |
Collapse
|
33
|
Vani BP, Aranganathan A, Tiwary P. Exploring Kinase Asp-Phe-Gly (DFG) Loop Conformational Stability with AlphaFold2-RAVE. J Chem Inf Model 2024; 64:2789-2797. [PMID: 37981824 PMCID: PMC11001530 DOI: 10.1021/acs.jcim.3c01436] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2023]
Abstract
Kinases compose one of the largest fractions of the human proteome, and their misfunction is implicated in many diseases, in particular, cancers. The ubiquitousness and structural similarities of kinases make specific and effective drug design difficult. In particular, conformational variability due to the evolutionarily conserved Asp-Phe-Gly (DFG) motif adopting in and out conformations and the relative stabilities thereof are key in structure-based drug design for ATP competitive drugs. These relative conformational stabilities are extremely sensitive to small changes in sequence and provide an important problem for sampling method development. Since the invention of AlphaFold2, the world of structure-based drug design has noticeably changed. In spite of it being limited to crystal-like structure prediction, several methods have also leveraged its underlying architecture to improve dynamics and enhanced sampling of conformational ensembles, including AlphaFold2-RAVE. Here, we extend AlphaFold2-RAVE and apply it to a set of kinases: the wild type DDR1 sequence and three mutants with single point mutations that are known to behave drastically differently. We show that AlphaFold2-RAVE is able to efficiently recover the changes in relative stability using transferable learned order parameters and potentials, thereby supplementing AlphaFold2 as a tool for exploration of Boltzmann-weighted protein conformations (Meller, A.; Bhakat, S.; Solieva, S.; Bowman, G. R. Accelerating Cryptic Pocket Discovery Using AlphaFold. J. Chem. Theory Comput. 2023, 19, 4355-4363).
Collapse
Affiliation(s)
- Bodhi P. Vani
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
| | - Akashnathan Aranganathan
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| |
Collapse
|
34
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. Predicting Functional Conformational Ensembles and Binding Mechanisms of Convergent Evolution for SARS-CoV-2 Spike Omicron Variants Using AlphaFold2 Sequence Scanning Adaptations and Molecular Dynamics Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.02.587850. [PMID: 38617283 PMCID: PMC11014522 DOI: 10.1101/2024.04.02.587850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
In this study, we combined AlphaFold-based approaches for atomistic modeling of multiple protein states and microsecond molecular simulations to accurately characterize conformational ensembles and binding mechanisms of convergent evolution for the SARS-CoV-2 Spike Omicron variants BA.1, BA.2, BA.2.75, BA.3, BA.4/BA.5 and BQ.1.1. We employed and validated several different adaptations of the AlphaFold methodology for modeling of conformational ensembles including the introduced randomized full sequence scanning for manipulation of sequence variations to systematically explore conformational dynamics of Omicron Spike protein complexes with the ACE2 receptor. Microsecond atomistic molecular dynamic simulations provide a detailed characterization of the conformational landscapes and thermodynamic stability of the Omicron variant complexes. By integrating the predictions of conformational ensembles from different AlphaFold adaptations and applying statistical confidence metrics we can expand characterization of the conformational ensembles and identify functional protein conformations that determine the equilibrium dynamics for the Omicron Spike complexes with the ACE2. Conformational ensembles of the Omicron RBD-ACE2 complexes obtained using AlphaFold-based approaches for modeling protein states and molecular dynamics simulations are employed for accurate comparative prediction of the binding energetics revealing an excellent agreement with the experimental data. In particular, the results demonstrated that AlphaFold-generated extended conformational ensembles can produce accurate binding energies for the Omicron RBD-ACE2 complexes. The results of this study suggested complementarities and potential synergies between AlphaFold predictions of protein conformational ensembles and molecular dynamics simulations showing that integrating information from both methods can potentially yield a more adequate characterization of the conformational landscapes for the Omicron RBD-ACE2 complexes. This study provides insights in the interplay between conformational dynamics and binding, showing that evolution of Omicron variants through acquisition of convergent mutational sites may leverage conformational adaptability and dynamic couplings between key binding energy hotspots to optimize ACE2 binding affinity and enable immune evasion.
Collapse
|
35
|
Monté D, Lens Z, Dewitte F, Villeret V, Verger A. Assessment of machine-learning predictions for the Mediator complex subunit MED25 ACID domain interactions with transactivation domains. FEBS Lett 2024; 598:758-773. [PMID: 38436147 DOI: 10.1002/1873-3468.14837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/01/2024] [Accepted: 02/10/2024] [Indexed: 03/05/2024]
Abstract
The human Mediator complex subunit MED25 binds transactivation domains (TADs) present in various cellular and viral proteins using two binding interfaces, named H1 and H2, which are found on opposite sides of its ACID domain. Here, we use and compare deep learning methods to characterize human MED25-TAD interfaces and assess the predicted models to published experimental data. For the H1 interface, AlphaFold produces predictions with high-reliability scores that agree well with experimental data, while the H2 interface predictions appear inconsistent, preventing reliable binding modes. Despite these limitations, we experimentally assess the validity of MED25 interface predictions with the viral transcriptional activators Lana-1 and IE62. AlphaFold predictions also suggest the existence of a unique hydrophobic pocket for the Arabidopsis MED25 ACID domain.
Collapse
Affiliation(s)
- Didier Monté
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Zoé Lens
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Frédérique Dewitte
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Vincent Villeret
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Alexis Verger
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| |
Collapse
|
36
|
Monteiro da Silva G, Cui JY, Dalgarno DC, Lisi GP, Rubenstein BM. High-throughput prediction of protein conformational distributions with subsampled AlphaFold2. Nat Commun 2024; 15:2464. [PMID: 38538622 PMCID: PMC10973385 DOI: 10.1038/s41467-024-46715-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/28/2024] [Indexed: 04/12/2024] Open
Abstract
This paper presents an innovative approach for predicting the relative populations of protein conformations using AlphaFold 2, an AI-powered method that has revolutionized biology by enabling the accurate prediction of protein structures. While AlphaFold 2 has shown exceptional accuracy and speed, it is designed to predict proteins' ground state conformations and is limited in its ability to predict conformational landscapes. Here, we demonstrate how AlphaFold 2 can directly predict the relative populations of different protein conformations by subsampling multiple sequence alignments. We tested our method against nuclear magnetic resonance experiments on two proteins with drastically different amounts of available sequence data, Abl1 kinase and the granulocyte-macrophage colony-stimulating factor, and predicted changes in their relative state populations with more than 80% accuracy. Our subsampling approach worked best when used to qualitatively predict the effects of mutations or evolution on the conformational landscape and well-populated states of proteins. It thus offers a fast and cost-effective way to predict the relative populations of protein conformations at even single-point mutation resolution, making it a useful tool for pharmacology, analysis of experimental results, and predicting evolution.
Collapse
Affiliation(s)
| | - Jennifer Y Cui
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA
| | | | - George P Lisi
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA
- Brown University Department of Chemistry, Providence, RI, USA
| | - Brenda M Rubenstein
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA.
- Brown University Department of Chemistry, Providence, RI, USA.
| |
Collapse
|
37
|
Tu G, Fu T, Zheng G, Xu B, Gou R, Luo D, Wang P, Xue W. Computational Chemistry in Structure-Based Solute Carrier Transporter Drug Design: Recent Advances and Future Perspectives. J Chem Inf Model 2024; 64:1433-1455. [PMID: 38294194 DOI: 10.1021/acs.jcim.3c01736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2024]
Abstract
Solute carrier transporters (SLCs) are a class of important transmembrane proteins that are involved in the transportation of diverse solute ions and small molecules into cells. There are approximately 450 SLCs within the human body, and more than a quarter of them are emerging as attractive therapeutic targets for multiple complex diseases, e.g., depression, cancer, and diabetes. However, only 44 unique transporters (∼9.8% of the SLC superfamily) with 3D structures and specific binding sites have been reported. To design innovative and effective drugs targeting diverse SLCs, there are a number of obstacles that need to be overcome. However, computational chemistry, including physics-based molecular modeling and machine learning- and deep learning-based artificial intelligence (AI), provides an alternative and complementary way to the classical drug discovery approach. Here, we present a comprehensive overview on recent advances and existing challenges of the computational techniques in structure-based drug design of SLCs from three main aspects: (i) characterizing multiple conformations of the proteins during the functional process of transportation, (ii) identifying druggability sites especially the cryptic allosteric ones on the transporters for substrates and drugs binding, and (iii) discovering diverse small molecules or synthetic protein binders targeting the binding sites. This work is expected to provide guidelines for a deep understanding of the structure and function of the SLC superfamily to facilitate rational design of novel modulators of the transporters with the aid of state-of-the-art computational chemistry technologies including artificial intelligence.
Collapse
Affiliation(s)
- Gao Tu
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Tingting Fu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | | | - Binbin Xu
- Chengdu Sintanovo Biotechnology Co., Ltd., Chengdu 610200, China
| | - Rongpei Gou
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Ding Luo
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| | - Panpan Wang
- College of Chemistry and Pharmaceutical Engineering, Huanghuai University, Zhumadian 463000, China
| | - Weiwei Xue
- Chongqing Key Laboratory of Natural Product Synthesis and Drug Research, School of Pharmaceutical Sciences, Chongqing University, Chongqing 401331, China
| |
Collapse
|
38
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2-Enabled Atomistic Modeling of Structure, Conformational Ensembles, and Binding Energetics of the SARS-CoV-2 Omicron BA.2.86 Spike Protein with ACE2 Host Receptor and Antibodies: Compensatory Functional Effects of Binding Hotspots in Modulating Mechanisms of Receptor Binding and Immune Escape. J Chem Inf Model 2024; 64:1657-1681. [PMID: 38373700 DOI: 10.1021/acs.jcim.3c01857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
The latest wave of SARS-CoV-2 Omicron variants displayed a growth advantage and increased viral fitness through convergent evolution of functional hotspots that work synchronously to balance fitness requirements for productive receptor binding and efficient immune evasion. In this study, we combined AlphaFold2-based structural modeling approaches with atomistic simulations and mutational profiling of binding energetics and stability for prediction and comprehensive analysis of the structure, dynamics, and binding of the SARS-CoV-2 Omicron BA.2.86 spike variant with ACE2 host receptor and distinct classes of antibodies. We adapted several AlphaFold2 approaches to predict both the structure and conformational ensembles of the Omicron BA.2.86 spike protein in the complex with the host receptor. The results showed that the AlphaFold2-predicted structural ensemble of the BA.2.86 spike protein complex with ACE2 can accurately capture the main conformational states of the Omicron variant. Complementary to AlphaFold2 structural predictions, microsecond molecular dynamics simulations reveal the details of the conformational landscape and produced equilibrium ensembles of the BA.2.86 structures that are used to perform mutational scanning of spike residues and characterize structural stability and binding energy hotspots. The ensemble-based mutational profiling of the receptor binding domain residues in the BA.2 and BA.2.86 spike complexes with ACE2 revealed a group of conserved hydrophobic hotspots and critical variant-specific contributions of the BA.2.86 convergent mutational hotspots R403K, F486P, and R493Q. To examine the immune evasion properties of BA.2.86 in atomistic detail, we performed structure-based mutational profiling of the spike protein binding interfaces with distinct classes of antibodies that displayed significantly reduced neutralization against the BA.2.86 variant. The results revealed the molecular basis of compensatory functional effects of the binding hotspots, showing that BA.2.86 lineage may have evolved to outcompete other Omicron subvariants by improving immune evasion while preserving binding affinity with ACE2 via through a compensatory effect of R493Q and F486P convergent mutational hotspots. This study demonstrated that an integrative approach combining AlphaFold2 predictions with complementary atomistic molecular dynamics simulations and robust ensemble-based mutational profiling of spike residues can enable accurate and comprehensive characterization of structure, dynamics, and binding mechanisms of newly emerging Omicron variants.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States of America
| |
Collapse
|
39
|
Martin J. AlphaFold2 Predicts Whether Proteins Interact Amidst Confounding Structural Compatibility. J Chem Inf Model 2024; 64:1473-1480. [PMID: 38373070 DOI: 10.1021/acs.jcim.3c01805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
Predicting whether two proteins physically interact is one of the holy grails of computational biology, galvanized by rapid advancements in deep learning. AlphaFold2, although not developed with this goal, is promising in this respect. Here, I test the prediction capability of AlphaFold2 on a very challenging data set, where proteins are structurally compatible, even when they do not interact. AlphaFold2 achieves high discrimination between interacting and non-interacting proteins, and the cases of misclassifications can either be rescued by revisiting the input sequences or can suggest false positives and negatives in the data set. AlphaFold2 is thus not impaired by the compatibility between protein structures and has the potential to be applied on a large scale.
Collapse
Affiliation(s)
- Juliette Martin
- Univ Lyon, CNRS, UMR 5086 MMSB, 7 passage du Vercors F-69367, Lyon, France
- Laboratory of Biology and Modeling of the Cell, Ecole Normale Supérieure de Lyon, CNRS UMR 5239, Inserm U1293, University Claude Bernard Lyon 1, 69364, Lyon, France
| |
Collapse
|
40
|
Hong L, Kortemme T. An integrative approach to protein sequence design through multiobjective optimization. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.01.582670. [PMID: 38496480 PMCID: PMC10942313 DOI: 10.1101/2024.03.01.582670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
With recent methodological advances in the field of computational protein design, in particular those based on deep learning, there is an increasing need for frameworks that allow for coherent, direct integration of different models and objective functions into the generative design process. Here we demonstrate how evolutionary multiobjective optimization techniques can be adapted to provide such an approach. With the established Non-dominated Sorting Genetic Algorithm II (NSGA-II) as the optimization framework, we use AlphaFold2 and ProteinMPNN confidence metrics to define the objective space, and a mutation operator composed of ESM-1v and ProteinMPNN to rank and then redesign the least favorable positions. Using the multistate design problem of the foldswitching protein RfaH as an in-depth case study, we show that the evolutionary multiobjective optimization approach leads to significant reduction in the bias and variance in RfaH native sequence recovery, compared to a direct application of ProteinMPNN. We suggest that this improvement is due to three factors: (i) the use of an informative mutation operator that accelerates the sequence space exploration, (ii) the parallel, iterative design process inherent to the genetic algorithm that improves upon the ProteinMPNN autoregressive sequence decoding scheme, and (iii) the explicit approximation of the Pareto front that leads to optimal design candidates representing diverse tradeoff conditions. We anticipate this approach to be readily adaptable to different models and broadly relevant for protein design tasks with complex specifications.
Collapse
Affiliation(s)
- Lu Hong
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158, USA
| |
Collapse
|
41
|
Jänes J, Beltrao P. Deep learning for protein structure prediction and design-progress and applications. Mol Syst Biol 2024; 20:162-169. [PMID: 38291232 PMCID: PMC10912668 DOI: 10.1038/s44320-024-00016-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open
Abstract
Proteins are the key molecular machines that orchestrate all biological processes of the cell. Most proteins fold into three-dimensional shapes that are critical for their function. Studying the 3D shape of proteins can inform us of the mechanisms that underlie biological processes in living cells and can have practical applications in the study of disease mutations or the discovery of novel drug treatments. Here, we review the progress made in sequence-based prediction of protein structures with a focus on applications that go beyond the prediction of single monomer structures. This includes the application of deep learning methods for the prediction of structures of protein complexes, different conformations, the evolution of protein structures and the application of these methods to protein design. These developments create new opportunities for research that will have impact across many areas of biomedical research.
Collapse
Affiliation(s)
- Jürgen Jänes
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pedro Beltrao
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
42
|
Manalastas-Cantos K, Adoni KR, Pfeifer M, Märtens B, Grünewald K, Thalassinos K, Topf M. Modeling Flexible Protein Structure With AlphaFold2 and Crosslinking Mass Spectrometry. Mol Cell Proteomics 2024; 23:100724. [PMID: 38266916 PMCID: PMC10884514 DOI: 10.1016/j.mcpro.2024.100724] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 12/23/2023] [Accepted: 12/27/2023] [Indexed: 01/26/2024] Open
Abstract
We propose a pipeline that combines AlphaFold2 (AF2) and crosslinking mass spectrometry (XL-MS) to model the structure of proteins with multiple conformations. The pipeline consists of two main steps: ensemble generation using AF2 and conformer selection using XL-MS data. For conformer selection, we developed two scores-the monolink probability score (MP) and the crosslink probability score (XLP)-both of which are based on residue depth from the protein surface. We benchmarked MP and XLP on a large dataset of decoy protein structures and showed that our scores outperform previously developed scores. We then tested our methodology on three proteins having an open and closed conformation in the Protein Data Bank: Complement component 3 (C3), luciferase, and glutamine-binding periplasmic protein, first generating ensembles using AF2, which were then screened for the open and closed conformations using experimental XL-MS data. In five out of six cases, the most accurate model within the AF2 ensembles-or a conformation within 1 Å of this model-was identified using crosslinks, as assessed through the XLP score. In the remaining case, only the monolinks (assessed through the MP score) successfully identified the open conformation of glutamine-binding periplasmic protein, and these results were further improved by including the "occupancy" of the monolinks. This serves as a compelling proof-of-concept for the effectiveness of monolinks. In contrast, the AF2 assessment score was only able to identify the most accurate conformation in two out of six cases. Our results highlight the complementarity of AF2 with experimental methods like XL-MS, with the MP and XLP scores providing reliable metrics to assess the quality of the predicted models. The MP and XLP scoring functions mentioned above are available at https://gitlab.com/topf-lab/xlms-tools.
Collapse
Affiliation(s)
- Karen Manalastas-Cantos
- Center for Data and Computing in Natural Sciences, Universität Hamburg, Hamburg, Germany; Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany
| | - Kish R Adoni
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK; Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, United Kingdom
| | - Matthias Pfeifer
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany
| | - Birgit Märtens
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany
| | - Kay Grünewald
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Department of Chemistry, Universität Hamburg, Hamburg, Germany
| | - Konstantinos Thalassinos
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London, UK; Institute of Structural and Molecular Biology, Birkbeck College, University of London, London, United Kingdom
| | - Maya Topf
- Department of Integrative Virology, Leibniz-Institut für Virologie (LIV), Centre for Structural Systems Biology (CSSB), Hamburg, Germany; Universitätsklinikum Hamburg Eppendorf (UKE), Hamburg, Germany.
| |
Collapse
|
43
|
Meller A, Kelly D, Smith LG, Bowman GR. Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants. Protein Sci 2024; 33:e4902. [PMID: 38358129 PMCID: PMC10868452 DOI: 10.1002/pro.4902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/01/2023] [Accepted: 01/04/2024] [Indexed: 02/16/2024]
Abstract
The goal of precision medicine is to utilize our knowledge of the molecular causes of disease to better diagnose and treat patients. However, there is a substantial mismatch between the small number of food and drug administration (FDA)-approved drugs and annotated coding variants compared to the needs of precision medicine. This review introduces the concept of physics-based precision medicine, a scalable framework that promises to improve our understanding of sequence-function relationships and accelerate drug discovery. We show that accounting for the ensemble of structures a protein adopts in solution with computer simulations overcomes many of the limitations imposed by assuming a single protein structure. We highlight studies of protein dynamics and recent methods for the analysis of structural ensembles. These studies demonstrate that differences in conformational distributions predict functional differences within protein families and between variants. Thanks to new computational tools that are providing unprecedented access to protein structural ensembles, this insight may enable accurate predictions of variant pathogenicity for entire libraries of variants. We further show that explicitly accounting for protein ensembles, with methods like alchemical free energy calculations or docking to Markov state models, can uncover novel lead compounds. To conclude, we demonstrate that cryptic pockets, or cavities absent in experimental structures, provide an avenue to target proteins that are currently considered undruggable. Taken together, our review provides a roadmap for the field of protein science to accelerate precision medicine.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular BiophysicsWashington University in St. LouisSt. LouisMissouriUSA
- Medical Scientist Training ProgramWashington University in St. LouisSt. LouisMissouriUSA
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Devin Kelly
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Louis G. Smith
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| | - Gregory R. Bowman
- Departments of Biochemistry & Biophysics and BioengineeringUniversity of PennsylvaniaPhiladelphiaPennsylvaniaUSA
| |
Collapse
|
44
|
Yao H, Wang X, Chi J, Chen H, Liu Y, Yang J, Yu J, Ruan Y, Xiang X, Pi J, Xu JF. Exploring Novel Antidepressants Targeting G Protein-Coupled Receptors and Key Membrane Receptors Based on Molecular Structures. Molecules 2024; 29:964. [PMID: 38474476 DOI: 10.3390/molecules29050964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/29/2024] [Accepted: 02/09/2024] [Indexed: 03/14/2024] Open
Abstract
Major Depressive Disorder (MDD) is a complex mental disorder that involves alterations in signal transmission across multiple scales and structural abnormalities. The development of effective antidepressants (ADs) has been hindered by the dominance of monoamine hypothesis, resulting in slow progress. Traditional ADs have undesirable traits like delayed onset of action, limited efficacy, and severe side effects. Recently, two categories of fast-acting antidepressant compounds have surfaced, dissociative anesthetics S-ketamine and its metabolites, as well as psychedelics such as lysergic acid diethylamide (LSD). This has led to structural research and drug development of the receptors that they target. This review provides breakthroughs and achievements in the structure of depression-related receptors and novel ADs based on these. Cryo-electron microscopy (cryo-EM) has enabled researchers to identify the structures of membrane receptors, including the N-methyl-D-aspartate receptor (NMDAR) and the 5-hydroxytryptamine 2A (5-HT2A) receptor. These high-resolution structures can be used for the development of novel ADs using virtual drug screening (VDS). Moreover, the unique antidepressant effects of 5-HT1A receptors in various brain regions, and the pivotal roles of the α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) and tyrosine kinase receptor 2 (TrkB) in regulating synaptic plasticity, emphasize their potential as therapeutic targets. Using structural information, a series of highly selective ADs were designed based on the different role of receptors in MDD. These molecules have the favorable characteristics of rapid onset and low adverse drug reactions. This review offers researchers guidance and a methodological framework for the structure-based design of ADs.
Collapse
Affiliation(s)
- Hanbo Yao
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Xiaodong Wang
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiaxin Chi
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Haorong Chen
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Yilin Liu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiayi Yang
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiaqi Yu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Yongdui Ruan
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
| | - Xufu Xiang
- The Key Laboratory for Biomedical Photonics of MOE at Wuhan National Laboratory for Optoelectronics-Hubei Bioinformatics and Molecular Imaging Key Laboratory, Systems Biology Theme, Department of Biomedical Engineering, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Jiang Pi
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jun-Fa Xu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| |
Collapse
|
45
|
Corum MR, Venkannagari H, Hryc CF, Baker ML. Predictive modeling and cryo-EM: A synergistic approach to modeling macromolecular structure. Biophys J 2024; 123:435-450. [PMID: 38268190 PMCID: PMC10912932 DOI: 10.1016/j.bpj.2024.01.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/09/2024] [Accepted: 01/18/2024] [Indexed: 01/26/2024] Open
Abstract
Over the last 15 years, structural biology has seen unprecedented development and improvement in two areas: electron cryo-microscopy (cryo-EM) and predictive modeling. Once relegated to low resolutions, single-particle cryo-EM is now capable of achieving near-atomic resolutions of a wide variety of macromolecular complexes. Ushered in by AlphaFold, machine learning has powered the current generation of predictive modeling tools, which can accurately and reliably predict models for proteins and some complexes directly from the sequence alone. Although they offer new opportunities individually, there is an inherent synergy between these techniques, allowing for the construction of large, complex macromolecular models. Here, we give a brief overview of these approaches in addition to illustrating works that combine these techniques for model building. These examples provide insight into model building, assessment, and limitations when integrating predictive modeling with cryo-EM density maps. Together, these approaches offer the potential to greatly accelerate the generation of macromolecular structural insights, particularly when coupled with experimental data.
Collapse
Affiliation(s)
- Michael R Corum
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Harikanth Venkannagari
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Corey F Hryc
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas
| | - Matthew L Baker
- Department of Biochemistry and Molecular Biology, McGovern Medical School at the University of Texas Health Science Center, Houston, Texas.
| |
Collapse
|
46
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker G. Interpretable Atomistic Prediction and Functional Analysis of Conformational Ensembles and Allosteric States in Protein Kinases Using AlphaFold2 Adaptation with Randomized Sequence Scanning and Local Frustration Profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.15.580591. [PMID: 38496487 PMCID: PMC10942451 DOI: 10.1101/2024.02.15.580591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
The groundbreaking achievements of AlphaFold2 (AF2) approaches in protein structure modeling marked a transformative era in structural biology. Despite the success of AF2 tools in predicting single protein structures, these methods showed intrinsic limitations in predicting multiple functional conformations of allosteric proteins and fold-switching systems. The recent NMR-based structural determination of the unbound ABL kinase in the active state and two inactive low-populated functional conformations that are unique for ABL kinase presents an ideal challenge for AF2 approaches. In the current study we employ several implementations of AF2 methods to predict protein conformational ensembles and allosteric states of the ABL kinase including (a) multiple sequence alignments (MSA) subsampling approach; (b) SPEACH_AF approach in which alanine scanning is performed on generated MSAs; and (c) introduced in this study randomized full sequence mutational scanning for manipulation of sequence variations combined with the MSA subsampling. We show that the proposed AF2 adaptation combined with local frustration mapping of conformational states enable accurate prediction of the ABL active and intermediate structures and conformational ensembles, also offering a robust approach for interpretable characterization of the AF2 predictions and limitations in detecting hidden allosteric states. We found that the large high frustration residue clusters are uniquely characteristic of the low-populated, fully inactive ABL form and can define energetically frustrated cracking sites of conformational transitions, presenting difficult targets for AF2 methods. This study uncovered previously unappreciated, fundamental connections between distinct patterns of local frustration in functional kinase states and AF2 successes/limitations in detecting low-populated frustrated conformations, providing a better understanding of benefits and limitations of current AF2-based adaptations in modeling of conformational ensembles.
Collapse
|
47
|
Wuyun Q, Chen Y, Shen Y, Cao Y, Hu G, Cui W, Gao J, Zheng W. Recent Progress of Protein Tertiary Structure Prediction. Molecules 2024; 29:832. [PMID: 38398585 PMCID: PMC10893003 DOI: 10.3390/molecules29040832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 02/06/2024] [Accepted: 02/08/2024] [Indexed: 02/25/2024] Open
Abstract
The prediction of three-dimensional (3D) protein structure from amino acid sequences has stood as a significant challenge in computational and structural bioinformatics for decades. Recently, the widespread integration of artificial intelligence (AI) algorithms has substantially expedited advancements in protein structure prediction, yielding numerous significant milestones. In particular, the end-to-end deep learning method AlphaFold2 has facilitated the rise of structure prediction performance to new heights, regularly competitive with experimental structures in the 14th Critical Assessment of Protein Structure Prediction (CASP14). To provide a comprehensive understanding and guide future research in the field of protein structure prediction for researchers, this review describes various methodologies, assessments, and databases in protein structure prediction, including traditionally used protein structure prediction methods, such as template-based modeling (TBM) and template-free modeling (FM) approaches; recently developed deep learning-based methods, such as contact/distance-guided methods, end-to-end folding methods, and protein language model (PLM)-based methods; multi-domain protein structure prediction methods; the CASP experiments and related assessments; and the recently released AlphaFold Protein Structure Database (AlphaFold DB). We discuss their advantages, disadvantages, and application scopes, aiming to provide researchers with insights through which to understand the limitations, contexts, and effective selections of protein structure prediction methods in protein-related fields.
Collapse
Affiliation(s)
- Qiqige Wuyun
- Department of Computer Science and Engineering, Michigan State University, East Lansing, MI 48824, USA
| | - Yihan Chen
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Yifeng Shen
- Faculty of Environment and Information Studies, Keio University, Fujisawa 252-0882, Kanagawa, Japan;
| | - Yang Cao
- College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Gang Hu
- NITFID, School of Statistics and Data Science, LPMC and KLMDASR, Nankai University, Tianjin 300071, China
| | - Wei Cui
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Jianzhao Gao
- School of Mathematical Sciences and LPMC, Nankai University, Tianjin 300071, China;
| | - Wei Zheng
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| |
Collapse
|
48
|
Brown BP, Stein RA, Meiler J, Mchaourab HS. Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations. J Chem Theory Comput 2024; 20:1434-1447. [PMID: 38215214 PMCID: PMC10867840 DOI: 10.1021/acs.jctc.3c01081] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024]
Abstract
Protein thermodynamics is intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, computational and experimental methods capable of quantifying the energetic determinants of these equilibria are limited to systems of modest size. Recently, it has been demonstrated that the artificial intelligence system AlphaFold2 can be manipulated to produce structurally valid protein conformational ensembles. Here, we extend these studies and explore the extent to which AlphaFold2 contact distance distributions can approximate projections of the conformational Boltzmann distributions. For this purpose, we examine the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Our studies suggest that AlphaFold2 normalized contact distance distributions can correlate with conformation probabilities obtained with other methods but that they suffer from peak broadening. We also find that the AlphaFold2 contact distance distributions can be sensitive to point mutations. Overall, we anticipate that our findings will be valuable as the community seeks to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
| | - Richard A. Stein
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| | - Jens Meiler
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Institute
for Drug Discovery, Leipzig University Medical
School, Leipzig, SAC 04103, Germany
| | - Hassane S. Mchaourab
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| |
Collapse
|
49
|
Ragonis-Bachar P, Axel G, Blau S, Ben-Tal N, Kolodny R, Landau M. What can AlphaFold do for antimicrobial amyloids? Proteins 2024; 92:265-281. [PMID: 37855235 DOI: 10.1002/prot.26618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 09/05/2023] [Accepted: 10/05/2023] [Indexed: 10/20/2023]
Abstract
Amyloids, protein, and peptide assemblies in various organisms are crucial in physiological and pathological processes. Their intricate structures, however, present significant challenges, limiting our understanding of their functions, regulatory mechanisms, and potential applications in biomedicine and technology. This study evaluated the AlphaFold2 ColabFold method's structure predictions for antimicrobial amyloids, using eight antimicrobial peptides (AMPs), including those with experimentally determined structures and AMPs known for their distinct amyloidogenic morphological features. Additionally, two well-known human amyloids, amyloid-β and islet amyloid polypeptide, were included in the analysis due to their disease relevance, short sequences, and antimicrobial properties. Amyloids typically exhibit tightly mated β-strand sheets forming a cross-β configuration. However, certain amphipathic α-helical subunits can also form amyloid fibrils adopting a cross-α structure. Some AMPs in the study exhibited a combination of cross-α and cross-β amyloid fibrils, adding complexity to structure prediction. The results showed that the AlphaFold2 ColabFold models favored α-helical structures in the tested amyloids, successfully predicting the presence of α-helical mated sheets and a hydrophobic core resembling the cross-α configuration. This implies that the AI-based algorithms prefer assemblies of the monomeric state, which was frequently predicted as helical, or capture an α-helical membrane-active form of toxic peptides, which is triggered upon interaction with lipid membranes.
Collapse
Affiliation(s)
| | - Gabriel Axel
- George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular Biology, Tel Aviv University, Tel Aviv, Israel
| | - Shahar Blau
- Department of Biology, Technion-Israel Institute of Technology, Haifa, Israel
| | - Nir Ben-Tal
- George S. Wise Faculty of Life Sciences, Department of Biochemistry and Molecular Biology, Tel Aviv University, Tel Aviv, Israel
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | - Meytal Landau
- Department of Biology, Technion-Israel Institute of Technology, Haifa, Israel
- CSSB Centre for Structural Systems Biology, Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany
- The Center for Experimental Medicine, Universitätsklinikum Hamburg-Eppendorf (UKE), Hamburg, Germany
- European Molecular Biology Laboratory (EMBL), Hamburg, Germany
| |
Collapse
|
50
|
Gopinath A, Rath T, Morgner N, Joseph B. Lateral gating mechanism and plasticity of the β-barrel assembly machinery complex in micelles and Escherichia coli. PNAS NEXUS 2024; 3:pgae019. [PMID: 38312222 PMCID: PMC10833450 DOI: 10.1093/pnasnexus/pgae019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 01/08/2024] [Indexed: 02/06/2024]
Abstract
The β-barrel assembly machinery (BAM) mediates the folding and insertion of the majority of outer membrane proteins (OMPs) in gram-negative bacteria. BAM is a penta-heterooligomeric complex consisting of the central β-barrel BamA and four interacting lipoproteins BamB, C, D, and E. The conformational switching of BamA between inward-open (IO) and lateral-open (LO) conformations is required for substrate recognition and folding. However, the mechanism for the lateral gating or how the structural details observed in vitro correspond with the cellular environment remains elusive. In this study, we addressed these questions by characterizing the conformational heterogeneity of BamAB, BamACDE, and BamABCDE complexes in detergent micelles and/or Escherichia coli using pulsed dipolar electron spin resonance spectroscopy (PDS). We show that the binding of BamB does not induce any visible changes in BamA, and the BamAB complex exists in the IO conformation. The BamCDE complex induces an IO to LO transition through a coordinated movement along the BamA barrel. However, the extracellular loop 6 (L6) is unaffected by the presence of lipoproteins and exhibits large segmental dynamics extending to the exit pore. PDS experiments with the BamABCDE complex in intact E. coli confirmed the dynamic behavior of both the lateral gate and the L6 in the native environment. Our results demonstrate that the BamCDE complex plays a key role in the function by regulating lateral gating in BamA.
Collapse
Affiliation(s)
- Aathira Gopinath
- Department of Physics, Freie Universität Berlin, Berlin, 14195, Germany
- Institute of Biophysics, Goethe Universität Frankfurt, Frankfurt, 60438, Germany
| | - Tobias Rath
- Institute of Physical and Theoretical Chemistry, Goethe Universität Frankfurt, Frankfurt, 60438, Germany
| | - Nina Morgner
- Institute of Physical and Theoretical Chemistry, Goethe Universität Frankfurt, Frankfurt, 60438, Germany
| | - Benesh Joseph
- Department of Physics, Freie Universität Berlin, Berlin, 14195, Germany
| |
Collapse
|