51
|
Winski A, Ludwiczak J, Orlowska M, Madaj R, Kaminski K, Dunin‐Horkawicz S. AlphaFold2 captures the conformational landscape of the HAMP signaling domain. Protein Sci 2024; 33:e4846. [PMID: 38010737 PMCID: PMC10731501 DOI: 10.1002/pro.4846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 10/30/2023] [Accepted: 11/19/2023] [Indexed: 11/29/2023]
Abstract
In this study, we present a conformational landscape of 5000 AlphaFold2 models of the Histidine kinases, Adenyl cyclases, Methyl-accepting proteins and Phosphatases (HAMP) domain, a short helical bundle that transduces signals from sensors to effectors in two-component signaling proteins such as sensory histidine kinases and chemoreceptors. The landscape reveals the conformational variability of the HAMP domain, including rotations, shifts, displacements, and tilts of helices, many combinations of which have not been observed in experimental structures. HAMP domains belonging to a single family tend to occupy a defined region of the landscape, even when their sequence similarity is low, suggesting that individual HAMP families have evolved to operate in a specific conformational range. The functional importance of this structural conservation is illustrated by poly-HAMP arrays, in which HAMP domains from families with opposite conformational preferences alternate, consistent with the rotational model of signal transduction. The only poly-HAMP arrays that violate this rule are predicted to be of recent evolutionary origin and structurally unstable. Finally, we identify a family of HAMP domains that are likely to be dynamic due to the presence of a conserved pi-helical bulge. All code associated with this work, including a tool for rapid sequence-based prediction of the rotational state in HAMP domains, is deposited at https://github.com/labstructbioinf/HAMPpred.
Collapse
Affiliation(s)
- Aleksander Winski
- Laboratory of Structural Bioinformatics, Centre of New TechnologiesUniversity of WarsawWarsawPoland
| | - Jan Ludwiczak
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
- Present address:
Prescient Design, Genentech Research & Early DevelopmentRoche GroupBaselSwitzerland
| | - Malgorzata Orlowska
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Rafal Madaj
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Kamil Kaminski
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Stanislaw Dunin‐Horkawicz
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
- Department of Protein EvolutionMax Planck Institute for Biology TübingenTübingenGermany
| |
Collapse
|
52
|
Kleiman DE, Nadeem H, Shukla D. Adaptive Sampling Methods for Molecular Dynamics in the Era of Machine Learning. J Phys Chem B 2023; 127:10669-10681. [PMID: 38081185 DOI: 10.1021/acs.jpcb.3c04843] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Molecular dynamics (MD) simulations are fundamental computational tools for the study of proteins and their free energy landscapes. However, sampling protein conformational changes through MD simulations is challenging due to the relatively long time scales of these processes. Many enhanced sampling approaches have emerged to tackle this problem, including biased sampling and path-sampling methods. In this Perspective, we focus on adaptive sampling algorithms. These techniques differ from other approaches because the thermodynamic ensemble is preserved and the sampling is enhanced solely by restarting MD trajectories at particularly chosen seeds rather than introducing biasing forces. We begin our treatment with an overview of theoretically transparent methods, where we discuss principles and guidelines for adaptive sampling. Then, we present a brief summary of select methods that have been applied to realistic systems in the past. Finally, we discuss recent advances in adaptive sampling methodology powered by deep learning techniques, as well as their shortcomings.
Collapse
Affiliation(s)
- Diego E Kleiman
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Hassan Nadeem
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Diwakar Shukla
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
53
|
da Silva GM, Cui JY, Dalgarno DC, Lisi GP, Rubenstein BM. Predicting Relative Populations of Protein Conformations without a Physics Engine Using AlphaFold 2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.25.550545. [PMID: 37546747 PMCID: PMC10402055 DOI: 10.1101/2023.07.25.550545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
This paper presents a novel approach for predicting the relative populations of protein conformations using AlphaFold 2, an AI-powered method that has revolutionized biology by enabling the accurate prediction of protein structures. While AlphaFold 2 has shown exceptional accuracy and speed, it is designed to predict proteins' ground state conformations and is limited in its ability to predict conformational landscapes. Here, we demonstrate how AlphaFold 2 can directly predict the relative populations of different protein conformations by subsampling multiple sequence alignments. We tested our method against NMR experiments on two proteins with drastically different amounts of available sequence data, Abl1 kinase and the granulocyte-macrophage colony-stimulating factor, and predicted changes in their relative state populations with more than 80% accuracy. Our subsampling approach worked best when used to qualitatively predict the effects of mutations or evolution on the conformational landscape and well-populated states of proteins. It thus offers a fast and cost-effective way to predict the relative populations of protein conformations at even single-point mutation resolution, making it a useful tool for pharmacology, NMR analysis, and evolution.
Collapse
Affiliation(s)
- Gabriel Monteiro da Silva
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Providence, RI, USA
| | - Jennifer Y Cui
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Providence, RI, USA
| | | | - George P Lisi
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University Department of Chemistry, Providence, RI, USA
| | - Brenda M Rubenstein
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University Department of Chemistry, Providence, RI, USA
| |
Collapse
|
54
|
Chakravarty D, Schafer JW, Chen EA, Thole JR, Porter LL. AlphaFold2 has more to learn about protein energy landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.12.571380. [PMID: 38168383 PMCID: PMC10760193 DOI: 10.1101/2023.12.12.571380] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Recent work suggests that AlphaFold2 (AF2)-a deep learning-based model that can accurately infer protein structure from sequence-may discern important features of folded protein energy landscapes, defined by the diversity and frequency of different conformations in the folded state. Here, we test the limits of its predictive power on fold-switching proteins, which assume two structures with regions of distinct secondary and/or tertiary structure. Using several implementations of AF2, including two published enhanced sampling approaches, we generated >280,000 models of 93 fold-switching proteins whose experimentally determined conformations were likely in AF2's training set. Combining all models, AF2 predicted fold switching with a modest success rate of ~25%, indicating that it does not readily sample both experimentally characterized conformations of most fold switchers. Further, AF2's confidence metrics selected against models consistent with experimentally determined fold-switching conformations in favor of inconsistent models. Accordingly, these confidence metrics-though suggested to evaluate protein energetics reliably-did not discriminate between low and high energy states of fold-switching proteins. We then evaluated AF2's performance on seven fold-switching proteins outside of its training set, generating >159,000 models in total. Fold switching was accurately predicted in one of seven targets with moderate confidence. Further, AF2 demonstrated no ability to predict alternative conformations of two newly discovered targets without homologs in the set of 93 fold switchers. These results indicate that AF2 has more to learn about the underlying energetics of protein ensembles and highlight the need for further developments of methods that readily predict multiple protein conformations.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Ethan A. Chen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph R. Thole
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| |
Collapse
|
55
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2-Enabled Atomistic Modeling of Epistatic Binding Mechanisms for the SARS-CoV-2 Spike Omicron XBB.1.5, EG.5 and FLip Variants: Convergent Evolution Hotspots Cooperate to Control Stability and Conformational Adaptability in Balancing ACE2 Binding and Antibody Resistance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.11.571185. [PMID: 38168257 PMCID: PMC10760024 DOI: 10.1101/2023.12.11.571185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
In this study, we combined AI-based atomistic structural modeling and microsecond molecular simulations of the SARS-CoV-2 Spike complexes with the host receptor ACE2 for XBB.1.5+L455F, XBB.1.5+F456L(EG.5) and XBB.1.5+L455F/F456L (FLip) lineages to examine the mechanisms underlying the role of convergent evolution hotspots in balancing ACE2 binding and antibody evasion. Using the ensemble-based mutational scanning of the spike protein residues and physics-based rigorous computations of binding affinities, we identified binding energy hotspots and characterized molecular basis underlying epistatic couplings between convergent mutational hotspots. Consistent with the experiments, the results revealed the mediating role of Q493 hotspot in synchronization of epistatic couplings between L455F and F456L mutations providing a quantitative insight into the mechanism underlying differences between XBB lineages. Mutational profiling is combined with network-based model of epistatic couplings showing that the Q493, L455 and F456 sites mediate stable communities at the binding interface with ACE2 and can serve as stable mediators of non-additive couplings. Structure-based mutational analysis of Spike protein binding with the class 1 antibodies quantified the critical role of F456L and F486P mutations in eliciting strong immune evasion response. The results of this analysis support a mechanism in which the emergence of EG.5 and FLip variants may have been dictated by leveraging strong epistatic effects between several convergent revolutionary hotspots that provide synergy between the improved ACE2 binding and broad neutralization resistance. This interpretation is consistent with the notion that functionally balanced substitutions which simultaneously optimize immune evasion and high ACE2 affinity may continue to emerge through lineages with beneficial pair or triplet combinations of RBD mutations involving mediators of epistatic couplings and sites in highly adaptable RBD regions.
Collapse
|
56
|
Ramelot TA, Tejero R, Montelione GT. Representing structures of the multiple conformational states of proteins. Curr Opin Struct Biol 2023; 83:102703. [PMID: 37776602 PMCID: PMC10841472 DOI: 10.1016/j.sbi.2023.102703] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/18/2023] [Accepted: 08/23/2023] [Indexed: 10/02/2023]
Abstract
Biomolecules exhibit dynamic behavior that single-state models of their structures cannot fully capture. We review some recent advances for investigating multiple conformations of biomolecules, including experimental methods, molecular dynamics simulations, and machine learning. We also address the challenges associated with representing single- and multiple-state models in data archives, with a particular focus on NMR structures. Establishing standardized representations and annotations will facilitate effective communication and understanding of these complex models to the broader scientific community.
Collapse
Affiliation(s)
- Theresa A Ramelot
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| | - Roberto Tejero
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Gaetano T Montelione
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| |
Collapse
|
57
|
Ahmed M, Maldonado AM, Durrant JD. From Byte to Bench to Bedside: Molecular Dynamics Simulations and Drug Discovery. ARXIV 2023:arXiv:2311.16946v1. [PMID: 38076508 PMCID: PMC10705576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Molecular dynamics (MD) simulations and computer-aided drug design (CADD) have advanced substantially over the past two decades, thanks to continuous computer hardware and software improvements. Given these advancements, MD simulations are poised to become even more powerful tools for investigating the dynamic interactions between potential small-molecule drugs and their target proteins, with significant implications for pharmacological research.
Collapse
Affiliation(s)
- Mayar Ahmed
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Alex M. Maldonado
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Jacob D. Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
58
|
Vögele M, Zhang BW, Kaindl J, Wang L. Is the Functional Response of a Receptor Determined by the Thermodynamics of Ligand Binding? J Chem Theory Comput 2023; 19:8414-8422. [PMID: 37943175 DOI: 10.1021/acs.jctc.3c00899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]
Abstract
For an effective drug, strong binding to the target protein is a prerequisite, but it is not enough. To produce a particular functional response, drugs need to either block the proteins' functions or modulate their activities by changing their conformational equilibrium. The binding free energy of a compound to its target is routinely calculated, but the timescales for the protein conformational changes are prohibitively long to be efficiently modeled via physics-based simulations. Thermodynamic principles suggest that the binding free energies of the ligands with different receptor conformations may infer their efficacy. However, this hypothesis has not been thoroughly validated. We present an actionable protocol and a comprehensive study to show that binding thermodynamics provides a strong predictor of the efficacy of a ligand. We apply the absolute binding free energy perturbation method to ligands bound to active and inactive states of eight G protein-coupled receptors and a nuclear receptor and then compare the resulting binding free energies. We find that carefully designed restraints are often necessary to efficiently model the corresponding conformational ensembles for each state. Our method achieves unprecedented performance in classifying ligands as agonists or antagonists across the various investigated receptors, all of which are important drug targets.
Collapse
Affiliation(s)
- Martin Vögele
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| | - Bin W Zhang
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| | - Jonas Kaindl
- Schrödinger GmbH, Glücksteinallee 25, Mannheim 68163, Germany
| | - Lingle Wang
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| |
Collapse
|
59
|
Buller R, Lutz S, Kazlauskas RJ, Snajdrova R, Moore JC, Bornscheuer UT. From nature to industry: Harnessing enzymes for biocatalysis. Science 2023; 382:eadh8615. [PMID: 37995253 DOI: 10.1126/science.adh8615] [Citation(s) in RCA: 59] [Impact Index Per Article: 59.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/17/2023] [Indexed: 11/25/2023]
Abstract
Biocatalysis harnesses enzymes to make valuable products. This green technology is used in countless applications from bench scale to industrial production and allows practitioners to access complex organic molecules, often with fewer synthetic steps and reduced waste. The last decade has seen an explosion in the development of experimental and computational tools to tailor enzymatic properties, equipping enzyme engineers with the ability to create biocatalysts that perform reactions not present in nature. By using (chemo)-enzymatic synthesis routes or orchestrating intricate enzyme cascades, scientists can synthesize elaborate targets ranging from DNA and complex pharmaceuticals to starch made in vitro from CO2-derived methanol. In addition, new chemistries have emerged through the combination of biocatalysis with transition metal catalysis, photocatalysis, and electrocatalysis. This review highlights recent key developments, identifies current limitations, and provides a future prospect for this rapidly developing technology.
Collapse
Affiliation(s)
- R Buller
- Competence Center for Biocatalysis, Institute of Chemistry and Biotechnology, Zurich University of Applied Sciences, 8820 Wädenswil, Switzerland
| | - S Lutz
- Codexis Incorporated, Redwood City, CA 94063, USA
| | - R J Kazlauskas
- Department of Biochemistry, Molecular Biology and Biophysics, Biotechnology Institute, University of Minnesota, Saint Paul, MN 55108, USA
| | - R Snajdrova
- Novartis Institutes for BioMedical Research, Global Discovery Chemistry, 4056 Basel, Switzerland
| | - J C Moore
- MRL, Merck & Co., Rahway, NJ 07065, USA
| | - U T Bornscheuer
- Institute of Biochemistry, Dept. of Biotechnology and Enzyme Catalysis, Greifswald University, Greifswald, Germany
| |
Collapse
|
60
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. Accurate Characterization of Conformational Ensembles and Binding Mechanisms of the SARS-CoV-2 Omicron BA.2 and BA.2.86 Spike Protein with the Host Receptor and Distinct Classes of Antibodies Using AlphaFold2-Augmented Integrative Computational Modeling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.18.567697. [PMID: 38045395 PMCID: PMC10690158 DOI: 10.1101/2023.11.18.567697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The latest wave SARS-CoV-2 Omicron variants displayed a growth advantage and the increased viral fitness through convergent evolution of functional hotspots that work synchronously to balance fitness requirements for productive receptor binding and efficient immune evasion. In this study, we combined AlphaFold2-based structural modeling approaches with all-atom MD simulations and mutational profiling of binding energetics and stability for prediction and comprehensive analysis of the structure, dynamics, and binding of the SARS-CoV-2 Omicron BA.2.86 spike variant with ACE2 host receptor and distinct classes of antibodies. We adapted several AlphaFold2 approaches to predict both structure and conformational ensembles of the Omicron BA.2.86 spike protein in the complex with the host receptor. The results showed that AlphaFold2-predicted conformational ensemble of the BA.2.86 spike protein complex can accurately capture the main dynamics signatures obtained from microscond molecular dynamics simulations. The ensemble-based dynamic mutational scanning of the receptor binding domain residues in the BA.2 and BA.2.86 spike complexes with ACE2 dissected the role of the BA.2 and BA.2.86 backgrounds in modulating binding free energy changes revealing a group of conserved hydrophobic hotspots and critical variant-specific contributions of the BA.2.86 mutational sites R403K, F486P and R493Q. To examine immune evasion properties of BA.2.86 in atomistic detail, we performed large scale structure-based mutational profiling of the S protein binding interfaces with distinct classes of antibodies that displayed significantly reduced neutralization against BA.2.86 variant. The results quantified specific function of the BA.2.86 mutations to ensure broad resistance against different classes of RBD antibodies. This study revealed the molecular basis of compensatory functional effects of the binding hotspots, showing that BA.2.86 lineage may have primarily evolved to improve immune escape while modulating binding affinity with ACE2 through cooperative effect of R403K, F486P and R493Q mutations. The study supports a hypothesis that the impact of the increased ACE2 binding affinity on viral fitness is more universal and is mediated through cross-talk between convergent mutational hotspots, while the effect of immune evasion could be more variant-dependent.
Collapse
|
61
|
Varadi M, Tsenkov M, Velankar S. Challenges in bridging the gap between protein structure prediction and functional interpretation. Proteins 2023. [PMID: 37850517 DOI: 10.1002/prot.26614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 09/26/2023] [Accepted: 10/04/2023] [Indexed: 10/19/2023]
Abstract
The rapid evolution of protein structure prediction tools has significantly broadened access to protein structural data. Although predicted structure models have the potential to accelerate and impact fundamental and translational research significantly, it is essential to note that they are not validated and cannot be considered the ground truth. Thus, challenges persist, particularly in capturing protein dynamics, predicting multi-chain structures, interpreting protein function, and assessing model quality. Interdisciplinary collaborations are crucial to overcoming these obstacles. Databases like the AlphaFold Protein Structure Database, the ESM Metagenomic Atlas, and initiatives like the 3D-Beacons Network provide FAIR access to these data, enabling their interpretation and application across a broader scientific community. Whilst substantial advancements have been made in protein structure prediction, further progress is required to address the remaining challenges. Developing training materials, nurturing collaborations, and ensuring open data sharing will be paramount in this pursuit. The continued evolution of these tools and methodologies will deepen our understanding of protein function and accelerate disease pathogenesis and drug development discoveries.
Collapse
Affiliation(s)
- Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Maxim Tsenkov
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
62
|
Guo X, Chen R, Cao L. Molecular docking-based virtual screening and dynamics simulation study of novel and potential SIRT7 inhibitors. Chem Biol Drug Des 2023; 102:707-717. [PMID: 37287091 DOI: 10.1111/cbdd.14277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 05/11/2023] [Accepted: 05/22/2023] [Indexed: 06/09/2023]
Abstract
In cancer cells, short for sirtuin (SIRT7) stabilizes the transformed state via its nicotinamide adenine dinucleotide (NAD+)-dependent deacetylase activity. Epigenetic factor SIRT7 plays important roles in cancer biology, reversing cancer phenotypes and suppressing tumor growth when inactive. In the present study, we got the SIRT7 protein structure from Alpha Fold2 Database and performed structure-based virtual screening to develop specific SIRT7 inhibitors using the SIRT7 inhibitor 97,491 interaction mechanism. As candidates for specific SIRT7 inhibitors, compounds with high affinities to SIRT7 were chosen. ZINC000001910616 and ZINC000014708529, two of our leading compounds, showed strong interactions with SIRT7. Our MD simulation results also revealed that the 5-hydroxy-4H-thioxen-4-one group and terminal carboxyl group were critical groups responsible for interaction of small molecules with SIRT7. In our study, we demonstrated that targeting SIRT7 may offer novel therapeutic options for cancer treatment. Compounds ZINC000001910616 and ZINC000014708529 can serve as chemical probes to investigate SIRT7 biological functions and provide starting points for the development of novel therapeutics against cancers.
Collapse
Affiliation(s)
- Xinli Guo
- Department of Operating Room, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| | - Rui Chen
- State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi, China
| | - Liping Cao
- Department of General Surgery, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
- Zhejiang Engineering Research Center of Cognitive Healthcare, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| |
Collapse
|
63
|
Shui S, Buckley S, Scheller L, Correia BE. Rational design of small-molecule responsive protein switches. Protein Sci 2023; 32:e4774. [PMID: 37656809 PMCID: PMC10510469 DOI: 10.1002/pro.4774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 08/26/2023] [Accepted: 08/29/2023] [Indexed: 09/03/2023]
Abstract
Small-molecule responsive protein switches are powerful tools for controlling cellular processes. These switches are designed to respond rapidly and specifically to their inducer. They have been used in numerous applications, including the regulation of gene expression, post-translational protein modification, and signal transduction. Typically, small-molecule responsive protein switches consist of two proteins that interact with each other in the presence or absence of a small molecule. Recent advances in computational protein design already contributed to the development of protein switches with an expanded range of small-molecule inducers and increasingly sophisticated switch mechanisms. Further progress in the engineering of small-molecule responsive switches is fueled by cutting-edge computational design approaches, which will enable more complex and precise control over cellular processes and advance synthetic biology applications in biotechnology and medicine. Here, we discuss recent milestones and how technological advances are impacting the development of chemical switches.
Collapse
Affiliation(s)
- Sailan Shui
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Stephen Buckley
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Leo Scheller
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Bruno E. Correia
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| |
Collapse
|
64
|
Kao TY, Chiang YW. DEERefiner-assisted structural refinement using pulsed dipolar spectroscopy: a study on multidrug transporter LmrP. Phys Chem Chem Phys 2023; 25:24508-24517. [PMID: 37656008 DOI: 10.1039/d3cp02569a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Pulsed dipolar spectroscopy, such as double electron-electron resonance (DEER), has been underutilized in protein structure determination, despite its ability to provide valuable spatial information. In this study, we present DEERefiner, a user-friendly MATLAB-based GUI program that enables the modeling of protein structures by combining an initial structure and DEER distance restraints. We illustrate the effectiveness of DEERefiner by successfully modeling the ligand-dependent conformational changes of the proton-drug antiporter LmrP to an extracellular-open-like conformation with an impressive precision of 0.76 Å. Additionally, DEERefiner was able to uncover a previously hypothesized but experimentally unresolved proton-dependent conformation of LmrP, characterized as an extracellular-closed/partially intracellular-open conformation, with a precision of 1.16 Å. Our work not only highlights the ability of DEER spectroscopy to model protein structures but also reveals the potential of DEERefiner to advance the field by providing an accessible and applicable tool for precise protein structure modeling, thereby paving the way for deeper insights into protein function.
Collapse
Affiliation(s)
- Te-Yu Kao
- Department of Chemistry, National Tsing Hua University, Hsinchu 300-044, Taiwan.
| | - Yun-Wei Chiang
- Department of Chemistry, National Tsing Hua University, Hsinchu 300-044, Taiwan.
| |
Collapse
|
65
|
Schafer JW, Porter LL. Evolutionary selection of proteins with two folds. Nat Commun 2023; 14:5478. [PMID: 37673981 PMCID: PMC10482954 DOI: 10.1038/s41467-023-41237-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 08/24/2023] [Indexed: 09/08/2023] Open
Abstract
Although most globular proteins fold into a single stable structure, an increasing number have been shown to remodel their secondary and tertiary structures in response to cellular stimuli. State-of-the-art algorithms predict that these fold-switching proteins adopt only one stable structure, missing their functionally critical alternative folds. Why these algorithms predict a single fold is unclear, but all of them infer protein structure from coevolved amino acid pairs. Here, we hypothesize that coevolutionary signatures are being missed. Suspecting that single-fold variants could be masking these signatures, we developed an approach, called Alternative Contact Enhancement (ACE), to search both highly diverse protein superfamilies-composed of single-fold and fold-switching variants-and protein subfamilies with more fold-switching variants. ACE successfully revealed coevolution of amino acid pairs uniquely corresponding to both conformations of 56/56 fold-switching proteins from distinct families. Then, we used ACE-derived contacts to (1) predict two experimentally consistent conformations of a candidate protein with unsolved structure and (2) develop a blind prediction pipeline for fold-switching proteins. The discovery of widespread dual-fold coevolution indicates that fold-switching sequences have been preserved by natural selection, implying that their functionalities provide evolutionary advantage and paving the way for predictions of diverse protein structures from single sequences.
Collapse
Affiliation(s)
- Joseph W Schafer
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Lauren L Porter
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA.
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
66
|
Herrington NB, Stein D, Li YC, Pandey G, Schlessinger A. Exploring the Druggable Conformational Space of Protein Kinases Using AI-Generated Structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.31.555779. [PMID: 37693436 PMCID: PMC10491245 DOI: 10.1101/2023.08.31.555779] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Protein kinase function and interactions with drugs are controlled in part by the movement of the DFG and ɑC-Helix motifs, which enable kinases to adopt various conformational states. Small molecule ligands elicit therapeutic effects with distinct selectivity profiles and residence times that often depend on the kinase conformation(s) they bind. However, the limited availability of experimentally determined structural data for kinases in inactive states restricts drug discovery efforts for this major protein family. Modern AI-based structural modeling methods hold potential for exploring the previously experimentally uncharted druggable conformational space for kinases. Here, we first evaluated the currently explored conformational space of kinases in the PDB and models generated by AlphaFold2 (AF2) (1) and ESMFold (2), two prominent AI-based structure prediction methods. We then investigated AF2's ability to predict kinase structures in different conformations at various multiple sequence alignment (MSA) depths, based on this parameter's ability to explore conformational diversity. Our results showed a bias within the PDB and predicted structural models generated by AF2 and ESMFold toward structures of kinases in the active state over alternative conformations, particularly those conformations controlled by the DFG motif. Finally, we demonstrate that predicting kinase structures using AF2 at lower MSA depths allows the exploration of the space of these alternative conformations, including identifying previously unobserved conformations for 398 kinases. The results of our analysis of structural modeling by AF2 create a new avenue for the pursuit of new therapeutic agents against a notoriously difficult-to-target family of proteins. Significance Statement Greater abundance of kinase structural data in inactive conformations, currently lacking in structural databases, would improve our understanding of how protein kinases function and expand drug discovery and development for this family of therapeutic targets. Modern approaches utilizing artificial intelligence and machine learning have potential for efficiently capturing novel protein conformations. We provide evidence for a bias within AlphaFold2 and ESMFold to predict structures of kinases in their active states, similar to their overrepresentation in the PDB. We show that lowering the AlphaFold2 algorithm's multiple sequence alignment depth can help explore kinase conformational space more broadly. It can also enable the prediction of hundreds of kinase structures in novel conformations, many of whose models are likely viable for drug discovery.
Collapse
|
67
|
Simpkin AJ, Caballero I, McNicholas S, Stevenson K, Jiménez E, Sánchez Rodríguez F, Fando M, Uski V, Ballard C, Chojnowski G, Lebedev A, Krissinel E, Usón I, Rigden DJ, Keegan RM. Predicted models and CCP4. Acta Crystallogr D Struct Biol 2023; 79:806-819. [PMID: 37594303 PMCID: PMC10478639 DOI: 10.1107/s2059798323006289] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 07/19/2023] [Indexed: 08/19/2023] Open
Abstract
In late 2020, the results of CASP14, the 14th event in a series of competitions to assess the latest developments in computational protein structure-prediction methodology, revealed the giant leap forward that had been made by Google's Deepmind in tackling the prediction problem. The level of accuracy in their predictions was the first instance of a competitor achieving a global distance test score of better than 90 across all categories of difficulty. This achievement represents both a challenge and an opportunity for the field of experimental structural biology. For structure determination by macromolecular X-ray crystallography, access to highly accurate structure predictions is of great benefit, particularly when it comes to solving the phase problem. Here, details of new utilities and enhanced applications in the CCP4 suite, designed to allow users to exploit predicted models in determining macromolecular structures from X-ray diffraction data, are presented. The focus is mainly on applications that can be used to solve the phase problem through molecular replacement.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Stuart McNicholas
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Kyle Stevenson
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Maria Fando
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Ville Uski
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Charles Ballard
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Andrey Lebedev
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Eugene Krissinel
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Ronan M. Keegan
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| |
Collapse
|
68
|
Sinclair M, Stein RA, Sheehan JH, Hawes EM, O'Brien RM, Tajkhorshid E, Claxton DP. Molecular mechanisms of catalytic inhibition for active site mutations in glucose-6-phosphatase catalytic subunit 1 linked to glycogen storage disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.13.532485. [PMID: 36993754 PMCID: PMC10054992 DOI: 10.1101/2023.03.13.532485] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Mediating the terminal reaction of gluconeogenesis and glycogenolysis, the integral membrane protein G6PC1 regulates hepatic glucose production by catalyzing hydrolysis of glucose-6-phosphate (G6P) within the lumen of the endoplasmic reticulum. Consistent with its vital contribution to glucose homeostasis, inactivating mutations in G6PC1 cause glycogen storage disease (GSD) type 1a characterized by hepatomegaly and severe hypoglycemia. Despite its physiological importance, the structural basis of G6P binding to G6PC1 and the molecular disruptions induced by missense mutations within the active site that give rise to GSD type 1a are unknown. Exploiting a computational model of G6PC1 derived from the groundbreaking structure prediction algorithm AlphaFold2 (AF2), we combine molecular dynamics (MD) simulations and computational predictions of thermodynamic stability with a robust in vitro screening platform to define the atomic interactions governing G6P binding as well as explore the energetic perturbations imposed by disease-linked variants. We identify a collection of side chains, including conserved residues from the signature phosphatidic acid phosphatase motif, that contribute to a hydrogen bonding and van der Waals network stabilizing G6P in the active site. Introduction of GSD type 1a mutations into the G6PC1 sequence elicits changes in G6P binding energy, thermostability and structural properties, suggesting multiple pathways of catalytic impairment. Our results, which corroborate the high quality of the AF2 model as a guide for experimental design and to interpret outcomes, not only confirm active site structural organization but also suggest novel mechanistic contributions of catalytic and non-catalytic side chains.
Collapse
|
69
|
Brown BP, Stein RA, Meiler J, Mchaourab H. Approximating conformational Boltzmann distributions with AlphaFold2 predictions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.06.552168. [PMID: 37609301 PMCID: PMC10441281 DOI: 10.1101/2023.08.06.552168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Protein dynamics are intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria are evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, the computational and experimental methods capable of quantifying them are limited to systems of modest size. Here, we demonstrate that AlphaFold2 contact distance distributions can approximate conformational Boltzmann distributions, which we evaluate through examination of the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Further, we show that contact distance probability distributions generated by AlphaFold2 are sensitive to points mutations thus AF2 can predict the structural effects of mutations in some systems. We anticipate that our approach will be a valuable tool to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Richard A. Stein
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
| | - Hassane Mchaourab
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| |
Collapse
|
70
|
Sala D, Engelberger F, Mchaourab HS, Meiler J. Modeling conformational states of proteins with AlphaFold. Curr Opin Struct Biol 2023; 81:102645. [PMID: 37392556 DOI: 10.1016/j.sbi.2023.102645] [Citation(s) in RCA: 56] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 05/16/2023] [Accepted: 06/01/2023] [Indexed: 07/03/2023]
Abstract
Many proteins exert their function by switching among different structures. Knowing the conformational ensembles affiliated with these states is critical to elucidate key mechanistic aspects that govern protein function. While experimental determination efforts are still bottlenecked by cost, time, and technical challenges, the machine-learning technology AlphaFold showed near experimental accuracy in predicting the three-dimensional structure of monomeric proteins. However, an AlphaFold ensemble of models usually represents a single conformational state with minimal structural heterogeneity. Consequently, several pipelines have been proposed to either expand the structural breadth of an ensemble or bias the prediction toward a desired conformational state. Here, we analyze how those pipelines work, what they can and cannot predict, and future directions.
Collapse
Affiliation(s)
- D Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany. https://twitter.com/sala_davide
| | - F Engelberger
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany. https://twitter.com/fengel97
| | - H S Mchaourab
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA. https://twitter.com/Mchaourablab
| | - J Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany; Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden/Leipzig, Germany.
| |
Collapse
|
71
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
72
|
Habeck M. Bayesian methods in integrative structure modeling. Biol Chem 2023; 404:741-754. [PMID: 37505205 DOI: 10.1515/hsz-2023-0145] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 07/07/2023] [Indexed: 07/29/2023]
Abstract
There is a growing interest in characterizing the structure and dynamics of large biomolecular assemblies and their interactions within the cellular environment. A diverse array of experimental techniques allows us to study biomolecular systems on a variety of length and time scales. These techniques range from imaging with light, X-rays or electrons, to spectroscopic methods, cross-linking mass spectrometry and functional genomics approaches, and are complemented by AI-assisted protein structure prediction methods. A challenge is to integrate all of these data into a model of the system and its functional dynamics. This review focuses on Bayesian approaches to integrative structure modeling. We sketch the principles of Bayesian inference, highlight recent applications to integrative modeling and conclude with a discussion of current challenges and future perspectives.
Collapse
Affiliation(s)
- Michael Habeck
- Microscopic Image Analysis Group, Jena University Hospital, D-07743 Jena, Germany
- Max Planck Institute for Multidisciplinary Sciences, d-37077 Göttingen, Germany
| |
Collapse
|
73
|
Casadevall G, Duran C, Osuna S. AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design. JACS AU 2023; 3:1554-1562. [PMID: 37388680 PMCID: PMC10302747 DOI: 10.1021/jacsau.3c00188] [Citation(s) in RCA: 14] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/22/2023] [Accepted: 05/22/2023] [Indexed: 07/01/2023]
Abstract
The recent success of AlphaFold2 (AF2) and other deep learning (DL) tools in accurately predicting the folded three-dimensional (3D) structure of proteins and enzymes has revolutionized the structural biology and protein design fields. The 3D structure indeed reveals key information on the arrangement of the catalytic machinery of enzymes and which structural elements gate the active site pocket. However, comprehending enzymatic activity requires a detailed knowledge of the chemical steps involved along the catalytic cycle and the exploration of the multiple thermally accessible conformations that enzymes adopt when in solution. In this Perspective, some of the recent studies showing the potential of AF2 in elucidating the conformational landscape of enzymes are provided. Selected examples of the key developments of AF2-based and DL methods for protein design are discussed, as well as a few enzyme design cases. These studies show the potential of AF2 and DL for allowing the routine computational design of efficient enzymes.
Collapse
Affiliation(s)
- Guillem Casadevall
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Cristina Duran
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Sílvia Osuna
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
- ICREA, Passeig Lluís Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
74
|
Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, allosteric, and orthosteric drug discovery: Ways forward. Drug Discov Today 2023; 28:103551. [PMID: 36907321 PMCID: PMC10238671 DOI: 10.1016/j.drudis.2023.103551] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/27/2023] [Accepted: 03/07/2023] [Indexed: 03/13/2023]
Abstract
Drug discovery is arguably a highly challenging and significant interdisciplinary aim. The stunning success of the artificial intelligence-powered AlphaFold, whose latest version is buttressed by an innovative machine-learning approach that integrates physical and biological knowledge about protein structures, raised drug discovery hopes that unsurprisingly, have not come to bear. Even though accurate, the models are rigid, including the drug pockets. AlphaFold's mixed performance poses the question of how its power can be harnessed in drug discovery. Here we discuss possible ways of going forward wielding its strengths, while bearing in mind what AlphaFold can and cannot do. For kinases and receptors, an input enriched in active (ON) state models can better AlphaFold's chance of rational drug design success.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| |
Collapse
|
75
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
76
|
McDonnell RT, Patel N, Wehrspan ZJ, Elcock AH. Atomic Models of All Major Trans-Envelope Complexes Involved in Lipid Trafficking in Escherichia Coli Constructed Using a Combination of AlphaFold2, AF2Complex, and Membrane Morphing Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.28.538765. [PMID: 37162969 PMCID: PMC10168319 DOI: 10.1101/2023.04.28.538765] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
In Gram-negative bacteria, several trans-envelope complexes (TECs) have been identified that span the periplasmic space in order to facilitate lipid transport between the inner- and outer- membranes. While partial or near-complete structures of some of these TECs have been solved by conventional experimental techniques, most remain incomplete. Here we describe how a combination of computational approaches, constrained by experimental data, can be used to build complete atomic models for four TECs implicated in lipid transport in Escherichia coli . We use DeepMind's protein structure prediction algorithm, AlphaFold2, and a variant of it designed to predict protein complexes, AF2Complex, to predict the oligomeric states of key components of TECs and their likely interfaces with other components. After obtaining initial models of the complete TECs by superimposing predicted structures of subcomplexes, we use the membrane orientation prediction algorithm OPM to predict the likely orientations of the inner- and outer- membrane components in each TEC. Since, in all cases, the predicted membrane orientations in these initial models are tilted relative to each other, we devise a novel molecular mechanics-based strategy that we call "membrane morphing" that adjusts each TEC model until the two membranes are properly aligned with each other and separated by a distance consistent with estimates of the periplasmic width in E. coli . The study highlights the potential power of combining computational methods, operating within limits set by both experimental data and by cell physiology, for producing useable atomic structures of very large protein complexes.
Collapse
|
77
|
Koehler Leman J, Künze G. Recent Advances in NMR Protein Structure Prediction with ROSETTA. Int J Mol Sci 2023; 24:ijms24097835. [PMID: 37175539 PMCID: PMC10178863 DOI: 10.3390/ijms24097835] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/15/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for studying the structure and dynamics of proteins in their native state. For high-resolution NMR structure determination, the collection of a rich restraint dataset is necessary. This can be difficult to achieve for proteins with high molecular weight or a complex architecture. Computational modeling techniques can complement sparse NMR datasets (<1 restraint per residue) with additional structural information to elucidate protein structures in these difficult cases. The Rosetta software for protein structure modeling and design is used by structural biologists for structure determination tasks in which limited experimental data is available. This review gives an overview of the computational protocols available in the Rosetta framework for modeling protein structures from NMR data. We explain the computational algorithms used for the integration of different NMR data types in Rosetta. We also highlight new developments, including modeling tools for data from paramagnetic NMR and hydrogen-deuterium exchange, as well as chemical shifts in CS-Rosetta. Furthermore, strategies are discussed to complement and improve structure predictions made by the current state-of-the-art AlphaFold2 program using NMR-guided Rosetta modeling.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA
| | - Georg Künze
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstr. 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
78
|
Nguyen PH, Sterpone F, Derreumaux P. Metastable alpha-rich and beta-rich conformations of small Aβ42 peptide oligomers. Proteins 2023. [PMID: 37038252 DOI: 10.1002/prot.26495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/15/2023] [Accepted: 03/23/2023] [Indexed: 04/12/2023]
Abstract
Probing the structures of amyloid-β (Aβ) peptides in the early steps of aggregation is extremely difficult experimentally and computationally. Yet, this knowledge is extremely important as small oligomers are the most toxic species. Experiments and simulations on Aβ42 monomer point to random coil conformations with either transient helical or β-strand content. Our current conformational description of small Aβ42 oligomers is funneled toward amorphous aggregates with some β-sheet content and rare high energy states with well-ordered assemblies of β-sheets. In this study, we emphasize another view based on metastable α-helix bundle oligomers spanning the C-terminal residues, which are predicted by the machine-learning AlphaFold2 method and supported indirectly by low-resolution experimental data on many amyloid polypeptides. This finding has consequences in developing novel chemical tools and to design potential therapies to reduce aggregation and toxicity.
Collapse
Affiliation(s)
- Phuong H Nguyen
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
| | - Fabio Sterpone
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
| | - Philippe Derreumaux
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
- Institut Universitaire de France (IUF), Paris, 75005, France
| |
Collapse
|
79
|
Meller A, Bhakat S, Solieva S, Bowman GR. Accelerating Cryptic Pocket Discovery Using AlphaFold. J Chem Theory Comput 2023. [PMID: 36948209 PMCID: PMC10373493 DOI: 10.1021/acs.jctc.2c01189] [Citation(s) in RCA: 32] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
Cryptic pockets, or pockets absent in ligand-free, experimentally determined structures, hold great potential as drug targets. However, cryptic pocket openings are often beyond the reach of conventional biomolecular simulations because certain cryptic pocket openings involve slow motions. Here, we investigate whether AlphaFold can be used to accelerate cryptic pocket discovery either by generating structures with open pockets directly or generating structures with partially open pockets that can be used as starting points for simulations. We use AlphaFold to generate ensembles for 10 known cryptic pocket examples, including five that were deposited after AlphaFold's training data were extracted from the PDB. We find that in 6 out of 10 cases AlphaFold samples the open state. For plasmepsin II, an aspartic protease from the causative agent of malaria, AlphaFold only captures a partial pocket opening. As a result, we ran simulations from an ensemble of AlphaFold-generated structures and show that this strategy samples cryptic pocket opening, even though an equivalent amount of simulations launched from a ligand-free experimental structure fails to do so. Markov state models (MSMs) constructed from the AlphaFold-seeded simulations quickly yield a free energy landscape of cryptic pocket opening that is in good agreement with the same landscape generated with well-tempered metadynamics. Taken together, our results demonstrate that AlphaFold has a useful role to play in cryptic pocket discovery but that many cryptic pockets may remain difficult to sample using AlphaFold alone.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Medical Scientist Training Program, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
| | - Soumendranath Bhakat
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Shahlo Solieva
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Gregory R Bowman
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
80
|
Agajanian S, Alshahrani M, Bai F, Tao P, Verkhivker GM. Exploring and Learning the Universe of Protein Allostery Using Artificial Intelligence Augmented Biophysical and Computational Approaches. J Chem Inf Model 2023; 63:1413-1428. [PMID: 36827465 PMCID: PMC11162550 DOI: 10.1021/acs.jcim.2c01634] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
Allosteric mechanisms are commonly employed regulatory tools used by proteins to orchestrate complex biochemical processes and control communications in cells. The quantitative understanding and characterization of allosteric molecular events are among major challenges in modern biology and require integration of innovative computational experimental approaches to obtain atomistic-level knowledge of the allosteric states, interactions, and dynamic conformational landscapes. The growing body of computational and experimental studies empowered by emerging artificial intelligence (AI) technologies has opened up new paradigms for exploring and learning the universe of protein allostery from first principles. In this review we analyze recent developments in high-throughput deep mutational scanning of allosteric protein functions; applications and latest adaptations of Alpha-fold structural prediction methods for studies of protein dynamics and allostery; new frontiers in integrating machine learning and enhanced sampling techniques for characterization of allostery; and recent advances in structural biology approaches for studies of allosteric systems. We also highlight recent computational and experimental studies of the SARS-CoV-2 spike (S) proteins revealing an important and often hidden role of allosteric regulation driving functional conformational changes, binding interactions with the host receptor, and mutational escape mechanisms of S proteins which are critical for viral infection. We conclude with a summary and outlook of future directions suggesting that AI-augmented biophysical and computer simulation approaches are beginning to transform studies of protein allostery toward systematic characterization of allosteric landscapes, hidden allosteric states, and mechanisms which may bring about a new revolution in molecular biology and drug discovery.
Collapse
Affiliation(s)
- Steve Agajanian
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Fang Bai
- Shanghai Institute for Advanced Immunochemical Studies, School of Life Science and Technology and Information Science and Technology, Shanghai Tech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75205, United States
| | - Gennady M Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
| |
Collapse
|
81
|
Sala D, Batebi H, Ledwitch K, Hildebrand PW, Meiler J. Targeting in silico GPCR conformations with ultra-large library screening for hit discovery. Trends Pharmacol Sci 2023; 44:150-161. [PMID: 36669974 PMCID: PMC9974811 DOI: 10.1016/j.tips.2022.12.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 12/23/2022] [Accepted: 12/27/2022] [Indexed: 01/20/2023]
Abstract
The use of deep machine learning (ML) in protein structure prediction has made it possible to easily access a large number of annotated conformations that can potentially compensate for missing experimental structures in structure-based drug discovery (SBDD). However, it is still unclear whether the accuracy of these predicted conformations is sufficient for screening chemical compounds that will effectively interact with a protein target for pharmacological purposes. In this opinion article, we examine the potential benefits and limitations of using state-annotated conformations for ultra-large library screening (ULLS) in light of the growing size of ultra-large libraries (ULLs). We believe that targeting different conformational states of common drug targets like G-protein-coupled receptors (GPCRs), which can regulate human physiology by switching between different conformations, can offer multiple advantages.
Collapse
Affiliation(s)
- D Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - H Batebi
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - K Ledwitch
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
| | - P W Hildebrand
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - J Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany; Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
82
|
Chakravarty D, Schafer JW, Porter LL. Distinguishing features of fold-switching proteins. Protein Sci 2023; 32:e4596. [PMID: 36782353 PMCID: PMC9951197 DOI: 10.1002/pro.4596] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 01/30/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023]
Abstract
Though many folded proteins assume one stable structure that performs one function, a small-but-increasing number remodel their secondary and tertiary structures and change their functions in response to cellular stimuli. These fold-switching proteins regulate biological processes and are associated with autoimmune dysfunction, severe acute respiratory syndrome coronavirus-2 infection, and more. Despite their biological importance, it is difficult to computationally predict fold switching. With the aim of advancing computational prediction and experimental characterization of fold switchers, this review discusses several features that distinguish fold-switching proteins from their single-fold and intrinsically disordered counterparts. First, the isolated structures of fold switchers are less stable and more heterogeneous than single folders but more stable and less heterogeneous than intrinsically disordered proteins (IDPs). Second, the sequences of single fold, fold switching, and intrinsically disordered proteins can evolve at distinct rates. Third, proteins from these three classes are best predicted using different computational techniques. Finally, late-breaking results suggest that single folders, fold switchers, and IDPs have distinct patterns of residue-residue coevolution. The review closes by discussing high-throughput and medium-throughput experimental approaches that might be used to identify new fold-switching proteins.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
83
|
Sala D, Hildebrand PW, Meiler J. Biasing AlphaFold2 to predict GPCRs and kinases with user-defined functional or structural properties. Front Mol Biosci 2023; 10:1121962. [PMID: 36876042 PMCID: PMC9978208 DOI: 10.3389/fmolb.2023.1121962] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 01/31/2023] [Indexed: 02/18/2023] Open
Abstract
Determining the three-dimensional structure of proteins in their native functional states has been a longstanding challenge in structural biology. While integrative structural biology has been the most effective way to get a high-accuracy structure of different conformations and mechanistic insights for larger proteins, advances in deep machine-learning algorithms have paved the way to fully computational predictions. In this field, AlphaFold2 (AF2) pioneered ab initio high-accuracy single-chain modeling. Since then, different customizations have expanded the number of conformational states accessible through AF2. Here, we further expanded AF2 with the aim of enriching an ensemble of models with user-defined functional or structural features. We tackled two common protein families for drug discovery, G-protein-coupled receptors (GPCRs) and kinases. Our approach automatically identifies the best templates satisfying the specified features and combines those with genetic information. We also introduced the possibility of shuffling the selected templates to expand the space of solutions. In our benchmark, models showed the intended bias and great accuracy. Our protocol can thus be exploited for modeling user-defined conformational states in an automatic fashion.
Collapse
Affiliation(s)
- Davide Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, Leipzig, Germany
| | - Peter W. Hildebrand
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, Leipzig, Germany
| | - Jens Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, Leipzig, Germany
- Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
| |
Collapse
|
84
|
Duran-Frigola M, Cigler M, Winter GE. Advancing Targeted Protein Degradation via Multiomics Profiling and Artificial Intelligence. J Am Chem Soc 2023; 145:2711-2732. [PMID: 36706315 PMCID: PMC9912273 DOI: 10.1021/jacs.2c11098] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2022] [Indexed: 01/28/2023]
Abstract
Only around 20% of the human proteome is considered to be druggable with small-molecule antagonists. This leaves some of the most compelling therapeutic targets outside the reach of ligand discovery. The concept of targeted protein degradation (TPD) promises to overcome some of these limitations. In brief, TPD is dependent on small molecules that induce the proximity between a protein of interest (POI) and an E3 ubiquitin ligase, causing ubiquitination and degradation of the POI. In this perspective, we want to reflect on current challenges in the field, and discuss how advances in multiomics profiling, artificial intelligence, and machine learning (AI/ML) will be vital in overcoming them. The presented roadmap is discussed in the context of small-molecule degraders but is equally applicable for other emerging proximity-inducing modalities.
Collapse
Affiliation(s)
- Miquel Duran-Frigola
- CeMM
Research Center for Molecular Medicine of the Austrian Academy of
Sciences, 1090 Vienna, Austria
- Ersilia
Open Source Initiative, 28 Belgrave Road, CB1 3DE, Cambridge, United Kingdom
| | - Marko Cigler
- CeMM
Research Center for Molecular Medicine of the Austrian Academy of
Sciences, 1090 Vienna, Austria
| | - Georg E. Winter
- CeMM
Research Center for Molecular Medicine of the Austrian Academy of
Sciences, 1090 Vienna, Austria
| |
Collapse
|
85
|
Brender JR, Ramamoorthy A, Gursky O, Bhunia A. Intrinsic disorder and structural biology: Searching where the light isn't. Biophys Chem 2023; 292:106912. [PMID: 36335754 DOI: 10.1016/j.bpc.2022.106912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jeffrey R Brender
- Radiation Biology Branch, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD, USA.
| | - Ayyalusamy Ramamoorthy
- Biophysics, Department of Chemistry, Biomedical Engineering, and Macromolecular Science and Engineering, University of Michigan, Ann Arbor, MI 48109-1055, USA
| | - Olga Gursky
- Boston University School of Medicine, Department of Physiology & Biophysics, W302, 700 Albany St, Boston, MA 02118, USA
| | - Anirban Bhunia
- Biomolecular NMR and Drug Design Laboratory, Department of Biophysics, Bose Institute, P-1/12 CIT Scheme VII (M), Kolkata 700054, India
| |
Collapse
|
86
|
Terwilliger TC, Poon BK, Afonine PV, Schlicksup CJ, Croll TI, Millán C, Richardson JS, Read RJ, Adams PD. Improved AlphaFold modeling with implicit experimental information. Nat Methods 2022; 19:1376-1382. [PMID: 36266465 PMCID: PMC9636017 DOI: 10.1038/s41592-022-01645-6] [Citation(s) in RCA: 64] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2022] [Accepted: 09/09/2022] [Indexed: 12/02/2022]
Abstract
Machine-learning prediction algorithms such as AlphaFold and RoseTTAFold can create remarkably accurate protein models, but these models usually have some regions that are predicted with low confidence or poor accuracy. We hypothesized that by implicitly including new experimental information such as a density map, a greater portion of a model could be predicted accurately, and that this might synergistically improve parts of the model that were not fully addressed by either machine learning or experiment alone. An iterative procedure was developed in which AlphaFold models are automatically rebuilt on the basis of experimental density maps and the rebuilt models are used as templates in new AlphaFold predictions. We show that including experimental information improves prediction beyond the improvement obtained with simple rebuilding guided by the experimental data. This procedure for AlphaFold modeling with density has been incorporated into an automated procedure for interpretation of crystallographic and electron cryo-microscopy maps.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- New Mexico Consortium, Los Alamos, NM, USA.
- Los Alamos National Laboratory, Los Alamos, NM, USA.
| | - Billy K Poon
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Pavel V Afonine
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Christopher J Schlicksup
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Tristan I Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | - Claudia Millán
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | - Randy J Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | - Paul D Adams
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Bioengineering, University of California, Berkeley, CA, USA
| |
Collapse
|
87
|
Cellier MFM. Nramp: Deprive and conquer? Front Cell Dev Biol 2022; 10:988866. [PMID: 36313567 PMCID: PMC9606685 DOI: 10.3389/fcell.2022.988866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Accepted: 09/20/2022] [Indexed: 11/13/2022] Open
Abstract
Solute carriers 11 (Slc11) evolved from bacterial permease (MntH) to eukaryotic antibacterial defense (Nramp) while continuously mediating proton (H+)-dependent manganese (Mn2+) import. Also, Nramp horizontal gene transfer (HGT) toward bacteria led to mntH polyphyly. Prior demonstration that evolutionary rate-shifts distinguishing Slc11 from outgroup carriers dictate catalytic specificity suggested that resolving Slc11 family tree may provide a function-aware phylogenetic framework. Hence, MntH C (MC) subgroups resulted from HGTs of prototype Nramp (pNs) parologs while archetype Nramp (aNs) correlated with phagocytosis. PHI-Blast based taxonomic profiling confirmed MntH B phylogroup is confined to anaerobic bacteria vs. MntH A (MA)’s broad distribution; suggested niche-related spread of MC subgroups; established that MA-variant MH, which carries ‘eukaryotic signature’ marks, predominates in archaea. Slc11 phylogeny shows MH is sister to Nramp. Site-specific analysis of Slc11 charge network known to interact with the protonmotive force demonstrates sequential rate-shifts that recapitulate Slc11 evolution. 3D mapping of similarly coevolved sites across Slc11 hydrophobic core revealed successive targeting of discrete areas. The data imply that pN HGT could advantage recipient bacteria for H+-dependent Mn2+ acquisition and Alphafold 3D models suggest conformational divergence among MC subgroups. It is proposed that Slc11 originated as a bacterial stress resistance function allowing Mn2+-dependent persistence in conditions adverse for growth, and that archaeal MH could contribute to eukaryogenesis as a Mn2+ sequestering defense perhaps favoring intracellular growth-competent bacteria.
Collapse
|