51
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. Predicting Functional Conformational Ensembles and Binding Mechanisms of Convergent Evolution for SARS-CoV-2 Spike Omicron Variants Using AlphaFold2 Sequence Scanning Adaptations and Molecular Dynamics Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.02.587850. [PMID: 38617283 PMCID: PMC11014522 DOI: 10.1101/2024.04.02.587850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/16/2024]
Abstract
In this study, we combined AlphaFold-based approaches for atomistic modeling of multiple protein states and microsecond molecular simulations to accurately characterize conformational ensembles and binding mechanisms of convergent evolution for the SARS-CoV-2 Spike Omicron variants BA.1, BA.2, BA.2.75, BA.3, BA.4/BA.5 and BQ.1.1. We employed and validated several different adaptations of the AlphaFold methodology for modeling of conformational ensembles including the introduced randomized full sequence scanning for manipulation of sequence variations to systematically explore conformational dynamics of Omicron Spike protein complexes with the ACE2 receptor. Microsecond atomistic molecular dynamic simulations provide a detailed characterization of the conformational landscapes and thermodynamic stability of the Omicron variant complexes. By integrating the predictions of conformational ensembles from different AlphaFold adaptations and applying statistical confidence metrics we can expand characterization of the conformational ensembles and identify functional protein conformations that determine the equilibrium dynamics for the Omicron Spike complexes with the ACE2. Conformational ensembles of the Omicron RBD-ACE2 complexes obtained using AlphaFold-based approaches for modeling protein states and molecular dynamics simulations are employed for accurate comparative prediction of the binding energetics revealing an excellent agreement with the experimental data. In particular, the results demonstrated that AlphaFold-generated extended conformational ensembles can produce accurate binding energies for the Omicron RBD-ACE2 complexes. The results of this study suggested complementarities and potential synergies between AlphaFold predictions of protein conformational ensembles and molecular dynamics simulations showing that integrating information from both methods can potentially yield a more adequate characterization of the conformational landscapes for the Omicron RBD-ACE2 complexes. This study provides insights in the interplay between conformational dynamics and binding, showing that evolution of Omicron variants through acquisition of convergent mutational sites may leverage conformational adaptability and dynamic couplings between key binding energy hotspots to optimize ACE2 binding affinity and enable immune evasion.
Collapse
|
52
|
Monteiro da Silva G, Cui JY, Dalgarno DC, Lisi GP, Rubenstein BM. High-throughput prediction of protein conformational distributions with subsampled AlphaFold2. Nat Commun 2024; 15:2464. [PMID: 38538622 PMCID: PMC10973385 DOI: 10.1038/s41467-024-46715-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/28/2024] [Indexed: 04/12/2024] Open
Abstract
This paper presents an innovative approach for predicting the relative populations of protein conformations using AlphaFold 2, an AI-powered method that has revolutionized biology by enabling the accurate prediction of protein structures. While AlphaFold 2 has shown exceptional accuracy and speed, it is designed to predict proteins' ground state conformations and is limited in its ability to predict conformational landscapes. Here, we demonstrate how AlphaFold 2 can directly predict the relative populations of different protein conformations by subsampling multiple sequence alignments. We tested our method against nuclear magnetic resonance experiments on two proteins with drastically different amounts of available sequence data, Abl1 kinase and the granulocyte-macrophage colony-stimulating factor, and predicted changes in their relative state populations with more than 80% accuracy. Our subsampling approach worked best when used to qualitatively predict the effects of mutations or evolution on the conformational landscape and well-populated states of proteins. It thus offers a fast and cost-effective way to predict the relative populations of protein conformations at even single-point mutation resolution, making it a useful tool for pharmacology, analysis of experimental results, and predicting evolution.
Collapse
Affiliation(s)
| | - Jennifer Y Cui
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA
| | | | - George P Lisi
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA
- Brown University Department of Chemistry, Providence, RI, USA
| | - Brenda M Rubenstein
- Brown University Department of Molecular and Cell Biology and Biochemistry, Providence, RI, USA.
- Brown University Department of Chemistry, Providence, RI, USA.
| |
Collapse
|
53
|
Carbery A, Buttenschoen M, Skyner R, von Delft F, Deane CM. Learnt representations of proteins can be used for accurate prediction of small molecule binding sites on experimentally determined and predicted protein structures. J Cheminform 2024; 16:32. [PMID: 38486231 PMCID: PMC10941399 DOI: 10.1186/s13321-024-00821-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 03/01/2024] [Indexed: 03/17/2024] Open
Abstract
Protein-ligand binding site prediction is a useful tool for understanding the functional behaviour and potential drug-target interactions of a novel protein of interest. However, most binding site prediction methods are tested by providing crystallised ligand-bound (holo) structures as input. This testing regime is insufficient to understand the performance on novel protein targets where experimental structures are not available. An alternative option is to provide computationally predicted protein structures, but this is not commonly tested. However, due to the training data used, computationally-predicted protein structures tend to be extremely accurate, and are often biased toward a holo conformation. In this study we describe and benchmark IF-SitePred, a protein-ligand binding site prediction method which is based on the labelling of ESM-IF1 protein language model embeddings combined with point cloud annotation and clustering. We show that not only is IF-SitePred competitive with state-of-the-art methods when predicting binding sites on experimental structures, but it performs better on proxies for novel proteins where low accuracy has been simulated by molecular dynamics. Finally, IF-SitePred outperforms other methods if ensembles of predicted protein structures are generated.
Collapse
Affiliation(s)
- Anna Carbery
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
| | - Martin Buttenschoen
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
| | - Rachael Skyner
- OMass Therapeutics, Building 4000, Chancellor Court, John Smith Drive, ARC Oxford, OX4 2GX, UK
| | - Frank von Delft
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Centre for Medicines Discovery, University of Oxford, Oxford, OX3 7DQ, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot, OX11 0FA, United Kingdom
- Department of Biochemistry, University of Johannesburg, Johannesburg, 2006, South Africa
| | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.
| |
Collapse
|
54
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2-Enabled Atomistic Modeling of Structure, Conformational Ensembles, and Binding Energetics of the SARS-CoV-2 Omicron BA.2.86 Spike Protein with ACE2 Host Receptor and Antibodies: Compensatory Functional Effects of Binding Hotspots in Modulating Mechanisms of Receptor Binding and Immune Escape. J Chem Inf Model 2024; 64:1657-1681. [PMID: 38373700 DOI: 10.1021/acs.jcim.3c01857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2024]
Abstract
The latest wave of SARS-CoV-2 Omicron variants displayed a growth advantage and increased viral fitness through convergent evolution of functional hotspots that work synchronously to balance fitness requirements for productive receptor binding and efficient immune evasion. In this study, we combined AlphaFold2-based structural modeling approaches with atomistic simulations and mutational profiling of binding energetics and stability for prediction and comprehensive analysis of the structure, dynamics, and binding of the SARS-CoV-2 Omicron BA.2.86 spike variant with ACE2 host receptor and distinct classes of antibodies. We adapted several AlphaFold2 approaches to predict both the structure and conformational ensembles of the Omicron BA.2.86 spike protein in the complex with the host receptor. The results showed that the AlphaFold2-predicted structural ensemble of the BA.2.86 spike protein complex with ACE2 can accurately capture the main conformational states of the Omicron variant. Complementary to AlphaFold2 structural predictions, microsecond molecular dynamics simulations reveal the details of the conformational landscape and produced equilibrium ensembles of the BA.2.86 structures that are used to perform mutational scanning of spike residues and characterize structural stability and binding energy hotspots. The ensemble-based mutational profiling of the receptor binding domain residues in the BA.2 and BA.2.86 spike complexes with ACE2 revealed a group of conserved hydrophobic hotspots and critical variant-specific contributions of the BA.2.86 convergent mutational hotspots R403K, F486P, and R493Q. To examine the immune evasion properties of BA.2.86 in atomistic detail, we performed structure-based mutational profiling of the spike protein binding interfaces with distinct classes of antibodies that displayed significantly reduced neutralization against the BA.2.86 variant. The results revealed the molecular basis of compensatory functional effects of the binding hotspots, showing that BA.2.86 lineage may have evolved to outcompete other Omicron subvariants by improving immune evasion while preserving binding affinity with ACE2 via through a compensatory effect of R493Q and F486P convergent mutational hotspots. This study demonstrated that an integrative approach combining AlphaFold2 predictions with complementary atomistic molecular dynamics simulations and robust ensemble-based mutational profiling of spike residues can enable accurate and comprehensive characterization of structure, dynamics, and binding mechanisms of newly emerging Omicron variants.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States of America
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States of America
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States of America
| |
Collapse
|
55
|
Jänes J, Beltrao P. Deep learning for protein structure prediction and design-progress and applications. Mol Syst Biol 2024; 20:162-169. [PMID: 38291232 PMCID: PMC10912668 DOI: 10.1038/s44320-024-00016-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open
Abstract
Proteins are the key molecular machines that orchestrate all biological processes of the cell. Most proteins fold into three-dimensional shapes that are critical for their function. Studying the 3D shape of proteins can inform us of the mechanisms that underlie biological processes in living cells and can have practical applications in the study of disease mutations or the discovery of novel drug treatments. Here, we review the progress made in sequence-based prediction of protein structures with a focus on applications that go beyond the prediction of single monomer structures. This includes the application of deep learning methods for the prediction of structures of protein complexes, different conformations, the evolution of protein structures and the application of these methods to protein design. These developments create new opportunities for research that will have impact across many areas of biomedical research.
Collapse
Affiliation(s)
- Jürgen Jänes
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pedro Beltrao
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
56
|
Yao H, Wang X, Chi J, Chen H, Liu Y, Yang J, Yu J, Ruan Y, Xiang X, Pi J, Xu JF. Exploring Novel Antidepressants Targeting G Protein-Coupled Receptors and Key Membrane Receptors Based on Molecular Structures. Molecules 2024; 29:964. [PMID: 38474476 DOI: 10.3390/molecules29050964] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 01/29/2024] [Accepted: 02/09/2024] [Indexed: 03/14/2024] Open
Abstract
Major Depressive Disorder (MDD) is a complex mental disorder that involves alterations in signal transmission across multiple scales and structural abnormalities. The development of effective antidepressants (ADs) has been hindered by the dominance of monoamine hypothesis, resulting in slow progress. Traditional ADs have undesirable traits like delayed onset of action, limited efficacy, and severe side effects. Recently, two categories of fast-acting antidepressant compounds have surfaced, dissociative anesthetics S-ketamine and its metabolites, as well as psychedelics such as lysergic acid diethylamide (LSD). This has led to structural research and drug development of the receptors that they target. This review provides breakthroughs and achievements in the structure of depression-related receptors and novel ADs based on these. Cryo-electron microscopy (cryo-EM) has enabled researchers to identify the structures of membrane receptors, including the N-methyl-D-aspartate receptor (NMDAR) and the 5-hydroxytryptamine 2A (5-HT2A) receptor. These high-resolution structures can be used for the development of novel ADs using virtual drug screening (VDS). Moreover, the unique antidepressant effects of 5-HT1A receptors in various brain regions, and the pivotal roles of the α-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid receptor (AMPAR) and tyrosine kinase receptor 2 (TrkB) in regulating synaptic plasticity, emphasize their potential as therapeutic targets. Using structural information, a series of highly selective ADs were designed based on the different role of receptors in MDD. These molecules have the favorable characteristics of rapid onset and low adverse drug reactions. This review offers researchers guidance and a methodological framework for the structure-based design of ADs.
Collapse
Affiliation(s)
- Hanbo Yao
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Xiaodong Wang
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiaxin Chi
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Haorong Chen
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Yilin Liu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiayi Yang
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jiaqi Yu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Yongdui Ruan
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
| | - Xufu Xiang
- The Key Laboratory for Biomedical Photonics of MOE at Wuhan National Laboratory for Optoelectronics-Hubei Bioinformatics and Molecular Imaging Key Laboratory, Systems Biology Theme, Department of Biomedical Engineering, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Jiang Pi
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| | - Jun-Fa Xu
- Guangdong Provincial Key Laboratory of Medical Molecular Diagnostics, The First Dongguan Affiliated Hospital, Guangdong Medical University, Dongguan 523808, China
- Institute of Laboratory Medicine, School of Medical Technology, Guangdong Medical University, Dongguan 523808, China
| |
Collapse
|
57
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker G. Interpretable Atomistic Prediction and Functional Analysis of Conformational Ensembles and Allosteric States in Protein Kinases Using AlphaFold2 Adaptation with Randomized Sequence Scanning and Local Frustration Profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.15.580591. [PMID: 38496487 PMCID: PMC10942451 DOI: 10.1101/2024.02.15.580591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
The groundbreaking achievements of AlphaFold2 (AF2) approaches in protein structure modeling marked a transformative era in structural biology. Despite the success of AF2 tools in predicting single protein structures, these methods showed intrinsic limitations in predicting multiple functional conformations of allosteric proteins and fold-switching systems. The recent NMR-based structural determination of the unbound ABL kinase in the active state and two inactive low-populated functional conformations that are unique for ABL kinase presents an ideal challenge for AF2 approaches. In the current study we employ several implementations of AF2 methods to predict protein conformational ensembles and allosteric states of the ABL kinase including (a) multiple sequence alignments (MSA) subsampling approach; (b) SPEACH_AF approach in which alanine scanning is performed on generated MSAs; and (c) introduced in this study randomized full sequence mutational scanning for manipulation of sequence variations combined with the MSA subsampling. We show that the proposed AF2 adaptation combined with local frustration mapping of conformational states enable accurate prediction of the ABL active and intermediate structures and conformational ensembles, also offering a robust approach for interpretable characterization of the AF2 predictions and limitations in detecting hidden allosteric states. We found that the large high frustration residue clusters are uniquely characteristic of the low-populated, fully inactive ABL form and can define energetically frustrated cracking sites of conformational transitions, presenting difficult targets for AF2 methods. This study uncovered previously unappreciated, fundamental connections between distinct patterns of local frustration in functional kinase states and AF2 successes/limitations in detecting low-populated frustrated conformations, providing a better understanding of benefits and limitations of current AF2-based adaptations in modeling of conformational ensembles.
Collapse
|
58
|
Brown BP, Stein RA, Meiler J, Mchaourab HS. Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations. J Chem Theory Comput 2024; 20:1434-1447. [PMID: 38215214 PMCID: PMC10867840 DOI: 10.1021/acs.jctc.3c01081] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024]
Abstract
Protein thermodynamics is intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, computational and experimental methods capable of quantifying the energetic determinants of these equilibria are limited to systems of modest size. Recently, it has been demonstrated that the artificial intelligence system AlphaFold2 can be manipulated to produce structurally valid protein conformational ensembles. Here, we extend these studies and explore the extent to which AlphaFold2 contact distance distributions can approximate projections of the conformational Boltzmann distributions. For this purpose, we examine the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Our studies suggest that AlphaFold2 normalized contact distance distributions can correlate with conformation probabilities obtained with other methods but that they suffer from peak broadening. We also find that the AlphaFold2 contact distance distributions can be sensitive to point mutations. Overall, we anticipate that our findings will be valuable as the community seeks to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
| | - Richard A. Stein
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| | - Jens Meiler
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Institute
for Drug Discovery, Leipzig University Medical
School, Leipzig, SAC 04103, Germany
| | - Hassane S. Mchaourab
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| |
Collapse
|
59
|
Choudhury J, Yonezawa K, Anu A, Shimizu N, Chaudhuri B. SAXS/WAXS data of conformationally flexible ribose binding protein. Data Brief 2024; 52:109932. [PMID: 38178847 PMCID: PMC10764985 DOI: 10.1016/j.dib.2023.109932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 01/06/2024] Open
Abstract
Modern artificial intelligence-based protein structure prediction methods, such as Alphafold2, can predict structures of folded proteins with reasonable accuracy. However, Alphafold2 provides a static view of a protein, which does not show the conformational variability of the protein, domain movement in a multi-domain protein, or ligand-induced conformational changes it might undergo in solution. Small-angle X-ay scattering (SAXS) and wide-angle X-ray scattering (WAXS) are solution techniques that can aid in integrative modeling of conformationally flexible proteins, or in validating their predicted ensemble structures. While SAXS is sensitive to global structural features, WAXS can expand the scope of structural modeling by including information about local structural changes. We present SAXS and WAXS datasets obtained from conformationally flexible d-ribose binding protein (RBP) from Escherichia coli in the ribose bound and unbound forms. SAXS/WAXS datasets of RBP provided here may aid in method development efforts for more accurate prediction of structural ensembles of conformationally flexible proteins, and their conformational changes.
Collapse
Affiliation(s)
- Jagrity Choudhury
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
| | - Kento Yonezawa
- Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
| | - Anu Anu
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi 110001, India
| | - Nobutaka Shimizu
- Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
- Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
| | - Barnali Chaudhuri
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi 110001, India
| |
Collapse
|
60
|
Li H, Sun X, Cui W, Xu M, Dong J, Ekundayo BE, Ni D, Rao Z, Guo L, Stahlberg H, Yuan S, Vogel H. Computational drug development for membrane protein targets. Nat Biotechnol 2024; 42:229-242. [PMID: 38361054 DOI: 10.1038/s41587-023-01987-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Accepted: 09/13/2023] [Indexed: 02/17/2024]
Abstract
The application of computational biology in drug development for membrane protein targets has experienced a boost from recent developments in deep learning-driven structure prediction, increased speed and resolution of structure elucidation, machine learning structure-based design and the evaluation of big data. Recent protein structure predictions based on machine learning tools have delivered surprisingly reliable results for water-soluble and membrane proteins but have limitations for development of drugs that target membrane proteins. Structural transitions of membrane proteins have a central role during transmembrane signaling and are often influenced by therapeutic compounds. Resolving the structural and functional basis of dynamic transmembrane signaling networks, especially within the native membrane or cellular environment, remains a central challenge for drug development. Tackling this challenge will require an interplay between experimental and computational tools, such as super-resolution optical microscopy for quantification of the molecular interactions of cellular signaling networks and their modulation by potential drugs, cryo-electron microscopy for determination of the structural transitions of proteins in native cell membranes and entire cells, and computational tools for data analysis and prediction of the structure and function of cellular signaling networks, as well as generation of promising drug candidates.
Collapse
Affiliation(s)
- Haijian Li
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
| | - Xiaolin Sun
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
| | - Wenqiang Cui
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Marc Xu
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Junlin Dong
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Babatunde Edukpe Ekundayo
- Laboratory of Biological Electron Microscopy, IPHYS, SB, EPFL and Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland
| | - Dongchun Ni
- Laboratory of Biological Electron Microscopy, IPHYS, SB, EPFL and Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland
| | - Zhili Rao
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
| | - Liwei Guo
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China
| | - Henning Stahlberg
- Laboratory of Biological Electron Microscopy, IPHYS, SB, EPFL and Department of Fundamental Microbiology, Faculty of Biology and Medicine, University of Lausanne, Lausanne, Switzerland.
| | - Shuguang Yuan
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China.
| | - Horst Vogel
- Center for Computer-Aided Drug Discovery, Faculty of Pharmaceutical Sciences, Shenzhen Institute of Advanced Technology/Chinese Academy of Sciences (SIAT/CAS), Shenzhen, China.
- Institut des Sciences et Ingénierie Chimiques (ISIC), Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland.
| |
Collapse
|
61
|
Stein RA, Mchaourab HS. Rosetta Energy Analysis of AlphaFold2 models: Point Mutations and Conformational Ensembles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.05.556364. [PMID: 37732281 PMCID: PMC10508732 DOI: 10.1101/2023.09.05.556364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/22/2023]
Abstract
There has been an explosive growth in the applications of AlphaFold2, and other structure prediction platforms, to accurately predict protein structures from a multiple sequence alignment (MSA) for downstream structural analysis. However, two outstanding questions persist in the field regarding the robustness of AlphaFold2 predictions of the consequences of point mutations and the completeness of its prediction of protein conformational ensembles. We combined our previously developed method SPEACH_AF with model relaxation and energetic analysis with Rosetta to address these questions. SPEACH_AF introduces residue substitutions across the MSA and not just within the input sequence. With respect to conformational ensembles, we combined SPEACH_AF and a new MSA subsampling method, AF_cluster, and for a benchmarked set of proteins, we found that the energetics of the conformational ensembles generated by AlphaFold2 correspond to those of experimental structures and explored by standard molecular dynamic methods. With respect to point mutations, we compared the structural and energetic consequences of having the mutation(s) in the input sequence versus in the whole MSA (SPEACH_AF). Both methods yielded models different from the wild-type sequence, with more robust changes when the mutation(s) were in the whole MSA. While our findings demonstrate the robustness of AlphaFold2 in analyzing point mutations and exploring conformational ensembles, they highlight the need for multi parameter structural and energetic analyses of these models to generate experimentally testable hypotheses.
Collapse
Affiliation(s)
- Richard A Stein
- Department of Molecular Physiology and Biophysics and Center for Applied AI in Protein Dynamics Vanderbilt University
| | - Hassane S Mchaourab
- Department of Molecular Physiology and Biophysics and Center for Applied AI in Protein Dynamics Vanderbilt University
| |
Collapse
|
62
|
Ohnuki J, Jaunet-Lahary T, Yamashita A, Okazaki KI. Accelerated Molecular Dynamics and AlphaFold Uncover a Missing Conformational State of Transporter Protein OxlT. J Phys Chem Lett 2024; 15:725-732. [PMID: 38215403 DOI: 10.1021/acs.jpclett.3c03052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2024]
Abstract
Transporter proteins change their conformations to carry their substrate across the cell membrane. The conformational dynamics is vital to understanding the transport function. We have studied the oxalate transporter (OxlT), an oxalate:formate antiporter from Oxalobacter formigenes, significant in avoiding kidney stone formation. The atomic structure of OxlT has been recently solved in the outward-open and occluded states. However, the inward-open conformation is still missing, hindering a complete understanding of the transporter. Here, we performed a Gaussian accelerated molecular dynamics simulation to sample the extensive conformational space of OxlT and successfully predicted the inward-open conformation where cytoplasmic substrate formate binding was preferred over oxalate binding. We also identified critical interactions for the inward-open conformation. The results were complemented by an AlphaFold2 structure prediction. Although AlphaFold2 solely predicted OxlT in the outward-open conformation, mutation of the identified critical residues made it partly predict the inward-open conformation, identifying possible state-shifting mutations.
Collapse
Affiliation(s)
- Jun Ohnuki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| | - Titouan Jaunet-Lahary
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
| | - Atsuko Yamashita
- Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama 700-8530, Japan
| | - Kei-Ichi Okazaki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| |
Collapse
|
63
|
Amaya-Rodriguez CA, Carvajal-Zamorano K, Bustos D, Alegría-Arcos M, Castillo K. A journey from molecule to physiology and in silico tools for drug discovery targeting the transient receptor potential vanilloid type 1 (TRPV1) channel. Front Pharmacol 2024; 14:1251061. [PMID: 38328578 PMCID: PMC10847257 DOI: 10.3389/fphar.2023.1251061] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 12/14/2023] [Indexed: 02/09/2024] Open
Abstract
The heat and capsaicin receptor TRPV1 channel is widely expressed in nerve terminals of dorsal root ganglia (DRGs) and trigeminal ganglia innervating the body and face, respectively, as well as in other tissues and organs including central nervous system. The TRPV1 channel is a versatile receptor that detects harmful heat, pain, and various internal and external ligands. Hence, it operates as a polymodal sensory channel. Many pathological conditions including neuroinflammation, cancer, psychiatric disorders, and pathological pain, are linked to the abnormal functioning of the TRPV1 in peripheral tissues. Intense biomedical research is underway to discover compounds that can modulate the channel and provide pain relief. The molecular mechanisms underlying temperature sensing remain largely unknown, although they are closely linked to pain transduction. Prolonged exposure to capsaicin generates analgesia, hence numerous capsaicin analogs have been developed to discover efficient analgesics for pain relief. The emergence of in silico tools offered significant techniques for molecular modeling and machine learning algorithms to indentify druggable sites in the channel and for repositioning of current drugs aimed at TRPV1. Here we recapitulate the physiological and pathophysiological functions of the TRPV1 channel, including structural models obtained through cryo-EM, pharmacological compounds tested on TRPV1, and the in silico tools for drug discovery and repositioning.
Collapse
Affiliation(s)
- Cesar A. Amaya-Rodriguez
- Centro Interdisciplinario de Neurociencia de Valparaíso, Facultad de Ciencias, Universidad de Valparaíso, Valparaíso, Chile
- Departamento de Fisiología y Comportamiento Animal, Facultad de Ciencias Naturales, Exactas y Tecnología, Universidad de Panamá, Ciudad de Panamá, Panamá
| | - Karina Carvajal-Zamorano
- Centro Interdisciplinario de Neurociencia de Valparaíso, Facultad de Ciencias, Universidad de Valparaíso, Valparaíso, Chile
| | - Daniel Bustos
- Centro de Investigación de Estudios Avanzados del Maule (CIEAM), Vicerrectoría de Investigación y Postgrado Universidad Católica del Maule, Talca, Chile
- Laboratorio de Bioinformática y Química Computacional, Departamento de Medicina Traslacional, Facultad de Medicina, Universidad Católica del Maule, Talca, Chile
| | - Melissa Alegría-Arcos
- Núcleo de Investigación en Data Science, Facultad de Ingeniería y Negocios, Universidad de las Américas, Santiago, Chile
| | - Karen Castillo
- Centro Interdisciplinario de Neurociencia de Valparaíso, Facultad de Ciencias, Universidad de Valparaíso, Valparaíso, Chile
- Centro de Investigación de Estudios Avanzados del Maule (CIEAM), Vicerrectoría de Investigación y Postgrado Universidad Católica del Maule, Talca, Chile
| |
Collapse
|
64
|
Bret H, Gao J, Zea DJ, Andreani J, Guerois R. From interaction networks to interfaces, scanning intrinsically disordered regions using AlphaFold2. Nat Commun 2024; 15:597. [PMID: 38238291 PMCID: PMC10796318 DOI: 10.1038/s41467-023-44288-7] [Citation(s) in RCA: 26] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 12/07/2023] [Indexed: 01/22/2024] Open
Abstract
The revolution brought about by AlphaFold2 opens promising perspectives to unravel the complexity of protein-protein interaction networks. The analysis of interaction networks obtained from proteomics experiments does not systematically provide the delimitations of the interaction regions. This is of particular concern in the case of interactions mediated by intrinsically disordered regions, in which the interaction site is generally small. Using a dataset of protein-peptide complexes involving intrinsically disordered regions that are non-redundant with the structures used in AlphaFold2 training, we show that when using the full sequences of the proteins, AlphaFold2-Multimer only achieves 40% success rate in identifying the correct site and structure of the interface. By delineating the interaction region into fragments of decreasing size and combining different strategies for integrating evolutionary information, we manage to raise this success rate up to 90%. We obtain similar success rates using a much larger dataset of protein complexes taken from the ELM database. Beyond the correct identification of the interaction site, our study also explores specificity issues. We show the advantages and limitations of using the AlphaFold2 confidence score to discriminate between alternative binding partners, a task that can be particularly challenging in the case of small interaction motifs.
Collapse
Affiliation(s)
- Hélène Bret
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jinmei Gao
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Diego Javier Zea
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| | - Raphaël Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| |
Collapse
|
65
|
Versini R, Sritharan S, Aykac Fas B, Tubiana T, Aimeur SZ, Henri J, Erard M, Nüsse O, Andreani J, Baaden M, Fuchs P, Galochkina T, Chatzigoulas A, Cournia Z, Santuz H, Sacquin-Mora S, Taly A. A Perspective on the Prospective Use of AI in Protein Structure Prediction. J Chem Inf Model 2024; 64:26-41. [PMID: 38124369 DOI: 10.1021/acs.jcim.3c01361] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
AlphaFold2 (AF2) and RoseTTaFold (RF) have revolutionized structural biology, serving as highly reliable and effective methods for predicting protein structures. This article explores their impact and limitations, focusing on their integration into experimental pipelines and their application in diverse protein classes, including membrane proteins, intrinsically disordered proteins (IDPs), and oligomers. In experimental pipelines, AF2 models help X-ray crystallography in resolving the phase problem, while complementarity with mass spectrometry and NMR data enhances structure determination and protein flexibility prediction. Predicting the structure of membrane proteins remains challenging for both AF2 and RF due to difficulties in capturing conformational ensembles and interactions with the membrane. Improvements in incorporating membrane-specific features and predicting the structural effect of mutations are crucial. For intrinsically disordered proteins, AF2's confidence score (pLDDT) serves as a competitive disorder predictor, but integrative approaches including molecular dynamics (MD) simulations or hydrophobic cluster analyses are advocated for accurate dynamics representation. AF2 and RF show promising results for oligomeric models, outperforming traditional docking methods, with AlphaFold-Multimer showing improved performance. However, some caveats remain in particular for membrane proteins. Real-life examples demonstrate AF2's predictive capabilities in unknown protein structures, but models should be evaluated for their agreement with experimental data. Furthermore, AF2 models can be used complementarily with MD simulations. In this Perspective, we propose a "wish list" for improving deep-learning-based protein folding prediction models, including using experimental data as constraints and modifying models with binding partners or post-translational modifications. Additionally, a meta-tool for ranking and suggesting composite models is suggested, driving future advancements in this rapidly evolving field.
Collapse
Affiliation(s)
- Raphaelle Versini
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Sujith Sritharan
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Burcu Aykac Fas
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Thibault Tubiana
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Sana Zineb Aimeur
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Julien Henri
- Sorbonne Université, CNRS, Laboratoire de Biologie, Computationnelle et Quantitative UMR 7238, Institut de Biologie Paris-Seine, 4 Place Jussieu, F-75005 Paris, France
| | - Marie Erard
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Oliver Nüsse
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Marc Baaden
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Patrick Fuchs
- Sorbonne Université, École Normale Supérieure, PSL University, CNRS, Laboratoire des Biomolécules, LBM, 75005 Paris, France
- Université de Paris, UFR Sciences du Vivant, 75013 Paris, France
| | - Tatiana Galochkina
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75014 Paris, France
| | - Alexios Chatzigoulas
- Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
| | - Zoe Cournia
- Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
| | - Hubert Santuz
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Sophie Sacquin-Mora
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Antoine Taly
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| |
Collapse
|
66
|
Li J, Wang L, Zhu Z, Song C. Exploring the Alternative Conformation of a Known Protein Structure Based on Contact Map Prediction. J Chem Inf Model 2024; 64:301-315. [PMID: 38117138 PMCID: PMC10777399 DOI: 10.1021/acs.jcim.3c01381] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 12/21/2023]
Abstract
The rapid development of deep learning-based methods has considerably advanced the field of protein structure prediction. The accuracy of predicting the 3D structures of simple proteins is comparable to that of experimentally determined structures, providing broad possibilities for structure-based biological studies. Another critical question is whether and how multistate structures can be predicted from a given protein sequence. In this study, analysis of tens of two-state proteins demonstrated that deep learning-based contact map predictions contain structural information on both states, which suggests that it is probably appropriate to change the target of deep learning-based protein structure prediction from one specific structure to multiple likely structures. Furthermore, by combining deep learning- and physics-based computational methods, we developed a protocol for exploring alternative conformations from a known structure of a given protein, by which we successfully approached the holo-state conformations of multiple representative proteins from their apo-state structures.
Collapse
Affiliation(s)
- Jiaxuan Li
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Lei Wang
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Zefeng Zhu
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Chen Song
- Center
for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Peking-Tsinghua
Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| |
Collapse
|
67
|
Ghafouri H, Lazar T, Del Conte A, Tenorio Ku LG, Tompa P, Tosatto SCE, Monzon AM. PED in 2024: improving the community deposition of structural ensembles for intrinsically disordered proteins. Nucleic Acids Res 2024; 52:D536-D544. [PMID: 37904608 PMCID: PMC10767937 DOI: 10.1093/nar/gkad947] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/10/2023] [Accepted: 10/13/2023] [Indexed: 11/01/2023] Open
Abstract
The Protein Ensemble Database (PED) (URL: https://proteinensemble.org) is the primary resource for depositing structural ensembles of intrinsically disordered proteins. This updated version of PED reflects advancements in the field, denoting a continual expansion with a total of 461 entries and 538 ensembles, including those generated without explicit experimental data through novel machine learning (ML) techniques. With this significant increment in the number of ensembles, a few yet-unprecedented new entries entered the database, including those also determined or refined by electron paramagnetic resonance or circular dichroism data. In addition, PED was enriched with several new features, including a novel deposition service, improved user interface, new database cross-referencing options and integration with the 3D-Beacons network-all representing efforts to improve the FAIRness of the database. Foreseeably, PED will keep growing in size and expanding with new types of ensembles generated by accurate and fast ML-based generative models and coarse-grained simulations. Therefore, among future efforts, priority will be given to further develop the database to be compatible with ensembles modeled at a coarse-grained level.
Collapse
Affiliation(s)
| | - Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel (VUB), Brussels, Belgium
| | - Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel (VUB), Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences (RCNS), Budapest, Hungary
| | | | | |
Collapse
|
68
|
Terwilliger TC, Liebschner D, Croll TI, Williams CJ, McCoy AJ, Poon BK, Afonine PV, Oeffner RD, Richardson JS, Read RJ, Adams PD. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat Methods 2024; 21:110-116. [PMID: 38036854 PMCID: PMC10776388 DOI: 10.1038/s41592-023-02087-4] [Citation(s) in RCA: 79] [Impact Index Per Article: 79.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 10/11/2023] [Indexed: 12/02/2023]
Abstract
Artificial intelligence-based protein structure prediction methods such as AlphaFold have revolutionized structural biology. The accuracies of these predictions vary, however, and they do not take into account ligands, covalent modifications or other environmental factors. Here, we evaluate how well AlphaFold predictions can be expected to describe the structure of a protein by comparing predictions directly with experimental crystallographic maps. In many cases, AlphaFold predictions matched experimental maps remarkably closely. In other cases, even very high-confidence predictions differed from experimental maps on a global scale through distortion and domain orientation, and on a local scale in backbone and side-chain conformation. We suggest considering AlphaFold predictions as exceptionally useful hypotheses. We further suggest that it is important to consider the confidence in prediction when interpreting AlphaFold predictions and to carry out experimental structure determination to verify structural details, particularly those that involve interactions not included in the prediction.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- New Mexico Consortium, Los Alamos, NM, USA.
- Los Alamos National Laboratory, Los Alamos, NM, USA.
| | - Dorothee Liebschner
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Tristan I Croll
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | - Airlie J McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | - Billy K Poon
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Pavel V Afonine
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Robert D Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | | | - Randy J Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, UK
| | - Paul D Adams
- Molecular Biophysics & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Department of Bioengineering, University of California, Berkeley, CA, USA
| |
Collapse
|
69
|
Winski A, Ludwiczak J, Orlowska M, Madaj R, Kaminski K, Dunin‐Horkawicz S. AlphaFold2 captures the conformational landscape of the HAMP signaling domain. Protein Sci 2024; 33:e4846. [PMID: 38010737 PMCID: PMC10731501 DOI: 10.1002/pro.4846] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 10/30/2023] [Accepted: 11/19/2023] [Indexed: 11/29/2023]
Abstract
In this study, we present a conformational landscape of 5000 AlphaFold2 models of the Histidine kinases, Adenyl cyclases, Methyl-accepting proteins and Phosphatases (HAMP) domain, a short helical bundle that transduces signals from sensors to effectors in two-component signaling proteins such as sensory histidine kinases and chemoreceptors. The landscape reveals the conformational variability of the HAMP domain, including rotations, shifts, displacements, and tilts of helices, many combinations of which have not been observed in experimental structures. HAMP domains belonging to a single family tend to occupy a defined region of the landscape, even when their sequence similarity is low, suggesting that individual HAMP families have evolved to operate in a specific conformational range. The functional importance of this structural conservation is illustrated by poly-HAMP arrays, in which HAMP domains from families with opposite conformational preferences alternate, consistent with the rotational model of signal transduction. The only poly-HAMP arrays that violate this rule are predicted to be of recent evolutionary origin and structurally unstable. Finally, we identify a family of HAMP domains that are likely to be dynamic due to the presence of a conserved pi-helical bulge. All code associated with this work, including a tool for rapid sequence-based prediction of the rotational state in HAMP domains, is deposited at https://github.com/labstructbioinf/HAMPpred.
Collapse
Affiliation(s)
- Aleksander Winski
- Laboratory of Structural Bioinformatics, Centre of New TechnologiesUniversity of WarsawWarsawPoland
| | - Jan Ludwiczak
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
- Present address:
Prescient Design, Genentech Research & Early DevelopmentRoche GroupBaselSwitzerland
| | - Malgorzata Orlowska
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Rafal Madaj
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Kamil Kaminski
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
| | - Stanislaw Dunin‐Horkawicz
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research CentreUniversity of WarsawWarsawPoland
- Department of Protein EvolutionMax Planck Institute for Biology TübingenTübingenGermany
| |
Collapse
|
70
|
Kleiman DE, Nadeem H, Shukla D. Adaptive Sampling Methods for Molecular Dynamics in the Era of Machine Learning. J Phys Chem B 2023; 127:10669-10681. [PMID: 38081185 DOI: 10.1021/acs.jpcb.3c04843] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]
Abstract
Molecular dynamics (MD) simulations are fundamental computational tools for the study of proteins and their free energy landscapes. However, sampling protein conformational changes through MD simulations is challenging due to the relatively long time scales of these processes. Many enhanced sampling approaches have emerged to tackle this problem, including biased sampling and path-sampling methods. In this Perspective, we focus on adaptive sampling algorithms. These techniques differ from other approaches because the thermodynamic ensemble is preserved and the sampling is enhanced solely by restarting MD trajectories at particularly chosen seeds rather than introducing biasing forces. We begin our treatment with an overview of theoretically transparent methods, where we discuss principles and guidelines for adaptive sampling. Then, we present a brief summary of select methods that have been applied to realistic systems in the past. Finally, we discuss recent advances in adaptive sampling methodology powered by deep learning techniques, as well as their shortcomings.
Collapse
Affiliation(s)
- Diego E Kleiman
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Hassan Nadeem
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Diwakar Shukla
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
71
|
da Silva GM, Cui JY, Dalgarno DC, Lisi GP, Rubenstein BM. Predicting Relative Populations of Protein Conformations without a Physics Engine Using AlphaFold 2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.25.550545. [PMID: 37546747 PMCID: PMC10402055 DOI: 10.1101/2023.07.25.550545] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]
Abstract
This paper presents a novel approach for predicting the relative populations of protein conformations using AlphaFold 2, an AI-powered method that has revolutionized biology by enabling the accurate prediction of protein structures. While AlphaFold 2 has shown exceptional accuracy and speed, it is designed to predict proteins' ground state conformations and is limited in its ability to predict conformational landscapes. Here, we demonstrate how AlphaFold 2 can directly predict the relative populations of different protein conformations by subsampling multiple sequence alignments. We tested our method against NMR experiments on two proteins with drastically different amounts of available sequence data, Abl1 kinase and the granulocyte-macrophage colony-stimulating factor, and predicted changes in their relative state populations with more than 80% accuracy. Our subsampling approach worked best when used to qualitatively predict the effects of mutations or evolution on the conformational landscape and well-populated states of proteins. It thus offers a fast and cost-effective way to predict the relative populations of protein conformations at even single-point mutation resolution, making it a useful tool for pharmacology, NMR analysis, and evolution.
Collapse
Affiliation(s)
- Gabriel Monteiro da Silva
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Providence, RI, USA
| | - Jennifer Y Cui
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Providence, RI, USA
| | | | - George P Lisi
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University Department of Chemistry, Providence, RI, USA
| | - Brenda M Rubenstein
- Brown University Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University Department of Chemistry, Providence, RI, USA
| |
Collapse
|
72
|
Chakravarty D, Schafer JW, Chen EA, Thole JR, Porter LL. AlphaFold2 has more to learn about protein energy landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.12.571380. [PMID: 38168383 PMCID: PMC10760193 DOI: 10.1101/2023.12.12.571380] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Recent work suggests that AlphaFold2 (AF2)-a deep learning-based model that can accurately infer protein structure from sequence-may discern important features of folded protein energy landscapes, defined by the diversity and frequency of different conformations in the folded state. Here, we test the limits of its predictive power on fold-switching proteins, which assume two structures with regions of distinct secondary and/or tertiary structure. Using several implementations of AF2, including two published enhanced sampling approaches, we generated >280,000 models of 93 fold-switching proteins whose experimentally determined conformations were likely in AF2's training set. Combining all models, AF2 predicted fold switching with a modest success rate of ~25%, indicating that it does not readily sample both experimentally characterized conformations of most fold switchers. Further, AF2's confidence metrics selected against models consistent with experimentally determined fold-switching conformations in favor of inconsistent models. Accordingly, these confidence metrics-though suggested to evaluate protein energetics reliably-did not discriminate between low and high energy states of fold-switching proteins. We then evaluated AF2's performance on seven fold-switching proteins outside of its training set, generating >159,000 models in total. Fold switching was accurately predicted in one of seven targets with moderate confidence. Further, AF2 demonstrated no ability to predict alternative conformations of two newly discovered targets without homologs in the set of 93 fold switchers. These results indicate that AF2 has more to learn about the underlying energetics of protein ensembles and highlight the need for further developments of methods that readily predict multiple protein conformations.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Ethan A. Chen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph R. Thole
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| |
Collapse
|
73
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. AlphaFold2-Enabled Atomistic Modeling of Epistatic Binding Mechanisms for the SARS-CoV-2 Spike Omicron XBB.1.5, EG.5 and FLip Variants: Convergent Evolution Hotspots Cooperate to Control Stability and Conformational Adaptability in Balancing ACE2 Binding and Antibody Resistance. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.11.571185. [PMID: 38168257 PMCID: PMC10760024 DOI: 10.1101/2023.12.11.571185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
In this study, we combined AI-based atomistic structural modeling and microsecond molecular simulations of the SARS-CoV-2 Spike complexes with the host receptor ACE2 for XBB.1.5+L455F, XBB.1.5+F456L(EG.5) and XBB.1.5+L455F/F456L (FLip) lineages to examine the mechanisms underlying the role of convergent evolution hotspots in balancing ACE2 binding and antibody evasion. Using the ensemble-based mutational scanning of the spike protein residues and physics-based rigorous computations of binding affinities, we identified binding energy hotspots and characterized molecular basis underlying epistatic couplings between convergent mutational hotspots. Consistent with the experiments, the results revealed the mediating role of Q493 hotspot in synchronization of epistatic couplings between L455F and F456L mutations providing a quantitative insight into the mechanism underlying differences between XBB lineages. Mutational profiling is combined with network-based model of epistatic couplings showing that the Q493, L455 and F456 sites mediate stable communities at the binding interface with ACE2 and can serve as stable mediators of non-additive couplings. Structure-based mutational analysis of Spike protein binding with the class 1 antibodies quantified the critical role of F456L and F486P mutations in eliciting strong immune evasion response. The results of this analysis support a mechanism in which the emergence of EG.5 and FLip variants may have been dictated by leveraging strong epistatic effects between several convergent revolutionary hotspots that provide synergy between the improved ACE2 binding and broad neutralization resistance. This interpretation is consistent with the notion that functionally balanced substitutions which simultaneously optimize immune evasion and high ACE2 affinity may continue to emerge through lineages with beneficial pair or triplet combinations of RBD mutations involving mediators of epistatic couplings and sites in highly adaptable RBD regions.
Collapse
|
74
|
Ramelot TA, Tejero R, Montelione GT. Representing structures of the multiple conformational states of proteins. Curr Opin Struct Biol 2023; 83:102703. [PMID: 37776602 PMCID: PMC10841472 DOI: 10.1016/j.sbi.2023.102703] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/18/2023] [Accepted: 08/23/2023] [Indexed: 10/02/2023]
Abstract
Biomolecules exhibit dynamic behavior that single-state models of their structures cannot fully capture. We review some recent advances for investigating multiple conformations of biomolecules, including experimental methods, molecular dynamics simulations, and machine learning. We also address the challenges associated with representing single- and multiple-state models in data archives, with a particular focus on NMR structures. Establishing standardized representations and annotations will facilitate effective communication and understanding of these complex models to the broader scientific community.
Collapse
Affiliation(s)
- Theresa A Ramelot
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| | - Roberto Tejero
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Gaetano T Montelione
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| |
Collapse
|
75
|
Ahmed M, Maldonado AM, Durrant JD. From Byte to Bench to Bedside: Molecular Dynamics Simulations and Drug Discovery. ARXIV 2023:arXiv:2311.16946v1. [PMID: 38076508 PMCID: PMC10705576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Molecular dynamics (MD) simulations and computer-aided drug design (CADD) have advanced substantially over the past two decades, thanks to continuous computer hardware and software improvements. Given these advancements, MD simulations are poised to become even more powerful tools for investigating the dynamic interactions between potential small-molecule drugs and their target proteins, with significant implications for pharmacological research.
Collapse
Affiliation(s)
- Mayar Ahmed
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Alex M. Maldonado
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Jacob D. Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
76
|
Vögele M, Zhang BW, Kaindl J, Wang L. Is the Functional Response of a Receptor Determined by the Thermodynamics of Ligand Binding? J Chem Theory Comput 2023; 19:8414-8422. [PMID: 37943175 DOI: 10.1021/acs.jctc.3c00899] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2023]
Abstract
For an effective drug, strong binding to the target protein is a prerequisite, but it is not enough. To produce a particular functional response, drugs need to either block the proteins' functions or modulate their activities by changing their conformational equilibrium. The binding free energy of a compound to its target is routinely calculated, but the timescales for the protein conformational changes are prohibitively long to be efficiently modeled via physics-based simulations. Thermodynamic principles suggest that the binding free energies of the ligands with different receptor conformations may infer their efficacy. However, this hypothesis has not been thoroughly validated. We present an actionable protocol and a comprehensive study to show that binding thermodynamics provides a strong predictor of the efficacy of a ligand. We apply the absolute binding free energy perturbation method to ligands bound to active and inactive states of eight G protein-coupled receptors and a nuclear receptor and then compare the resulting binding free energies. We find that carefully designed restraints are often necessary to efficiently model the corresponding conformational ensembles for each state. Our method achieves unprecedented performance in classifying ligands as agonists or antagonists across the various investigated receptors, all of which are important drug targets.
Collapse
Affiliation(s)
- Martin Vögele
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| | - Bin W Zhang
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| | - Jonas Kaindl
- Schrödinger GmbH, Glücksteinallee 25, Mannheim 68163, Germany
| | - Lingle Wang
- Schrödinger, Inc., 1540 Broadway 24th Floor, New York, New York 10036, United States
| |
Collapse
|
77
|
Buller R, Lutz S, Kazlauskas RJ, Snajdrova R, Moore JC, Bornscheuer UT. From nature to industry: Harnessing enzymes for biocatalysis. Science 2023; 382:eadh8615. [PMID: 37995253 DOI: 10.1126/science.adh8615] [Citation(s) in RCA: 91] [Impact Index Per Article: 45.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/17/2023] [Indexed: 11/25/2023]
Abstract
Biocatalysis harnesses enzymes to make valuable products. This green technology is used in countless applications from bench scale to industrial production and allows practitioners to access complex organic molecules, often with fewer synthetic steps and reduced waste. The last decade has seen an explosion in the development of experimental and computational tools to tailor enzymatic properties, equipping enzyme engineers with the ability to create biocatalysts that perform reactions not present in nature. By using (chemo)-enzymatic synthesis routes or orchestrating intricate enzyme cascades, scientists can synthesize elaborate targets ranging from DNA and complex pharmaceuticals to starch made in vitro from CO2-derived methanol. In addition, new chemistries have emerged through the combination of biocatalysis with transition metal catalysis, photocatalysis, and electrocatalysis. This review highlights recent key developments, identifies current limitations, and provides a future prospect for this rapidly developing technology.
Collapse
Affiliation(s)
- R Buller
- Competence Center for Biocatalysis, Institute of Chemistry and Biotechnology, Zurich University of Applied Sciences, 8820 Wädenswil, Switzerland
| | - S Lutz
- Codexis Incorporated, Redwood City, CA 94063, USA
| | - R J Kazlauskas
- Department of Biochemistry, Molecular Biology and Biophysics, Biotechnology Institute, University of Minnesota, Saint Paul, MN 55108, USA
| | - R Snajdrova
- Novartis Institutes for BioMedical Research, Global Discovery Chemistry, 4056 Basel, Switzerland
| | - J C Moore
- MRL, Merck & Co., Rahway, NJ 07065, USA
| | - U T Bornscheuer
- Institute of Biochemistry, Dept. of Biotechnology and Enzyme Catalysis, Greifswald University, Greifswald, Germany
| |
Collapse
|
78
|
Raisinghani N, Alshahrani M, Gupta G, Xiao S, Tao P, Verkhivker G. Accurate Characterization of Conformational Ensembles and Binding Mechanisms of the SARS-CoV-2 Omicron BA.2 and BA.2.86 Spike Protein with the Host Receptor and Distinct Classes of Antibodies Using AlphaFold2-Augmented Integrative Computational Modeling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.18.567697. [PMID: 38045395 PMCID: PMC10690158 DOI: 10.1101/2023.11.18.567697] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
The latest wave SARS-CoV-2 Omicron variants displayed a growth advantage and the increased viral fitness through convergent evolution of functional hotspots that work synchronously to balance fitness requirements for productive receptor binding and efficient immune evasion. In this study, we combined AlphaFold2-based structural modeling approaches with all-atom MD simulations and mutational profiling of binding energetics and stability for prediction and comprehensive analysis of the structure, dynamics, and binding of the SARS-CoV-2 Omicron BA.2.86 spike variant with ACE2 host receptor and distinct classes of antibodies. We adapted several AlphaFold2 approaches to predict both structure and conformational ensembles of the Omicron BA.2.86 spike protein in the complex with the host receptor. The results showed that AlphaFold2-predicted conformational ensemble of the BA.2.86 spike protein complex can accurately capture the main dynamics signatures obtained from microscond molecular dynamics simulations. The ensemble-based dynamic mutational scanning of the receptor binding domain residues in the BA.2 and BA.2.86 spike complexes with ACE2 dissected the role of the BA.2 and BA.2.86 backgrounds in modulating binding free energy changes revealing a group of conserved hydrophobic hotspots and critical variant-specific contributions of the BA.2.86 mutational sites R403K, F486P and R493Q. To examine immune evasion properties of BA.2.86 in atomistic detail, we performed large scale structure-based mutational profiling of the S protein binding interfaces with distinct classes of antibodies that displayed significantly reduced neutralization against BA.2.86 variant. The results quantified specific function of the BA.2.86 mutations to ensure broad resistance against different classes of RBD antibodies. This study revealed the molecular basis of compensatory functional effects of the binding hotspots, showing that BA.2.86 lineage may have primarily evolved to improve immune escape while modulating binding affinity with ACE2 through cooperative effect of R403K, F486P and R493Q mutations. The study supports a hypothesis that the impact of the increased ACE2 binding affinity on viral fitness is more universal and is mediated through cross-talk between convergent mutational hotspots, while the effect of immune evasion could be more variant-dependent.
Collapse
|
79
|
Guo X, Chen R, Cao L. Molecular docking-based virtual screening and dynamics simulation study of novel and potential SIRT7 inhibitors. Chem Biol Drug Des 2023; 102:707-717. [PMID: 37287091 DOI: 10.1111/cbdd.14277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 05/11/2023] [Accepted: 05/22/2023] [Indexed: 06/09/2023]
Abstract
In cancer cells, short for sirtuin (SIRT7) stabilizes the transformed state via its nicotinamide adenine dinucleotide (NAD+)-dependent deacetylase activity. Epigenetic factor SIRT7 plays important roles in cancer biology, reversing cancer phenotypes and suppressing tumor growth when inactive. In the present study, we got the SIRT7 protein structure from Alpha Fold2 Database and performed structure-based virtual screening to develop specific SIRT7 inhibitors using the SIRT7 inhibitor 97,491 interaction mechanism. As candidates for specific SIRT7 inhibitors, compounds with high affinities to SIRT7 were chosen. ZINC000001910616 and ZINC000014708529, two of our leading compounds, showed strong interactions with SIRT7. Our MD simulation results also revealed that the 5-hydroxy-4H-thioxen-4-one group and terminal carboxyl group were critical groups responsible for interaction of small molecules with SIRT7. In our study, we demonstrated that targeting SIRT7 may offer novel therapeutic options for cancer treatment. Compounds ZINC000001910616 and ZINC000014708529 can serve as chemical probes to investigate SIRT7 biological functions and provide starting points for the development of novel therapeutics against cancers.
Collapse
Affiliation(s)
- Xinli Guo
- Department of Operating Room, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| | - Rui Chen
- State Key Laboratory of Food Science and Technology, Jiangnan University, Wuxi, China
| | - Liping Cao
- Department of General Surgery, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
- Zhejiang Engineering Research Center of Cognitive Healthcare, Sir Run Run Shaw Hospital, School of Medicine, Zhejiang University, Hangzhou, China
| |
Collapse
|
80
|
Shui S, Buckley S, Scheller L, Correia BE. Rational design of small-molecule responsive protein switches. Protein Sci 2023; 32:e4774. [PMID: 37656809 PMCID: PMC10510469 DOI: 10.1002/pro.4774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 08/26/2023] [Accepted: 08/29/2023] [Indexed: 09/03/2023]
Abstract
Small-molecule responsive protein switches are powerful tools for controlling cellular processes. These switches are designed to respond rapidly and specifically to their inducer. They have been used in numerous applications, including the regulation of gene expression, post-translational protein modification, and signal transduction. Typically, small-molecule responsive protein switches consist of two proteins that interact with each other in the presence or absence of a small molecule. Recent advances in computational protein design already contributed to the development of protein switches with an expanded range of small-molecule inducers and increasingly sophisticated switch mechanisms. Further progress in the engineering of small-molecule responsive switches is fueled by cutting-edge computational design approaches, which will enable more complex and precise control over cellular processes and advance synthetic biology applications in biotechnology and medicine. Here, we discuss recent milestones and how technological advances are impacting the development of chemical switches.
Collapse
Affiliation(s)
- Sailan Shui
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Stephen Buckley
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Leo Scheller
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| | - Bruno E. Correia
- Laboratory of Protein Design and Immunoengineering (LPDI)STI, EPFLLausanneSwitzerland
- Swiss Institute of Bioinformatics (SIB)LausanneSwitzerland
| |
Collapse
|
81
|
Kao TY, Chiang YW. DEERefiner-assisted structural refinement using pulsed dipolar spectroscopy: a study on multidrug transporter LmrP. Phys Chem Chem Phys 2023; 25:24508-24517. [PMID: 37656008 DOI: 10.1039/d3cp02569a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/02/2023]
Abstract
Pulsed dipolar spectroscopy, such as double electron-electron resonance (DEER), has been underutilized in protein structure determination, despite its ability to provide valuable spatial information. In this study, we present DEERefiner, a user-friendly MATLAB-based GUI program that enables the modeling of protein structures by combining an initial structure and DEER distance restraints. We illustrate the effectiveness of DEERefiner by successfully modeling the ligand-dependent conformational changes of the proton-drug antiporter LmrP to an extracellular-open-like conformation with an impressive precision of 0.76 Å. Additionally, DEERefiner was able to uncover a previously hypothesized but experimentally unresolved proton-dependent conformation of LmrP, characterized as an extracellular-closed/partially intracellular-open conformation, with a precision of 1.16 Å. Our work not only highlights the ability of DEER spectroscopy to model protein structures but also reveals the potential of DEERefiner to advance the field by providing an accessible and applicable tool for precise protein structure modeling, thereby paving the way for deeper insights into protein function.
Collapse
Affiliation(s)
- Te-Yu Kao
- Department of Chemistry, National Tsing Hua University, Hsinchu 300-044, Taiwan.
| | - Yun-Wei Chiang
- Department of Chemistry, National Tsing Hua University, Hsinchu 300-044, Taiwan.
| |
Collapse
|
82
|
Schafer JW, Porter LL. Evolutionary selection of proteins with two folds. Nat Commun 2023; 14:5478. [PMID: 37673981 PMCID: PMC10482954 DOI: 10.1038/s41467-023-41237-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 08/24/2023] [Indexed: 09/08/2023] Open
Abstract
Although most globular proteins fold into a single stable structure, an increasing number have been shown to remodel their secondary and tertiary structures in response to cellular stimuli. State-of-the-art algorithms predict that these fold-switching proteins adopt only one stable structure, missing their functionally critical alternative folds. Why these algorithms predict a single fold is unclear, but all of them infer protein structure from coevolved amino acid pairs. Here, we hypothesize that coevolutionary signatures are being missed. Suspecting that single-fold variants could be masking these signatures, we developed an approach, called Alternative Contact Enhancement (ACE), to search both highly diverse protein superfamilies-composed of single-fold and fold-switching variants-and protein subfamilies with more fold-switching variants. ACE successfully revealed coevolution of amino acid pairs uniquely corresponding to both conformations of 56/56 fold-switching proteins from distinct families. Then, we used ACE-derived contacts to (1) predict two experimentally consistent conformations of a candidate protein with unsolved structure and (2) develop a blind prediction pipeline for fold-switching proteins. The discovery of widespread dual-fold coevolution indicates that fold-switching sequences have been preserved by natural selection, implying that their functionalities provide evolutionary advantage and paving the way for predictions of diverse protein structures from single sequences.
Collapse
Affiliation(s)
- Joseph W Schafer
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Lauren L Porter
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA.
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
83
|
Herrington NB, Stein D, Li YC, Pandey G, Schlessinger A. Exploring the Druggable Conformational Space of Protein Kinases Using AI-Generated Structures. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.31.555779. [PMID: 37693436 PMCID: PMC10491245 DOI: 10.1101/2023.08.31.555779] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Protein kinase function and interactions with drugs are controlled in part by the movement of the DFG and ɑC-Helix motifs, which enable kinases to adopt various conformational states. Small molecule ligands elicit therapeutic effects with distinct selectivity profiles and residence times that often depend on the kinase conformation(s) they bind. However, the limited availability of experimentally determined structural data for kinases in inactive states restricts drug discovery efforts for this major protein family. Modern AI-based structural modeling methods hold potential for exploring the previously experimentally uncharted druggable conformational space for kinases. Here, we first evaluated the currently explored conformational space of kinases in the PDB and models generated by AlphaFold2 (AF2) (1) and ESMFold (2), two prominent AI-based structure prediction methods. We then investigated AF2's ability to predict kinase structures in different conformations at various multiple sequence alignment (MSA) depths, based on this parameter's ability to explore conformational diversity. Our results showed a bias within the PDB and predicted structural models generated by AF2 and ESMFold toward structures of kinases in the active state over alternative conformations, particularly those conformations controlled by the DFG motif. Finally, we demonstrate that predicting kinase structures using AF2 at lower MSA depths allows the exploration of the space of these alternative conformations, including identifying previously unobserved conformations for 398 kinases. The results of our analysis of structural modeling by AF2 create a new avenue for the pursuit of new therapeutic agents against a notoriously difficult-to-target family of proteins. Significance Statement Greater abundance of kinase structural data in inactive conformations, currently lacking in structural databases, would improve our understanding of how protein kinases function and expand drug discovery and development for this family of therapeutic targets. Modern approaches utilizing artificial intelligence and machine learning have potential for efficiently capturing novel protein conformations. We provide evidence for a bias within AlphaFold2 and ESMFold to predict structures of kinases in their active states, similar to their overrepresentation in the PDB. We show that lowering the AlphaFold2 algorithm's multiple sequence alignment depth can help explore kinase conformational space more broadly. It can also enable the prediction of hundreds of kinase structures in novel conformations, many of whose models are likely viable for drug discovery.
Collapse
|
84
|
Simpkin AJ, Caballero I, McNicholas S, Stevenson K, Jiménez E, Sánchez Rodríguez F, Fando M, Uski V, Ballard C, Chojnowski G, Lebedev A, Krissinel E, Usón I, Rigden DJ, Keegan RM. Predicted models and CCP4. Acta Crystallogr D Struct Biol 2023; 79:806-819. [PMID: 37594303 PMCID: PMC10478639 DOI: 10.1107/s2059798323006289] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 07/19/2023] [Indexed: 08/19/2023] Open
Abstract
In late 2020, the results of CASP14, the 14th event in a series of competitions to assess the latest developments in computational protein structure-prediction methodology, revealed the giant leap forward that had been made by Google's Deepmind in tackling the prediction problem. The level of accuracy in their predictions was the first instance of a competitor achieving a global distance test score of better than 90 across all categories of difficulty. This achievement represents both a challenge and an opportunity for the field of experimental structural biology. For structure determination by macromolecular X-ray crystallography, access to highly accurate structure predictions is of great benefit, particularly when it comes to solving the phase problem. Here, details of new utilities and enhanced applications in the CCP4 suite, designed to allow users to exploit predicted models in determining macromolecular structures from X-ray diffraction data, are presented. The focus is mainly on applications that can be used to solve the phase problem through molecular replacement.
Collapse
Affiliation(s)
- Adam J. Simpkin
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Iracema Caballero
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Stuart McNicholas
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Kyle Stevenson
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Elisabet Jiménez
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
| | - Filomeno Sánchez Rodríguez
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- York Structural Biology Laboratory, Department of Chemistry, The University of York, York YO10 5DD, United Kingdom
| | - Maria Fando
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Ville Uski
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Charles Ballard
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Grzegorz Chojnowski
- European Molecular Biology Laboratory, Hamburg Unit, Notkestrasse 85, 22607 Hamburg, Germany
| | - Andrey Lebedev
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Eugene Krissinel
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Isabel Usón
- Crystallographic Methods, Institute of Molecular Biology of Barcelona (IBMB–CSIC), Barcelona, Spain
- ICREA, Institució Catalana de Recerca i Estudis Avançats, Passeig Lluís Companys 23, 08003 Barcelona, Spain
| | - Daniel J. Rigden
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Ronan M. Keegan
- Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
- UKRI–STFC, Rutherford Appleton Laboratory, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| |
Collapse
|
85
|
Sinclair M, Stein RA, Sheehan JH, Hawes EM, O'Brien RM, Tajkhorshid E, Claxton DP. Molecular mechanisms of catalytic inhibition for active site mutations in glucose-6-phosphatase catalytic subunit 1 linked to glycogen storage disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.13.532485. [PMID: 36993754 PMCID: PMC10054992 DOI: 10.1101/2023.03.13.532485] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Mediating the terminal reaction of gluconeogenesis and glycogenolysis, the integral membrane protein G6PC1 regulates hepatic glucose production by catalyzing hydrolysis of glucose-6-phosphate (G6P) within the lumen of the endoplasmic reticulum. Consistent with its vital contribution to glucose homeostasis, inactivating mutations in G6PC1 cause glycogen storage disease (GSD) type 1a characterized by hepatomegaly and severe hypoglycemia. Despite its physiological importance, the structural basis of G6P binding to G6PC1 and the molecular disruptions induced by missense mutations within the active site that give rise to GSD type 1a are unknown. Exploiting a computational model of G6PC1 derived from the groundbreaking structure prediction algorithm AlphaFold2 (AF2), we combine molecular dynamics (MD) simulations and computational predictions of thermodynamic stability with a robust in vitro screening platform to define the atomic interactions governing G6P binding as well as explore the energetic perturbations imposed by disease-linked variants. We identify a collection of side chains, including conserved residues from the signature phosphatidic acid phosphatase motif, that contribute to a hydrogen bonding and van der Waals network stabilizing G6P in the active site. Introduction of GSD type 1a mutations into the G6PC1 sequence elicits changes in G6P binding energy, thermostability and structural properties, suggesting multiple pathways of catalytic impairment. Our results, which corroborate the high quality of the AF2 model as a guide for experimental design and to interpret outcomes, not only confirm active site structural organization but also suggest novel mechanistic contributions of catalytic and non-catalytic side chains.
Collapse
|
86
|
Brown BP, Stein RA, Meiler J, Mchaourab H. Approximating conformational Boltzmann distributions with AlphaFold2 predictions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.06.552168. [PMID: 37609301 PMCID: PMC10441281 DOI: 10.1101/2023.08.06.552168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Protein dynamics are intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria are evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, the computational and experimental methods capable of quantifying them are limited to systems of modest size. Here, we demonstrate that AlphaFold2 contact distance distributions can approximate conformational Boltzmann distributions, which we evaluate through examination of the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Further, we show that contact distance probability distributions generated by AlphaFold2 are sensitive to points mutations thus AF2 can predict the structural effects of mutations in some systems. We anticipate that our approach will be a valuable tool to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Richard A. Stein
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
| | - Hassane Mchaourab
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| |
Collapse
|
87
|
Sala D, Engelberger F, Mchaourab HS, Meiler J. Modeling conformational states of proteins with AlphaFold. Curr Opin Struct Biol 2023; 81:102645. [PMID: 37392556 DOI: 10.1016/j.sbi.2023.102645] [Citation(s) in RCA: 66] [Impact Index Per Article: 33.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 05/16/2023] [Accepted: 06/01/2023] [Indexed: 07/03/2023]
Abstract
Many proteins exert their function by switching among different structures. Knowing the conformational ensembles affiliated with these states is critical to elucidate key mechanistic aspects that govern protein function. While experimental determination efforts are still bottlenecked by cost, time, and technical challenges, the machine-learning technology AlphaFold showed near experimental accuracy in predicting the three-dimensional structure of monomeric proteins. However, an AlphaFold ensemble of models usually represents a single conformational state with minimal structural heterogeneity. Consequently, several pipelines have been proposed to either expand the structural breadth of an ensemble or bias the prediction toward a desired conformational state. Here, we analyze how those pipelines work, what they can and cannot predict, and future directions.
Collapse
Affiliation(s)
- D Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany. https://twitter.com/sala_davide
| | - F Engelberger
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany. https://twitter.com/fengel97
| | - H S Mchaourab
- Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN, USA. https://twitter.com/Mchaourablab
| | - J Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany; Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Center for Scalable Data Analytics and Artificial Intelligence (ScaDS.AI), Dresden/Leipzig, Germany.
| |
Collapse
|
88
|
Mondal A, Lenz S, MacCallum JL, Perez A. Hybrid computational methods combining experimental information with molecular dynamics. Curr Opin Struct Biol 2023; 81:102609. [PMID: 37224642 DOI: 10.1016/j.sbi.2023.102609] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2023] [Revised: 04/12/2023] [Accepted: 04/23/2023] [Indexed: 05/26/2023]
Abstract
A goal of structural biology is to understand how macromolecules carry out their biological roles by identifying their metastable states, mechanisms of action, pathways leading to conformational changes, and the thermodynamic and kinetic relationships between those states. Integrative modeling brings structural insights into systems where traditional structure determination approaches cannot help. We focus on the synergies and challenges of integrative modeling combining experimental data with molecular dynamics simulations.
Collapse
Affiliation(s)
- Arup Mondal
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK. https://twitter.com/@amondal_chem
| | - Stefan Lenz
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada
| | - Justin L MacCallum
- Department of Chemistry, University of Calgary, 2500 University Drive, Canada. https://twitter.com/@jlmaccal
| | - Alberto Perez
- Quantum Theory Project, Department of Chemistry, University of Florida, Leigh, UK.
| |
Collapse
|
89
|
Habeck M. Bayesian methods in integrative structure modeling. Biol Chem 2023; 404:741-754. [PMID: 37505205 DOI: 10.1515/hsz-2023-0145] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 07/07/2023] [Indexed: 07/29/2023]
Abstract
There is a growing interest in characterizing the structure and dynamics of large biomolecular assemblies and their interactions within the cellular environment. A diverse array of experimental techniques allows us to study biomolecular systems on a variety of length and time scales. These techniques range from imaging with light, X-rays or electrons, to spectroscopic methods, cross-linking mass spectrometry and functional genomics approaches, and are complemented by AI-assisted protein structure prediction methods. A challenge is to integrate all of these data into a model of the system and its functional dynamics. This review focuses on Bayesian approaches to integrative structure modeling. We sketch the principles of Bayesian inference, highlight recent applications to integrative modeling and conclude with a discussion of current challenges and future perspectives.
Collapse
Affiliation(s)
- Michael Habeck
- Microscopic Image Analysis Group, Jena University Hospital, D-07743 Jena, Germany
- Max Planck Institute for Multidisciplinary Sciences, d-37077 Göttingen, Germany
| |
Collapse
|
90
|
Casadevall G, Duran C, Osuna S. AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design. JACS AU 2023; 3:1554-1562. [PMID: 37388680 PMCID: PMC10302747 DOI: 10.1021/jacsau.3c00188] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 05/22/2023] [Accepted: 05/22/2023] [Indexed: 07/01/2023]
Abstract
The recent success of AlphaFold2 (AF2) and other deep learning (DL) tools in accurately predicting the folded three-dimensional (3D) structure of proteins and enzymes has revolutionized the structural biology and protein design fields. The 3D structure indeed reveals key information on the arrangement of the catalytic machinery of enzymes and which structural elements gate the active site pocket. However, comprehending enzymatic activity requires a detailed knowledge of the chemical steps involved along the catalytic cycle and the exploration of the multiple thermally accessible conformations that enzymes adopt when in solution. In this Perspective, some of the recent studies showing the potential of AF2 in elucidating the conformational landscape of enzymes are provided. Selected examples of the key developments of AF2-based and DL methods for protein design are discussed, as well as a few enzyme design cases. These studies show the potential of AF2 and DL for allowing the routine computational design of efficient enzymes.
Collapse
Affiliation(s)
- Guillem Casadevall
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Cristina Duran
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
| | - Sílvia Osuna
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Maria Aurèlia Capmany 69, 17003 Girona, Spain
- ICREA, Passeig Lluís Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
91
|
Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, allosteric, and orthosteric drug discovery: Ways forward. Drug Discov Today 2023; 28:103551. [PMID: 36907321 PMCID: PMC10238671 DOI: 10.1016/j.drudis.2023.103551] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/27/2023] [Accepted: 03/07/2023] [Indexed: 03/13/2023]
Abstract
Drug discovery is arguably a highly challenging and significant interdisciplinary aim. The stunning success of the artificial intelligence-powered AlphaFold, whose latest version is buttressed by an innovative machine-learning approach that integrates physical and biological knowledge about protein structures, raised drug discovery hopes that unsurprisingly, have not come to bear. Even though accurate, the models are rigid, including the drug pockets. AlphaFold's mixed performance poses the question of how its power can be harnessed in drug discovery. Here we discuss possible ways of going forward wielding its strengths, while bearing in mind what AlphaFold can and cannot do. For kinases and receptors, an input enriched in active (ON) state models can better AlphaFold's chance of rational drug design success.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| |
Collapse
|
92
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
93
|
McDonnell RT, Patel N, Wehrspan ZJ, Elcock AH. Atomic Models of All Major Trans-Envelope Complexes Involved in Lipid Trafficking in Escherichia Coli Constructed Using a Combination of AlphaFold2, AF2Complex, and Membrane Morphing Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.28.538765. [PMID: 37162969 PMCID: PMC10168319 DOI: 10.1101/2023.04.28.538765] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
In Gram-negative bacteria, several trans-envelope complexes (TECs) have been identified that span the periplasmic space in order to facilitate lipid transport between the inner- and outer- membranes. While partial or near-complete structures of some of these TECs have been solved by conventional experimental techniques, most remain incomplete. Here we describe how a combination of computational approaches, constrained by experimental data, can be used to build complete atomic models for four TECs implicated in lipid transport in Escherichia coli . We use DeepMind's protein structure prediction algorithm, AlphaFold2, and a variant of it designed to predict protein complexes, AF2Complex, to predict the oligomeric states of key components of TECs and their likely interfaces with other components. After obtaining initial models of the complete TECs by superimposing predicted structures of subcomplexes, we use the membrane orientation prediction algorithm OPM to predict the likely orientations of the inner- and outer- membrane components in each TEC. Since, in all cases, the predicted membrane orientations in these initial models are tilted relative to each other, we devise a novel molecular mechanics-based strategy that we call "membrane morphing" that adjusts each TEC model until the two membranes are properly aligned with each other and separated by a distance consistent with estimates of the periplasmic width in E. coli . The study highlights the potential power of combining computational methods, operating within limits set by both experimental data and by cell physiology, for producing useable atomic structures of very large protein complexes.
Collapse
|
94
|
Koehler Leman J, Künze G. Recent Advances in NMR Protein Structure Prediction with ROSETTA. Int J Mol Sci 2023; 24:ijms24097835. [PMID: 37175539 PMCID: PMC10178863 DOI: 10.3390/ijms24097835] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/15/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for studying the structure and dynamics of proteins in their native state. For high-resolution NMR structure determination, the collection of a rich restraint dataset is necessary. This can be difficult to achieve for proteins with high molecular weight or a complex architecture. Computational modeling techniques can complement sparse NMR datasets (<1 restraint per residue) with additional structural information to elucidate protein structures in these difficult cases. The Rosetta software for protein structure modeling and design is used by structural biologists for structure determination tasks in which limited experimental data is available. This review gives an overview of the computational protocols available in the Rosetta framework for modeling protein structures from NMR data. We explain the computational algorithms used for the integration of different NMR data types in Rosetta. We also highlight new developments, including modeling tools for data from paramagnetic NMR and hydrogen-deuterium exchange, as well as chemical shifts in CS-Rosetta. Furthermore, strategies are discussed to complement and improve structure predictions made by the current state-of-the-art AlphaFold2 program using NMR-guided Rosetta modeling.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA
| | - Georg Künze
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstr. 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
95
|
Nguyen PH, Sterpone F, Derreumaux P. Metastable alpha-rich and beta-rich conformations of small Aβ42 peptide oligomers. Proteins 2023. [PMID: 37038252 DOI: 10.1002/prot.26495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 02/15/2023] [Accepted: 03/23/2023] [Indexed: 04/12/2023]
Abstract
Probing the structures of amyloid-β (Aβ) peptides in the early steps of aggregation is extremely difficult experimentally and computationally. Yet, this knowledge is extremely important as small oligomers are the most toxic species. Experiments and simulations on Aβ42 monomer point to random coil conformations with either transient helical or β-strand content. Our current conformational description of small Aβ42 oligomers is funneled toward amorphous aggregates with some β-sheet content and rare high energy states with well-ordered assemblies of β-sheets. In this study, we emphasize another view based on metastable α-helix bundle oligomers spanning the C-terminal residues, which are predicted by the machine-learning AlphaFold2 method and supported indirectly by low-resolution experimental data on many amyloid polypeptides. This finding has consequences in developing novel chemical tools and to design potential therapies to reduce aggregation and toxicity.
Collapse
Affiliation(s)
- Phuong H Nguyen
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
| | - Fabio Sterpone
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
| | - Philippe Derreumaux
- Laboratoire de Biochimie Théorique, UPR 9080, CNRS, Université Paris Cité, Paris, France
- Institut de Biologie Physico-Chimique, Fondation Edmond de Rothschild, 13 rue Pierre et Marie Curie, Paris, 75005, France
- Institut Universitaire de France (IUF), Paris, 75005, France
| |
Collapse
|
96
|
Meller A, Bhakat S, Solieva S, Bowman GR. Accelerating Cryptic Pocket Discovery Using AlphaFold. J Chem Theory Comput 2023. [PMID: 36948209 PMCID: PMC10373493 DOI: 10.1021/acs.jctc.2c01189] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/24/2023]
Abstract
Cryptic pockets, or pockets absent in ligand-free, experimentally determined structures, hold great potential as drug targets. However, cryptic pocket openings are often beyond the reach of conventional biomolecular simulations because certain cryptic pocket openings involve slow motions. Here, we investigate whether AlphaFold can be used to accelerate cryptic pocket discovery either by generating structures with open pockets directly or generating structures with partially open pockets that can be used as starting points for simulations. We use AlphaFold to generate ensembles for 10 known cryptic pocket examples, including five that were deposited after AlphaFold's training data were extracted from the PDB. We find that in 6 out of 10 cases AlphaFold samples the open state. For plasmepsin II, an aspartic protease from the causative agent of malaria, AlphaFold only captures a partial pocket opening. As a result, we ran simulations from an ensemble of AlphaFold-generated structures and show that this strategy samples cryptic pocket opening, even though an equivalent amount of simulations launched from a ligand-free experimental structure fails to do so. Markov state models (MSMs) constructed from the AlphaFold-seeded simulations quickly yield a free energy landscape of cryptic pocket opening that is in good agreement with the same landscape generated with well-tempered metadynamics. Taken together, our results demonstrate that AlphaFold has a useful role to play in cryptic pocket discovery but that many cryptic pockets may remain difficult to sample using AlphaFold alone.
Collapse
Affiliation(s)
- Artur Meller
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Medical Scientist Training Program, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
| | - Soumendranath Bhakat
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Shahlo Solieva
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| | - Gregory R Bowman
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, 660 S. Euclid Ave., St. Louis, Missouri 63110, United States
- Department of Biochemistry and Biophysics, University of Pennsylvania, Philadelphia, Pennsylvania 19104, United States
| |
Collapse
|
97
|
Agajanian S, Alshahrani M, Bai F, Tao P, Verkhivker GM. Exploring and Learning the Universe of Protein Allostery Using Artificial Intelligence Augmented Biophysical and Computational Approaches. J Chem Inf Model 2023; 63:1413-1428. [PMID: 36827465 PMCID: PMC11162550 DOI: 10.1021/acs.jcim.2c01634] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/26/2023]
Abstract
Allosteric mechanisms are commonly employed regulatory tools used by proteins to orchestrate complex biochemical processes and control communications in cells. The quantitative understanding and characterization of allosteric molecular events are among major challenges in modern biology and require integration of innovative computational experimental approaches to obtain atomistic-level knowledge of the allosteric states, interactions, and dynamic conformational landscapes. The growing body of computational and experimental studies empowered by emerging artificial intelligence (AI) technologies has opened up new paradigms for exploring and learning the universe of protein allostery from first principles. In this review we analyze recent developments in high-throughput deep mutational scanning of allosteric protein functions; applications and latest adaptations of Alpha-fold structural prediction methods for studies of protein dynamics and allostery; new frontiers in integrating machine learning and enhanced sampling techniques for characterization of allostery; and recent advances in structural biology approaches for studies of allosteric systems. We also highlight recent computational and experimental studies of the SARS-CoV-2 spike (S) proteins revealing an important and often hidden role of allosteric regulation driving functional conformational changes, binding interactions with the host receptor, and mutational escape mechanisms of S proteins which are critical for viral infection. We conclude with a summary and outlook of future directions suggesting that AI-augmented biophysical and computer simulation approaches are beginning to transform studies of protein allostery toward systematic characterization of allosteric landscapes, hidden allosteric states, and mechanisms which may bring about a new revolution in molecular biology and drug discovery.
Collapse
Affiliation(s)
- Steve Agajanian
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Fang Bai
- Shanghai Institute for Advanced Immunochemical Studies, School of Life Science and Technology and Information Science and Technology, Shanghai Tech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75205, United States
| | - Gennady M Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
| |
Collapse
|
98
|
Sala D, Batebi H, Ledwitch K, Hildebrand PW, Meiler J. Targeting in silico GPCR conformations with ultra-large library screening for hit discovery. Trends Pharmacol Sci 2023; 44:150-161. [PMID: 36669974 PMCID: PMC9974811 DOI: 10.1016/j.tips.2022.12.006] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2022] [Revised: 12/23/2022] [Accepted: 12/27/2022] [Indexed: 01/20/2023]
Abstract
The use of deep machine learning (ML) in protein structure prediction has made it possible to easily access a large number of annotated conformations that can potentially compensate for missing experimental structures in structure-based drug discovery (SBDD). However, it is still unclear whether the accuracy of these predicted conformations is sufficient for screening chemical compounds that will effectively interact with a protein target for pharmacological purposes. In this opinion article, we examine the potential benefits and limitations of using state-annotated conformations for ultra-large library screening (ULLS) in light of the growing size of ultra-large libraries (ULLs). We believe that targeting different conformational states of common drug targets like G-protein-coupled receptors (GPCRs), which can regulate human physiology by switching between different conformations, can offer multiple advantages.
Collapse
Affiliation(s)
- D Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - H Batebi
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - K Ledwitch
- Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA
| | - P W Hildebrand
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany
| | - J Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, 04103 Leipzig, Germany; Center for Structural Biology, Vanderbilt University, Nashville, TN 37240, USA; Department of Chemistry, Vanderbilt University, Nashville, TN 37235, USA.
| |
Collapse
|
99
|
Chakravarty D, Schafer JW, Porter LL. Distinguishing features of fold-switching proteins. Protein Sci 2023; 32:e4596. [PMID: 36782353 PMCID: PMC9951197 DOI: 10.1002/pro.4596] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 01/30/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023]
Abstract
Though many folded proteins assume one stable structure that performs one function, a small-but-increasing number remodel their secondary and tertiary structures and change their functions in response to cellular stimuli. These fold-switching proteins regulate biological processes and are associated with autoimmune dysfunction, severe acute respiratory syndrome coronavirus-2 infection, and more. Despite their biological importance, it is difficult to computationally predict fold switching. With the aim of advancing computational prediction and experimental characterization of fold switchers, this review discusses several features that distinguish fold-switching proteins from their single-fold and intrinsically disordered counterparts. First, the isolated structures of fold switchers are less stable and more heterogeneous than single folders but more stable and less heterogeneous than intrinsically disordered proteins (IDPs). Second, the sequences of single fold, fold switching, and intrinsically disordered proteins can evolve at distinct rates. Third, proteins from these three classes are best predicted using different computational techniques. Finally, late-breaking results suggest that single folders, fold switchers, and IDPs have distinct patterns of residue-residue coevolution. The review closes by discussing high-throughput and medium-throughput experimental approaches that might be used to identify new fold-switching proteins.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
100
|
Sala D, Hildebrand PW, Meiler J. Biasing AlphaFold2 to predict GPCRs and kinases with user-defined functional or structural properties. Front Mol Biosci 2023; 10:1121962. [PMID: 36876042 PMCID: PMC9978208 DOI: 10.3389/fmolb.2023.1121962] [Citation(s) in RCA: 38] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2022] [Accepted: 01/31/2023] [Indexed: 02/18/2023] Open
Abstract
Determining the three-dimensional structure of proteins in their native functional states has been a longstanding challenge in structural biology. While integrative structural biology has been the most effective way to get a high-accuracy structure of different conformations and mechanistic insights for larger proteins, advances in deep machine-learning algorithms have paved the way to fully computational predictions. In this field, AlphaFold2 (AF2) pioneered ab initio high-accuracy single-chain modeling. Since then, different customizations have expanded the number of conformational states accessible through AF2. Here, we further expanded AF2 with the aim of enriching an ensemble of models with user-defined functional or structural features. We tackled two common protein families for drug discovery, G-protein-coupled receptors (GPCRs) and kinases. Our approach automatically identifies the best templates satisfying the specified features and combines those with genetic information. We also introduced the possibility of shuffling the selected templates to expand the space of solutions. In our benchmark, models showed the intended bias and great accuracy. Our protocol can thus be exploited for modeling user-defined conformational states in an automatic fashion.
Collapse
Affiliation(s)
- Davide Sala
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, Leipzig, Germany
| | - Peter W. Hildebrand
- Institute of Medical Physics and Biophysics, Faculty of Medicine, University of Leipzig, Leipzig, Germany
| | - Jens Meiler
- Institute of Drug Discovery, Faculty of Medicine, University of Leipzig, Leipzig, Germany
- Center for Structural Biology, Vanderbilt University, Nashville, TN, United States
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States
| |
Collapse
|