1
|
Yang ZJ, Shao Q, Jiang Y, Jurich C, Ran X, Juarez RJ, Yan B, Stull SL, Gollu A, Ding N. Mutexa: A Computational Ecosystem for Intelligent Protein Engineering. J Chem Theory Comput 2023; 19:7459-7477. [PMID: 37828731 PMCID: PMC10653112 DOI: 10.1021/acs.jctc.3c00602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Indexed: 10/14/2023]
Abstract
Protein engineering holds immense promise in shaping the future of biomedicine and biotechnology. This Review focuses on our ongoing development of Mutexa, a computational ecosystem designed to enable "intelligent protein engineering". In this vision, researchers will seamlessly acquire sequences of protein variants with desired functions as biocatalysts, therapeutic peptides, and diagnostic proteins through a finely-tuned computational machine, akin to Amazon Alexa's role as a versatile virtual assistant. The technical foundation of Mutexa has been established through the development of a database that combines and relates enzyme structures and their respective functions (e.g., IntEnzyDB), workflow software packages that enable high-throughput protein modeling (e.g., EnzyHTP and LassoHTP), and scoring functions that map the sequence-structure-function relationship of proteins (e.g., EnzyKR and DeepLasso). We will showcase the applications of these tools in benchmarking the convergence conditions of enzyme functional descriptors across mutants, investigating protein electrostatics and cavity distributions in SAM-dependent methyltransferases, and understanding the role of nonelectrostatic dynamic effects in enzyme catalysis. Finally, we will conclude by addressing the future steps and fundamental challenges in our endeavor to develop new Mutexa applications that assist the identification of beneficial mutants in protein engineering.
Collapse
Affiliation(s)
- Zhongyue J. Yang
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt
Institute of Chemical Biology, Vanderbilt
University, Nashville, Tennessee 37235, United States
- Department
of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data
Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Qianzhen Shao
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yaoyukun Jiang
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Christopher Jurich
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt
Institute of Chemical Biology, Vanderbilt
University, Nashville, Tennessee 37235, United States
| | - Xinchun Ran
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Reecan J. Juarez
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37235, United States
| | - Bailu Yan
- Department
of Biostatistics, Vanderbilt University, Nashville, Tennessee 37205, United States
| | - Sebastian L. Stull
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Anvita Gollu
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Ning Ding
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
2
|
Liu C, Kutchukian P, Nguyen ND, AlQuraishi M, Sorger PK. A Hybrid Structure-Based Machine Learning Approach for Predicting Kinase Inhibition by Small Molecules. J Chem Inf Model 2023; 63:5457-5472. [PMID: 37595065 PMCID: PMC10498990 DOI: 10.1021/acs.jcim.3c00347] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Indexed: 08/20/2023]
Abstract
Kinases have been the focus of drug discovery programs for three decades leading to over 70 therapeutic kinase inhibitors and biophysical affinity measurements for over 130,000 kinase-compound pairs. Nonetheless, the precise target spectrum for many kinases remains only partly understood. In this study, we describe a computational approach to unlocking qualitative and quantitative kinome-wide binding measurements for structure-based machine learning. Our study has three components: (i) a Kinase Inhibitor Complex (KinCo) data set comprising in silico predicted kinase structures paired with experimental binding constants, (ii) a machine learning loss function that integrates qualitative and quantitative data for model training, and (iii) a structure-based machine learning model trained on KinCo. We show that our approach outperforms methods trained on crystal structures alone in predicting binary and quantitative kinase-compound interaction affinities; relative to structure-free methods, our approach also captures known kinase biochemistry and more successfully generalizes to distant kinase sequences and compound scaffolds.
Collapse
Affiliation(s)
- Changchang Liu
- Laboratory
of Systems Pharmacology, Department of Systems Biology, Harvard Program
in Therapeutic Science, Harvard Medical
School, Boston, Massachusetts 02115, United States
| | - Peter Kutchukian
- Novartis
Institutes for Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Nhan D. Nguyen
- Pritzker
School of Molecular Engineering, University
of Chicago, Chicago, Illinois 60637, United
States
| | - Mohammed AlQuraishi
- Department
of Systems Biology, Columbia University, New York, New York 10032, United States
| | - Peter K. Sorger
- Laboratory
of Systems Pharmacology, Department of Systems Biology, Harvard Program
in Therapeutic Science, Harvard Medical
School, Boston, Massachusetts 02115, United States
| |
Collapse
|
3
|
Lalmansingh JM, Keeley AT, Ruff KM, Pappu RV, Holehouse AS. SOURSOP: A Python Package for the Analysis of Simulations of Intrinsically Disordered Proteins. J Chem Theory Comput 2023; 19:5609-5620. [PMID: 37463458 PMCID: PMC11188088 DOI: 10.1021/acs.jctc.3c00190] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/20/2023]
Abstract
Conformational heterogeneity is a defining hallmark of intrinsically disordered proteins and protein regions (IDRs). The functions of IDRs and the emergent cellular phenotypes they control are associated with sequence-specific conformational ensembles. Simulations of conformational ensembles that are based on atomistic and coarse-grained models are routinely used to uncover the sequence-specific interactions that may contribute to IDR functions. These simulations are performed either independently or in conjunction with data from experiments. Functionally relevant features of IDRs can span a range of length scales. Extracting these features requires analysis routines that quantify a range of properties. Here, we describe a new analysis suite simulation analysis of unfolded regions of proteins (SOURSOP), an object-oriented and open-source toolkit designed for the analysis of simulated conformational ensembles of IDRs. SOURSOP implements several analysis routines motivated by principles in polymer physics, offering a unique collection of simple-to-use functions to characterize IDR ensembles. As an extendable framework, SOURSOP supports the development and implementation of new analysis routines that can be easily packaged and shared.
Collapse
Affiliation(s)
- Jared M. Lalmansingh
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Alex T. Keeley
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana-Champaign, IL 61801, USA
| | - Kiersten M. Ruff
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Rohit V. Pappu
- Department of Biomedical Engineering, Washington University in St. Louis, St. Louis, MO 63130, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Alex S. Holehouse
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO 63130, USA
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, 63110, USA
| |
Collapse
|
4
|
Doh CY, Kampourakis T, Campbell KS, Stelzer JE. Basic science methods for the characterization of variants of uncertain significance in hypertrophic cardiomyopathy. Front Cardiovasc Med 2023; 10:1238515. [PMID: 37600050 PMCID: PMC10432852 DOI: 10.3389/fcvm.2023.1238515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Accepted: 07/20/2023] [Indexed: 08/22/2023] Open
Abstract
With the advent of next-generation whole genome sequencing, many variants of uncertain significance (VUS) have been identified in individuals suffering from inheritable hypertrophic cardiomyopathy (HCM). Unfortunately, this classification of a genetic variant results in ambiguity in interpretation, risk stratification, and clinical practice. Here, we aim to review some basic science methods to gain a more accurate characterization of VUS in HCM. Currently, many genomic data-based computational methods have been developed and validated against each other to provide a robust set of resources for researchers. With the continual improvement in computing speed and accuracy, in silico molecular dynamic simulations can also be applied in mutational studies and provide valuable mechanistic insights. In addition, high throughput in vitro screening can provide more biologically meaningful insights into the structural and functional effects of VUS. Lastly, multi-level mathematical modeling can predict how the mutations could cause clinically significant organ-level dysfunction. We discuss emerging technologies that will aid in better VUS characterization and offer a possible basic science workflow for exploring the pathogenicity of VUS in HCM. Although the focus of this mini review was on HCM, these basic science methods can be applied to research in dilated cardiomyopathy (DCM), restrictive cardiomyopathy (RCM), arrhythmogenic cardiomyopathy (ACM), or other genetic cardiomyopathies.
Collapse
Affiliation(s)
- Chang Yoon Doh
- School of Medicine, Case Western Reserve University, Cleveland, OH, United States
| | - Thomas Kampourakis
- Randall Centre for Cell and Molecular Biophysics, and British Heart Foundation Centre of Research Excellence, King’s College London, London, United Kingdom
| | - Kenneth S. Campbell
- Division of Cardiovascular Medicine, University of Kentucky, Lexington, KY, United States
| | - Julian E. Stelzer
- Department of Physiology and Biophysics, School of Medicine, Case Western Reserve University, Cleveland, OH, United States
| |
Collapse
|
5
|
Lalmansingh JM, Keeley AT, Ruff KM, Pappu RV, Holehouse AS. SOURSOP: A Python package for the analysis of simulations of intrinsically disordered proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.16.528879. [PMID: 36824878 PMCID: PMC9949127 DOI: 10.1101/2023.02.16.528879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/19/2023]
Abstract
Conformational heterogeneity is a defining hallmark of intrinsically disordered proteins and protein regions (IDRs). The functions of IDRs and the emergent cellular phenotypes they control are associated with sequence-specific conformational ensembles. Simulations of conformational ensembles that are based on atomistic and coarse-grained models are routinely used to uncover the sequence-specific interactions that may contribute to IDR functions. These simulations are performed either independently or in conjunction with data from experiments. Functionally relevant features of IDRs can span a range of length scales. Extracting these features requires analysis routines that quantify a range of properties. Here, we describe a new analysis suite SOURSOP, an object-oriented and open-source toolkit designed for the analysis of simulated conformational ensembles of IDRs. SOURSOP implements several analysis routines motivated by principles in polymer physics, offering a unique collection of simple-to-use functions to characterize IDR ensembles. As an extendable framework, SOURSOP supports the development and implementation of new analysis routines that can be easily packaged and shared.
Collapse
|
6
|
Zimmermann MT. Molecular Modeling is an Enabling Approach to Complement and Enhance Channelopathy Research. Compr Physiol 2022; 12:3141-3166. [PMID: 35578963 DOI: 10.1002/cphy.c190047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Hundreds of human membrane proteins form channels that transport necessary ions and compounds, including drugs and metabolites, yet details of their normal function or how function is altered by genetic variants to cause diseases are often unknown. Without this knowledge, researchers are less equipped to develop approaches to diagnose and treat channelopathies. High-resolution computational approaches such as molecular modeling enable researchers to investigate channelopathy protein function, facilitate detailed hypothesis generation, and produce data that is difficult to gather experimentally. Molecular modeling can be tailored to each physiologic context that a protein may act within, some of which may currently be difficult or impossible to assay experimentally. Because many genomic variants are observed in channelopathy proteins from high-throughput sequencing studies, methods with mechanistic value are needed to interpret their effects. The eminent field of structural bioinformatics integrates techniques from multiple disciplines including molecular modeling, computational chemistry, biophysics, and biochemistry, to develop mechanistic hypotheses and enhance the information available for understanding function. Molecular modeling and simulation access 3D and time-dependent information, not currently predictable from sequence. Thus, molecular modeling is valuable for increasing the resolution with which the natural function of protein channels can be investigated, and for interpreting how genomic variants alter them to produce physiologic changes that manifest as channelopathies. © 2022 American Physiological Society. Compr Physiol 12:3141-3166, 2022.
Collapse
Affiliation(s)
- Michael T Zimmermann
- Bioinformatics Research and Development Laboratory, Genomic Sciences and Precision Medicine Center, Medical College of Wisconsin, Milwaukee, Wisconsin, USA.,Clinical and Translational Sciences Institute, Medical College of Wisconsin, Milwaukee, Wisconsin, USA.,Department of Biochemistry, Medical College of Wisconsin, Milwaukee, Wisconsin, USA
| |
Collapse
|
7
|
Shao Q, Jiang Y, Yang ZJ. EnzyHTP: A High-Throughput Computational Platform for Enzyme Modeling. J Chem Inf Model 2022; 62:647-655. [DOI: 10.1021/acs.jcim.1c01424] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Qianzhen Shao
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Yaoyukun Jiang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Zhongyue J. Yang
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United States
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Vanderbilt Institute of Chemical Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
- Data Science Institute, Vanderbilt University, Nashville, Tennessee 37235, United States
| |
Collapse
|
8
|
Szabó PB, Sabanés Zariquiey F, Nogueira JJ. Cosolvent and Dynamic Effects in Binding Pocket Search by Docking Simulations. J Chem Inf Model 2021; 61:5508-5523. [PMID: 34730967 PMCID: PMC8659376 DOI: 10.1021/acs.jcim.1c00924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Indexed: 11/30/2022]
Abstract
The lack of conformational sampling in virtual screening projects can lead to inefficient results because many of the potential drugs may not be able to bind to the target protein during the static docking simulations. Here, we performed ensemble docking for around 2000 United States Food and Drug Administration (FDA)-approved drugs with the RNA-dependent RNA polymerase (RdRp) protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) as a target. The representative protein structures were generated by clustering classical molecular dynamics trajectories, which were evolved using three solvent scenarios, namely, pure water, benzene/water and phenol/water mixtures. The introduction of dynamic effects in the theoretical model showed improvement in docking results in terms of the number of strong binders and binding sites in the protein. Some of the discovered pockets were found only for the cosolvent simulations, where the nonpolar probes induced local conformational changes in the protein that lead to the opening of transient pockets. In addition, the selection of the ligands based on a combination of the binding free energy and binding free energy gap between the best two poses for each ligand provided more suitable binders than the selection of ligands based solely on one of the criteria. The application of cosolvent molecular dynamics to enhance the sampling of the configurational space is expected to improve the efficacy of virtual screening campaigns of future drug discovery projects.
Collapse
Affiliation(s)
- P. Bernát Szabó
- Department
of Chemistry, KU Leuven, Celestijnenlaan 200F, 3001 Leuven, Belgium
- Department
of Chemistry, Universidad Autónoma
de Madrid, Calle Francisco Tomás y Valiente, 7, 28049 Madrid, Spain
| | | | - Juan J. Nogueira
- Department
of Chemistry, Universidad Autónoma
de Madrid, Calle Francisco Tomás y Valiente, 7, 28049 Madrid, Spain
- IADCHEM,
Institute for Advanced Research in Chemistry, Universidad Autónoma de Madrid, Calle Francisco Tomás y Valiente, 7, 28049 Madrid, Spain
| |
Collapse
|
9
|
Abstract
Markov chain Monte Carlo methods are a powerful tool for sampling equilibrium configurations in complex systems. One problem these methods often face is slow convergence over large energy barriers. In this work, we propose a novel method that increases convergence in systems composed of many metastable states. This method aims to connect metastable regions directly using generative neural networks in order to propose new configurations in the Markov chain and optimizes the acceptance probability of large jumps between modes in the configuration space. We provide a comprehensive theory as well as a training scheme for the network and demonstrate the method on example systems.
Collapse
Affiliation(s)
- Luigi Sbailò
- Freie Universität Berlin, Department of Mathematics and Computer Science, Arnimallee 6, 14195 Berlin, Germany
| | - Manuel Dibak
- Freie Universität Berlin, Department of Mathematics and Computer Science, Arnimallee 6, 14195 Berlin, Germany
| | - Frank Noé
- Freie Universität Berlin, Department of Mathematics and Computer Science, Arnimallee 6, 14195 Berlin, Germany
| |
Collapse
|
10
|
Pietrek LM, Stelzl LS, Hummer G. Hierarchical Ensembles of Intrinsically Disordered Proteins at Atomic Resolution in Molecular Dynamics Simulations. J Chem Theory Comput 2019; 16:725-737. [PMID: 31809054 DOI: 10.1021/acs.jctc.9b00809] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Intrinsically disordered proteins (IDPs) constitute a large fraction of the human proteome and are critical in the regulation of cellular processes. A detailed understanding of the conformational dynamics of IDPs could help to elucidate their roles in health and disease. However, the inherent flexibility of IDPs makes structural studies and their interpretation challenging. Molecular dynamics (MD) simulations could address this challenge in principle, but inaccuracies in the simulation models and the need for long simulations have stymied progress. To overcome these limitations, we adopt a hierarchical approach that builds on the "flexible-meccano" model reported by Bernadó et al. (J. Am. Chem. Soc. 2005, 127, 17968-17969). First, we exhaustively sample small IDP fragments in all-atom simulations to capture their local structures. Then, we assemble the fragments into full-length IDPs to explore the stereochemically possible global structures of IDPs. The resulting ensembles of three-dimensional structures of full-length IDPs are highly diverse, much more so than in standard MD simulation. For the paradigmatic IDP α-synuclein, our ensemble captures both the local structure, as probed by nuclear magnetic resonance spectroscopy, and its overall dimension, as obtained from small-angle X-ray scattering in solution. By generating representative and meaningful starting ensembles, we can begin to exploit the massive parallelism afforded by current and future high-performance computing resources for atomic-resolution characterization of IDPs.
Collapse
Affiliation(s)
- Lisa M Pietrek
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany
| | - Lukas S Stelzl
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany
| | - Gerhard Hummer
- Department of Theoretical Biophysics , Max Planck Institute of Biophysics , Max-von-Laue Straße 3 , 60438 Frankfurt am Main , Germany.,Institute for Biophysics , Goethe University Frankfurt , 60438 Frankfurt am Main , Germany
| |
Collapse
|
11
|
A deep learning framework to predict binding preference of RNA constituents on protein surface. Nat Commun 2019; 10:4941. [PMID: 31666519 PMCID: PMC6821705 DOI: 10.1038/s41467-019-12920-0] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Accepted: 10/08/2019] [Indexed: 12/21/2022] Open
Abstract
Protein-RNA interaction plays important roles in post-transcriptional regulation. However, the task of predicting these interactions given a protein structure is difficult. Here we show that, by leveraging a deep learning model NucleicNet, attributes such as binding preference of RNA backbone constituents and different bases can be predicted from local physicochemical characteristics of protein structure surface. On a diverse set of challenging RNA-binding proteins, including Fem-3-binding-factor 2, Argonaute 2 and Ribonuclease III, NucleicNet can accurately recover interaction modes discovered by structural biology experiments. Furthermore, we show that, without seeing any in vitro or in vivo assay data, NucleicNet can still achieve consistency with experiments, including RNAcompete, Immunoprecipitation Assay, and siRNA Knockdown Benchmark. NucleicNet can thus serve to provide quantitative fitness of RNA sequences for given binding pockets or to predict potential binding pockets and binding RNAs for previously unknown RNA binding proteins.
Collapse
|
12
|
Schwarz D, Merget B, Deane C, Fulle S. Modeling conformational flexibility of kinases in inactive states. Proteins 2019; 87:943-951. [PMID: 31168936 PMCID: PMC6852311 DOI: 10.1002/prot.25756] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2019] [Accepted: 05/26/2019] [Indexed: 02/02/2023]
Abstract
Kinase structures in the inactive "DFG-out" state provide a wealth of druggable binding site variants. The conformational plasticity of this state can be mainly described by different conformations of binding site-forming elements such as DFG motif, A-loop, P-loop, and αC-helix. Compared to DFG-in structures, DFG-out structures are largely underrepresented in the Protein Data Bank (PDB). Thus, structure-based drug design efforts for DFG-out inhibitors may benefit from an efficient approach to generate an ensemble of DFG-out structures. Accordingly, the presented modeling pipeline systematically generates homology models of kinases in several DFG-out conformations based on a sophisticated creation of template structures that represent the major states of the flexible structural elements. Eighteen template classes were initially selected from all available kinase structures in the PDB and subsequently employed for modeling the entire kinome in different DFG-out variants by fusing individual structural elements to multiple chimeric template structures. Molecular dynamics simulations revealed that conformational transitions between the different DFG-out states generally do not occur within trajectories of a few hundred nanoseconds length. This underlines the benefits of the presented homology modeling pipeline to generate relevant conformations of "DFG-out" kinase structures for subsequent in silico screening or binding site analysis studies.
Collapse
Affiliation(s)
- Dominik Schwarz
- BioMed X Innovation Center, Heidelberg, Germany.,Department of Statistics, University of Oxford, Oxford, UK
| | | | | | | |
Collapse
|
13
|
Chen S, Wiewiora RP, Meng F, Babault N, Ma A, Yu W, Qian K, Hu H, Zou H, Wang J, Fan S, Blum G, Pittella-Silva F, Beauchamp KA, Tempel W, Jiang H, Chen K, Skene RJ, Zheng YG, Brown PJ, Jin J, Luo C, Chodera JD, Luo M. The dynamic conformational landscape of the protein methyltransferase SETD8. eLife 2019; 8:45403. [PMID: 31081496 PMCID: PMC6579520 DOI: 10.7554/elife.45403] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 05/08/2019] [Indexed: 12/27/2022] Open
Abstract
Elucidating the conformational heterogeneity of proteins is essential for understanding protein function and developing exogenous ligands. With the rapid development of experimental and computational methods, it is of great interest to integrate these approaches to illuminate the conformational landscapes of target proteins. SETD8 is a protein lysine methyltransferase (PKMT), which functions in vivo via the methylation of histone and nonhistone targets. Utilizing covalent inhibitors and depleting native ligands to trap hidden conformational states, we obtained diverse X-ray structures of SETD8. These structures were used to seed distributed atomistic molecular dynamics simulations that generated a total of six milliseconds of trajectory data. Markov state models, built via an automated machine learning approach and corroborated experimentally, reveal how slow conformational motions and conformational states are relevant to catalysis. These findings provide molecular insight on enzymatic catalysis and allosteric mechanisms of a PKMT via its detailed conformational landscape. Our cells contain thousands of proteins that perform many different tasks. Such tasks often involve significant changes in the shape of a protein that allow it to interact with other proteins or ligands. Understanding these shape changes can be an essential step for predicting and manipulating how proteins work or designing new drugs. Some changes in protein shape happen quickly, whereas others take longer. Existing experimental approaches generally only capture some, but not all, of the different shapes an individual protein adopts. A family of proteins known as protein lysine methyltransferases (PKMTs) help to regulate the activities of other proteins by adding small tags called methyl groups to specific positions on their target proteins. PKMTs play important roles in many life processes including in activating genes, maintaining stem cells and controlling how organs develop. It is important for cells to properly control the activity of PKMTs because too much, or too little, activity can promote cancers and neurological diseases. For example, genetic mutations that increase the levels of a PKMT known as SETD8 appear to promote the progression of some breast cancers and childhood leukemia. There is a pressing need to develop new drugs that can inhibit SETD8 and other PKMTs in human patients. However, these efforts are hindered by the lack of understanding of exactly how the shape of PKMT proteins change as they operate in cells. Chen, Wiewiora et al. used a technique called X-ray crystallography to generate structural models of the human SETD8 protein in the presence or absence of native or foreign ligands. These models were used to develop computer simulations of how the shape of SETD8 changes as it operates. Further computational analysis and laboratory experiments revealed how slow changes in the shape of SETD8 contribute to the ability of the protein to attach methyl groups to other proteins. This work is a significant stepping-stone to developing a complete model of how the SETD8 protein works, as well as understanding how genetic mutations may affect the protein’s role in the body. The next step is to refine the model by integrating data from other approaches including biophysical models and mathematical calculations of the energy associated with the shape changes, with a long-term goal to better understand and then manipulate the function of SETD8.
Collapse
Affiliation(s)
- Shi Chen
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, United States.,Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Rafal P Wiewiora
- Tri-Institutional PhD Program in Chemical Biology, Memorial Sloan Kettering Cancer Center, New York, United States.,Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Fanwang Meng
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China
| | - Nicolas Babault
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Anqi Ma
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Wenyu Yu
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Kun Qian
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Hao Hu
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Hua Zou
- Takeda California, Science Center Drive, San Diego, United States
| | - Junyi Wang
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Shijie Fan
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Gil Blum
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Fabio Pittella-Silva
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Kyle A Beauchamp
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Wolfram Tempel
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Hualiang Jiang
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Kaixian Chen
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Robert J Skene
- Takeda California, Science Center Drive, San Diego, United States
| | - Yujun George Zheng
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, United States
| | - Peter J Brown
- Structural Genomics Consortium, University of Toronto, Toronto, Canada
| | - Jian Jin
- Mount Sinai Center for Therapeutics Discovery, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Department of Oncological Sciences, Icahn School of Medicine at Mount Sinai, New York, United States.,Tisch Cancer Institute, Icahn School of Medicine at Mount Sinai, New York, United States
| | - Cheng Luo
- Drug Discovery and Design Center, CAS Key Laboratory of Receptor Research, State Key Laboratory of Drug Research, Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.,University of Chinese Academy of Sciences, Beijing, China
| | - John D Chodera
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States
| | - Minkui Luo
- Chemical Biology Program, Memorial Sloan Kettering Cancer Center, New York, United States.,Program of Pharmacology, Weill Cornell Medical College of Cornell University, New York, United States
| |
Collapse
|
14
|
Hanson SM, Georghiou G, Thakur MK, Miller WT, Rest JS, Chodera JD, Seeliger MA. What Makes a Kinase Promiscuous for Inhibitors? Cell Chem Biol 2019; 26:390-399.e5. [PMID: 30612951 DOI: 10.1016/j.chembiol.2018.11.005] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2018] [Revised: 09/13/2018] [Accepted: 11/06/2018] [Indexed: 10/27/2022]
Abstract
ATP-competitive kinase inhibitors often bind several kinases due to the high conservation of the ATP binding pocket. Through clustering analysis of a large kinome profiling dataset, we found a cluster of eight promiscuous kinases that on average bind more than five times more kinase inhibitors than the other 398 kinases in the dataset. To understand the structural basis of promiscuous inhibitor binding, we determined the co-crystal structure of the receptor tyrosine kinase DDR1 with the type I inhibitors dasatinib and VX-680. Surprisingly, we find that DDR1 binds these type I inhibitors in an inactive conformation typically reserved for type II inhibitors. Our computational and biochemical studies show that DDR1 is unusually stable in this inactive conformation, giving a mechanistic explanation for inhibitor promiscuity. This phenotypic clustering analysis provides a strategy to obtain functional insights not available by sequence comparison alone.
Collapse
Affiliation(s)
- Sonya M Hanson
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA; Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065-1115, USA
| | - George Georghiou
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA
| | - Manish K Thakur
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA
| | - W Todd Miller
- Department of Physiology and Biophysics, Stony Brook University, Stony Brook, NY 11794-8651, USA
| | - Joshua S Rest
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, USA
| | - John D Chodera
- Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY 10065-1115, USA.
| | - Markus A Seeliger
- Department of Pharmacological Sciences, Stony Brook University, Stony Brook, NY 11794-8651, USA.
| |
Collapse
|
15
|
Amaro RE, Baudry J, Chodera J, Demir Ö, McCammon JA, Miao Y, Smith JC. Ensemble Docking in Drug Discovery. Biophys J 2018; 114:2271-2278. [PMID: 29606412 DOI: 10.1016/j.bpj.2018.02.038] [Citation(s) in RCA: 263] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 02/13/2018] [Accepted: 02/20/2018] [Indexed: 12/11/2022] Open
Abstract
Ensemble docking corresponds to the generation of an "ensemble" of drug target conformations in computational structure-based drug discovery, often obtained by using molecular dynamics simulation, that is used in docking candidate ligands. This approach is now well established in the field of early-stage drug discovery. This review gives a historical account of the development of ensemble docking and discusses some pertinent methodological advances in conformational sampling.
Collapse
Affiliation(s)
- Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Jerome Baudry
- University of Alabama at Huntsville, Huntsville, Alabama
| | - John Chodera
- University of California, Berkeley, Berkeley, California
| | - Özlem Demir
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - J Andrew McCammon
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California
| | - Yinglong Miao
- Department of Computational Biology and Molecular Biosciences, University of Kansas, Lawrence, Kansas
| | - Jeremy C Smith
- UT/ORNL Center for Molecular Biophysics, Oak Ridge National Laboratory, Oak Ridge, Tennessee; Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, Tennessee.
| |
Collapse
|
16
|
Membrane proteins structures: A review on computational modeling tools. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2017; 1859:2021-2039. [DOI: 10.1016/j.bbamem.2017.07.008] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/23/2017] [Revised: 07/04/2017] [Accepted: 07/13/2017] [Indexed: 01/02/2023]
|
17
|
Doerr S, Giorgino T, Martínez-Rosell G, Damas JM, De Fabritiis G. High-Throughput Automated Preparation and Simulation of Membrane Proteins with HTMD. J Chem Theory Comput 2017; 13:4003-4011. [DOI: 10.1021/acs.jctc.7b00480] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Stefan Doerr
- Computational
Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
| | - Toni Giorgino
- Institute
of Neurosciences, National Research Council of Italy (IN-CNR), 35127 Padua, Italy
| | - Gerard Martínez-Rosell
- Computational
Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
| | - João M. Damas
- Acellera, Barcelona Biomedical Research Park
(PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
| | - Gianni De Fabritiis
- Computational
Biophysics Laboratory (GRIB-IMIM), Universitat Pompeu Fabra, Barcelona Biomedical Research Park (PRBB), C/Doctor Aiguader 88, 08003 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluis Companys 23, Barcelona 08010, Spain
| |
Collapse
|
18
|
McGibbon RT, Beauchamp KA, Harrigan MP, Klein C, Swails JM, Hernández CX, Schwantes CR, Wang LP, Lane TJ, Pande VS. MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys J 2016; 109:1528-32. [PMID: 26488642 DOI: 10.1016/j.bpj.2015.08.015] [Citation(s) in RCA: 1292] [Impact Index Per Article: 161.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Revised: 07/24/2015] [Accepted: 08/10/2015] [Indexed: 10/22/2022] Open
Abstract
As molecular dynamics (MD) simulations continue to evolve into powerful computational tools for studying complex biomolecular systems, the necessity of flexible and easy-to-use software tools for the analysis of these simulations is growing. We have developed MDTraj, a modern, lightweight, and fast software package for analyzing MD simulations. MDTraj reads and writes trajectory data in a wide variety of commonly used formats. It provides a large number of trajectory analysis capabilities including minimal root-mean-square-deviation calculations, secondary structure assignment, and the extraction of common order parameters. The package has a strong focus on interoperability with the wider scientific Python ecosystem, bridging the gap between MD data and the rapidly growing collection of industry-standard statistical analysis and visualization tools in Python. MDTraj is a powerful and user-friendly software package that simplifies the analysis of MD data and connects these datasets with the modern interactive data science software ecosystem in Python.
Collapse
Affiliation(s)
| | - Kyle A Beauchamp
- Computational Biology Program, Sloan-Kettering Institute, New York, New York
| | | | - Christoph Klein
- Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville, Tennessee
| | - Jason M Swails
- Department of Chemistry, Rutgers University, Piscataway, New Jersey
| | | | | | - Lee-Ping Wang
- Department of Chemistry, University of California, Davis, Davis, California
| | - Thomas J Lane
- SLAC National Accelerator Laboratory, Menlo Park, California
| | - Vijay S Pande
- Department of Chemistry, Stanford University, Stanford, California; Biophysics Program, Stanford University, Stanford, California
| |
Collapse
|