1
|
Ravichandran A, Araque JC, Lawson JW. Predicting the functional state of protein kinases using interpretable graph neural networks from sequence and structural data. Proteins 2024; 92:623-636. [PMID: 38083830 DOI: 10.1002/prot.26641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Revised: 10/13/2023] [Accepted: 11/09/2023] [Indexed: 04/13/2024]
Abstract
Protein kinases are central to cellular activities and are actively pursued as drug targets for several conditions including cancer and autoimmune diseases. Despite the availability of a large structural database for kinases, methodologies to elucidate the structure-function relationship of these proteins (without manual intervention) are lacking. Such techniques are essential in structural biology and to accelerate drug discovery efforts. Here, we implement an interpretable graph neural network (GNN) framework for classifying the functionally active and inactive states of a large set of protein kinases by only using their tertiary structure and amino acid sequence. We show that the GNN models can classify kinase structures with high accuracy (>97%). We implement the Gradient-weighted Class Activation Mapping for graphs (Graph Grad-CAM) to automatically identify structurally important residues and residue-residue contacts of the kinases without any a priori input. We show that the motifs identified through the Graph Grad-CAM methodology are functionally critical, consistent with the existing kinase literature. Notably, the highly conserved DFG and HRD motifs of the well-known hydrophobic spine are identified by the interpretable framework in addition to some of the lesser known motifs. Further, using Grad-CAM maps as the vector embedding of the protein structures, we identify the subtle differences in the crystal structures among different sub-classes of kinases in the Protein Data Bank (PDB). Frameworks such as the one implemented here, for high-throughput identification of protein structure-function relationships are essential in designing targeted small molecules therapies as well as in engineering new proteins for novel applications.
Collapse
Affiliation(s)
- Ashwin Ravichandran
- KBR Inc., Intelligent Systems Division, NASA Ames Research Center, Moffett Field, California, USA
| | - Juan C Araque
- KBR Inc., Intelligent Systems Division, NASA Ames Research Center, Moffett Field, California, USA
| | - John W Lawson
- Intelligent Systems Division, NASA Ames Research Center, Moffett Field, California, USA
| |
Collapse
|
2
|
Reveguk I, Simonson T. Classifying protein kinase conformations with machine learning. Protein Sci 2024; 33:e4918. [PMID: 38501429 PMCID: PMC10962494 DOI: 10.1002/pro.4918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 01/02/2024] [Accepted: 01/22/2024] [Indexed: 03/20/2024]
Abstract
Protein kinases are key actors of signaling networks and important drug targets. They cycle between active and inactive conformations, distinguished by a few elements within the catalytic domain. One is the activation loop, whose conserved DFG motif can occupy DFG-in, DFG-out, and some rarer conformations. Annotation and classification of the structural kinome are important, as different conformations can be targeted by different inhibitors and activators. Valuable resources exist; however, large-scale applications will benefit from increased automation and interpretability of structural annotation. Interpretable machine learning models are described for this purpose, based on ensembles of decision trees. To train them, a set of catalytic domain sequences and structures was collected, somewhat larger and more diverse than existing resources. The structures were clustered based on the DFG conformation and manually annotated. They were then used as training input. Two main models were constructed, which distinguished active/inactive and in/out/other DFG conformations. They considered initially 1692 structural variables, spanning the whole catalytic domain, then identified ("learned") a small subset that sufficed for accurate classification. The first model correctly labeled all but 3 of 3289 structures as active or inactive, while the second assigned the correct DFG label to all but 17 of 8826 structures. The most potent classifying variables were all related to well-known structural elements in or near the activation loop and their ranking gives insights into the conformational preferences. The models were used to automatically annotate 3850 kinase structures predicted recently with the Alphafold2 tool, showing that Alphafold2 reproduced the active/inactive but not the DFG-in proportions seen in the Protein Data Bank. We expect the models will be useful for understanding and engineering kinases.
Collapse
Affiliation(s)
- Ivan Reveguk
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654)Ecole PolytechniquePalaiseauFrance
| | - Thomas Simonson
- Laboratoire de Biologie Structurale de la Cellule (CNRS UMR7654)Ecole PolytechniquePalaiseauFrance
| |
Collapse
|
3
|
Pratap Reddy Gajulapalli V. Development of Kinase-Centric Drugs: A Computational Perspective. ChemMedChem 2023; 18:e202200693. [PMID: 37442809 DOI: 10.1002/cmdc.202200693] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Revised: 07/12/2023] [Accepted: 07/12/2023] [Indexed: 07/15/2023]
Abstract
Kinases are prominent drug targets in the pharmaceutical and research community due to their involvement in signal transduction, physiological responses, and upon dysregulation, in diseases such as cancer, neurological and autoimmune disorders. Several FDA-approved small-molecule drugs have been developed to combat human diseases since Gleevec was approved for the treatment of chronic myelogenous leukemia. Kinases were considered "undruggable" in the beginning. Several FDA-approved small-molecule drugs have become available in recent years. Most of these drugs target ATP-binding sites, but a few target allosteric sites. Among kinases that belong to the same family, the catalytic domain shows high structural and sequence conservation. Inhibitors of ATP-binding sites can cause off-target binding. Because members of the same family have similar sequences and structural patterns, often complex relationships between kinases and inhibitors are observed. To design and develop drugs with desired selectivity, it is essential to understand the target selectivity for kinase inhibitors. To create new inhibitors with the desired selectivity, several experimental methods have been designed to profile the kinase selectivity of small molecules. Experimental approaches are often expensive, laborious, time-consuming, and limited by the available kinases. Researchers have used computational methodologies to address these limitations in the design and development of effective therapeutics. Many computational methods have been developed over the last few decades, either to complement experimental findings or to forecast kinase inhibitor activity and selectivity. The purpose of this review is to provide insight into recent advances in theoretical/computational approaches for the design of new kinase inhibitors with the desired selectivity and optimization of existing inhibitors.
Collapse
|
4
|
Majumdar S, Di Palma F, Spyrakis F, Decherchi S, Cavalli A. Molecular Dynamics and Machine Learning Give Insights on the Flexibility-Activity Relationships in Tyrosine Kinome. J Chem Inf Model 2023; 63:4814-4826. [PMID: 37462363 PMCID: PMC10428216 DOI: 10.1021/acs.jcim.3c00738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Indexed: 08/15/2023]
Abstract
Tyrosine kinases are a subfamily of kinases with critical roles in cellular machinery. Dysregulation of their active or inactive forms is associated with diseases like cancer. This study aimed to holistically understand their flexibility-activity relationships, focusing on pockets and fluctuations. We studied 43 different tyrosine kinases by collecting 120 μs of molecular dynamics simulations, pocket and residue fluctuation analysis, and a complementary machine learning approach. We found that the inactive forms often have increased flexibility, particularly at the DFG motif level. Noteworthy, thanks to these long simulations combined with a decision tree, we identified a semiquantitative fluctuation threshold of the DGF+3 residue over which the kinase has a higher probability to be in the inactive form.
Collapse
Affiliation(s)
- Sarmistha Majumdar
- Computational
& Chemical Biology, Fondazione Istituto
Italiano di Tecnologia, Via Morego 30, I-16163 Genova, Italy
| | - Francesco Di Palma
- Computational
& Chemical Biology, Fondazione Istituto
Italiano di Tecnologia, Via Morego 30, I-16163 Genova, Italy
| | - Francesca Spyrakis
- Department
of Drug Science and Technology, University
of Turin, via Giuria
9, I-10125 Turin, Italy
| | - Sergio Decherchi
- Data
Science and Computation, Fondazione Istituto
Italiano di Tecnologia, Via Morego 30, I-16163 Genova, Italy
| | - Andrea Cavalli
- Computational
& Chemical Biology, Fondazione Istituto
Italiano di Tecnologia, Via Morego 30, I-16163 Genova, Italy
- Department
of Pharmacy and Biotechnology, University
of Bologna, Via Belmeloro 6, I-40126 Bologna, Italy
| |
Collapse
|
5
|
Durairaj J, de Ridder D, van Dijk AD. Beyond sequence: Structure-based machine learning. Comput Struct Biotechnol J 2022; 21:630-643. [PMID: 36659927 PMCID: PMC9826903 DOI: 10.1016/j.csbj.2022.12.039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Revised: 12/21/2022] [Accepted: 12/21/2022] [Indexed: 12/31/2022] Open
Abstract
Recent breakthroughs in protein structure prediction demarcate the start of a new era in structural bioinformatics. Combined with various advances in experimental structure determination and the uninterrupted pace at which new structures are published, this promises an age in which protein structure information is as prevalent and ubiquitous as sequence. Machine learning in protein bioinformatics has been dominated by sequence-based methods, but this is now changing to make use of the deluge of rich structural information as input. Machine learning methods making use of structures are scattered across literature and cover a number of different applications and scopes; while some try to address questions and tasks within a single protein family, others aim to capture characteristics across all available proteins. In this review, we look at the variety of structure-based machine learning approaches, how structures can be used as input, and typical applications of these approaches in protein biology. We also discuss current challenges and opportunities in this all-important and increasingly popular field.
Collapse
Affiliation(s)
- Janani Durairaj
- Biozentrum, University of Basel, Basel, Switzerland
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| | - Aalt D.J. van Dijk
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, Wageningen, the Netherlands
| |
Collapse
|
6
|
Burastero O, Cabrera M, Lopez ED, Defelipe LA, Arcon JP, Durán R, Marti MA, Turjanski AG. Specificity and Reactivity of Mycobacterium tuberculosis Serine/Threonine Kinases PknG and PknB. J Chem Inf Model 2022; 62:1723-1733. [PMID: 35319884 DOI: 10.1021/acs.jcim.1c01358] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Mycobacterium tuberculosis (Mtb), the causative agent of Tuberculosis, has 11 eukaryotic-like serine/threonine protein kinases, which play essential roles in cell growth, signal transduction, and pathogenesis. Protein kinase G (PknG) regulates the carbon and nitrogen metabolism by phosphorylation of the glycogen accumulation regulator (GarA) protein at Thr21. Protein kinase B (PknB) is involved in cell wall synthesis and cell shape, as well as phosphorylates GarA but at Thr22. While PknG seems to be constitutively activated and recognition of GarA requires phosphorylation in its unstructured tail, PknB activation is triggered by phosphorylation of its activation loop, which allows binding of the forkhead-associated domain of GarA. In the present work, we used molecular dynamics and quantum-mechanics/molecular mechanics simulations of the catalytically competent complex and kinase activity assays to understand PknG/PknB specificity and reactivity toward GarA. Two hydrophobic residues in GarA, Val24 and Phe25, seem essential for PknG binding and allow specificity for Thr21 phosphorylation. On the other hand, phosphorylated residues in PknB bind Arg26 in GarA and regulate its specificity for Thr22. We also provide a detailed analysis of the free energy profile for the phospho-transfer reaction and show why PknG has a constitutively active conformation not requiring priming phosphorylation in contrast to PknB. Our results provide new insights into these two key enzymes relevant for Mtb and the mechanisms of serine/threonine phosphorylation in bacteria.
Collapse
Affiliation(s)
- Osvaldo Burastero
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Marisol Cabrera
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Elias D Lopez
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Lucas A Defelipe
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Juan Pablo Arcon
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Rosario Durán
- Institut Pasteur de Montevideo, 11400 Montevideo, Uruguay.,Instituto de Investigaciones BiológicasClemente Estable, 11600 Montevideo, Uruguay
| | - Marcelo A Marti
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| | - Adrian G Turjanski
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina.,Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Ciudad Universitaria, Pabellón 2, C1428EGA Buenos Aires, Argentina
| |
Collapse
|
7
|
Modi V, Dunbrack RL. Kincore: a web resource for structural classification of protein kinases and their inhibitors. Nucleic Acids Res 2022; 50:D654-D664. [PMID: 34643709 PMCID: PMC8728253 DOI: 10.1093/nar/gkab920] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 09/21/2021] [Accepted: 09/28/2021] [Indexed: 11/13/2022] Open
Abstract
The active form of kinases is shared across different family members, as are several commonly observed inactive forms. We previously performed a clustering of the conformation of the activation loop of all protein kinase structures in the Protein Data Bank (PDB) into eight classes based on the dihedral angles that place the Phe side chain of the DFG motif at the N-terminus of the activation loop. Our clusters are strongly associated with the placement of the activation loop, the C-helix, and other structural elements of kinases. We present Kincore, a web resource providing access to our conformational assignments for kinase structures in the PDB. While other available databases provide conformational states or drug type but not both, KinCore includes the conformational state and the inhibitor type (Type 1, 1.5, 2, 3, allosteric) for each kinase chain. The user can query and browse the database using these attributes or determine the conformational labels of a kinase structure using the web server or a standalone program. The database and labeled structure files can be downloaded from the server. Kincore will help in understanding the conformational dynamics of these proteins and guide development of inhibitors targeting specific states. Kincore is available at http://dunbrack.fccc.edu/kincore.
Collapse
Affiliation(s)
- Vivek Modi
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA 19148, USA
| | - Roland L Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA 19148, USA
| |
Collapse
|
8
|
Bhati AP, Wan S, Alfè D, Clyde AR, Bode M, Tan L, Titov M, Merzky A, Turilli M, Jha S, Highfield RR, Rocchia W, Scafuri N, Succi S, Kranzlmüller D, Mathias G, Wifling D, Donon Y, Di Meglio A, Vallecorsa S, Ma H, Trifan A, Ramanathan A, Brettin T, Partin A, Xia F, Duan X, Stevens R, Coveney PV. Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers. Interface Focus 2021; 11:20210018. [PMID: 34956592 PMCID: PMC8504892 DOI: 10.1098/rsfs.2021.0018] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/07/2021] [Indexed: 12/13/2022] Open
Abstract
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The two in silico methods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.
Collapse
Affiliation(s)
- Agastya P. Bhati
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Shunzhou Wan
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
| | - Dario Alfè
- Department of Earth Sciences, London Centre for Nanotechnology and Thomas Young Centre at University College London, University College London, Gower Street, London WC1E 6BT, UK
- Dipartimento di Fisica Ettore Pancini, Università di Napoli Federico II, Monte Sant'Angelo, Napoli 80126, Italy
| | - Austin R. Clyde
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | - Mathis Bode
- Institute for Combustion Technology, RWTH Aachen University, Aachen 52056, Germany
| | - Li Tan
- Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Mikhail Titov
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Andre Merzky
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Matteo Turilli
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | - Shantenu Jha
- Brookhaven National Laboratory, Upton, NY 11973, USA
- Department of Electrical and Computer Engineering, Rutgers, the State University of New Jersey, Piscataway, NJ 08854, USA
| | | | - Walter Rocchia
- Concept Lab, Italian Institute of Technology, Via Melen, Genova, Italy
| | - Nicola Scafuri
- Concept Lab, Italian Institute of Technology, Via Melen, Genova, Italy
| | - Sauro Succi
- Center for Life Nanosciences at La Sapienza, Italian Institute of Technology, viale Regina Elena, Roma, Italy
| | - Dieter Kranzlmüller
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | - Gerald Mathias
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | - David Wifling
- Leibniz Supercomputing Centre (LRZ) of the Bavarian Academy of Sciences and Humanities, Boltzmannstrasse 1, Garching bei München 85748, Germany
| | | | | | | | - Heng Ma
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Anda Trifan
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Arvind Ramanathan
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Tom Brettin
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Alexander Partin
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Fangfang Xia
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Xiaotan Duan
- Department of Computer Science, University of Chicago, Chicago, IL, USA
| | - Rick Stevens
- Computing, Environment and Life Sciences Directorate, Argonne National Laboratory, Lemont, IL 60439, USA
| | - Peter V. Coveney
- Centre for Computational Science, University College London, Gordon Street, London WC1H 0AJ, UK
- Institute for Informatics, University of Amsterdam, Science Park 904, Amsterdam 1098 XH, The Netherlands
| |
Collapse
|
9
|
Paul A, Subhadarshini S, Srinivasan N. Pseudokinases repurpose flexibility signatures associated with the protein kinase fold for noncatalytic roles. Proteins 2021; 90:747-764. [PMID: 34708889 DOI: 10.1002/prot.26271] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 09/22/2021] [Accepted: 10/18/2021] [Indexed: 01/27/2023]
Abstract
The bilobal protein kinase-like fold in pseudokinases lack one or more catalytic residues, conserved in canonical protein kinases, and are considered enzymatically deficient. Tertiary structures of pseudokinases reveal that their loops topologically equivalent to activation segments of kinases adopt contracted configurations, which is typically extended in active conformation of kinases. Herein, anisotropic network model based normal mode analysis (NMA) was conducted on 51 active conformation structures of protein kinases and 26 crystal structures of pseudokinases. Our observations indicate that although backbone fluctuation profiles are similar for individual kinase-pseudokinase families, low intensity mean square fluctuations in pseudo-activation segment and other sub-structures impart rigidity to pseudokinases. Analyses of collective motions from functional modes reveal that pseudokinases, compared to active kinases, undergo distinct conformational transitions using the same structural fold. All-atom NMA of protein kinase-pseudokinase pairs from each family, sharing high amino acid sequence identities, yielded distinct community clusters, partitioned by residues exhibiting highly correlated fluctuations. It appears that atomic fluctuations from equivalent activation segments guide community membership and network topologies for respective kinase and pseudokinase. Our findings indicate that such adaptations in backbone and side-chain fluctuations render pseudokinases competent for catalysis-independent roles.
Collapse
Affiliation(s)
- Anindita Paul
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, Karnataka, India
| | | | | |
Collapse
|
10
|
Maloney RC, Zhang M, Jang H, Nussinov R. The mechanism of activation of monomeric B-Raf V600E. Comput Struct Biotechnol J 2021; 19:3349-3363. [PMID: 34188782 PMCID: PMC8215184 DOI: 10.1016/j.csbj.2021.06.007] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2021] [Revised: 05/30/2021] [Accepted: 06/02/2021] [Indexed: 02/07/2023] Open
Abstract
Oncogenic mutations in the serine/threonine kinase B-Raf, particularly the V600E mutation, are frequent in cancer, making it a major drug target. Although much is known about B-Raf's active and inactive states, questions remain about the mechanism by which the protein changes between these two states. Here, we utilize molecular dynamics to investigate both wild-type and V600E B-Raf to gain mechanistic insights into the impact of the Val to Glu mutation. The results show that the wild-type and mutant follow similar activation pathways involving an extension of the activation loop and an inward motion of the αC-helix. The V600E mutation, however, destabilizes the inactive state by disrupting hydrophobic interactions present in the wild-type structure while the active state is stabilized through the formation of a salt bridge between Glu600 and Lys507. Additionally, when the activation loop is extended, the αC-helix is able to move between an inward and outward orientation as long as the DFG motif adopts a specific orientation. In that orientation Phe595 rotates away from the αC-helix, allowing the formation of a salt bridge between Lys483 and Glu501. These mechanistic insights have implications for the development of new Raf inhibitors.
Collapse
Affiliation(s)
- Ryan C. Maloney
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
| | - Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
- Corresponding author at: Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Laboratory of Cancer Immunometabolism, National Cancer Institute, Frederick, MD 21702, USA.
| |
Collapse
|
11
|
Kharkar PS. Computational Approaches for the Design of (Mutant-)Selective Tyrosine Kinase Inhibitors: State-of-the-Art and Future Prospects. Curr Top Med Chem 2021; 20:1564-1575. [PMID: 32357816 DOI: 10.2174/1568026620666200502005853] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Revised: 03/10/2020] [Accepted: 03/26/2020] [Indexed: 02/08/2023]
Abstract
Kinases remain one of the major attractive therapeutic targets for a large number of indications such as cancer, rheumatoid arthritis, cardiac failure and many others. Design and development of kinase inhibitors (ATP-competitive, allosteric or covalent) is a clinically validated and successful strategy in the pharmaceutical industry. The perks come with limitations, particularly the development of resistance to highly potent and selective inhibitors. When this happens, the cycle needs to be repeated, i.e., the design and development of kinase inhibitors active against the mutated forms. The complexity of tumor milieu makes it awfully difficult for these molecularly-targeted therapies to work. Every year newer and better versions of these agents are introduced in the clinic. Several computational approaches such as structure-, ligand-based or hybrid ones continue to live up to their potential in discovering novel kinase inhibitors. New schools of thought in this area continue to emerge, e.g., development of dual-target kinase inhibitors. But there are fundamental issues with this approach. It is indeed difficult to selectively optimize binding at two entirely different or related kinases. In addition to the conventional strategies, modern technologies (machine learning, deep learning, artificial intelligence, etc.) started yielding the results and building success stories. Computational tools invariably played a critical role in catalysing the phenomenal progress in kinase drug discovery field. The present review summarized the progress in utilizing computational methods and tools for discovering (mutant-)selective tyrosine kinase inhibitor drugs in the last three years (2017-2019). Representative investigations have been discussed, while others are merely listed. The author believes that the enthusiastic reader will be inspired to dig out the cited literature extensively to appreciate the progress made so far and the future prospects of the field.
Collapse
Affiliation(s)
- Prashant S Kharkar
- Department of Pharmaceutical Sciences and Technology, Institute of Chemical Technology, Nathalal Parekh Marg, Matunga, Mumbai 400 019, India
| |
Collapse
|
12
|
Spitaleri A, Zia SR, Di Micco P, Al-Lazikani B, Soler MA, Rocchia W. Tuning Local Hydration Enables a Deeper Understanding of Protein-Ligand Binding: The PP1-Src Kinase Case. J Phys Chem Lett 2021; 12:49-58. [PMID: 33300337 PMCID: PMC7812613 DOI: 10.1021/acs.jpclett.0c03075] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 12/03/2020] [Indexed: 05/13/2023]
Abstract
Water plays a key role in biomolecular recognition and binding. Despite the development of several computational and experimental approaches, it is still challenging to comprehensively characterize water-mediated effects on the binding process. Here, we investigate how water affects the binding of Src kinase to one of its inhibitors, PP1. Src kinase is a target for treating several diseases, including cancer. We use biased molecular dynamics simulations, where the hydration of predetermined regions is tuned at will. This computational technique efficiently accelerates the SRC-PP1 binding simulation and allows us to identify several key and yet unexplored aspects of the solvent's role. This study provides a further perspective on the binding phenomenon, which may advance the current drug design approaches for the development of new kinase inhibitors.
Collapse
Affiliation(s)
- Andrea Spitaleri
- CONCEPT
Lab, Istituto Italiano di Tecnologia, via Morego 30, Genoa I-16163, Italy
- Center
for Omics Sciences, Emerging Bacterial Pathogens Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy
| | - Syeda R. Zia
- CONCEPT
Lab, Istituto Italiano di Tecnologia, via Morego 30, Genoa I-16163, Italy
- Dr.
Panjwani Center for Molecular Medicine and Drug Research, International
Center for Chemical and Biological Sciences, University of Karachi, Karachi 75270, Pakistan
| | - Patrizio Di Micco
- Cancer
Research UK Cancer Therapeutics Unit, The
Institute of Cancer Research, London SM2 5NG, U.K.
| | - Bissan Al-Lazikani
- Cancer
Research UK Cancer Therapeutics Unit, The
Institute of Cancer Research, London SM2 5NG, U.K.
| | - Miguel A. Soler
- CONCEPT
Lab, Istituto Italiano di Tecnologia, via Morego 30, Genoa I-16163, Italy
| | - Walter Rocchia
- CONCEPT
Lab, Istituto Italiano di Tecnologia, via Morego 30, Genoa I-16163, Italy
| |
Collapse
|
13
|
Abdelbaky I, Tayara H, Chong KT. Prediction of kinase inhibitors binding modes with machine learning and reduced descriptor sets. Sci Rep 2021; 11:706. [PMID: 33436888 PMCID: PMC7804204 DOI: 10.1038/s41598-020-80758-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2020] [Accepted: 12/28/2020] [Indexed: 02/06/2023] Open
Abstract
Protein kinases are receiving wide research interest, from drug perspective, due to their important roles in human body. Available kinase-inhibitor data, including crystallized structures, revealed many details about the mechanism of inhibition and binding modes. The understanding and analysis of these binding modes are expected to support the discovery of kinase-targeting drugs. The huge amounts of data made it possible to utilize computational techniques, including machine learning, to help in the discovery of kinase-targeting drugs. Machine learning gave reasonable predictions when applied to differentiate between the binding modes of kinase inhibitors, promoting a wider application in that domain. In this study, we applied machine learning supported by feature selection techniques to classify kinase inhibitors according to their binding modes. We represented inhibitors as a large number of molecular descriptors, as features, and systematically reduced these features in a multi-step manner while trying to attain high classification accuracy. Our predictive models could satisfy both goals by achieving high accuracy while utilizing at most 5% of the modeling features. The models could differentiate between binding mode types with MCC values between 0.67 and 0.92, and balanced accuracy values between 0.78 and 0.97 for independent test sets.
Collapse
Affiliation(s)
- Ibrahim Abdelbaky
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju, 54896, South Korea.,Agricultural Research Center, Giza, 12619, Egypt.,Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju, 54896, South Korea
| | - Hilal Tayara
- School of International Engineering and Science, Jeonbuk National University, Jeonju, 54896, South Korea.
| | - Kil To Chong
- Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju, 54896, South Korea. .,Advanced Electronics and Information Research Center, Jeonbuk National University, Jeonju, 54896, South Korea.
| |
Collapse
|
14
|
Jonniya NA, Sk MF, Kar P. Characterizing an allosteric inhibitor-induced inactive state in with-no-lysine kinase 1 using Gaussian accelerated molecular dynamics simulations. Phys Chem Chem Phys 2021; 23:7343-7358. [DOI: 10.1039/d0cp05733a] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
The binding of an allosteric inhibitor in WNK1 leads to the inactive state.
Collapse
Affiliation(s)
- Nisha Amarnath Jonniya
- Department of Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Khandwa Road, MP
- India
| | - Md Fulbabu Sk
- Department of Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Khandwa Road, MP
- India
| | - Parimal Kar
- Department of Biosciences and Biomedical Engineering, Indian Institute of Technology Indore, Khandwa Road, MP
- India
| |
Collapse
|
15
|
Paul F, Thomas T, Roux B. Diversity of Long-Lived Intermediates along the Binding Pathway of Imatinib to Abl Kinase Revealed by MD Simulations. J Chem Theory Comput 2020; 16:7852-7865. [PMID: 33147951 DOI: 10.1021/acs.jctc.0c00739] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Imatinib, a drug used for the treatment of chronic myeloid leukemia and other cancers, works by blocking the catalytic site of pathological constitutively active Abl kinase. While the binding pose is known from X-ray crystallography, the different steps leading to the formation of the complex are not well understood. The results from extensive molecular dynamics simulations show that imatinib can primarily exit the known crystallographic binding pose through the cleft of the binding site or by sliding under the αC helix. Once displaced from the crystallographic binding pose, imatinib becomes trapped in intermediate states. These intermediates are characterized by a high diversity of ligand orientations and conformations, and relaxation timescales within this region may exceed 3-4 ms. Analysis indicates that the metastable intermediate states should be spectroscopically indistinguishable from the crystallographic binding pose, in agreement with tryptophan stopped-flow fluorescence experiments.
Collapse
Affiliation(s)
- Fabian Paul
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, United States
| | - Trayder Thomas
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, United States
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
16
|
Shrestha S, Byrne DP, Harris JA, Kannan N, Eyers PA. Cataloguing the dead: breathing new life into pseudokinase research. FEBS J 2020; 287:4150-4169. [PMID: 32053275 PMCID: PMC7586955 DOI: 10.1111/febs.15246] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Revised: 01/22/2020] [Accepted: 02/11/2020] [Indexed: 12/22/2022]
Abstract
Pseudoenzymes are present within many, but not all, known enzyme families and lack one or more conserved canonical amino acids that help define their catalytically active counterparts. Recent findings in the pseudokinase field confirm that evolutionary repurposing of the structurally defined bilobal protein kinase fold permits distinct biological functions to emerge, many of which rely on conformational switching, as opposed to canonical catalysis. In this analysis, we evaluate progress in evaluating several members of the 'dark' pseudokinome that are pertinent to help drive this expanding field. Initially, we discuss how adaptions in erythropoietin-producing hepatocellular carcinoma (Eph) receptor tyrosine kinase domains resulted in two vertebrate pseudokinases, EphA10 and EphB6, in which co-evolving sequences generate new motifs that are likely to be important for both nucleotide binding and catalysis-independent signalling. Secondly, we discuss how conformationally flexible Tribbles pseudokinases, which have radiated in the complex vertebrates, control fundamental aspects of cell signalling that may be targetable with covalent small molecules. Finally, we show how species-level adaptions in the duplicated canonical kinase protein serine kinase histone (PSKH)1 sequence have led to the appearance of the pseudokinase PSKH2, whose physiological role remains mysterious. In conclusion, we show how the patterns we discover are selectively conserved within specific pseudokinases, and that when they are modelled alongside closely related canonical kinases, many are found to be located in functionally important regions of the conserved kinase fold. Interrogation of these patterns will be useful for future evaluation of these, and other, members of the unstudied human kinome.
Collapse
Affiliation(s)
- Safal Shrestha
- Institute of BioinformaticsUniversity of GeorgiaAthensGAUSA
- Department of Biochemistry & Molecular BiologyUniversity of GeorgiaAthensGAUSA
| | - Dominic P. Byrne
- Department of BiochemistryInstitute of Integrative BiologyUniversity of LiverpoolLiverpoolUK
| | - John A. Harris
- Department of BiochemistryInstitute of Integrative BiologyUniversity of LiverpoolLiverpoolUK
| | - Natarajan Kannan
- Institute of BioinformaticsUniversity of GeorgiaAthensGAUSA
- Department of Biochemistry & Molecular BiologyUniversity of GeorgiaAthensGAUSA
| | - Patrick A. Eyers
- Department of BiochemistryInstitute of Integrative BiologyUniversity of LiverpoolLiverpoolUK
| |
Collapse
|
17
|
Savage SR, Zhang B. Using phosphoproteomics data to understand cellular signaling: a comprehensive guide to bioinformatics resources. Clin Proteomics 2020; 17:27. [PMID: 32676006 PMCID: PMC7353784 DOI: 10.1186/s12014-020-09290-x] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Accepted: 07/04/2020] [Indexed: 12/19/2022] Open
Abstract
Mass spectrometry-based phosphoproteomics is becoming an essential methodology for the study of global cellular signaling. Numerous bioinformatics resources are available to facilitate the translation of phosphopeptide identification and quantification results into novel biological and clinical insights, a critical step in phosphoproteomics data analysis. These resources include knowledge bases of kinases and phosphatases, phosphorylation sites, kinase inhibitors, and sequence variants affecting kinase function, and bioinformatics tools that can predict phosphorylation sites in addition to the kinase that phosphorylates them, infer kinase activity, and predict the effect of mutations on kinase signaling. However, these resources exist in silos and it is challenging to select among multiple resources with similar functions. Therefore, we put together a comprehensive collection of resources related to phosphoproteomics data interpretation, compared the use of tools with similar functions, and assessed the usability from the standpoint of typical biologists or clinicians. Overall, tools could be improved by standardization of enzyme names, flexibility of data input and output format, consistent maintenance, and detailed manuals.
Collapse
Affiliation(s)
- Sara R. Savage
- Department of Biomedical Informatics, Vanderbilt University, Nashville, TN USA
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX USA
| |
Collapse
|
18
|
Akdel M, Durairaj J, de Ridder D, van Dijk ADJ. Caretta - A multiple protein structure alignment and feature extraction suite. Comput Struct Biotechnol J 2020; 18:981-992. [PMID: 32368333 PMCID: PMC7186369 DOI: 10.1016/j.csbj.2020.03.011] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 02/01/2020] [Accepted: 03/13/2020] [Indexed: 02/06/2023] Open
Abstract
The vast number of protein structures currently available opens exciting opportunities for machine learning on proteins, aimed at predicting and understanding functional properties. In particular, in combination with homology modelling, it is now possible to not only use sequence features as input for machine learning, but also structure features. However, in order to do so, robust multiple structure alignments are imperative. Here we present Caretta, a multiple structure alignment suite meant for homologous but sequentially divergent protein families which consistently returns accurate alignments with a higher coverage than current state-of-the-art tools. Caretta is available as a GUI and command-line application and additionally outputs an aligned structure feature matrix for a given set of input structures, which can readily be used in downstream steps for supervised or unsupervised machine learning. We show Caretta’s performance on two benchmark datasets, and present an example application of Caretta in predicting the conformational state of cyclin-dependent kinases.
Collapse
Affiliation(s)
- Mehmet Akdel
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, The Netherlands
| | - Janani Durairaj
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, The Netherlands
| | - Dick de Ridder
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, The Netherlands
| | - Aalt D J van Dijk
- Bioinformatics Group, Department of Plant Sciences, Wageningen University and Research, The Netherlands.,Mathematical and Statistical Methods - Biometris, Department of Plant Sciences, Wageningen University and Research, The Netherlands
| |
Collapse
|
19
|
Paul F, Meng Y, Roux B. Identification of Druggable Kinase Target Conformations Using Markov Model Metastable States Analysis of apo-Abl. J Chem Theory Comput 2020; 16:1896-1912. [PMID: 31999924 DOI: 10.1021/acs.jctc.9b01158] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Kinases are important targets for drug development. However, accounting for the impact of possible structural rearrangements on the binding of kinase inhibitors is complicated by the extensive flexibility of their catalytic domain. The dynamic N-lobe contains four particular mobile structural elements: the Asp-Phe-Gly (DFG) motif, the phosphate (P) positioning loop, the activation (A) loop, and the αC helix. In our previous study [Meng et al. J. Chem. Theory Comput. 2018 14, 2721-2732], we combined various simulation techniques with Markov state modeling (MSM) to explore the free energy landscape of Abl kinase beyond conformations that are known from X-ray crystallography. Here we examine the resulting Markov model in greater detail by analyzing its metastable states. A characterization of the states in terms of their DFG state, P-loop, and αC conformations is presented and compared to existing classification schemes. Several metastable states are found to be structurally close to known crystal structures of different kinases in complex with a variety of inhibitors. These results suggest that the set of conformations accessible to tyrosine kinases may be shared within the entire family and that the conformational dynamics of one kinase in the absence of any ligand can provide meaningful information about possible target conformations for inhibitors of any member of the kinase family.
Collapse
Affiliation(s)
- Fabian Paul
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637-1454, United States
| | - Yilin Meng
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637-1454, United States
| | - Benoît Roux
- Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, Illinois 60637-1454, United States
| |
Collapse
|
20
|
Structural Consequences of Multisite Phosphorylation in the BAK1 Kinase Domain. Biophys J 2020; 118:698-707. [PMID: 31962105 DOI: 10.1016/j.bpj.2019.12.026] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 11/09/2019] [Accepted: 12/10/2019] [Indexed: 12/26/2022] Open
Abstract
Multisite phosphorylation is an important mechanism of post-translational control of protein kinases. The effects of combinations of possible phosphorylation states on protein kinase activity are difficult to study experimentally because of challenges in isolating a particular phosphorylation state; surprising little effort on this topic has been expended in computational studies. To understand the effects of multisite phosphorylation on the plant protein kinase brassinosteroid insensitive 1-associated kinase 1 (BAK1) conformational ensemble, we performed Gaussian accelerated molecular dynamics simulations on eight BAK1 mod-forms involving phosphorylation of the four activation-loop threonine residues and binding of ATP-Mg2+. We find that unphosphorylated BAK1 transitions into an inactive conformation with a "cracked" activation loop and with the αC helix swung away from the active site. T450 phosphorylation can prevent the activation loop from cracking and keep the αC helix in an active-like conformation, whereas phosphorylation of T455 only slightly stabilizes the activation loop. There is a general trend of reduced flexibility in interlobe motion with increased phosphorylation. Interestingly, the αC helix is destabilized when the activation loop is fully phosphorylated but is again stabilized with ATP-Mg2+ bound. Our results provide insight into the mechanism of phosphorylation-controlled BAK1 activation while at the same time represent the first, to our knowledge, comprehensive, comparative study of the effects of combinatorial phosphorylation states on protein kinase conformational dynamics.
Collapse
|
21
|
Modi V, Dunbrack RL. A Structurally-Validated Multiple Sequence Alignment of 497 Human Protein Kinase Domains. Sci Rep 2019; 9:19790. [PMID: 31875044 PMCID: PMC6930252 DOI: 10.1038/s41598-019-56499-4] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2019] [Accepted: 11/14/2019] [Indexed: 12/21/2022] Open
Abstract
Studies on the structures and functions of individual kinases have been used to understand the biological properties of other kinases that do not yet have experimental structures. The key factor in accurate inference by homology is an accurate sequence alignment. We present a parsimonious, structure-based multiple sequence alignment (MSA) of 497 human protein kinase domains excluding atypical kinases. The alignment is arranged in 17 blocks of conserved regions and unaligned blocks in between that contain insertions of varying lengths present in only a subset of kinases. The aligned blocks contain well-conserved elements of secondary structure and well-known functional motifs, such as the DFG and HRD motifs. From pairwise, all-against-all alignment of 272 human kinase structures, we estimate the accuracy of our MSA to be 97%. The remaining inaccuracy comes from a few structures with shifted elements of secondary structure, and from the boundaries of aligned and unaligned regions, where compromises need to be made to encompass the majority of kinases. A new phylogeny of the protein kinase domains in the human genome based on our alignment indicates that ten kinases previously labeled as "OTHER" can be confidently placed into the CAMK group. These kinases comprise the Aurora kinases, Polo kinases, and calcium/calmodulin-dependent kinase kinases.
Collapse
Affiliation(s)
- Vivek Modi
- Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA, 19111, USA
| | - Roland L Dunbrack
- Fox Chase Cancer Center, 333 Cottman Avenue, Philadelphia, PA, 19111, USA.
| |
Collapse
|
22
|
Lopez ED, Burastero O, Arcon JP, Defelipe LA, Ahn NG, Marti MA, Turjanski AG. Kinase Activation by Small Conformational Changes. J Chem Inf Model 2019; 60:821-832. [PMID: 31714778 DOI: 10.1021/acs.jcim.9b00782] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Protein kinases (PKs) are allosteric enzymes that play an essential role in signal transduction by regulating a variety of key cellular processes. Most PKs suffer conformational rearrangements upon phosphorylation that strongly enhance the catalytic activity. Generally, it involves the movement of the phosphorylated loop toward the active site and the rotation of the whole C-terminal lobe. However, not all kinases undergo such a large configurational change: The MAPK extracellular signal-regulated protein kinases ERK1 and ERK2 achieve a 50 000 fold increase in kinase activity with only a small motion of the C-terminal region. In the present work, we used a combination of molecular simulation tools to characterize the conformational landscape of ERK2 in the active (phosphorylated) and inactive (unphosphorylated) states in solution in agreement with NMR experiments. We show that the chemical reaction barrier is strongly dependent on ATP conformation and that the "active" low-barrier configuration is subtly regulated by phosphorylation, which stabilizes a key salt bridge between the conserved Lys52 and Glu69 belonging to helix-C and promotes binding of a second Mg ion. Our study highlights that the on-off switch embedded in the kinase fold can be regulated by small, medium, and large conformational changes.
Collapse
Affiliation(s)
- Elias D Lopez
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| | - Osvaldo Burastero
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| | - Juan P Arcon
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| | - Lucas A Defelipe
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| | - Natalie G Ahn
- Department of Chemistry and Biochemistry , University of Colorado , Boulder , Colorado 80309 , United States
| | - Marcelo A Marti
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| | - Adrian G Turjanski
- Departamento de Química Biológica, Facultad de Ciencias Exactas y Naturales , Universidad de Buenos Aires , Ciudad Autónoma de Buenos Aires , Argentina
| |
Collapse
|
23
|
Jiang YY, Maier W, Baumeister R, Minevich G, Joachimiak E, Wloga D, Ruan Z, Kannan N, Bocarro S, Bahraini A, Vasudevan KK, Lechtreck K, Orias E, Gaertig J. LF4/MOK and a CDK-related kinase regulate the number and length of cilia in Tetrahymena. PLoS Genet 2019; 15:e1008099. [PMID: 31339880 PMCID: PMC6682161 DOI: 10.1371/journal.pgen.1008099] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Revised: 08/05/2019] [Accepted: 06/13/2019] [Indexed: 11/18/2022] Open
Abstract
The length of cilia is controlled by a poorly understood mechanism that involves members of the conserved RCK kinase group, and among them, the LF4/MOK kinases. The multiciliated protist model, Tetrahymena, carries two types of cilia (oral and locomotory) and the length of the locomotory cilia is dependent on their position with the cell. In Tetrahymena, loss of an LF4/MOK ortholog, LF4A, lengthened the locomotory cilia, but also reduced their number. Without LF4A, cilia assembled faster and showed signs of increased intraflagellar transport (IFT). Consistently, overproduced LF4A shortened cilia and downregulated IFT. GFP-tagged LF4A, expressed in the native locus and imaged by total internal reflection microscopy, was enriched at the basal bodies and distributed along the shafts of cilia. Within cilia, most LF4A-GFP particles were immobile and a few either diffused or moved by IFT. We suggest that the distribution of LF4/MOK along the cilium delivers a uniform dose of inhibition to IFT trains that travel from the base to the tip. In a longer cilium, the IFT machinery may experience a higher cumulative dose of inhibition by LF4/MOK. Thus, LF4/MOK activity could be a readout of cilium length that helps to balance the rate of IFT-driven assembly with the rate of disassembly at steady state. We used a forward genetic screen to identify a CDK-related kinase, CDKR1, whose loss-of-function suppressed the shortening of cilia caused by overexpression of LF4A, by reducing its kinase activity. Loss of CDKR1 alone lengthened both the locomotory and oral cilia. CDKR1 resembles other known ciliary CDK-related kinases: LF2 of Chlamydomonas, mammalian CCRK and DYF-18 of C. elegans, in lacking the cyclin-binding motif and acting upstream of RCKs. The new genetic tools we developed here for Tetrahymena have potential for further dissection of the principles of cilia length regulation in multiciliated cells.
Collapse
Affiliation(s)
- Yu-Yang Jiang
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Wolfgang Maier
- Bio 3/Bioinformatics and Molecular Genetics, Faculty of Biology and ZBMZ, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Ralf Baumeister
- Bio 3/Bioinformatics and Molecular Genetics, Faculty of Biology and ZBMZ, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Gregory Minevich
- Department of Biochemistry and Molecular Biophysics, Columbia University Medical Center, New York, New York, United States of America
| | - Ewa Joachimiak
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, Warsaw, Poland
| | - Dorota Wloga
- Laboratory of Cytoskeleton and Cilia Biology, Nencki Institute of Experimental Biology of Polish Academy of Sciences, Warsaw, Poland
| | - Zheng Ruan
- Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
| | - Natarajan Kannan
- Institute of Bioinformatics, University of Georgia, Athens, Georgia, United States of America
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Stephen Bocarro
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Anoosh Bahraini
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Krishna Kumar Vasudevan
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Karl Lechtreck
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
| | - Eduardo Orias
- Department of Molecular, Cellular and Developmental Biology, University of California Santa Barbara, Santa Barbara, California, United States of America
| | - Jacek Gaertig
- Department of Cellular Biology, University of Georgia, Athens, Georgia, United States of America
- * E-mail:
| |
Collapse
|
24
|
Two Antagonistic Hippo Signaling Circuits Set the Division Plane at the Medial Position in the Ciliate Tetrahymena. Genetics 2018; 211:651-663. [PMID: 30593491 DOI: 10.1534/genetics.118.301889] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Accepted: 12/21/2018] [Indexed: 12/17/2022] Open
Abstract
In a single cell, ciliates maintain a complex pattern of cortical organelles that are arranged along the anteroposterior and circumferential axes. The underlying molecular mechanisms of intracellular pattern formation in ciliates are largely unknown. Ciliates divide by tandem duplication, a process that remodels the parental cell into two daughters aligned head-to-tail. In the elo1-1 mutant of Tetrahymena thermophila, the segmentation boundary/division plane forms too close to the posterior end of the parental cell, producing a large anterior and a small posterior daughter cell, respectively. We show that ELO1 encodes a Lats/NDR kinase that marks the posterior segment of the cell cortex, where the division plane does not form in the wild-type. Elo1 acts independently of CdaI, a Hippo/Mst kinase that marks the anterior half of the parental cell, and whose loss shifts the division plane anteriorly. We propose that, in Tetrahymena, two antagonistic Hippo circuits focus the segmentation boundary/division plane at the equatorial position, by excluding divisional morphogenesis from the cortical areas that are too close to cell ends.
Collapse
|
25
|
Feltes BC, Grisci BI, Poloni JDF, Dorn M. Perspectives and applications of machine learning for evolutionary developmental biology. Mol Omics 2018; 14:289-306. [PMID: 30168572 DOI: 10.1039/c8mo00111a] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Evolutionary Developmental Biology (Evo-Devo) is an ever-expanding field that aims to understand how development was modulated by the evolutionary process. In this sense, "omic" studies emerged as a powerful ally to unravel the molecular mechanisms underlying development. In this scenario, bioinformatics tools become necessary to analyze the growing amount of information. Among computational approaches, machine learning stands out as a promising field to generate knowledge and trace new research perspectives for bioinformatics. In this review, we aim to expose the current advances of machine learning applied to evolution and development. We draw clear perspectives and argue how evolution impacted machine learning techniques.
Collapse
Affiliation(s)
- Bruno César Feltes
- Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, Brazil.
| | | | | | | |
Collapse
|
26
|
Altered conformational landscape and dimerization dependency underpins the activation of EGFR by αC- β4 loop insertion mutations. Proc Natl Acad Sci U S A 2018; 115:E8162-E8171. [PMID: 30104348 DOI: 10.1073/pnas.1803152115] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Mutational activation of epidermal growth factor receptor (EGFR) in human cancers involves both point mutations and complex mutations (insertions and deletions). In particular, short in-frame insertion mutations within a conserved αC-β4 loop in the EGFR kinase domain are frequently observed in tumor samples and patients harboring these mutations are insensitive to first-generation EGFR inhibitors. Despite the prevalence and clinical relevance of insertion mutations, the mechanisms by which these mutations regulate EGFR activity and contribute to drug sensitivity are poorly understood. Using cell-based mutation screening, we find that the precise location, length, and sequence of the inserted segment are critical for ligand-independent EGFR activation and downstream signaling. We identify three insertion mutations (N771_P772insN, D770_N771insG, and D770>GY) that activate EGFR in a unique way by relying more on the "acceptor" interface for kinase activation. Our drug inhibition studies indicate that these activating insertion mutations respond more favorably to osimertinib, a recently Food and Drug Administration-approved EGFR inhibitor for T790M-positive patients with lung cancer. Molecular dynamics simulations and umbrella sampling of WT and mutant EGFR suggest a model in which activating insertion mutations increase catalytic activity by relieving key autoinhibitory interactions associated with αC-helix movement and by lowering the transition free energy ([Formula: see text]) between active and inactive states. Our studies also identify a transition state sampled by activating insertion mutations that can be exploited in the design of mutant-selective EGFR inhibitors.
Collapse
|
27
|
Ung PMU, Rahman R, Schlessinger A. Redefining the Protein Kinase Conformational Space with Machine Learning. Cell Chem Biol 2018; 25:916-924.e2. [PMID: 29861272 DOI: 10.1016/j.chembiol.2018.05.002] [Citation(s) in RCA: 59] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Revised: 02/28/2018] [Accepted: 05/01/2018] [Indexed: 02/07/2023]
Abstract
Protein kinases are dynamic, adopting different conformational states that are critical for their catalytic activity. We assess a range of structural features derived from the conserved αC helix and DFG motif to define the conformational space of the catalytic domain of protein kinases. We then construct Kinformation, a random forest classifier, to annotate the conformation of 3,708 kinase structures in the PDB. Our classification scheme captures known active and inactive kinase conformations and defines an additional conformational state, thereby refining the current understanding of the kinase conformational space. Furthermore, network analysis of the small molecules recognized by each conformation captures chemical substructures that are associated with each conformation type. Our description of the kinase conformational space is expected to improve modeling of protein kinase structures, as well as guide the development of conformation-specific kinase inhibitors with optimal pharmacological profiles.
Collapse
Affiliation(s)
- Peter Man-Un Ung
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Rayees Rahman
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Avner Schlessinger
- Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| |
Collapse
|
28
|
Mittal S, Shukla D. Recruiting machine learning methods for molecular simulations of proteins. MOLECULAR SIMULATION 2018. [DOI: 10.1080/08927022.2018.1448976] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Affiliation(s)
- Shriyaa Mittal
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign , Urbana, IL, USA
| | - Diwakar Shukla
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign , Urbana, IL, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign , Urbana, IL, USA
| |
Collapse
|